13 March 2024
Jason Harrop
Update code to work with docx4j 11.4.10
Bump pdfbox to address font initialization errors
Bump other deps
12 September 2022
Jason Harrop
MathML support (requires user-supplied mathml2omml.xslt)
Support converting altChunks of type mht (as emitted by BIRT) - proof of concept.
16 April 2022
Jason Harrop
11.4.6 is our first JPMS modularised release of ImportXHTML, so the jars contain module-info.class entries.
11.4.6 is compiled with Java 15, targeting Java 11.
Uses jakarta.xml.bind, rather than javax.xml.bind (hence the 11.4.x numbering); if you have existing code which imports javax.xml.bind, you'll need to search/replace across your code base, replacing javax.xml.bind with jakarta.xml.bind
This is also a Maven multi-module project.
From hereon, changes will generally be made to this branch first.
Use auto-generated id value when an image is added; fixes #71, requires docx4j 11.4.6 (hence this version number)
2 Dec 2021
Use openhtmltopdf 1.0.10; use docx4j 8.3.2
15 April 2021
Use openhtmltopdf 1.0.8, reflection no longer necessary
13 January 2021
Migrated from FlyingSaucer to https://github.com/danfickle/openhtmltopdf which uses pdfbox, not old iText
22 April 2019
Compiled for Java 8; aligning with docx4j 8.0.0 release.
Depends on docx4j 8.0.0.
No substantive changes in this release. Purpose of release is to align version numbers.
17 December 2018
Depends on docx4j 6.1.0 (6.0.1 would also work).
No substantive changes in this release. Purpose of release is to align version numbers.
3 August 2018
Depends on docx4j 6.0.1 (since 6.0.0 should be avoided).
docx4j-ImportXHTML.Bidi.Heuristic if set to true in your docx4j-ImportXHTML.properties file, will set bidi/rtl for RTL text including Arabic and Hebrew.
22 July 2018
None. This release is identical to 3.3.6-1 (except some sample code).
Docx4j 6.x will be the last series supporting Java 6.
(docx4j 7.x, when released, will require Java 7+)
This release is numbered 6.0.0 to signify it supports Java 6, and to align with docx4j's 6.0 release.
27 November 2017
Avoid inserting an additional list number where mixed content is encountered in a list item
7 October 2017
nested list improvements (including indentation where tables and lists are mixed)
indentation of nested divs
support max-width on image
Version number 3.3.6 assigned, to align with corresponding docx4j release
docx4j 3.3.6 or later
15 June 2017
rFonts.setHAnsi(font) for convenient handling of chars with diacritic
Hyperlinks: handle XML predefined entities; handle trailing text in mixed content
Version number 3.3.4 assigned, to align with docx4j release number of same date.
docx4j 3.3.4 or later recommended (though 3.3.1 or later ought to work)
16 August 2016
For image inside table cell, remove table margins from image width
Uses docx4j 3.3.1 (though not heavily dependent on it)
21 April 2016
Image size (where specified in HTML)
Nested table handling
Bump dependent jar versions
Uses docx4j 3.3.0 (though not heavily dependent on it)
28 Dec 2014
Haarli Jason Harrop
Paragraph borders Table cell padding (to margin) Support for
Create/add p for div
20 Sept 2014
Jason Harrop Roded
TR height support.
Support for @start, as in
- DivToSdt implementation: allows roundtripping of divs by ID See further: http://www.docx4java.org/blog/2014/09/xhtml-docx-roundtrip-content-tracking/ and samples/DivRoundtrip.java
- Convert @id (on most elements) to bookmark
- Refactor code into new BookmarkHelper
-
https://github.com/plutext/docx4j/commit/df5d726e3359b29893153433082c13ed6c4eb56d 17 Sept 2014 introduces
BindingHandler bh = new BindingHandler(wordMLPackage); bh.setStartingIdForNewBookmarks(bookmarkId);
Use of that code is commented out in the relevant samples here; those samples assume docx4j 3.2.0.
table cell support @valign="bottom", @style="vertical-align:bottom;"
Refactor: move table stuff into a helper class
Support for p and table elements in list item
interface DivHandler: Customisable div handling
Convert a relative link (#) to w:hyperlink/@w:anchor (in createHyperlink method).
We will try to ensure that any formal releases of 3.2.x of docx4j and 3.2.y of ImportXHTML can be used together.
At some point we might add to XHTMLImporter interface in docx4j:
public AtomicInteger getBookmarkIdLast();
public void setBookmarkIdNext(AtomicInteger val);
If we did that, a docx4j using that would need the implementation (in ImportXHTML 3.2.1 or greater).
Because a docx4j using that would not work with ImportXHTML 3.2.0, it would have to be numbered 3.3.
But we avoid all this by leaving the XHTMLImporter interface as-is, and using that method via reflection only.
26 August 2014
HTML 5 figure/figcaption converted to Figure SEQ field
Support for applying Word heading styles to heading (h1, h2, h3..) elements
FontHandler, which creates default font mappings
No release, since no changes in 3.0.1
8 Feb 2014
ai-github Jason Harrop leedavidr
Nil
0d6c106 - if (color.startsWith("#")) .. Fix for plutext/docx4j#101 a452b43 - BindingTraverser.XHTML.Block.rStyle.Adopt handling in list items 6a7b6fc - NPE in indent handling fixed e902f20 - xhtml to pptx Very rudimentary support for h1-3, ol|ul, li Write all paragraphs <a:p> into a single <p:txBody>
3.0.0 is the first release of the XHTML Import stuff as a separate project; so numbered to match corresponding docx4j version.
26 Nov 2013
ai bezda bsl-zcs EthanTsui hpeng Jason Harrop jlesquembre jeffbeard fmmfonseca meletis pnml siilk tj09 tstirrat vollewijn zluspai
Better table support, inc merged cells (pnml) and cell border support (bsl-zcs) Font handling Conversion of CSS class to existing matching Word style (FormattingOption enum: CLASS_TO_STYLE_ONLY, CLASS_PLUS_OTHER, IGNORE_CLASS) Nested list support Image resizing
35fae5e - Handle inline images (avoid forcing new paragraph afterwards) 008604c - Ability to require a CSS property to be on a white list 5b69f1d - addFontMapping(String cssFontFamily, RFonts rFonts) 3b11904 - tables: import cell border styles (bsl-zcs) bcfe159 - XHTML import hyperlink named anchors to bookmarks e486b05 - XHTMLImporter: improve logging; better support for pixel unit in Indent and SpaceAfter. 6a02270 - Support underline in XHTML import. e0034b2 - XHTML importer for presentation slide notes, needs work on element positioning to support imports of slides (bezda) c297662 - XHTML: margin-bottom -> space-after 7d8af58 - XHTML table import: support for adjacent vertically merged cells and cells merged both vertically and horizontally (pnml) 5553600 - fixed vertical merging of table cells in xhtml importer (cherry-picked from 05fdacb)