Releases · jgm/pandoc

08 Mar 20:07

jgm

2.12

31ca011

pandoc 2.12

Click to expand changelog

--resource-path now accumulates if specified multiple times (#6152). Resource paths specified later on the command line are prepended to those specified earlier. Thus, --resource-path foo --resource-path bar:baz is equivalent to --resource-path bar:bas:foo. (The previous behavior was for the last --resource-path to replace all the rest.) resource-path in defaults files behaves the same way: it will be prepended to the resource path set by earlier command line options or defaults files. This change facilitates the use of multiple defaults files: each can specify a directory containing resources it refers to without clobbering the resource paths set by the others.
Allow defaults files to refer to the home directory, the user data directory, and the directory containing the defaults file itself (#5871, #5982, #5977). In fields that expect file paths (and only in these fields),
- ${VARIABLE} will expand to the value of the environment variable VARIABLE (and in particular ${HOME} will expand to the path of the home directory). A warning will be raised for undefined variables.
- ${USERDATA} will expand to the path of the user data directory in force when the defaults file is being processed.
- ${.} will expand to the directory containing the defaults file. (This allows default files to be placed in a directory containing resources they make use of.)
When downloading content from URL arguments, be sensitive to the character encoding (#5600). We can properly handle UTF-8 and latin1 (ISO-8859-1); for others we raise an error. Fall back to latin1 if no charset is given in the mime type and UTF-8 decoding fails.
Allow abbreviations that don’t end in a period to be specified using --abbreviations (#7124).
Add new unexported module Text.Pandoc.XML.Light, as well as Text.Pandoc.XML.Light.Types, Text.Pantoc.XML.Light.Proc, Text.Pandoc.XML.Light.Output. (Closes #6001, #6565, #7091).

This module exports definitions of Element and Content that are isomorphic to xml-light’s, but with Text instead of String. This allows us to keep most of the code in existing readers that use xml-light, but avoid lots of unnecessary allocation.

We also add versions of the functions from xml-light’s Text.XML.Light.Output and Text.XML.Light.Proc that operate on our modified XML types, and functions that convert xml-light types to our types (since some of our dependencies, like texmath, use xml-light).

We export functions that use xml-conduit’s parser to produce an Element or [Content]. This allows existing pandoc code to use a better parser without much modification.

The new parser is used in all places where xml-light’s parser was previously used. Benchmarks show a significant performance improvement in parsing XML-based formats (with docbook, opml, jats, and docx almost twice as fast, odt and fb2 more than twice as fast).

In addition, the new parser gives us better error reporting than xml-light. We report XML errors, when possible, using the new PandocXMLError constructor in PandocError.

These changes revealed the need for some changes in the tests. The docbook-reader.docbook test lacked definitions for the entities it used; these have been added. And the docx golden tests have been updated, because the new parser does not preserve the order of attributes.
DocBook reader:
- Avoid expensive tree normalization step, as it is not necessary with the new XML parser.
- Support informalfigure (#7079) (Nils Carlson).
Docx reader:
- Use Map instead of list for Namespaces. This gives a speedup of about 5-10%. With this and the XML parsing changes, the docx reader is now about twice as fast as in the previous release.
HTML reader:
- Small performance tweaks.
- Also, remove exported class NamedTag(..) [API change]. This was just intended to smooth over the transition from String to Text and is no longer needed.
- As a result, the functions isInlineTag and isBlockTag are no longer polymorphic; they apply to a Tag Text [API change].
- Do a lookahead to find the right parser to use. This takes benchmarks from 34ms to 23ms, with less allocation.
- Fix bad handling of empty src attribute in iframe (#7099). If src is empty, we simply skip the iframe. If src is invalid or cannot be fetched, we issue a warning nd skip instead of failing with an error.
JATS reader:
- Avoid tree normalization, which is no longer necessary given the new XML parser.
LaTeX reader:
- Don’t export tokenize, untokenize [API change]. These are internal implementation details, which were only exported for testing. They don’t belong in the public API.
- Improved efficiency of the parser. With these changes the reader is almost twice as fast as in the last release in our benchmarks.
- Code cleanup, removing some unnecessary things.
- Rewrite withRaw so it doesn’t rely on fragile assumptions about token positions (which break when macros are expanded) (#7092). This requires the addition of sEnableWithRaw and sRawTokens in LaTeXState, and a new combinator disablingWithRaw to disable collecting of raw tokens in certain contexts. Add parseFromToks to Text.Pandoc.Readers.LaTeX.Parsing. Fix parsing of single character tokens so it doesn’t mess up the new raw token collecting. These changes slightly increase allocations and have a small performance impact.
- Handle some bibtex/biblatex-specific commands that used to be dealt with in pandoc-citeproc (#7049).
- Optimize satisfyTok, avoiding unnecessary macro expansion steps. Benchmarks after this change show 2/3 of the run time and 2/3 of the allocation of the Feb. 10 benchmarks.
- Removed sExpanded in state. This isn’t actually needed and checking it doesn’t change anything.
- Improve braced'. Remove the parameter, have it parse the opening brace, and make it more efficient.
- Factor out pieces of the LaTeX reader to make the module smaller. This reduces memory demands when compiling. Created Text.Pandoc.Readers.{LaTeX,Math,Citation,Table,Macro,Inline}. Changed Text.Pandoc.Readers.LaTeX.SIunitx to export a command map instead of individual commands.
- Handle table cells containing & in \verb (#7129).
Make Text.Pandoc.Readers.LaTeX.Types an unexported module [API change].
Markdown reader:
- Improved handling of mmd link attributes in references (#7080). Previously they only worked for links that had titles.
- Improved efficiency of the parser (benchmarks show a 15% speedup).
OPML reader:
- Avoid tree normalization, which is no longer necessary with the new XML parser.
ODT reader:
- Finer-grained errors on parse failure (#7091).
- Give more information if the zip container can’t be unpacked.
Org reader:
- Support task_lists extension (Albert Krewinkel, #6336).
- Fix bug in org-ref citation parsing (Albert Krewinkel, #7101). The org-ref syntax allows to list multiple citations separated by comma. Previously commas were accepted as part of the citation id, so all citation lists were parsed as one single citation.
RST reader:
- Use getTimestamp instead of getCurrentTime to fetch timestamp. Setting SOURCE_DATE_EPOCH will allow reproducible builds.
- RST reader: fix handling of header in CSV tables (#7064). The interpretation of this line is not affected by the delim option.
Jira reader:
- Modified the Doc parser to skip leading blank lines. This fixes parsing of documents which start with multiple blank lines (Albert Krewinkel, #7095).
- Prevent URLs within link aliases to be treated as autolinks (Albert Krewinkel, #6944).
Text.Pandoc.Shared
- Remove formerly exported functions that are no longer used in the code base: splitByIndices, splitStringByIndicies, substitute, and underlineSpan (which had been deprecated in April 2020) [API change].
- Export handleTaskListItem (Albert Krewinkel) [API change].
- Change defaultUserDataDirs to defaultUserDataDir [API change]. We determine what is the default user data directory by seeing whether the XDG directory and/or legacy directory exist.
BibTeX writer:
- BibTeX writer: use doclayout and doctemplate. This change allows bibtex/biblatex output to wrap as other formats do, depending on the settings of --wrap and --columns (#7068).
CSL JSON writer:
- Output [] if no references in input, instead of raising a PandocAppError as before.
Docx writer:
- Use getTimestamp instead of getCurrentTime for timestamp. Setting SOURCE_DATE_EPOCH will allow reproducible builds.
EPUB writer:
- Use getTimestamp instead of getCurrentTime for timestamp. Setting SOURCE_DATE_EPOCH will allow reproducible builds (#7093). This does not suffice to fully enable reproducible in EPUB, since a unique id is still being generated for each build.
- Support belongs-to-collection metadata (#7063) (Nick Berendsen).
JATS writer:
- Escape special chars in reference elements (Albert Krewinkel). Prevents the generation of invalid markup if a citation element contains an ampersand or another character with a special meaning in XML.
Jira writer:
- Use Span identifiers as anchors (Albert Krewinkel).
- Use {noformat} instead of {code} for unknown languages (Albert Krewinkel). Code blocks which are not marked as a language supported by Jira are rendered as preformatted text via {noformat} blocks.
LaTeX writer:
- Adjust hypertargets to beginnings of paragraphs (#7078). Use \vadjust pre so that the h...

Assets 10

23 Jan 23:18

jgm

2.11.4

54d8c69

pandoc 2.11.4

Click to expand changelog

Add biblatex, bibtex as output formats (closes #7040).
Recognize more extensions as markdown by default (#7034): mkdn, mkd, mdwn, mdown, Rmd.
Implement defaults file inheritance (#6924, David Martschenko). Allow defaults files to inherit options from other defaults files by specifying them with the following syntax: defaults: [list of defaults files or single defaults file].
Fix infinite HTTP requests when writing epubs from URL source (#7013). Due to a bug in code added to avoid overwriting the cover image if it had the form fileX.YYY, pandoc made an endless sequence of HTTP requests when writing epub with input from a URL.
Org reader:
- Allow multiple pipe chars in todo sequences (Albert Krewinkel, #7014). Additional pipe chars, used to separate “action” state from “no further action” states, are ignored. E.g., for the following sequence, both DONE and FINISHED are states with no further action required: #+TODO: UNFINISHED | DONE | FINISHED.
- Restructure output of captioned code blocks (Albert Krewinkel, #6977). The Div wrapper of code blocks with captions now has the class “captioned-content”. The caption itself is added as a Plain block inside a Div of class “caption”. This makes it easier to write filters which match on captioned code blocks. Existing filters will need to be updated.
- Mark verbatim code with class verbatim (Dimitri Sabadie, #6998).
LaTeX reader:
- Handle filecontents environment (#7003).
- Put contents of unknown environments in a Div when raw_tex is not enabled (#6997). (When raw_tex is enabled, the whole environment is parsed as a raw block.) The class name is the name of the environment. Previously, we just included the contents without the surrounding Div, but having a record of the environment’s boundaries and name can be useful.
Mediawiki reader:
- Allow space around storng/emph delimiters (#6993).
New module Text.Pandoc.Writers.BibTeX, exporting writeBibTeX and writeBibLaTeX. [API change]
LaTeX writer:
- Revert table line height increase in 2.11.3 (#6996). In 2.11.3 we started adding \addlinespace, which produced less dense tables. This wasn’t an intentional change; I misunderstood a comment in the discussion leading up to the change. This commit restores the earlier default table appearance. Note that if you want a less dense table, you can use something like \def\arraystretch{1.5} in your header.
EPUB writer:
- Adjust internal links to identifiers defined in raw HTML sections after splitting into chapters (#7000).
- Recognize Format "html4", Format "html5" as raw HTML.
- Adjust internal links to images, links, and tables after splitting into chapters. Previously we only did this for Div and Span and Header elements (see #7000).
Ms writer:
- Don’t justify text inside table cells.
JATS writer:
- Use <element-citation> if element_citations extension is enabled (Albert Krewinkel).
- Fix citations (Albert Krewinkel, #7018). By default we use formatted citations.
- Ensure that <disp-quote> is always wrapped in <p> (#7041).
Markdown writer:
- Cleaned up raw formats. We now react appropriately to gfm, commonmark, and commonmark_x as raw formats.
RST writer:
- Fix bug with dropped content from inside spans with a class in some cases (#7039).
Docx writer:
- Handle table header using styles (#7008). Instead of hard-coding the border and header cell vertical alignment, we now let this be determined by the Table style, making use of Word’s “conditional formatting” for the table’s first row. For headerless tables, we use the tblLook element to tell Word not to apply conditional first-row formatting.
Commonmark writer:
- Implement start number on ordered lists (#7009). Previously they always started at 1, but according to the spec the start number is respected.
HTML writer:
- Fix implicit_figure at end of footnotes (#7006).
ConTeXt template: Remove \setupthinrules from default template. The width parameter this used is not actually supported, and the command didn’t do anything.
Text.Pandoc.Extensions:
- Add Ext_element_citations constructor (Albert Krewinkel).
Text.Pandoc.Citeproc.BibTeX: New unexported function writeBibtexString.
Text.Pandoc.Citeproc:
- Use finer grained imports (Albert Krewinkel).
- Factor out and export getStyle [API change].
- Export getReferences [API change, #7106].
- Factor out getLang.
Text.Pandoc.Parsing: modify gridTableWith' for headerless tables. If the table lacks a header, the header row should be an empty list. Previously we got a list of empty cells, which caused an empty header to be emitted instead of no header. In LaTeX/PDF output that meant we got a double top line with space between.
ImageSize: use viewBox for SVG if no length, width attributes (#7045). This change allows pandoc to extract size information from more SVGs.
Add simple default.nix.
Use commonmark 0.1.1.3.
Use citeproc 0.3.0.5.
Update default CSL to use latest chicago-author-date.csl.
CONTRIBUTING.md: add note on GNU xargs.
MANUAL.txt:
- Update description of -L/--lua-filter.
- Document use of citations in note styles (#6828).

Assets 8

30 Dec 00:43

jgm

2.11.3.2

dcbd8d3

pandoc 2.11.3.2

Click to expand changelog

HTML reader: use renderTags’ from Text.Pandoc.Shared (Albert Krewinkel). A side effect of this change is that empty <col> elements are written as self-closing tags in raw HTML blocks.
Asciidoc writer: Add support for writing nested tables (#6972, timo-a). Asciidoc supports one level of nesting. If deeper tables are to be written, they are omitted and a warning is issued.
Docx writer: fix nested tables with captions (#6983). Previously we got unreadable content, because docx seems to want a <w:p> element (even an empty one) at the end of every table cell.
Powerpoint writer: allow arbitrary OOXML in raw inline elements (Albert Krewinkel). The raw text is now included verbatim in the output. Previously is was parsed into XML elements, which prevented the inclusion of partial XML snippets.
LaTeX writer: support colspans and rowspans in tables (#6950, Albert Krewinkel). Note that the multirow package is needed for rowspans. It is included in the latex template under a variable, so that it won’t be used unless needed for a table.
HTML writer: don’t include p tags in CSL bibliography entries (#6966). Fixes a regression in 2.11.3.
Add meta-description variable to HTML templates (#6982). This is populated by the writer by stringifying the description field of metadata (Jerry Sky). The description meta tag will make the generated HTML documents more complete and SEO-friendly.
Citeproc: fix handling of empty URL variables (DOI, etc.). The linkifyVariables function was changing these to links which then got treated as non-empty by citeproc, leading to wrong results (e.g. ignoring nonempty URL when empty DOI is present). See jgm/citeproc#41.
Use citeproc 0.3.0.3. Fixes an issue in author-only citations when both an author and translator are present, and an issue with citation group delimiters.
Require texmath 0.12.1. This improves siunitx support in math, fixes bugs with \*mod family operators and arrays, and avoids italicizing symbols and operator names in docx output.
Ensure that the perl interpreter used for filters with .pl extension (wuffi).
MANUAL: note that textarea content is never parsed as Markdown (Albert Krewinkel).

Assets 8

19 Dec 01:48

jgm

2.11.3.1

37ba5d5

pandoc 2.11.3.1

Click to expand changelog

Added some missing files to extra-source-files and data files, so they are included in the sdist tarball. Closes #6961. Cleaned up some extraneous data and test files, and added a CI check to ensure that the test and data files included in the sdist match what is in the git repository.
Use citeproc 0.3.0.1, which avoids removing nonbreaking space at the end of the initialize-with attribute. (Some journals require nonbreaking space after initials, and this makes that possible.)

Assets 8

18 Dec 08:01

jgm

2.11.3

ec0ec4a

pandoc 2.11.3

Click to expand changelog

With --bibliography (or bibliography in metadata), a URL may now be provided, and pandoc will fetch the resource. In addition, if a file path is provided and it is not found relative to the working directory, the resource path will be searched (#6940).
Add sourcepos extension for commonmark, gfm, commonmark_x (#4565). With the sourcepos extension set set, data-pos attributes are added to the AST by the commonmark reader. No other readers are affected. The data-pos attributes are put on elements that accept attributes; for other elements, an enlosing Div or Span is added to hold the attributes.
Change extensions for commonmark_x: replace auto_identifiers with gfm_auto_identifiers (#6863). commonmark_x never actually supported auto_identifiers (it didn’t do anything), because the underlying library implements gfm-style identifiers only. Attempts to add the auto_identifiers extension to commonmark will now fail with an error.
HTML reader:
- Split module into several submodules (Albert Krewinkel). Reducing module size should reduce memory use during compilation.
- Support advanced table features (Albert Krewinkel): block level content in captions, row and colspans, body headers, row head columns, footers, attributes.
- Disable round-trip testing for tables. Information for cell alignment in a column is not preserved during round-trips (Albert Krewinkel).
- Allow finer grained options for tag omission (Albert Krewinkel).
- Simplify list attribute handling (Albert Krewinkel).
- Pay attention to lang attributes on body element (#6938). These (as well as lang attributes on the html element) should update lang in metadata.
- Retain attribute prefixes and avoid duplicates (#6938). Previously we stripped attribute prefixes, reading xml:lang as lang for example. This resulted in two duplicate lang attributes when xml:lang and lang were both used. This commit causes the prefixes to be retained, and also avoids invald duplicate attributes.
Commonmark reader:
- Refactor specFor.
- Set input name to "" to avoid clutter in sourcepos output.
Org reader:
- Parse #+LANGUAGE into lang metadata field (#6845, Albert Krewinkel).
- Preserve targets of spurious links (#6916, Albert Krewinkel). Links with (internal) targets that the reader doesn’t know about are converted into emphasized text. Information on the link target is now preserved by wrapping the text in a Span of class spurious-link, with an attribute target set to the link’s original target. This allows to recover and fix broken or unknown links with filters.
DocBook reader:
- Table text width support (#6791, Nils Carlson). Table width in relation to text width is not natively supported by docbook but is by the docbook fo stylesheets through an XML processing instruction, <?dbfo table-width="50%"?>.
LaTeX reader:
- Improve parsing of command options (#6869, #6873). In cases where we run into trouble parsing inlines til the closing ], e.g. quotes, we return a plain string with the option contents. Previously we mistakenly included the brackets in this string.
- Preserve center environment (#6852, Igor Pashev). The contents of the center environment are put in a Div with class center.
- Don’t parse \rule with width 0 as horizontal rule. These are sometimes used as spacers in LaTeX.
- Don’t apply theorem default styling to a figure inside (#6925). If we put an image in italics, then when rendering to Markdown we no longer get an implicit figure.
Dokuwiki reader:
- Handle unknown interwiki links better (#6932). DokuWiki lets the user define his own Interwiki links. Previously pandoc reacted to these by emitting a google search link, which is not helpful. Instead, we now just emit the full URL including the wikilink prefix, e.g. faquk>FAQ-mathml. This at least gives users the ability to modify the links using filters.
Markdown writer:
- Properly handle boolean values in writing YAML metadata (#6388).
- Ensure that a new csl-block begins on a new line (#6921). This just looks better and doesn’t affect the semantics.
RST writer:
- Better image handling (#6948). An image alone in its paragraph (but not a figure) is now rendered as an independent image, with an alt attribute if a description is supplied. An inline image that is not alone in its paragraph will be rendered, as before, using a substitution. Such an image cannot have a “center”, “left”, or “right” alignment, so the classes align-center, align-left, or align-right are ignored. However, align-top, align-middle, align-bottom will generate a corresponding align attribute.
Docx writer:
- Keep raw openxml strings verbatim (#6933, Albert Krewinkel).
- Use Content instead of Element. This allows us to inject raw OpenXML into the document without reparsing it into an Element, which is necessary if you want to inject an open tag or close tag.
- Fix bullets/lists indentation, so that the first level is slightly indented to the right instead of right on the margin (cholonam).
- Support bold and italic in “complex script” (#6911). Previously bold and italics didn’t work properly in LTR text. This commit causes the w:bCs and w:iCs attributes to be used, in addition to w:b and w:i, for bold and italics respectively.
ICML writer:
- Fix image bounding box for custom widths/heighta (Mauro Bieg, #6936).
LaTeX writer:
- Improve table spacing (#6842, #6860). Remove the \strut that was added at the end of minipage environments in cells. Replace \tabularnewline with \\ \addlinespace.
- Improve calculation of column spacing (#6883).
- Extract table handling into separate module (Albert Krewinkel).
- Fix bug with nested csl- display Spans (#6921).
- Improve longtable output (#6883). Don’t create minipages for regular paragraphs. Put width and alignment information in the longtable column descriptors.
OpenDocument writer:
- Support for table width as a percentage of text width (#6792, Nils Carson).
- Implement Div and Span ident support (#6755, Nils Carson). Spans and Divs containing an ident in the Attr will become bookmarks or sections with idents in OpenDocument format.
- Add two extensions, xrefs_name and xrefs_number (#6774, Nils Carlson). Links to headings, figures and tables inside the document are substituted with cross-references that will use the name or caption of the referenced item for xrefs_name or the number for xrefs_number. For the xrefs_number to be useful heading numbers must be enabled in the generated document and table and figure captions must be enabled using for example the native_numbering extension. In order for numbers and reference text to be updated the generated document must be refreshed.
JATS writer:
- Support advanced table features (Albert Krewinkel).
- Support author affiliations (#6687, Albert Krewinkel).
Docbook writer:
- Use correct id attribute consistently (Jan Tojnar). DocBook5 should always use xml:id instead of id.
- Handle admonition titles better (Jan Tojnar). Docbook reader produces a Div with title class for <title> element within an “admonition” element. Markdown writer then turns this into a fenced div with title class attribute. Since fenced divs are block elements, their content is recognized as a paragraph by the Markdown reader. This is an issue for Docbook writer because it would produce an invalid DocBook document from such AST – the <title> element can only contain “inline” elements. Handle this special case separately by unwrapping the paragraph before creating the <title> element.
- Add XML namespaces to top-level elements (#6923, Jan Tojnar). Previously, we only added xmlns attributes to chapter elements, even when running with --top-level-division=section. These namespaces are now added to part and section elements too, when they are the selected top-level divisions. We do not need to add namespaces to documents produced with --standalone flag, since those will already have xmlns attribute on the root element in the template.
HTML writer:
- Fix handling of nested csl- display spans (#6921). Previously inner Spans used to represent CSL display attributes were not rendered as div tags as intended.
EPUB writer:
- Include title page in landmarks (#6919). Note that the toc is also included if --toc is specified.
- Add frontmatter type on body element for nav.xhtml (#6918).
EPUB templates: use preserveAspectRatio=“xMidYMid” for cover image (#6895, Shin Sang-jae). This change affects both the epub2 and the epub3 templates. It avoids distortion of the cover image by requiring that the aspect ratio be preserved.
LaTeX template:
- Include csquotes package if csquotes variable set.
- Put back amssymb. We need it for checkboxes in todo lists, and maybe for other things. In this location it seems compatible with the cases that prompted #6469 and PR #6762.
- Disable language-specific shorthands in babel (#6817, #6887). Babel defines “shorthands” for some languages, and these can produce unexpected results. For example, in Spanish, 1.22 gets rendered as 122, and et~al. as etal. One would think that babel’s shorthands=off option (which we were using) would disable these, but it doesn’t. So we remove shorthands=off and add some code that redefines the shorthands macro. Eventually this will be fixed in babel, I hope, and we can revert to something simpler.
JATS template: allow array of persistent institute ...

Assets 8

19 Nov 23:01

jgm

2.11.2

0c8ab8a

pandoc 2.11.2

Click to expand changelog

Default to using ATX (##-style) headings for Markdown output (#6662, Aner Lucero). Previously we used Setext (underlined) headings by default for levels 1–2.
Add option --markdown-headings=atx|setext, and deprecate --atx-headers (#6662, Aner Lucero).
Support markdown-headings in defaults files.
Fix corner case in YAML metadata parsing (#6823). Previously YAML metadata would sometimes not get recognized if a field ended with a newline followed by spaces.
--self-contained: increase coverage (#6854). Previously we only self-contained attributes for certain tag names (img, embed, video, input, audio, source, track, section). Now we self-contain any occurrence of src, data-src, poster, or data-background-image, on any tag; and also href on link tags.
Markdown reader:
- Fix detection of locators following in-text citations. Prevously, if we had @foo [p. 33; @bar], the p. 33 would be incorrectly parsed as a prefix of @bar rather than a suffix of @foo.
- Improve period suppression algorithm for citations in notes in note citation styles (#6835).
- Don’t increment stateNoteNumber for example list references. This helps with #6836 (a bug in which example list references disturb calculation of citation note number and affect when ibid is triggered).
LaTeX reader:
- Move getNextNumber from Readers.LaTeX to Readers.LaTeX.Parsing.
- Fix negative numbers in siunitx commands. A change in pandoc 2.11 broke negative numbers, e.g. \SI{-33}{\celcius} or \num{-3}. This fixes the regression.
DocBook reader: drop period in formalpara title and put it in a div with class formalpara-title, so that people can reformat with filters (#6562).
Man reader: improve handling of .IP (#6858). We now better handle .IP when it is used with non-bullet, non-numbered lists, creating a definition list. We also skip blank lines like groff itself.
Bibtex reader: fall back on en-US if locale for LANG not found. This reproduces earlier pandoc-citeproc behavior (jgm/citeproc#26).
JATS writer:
- Wrap all tables (Albert Krewinkel). All <table> elements are put inside <table-wrap> elements, as the former are not valid as immediate child elements of <body>.
- Move Table handling to separate module (Albert Krewinkel). Adds two new unexported modules: Text.Pandoc.Writers.JATS.Types, Text.Pandoc.Writers.JATS.Table.
Org writer:
- Replace org #+KEYWORDS with #+keywords (TEC). As of ~2 years ago, lower case keywords became the standard (though they are handled case insensitive, as always).
- Update org supported languages and identifiers according to the current list contained in https://orgmode.org/worg/org-contrib/babel/languages/index.html (TEC).
Only use filterIpynbOutput if input format is ipynb (#6841). Before this change content could go missing from divs with class output, even when non-ipynb was being converted.
When checking reader/writer name, check base name now that we permit extensions on formats other than markdown.
Text.Pandoc.PDF: Fix changePathSeparators for Windows (#6173). Previously a path beginning with a drive, like C:\foo\bar, was translated to C:\/foo/bar, which caused problems. With this fix, the backslashes are removed.
Text.Pandoc.Logging: Add constructor ATXHeadingInLHS constructor to LogMessage [API change].
Fix error that is given when people specify doc output (#6834, gison93).
LaTeX template: add a \break after parbox in CSLRightInline. This should fix spacing problems between entries with numeric styles. Also fix number of params on CSLReferences.
reveal.js template: Put quotes around controlsLayout, controlsBackArrows, and display, since these require strings. Add showSlideNumber, hashOneBasedIndex, pause.
Use citeproc 0.2. This fixes a bug with title case around parentheses.
pandoc.cabal: remove ‘static’ flag. This isn’t really necessary and can be misleading (e.g. on macOS, where a fully static build isn’t possible). cabal’s new option --enable-executable-static does the same. On stack you can add something like this to the options for your executable in package.yaml:
```
ld-options: -static -pthread
```
Remove obsolete bibutils flag setting in linux/make_artifacts.sh.
Manual:
- Correct link-citation -> link-citations.
- Add a sentence about pagetitle for HTML (#6843, Alex Toldaiev).
INSTALL.md: Remove references to pandoc-citeproc (#6857).
CONTRIBUTING: describe hlint and how it’s used (#6840, Albert Krewinkel).

Assets 8

08 Nov 04:44

jgm

2.11.1.1

cfb017c

pandoc 2.11.1.1

Click to expand changelog

Citeproc: improve punctuation in in-text note citations (#6813). Previously in-text note citations inside a footnote would sometimes have the final period stripped, even if it was needed (e.g. on the end of ‘ibid’).
Use citeproc 0.1.1.1. This improves the decision about when to use ibid in cases where citations are used inside a footnote (#6813).
Support nocase spans for csljson output.
Require latest commonmark, commonmark-extensions. This fixes a bug with autolink_bare_uris and commonmark.
LaTeX reader: better handling of \\ inside math in table cells (#6811).
DokuWiki writer: translate language names for code elements and improve whitespace (#6807).
MediaWiki writer: use syntaxhighlight tag instead of deprecated source for highlighted code (#6810). Also support startFrom attribute and numberLines.
Lint code in PRs and when committing to master (#6790, Albert Krewinkel).
doc/filters.md: describe technical details of filter invocations (#6815, Albert Krewinkel).

Assets 8

03 Nov 22:42

jgm

2.11.1

360fa88

pandoc 2.11.1

Click to expand changelog

DocBook Reader: fix duplicate bibliography bug (#6773, Nils Carlson).
HTML reader:
- Parse contents of iframes (#6770).
- Parse inline svg as image unless raw_html is set in the reader (in which case the svg is passed through as raw HTML) (#6770).
LaTeX reader:
- Fix bug parsing macro arguments (#6796). If \cL is defined as \mathcal{L}, and \til as \tilde{#1}, then \til\cL should expand to \tilde{\mathcal{L}}, but pandoc was expanding it to \tilde\mathcal{L}. This is fixed by parsing the arguments in “verbatim mode” when the macro expands arguments at the point of use.
- Properly support optional (cite) argument for \blockquote from csquotes (#6802).
LaTeX writer: Improved calculation of table column widths. We now have LaTeX do the calculation, using \tabcolsep. So we should now have accurate relative column widths no matter what the text width. The default template has been modified to load the calc package if tables are used.
HTML writer: Fix duplicate “class” attribute for table rows (Andy Morris).
Text.Pandoc.Filter: allow shorter YAML representation of Citeproc (Albert Krewinkel). The map-based YAML representation of filters expects type and path fields. The path field had to be present for all filter types, but is not used for citeproc filters. The field can now be omitted when type is “citeproc”, as described in the MANUAL.
Text.Pandoc.Error: Add PandocBibliographyError constructor for PandocError [API change]. This ensures that bibliography parsing errors generate messages that include the bibliography file name – otherwise it can be quite mysterious where it is coming from.
Citeproc: properly handle csl field with data: URI (#6783). This is used with the JATS writer, so this fixes a regression in pandoc 2.11 with JATS output and citeproc.
Allow citation-abbreviations in defaults file.
JATS templates: ensure jats_publishing output is valid (Albert Krewinkel).
LaTeX template: Fix CSLRightInline, so that it does not run over the right margin.
HTML template: default CSS tweaks (Mauro Bieg and John MacFarlane).
- Fix margin before codeblock
- Add monobackgroundcolor variable, making the background color and padding of code optional.
- Ensure that backgrounds from highlighting styles take precedence over monobackgroundcolor
- Remove list markers from TOC
- Add margin-bottom where needed
- Remove italics from blockquote styling
- Change borders and spacing in tables to be more consistent with other output formats
- Style h5, h6
- Set font-size for print media to 12pt.
- Reduce interline space.
- Reduce interparagraph space.
- Reduce line width.
- Remove the special line-height: 1 for table cells.
- Remove the special line-height for pre.
- Ensure that there is a bit more space before a heading than after.
- Slightly reduced space after title header.
- Add CSS example to MANUAL
man template: Change comment that triggers tbl from .\"t to '\" t, as specified in groff_man(7) (#6803).
Use latest commonmark, commonmark-extensions. This fixes a bug with nested blocks in footnotes with the footnote extension to commonmark. See jgm/commonmark-hs#63.
Citeproc: use comma for in-text citations inside footnotes. When an author-in-text citation like @foo occurs in a footnote, we now render it with: AUTHOR NAME + COMMA + SPACE + REST. Previously we rendered: AUTHOR NAME + SPACE + "(" + REST + ")". This gives better results. Note that normal citations are still rendered in parentheses.
Use latest citeproc:
- citeproc no longer capitalizes notes, so we do it in pandoc when appropriate.
- Closes #6783.
Clarify manual on --track-changes (#6801).
Add doc/jats.md to document pandoc’s handling of JATS (#6794, Albert Krewinkel).
Fix code example in lua-filters.md (#6795).

Assets 8

23 Oct 18:50

jgm

2.11.0.4

90f25c7

pandoc 2.11.0.4

Use latest citeproc (2.11.0.3, closes #6765). This fixes a problem with author-in-text citations for references including both an author and an editor. Previously, both were included in the text, but only the author should be.
With --citeproc, ensure that the final period is removed when citations that occur in notes in note-based styles get put in parentheses. See jgm/citeproc#20.
Commonmark writer: fix regression with fenced divs (#6768). Starting with 2.10.1, fenced divs no longer render with HTML div tags in commonmark output. This is a regression due to our transition from cmark-gfm. This commit fixes it.
Normalize rewritten image paths with --extract-media (#6761). This change will avoid mixed paths like this one when --extract-media is used with a Word file: ![](C:\Git\TIJ4\Markdown/media/image30.wmf). Instead we’ll get ![](C:\Git\TIJ4\Markdownmediaimage30.wmf).
Modify --version output. Use space more efficiently and report the citeproc and ipynb versions, along with skylighting, texmath, and pandoc-types. Drop the word “default” before “user data directory.”
DocBook reader: bibliomisc and anchor support (#6754, Nils Carlson). Also ensure that bibliodiv without a title no longer results in an empty Header.
ConTeXt template: adds \setupinterlinespace to fonts larger than normal (#6763, Denis Maier).
LaTeX template: Do not load amssymb if not needed (#6469, Angelo Peronio). See https://tex.stackexchange.com/a/549938.
Relax upper bound on hslua, allow hslua-1.3.* (Albert Krewinkel).
Use latest sylighting, with support for groovy.
MANUAL:
- Improve explanation of “indent” variable (#6767, Cyrus Yip).
- Remove org from list of input formats supporting raw_tex (#6753, Nick Fleisher).
- Document that –html-q-tags requires the smart extension on the reader (#6766).

Assets 8

16 Oct 04:07

jgm

2.11.0.2

6f60755

pandoc 2.11.0.2

Click to expand changelog

Fix handling of xdata in bibtex/biblatex bibliographies (#6752).
Fix some small typos in the API documentation (#6751, Michael Hoffmann).
Require citeproc 0.1.0.2. This fixes a regression from pandoc-citeproc involving spacing between components of a reference in certain styles (e.g. cell.csl).
Fix typos in comments, doc strings, error messages, and tests (Albert Krewinkel, #6738).

Assets 8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: jgm/pandoc

pandoc 2.12

pandoc 2.11.4

pandoc 2.11.3.2

pandoc 2.11.3.1

pandoc 2.11.3

pandoc 2.11.2

pandoc 2.11.1.1

pandoc 2.11.1

pandoc 2.11.0.4

pandoc 2.11.0.2