aboutsummaryrefslogtreecommitdiff
path: root/test/jats-reader.xml
AgeCommit message (Collapse)Author
2024-11-06Fix typos (#10349)Andreas Deininger
2024-04-26Fix some typos (#9638)guqicun
Signed-off-by: guqicun <[email protected]>
2023-10-17JATS reader: fix handling of alt-text (#9134)Julia Diaz
Previously we were looking for an attribute that doesn't exist in JATS; alt-text is provided by a child element. Closes #9130.
2023-09-04JATS reader: Ignore `<processing-meta>` element (#9057)Julia Diaz
2023-08-30JATS reader: Multilevel support for `<permissions>` metadata (#9037)Julia Diaz
This revises the earlier support for `<permissions>`: now metadata objects with multiple fields are created, matching the structure in JATS.
2023-08-10Fix display of block elements in JATS reader (PR #8971)Julia Diaz
A number of block elements, like disp-quote, list, and disp-formula, were always treated as inlines if appearing inside paragraphs, even if their usage granted a separate block. The function isElementBlock has been refined to prevent this, and a number of specific parse cases have been added to parseBlock. Also, some minimal cleanup of the test file, in order for it to pass XML validation against the JATS DTD 1.3 (it was not compliant with the current or any previous versions of JATS). Closes #8889.
2023-06-08Add footer and multiple body parsing to JATS table reader (#8795)Noah Malmed
Closes #8765.
2023-06-06Improve title and label parsing in the JATS reader (#8840)Noah Malmed
Closes #8718.
2023-04-05Add rowspan, colspan and alignment to cells in jats table reader (#8726)Noah Malmed
Partially addresses #8408
2021-02-10Add new unexported module T.P.XMLParser.John MacFarlane
This exports functions that uses xml-conduit's parser to produce an xml-light Element or [Content]. This allows existing pandoc code to use a better parser without much modification. The new parser is used in all places where xml-light's parser was previously used. Benchmarks show a significant performance improvement in parsing XML-based formats (especially ODT and FB2). Note that the xml-light types use String, so the conversion from xml-conduit types involves a lot of extra allocation. It would be desirable to avoid that in the future by gradually switching to using xml-conduit directly. This can be done module by module. The new parser also reports errors, which we report when possible. A new constructor PandocXMLError has been added to PandocError in T.P.Error [API change]. Closes #7091, which was the main stimulus. These changes revealed the need for some changes in the tests. The docbook-reader.docbook test lacked definitions for the entities it used; these have been added. And the docx golden tests have been updated, because the new parser does not preserve the order of attributes. Add entity defs to docbook-reader.docbook. Update golden tests for docx.
2019-12-30Update JATS dtd (#6020)Arfon Smith
The current DTD for the JATS writer template is for Journal Publishing (JATS-journalpublishing1.dtd), which does not permit ext-link as a valid child (https://jats.nlm.nih.gov/publishing/tag-library/1.1/element/publisher-name.html). This update modifies the default output template to be the less restrictive JATS archiving and interchange DTD which systems like PubMed use internally to represent their articles.
2018-03-05Remove extraneous, significant whitespace in JATS writer output (#4335)Nokome Bentley
This patch fixes some cases where the JATS writer was introducing semantically significant whitespace by indenting and wrapping tags. Note that the JATS spec has a content model for `<p>` tags of `(#PCDATA | ...`. Any tag where `#PCDATA` children are possible should not have any indentation. The same is true for `<th>`, `<td>`, `<term>`, `<label>`.
2017-12-23JATS reader: process author metadata.John MacFarlane
2017-12-20Add Basic JATS reader based on DocBook readerHamish Mackenzie