Skip to content

Commit

Permalink
add new page on JATS dialects
Browse files Browse the repository at this point in the history
  • Loading branch information
castedo committed Apr 8, 2024
1 parent 229bbce commit 8d62ff5
Show file tree
Hide file tree
Showing 3 changed files with 74 additions and 2 deletions.
4 changes: 2 additions & 2 deletions docs/bdf/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -23,8 +23,8 @@ it is often temporarily stored in a file system directory.
However, for public long-term storage, BDF data is stored in a SWHID addressable Git tree or
an equivalent "directory" object in the Software Heritage Archive.

Inside BDF, there is a file named `article.xml` encoded in a subset of the JATS XML
format.
Inside BDF, there is a file named `article.xml` encoded in a subset of the
[JATS XML format](../jats.md).
This file format can informally be referred to as *Baseprint JATS XML*.
As of October 2023,
all applications that read Baseprint document snapshots encoded in BDF
Expand Down
69 changes: 69 additions & 0 deletions docs/jats.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,69 @@
JATS XML Open Dialects
======================

This page focuses on dialects of JATS XML that are highly relevant to open-source software
development.

JATS XML Specifications/Recommendations:

* <https://jats4r.niso.org/>
* <https://jats.nlm.nih.gov/>
* <https://www.niso.org/standards-committees/jats>


## JATS XML Open Corpora

A corpus of JATS XML articles embodies a particular "dialect" of JATS XML.

Archived Corpora:

* **PMC XML**: [PubMed Central (PMC) Open Access Subset](https://www.ncbi.nlm.nih.gov/pmc/tools/openftlist/)
* **eLife XML**: [eLife article XML repository](https://github.com/elifesciences/elife-article-xml)
* **?**: eLife Reviewed Preprints

Non-archived Corpora:

* **bioRxiv preprints**

## Open-source Full-Article XML Parsing

<table markdown>
<tr>
<th>XML Articles Parsed</th>
<th>Software</th>
<th>Language</th>
</tr>
<tr markdown>
<td>PubMed Central (HTML generation)</td>
<td>?</td>
<td></td>
</tr>
<tr markdown>
<td>bioRxiv JATS XML to eLife Reviewed Preprints</td>
<td><a href="https://github.com/stencila/encoda">Encoda</a></td>
<td>TypeScript</td>
</tr>
<tr markdown>
<td>American Mathematical Society (AMS) articles</td>
<td markdown>
[AMS texml-to-html](https://github.com/AmerMathSoc/texml-to-html)
</td>
<td>JavaScript</td>
</tr>
<tr markdown>
<td>Articles of journals using Open Journal Systems (OJS)</td>
<td><a href="https://github.com/pkp">OJS</a></td>
<td>PHP</td>
</tr>
<tr>
<td>Articles of some journals hosted by <a href="https://www.centre-mersenne.org">Centre Mersenne</a></td>
<td><a href="https://gricad-gitlab.univ-grenoble-alpes.fr/mathdoc/ptf">MathDoc platform</a></td>
<td>Python</td>
</tr>
<tr>
<td>?</td>
<td>eLife Lens and derivatives</td>
<td>JavaScript</td>
</tr>
</table>

3 changes: 3 additions & 0 deletions mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -18,10 +18,13 @@ markdown_extensions:
- attr_list
- admonition
- def_list
- md_in_html
# - tables
nav:
- Interoperability Forum: index.md
- Baseprint Document Format:
- bdf/index.md
- jats.md
watch:
- docs
- theme

0 comments on commit 8d62ff5

Please sign in to comment.