Skip to content

Commit

Permalink
pkp#492 Initial scaffold of XML Production guide
Browse files Browse the repository at this point in the history
  • Loading branch information
NateWr committed Oct 1, 2020
1 parent ee092a7 commit cbea90f
Show file tree
Hide file tree
Showing 6 changed files with 184 additions and 0 deletions.
6 changes: 6 additions & 0 deletions xml-production/en/SUMMARY.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
# Summary

* [Introduction](.)
* [Convert Files](./convert)
* [Copyedit and Typeset XML](./edit)
* [Publish and Distribute](./publish)
91 changes: 91 additions & 0 deletions xml-production/en/convert.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,91 @@
---
title: Convert - XML Production - OJS
description: How to convert journal articles prepared in Microsoft Word, Google Docs or LibreOffice to JATS XML.
---

# Convert to JATS XML

Scholarly articles are often written using software that is not compatible with JATS XML, such as Microsoft Word, Google Docs, and LibreOffice. These files must be converted to JATS XML.

The DocxConverter plugin for OJS can convert these documents when they have been prepared correctly. This guide will describe how to install the plugin, how to prepare the document, and how to convert it to JATS XML.

> System administrators may want to try other tools to convert documents to XML. These often require technical assistance to deploy, but in some cases they may provide a more accurate conversion. These tools include ...
{:.tip}

## Install the DocxConverter Plugin

... Step-by-step with screenshots or gifs...

## Prepare the Document

Limitations of DocxConverter (things not supported), need to prepare the document in advance, advice on training editorial staff

The article's title, abstract, contributors, and other metadata should be stripped from the document. In OJS, this metadata (the frontmatter and backmatter of a JATS XML document) is handled separately from the main body of the article. The document should only include the main body of the article, and the references and citations.

... Links to example documents in MS Word and Google Docs...

### Paragraphs

All paragraphs must have the "default" or "normal" style applied in the editor.

... screenshot ...

### Headings

All headings must have one of the heading styles applied in the editor. Each section, such as the Introduction or Conclusion, should start at **Heading 1**. Any sub-sections should use **Heading 2**.

... screenshot ...

### Lists

...

### Tables

All tables will be converted. However, table titles and captions can not be captured in the conversion process. After converting the document, table captions will appear as paragraphs and must be cleaned up when [copyediting and typesetting](./edit).

### Bold, Italic, Superscript, Subscript, Underlined and Strikethrough

All text must be formatted using the document editor's designated formatting tools, such as bold, italic, superscript, subscript, underlined and strikethrough. Any text formatting that indicates a font type or size will be removed when the document is converted in order to comply with the specifications of a JATS XML document.

> Never use underline in text formatting if it can be avoided. When the article is published to the web, underlined text will be confused for a link and users may think the document is "broken".
{:.warning}

### Figures, Charts and Images

All figures, charts and images must be inserted into the document in `JPEG` or `PNG` format. Some editors allow charts and graphs to be imported and displayed directly from data. These charts and graphs must be replaced with an image in `PNG` format in order to be converted correctly.

### References and Citations

References and citations must have the "bibliography" style applied in the editor. This style is not typically found alongside the other styles and must be applied by locating it in the editor's advanced style settings.

... screenshot or gif ...

> Google Docs does not support a bibliography style. When using Google Docs, references and citations will need to be entered manually when [copyediting and typesetting](./edit).
{:.warning}

References identified with the "bibliography" style will be imported as a `mixed-citation`. This means that the JATS XML will not be able to identify the distinct parts of a citation, such as the title, author or date. This information can be supplied manually when [copyediting and typesetting](./edit).

> The [Zotero plugin for Microsoft Word](https://www.zotero.org/support/word_processor_plugin_usage) will insert structured references so that the converter can identify the distinct parts of a citation. Citations inserted by Zotero do not need to be manually corrected when [copyediting and typesetting](./edit), which will save a lot of time.
{:.tip}

## Export the File

When the document is prepared it must be exported in `docx` format.

- In **Microsoft Word**, no additional action is needed to save the file in `docx` format.
- In **LibreOffice**, select File > Save as > Microsoft Word 2007-2013 XML (.docx).
- In **Google Docs**, select File > Download > Microsoft Word (.docx).

## Upload to OJS

... Step-by-step with screenshots or gifs...

## Convert to JATS XML

... Step-by-step with screenshots or gifs...


---

When the conversion is complete, editorial staff will need to [review and edit](./edit) the final document.
21 changes: 21 additions & 0 deletions xml-production/en/edit.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
---
title: Copyedit and Typeset - XML Production - OJS
---

# Copyedit and Typeset

...

## Install the Texture Plugin

...

## Edit the JATS XML File

...

## Limitations of Texture

---

Once the article is typeset it can be [published](./publish).
32 changes: 32 additions & 0 deletions xml-production/en/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,32 @@
---
title: XML Production - OJS
description: How to discover and use tools to convert, edit and publish journal articles in JATS XML.
---

# Introduction

This document is a guide to the tools and editorial procedures that can be used with Open Journal Systems to prepare and publish articles in [JATS XML](https://en.wikipedia.org/wiki/Journal_Article_Tag_Suite), a widely used standard for publishing and distributing scholarly articles. It is written for journal managers, managing editors or system administrators who want to produce XML as part of their publishing efforts.

JATS XML is a complex, technical specification for describing a journal in a machine-readable format. Typesetting an article in JATS XML can be a difficult process. However, once an article has been typeset in JATS XML it becomes possible to automate tasks, such as generating PDFs, depositing citation information, or data-mining research articles.

This guide will help you navigate the options that are available, understand what to expect from them, set up and configure the tools for a journal, and train editorial staff to get the best results.

## Production Workflow

The XML production workflow recommended in this guide mixes is a three-step process that includes automated and manual tasks.

1. The full text of an article's submission is [automatically converted](./convert) to JATS XML in a process that often introduces errors.
2. The JATS XML is then [manually copyedited and typeset](./edit) by editorial staff to correct errors and prepare the final proof.
3. When the article is [published](./publish), full-text HTML and PDFs are automatically generated from the JATS XML.

This guide will focus on free, scholar-owned tools that the Public Knowledge Project recommends in order to accomplish each step, along with the limitations of these tools and any editorial guidelines needed to use them correctly.

Many publishing services may be able to take advantage of other tools and services to perform some or all of these steps. For example, a publisher may contract with a professional typesetting service to perform the initial conversion process with fewer errors. Publishers with technical support staff may find it easier to implement their own tools for HTML and PDF generation. This guide will mention these alternatives but can not provide a recommendation.

Learn how to take an author's submission file and [convert it to JATS XML](./convert).

----

Copyright: Simon Fraser University holds the copyright for work produced by the Public Knowledge Project and has placed its documentation under a [Creative Commons Attribution 4.0 International License](https://creativecommons.org/licenses/by/4.0/).

[![](https://licensebuttons.net/l/by/4.0/88x31.png "Creative Commons Attribution 4.0 International")](https://creativecommons.org/licenses/by/4.0/)
26 changes: 26 additions & 0 deletions xml-production/en/publish.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
---
title: Publish - XML Production - OJS
description: How to publish from JATS XML to the web or a PDF.
---

# Publish with JATS XML

...

## Generate Full-Text

...JATSParser and eLens...

## Generate PDF Galley

...

## Deposit Citations

...

## A Note on Article Metadata

Publishing professionals familiar with JATS XML may be familiar with tools that handle metadata, such as an article's title, abstract and keywords, when converting, editing or publishing XML. In OJS, this metadata (the frontmatter and backmatter of a JATS XML document) is handled separately from the main body of the article.

For this reason, the conversion, editing and publishing tools described in this document focus on the article's main body and citations. Only when the article is published is the metadata and article body merged into a complete JATS XML document.
8 changes: 8 additions & 0 deletions xml-production/index.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
---
isBookIndex: true
---
# XML Production

How to discover and use tools to convert, edit and publish journal articles in JATS XML.

* [English](en/)

0 comments on commit cbea90f

Please sign in to comment.