Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

JATS4M: spec for <element-citation> #238

Closed
michael opened this issue Aug 10, 2017 · 60 comments
Closed

JATS4M: spec for <element-citation> #238

michael opened this issue Aug 10, 2017 · 60 comments
Assignees

Comments

@michael
Copy link
Member

michael commented Aug 10, 2017

elife publication-type scielo publication-type pattern
book ✓ book person-group[person-group-type=author], person-group[person-group-type=editor]?, edition, year, month?, day?, chapter-title, source?, publisher-loc?, publisher-name?, fpage?, lpage?, page-range?, elocation-id?, pub-id[pub-id-type='doi']?, pub-id[pub-id-type='pmid']?, pub-id[pub-id-type='isbn']?
preprint ✓ - person-group[person-group-type='author'], article-title, source, year, month?, day?, pub-id[pub-id-type='doi']?
clinicaltrial ✓ - person-group[person-group-type=sponsor], year, month?, day?, article-title, source?,pub-id[pub-id-type='doi']?
confproc ✓ confproc person-group[person-group-type='author'],article-title, year, month?, day?, conf-name, source?, fpage?, lpage?, page-range?, elocation-id?, pub-id[pub-id-type='doi']?
data ✓ database person-group[person-group-type='author'], data-title, source, year, month?, day?, pub-id[pub-id-type="accession, ark, doi, archive"]?
periodical ✓ - person-group[person-group-type=author], article-title, year, month, day, source, fpage?, lpage?, page-range?, volume?, pub-id[pub-id-type='doi']?
journal ✓ journal person-group[person-group-type=author], person-group[person-group-type=editor]?, year, month?, day?, article-title, source?, volume?, issue?, fpage?, lpage?, page-range?, elocation-id?, comment?, pub-id[pub-id-type='doi']?, pub-id[pub-id-type='pmid']?
patent ✓ patent person-group[person-group-type='inventor'], collab[type=assignee]?, article-title, year?, month?, day?, source?, patent[country='xxx']?
report ✓ report person-group[person-group-type='author'], source, year, month?, day?, publisher-name, publisher-loc?, pub-id[pub-id-type='isbn']?
software software person-group[person-group-type='author'], year, month?, day?, source?, version?, publisher-loc?, publisher-name?, pub-id[pub-id-type='doi']?
thesis thesis person-group[person-group-type='author'], year, month?, day?, article-title, publisher-name, publisher-loc?, pub-id[pub-id-type='doi']?
web webpage person-group[person-group-type='author'], article-title, uri, year, month, day, publisher-loc?, source?
- legal-doc -
- newspaper -

<person-group>

name, collab

<name>

surname, given-names?, prefix?, suffix?

HTML Preview in Texture

journal

${authors}. ${editors}. ${year}. ${article-title}. ${source} <strong>${vol}</strong>:${fpage}-${lpage}.

book

${source} (${edition}). ${editors}, editors. (${year}) ${publisher-loc}: ${publisher}.

chapter in book

${article-title}. ${authors} (${year}). In: ${editors}, editors. <emphasized>${source}<emphasized>.
${publisher-loc}: ${publisher}.  pp. $(first-page)-$(last-page).

software

${data-title}, ${version}. ${authors} (${year}). ${source}, editors. ${source}, ${publisher}, ${publisher-loc}

preprint

${data-title}. ${authors} (${year}). ${source}. ${ext-link}

clinicaltrial

${article-title}. ${sponsors}  (sponsors) (${year}). ${ext-link}

confproc

${article-title}. ${authors} (${year}). ${conf-name}, ${source}, $(fpage?}-${lpage}. ${doi}

data

${data-title}. ${authors} (${year}). ${source}, ${pub-id}

patent

${article-title}. ${inventor} (${year}). ${assignee}, ${country}

periodical

${article-title}. ${author} (${string-date}). <emphasis> ${source}<emphasis>, p. $(first-page}-$(last-page}

report

${article-title}. ${author} (${string-date}). <emphasis>${source}<emphasis>, p. $(first-page}-$(last-page}

thesis

${article-title}. ${author} (${year}). ${publisher-name}, ${publisher-loc}

This spec will define what the Texture user interface will support for editing. Note that this spec is incomplete, please comment on what you would like to have included and why. I will update this issue according to your comments

@fabiobatalha
Copy link
Collaborator

@michael

There are some journals that do not specify the article title in the references. Some of these mandatory elements may should not be mandatory. Eg: http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0100-40422016000300267&lng=en&nrm=iso&tlng=pt

@michael
Copy link
Member Author

michael commented Aug 11, 2017

@fabiobatalha is this more a legacy problem? Or are there good reasons not to specify an article title?

Making article-title an optional field is possible but I would assume that in most cases when it is empty this is by accident and then we should warn the user?

@fabiobatalha
Copy link
Collaborator

@michael There are no good reasons not to specify an article title, but this is our reality.

It is not a legacy problem, it could be described as a cultural issue, a standard (bad or good) used by some journals or knowledge areas, etc.

I'm not sure it is a common behavior. What I know is, we still have to deal with this kind of issues with our journals/editors. This is the newest issue of a journal (http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0100-40422017000600706&lng=en&nrm=iso&tlng=pt).

I don't know if our colleagues have the same issue in their collections.

@Melissa37
Copy link
Collaborator

@fabiobatalha eLife has produced some Schematron that enforce some pretty strict rules for eLife:
https://github.com/elifesciences/reference-schematron

We'd love for this schematron to be used by more people so we can collectively help increase quality of references - we're using it in the production process, so authors can be queried for missing data before final delivery and publication.

We've also reviewed our archive and come up with a list of 12 different @publication-type.

In addition to the tags you have listed for publication-type="journal", we'd add:

  • name (surname/given-names), suffix and collab to go within person-group
  • person-group needs @person-group-type="author"
  • we'd request an @iso-8601-date on the year - this allows publishers to list a,b,c suffixes on dates within the field (where it's the same authors and date for multiple refs) and allowing a proper date to still be machine readable.
  • We allow comment, but restrict it to only allow this text "In press"
  • pub-id with @pub-id-type of doi or pmid

Book (publication-type="book")

  • name (surname/given-names), suffix and collab to go within person-group
  • person-group needs @person-group-type with choice of author or editor
  • we'd request an @iso-8601-date on the year - this allows publishers to list a,b,c suffixes on dates within the field (where it's the same authors and date for multiple refs) and allowing a proper date to still be machine readable.
  • add edition
  • pub-id with @pub-id-type of doi, pmid, isbn

Some more to be added:

Data (publication-type="data")

  • name (surname/given-names), suffix and collab to go within person-group
  • person-group needs @person-group-type of author, compiler or curator
  • we'd request an @iso-8601-date on the year - this allows publishers to list a,b,c suffixes on dates within the field (where it's the same authors and date for multiple refs) and allowing a proper date to still be machine readable.
  • add data title, source
  • pub-id with @pub-id-type of these allowed values:
    Allowed values:
    "accession" ((bioinformatics) a unique identifier given to a DNA or protein sequence record for tracking the sequence record and the associated sequence over time in a data repository.)
    "archive" (assigned by an archive or other repository)
    "ark" (Archival Resource Key — a Uniform Resource Locator (URL) containing the word "ark" that is a multi-purpose identifier for information objects of any type)
    "doi"
    -@assigning-authority - Not required but nice to have (eg FigShare, PDB)
  • [URL of website]

I won't list out all the requirements in this ticket as I can point you to the info if decided it is of use. These are the rest of our reference types:
Patents (publication-type="patent")
Preprints (publication-type="preprint")
website
periodical
technical report
clinical trials
Theses

@fabiobatalha
Copy link
Collaborator

Hello @Melissa37,

We also have a schematron (https://github.com/scieloorg/packtools) to validate XML files against SciELO PS rules. We will look your schematron to check if we can use some of your validations.

Our reference types allowed are:

publication-type Description
book Refers to books. It can also represent only a part or chapter of a book.
confproc Identifies documents related to scientific events: minutes, annals, results, proceedings, conventions and conferences, among others.
database Specifies databases..
journal Characterized by serial publications, edited in successive units/ issues, with numerical and / or chronological designations, and destined to be continued indefinitely.
patent Refers to patents.
report Identifies a unique technical report, normally authored by an institution.
software Refers to a software distributed on CDs, DVDs, online media, USB devices, etc.
thesis Characterized by monographs, dissertations or theses to attain an academic degree (doctorate, master's, baccalaureate, bachelor's degree, etc.).
webpage Identifies information on websites and blogs.
legal-doc Refers to legal norms.
newspaper Identifies newspaper articles.
other Specifies types not found in the SciELO PS.

I will ask @gustavofonseca to check your schematron and find the differences and similarities between our schematron and yours. So we can set the common rules between both. I believe it could be our first requirements for Texture references metadata.

After that we may discuss together to try to mitigate the diferences between both schematrons. Defining a base standard for this consortium.

Thoughts?

@Melissa37
Copy link
Collaborator

Hi there

That sounds like a great idea, thanks.
We'll take a look at the extra reference types you have that we don't.
We'd really be interested in that as we'd planned for our Schematron to have wider impact than just eLife and we also wanted it to be used to help improve the quality of references (and references provided to Crossref)

@gustavofonseca
Copy link
Collaborator

Schematron produced by SciELO deals with a very specific use case that is to validate the XML representation structure of an article that has already gone through the editorial process. As far as I can see -- @Melissa37 correct me if I am wrong -- the use case of eLife is different and more convenient for the application of strict rules regarding the minimum set of metadata of the bibliographic references as there is the direct interaction with the author Of the article.

I believe that the Schematron pointed out by @Melissa37 will be very useful when SciELO happens to interact directly with the authors, but I do not think it will replace our need for more relaxed schemes for the case I mentioned earlier.

@michael
Copy link
Member Author

michael commented Aug 28, 2017

@Melissa37 @fabiobatalha great input, thank you! I'm already working on getting journal + book pubtypes implemented.

regarding pub-id with @pub-id-type of doi or pmid: I guess this is only valid for LifeScience journals? Are there other pub-id types you can think of? If it's just a hand-full i would just show them all in the user interface and you fill them out if you know them. But if there's like 10 different pub-id types then we would need to go for a more generic (but also more complicated) user interface (add and remove pub-id, select pub-id type etc.)

@michael
Copy link
Member Author

michael commented Aug 28, 2017

we'd request an @iso-8601-date on the year - this allows publishers to list a,b,c suffixes on dates within the field (where it's the same authors and date for multiple refs) and allowing a proper date to still be machine readable.

Can you give an example? I don't like that we'd have two different values. This would complicate the user interface (two input fields). Still not sure I understand... what would 2007a mean? :)

@michael
Copy link
Member Author

michael commented Aug 28, 2017

name (surname/given-names), suffix and collab to go within person-group

@Melissa37 can you give an example for group authors with a collab element present in person group? Not sure yet how we want to model this in Texture.

@michael
Copy link
Member Author

michael commented Aug 28, 2017

Any objections on making surname and given-names required fields?

@Melissa37
Copy link
Collaborator

@michael regarding pub-id with @pub-id-type of doi or pmid: I guess this is only valid for LifeScience journals?
DOI should be humanities and other too, as well as datasets (Figshare/DataCite etc) - should have very broad coverage now
Yes, PMID would be only for journals indexed on PubMed
Are there other pub-id types you can think of?
Yes - PMCID (any open content published on PMC), ISBNs for books
If it's just a hand-full i would just show them all in the user interface and you fill them out if you know them. But if there's like 10 different pub-id types then we would need to go for a more generic (but also more complicated) user interface (add and remove pub-id, select pub-id type etc.)
I would not introduce lots at this stage for authors to fill in as I don't think they will know many more, but for eLife for Data we're seeing accession and archive (generic catch all) too

@Melissa37
Copy link
Collaborator

@gustavofonseca re: I believe that the Schematron pointed out by @Melissa37 will be very useful when SciELO happens to interact directly with the authors, but I do not think it will replace our need for more relaxed schemes for the case I mentioned earlier.
Yes I agree.

@Melissa37
Copy link
Collaborator

@michael we'd request an @iso-8601-date on the year - this allows publishers to list a,b,c suffixes on dates within the field (where it's the same authors and date for multiple refs) and allowing a proper date to still be machine readable.
Can you give an example? I don't like that we'd have two different values. This would complicate the user interface (two input fields). Still not sure I understand... what would 2007a mean? :)

Ref List:

xml
<!-- Data reference: NCBI BioProject: author is a collab pub-id-type="accession" assigning-authority="NCBI" example of 3 references with same author/year -->
    <ref id="bib45">
        <element-citation publication-type="data">
            <person-group person-group-type="author">
                <collab>The <italic>Shigella</italic> Genome Sequencing Consortium</collab>
            </person-group>
            <year iso-8601-date="2015">2015a</year>
            <data-title>Global Diversity of Shigella Species</data-title>
            <source>NCBI BioProject</source>
            <pub-id pub-id-type="accession" assigning-authority="NCBI"
                xlink:href="http://www.ncbi.nlm.nih.gov/bioproject/?term=PRJEB2846"
                >PRJEB2846</pub-id>
        </element-citation>
    </ref>
    <ref id="bib46">
        <element-citation publication-type="data">
            <person-group person-group-type="author">
                <collab>The <italic>Shigella</italic> Genome Sequencing Consortium</collab>
            </person-group>
            <year iso-8601-date="2015">2015b</year>
            <data-title>Shigella sonnei and flexneri from around the world</data-title>
            <source>NCBI BioProject</source>
            <pub-id pub-id-type="accession" assigning-authority="NCBI"
                xlink:href="http://www.ncbi.nlm.nih.gov/bioproject/204320"
                >PRJEB2460</pub-id>
        </element-citation>
    </ref>
    <ref id="bib47">
        <element-citation publication-type="data">
            <person-group person-group-type="author">
                <collab>The <italic>Shigella</italic> Genome Sequencing Consortium</collab>
            </person-group>
            <year iso-8601-date="2015">2015c</year>
            <data-title>Shigella flexneri from around the world</data-title>
            <source>NCBI BioProject </source>
            <pub-id pub-id-type="accession" assigning-authority="NCBI"
                xlink:href="http://www.ncbi.nlm.nih.gov/bioproject/?term=PRJEB2542"
                >PRJEB2542</pub-id>
        </element-citation>
    </ref>

In-text citation:

xml
To describe the basic principles of BCV, we adopt the formalism of Bayesian graphs (The <italic>Shigella</italic> Genome Sequencing Consortium, 2015c</xref>) where a generative model is described by nodes or circles...

We use Harvard referencing style - name and year, not number (Vancouver)(http://site.uit.no/english/writing-style/citationstyles/).
Does that make sense?

@Melissa37
Copy link
Collaborator

@michael
name (surname/given-names), suffix and collab to go within person-group
@Melissa37 can you give an example for group authors with a collab element present in person group? Not sure yet how we want to model this in Texture.

Is this what you mean?:

xml
<ref id="bib47">
        <element-citation publication-type="data">
            <person-group person-group-type="author">
                <collab>The <italic>Shigella</italic> Genome Sequencing Consortium</collab>
            </person-group>
            <year iso-8601-date="2015">2015c</year>
            <data-title>Shigella flexneri from around the world</data-title>
            <source>NCBI BioProject </source>
            <pub-id pub-id-type="accession" assigning-authority="NCBI"
                xlink:href="http://www.ncbi.nlm.nih.gov/bioproject/?term=PRJEB2542"
                >PRJEB2542</pub-id>
        </element-citation>
    </ref>

Or is this linked to:
#239
In references, I would not be providing details of authors who are members of a group which is the author. I would only do that for the main metadata for an article, not bibs

@Melissa37
Copy link
Collaborator

@michael Any objections on making surname and given-names required fields?
No!

@jamcandrews
Copy link

@michael Any objections on making surname and given-names required fields?
okay with surnames but please don't make given-names required. we have a not insignificant number of authors who use one name only.
(~J. McAndrews, AIP Publishing -- really wishing I could participate more!)

@michael
Copy link
Member Author

michael commented Sep 14, 2017

@Melissa37 I've modelled book and journal mostly after your guidelines. They were really helpful! I wanted to ask if we could move on and specify all the 12 pub types you use at eLife so we can work towards having them signed off by each party.

You think you could translate each of the pub types into a formal spec like this one for book?

person-group, edition, year, chapter-title, source?, publisher-loc?, publisher-name?,
fpage?, lpage?, page-range?, elocation-id?, pub-id[pub-id-type='doi']?,
pub-id[pub-id-type='pmid']?, pub-id[pub-id-type='isbn']?

That would great! :)

@Melissa37
Copy link
Collaborator

HI Michael

Sure I can do this - do I have a deadline/ :-)

Also, we are very strict so @gustavofonseca might want to edit it!

@michael
Copy link
Member Author

michael commented Sep 15, 2017

No hard deadline. :) Starting out strict is always good and then we have each org go through it and sign it off. Let's try to seek for the best possible solution and consider making updates to the existing guidelines. We will all benefit from a uniform tagging style for element-citations in the long run!

Thank you very much for helping @Melissa37 !

@michael
Copy link
Member Author

michael commented Sep 15, 2017

@Melissa37 maybe you could also specify an HTML rendering template for each pub type (see ' HTML Preview in Texture ' above). We need that to have a preview in Texture when we display the refs in the reference list at the end of the document.

@Melissa37
Copy link
Collaborator

Melissa37 commented Sep 20, 2017

publication-type="software"

person-group[person-group-type='author'], person-group[person-group-type='curator']
data-title, year, source?, version?, publisher-loc?, publisher-name?, ext-link[ext-link-type="uri"]?
publication-type="preprint"

person-group[person-group-type='author'], 
article-title, source, year, ext-link[ext-link-type="uri"]?, pub-id[pub-id-type='doi']?
publication-type="clinicaltrial"

person-group, year, article-title, ext-link[ext-link-type="uri"], source?
publication-type="confproc"

person-group[person-group-type='author'], 
article-title, year, ext-link[ext-link-type="uri"], conf-name, source?, fpage?, lpage?, page-range?, elocation-id?, pub-id[pub-id-type='doi']?
publication-type="data"

person-group[person-group-type='author'], 
data-title, year, source, pub-id[pub-id-type="accession, ark, doi, archive"]
ext-link[ext-link-type="uri"]?, pub-id[assigning-authority="XXX"]?
publication-type="patent"

person-group[person-group-type='inventor'], 
article-title, year, source?, patent country,
ext-link[ext-link-type="uri"], person-group[person-group-type='assignee']
publication-type="periodical"

string-date, person-group, article-title, source, fpage, lpage, volume?
publication-type="report"

 person-group, source, year, publisher-name, publisher-loc?, pub-id[pub-id-type='isbn']?
publication-type="thesis"

person-group, year, article-title, publisher-name, publisher-loc?, pub-id[pub-id-type='doi']?, ext-link[ext-link-type="uri"]?
publication-type="web"

person-group, article-title, date?, ext-link[ext-link-type="uri"],date-in-citation, publisher-loc?, source?

@Melissa37
Copy link
Collaborator

Melissa37 commented Sep 20, 2017

@michael regarding HTML Preview in Texture - it is so subjective and there are so many different styles and people seem to really care (whereas I personally don't!).
The key thing has always been brevity because of the print legacy and we at eLife do not suffer from that.
However, I will provide you with the style we came up with for our HTML view:

Only comment on journal - vol not issue please (or both)

book

${source} (${edition}). ${editors}, editors. (${year}) ${publisher-loc}: ${publisher}.
xml
chapter in book

${article-title}. ${authors} (${year}). In: ${editors}, editors. <emphasized>${source}<emphasized>.
${publisher-loc}: ${publisher}.  pp. $(first-page)-$(last-page).

xml
software

${data-title}, ${version}. ${authors} (${year}). ${source}, editors. ${source}, ${publisher}, ${publisher-loc}

xml
preprint

${data-title}. ${authors} (${year}). ${source}. ${ext-link}

xml
clinicaltrial

${article-title}. ${sponsors}  (sponsors) (${year}). ${ext-link}

xml
confproc

${article-title}. ${authors} (${year}). ${conf-name}, ${source}, $(fpage?}-${lpage}. ${doi}
xml
data

${data-title}. ${authors} (${year}). ${source}, ${pub-id}
xml
patent

${article-title}. ${inventor} (${year}). ${assignee}, ${country}
xml
periodical

${article-title}. ${author} (${string-date}). <emphasis> ${source}<emphasis>, p. $(first-page}-$(last-page}
xml
report

${article-title}. ${author} (${string-date}). <emphasis>${source}<emphasis>, p. $(first-page}-$(last-page}
xml
publication-type="report"

${source}, ${author} (${year}). ${publisher-name}, ${publisher-loc}

xml
thesis

${article-title}. ${author} (${year}). ${publisher-name}, ${publisher-loc}

@michael
Copy link
Member Author

michael commented Sep 20, 2017

Thanks so much for taking the time to put this together @Melissa37. Now we have something to start with!

Please add your comments everyone!

@Melissa37
Copy link
Collaborator

Ah, I see now! Thanks! Please ignore the second one!

@fabiobatalha
Copy link
Collaborator

fabiobatalha commented Nov 1, 2017

A preview of the required data for each reference type.

It seems the publication-types used by SciELO and eLife are totally compatible, we will provide the pattern for legal-doc and newspaper.

elife publication-type scielo publication-type pattern
book book person-group[person-group-type=author], person-group[person-group-type=editor]?, edition, year, month?, day?, chapter-title, source?, publisher-loc?, publisher-name?, fpage?, lpage?, page-range?, elocation-id?, pub-id[pub-id-type='doi']?, pub-id[pub-id-type='pmid']?, pub-id[pub-id-type='isbn']?
preprint - person-group[person-group-type='author'], article-title, source, year, ext-link[ext-link-type="uri"]?, pub-id[pub-id-type='doi']?
clinicaltrial - person-group, year, article-title, ext-link[ext-link-type="uri"], source?
confproc confproc person-group[person-group-type='author'],article-title, year, ext-link[ext-link-type="uri"], conf-name, source?, fpage?, lpage?, page-range?, elocation-id?, pub-id[pub-id-type='doi']?
data database person-group[person-group-type='author'], data-title, year, source, pub-id[pub-id-type="accession, ark, doi, archive"], ext-link[ext-link-type="uri"]?, pub-id[assigning-authority="XXX"]?
periodical - string-date, person-group, article-title, source, fpage, lpage, volume?
journal journal person-group[person-group-type=author], person-group[person-group-type=editor]?, year, month?, day?, article-title, source?, volume?, issue?, fpage?, lpage?, page-range?, elocation-id?, comment?, pub-id[pub-id-type='doi']?, pub-id[pub-id-type='pmid']?
patent patent person-group[person-group-type='inventor'], article-title, year, source?, patent country, ext-link[ext-link-type="uri"], person-group[person-group-type='assignee']
report report person-group, source, year, publisher-name, publisher-loc?, pub-id[pub-id-type='isbn']?
software software person-group[person-group-type='author'], person-group[person-group-type='curator'], data-title, year, source?, version?, publisher-loc?, publisher-name?, ext-link[ext-link-type="uri"]?
thesis thesis person-group, year, article-title, publisher-name, publisher-loc?, pub-id[pub-id-type='doi']?, ext-link[ext-link-type="uri"]?
web webpage person-group, article-title, date?, ext-link[ext-link-type="uri"],date-in-citation, publisher-loc?, source?
- legal-doc person-group[person-group-type=author], year, month?, day?, article-title?, source, volume?, issue?, fpage?, lpage?, page-range?, elocation-id?, publisher-name, publisher-loc
- newspaper person-group[person-group-type=author]?, year, month?, day?, article-title?, source, volume?, issue?, fpage?, lpage?, elocation-id?, comment?

@michael
Copy link
Member Author

michael commented Nov 1, 2017

Thanks for writing this up @fabiobatalha! Starting now with implementing this. :)

Using this table as the official spec. Thanks!

Oh, only one question to @fabiobatalha and @Melissa37: Instead of ext-link could we use the uri tag? It would be more consistent to use ext-link only within text elements, not in structured elements such as element-citation.

@mfenner
Copy link

mfenner commented Nov 1, 2017

Thanks @fabiobatalha. The DataCite implementation is pretty close to this. Some of the questions I have include:

  • data-title for software? I used source.
  • article-title for many things, including webpage?
  • publisher-loc I typically see this only for book and maybe conferences. You want to support this for all @publication-type?
  • why not optionally include month and day for data?
  • what is the pattern for using ext-link? Do we for example need it if we have pub-id for data?

@michael
Copy link
Member Author

michael commented Nov 1, 2017

Agree with @mfenner on having month + day available everywhere as an optional field. Also would like to get rid of ext-link. I will make changes in table (in the issue text) so it reflects the latest implementation implementation plan in Texture.

@fabiobatalha
Copy link
Collaborator

fabiobatalha commented Nov 1, 2017

  • Agree on having month + day available everywhere as an option field
  • Agree on having source instead of data-title for software
  • Not sure if publisher-loc should be considered only for book and conferences, The ABNT standard ask for the publication-loc for journals. It is not frequently used in SciELO but some journals use the ABNT standard in Brazil.
  • No objections for uri instead of ext-link

@michael
Copy link
Member Author

michael commented Nov 1, 2017

What is date-in-citation in web? Shouldn't this be year,month?,day? also?

@fabiobatalha
Copy link
Collaborator

@michael
Copy link
Member Author

michael commented Nov 1, 2017

Ah didn't first spot it in the JATS spec. I'm not sure we should use that tag, as it seems to be there for some visual clues (like freeform text such as "updated on March 23 2017"). If we captured that semantically with year+month+day (all required) we could use the same structure as in the other elements.

Do we need multiple dates? Like updated-at and accessed-on? Maybe accessed-on is enough? Hard to proof when a website actually has been updated anyways.

I'm sure how to deal with pub-id[assigning-authority="XXX"] would be easier to come up with a list of supported assigning authorities rather than letting the user enter something here? Shouldn't we just use a pub-id[type=doi] here instead. Hard to support any kind of id provider, and we will never be done. :/

@fabiobatalha
Copy link
Collaborator

@michael
for us the is enough, not sure about the others.

@michael
Copy link
Member Author

michael commented Nov 1, 2017

I will make a few decisions (will change the spec in the issue text) so I can come up with an initial implementation fast. We can then start a round of user testing and adjust things were needed. All ok with that?

@michael
Copy link
Member Author

michael commented Nov 1, 2017

@Melissa37 do you have one or two elife examples for the periodical pubtype? Wonder if we could avoid using string-date there.

@michael
Copy link
Member Author

michael commented Nov 1, 2017

@Melissa37 (sorry for the email flood...) For patents JATS suggests using the collab element like so <collab collab-type="assignee">Hospira, Inc.</collab>. In your spec there's a person-group[person-group-type=assignee] used. Not sure which one is better, would need to see some real eLife examples for patents... if you got time. :) thx!

@michael
Copy link
Member Author

michael commented Nov 2, 2017

I've added examples to the Texture kitchen sink file (one per publication type):

https://github.com/substance/texture/blob/ref-editing/data/kitchen-sink.xml#L133-L373

Please review and suggest changes via "Edit file on Github".

@pigeonm
Copy link

pigeonm commented Nov 2, 2017

Hi,

Currently at Erudit we do not finely tag citations, we separate each citation, but the citation itself is a text string. Though we’re willing to implement a better tagging in near future, you're way farther than us into the reflexion surrounding the tagging of citations, both with the publication types and the fine tagging of these different types of citation.

At Erudit, we do not deal directly with the authors and can’t impose a citation style on them. The citations we have to process may sometimes be incomplete (no given-names, no title, etc.), we have no control over it, we mark the reference as they are in the source files. We don’t clean/edit/complete citations to make the citations valid. Knowing this, our needs are more oriented toward greater flexibility. Thus, we are of the opinion that everything in the element-citation tag must be optional.

Also, it seems important to us to preserve the possibility of being able to include untagged text in citations, something that the element-citation tag does not allow (but allowed in the mixed-citation tag which seems to be also permitted in JATS4R). The support of mixed-citation tag in Texture and the UX of choosing between mixed-citation and element-citation might be discussed in a dedicated issue

@pigeonm
Copy link

pigeonm commented Nov 2, 2017

About the publication ID, What is suggested here (DOI, ISBN, PMID,PMCID) covers our needs.

@pigeonm
Copy link

pigeonm commented Nov 2, 2017

About iso-8601 (date). The use of an iso-8601 covers our needs, especially if it allow a, b, c suffixes on dates.

Since we think everything should be optional, we are OK with having month and day available for every type of publication as an option field.

@Melissa37
Copy link
Collaborator

Melissa37 commented Nov 8, 2017

Sorry, long entry - catch up:

@michael RE: Oh, only one question to @fabiobatalha and @Melissa37: Instead of ext-link could we use the uri tag? It would be more consistent to use ext-link only within text elements, not in structured elements such as element-citation.

<ext-link> is explicitly stating it's a link to an external file or resource, and I think that is why we chose to use it in references over <uri>. You cannot just use uri to create a link either, you still have to use the @xLink:href attribute. But if everyone else has considered this item and agree on <uri> I won't stand in the way!

@michael re date-in-citation - we require this and a publication date currently, but I think I agree that the publication date is ambiguous - is it the original page published date, or most recent update?
The date in citation is when our author accessed it. This is meant to cover issues where the content on a website changes over time and when someone cites it that information is accurate and correct at that time. I think I'd be OK with having a date when it was accessed without the source's publication date, but it would need to be clear the date being recorded is the accessed date.

Regarding software citations - I have a concern that to simplify things the software citations will go down a different route to what the data citations working group went down and will change the use/meaning of tags used, so in the larger picture will be less useful. I would be happy to review data and software citations together to try and keep them in synch.

@michael re I'm sure how to deal with pub-id[assigning-authority="XXX"] would be easier to come up with a list of supported assigning authorities rather than letting the user enter something here? Shouldn't we just use a pub-id[type=doi] here instead. Hard to support any kind of id provider, and we will never be done. :/
I think this is a nice-to-have and we should drop it from the initial spec. Writers won't want to put this level of detail into a reference they are manually entering and if there is a url, it's usually clear what site it comes from from the URL - although I know this is not machine-readable accuracy, sorry!

@michael RE do you have one or two elife examples for the periodical pubtype? Wonder if we could avoid using string-date there.

<ref id="bib40">
        <element-citation publication-type="periodical">
            <person-group person-group-type="author">
                <name>
                    <surname>Schwartz</surname>
                    <given-names>J</given-names>
                </name>
            </person-group>
            <string-date><month>September</month> <day>9</day>, <year iso-8601-date="1993-09-09">1993</year></string-date>
            <article-title>Obesity affects economic, social status</article-title>
            <source>The Washington Post</source>
            <fpage>A1</fpage>
            <lpage>A4</lpage>
        </element-citation>
    </ref>

Yes I agree re avoiding string date - we're using it for presentation only.

@michael Re: For patents JATS suggests using the collab element like so Hospira, Inc.. In your spec there's a person-group[person-group-type=assignee] used. Not sure which one is better, would need to see some real eLife examples for patents

Below's an example from eLife. I guess we went with person group to keep in in the same structure as the "inventor". I guess there can only be one assignee, where as there could be multiple inventors? I am not sure! We did not have any examples or other evidence so went with our basic structure which is to put everyone/org into a person-group

xml
<ref id="bib32">
        <element-citation publication-type="patent">
            <person-group person-group-type="inventor">
                <name>
                    <surname>Patterson</surname><given-names>JB</given-names>
                </name>
                <name>
                    <surname>Lonergan</surname><given-names>DG</given-names>
                </name>
                <name>
                    <surname>Flynn</surname><given-names>GA</given-names>
                </name>
                <name>
                    <surname>Qingpeng</surname><given-names>Z</given-names>
                </name>
                <name>
                    <surname>Pallai</surname><given-names>PV</given-names>
                </name>
            </person-group>
            <person-group person-group-type="assignee">
                <collab>Mankind Corp</collab>
            </person-group>
            <year iso-8601-date="2011">2011</year>
            <article-title>IRE-1alpha inhibitors</article-title>
            <source>United States patent</source>
            <patent country="United States">US20100941530</patent>
            <ext-link ext-link-type="uri" xlink:href="http://europepmc.org/patents/PAT/US2011065162">http://europepmc.org/patents/PAT/US2011065162</ext-link>
        </element-citation>
    </ref>

@pigeonm RE The support of mixed-citation tag in Texture and the UX of choosing between mixed-citation and element-citation might be discussed in a dedicated issue
I agree that needs a lot of thought as a user of Texture does not and should not have to care about XML, so should not know the difference between the reference tagging options :-)

@michael michael changed the title JATS4R: spec for <element-citation> JATS4M: spec for <element-citation> Nov 8, 2017
@michael
Copy link
Member Author

michael commented Nov 10, 2017

@pigeonm thank you for your update! I think we could solve this by providing a publication-type="raw" type. There you can just enter an annotated article title which will be rendered as is.
We would not support mixed-citations in Texture as they are in our opinion a bad concept (mixing structure and style). Anyways, I hope we we could move towards structured info where-ever possible. At Erudit do you think you could work towards that goal, at least with some journals?

Do you have time to do a conference call with Oliver and I so we can better assess the requirements of Erudit for references and affiliations?

@Melissa37 thanks for your update too. Will answer when I get time!

@fabiobatalha
Copy link
Collaborator

@michael

I think the publication-type is more related to the type of the citation rather than the XML structure inside the element-citation. Creating a publication-type=raw could bring misinterpretations regarding to the meaning of this attribute.

@michael and @pigeonm

Maybe in the Erudit case it should be important to have at least the meaning/type of the bibliographic reference content.

SciELO deal with the element-citation and mixed-citation in a combined way, where the element-citation have the structured data and mixed-citation have the citation in the way it was written by the author. Of course, this scenario deals with the content already produced by the author without texture and could also be considered a hacking of the JAT's structure to deal with a SciELO internal detail of implementation.

So maybe we could find another approach to deal with untagged references.

ex:

<ref-list>
    <ref id="B01">
        <label>1</label>
        <mixed-citation>ARRETCHE, M. Federalism and territorial equality: a contradiction in terms? Dados, Rio de Janeiro, v. 5, n. 02, 2010 . Disponível em: &lt;http://socialsciences.scielo.org.</mixed-citation>
        <element-citation publication-type="journal">
            <person-group person-group-type="author">
                <name>
                    <surname>ARRETCHE</surname>
                    <given-names>M</given-names>
                </name>
            </person-group>
            <article-title>Federalism and territorial equality: a contradiction in terms?</article-title>
            <source>Dados</source>
            <publisher-loc>Rio de Janeiro</publisher-loc>
            <volume>5</volume>
            <issue>02</issue>
            <year>2010</year>
            <ext-link ext-link-type="uri" xlink:href="http://socialsciences.scielo.org">http://socialsciences.scielo.org</ext-link>
        </element-citation>
    </ref>
<ref-list>

@Melissa37

I agree with your point of view regarding to the meaning of the ext-link in references. For SciELO both solutions will work once we will need to produce the SciELO PS XML from the Texture XML. Now we are using the ext-link so it will be better for us, but not mandatory once it is feasible to convert from uri to ext-link.

@pigeonm
Copy link

pigeonm commented Nov 16, 2017

@fabiobatalha How do you see this working inside Texture ? Would it be required to always have the citation tagged both with mixed-citation and element-citation ? Would Texture gives the possibility to the user to only tag a citation only with mixed-citation in a case where we only have the text string of the citation and where a citation lookup tool would have returned no results ?

@fabiobatalha
Copy link
Collaborator

fabiobatalha commented Nov 21, 2017

@pigeonm

One usage and implementation could be.

  1. Texture works the way it is being planed until now. It means, keep using the element-citation for the identification of each element. Avoiding the mixing of structure and style.

  2. Include a new field "styled-citation" in the interface to allow the users/author to include the reference as it was written by the author, following journal/area standards, patterns, look and feel, etc..... It could be considered as a kind of template per cite.

  3. The content of this field "styled-citation" could be used to include a mixed-citation element inside the ref element when exporting the document to XML. Te user could choose to include or not the mixed-citation element with the contend of the styled-citation field while exporting his article to XML.

  4. If the user have the styled-citation field and all the other fields of the element-citation filled, could be feasible to create an XML exportation with the mixed-citation; automatically tagging the mixed-citation content according to the filled fields.

  5. Finally, the user could decide to export the XML with element-citation, mixed-citation (without tagged data), mixed-citation (with tagged data) or both.

@michael michael self-assigned this Dec 2, 2017
@michael michael added this to the Alpha 4 milestone Dec 4, 2017
@michael
Copy link
Member Author

michael commented Dec 5, 2017

Thank you everyone for the fruitful discussions here. I've consolidated all the specification work into a single document: https://github.com/substance/texture/blob/master/docs/JATS4M.md

I will close this issue now, as I think we created a good base. Please open new issues anytime if you have questions, suggestions how to improve the current spec.

@michael michael closed this as completed Dec 5, 2017
@katrinleinweber
Copy link

katrinleinweber commented Jun 5, 2018

The document moved to substance/dar/DarArticle.md, I presume.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

9 participants