Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrating stats about journals cited by Wikipedia #1957

Open
prototyperspective opened this issue May 8, 2022 · 0 comments
Open

Integrating stats about journals cited by Wikipedia #1957

prototyperspective opened this issue May 8, 2022 · 0 comments
Labels
enhancement some suggestions to improve Scholia

Comments

@prototyperspective
Copy link

prototyperspective commented May 8, 2022

Is your feature request related to a problem? Please describe.
It would be nice if this data was integrated into Scholia: https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Academic_Journals/Journals_cited_by_Wikipedia

Describe the solution you'd like
This would allow you to for example see:

  • the most popular/used/important/constructive/central journals on Wikipedia (to some degree also in general) for a specific scientific field that is also a Wikipedia category and
  • stats visualizing changes over time and/or other things
  • such stats on the Scholia pages for the journals (for example to identify the different focuses/specialization of different journals)

https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Academic_Journals/Journals_cited_by_Wikipedia/Popular1 only show journals most popular in Wikipedia overall but this isn't really useful*, it would more interesting to see how it looks like for https://en.wikipedia.org/wiki/Category:Agriculture or https://en.wikipedia.org/wiki/Category:Types_of_neoplasia for example and I don't know if the data available there currently suffices for this (it could get improved later on after it's embedded in a more general way).

*not just because this data may be "polluted" – for example with references in templates that are used in many pages or were added routinely by a bot, reducing the meaningfulness / invalidating the stats

Later on this could be improved with things like semantically evaluating the references similar to (or using) scite.ai (see also #1888) – this would allow you to cleanse the data a bit to address the issue mentioned above and to take into account that some references are only used for something trivial / minor while other references (each a review or paper) are covering lengthy statements with lots of substance in terms of meaning.

There are many use-cases that can be built or improved with this over time – this data becomes far more useful when integrated into Scholia.

Describe alternatives you've considered
Maybe this puts to much focus on journals. That wasn't my intention – it's just because that's the data that's available and more similar things could also be added later on.

Additional context
It could be a successor to Wikipedia-Cite-o-Meter and/or enable its revival.

Please see the discussion there for a few more details: wpoa/Wikipedia-Cite-o-Meter#12

See also: #1955

Here is a recent study about something related: "Dataset of first appearances of the scholarly bibliographic references on Wikipedia articles" in case this is relevant, or you're looking for inspiration for similar things one could add or use-cases of this.

I don't think this is currently an important issue but given that this data is already there, it may not the be too difficult to add it despite having quite some potential for usefulness.

@prototyperspective prototyperspective added the enhancement some suggestions to improve Scholia label May 8, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement some suggestions to improve Scholia
Projects
None yet
Development

No branches or pull requests

1 participant