Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluate sending quality info to the ELK stack #190

Open
gothub opened this issue Jan 30, 2019 · 1 comment
Open

Evaluate sending quality info to the ELK stack #190

gothub opened this issue Jan 30, 2019 · 1 comment
Assignees
Labels
metadig All issues related to metadig metadig-engine

Comments

@gothub
Copy link
Contributor

gothub commented Jan 30, 2019

Evaluate sending quality scores to the ElasticSearch Logstash Kibana (ELK) system used by the Making Data Count project. This would replace the current Solr index used by metadig-engine.

Changes to metadig-engine would involve:

  • updating metadig-worker so that data would be sent to a Solr server based on a configuration parameter. This capability would not be removed completely, to allow for flexibility in the future.
  • develop an alternative method of parsing the quality reports in order to extract and calculate the quality scores. Currently this is done using a variant of the DataONE d1_index_processor component. A likely alternative would be a Java class using XSLT, then calculating the scores manually.
  • metadig-worker would write out a file that contains the quality score info for each pid/authoritative MN/suite id.
  • a Filebeat instance would be available on a known port on a k8s cluster machine that would provide info from the data file metadig-worker writes
@gothub gothub added metadig-engine metadig All issues related to metadig labels Jan 30, 2019
@gothub gothub added this to the 2.1.0 milestone Jan 30, 2019
@gothub gothub self-assigned this Jan 30, 2019
@gothub
Copy link
Contributor Author

gothub commented Feb 20, 2019

After reviewing the metadig-engine code, it appears that the d1_index_processor code could still be used for parsing quality reports. A new Java class can be written to used the parsed and calculated info (quality scores, etc) to write out the JSON file that the Filebeat listener will read.

@gothub gothub modified the milestones: 2.1.0, 3.0 Apr 2, 2020
@jeanetteclark jeanetteclark removed this from the 3.0 milestone Jul 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
metadig All issues related to metadig metadig-engine
Projects
None yet
Development

No branches or pull requests

2 participants