Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Harvard Dataverse file retriever implementation for Heal SDK #11

Open
wants to merge 15 commits into
base: master
Choose a base branch
from

Conversation

piotrsenkow
Copy link

@piotrsenkow piotrsenkow commented Oct 2, 2024

Link to JIRA ticket if there is one: HP-1648

New Features

Nearly identical to the QDR File retriever, this heal sdk function, get_harvard_dataverse_files(), parses a list of metadata hosted in harvard dataverse, utilizing the same dataverse api that QDR adopted.

Breaking Changes

Bug Fixes

Improvements

Dependency updates

Deployment changes

@piotrsenkow piotrsenkow changed the title Harvard Dataverse file retriever implementation for Heal SDK draft:Harvard Dataverse file retriever implementation for Heal SDK Oct 2, 2024
@piotrsenkow piotrsenkow changed the title draft:Harvard Dataverse file retriever implementation for Heal SDK draft: Harvard Dataverse file retriever implementation for Heal SDK Oct 2, 2024
Copy link

github-actions bot commented Oct 2, 2024

The style in this PR agrees with black. ✔️

This formatting comment was generated automatically by a script in uc-cdis/wool.

@piotrsenkow piotrsenkow marked this pull request as draft October 2, 2024 16:56
@piotrsenkow piotrsenkow changed the title draft: Harvard Dataverse file retriever implementation for Heal SDK Harvard Dataverse file retriever implementation for Heal SDK Dec 12, 2024
@piotrsenkow piotrsenkow marked this pull request as ready for review December 12, 2024 19:26
This module includes an external file retriever function intended to be called
by the external_files_download module in the Gen3-SDK.

The retriever function sends requests to the Syracuse QDR API for downloading studies or files.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There are many places in this file that is referencing to Syracuse QDR or QDR, which should be replaced by Harvard Dataverse

return completed


def download_from_url(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this function (any many other functions in this file) are identical to their siblings in the QDR retriever code, we should move them to a shared utils.py module and use from there to reduce code reduncency

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants