Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HTML input: formats.html.import_set #245

Closed
wants to merge 3 commits into from
Closed

Conversation

klothe
Copy link

@klothe klothe commented Jul 5, 2016

formats.html.import_set imports a Dataset from an HTML table.

It assumes there is exactly one table in the HTML document. (An import_book function would be more appropriate to handle multiple tables, but that doesn't seem very useful.) Headers are taken from elements if they exist.

BeautifulSoup (bs4) is used for HTML parsing, assuming it's OK to add that dependency. I didn't add a copy of bs4 to the "packages" directory since I'm not sure if that is necessary, or why it was done with other dependencies. Alternately, it could be added as a requirement in setup.py. Which way would be best?

@timofurrer
Copy link
Member

Please resolve the merge conflicts. Thanks 🎉

@claudep
Copy link
Contributor

claudep commented Jul 2, 2023

I worked on similar functionality, but trying without any external dependency: #555.

@claudep claudep closed this Jul 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants