Skip to content

Issues: Unstructured-IO/unstructured

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

docx: access images in document order docx Related to Microsoft Word (.docx) file format enhancement New feature or request
#1615 by scanny was closed Jun 13, 2024
Add file information to document metadata good first issue Good for newcomers python Pull requests that update Python code
#230 by MthwRobinson was closed Dec 18, 2023
Add email header information to metadata python Pull requests that update Python code
#231 by MthwRobinson was closed Apr 4, 2023
ISD dictionaries are not JSON serializable if the filename has a POSIX path bug Something isn't working good first issue Good for newcomers python Pull requests that update Python code
#232 by MthwRobinson was closed Feb 16, 2023
Add filetype check based on file extension if libmagic isn't available good first issue Good for newcomers python Pull requests that update Python code
#228 by MthwRobinson was closed Feb 24, 2023
Install python-magic-bin instead of python-magic for Windows python Pull requests that update Python code
#234 by MthwRobinson was closed Dec 18, 2023
Sync detectron2 versions in docs documentation Improvements or additions to documentation
#241 by MthwRobinson was closed Apr 17, 2023
Too many ListItem's generated for some PDF's bug Something isn't working pdf
#242 by cragwolfe was closed Dec 18, 2023
Create a data connector for Google Drive enhancement New feature or request python Pull requests that update Python code
#244 by cragwolfe was closed Mar 7, 2023
partition_html incorrect encoding bug Something isn't working
#250 by asai95 was closed Feb 23, 2023
Create a data connector for Discord enhancement New feature or request
#253 by mallorih was closed Jun 21, 2023
Create a data connector for processing the biomedical literature enhancement New feature or request
#254 by ajjimeno was closed Mar 16, 2023
Create a data connector for Slack enhancement New feature or request
#252 by mallorih was closed Apr 17, 2023
Create a data connector for processing social media sites enhancement New feature or request json Related to partitioning JSON
#255 by ajjimeno was closed Dec 18, 2023
Create a new data connector for Notion enhancement New feature or request
#260 by ajjimeno was closed Nov 1, 2023
Create a data conector for Azure Blob Storage enhancement New feature or request
#257 by benjats07 was closed Mar 10, 2023
Create a new data connector for Obsidian enhancement New feature or request
#261 by ajjimeno was closed Dec 18, 2023
Create a data connector for Sharepoint enhancement New feature or request
#258 by ajjimeno was closed Nov 1, 2023
Create a data connector for JIRA enhancement New feature or request
#263 by LaverdeS was closed Sep 6, 2023
6 tasks
ModuleNotFoundError: No module named 'unstructured.documents.pdf' bug Something isn't working documentation Improvements or additions to documentation
#267 by iliasmansouri was closed Feb 27, 2023
Create a data connector for Substack enhancement New feature or request
#271 by cragwolfe was closed Dec 18, 2023
ProTip! no:milestone will show everything without a milestone.