Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A question about data access #6

Open
alexey-milovidov opened this issue Jun 4, 2023 · 5 comments
Open

A question about data access #6

alexey-milovidov opened this issue Jun 4, 2023 · 5 comments

Comments

@alexey-milovidov
Copy link

I'm trying to download the data, but the command

$ aws s3 sync s3://gpt4all-datalake ./datalake_dump

returns an error:

fatal error: An error occurred (AccessDenied) when calling the ListObjectsV2 operation: Access Denied
@AndriyMulyar
Copy link
Contributor

Working on this, S3 misconfigured. If you want a dump ask in the discord and will send it to you.

The latest data dump is located at.
https://atlas.nomic.ai/map/gpt4all-datalake

@typoworx-de
Copy link

How large is the data-dump at all? @AndriyMulyar?
Can i somehow also download from here: https://atlas.nomic.ai/map/gpt4all-datalake?
Didn't find anything to download the dump or is it only accessible from there without full-dump download?

@xnought
Copy link

xnought commented Jul 5, 2023

Are there any plans to make the download process easier? (I still get the error the original poster had)

@AndriyMulyar
Copy link
Contributor

@xnought The bucket is configured for public export now.

Will be updating with an easier method for download soon.

@secretyjc
Copy link

any update?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants