-
-
Notifications
You must be signed in to change notification settings - Fork 32
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Coding for tommorrow incomplete file #215
Comments
Clicking on a download link, you get an error message:
As you can see, this URL doesn't share the same prefix as the URL of the recipe ( |
@rgaudin I tried to change the scope to Any, page, prefix and still the resulted file is the same. |
That's exactly why some documentation is needed. All those scopes have different effects. You haven't tested I advise you try with There's no documentation on those scopes ; code is at https://github.com/webrecorder/browsertrix-crawler/blob/165a9787af8a7dce6b0acb5f91e6803ef525fd5b/util/seeds.js#L75 |
I tried changing the scopes, the host scraped the website but without the needed projects I disabled the recipe and marked the resulted file for deletion |
Now that the URL configured is https://coding-for-tomorrow.de, what did you expected by changing the scope from the default (prefix) to host?
I don't get what you expected by making this change. That being said, I analyzed a bit the issue:
All that being said, as you see there is a significant effort needed by a developer to make the scraping of this website be enhanced, and I'm not even sure it will succeed (at least there is a significant chance that stuff like the Youtube videos will not be available). @Popolechien what are your views on this, do you think this is worth the effort? |
It's in German, not a core target audience. We can drop it I think. |
The issue related is marked as upstream |
Let's keep this issue open, I doubt we will make any progress in the coming months due to lack of resources but the ZIM request is legit, I've identified a potential solution and we should fix this at some point, it is not purely impossible or an immense effort, just not a priority for now. |
The recipe of coding for tomorrow has been successful but the file in dev library is incomplete, all internal links are not clickable.
https://farm.openzim.org/recipes/codingfortomorrow_de_all
https://dev.library.kiwix.org/viewer#codingfortomorrow_de_all_2023-08/A/coding-for-tomorrow.de/downloads/
Can you check please?
The text was updated successfully, but these errors were encountered: