-
Notifications
You must be signed in to change notification settings - Fork 696
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
stylesheet issue - 404 Not Found #281
Comments
This issue can be easily reproduced by downloading a book, e.g. 9781098122553.
Download on Oct 15, 2021: the C__ Crash Course (9781098122553)/OEBPS/Styles output directory contains 2 .css files but the second one (style01.css) was invalid and its content was showing 404 Not Found:
Comparing to a previous download on Nov 4, 2020 for the same book: it was the same book, however the C__ Crash Course (9781098122553)/OEBPS/Styles output directory contains 2 .css files which were both valid. Style01.css contains valid content. Also, a diff on the Style00.css shows it is same with recent download.
The issue appears to be true for other book downloads. Is any other information needed? Thanks. |
I can confirm that the API will list only 1 CSS file, but this program attempts to download another CSS file, one of the request for CSS hits a Below you can find the API that I assume is used to fetch the CSS file. Chapters: Files: {
"ourn": "urn:orm:book:9781098122553:asset:styles%2f9781593278892.css",
"url": "https://learning.oreilly.com/api/v2/epubs/urn:orm:book:9781098122553/files/styles/9781593278892.css",
"full_path": "styles/9781593278892.css",
"filename": "9781593278892.css",
"filename_ext": ".css",
"media_type": "text/css",
"has_mathml": false,
"kind": "stylesheet",
"created_time": "2020-10-27T13:21:31.244025Z",
"last_modified_time": "2021-02-11T00:14:31.128271Z",
"virtual_pages": null,
"file_size": 14342,
"epub_archive": "https://learning.oreilly.com/api/v2/epubs/urn:orm:book:9781098122553/"
} Log FileRunning the program with
The ProblemCrawler is fetching the following CSS file that doesn't exist Possible SolutionDon't parse a response that has a HTTP status code of 404. My Environment:I'm running the latest commit (e016ad3) as of this post.
|
For some download, when viewing a downloaded OEBPS/*.xhtml page with a browser, the page rendered did not seem to be styled correctly (quite different from what is viewed on-line which seems to offer better readability).
Looking at OEBPS/Styles/ and found that some downloaded stylesheet file (OEBPS/Styles/Style*.css) has the following content:
Can it be an error with the downloading process or something else?
Is there anything can be done to fix it?
Thanks!
The text was updated successfully, but these errors were encountered: