Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error: probably not a valid book #113

Open
michalastocki opened this issue May 15, 2020 · 7 comments
Open

Error: probably not a valid book #113

michalastocki opened this issue May 15, 2020 · 7 comments

Comments

@michalastocki
Copy link

Hello, do you have any ideas why it's not downloading any of the books? I have all the dependencies and the python code seems to be running, but ot doesn't download anything right now, skipping every book.

Does it mean that Springer recovered all the books from their site?

Thanks!

image

@wallacefsilva
Copy link

I'm having the same issues. It seems they implemented a security check via recaptcha...

@renanxcortes
Copy link

renanxcortes commented May 16, 2020

Yep.. it seems I'm facing the same type of issue in the R version of this package (renanxcortes/springerQuarantineBooksR#53). Any clue on how to solve this issue?

@lgabs
Copy link

lgabs commented May 24, 2020

I'm getting the same errors:

image

@renanxcortes
Copy link

It seems like springer tweaked the reCaptcha step and the recent workaround is not enough to avoid getting flawed files :(

@chaosAD
Copy link
Contributor

chaosAD commented May 25, 2020

I have a raw working prototype to get around the latest hurdle. My code also manipulates the HTTP cookies to get past it, but in an elaborate way. I see Springer put other cookies that are used to track downloading users as well. This makes it easier for their monitoring software to analyze and then flag a downloading bot. Obviously, they do not allow downloading using bots and I feel we are playing cat and mouse with Springer. For that reason, I am not going to pursue on this.

@SanJJ1
Copy link

SanJJ1 commented May 28, 2020

So does this mean that this project is dead?

@pbl987
Copy link

pbl987 commented Jul 14, 2020

I have a raw working prototype to get around the latest hurdle. My code also manipulates the HTTP cookies to get past it, but in an elaborate way. I see Springer put other cookies that are used to track downloading users as well. This makes it easier for their monitoring software to analyze and then flag a downloading bot. Obviously, they do not allow downloading using bots and I feel we are playing cat and mouse with Springer. For that reason, I am not going to pursue on this.

There is no need for playing cat and mouse - JDownloader does that!
They can solve automatically weak captchas, and semi-automatically with google.

I want to save at least the covid-package, as this wont be available in two weeks!!!
Currently 389 books are available, it would be doable to enter a captcha every 5 download or so.

Could you please reconsider helping the project?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants