Link to the downloaded books #119

kangli-bionic · 2020-05-26T06:11:21Z

The script is not easy to run to completion if downloading all books. Finally was able to finish downloading on my fifth attempt. So to save people some time I'm sharing a link to the downloaded books.

Currently missing the last 13 titles on the excel file.

https://drive.google.com/drive/folders/1JC15m__PbPaowQ7k2zS1-Us72yvROCQs?usp=sharing

valahna · 2020-06-24T07:19:22Z

For those who want the recently released 1000 books for summer and to learn how I approached downloading any set that is freely available:

I grabbed the CSV report from the search page that contains the links and some meta data. I parsed the URLs and DOIs into direct download links, and then created two text files: one for the pdfs and one for the epubs version. I then imported these files into downthemall (download manager extension), which then proceeded to download all one thousand of these books. Not all have an epub version, so some will fail in that regard. I kept the simultaneous download limit to 8-10 at a time, and the it worked fine, is this due to the extension acting as a complete browser and handling the cookies, headers, and all that for you, or because it was limited the amount downloaded at a time to prevent being flagged as a bot/script, I don't know. Further testing and data would need to be done to determine this.

Then I wrote a script to parse the csv and update the PDFs with the meta data using exiftool, and renamed the files to something besides the DOI. I compressed them into three files, one with the PDFs and two with the epubs. You can find the csv I used and the archives with all the books here: Mega Hosted

To chaosAD's point, my approach is certainly a more "cat and mouse" approach, and not as elegant and refined as a script that handles all of this for you; however, I think it is a little impractical for someone to visit each books' springer page and click on two donwload buttons for all one thousand of these titles.

CyclopeanBee · 2020-06-29T01:31:55Z

The script is not easy to run to completion if downloading all books. Finally was able to finish downloading on my fifth attempt. So to save people some time I'm sharing a link to the downloaded books.

Currently missing the last 13 titles on the excel file.

https://drive.google.com/drive/folders/1JC15m__PbPaowQ7k2zS1-Us72yvROCQs?usp=sharing

I have 11 of the missing books! The final two didn't have download links any more when I checked.

AntoineSoetewey · 2020-10-29T19:14:39Z

Hello @kangli-bionic,

Can you confirm that your google drive link and the books will remain accessible as long as possible?

I would like to include your link at the top of this article, so I would like to make sure books are not removed soon.

Thanks again for having downloaded the books!

Regards,
Antoine

valahna mentioned this issue Jun 24, 2020

[Feature Request] Springer's 1000 open-access books #118

Open

AntoineSoetewey mentioned this issue Oct 29, 2020

Somebody please mirror and make a torrent #117

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Link to the downloaded books #119

Link to the downloaded books #119

kangli-bionic commented May 26, 2020

valahna commented Jun 24, 2020 •

edited

Loading

CyclopeanBee commented Jun 29, 2020

AntoineSoetewey commented Oct 29, 2020

Link to the downloaded books #119

Link to the downloaded books #119

Comments

kangli-bionic commented May 26, 2020

valahna commented Jun 24, 2020 • edited Loading

CyclopeanBee commented Jun 29, 2020

AntoineSoetewey commented Oct 29, 2020

valahna commented Jun 24, 2020 •

edited

Loading