This script provides an interface to search for academic papers from multiple sources including IEEE Xplore, arXiv, PubMed, CrossRef, and more. It uses a sequential combined search strategy to go through the strategies one by one until the requested number of paper abstracts are found.
Read for more information: https://medium.com/@zakaria.hamane1/how-to-automate-academic-paper-searches-with-python-part-i-65bd1c837f4c
- Clone the repository to your local machine.
- Install the required Python packages using the
requirements.txt
file:
pip install -r requirements.txt
- Set up your environment variables by copying the
.env
template to your project root and filling in your API keys and email for PubMed.
cp env_files/local.env .env
Edit .env
with your actual credentials.
Initialize the SearchHelper
with the desired search strategies in a preferred order. Then, execute a search with a query and the number of abstracts you wish to retrieve.
Example:
from search_helper import SearchHelper
helper = SearchHelper()
helper.set_sequential_combined_strategy(["ieee", "arxiv"])
papers = helper.execute("Artificial Intelligence", 10)
for paper in papers:
print(paper)
- S2_API_KEY_1: Your Semantic Scholar API key.
- CORE_API_KEY: Your CORE API key.
- SPRINGER_API_KEY: Your Springer API key.
- SS_EMAIL: Your email for Entrez PubMed API.
- SCOPUS_API_KEY: Your Scopus API key.