An automated, AI-driven daily paper tracking and intelligent filtering system tailored for the AI for Science (Bio + Chem) domain.
This project utilizes a stateful workflow built with LangGraph to automatically fetch, score, and summarize the latest research from top sources, ensuring you never miss a critical breakthrough in geometric deep learning, AIDD, single-cell omics, and protein structure.
- Multi-source Fetching: Automatically parses the latest daily papers from arXiv, bioRxiv, and select Nature journals (e.g., Nature Biotechnology, Nature Machine Intelligence).
- Intelligent LLM Filtering: Powered by DeepSeek-R1, the system evaluates and scores papers on a 1-10 scale based on relevance to predefined core subjects (e.g., GNNs, Flow Matching, Perturb-seq, Molecular Glue). Only high-quality papers (score >= 7) pass the filter.
- Automated Summarization: For papers that pass the threshold, the agent generates multidimensional summaries covering the core contributions, technical routes, and innovations.
- Markdown Reports: Automatically compiles an easy-to-read daily Markdown report into the
reports/directory.
The pipeline is implemented as a LangGraph StateGraph to guarantee structured, predictable, and fully automated processing:
- Fetch Node: Retrieves feed data.
- Filter Node: Runs batch inferences to score abstract relevance.
- Summarize Node: Provides in-depth analysis on valid papers.
- Report Node: Generates the markdown output and saves it locally.
We recommend using pixi for dependency management:
pixi installAlternatively, use pip:
pip install -r requirements.txt- Copy the environment configuration template:
cp .env.example .env- Fill in the required API keys (e.g., your SiliconCloud/DeepSeek API key) and email configurations in your newly created
.envfile.
Start the agent pipeline by simply running:
python main.py(Ensure that your main.py entrypoint executes the LangGraph workflow.)
- Langchain & LangGraph: Workflow management and orchestration
- DeepSeek-R1: Core reasoning and selection engine
- Python-dotenv: Environment and secrets management
- Feedparser: RSS and feed processing
Contributions, issues, and feature requests are welcome!
This project is licensed under the MIT License.