Pdf2Notes📝

Turn PDF into Notes in seconds

If you find Pdf2Notes userful, please consider to donate and support the project:

Pdf2Notes is a simple AI-powered open-source chatbot that helps you speed up your learning by turning your PDF documents into notes, in a matter of seconds. It's powered by LlamaIndex, Groq, Gradio, FastAPI and Postgres

Install and launch🚀

The first step, common to both the Docker and the source code setup approaches, is to clone the repository and access it:

git clone https://github.com/AstraBert/pdf2notes.git
cd pdf2notes

Once there, you can choose one of the two following approaches:

Docker (recommended)🐋

Required: Docker and docker compose

Add the groq_api_key and the llamacloud_api_key variables in the .env.example file and modify the name of the file to scripts/.env. Get these keys:
- On Groq Console
- On LlamaCloud

mv .env.example .env

Launch the Docker application through the dedicated scripts:

# If you are on Linux/macOS
bash start_services.sh
# If you are on Windows
.\start_services.ps1

Or do it manually:

docker compose up postgres adminer -d
docker compose up pdf2notes -d

You will see the application running on http://localhost:6500/app and you will be able to use it. Depending on your connection and on your hardware, the set up might take some time (up to 15 mins to set up) - but this is only for the first time your run it!

Source code🗎

Required: Docker, docker compose and conda

Add the groq_api_key and the llamacloud_api_key variables in the .env.example file and modify the name of the file to scripts/.env. Get these keys:
- On Groq Console
- On LlamaCloud

mv .env.example scripts/.env

Set up Pdf2Notes using the dedicated script:

# For MacOs/Linux users
bash setup.sh
# For Windows users
.\setup.ps1

Or you can do it manually, if you prefer:

docker compose up postgres adminer -d

conda env create -f environment.yml

conda activate pdf2notes

cd scripts

uvicorn main:app --host 0.0.0.0 --port 6500

conda deactivate

You will see the application running on http://localhost:6500/app and you will be able to use it.

How it works

Database services

Postgres manages the chat-based memory that the application can update and access, containing all the chat history
Adminer is a database management and control system, that lets you check your Postgres databases

Workflow

The workflow is split into two parts:

First, you upload a PDF document
The document is processed by LlamaParse with Gemini 2.0 as a multimodal parsing model
The extracted text is returned to Llama-3.3-70B, provisioned through Groq, which produces notes about the document

In the second part, you can modify the nodes by interacting with the chatbot:

You message will be passed to the chatbot, along will retrieve the last 10 messages from the memory
The LLM will reply based on the memory-enhanced context

Contributing

Contributions are always welcome! Follow the contributions guidelines reported here.

License and rights of usage

The software is provided under MIT license.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
docker		docker
scripts		scripts
shell		shell
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
compose.yaml		compose.yaml
environment.yml		environment.yml
setup.ps1		setup.ps1
setup.sh		setup.sh
start_services.bash		start_services.bash
start_services.ps1		start_services.ps1
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pdf2Notes📝

Turn PDF into Notes in seconds

If you find Pdf2Notes userful, please consider to donate and support the project:

Install and launch🚀

Docker (recommended)🐋

Source code🗎

How it works

Database services

Workflow

Contributing

License and rights of usage

About

Releases

Packages

Languages

License

AstraBert/pdf2notes

Folders and files

Latest commit

History

Repository files navigation

Pdf2Notes📝

Turn PDF into Notes in seconds

If you find Pdf2Notes userful, please consider to donate and support the project:

Install and launch🚀

Docker (recommended)🐋

Source code🗎

How it works

Database services

Workflow

Contributing

License and rights of usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages