Insightify is a PDF analysis tool that leverages OpenAI's API to extract and summarize key insights from PDF documents. It helps users to easily analyze company reports and financial documents by summarizing critical information like future growth prospects, key business changes, and factors influencing next year’s earnings.
- Extracts text from PDF files using the
fitzlibrary (PyMuPDF). - Uses OpenAI to analyze and summarize key insights from the extracted text.
- Identifies key elements such as:
- Future growth prospects
- Key business changes
- Triggers for growth or decline
- Material impacts on future earnings and growth
-
Install
pipenvif you don't have it:pip install pipenv
-
Activate the virtual environment:
pipenv shell
-
Install required libraries from the
Pipfile:pipenv install
-
Ensure that the
.envfile contains your OpenAI API key.
-
Make sure the environment is activated (step 2).
-
Run the script:
python pdf-analyzer.py
-
Optional: For Streamlit-based app:
streamlit run app.py
-
The script will extract text from the specified PDF and send it to OpenAI for analysis. It will output the key findings from the document.
- .env: Contains environment variables (e.g., OpenAI API key).
- app.py: Streamlit based version of the app.
- pdf-analyzer.py: Main Python script to run the PDF analysis.
- Pipfile: Specifies the dependencies for the project.
- Pipfile.lock: Ensures consistent dependency versions.
- Sample PDF.pdf: A sample file for testing the summarization.