This tool helps you find the largest number in a PDF document. It could also apply the appropriate multipliers such as "in millions," "in billions," or "in thousands" to the numbers found in charts or tables. The tool is designed to process documents that contain financial summaries, technical tables, and other large-number data.
- Automatically applies multipliers based on the context (e.g., "in millions").
- Handles PDF documents with tables or charts.
- Resets multipliers when encountering sentences or textual boundaries.
- Provides the largest number and the page it was found on.
To install and run the project, follow the steps below.
Clone the repository using Git:
git clone git@github.com:morgzmatty/ConductorAITakeHome.git
cd ConductorAITakeHomeInstall the required dependencies using pip:
pip install .Once installed, you can use the tool by running the following command:
largest-number path/to/your/document.pdfReplace path/to/your/document.pdf with the actual path to the PDF file you’d like to analyze.
To find the largest number when taking natural language guidance from the document into consideration,
you can run the command with the --include-bonus flag:
largest-number path/to/your/document.pdf --include-bonusIf you want to modify or contribute to the project, follow these steps:
To install the project in a way that allows you to edit it while testing, run:
pip install -e .This allows you to make changes to the code and immediately see the effects without reinstalling the package.
You can also run the script directly by executing:
python -m my_project.main path/to/your/document.pdf