A React-based web application for extracting tabular data from PDF documents with a user-friendly interface for defining and configuring table columns.
- 📄 PDF file upload and viewing
- 🔍 Interactive area selection
- 📏 Precise coordinate-based selection
- 📊 Column management system
- 💾 Configuration save/load functionality
- 📤 CSV export capability
- 📱 Responsive split-panel layout
- Framework: React 18
- Package Manager: pnpm
- (npm or other package manager may be used instead but only the pnpm lock file is maintained)
- Build Tool: Vite
- Language: TypeScript
- PDF Processing: react-pdf
- State Management: Zustand
- UI Components:
- Radix UI primitives
- Tailwind CSS
- Lucide React icons
- Development Tools:
- ESLint
- Node.js (v16 or higher)
- pnpm
- Clone the repository:
git clone https://github.com/yourusername/pdf-table-extractor.git
cd pdf-table-extractor- Install dependencies:
pnpm install- Start the development server:
pnpm run dev- Open your browser and navigate to
http://localhost:5173
- Upload a PDF file by dragging and dropping or using the file selector
- Use the selection tools to define table areas
- Configure columns using the side panel
- Export the extracted data as CSV