This is the open-source repository for GreenMine, introduced in the paper titled: Visual Text Mining with Progressive Taxonomy Construction for Environmental Studies
Since reproducing the system from scratch costs API tokens for querying GPT models, you can start directly from Step 3 with the pre-generated data.
Disclaimer: For privacy reasons, we can not release the original data as shown in the paper. Therefore, the data available here is synthetically generated by gpt-4o-mini, and are only provided to demonstrate the visual components of the system, rather than for practical usage.
1. Generating Synthetic Data (Optional)
2. Preprocessing (Optional)
- Request for an OpenAI's api key, and then create a file called
openai_api_key(no file extension needed) under the directoryserver/. Copy and paste your api key into the file. - Under the directory
server/, execute the following command:
pip install -r requirements.txt
flask run- The backend should be launched at
localhost:5000. You can check by accessinglocalhost:5000/test/in your browser, you should seeHello GreenMineshowing on the web page.
- Install
Node.js - Under the repository root directory, execute the following command:
npm i
npm run dev- The front-end should be launched at
localhost:5173. You can access it in your browser.

