The following instructions are for setting up a version of Amundsen using Docker.
- Install
dockeranddocker-compose. - Clone this repo and its submodules by running:
$ git clone --recursive git@github.com:lyft/amundsen.git
- Enter the cloned directory and run:
$ docker-compose -f docker-amundsen.yml up
- Ingest dummy data into Neo4j by doing the following:
- Change directory to the amundsendatabuilder submodule.
- Run the following commands in the
amundsendatabuilderupstream directory:
$ python3 -m venv venv $ source venv/bin/activate $ pip3 install -r requirements.txt $ python3 setup.py install $ python3 example/scripts/sample_data_loader.py - View UI at
http://localhost:5000and try to searchtest, it should return some result.
- You can verify dummy data has been ingested into Neo4j by by visiting
http://localhost:7474/browser/and runMATCH (n:Table) RETURN n LIMIT 25in the query box. You should see two tables:hive.test_schema.test_table1dynamo.test_schema.test_table2
- You can verify the data has been loaded into the metadataservice by visiting:
- If the docker container doesn't have enough heap memory for Elastic Search,
es_amundsenwill fail duringdocker-compose.- docker-compose error:
es_amundsen | [1]: max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144] - Increase the heap memory detailed instructions here
- Edit
/etc/sysctl.conf - Make entry
vm.max_map_count=262144. Save and exit. - Reload settings
$ sysctl -p - Restart
docker-compose
- Edit
- docker-compose error: