Skip to content

devmunoz/travel-flights-scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

eDreams Web Scraping 🛫🤑

Last update: 12/2024

Web Scraping and Data Collection Project

Disclaimer

The script may not work properly in the future due to updates on the target page eDreams. Keep in mind that web scraping is a technique that requires constant updates, and this project will not be maintained. This source code is for educational purposes only.

Description

This repository is part of the original project performed by José Dos Reis - josedosr, Pamela Colman - pamve and me.

If you want more information, please check the original project here.

This script can search and collect data from the eDreams search engine based on the provided dates and locations. The collected data can be processed using pandas or similar libraries.

Demo Video

See the demo video

Installation and Execution

  • Prerequisites:

  • Perform the scraping:

    • Install the virtual environment:

      python -m venv .venv
      
    • Run the virtual environment:

      source .venv/bin/activate
      
    • Install the requirements:

      pip install -r requirements.txt
      
    • Usage 😄:

      usage: scraper_edreams.py [-h] --dates DATES --sources SOURCES
      
      eDreams flights scraping script
      
      options:
      -h, --help         show this help message and exit
      --dates DATES      Input dates dict (JSON). Example: '[{"from": "2024-12-06", "to": "2025-01-10"}]'
      --sources SOURCES  Input sources list, IATA codes (JSON). Example: '["MAD","VLC","BCN"]'
      
    • Extra script to retrieve IATA codes (thanks to ip2location-iata-icao project) 😄:

      python get_iata_codes.py -> returns the complete list of IATA codes per airport.
      

Contribution

Feel free to improve or update the code.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

Contributors

Languages