Skip to content

soudaburger/scrape_website

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

How to Scrape a website with Node.js?

After publishing this Gist; contacting some friends, they said that it would be nice to have public access, with LICENSE and a more detailed README on how to actually scrape a website, thus, here is the repo.

Gaining access to this software and installing packages

First of all, fork this repo and clone it in you local machine.

As I utilized yarn, you have to navigate to this folder and type in the terminal yarn install, witch take some time because it is installing chromium -if you don't already have it.

After the install process, go to index.js and change the urls array with the site(s) that you want to clone; then, change the "output" string with the desire folder name that you want the HTML, CSS and JS files to go.

Finally, type yarn install in the terminal to run the program.

That's it, all the process is done in the backend; just wait a little time.

LICENSE

The software is licensed under MIT

About

How to scrape and clone HTML, CSS, and JS files from a website with node js.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • JavaScript 100.0%