Skip to content

mzhang0/fcrawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

fcrawler

fcrawler is a focused web crawling library for Node.js.

Installation

git clone https://github.com/mzhang0/fcrawler.git
cd fcrawler && npm install

Usage

var FocusedCrawler = require('fcrawler');
var fc = new FocusedCrawler();
var searchTerms = [
	'javascript','code','programming',
	'language','js','ecmascript','web'
];
var seedLinks = ['https://developer.mozilla.org/en-US/docs/Web/JavaScript'];
fc.crawl({
searchTerms: searchTerms,
	seedLinks: seedLinks,
	onCrawl: function(link, response, body, terms, priority) {
		console.log(link);
	},
	onComplete: function() {
		console.log("Focused crawl completed!");
	}
});

About

Focused web crawling library for Node.js

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors