Skip to content

commoncrawl/cc-quick-scripts

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Common Crawl Logo

Common Crawl Quick Scripts

This repository contains a number of useful scripts for attacking the CommonCrawl dataset and WARC/WET/WAT files.

License

MIT License, as per LICENSE

About

Scripts to verify Common Crawl segments and WARC/WET/WAT files

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 90.3%
  • Shell 9.7%