This project provides a reliable and efficient solution for extracting structured event data from the 10times Top 100 category pages. It automates the collection of rankings, event details, dates, locations, and other essential metadata. The scraper is optimized for accuracy, stability, and SEO-friendly event research workflows.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for 10times you've just found your team — Let’s Chat. 👆👆
The 10times Scraper gathers comprehensive event information from dozens of industries listed in the 10times Top 100 rankings. It solves the challenge of manually collecting event data across categories by offering a fully automated extraction tool. It is ideal for researchers, analysts, event aggregators, travel businesses, and market intelligence teams.
- Supports 30+ industry categories from 10times Top 100 rankings
- Automatically extracts detailed event metadata
- Handles blocking and anti-bot measures with session rotation
- Outputs fully structured JSON for easy data processing
- Works with both default and custom category URLs
| Feature | Description |
|---|---|
| Multi-Category Support | Scrapes events across more than 30 major industry categories. |
| Comprehensive Event Details | Captures rank, name, location, category, rating, and timing info. |
| Proxy & Session Management | Uses session rotation and recommended proxy settings for higher success rates. |
| Error Handling | Retries failed requests and gracefully manages redirects or blocked pages. |
| Configurable Inputs | Supports custom URLs, proxy settings, timeouts, and retry limits. |
| Structured JSON Output | Provides clean, ready-to-use JSON data for each scraped event. |
| Field Name | Field Description |
|---|---|
| rank | Event ranking on the Top 100 page. |
| eventLink | Direct URL to the event page. |
| eventName | Official event title. |
| eventDates | Raw date text displayed on the page. |
| dateStart | Parsed ISO-like start timestamp. |
| dateEnd | Parsed ISO-like end timestamp. |
| location | City or venue where the event takes place. |
| category | Industry category of the event. |
| rating | User rating score. |
| extractedAt | Timestamp of data extraction. |
| sourceUrl | The specific Top 100 category URL scraped. |
[
{
"rank": "1",
"eventLink": "https://10times.com/magic",
"eventName": "MAGIC LAS VEGAS",
"eventDates": "10 - 12 Feb 2025",
"dateStart": "20250210T000000Z",
"dateEnd": "20250212T235959Z",
"location": "Las Vegas",
"category": "Apparel & Clothing",
"rating": "4.3",
"extractedAt": "2025-01-15T10:30:00.000Z",
"sourceUrl": "https://10times.com/top100/travel-tourism"
}
]
10times/
├── src/
│ ├── runner.py
│ ├── extractors/
│ │ ├── times_parser.py
│ │ └── date_utils.py
│ ├── outputs/
│ │ └── exporter.py
│ └── config/
│ └── settings.example.json
├── data/
│ ├── inputs.sample.json
│ └── sample-output.json
├── requirements.txt
└── README.md
- Market researchers gather event intelligence to identify emerging industry trends.
- Travel and tourism companies collect event data to plan packages and seasonal promotions.
- Event directories automate enrichment of listings to maintain up-to-date catalogs.
- Analysts use structured event metadata to power dashboards and reporting tools.
- Business development teams monitor top events across industries for outreach opportunities.
Q: Can I scrape any category on the Top 100 list? Yes. Any URL from the 10times Top 100 categories is supported, and the scraper adapts automatically.
Q: What if the website blocks my requests? Using residential proxies—especially from India—significantly increases success rates. The scraper also includes retry logic and session rotation.
Q: Can I increase performance by raising concurrency? While possible, keeping concurrency at 1 offers the best stability due to strict anti-bot systems.
Q: What should I do if no data appears in the output? Check that the URL is correct, try different proxy settings, or review debug images generated during the run.
Primary Metric: Average scraping speed of 1–2 seconds per event block under recommended proxy settings.
Reliability Metric: Approximately 92–96% success rate across all supported categories when using residential IPs.
Efficiency Metric: Handles 100+ events in a single run without degraded performance or session failures.
Quality Metric: Data completeness typically exceeds 98%, with accurate parsing of dates, locations, and event names.
