Extract any information from websites OR analyze any data using intelligent AI. A powerful Apify actor that combines traditional web scraping with advanced AI capabilities to extract contact information, generate summaries, analyze data, or create custom structured outputs from any website or raw data.
1. Basic Contact Extraction (No Configuration Needed)
Click "Start" with default settings to extract emails, phone numbers, and social links from any website. No AI knowledge required.
{
"startUrls": [{"url": "https://example.com/contact"}],
"useAI": false
}2. AI-Powered Extraction (Tell It What You Want)
Enable AI and describe what you need in plain English. The AI understands natural language.
{
"startUrls": [{"url": "https://apify.com/about"}],
"useAI": true,
"aiInstructions": "Extract company name, CEO, and main contact email"
}3. Data Analysis Mode (No Website Needed)
Analyze your own data directly without scraping. Perfect for transforming CSV, JSON, or any text data.
{
"data": "name,email\nAlice,alice@example.com\nBob,bob@example.com",
"aiInstructions": "Convert to JSON array with name and email fields"
}New to web scraping? Start here:
- Sign up for a free Apify account
- Find this Actor by searching "AI Lead Extractor"
- Add a URL in the
startUrlsfield (e.g., "https://example.com") - Click Start and wait 30-60 seconds
- Download results as JSON, CSV, or Excel
That's it! No coding required.
- Features
- Two Operating Modes
- Input Configuration
- Output Structure
- Pricing & Billing
- Use Cases
- API Usage
- Advanced Configuration
- Troubleshooting
- FAQ
- Support
- Normal Mode: Web scraping with full browser automation for JavaScript-heavy sites
- Standby Mode: Direct data analysis without browser overhead for faster processing
- Free Tier: Basic extraction plus AI via OpenRouter (pay-as-you-go)
- Paid Tier: Advanced AI included with superior accuracy and 60% cost savings
- Natural Language Instructions: Describe what you want in plain English (max 75 words)
- Flexible Output: JSON, CSV, markdown, or any custom format you specify
- Full Browser Rendering: Handles dynamic JavaScript websites seamlessly
- Automatic Screenshots: Captures page screenshots for visual reference and verification
- Smart Retry Logic: Automatically recovers from temporary failures
- Token Management: Built-in 30K token limit with automatic overflow protection
- Graceful Fallback: Falls back to basic extraction if AI encounters issues
- Memory-Based Pricing: Predictable costs based on allocated memory (Normal Mode only)
- Word-Based AI Billing: Pay only for words processed (input + output)
- Detailed Usage Tracking: Complete breakdown of all charges in results
- Tier-Based Discounts: Up to 75% savings for Bronze, Silver, and Gold subscribers
When to use: Extract data from live websites
How it works:
- Launches a full browser using Playwright
- Navigates to your specified URLs
- Extracts content using AI or basic regex patterns
- Captures screenshots for reference
- Returns structured data in your preferred format
Best for: Company websites, contact pages, product listings, competitor analysis
Example:
{
"startUrls": [
{"url": "https://apify.com/about"},
{"url": "https://example.com/contact"}
],
"useAI": true,
"aiInstructions": "Extract company description, CEO name, and all contact information"
}Result: Structured data with company info, contacts, and a screenshot of each page
When to use: Analyze existing data without web scraping
How it works:
- No browser launched (faster and cheaper)
- Accepts any data format: JSON, CSV, text, XML, HTML
- AI analyzes and transforms your data
- Returns processed results in seconds
- Cost-effective for pure data processing tasks
Best for: Data transformation, format conversion, text analysis, report generation
Example using API:
curl -X POST "https://dz-omar--ai-lead-extractor.apify.actor?token=YOUR_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"data": "Company A: info@companya.com, +1-555-0100\nCompany B: contact@companyb.com, +1-555-0200",
"aiInstructions": "Extract as JSON array with company name, email, and phone"
}'Result: Instant JSON transformation without any web scraping overhead
Key Difference:
- Normal Mode = Browser required, processes URLs, memory billing applies
- Standby Mode = No browser, processes your data, no memory billing for creator
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
startUrls |
Array | Yes* | [] |
URLs to scrape (*not needed in Standby Mode for data analysis) |
useAI |
Boolean | No | false |
Enable AI-powered extraction |
aiInstructions |
String | No | Auto | Natural language instructions (max 75 words/500 chars) |
data |
String | No | null |
Your data to analyze (Standby Mode only) |
Default AI Instructions (when useAI: true but no instructions provided):
"Extract all useful information: emails, phones, social links, descriptions, etc."
Perfect for simple contact scraping without AI costs.
{
"startUrls": [
{"url": "https://example.com/contact"},
{"url": "https://example.com/about"}
],
"useAI": false
}What you get:
- Email addresses
- Phone numbers
- Social media links
- Page screenshots
Cost: Memory only (~$0.01 per page)
AI extraction with usage-based billing for free users.
{
"startUrls": [{"url": "https://apify.com/about"}],
"useAI": true,
"aiInstructions": "Summarize this company in 3 bullet points and extract the CEO's name"
}What you get:
- AI-generated summary
- CEO name
- Basic contacts
- Full cost breakdown
Cost: Memory + AI words (~$0.02-$0.05 per page)
Advanced AI with better accuracy and discounted rates.
{
"startUrls": [{"url": "https://apify.com/about"}],
"useAI": true,
"aiInstructions": "Extract: CEO full name, primary support email, company mission statement, year founded, and number of employees if mentioned"
}What you get:
- More accurate extraction
- Better understanding of complex requests
- 60% lower costs than free tier
Cost: ~$0.01 per page (GOLD tier example)
Analyze your own data without any web scraping.
{
"data": "Product A: $299, 4.5 stars, 128 reviews\nProduct B: $399, 4.8 stars, 256 reviews\nProduct C: $199, 4.2 stars, 89 reviews",
"aiInstructions": "Convert to JSON array with fields: name, price (as number), rating, reviewCount"
}What you get:
- Structured JSON output
- Fast processing (no browser)
- No memory charges (in Standby)
Cost: AI words only
Process many pages in one run.
{
"startUrls": [
{"url": "https://company1.com/about"},
{"url": "https://company2.com/contact"},
{"url": "https://company3.com/team"},
{"url": "https://company4.com/careers"}
],
"useAI": true,
"aiInstructions": "Extract company name, industry, and all email addresses"
}What you get:
- Consistent data structure across all URLs
- Parallel processing for speed
- One dataset with all results
{
"url": "https://example.com/contact",
"title": "Contact Us - Example Company",
"basicExtraction": {
"emails": ["info@example.com", "support@example.com"],
"phones": ["+1-555-0100", "+1-800-EXAMPLE"],
"socialLinks": [
"https://twitter.com/example",
"https://linkedin.com/company/example"
],
"extractionMethod": "regex"
},
"screenshot": {
"available": true,
"url": "https://api.apify.com/v2/key-value-stores/xyz/records/screenshot-123456"
},
"extractionTier": "FREE",
"extractionMethod": "Basic Only",
"scrapedAt": "2025-01-10T14:30:00.000Z"
}{
"url": "https://apify.com/about",
"title": "About Β· Apify",
"aiExtraction": {
"company_name": "Apify",
"ceo_name": "Jan Δurn",
"support_email": "support@apify.com",
"company_mission": "Make the web more programmable",
"year_founded": "2015",
"summary": "Apify is a web scraping and automation platform that helps developers extract data from websites and automate web workflows."
},
"basicExtraction": {
"emails": ["support@apify.com", "hello@apify.com"],
"phones": ["+420-123-456-789"],
"socialLinks": [
"https://linkedin.com/company/apify",
"https://twitter.com/apify"
]
},
"aiCost": {
"inputWords": 2836,
"outputWords": 2606,
"totalWords": 5442,
"cost": "$0.015981",
"breakdown": "Processed 2,836 input words, generated 2,606 output words"
},
"screenshot": {
"available": true,
"url": "https://api.apify.com/v2/key-value-stores/xyz/records/screenshot-789012"
},
"extractionTier": "PAID",
"extractionMethod": "AI-Powered + Basic",
"userTier": "GOLD",
"scrapedAt": "2025-01-10T14:30:00.000Z"
}| Field | Description | Always Present |
|---|---|---|
url |
The scraped website URL | β Yes |
title |
HTML page title | β Yes |
aiExtraction |
AI-extracted data (structure varies by instructions) | Only with AI |
basicExtraction |
Contact info (emails, phones, social links) | β Yes |
aiCost |
Detailed AI usage breakdown (words processed, cost) | Only with AI |
screenshot |
Screenshot URL for visual reference | β Yes |
extractionMethod |
Method used (Basic, AI-Powered, etc.) | β Yes |
extractionTier |
User tier (FREE, PAID) | β Yes |
userTier |
Subscription level (FREE, BRONZE, SILVER, GOLD) | β Yes |
scrapedAt |
ISO 8601 timestamp | β Yes |
Note: The structure of aiExtraction varies based on your aiInstructions. The AI automatically determines the best format for your requested data.
This actor has two billing components:
- Memory-Based Billing (Normal Mode only)
- AI Word-Based Billing (when AI is enabled)
Charged every 30 seconds based on allocated memory. Does NOT apply in Standby Mode.
| Memory | Events per 30s | FREE Tier | GOLD Tier | Savings |
|---|---|---|---|---|
| 128 MB | 1 | $0.0008 | $0.0002 | 75% |
| 256 MB | 2 | $0.0016 | $0.0004 | 75% |
| 512 MB | 4 | $0.0032 | $0.0008 | 75% |
| 1024 MB | 8 | $0.0064 | $0.0016 | 75% |
Formula: Events = Memory (MB) Γ· 128
Example: Running at 256 MB for 2 minutes
- FREE tier: 4 intervals Γ 2 events Γ $0.0016 = $0.0128
- GOLD tier: 4 intervals Γ 2 events Γ $0.0004 = $0.0032 (75% savings)
Charged per 1,000 words processed when AI is enabled.
| Type | FREE | BRONZE | SILVER | GOLD |
|---|---|---|---|---|
| Input Words (per 1K) | $0.0015 | $0.0012 | $0.0010 | $0.0008 |
| Output Words (per 1K) | $0.0045 | $0.0036 | $0.0030 | $0.0024 |
Conversion: ~1 token = 0.75 words (approximately 4 characters)
Example AI extraction (typical page):
- Input: 2,836 words (page content)
- Output: 2,606 words (extracted data)
- FREE tier: $0.00425 + $0.01173 = $0.01598
- GOLD tier: $0.00227 + $0.00625 = $0.00852 (47% savings)
Scenario: Extract contacts from 5 pages, 2 minutes total, 256 MB memory
- Memory Cost (FREE): $0.0128
- AI Cost: $0 (AI disabled)
- Total: $0.0128
Scenario: AI extract from 3 pages, 3 minutes total, 256 MB memory
- Memory Cost: 6 intervals Γ 2 events Γ $0.0016 = $0.0192
- AI Cost (3 pages avg): 3 Γ $0.016 = $0.048
- Total: $0.0672
Scenario: Same as Example 2, but with GOLD subscription
- Memory Cost: 6 intervals Γ 2 events Γ $0.0004 = $0.0048
- AI Cost (3 pages avg): 3 Γ $0.0085 = $0.0255
- Total: $0.0303 (55% savings vs FREE)
Scenario: Analyze CSV data with 5,000 words
- Memory Cost: $0 (no browser in Standby)
- AI Cost (GOLD): ~$0.005
- Total: $0.005
Note: In Standby Mode with PPE (Pay-Per-Event), the user pays execution costs, not the creator.
| Feature | Free Tier | Paid Tiers (BRONZE/SILVER/GOLD) |
|---|---|---|
| Basic Extraction | β Always included | β Always included |
| AI Model | Simple (OpenRouter) | Advanced (higher accuracy) |
| AI Billing | Pay-per-use | 20-47% discounts |
| Memory Rates | Standard | Up to 75% savings |
| AI Accuracy | Good | Superior |
| Complex Instructions | Limited | Excellent |
| Support Priority | Standard | Priority response |
- Use Basic Extraction First: If you only need emails/phones, disable AI to save costs
- Batch Processing: Process multiple URLs in one run to amortize startup costs
- Lower Memory: Start with 256 MB and increase only if needed
- Upgrade for Volume: If processing 50+ pages/month with AI, paid tiers save money
- Use Standby Mode: For data analysis tasks, Standby Mode avoids memory charges
Extract comprehensive contact information and company data for sales prospecting.
Example Tasks:
- Build targeted prospect lists from company websites
- Extract decision-maker names and contact details
- Gather company descriptions for personalized outreach
- Find direct phone numbers and email addresses
Sample Instructions:
"Extract CEO name, CFO name, main company email, phone number, and LinkedIn profile"
"Find all team members listed with their job titles and email addresses"
"Get company size, industry, and all available contact methods"
Real-World Workflow:
- Input list of 50 competitor/prospect websites
- Extract key decision-makers and contacts
- Export to CSV for CRM import
- Use data for personalized outreach campaigns
Analyze competitor offerings, pricing, and market positioning.
Example Tasks:
- Compare pricing plans across multiple competitors
- Extract product features and specifications
- Monitor changes in competitor messaging
- Analyze value propositions
Sample Instructions:
"List all pricing tiers with features and monthly costs"
"Summarize the company's main value proposition in 2 sentences"
"Extract all product names with their descriptions and target audience"
"Compare this company's features against industry standard offerings"
Real-World Workflow:
- Scrape 20 competitor pricing pages
- Extract structured pricing data
- Generate comparison matrix
- Identify pricing gaps and opportunities
Transform unstructured data into structured formats.
Example Tasks:
- Convert CSV to JSON with custom fields
- Clean and normalize messy datasets
- Extract specific information from documents
- Merge and deduplicate data sources
Sample Instructions:
"Convert this CSV to JSON array with fields: name, email, company, title"
"Extract all dates, amounts, and invoice numbers from this text"
"Normalize phone numbers to E.164 format and remove duplicates"
"Parse job postings and extract: position, salary range, location, requirements"
Real-World Workflow:
- Upload raw CSV/text data via Standby Mode
- Specify desired output structure
- Get clean, structured JSON
- Import directly into your database
Gather information for content creation and fact-checking.
Example Tasks:
- Research topics with source attribution
- Extract statistics and data points
- Generate article summaries
- Collect citation information
Sample Instructions:
"Extract all statistics mentioned with their sources"
"Generate a 5-bullet-point summary of the main arguments"
"List all studies/papers cited with publication dates and authors"
"Extract key takeaways and actionable insights"
**Real-World Workflow**:
1. Input list of research articles/blogs
2. Extract key facts and statistics
3. Generate summary report
4. Use as research base for content creation
---
### πͺ E-Commerce & Product Intelligence
Monitor product listings, reviews, and market trends.
**Example Tasks**:
- Extract product specifications
- Monitor price changes
- Analyze review sentiment
- Track competitor inventory
**Sample Instructions**:
"Extract product name, price, rating, and number of reviews"
"List all product features and specifications in JSON format"
"Summarize top 3 positive and negative points from reviews"
**Real-World Workflow**:
1. Scrape competitor product pages
2. Extract pricing and features
3. Monitor changes daily
4. Adjust your pricing strategy accordingly
---
### π§ Email Finder & Contact Enrichment
Discover and verify contact information for outreach.
**Example Tasks**:
- Find verified email addresses
- Extract phone numbers with validation
- Discover social media profiles
- Build contact databases
**Sample Instructions**:
"Find all email addresses and categorize as: sales, support, general"
"Extract contact form URL, main phone, and support email"
"List team members with their roles and LinkedIn profiles"
**Real-World Workflow**:
1. Input list of target company websites
2. Extract all contact methods
3. Verify and categorize contacts
4. Export to your CRM or email tool
---
## π API Usage
### Method 1: Run Actor via Apify API
Standard actor execution for batch processing.
```bash
curl -X POST "https://api.apify.com/v2/acts/IeZZMR1Uv6J9h7pdS/runs?token=$APIFY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"startUrls": [
{"url": "https://apify.com/about"},
{"url": "https://example.com/contact"}
],
"useAI": true,
"aiInstructions": "Extract company info and contact details"
}'
Response:
{
"data": {
"id": "run_abc123xyz",
"status": "RUNNING",
"defaultDatasetId": "dataset_xyz789"
}
}Then fetch results:
curl "https://api.apify.com/v2/datasets/dataset_xyz789/items?token=$APIFY_TOKEN"Fast, synchronous responses for real-time applications.
curl -X POST "https://dz-omar--ai-lead-extractor.apify.actor?token=YOUR_STANDBY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"url": "https://example.com/about",
"useAI": true,
"aiInstructions": "Extract company name, description, and contact email"
}'Response (immediate):
{
"url": "https://example.com/about",
"aiExtraction": {
"company_name": "Example Corp",
"description": "Leading provider of...",
"contact_email": "info@example.com"
},
"basicExtraction": {
"emails": ["info@example.com"],
"phones": ["+1-555-0100"]
}
}curl -X POST "https://dz-omar--ai-lead-extractor.apify.actor?token=YOUR_STANDBY_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"data": "Alice,alice@example.com,CEO\nBob,bob@example.com,CTO\nCarol,carol@example.com,CFO",
"aiInstructions": "Convert to JSON array with fields: name, email, role"
}'Response (immediate):
[
{"name": "Alice", "email": "alice@example.com", "role": "CEO"},
{"name": "Bob", "email": "bob@example.com", "role": "CTO"},
{"name": "Carol", "email": "carol@example.com", "role": "CFO"}
]Be Specific:
β
"Extract: CEO full name, primary support email, company mission (max 20 words)"
β "Get company info"
Specify Format:
β
"Return as JSON with fields: name, email, phone. Phone in E.164 format."
β "Give me contact details"
Set Boundaries:
β
"Summarize in exactly 3 bullet points, max 15 words each"
β "Summarize the page"
Use Examples:
β
"Extract pricing like: {tier: 'Pro', price: 29, features: ['A', 'B', 'C']}"
β "Extract pricing"
Choose memory based on your needs:
| Memory | Use Case | Typical Speed | Cost (FREE) |
|---|---|---|---|
| 128 MB | Simple pages, no JS | 15-30 sec | $0.0008/30s |
| 256 MB | Recommended default | 10-20 sec | $0.0016/30s |
| 512 MB | Complex JS sites | 5-10 sec | $0.0032/30s |
| 1024 MB | Heavy pages, many images | 3-7 sec | $0.0064/30s |
Recommendation: Start with 256 MB. Increase only if you see timeout errors or slow performance.
Batch Processing (Normal Mode):
{
"startUrls": [
{"url": "https://site1.com"},
{"url": "https://site2.com"},
{"url": "https://site3.com"}
// ... up to 1000+ URLs
],
"useAI": true,
"aiInstructions": "Extract contacts"
}Pagination Support:
{
"startUrls": [
{"url": "https://example.com/page?p=1"},
{"url": "https://example.com/page?p=2"},
{"url": "https://example.com/page?p=3"}
]
}The actor automatically handles common errors:
- Timeouts: Automatic retry with increased timeout
- Network errors: 3 retry attempts with exponential backoff
- AI failures: Graceful fallback to basic extraction
- Invalid URLs: Skipped with error logged
Error Output Example:
{
"url": "https://invalid-site.com",
"error": "Page timeout after 60 seconds",
"errorType": "TIMEOUT",
"basicExtraction": null,
"scrapedAt": "2025-01-10T14:30:00.000Z"
}- π Website: flowextractapi.com
- π§ Email: flowextractapi@outlook.com
- π Apify Profile: FlowExtract API
- π¬ GitHub Issues: FlowExtractAPI
- πΌ LinkedIn: flowextract-api
- π¦ Twitter: @FlowExtractAPI
- π± Facebook: flowextractapi
- π Apify Docs: docs.apify.com
- π§ API Reference: docs.apify.com/api/v2
- π Tutorials: Check our blog for guides and examples
- YouTube Transcript Extractor - Extract transcripts with timestamps
- YouTube Scraper Pro - Complete channel and playlist extraction
- Zoom Scraper - Download recordings and transcripts
- Loom Scraper - Loom video and transcript extraction
- Idealista Scraper API - Spanish property data with API
- Idealista Scraper - Real estate listings extractor
- Screenshot - Fast webpage screenshots
- Ultimate Screenshot - Advanced screenshot tool
- Network Security Scanner - Security vulnerability scanner
- Facebook Ads Scraper Pro - Extract Facebook ads data
This actor is provided as-is for use on the Apify platform. Use responsibly and in accordance with applicable laws and website terms of service.
This actor extracts publicly available information from websites. Ensure your use complies with:
- β Website Terms of Service
- β Copyright laws
- β Data protection regulations (GDPR, CCPA)
- β Robots.txt directives
- β Your jurisdiction's laws
You are responsible for how you use extracted data. Use ethically and legally.
- Sign up for Apify (free tier available)
- Try AI Lead Extractor
- Configure your first extraction
- Get results in seconds
Have questions? Contact us at flowextractapi@outlook.com