AllProfanity

A blazing-fast, multi-language profanity filter for JavaScript/TypeScript with advanced algorithms (Aho-Corasick, Bloom Filters) delivering 664% faster performance on large texts, intelligent leet-speak detection, and pattern-based context analysis.

What's New in v2.2.0

Aho-Corasick Algorithm: 664% faster on large texts (1KB+) with O(n) multi-pattern matching
Bloom Filters: Lightning-fast probabilistic lookups reduce unnecessary checks
Result Caching: 123x speedup on repeated inputs (perfect for chat apps and forms)
Pattern-Based Context Detection: Reduces false positives in medical/negation contexts
Word Boundary Detection: Smart whole-word matching prevents flagging "assassin" or "assistance"
Flexible Configuration: Choose algorithm and trade-offs based on your use case

Read the full Performance Analysis →

Features

Performance & Speed

Multiple Algorithm Options: Choose between Trie (default), Aho-Corasick, or Hybrid modes
664% Faster on Large Texts: Aho-Corasick delivers O(n) multi-pattern matching
123x Speedup with Caching: Result cache perfect for repeated checks (chat, forms, APIs)
~27K ops/sec: Default Trie mode handles short texts incredibly fast
Single-Pass Scanning: O(n) complexity regardless of dictionary size
Batch Processing Ready: Optimized for high-throughput API endpoints

Accuracy & Detection

Word Boundary Matching: Smart whole-word detection prevents false positives like "assassin" or "assistance"
Pattern-Based Context Detection: Recognizes medical terms ("anal region") and negation patterns ("not bad")
Advanced Leet-Speak: Detects obfuscated profanities (f#ck, a55hole, sh1t, etc.)
Comprehensive Coverage: Catches profanity while minimizing false flags
Configurable Strictness: Tune detection sensitivity to your needs

Multi-Language & Flexibility

Multi-Language Support: Built-in dictionaries for English, Hindi, French, German, Spanish, Bengali, Tamil, Telugu, Brazilian Portuguese
Multiple Scripts: Latin/Roman (Hinglish) and native scripts (Devanagari, Tamil, Telugu, etc.)
Custom Dictionaries: Add/remove words or entire language packs at runtime
Whitelisting: Exclude safe words from detection
Severity Scoring: Assess content offensiveness (MILD, MODERATE, SEVERE, EXTREME)

Developer Experience

TypeScript Support: Fully typed API with comprehensive documentation
Zero 3rd-Party Dependencies: Only internal code and data
Configurable: Tune performance vs accuracy for your use case
No Dictionary Exposure: Secure by design - word lists never exposed
Universal: Works in Node.js and browsers

Installation

npm install allprofanity
# or
yarn add allprofanity

Generate configuration file (optional):

npx allprofanity
# Creates allprofanity.config.json and config.schema.json in your project

Quick Start

import profanity from 'allprofanity';

// Simple check
profanity.check('This is a clean sentence.');        // false
profanity.check('What the f#ck is this?');           // true (leet-speak detected)
profanity.check('यह एक चूतिया परीक्षण है।');           // true (Hindi)
profanity.check('Ye ek chutiya test hai.');          // true (Hinglish Roman script)

Algorithm Configuration

AllProfanity v2.2+ offers multiple algorithms optimized for different use cases. You can configure via constructor options or config file.

Configuration Methods

Method 1: Constructor Options (Inline)

import { AllProfanity } from 'allprofanity';

const filter = new AllProfanity({
  algorithm: { matching: "hybrid" },
  performance: { enableCaching: true }
});

Method 2: Config File (Recommended)

# Generate config files in your project
npx allprofanity

# This creates:
# - allprofanity.config.json (main config)
# - config.schema.json (for IDE autocomplete)

import { AllProfanity } from 'allprofanity';
import config from './allprofanity.config.json';

// Load from generated config file
const filter = AllProfanity.fromConfig(config);

// Or directly from object (no file needed)
const filter2 = AllProfanity.fromConfig({
  algorithm: { matching: "hybrid", useContextAnalysis: true },
  performance: { enableCaching: true, cacheSize: 1000 }
});

Example Config File (allprofanity.config.json):

{
  "algorithm": {
    "matching": "hybrid",
    "useAhoCorasick": true,
    "useBloomFilter": true,
    "useContextAnalysis": true
  },
  "contextAnalysis": {
    "enabled": true,
    "contextWindow": 50,
    "languages": ["en"],
    "scoreThreshold": 0.5
  },
  "profanityDetection": {
    "enableLeetSpeak": true,
    "caseSensitive": false,
    "strictMode": false
  },
  "performance": {
    "enableCaching": true,
    "cacheSize": 1000
  }
}

Config File: Run npx allprofanity to generate config files in your project. The JSON schema provides IDE autocomplete and validation.

Quick Configuration Examples

1. Default (Best for General Use)

import { AllProfanity } from 'allprofanity';
const filter = new AllProfanity();
// Uses optimized Trie - fast and reliable (~27K ops/sec)

2. Large Text Processing (Documents, Articles)

const filter = new AllProfanity({
  algorithm: { matching: "aho-corasick" }
});
// 664% faster on 1KB+ texts

3. Reduced False Positives (Social Media, Content Moderation)

const filter = new AllProfanity({
  algorithm: {
    matching: "hybrid",
    useBloomFilter: true,
    useAhoCorasick: true,
    useContextAnalysis: true
  },
  contextAnalysis: {
    enabled: true,
    contextWindow: 50,
    languages: ["en"],
    scoreThreshold: 0.5
  }
});
// Pattern-based context detection reduces medical/negation false positives

4. Repeated Checks (Chat, Forms, APIs)

const filter = new AllProfanity({
  performance: {
    enableCaching: true,
    cacheSize: 1000
  }
});
// 123x speedup on cache hits

5. Medical/Professional Content

const filter = new AllProfanity({
  algorithm: {
    matching: "hybrid",
    useContextAnalysis: true
  },
  contextAnalysis: {
    enabled: true,
    contextWindow: 100,
    scoreThreshold: 0.7  // Higher threshold = less sensitive
  }
});
// Reduces false positives from medical terms using keyword patterns

Performance Characteristics

Use Case	Algorithm	Speed	Detection	Best For
Short texts (<500 chars)	Trie (default)	~27K ops/sec	Excellent	Chat, comments
Large texts (1KB+)	Aho-Corasick	~9.6K ops/sec	Excellent	Documents, articles
Repeated patterns	Any + Caching	123x faster	Excellent	Forms, validation
Content moderation	Hybrid + Context	Moderate	Good (fewer false positives)	Social media, UGC
Professional content	Hybrid + Context (strict)	Moderate	Reduced false flags	Medical, academic

See detailed benchmarks and comparisons →

API Reference & Examples

`check(text: string): boolean`

Returns true if the text contains any profanity.

profanity.check('This is a clean sentence.');  // false
profanity.check('This is a bullshit sentence.'); // true
profanity.check('What the f#ck is this?'); // true (leet-speak)
profanity.check('यह एक चूतिया परीक्षण है।'); // true (Hindi)

`detect(text: string): ProfanityDetectionResult`

Returns a detailed result:

hasProfanity: boolean
detectedWords: string[] (actual matched words)
cleanedText: string (character-masked)
severity: ProfanitySeverity (MILD, MODERATE, SEVERE, EXTREME)
positions: Array<{ word: string, start: number, end: number }>

const result = profanity.detect('This is fucking bullshit and chutiya.');
console.log(result.hasProfanity); // true
console.log(result.detectedWords); // ['fucking', 'bullshit', 'chutiya']
console.log(result.severity); // 3 (SEVERE)
console.log(result.cleanedText); // "This is ******* ******** and ******."
console.log(result.positions); // e.g. [{word: 'fucking', start: 8, end: 15}, ...]

`clean(text: string, placeholder?: string): string`

Replace each character of profane words with a placeholder (default: *).

profanity.clean('This contains bullshit.'); // "This contains ********."
profanity.clean('This contains bullshit.', '#'); // "This contains ########."
profanity.clean('यह एक चूतिया परीक्षण है।'); // e.g. "यह एक ***** परीक्षण है।"

`cleanWithPlaceholder(text: string, placeholder?: string): string`

Replace each profane word with a single placeholder (default: ***).
(If the placeholder is omitted, uses ***.)

profanity.cleanWithPlaceholder('This contains bullshit.'); // "This contains ***."
profanity.cleanWithPlaceholder('This contains bullshit.', '[CENSORED]'); // "This contains [CENSORED]."
profanity.cleanWithPlaceholder('यह एक चूतिया परीक्षण है।', '####'); // e.g. "यह एक #### परीक्षण है।"

`add(word: string | string[]): void`

Add a word or an array of words to the profanity filter.

profanity.add('badword123');
profanity.check('This is badword123.'); // true

profanity.add(['mierda', 'puta']);
profanity.check('Esto es mierda.'); // true (Spanish)
profanity.check('Qué puta situación.'); // true

`remove(word: string | string[]): void`

Remove a word or an array of words from the profanity filter.

profanity.remove('bullshit');
profanity.check('This is bullshit.'); // false

profanity.remove(['mierda', 'puta']);
profanity.check('Esto es mierda.'); // false

`addToWhitelist(words: string[]): void`

Whitelist words so they are never flagged as profane.

profanity.addToWhitelist(['fuck', 'idiot','shit']);
profanity.check('He is an fucking idiot.'); // false
profanity.check('Fuck this shit.'); // false
// Remove from whitelist to restore detection
profanity.removeFromWhitelist(['fuck', 'idiot','shit']);

`removeFromWhitelist(words: string[]): void`

Remove words from the whitelist so they can be detected again.

profanity.removeFromWhitelist(['anal']);

`setPlaceholder(placeholder: string): void`

Set the default placeholder character for clean().

profanity.setPlaceholder('#');
profanity.clean('This is bullshit.'); // "This is ########."
profanity.setPlaceholder('*'); // Reset to default

`updateConfig(options: Partial<AllProfanityOptions>): void`

Change configuration at runtime.
Options include: enableLeetSpeak, caseSensitive, strictMode, detectPartialWords, defaultPlaceholder, languages, whitelistWords.

profanity.updateConfig({ caseSensitive: true, enableLeetSpeak: false });
profanity.check('FUCK'); // false (if caseSensitive)
profanity.updateConfig({ caseSensitive: false, enableLeetSpeak: true });
profanity.check('f#ck'); // true

`loadLanguage(language: string): boolean`

Load a built-in language.

profanity.loadLanguage('french');
profanity.check('Ce mot est merde.'); // true

`loadLanguages(languages: string[]): number`

Load multiple built-in languages at once.

profanity.loadLanguages(['english', 'french', 'german']);
profanity.check('Das ist scheiße.'); // true (German)

`loadIndianLanguages(): number`

Convenience: Load all major Indian language packs.

profanity.loadIndianLanguages();
profanity.check('यह एक बेंगाली गाली है।'); // true (Bengali)
profanity.check('This is a Tamil profanity: புண்டை'); // true

`loadCustomDictionary(name: string, words: string[]): void`

Add your own dictionary as an additional language.

profanity.loadCustomDictionary('swedish', ['fan', 'jävla', 'skit']);
profanity.loadLanguage('swedish');
profanity.check('Det här är skit.'); // true

`getLoadedLanguages(): string[]`

Returns the names of all currently loaded language packs.

console.log(profanity.getLoadedLanguages()); // ['english', 'hindi', ...]

`getAvailableLanguages(): string[]`

Returns the names of all available built-in language packs.

console.log(profanity.getAvailableLanguages());
// ['english', 'hindi', 'french', 'german', 'spanish', 'bengali', 'tamil', 'telugu', 'brazilian']

`clearList(): void`

Remove all loaded languages and dynamic words (start with a clean filter).

profanity.clearList();
profanity.check('fuck'); // false
profanity.loadLanguage('english');
profanity.check('fuck'); // true

`getConfig(): Partial<AllProfanityOptions>`

Get the current configuration.

console.log(profanity.getConfig());
/*
{
  defaultPlaceholder: '*',
  enableLeetSpeak: true,
  caseSensitive: false,
  strictMode: false,
  detectPartialWords: false,
  languages: [...],
  whitelistWords: [...]
}
*/

Configuration File Structure

AllProfanity supports JSON-based configuration for easy setup and deployment. The config file structure supports all algorithm and detection options.

Full Configuration Schema

{
  "algorithm": {
    "matching": "trie" | "aho-corasick" | "hybrid",  // Algorithm selection
    "useAhoCorasick": boolean,                        // Enable Aho-Corasick
    "useBloomFilter": boolean,                        // Enable Bloom Filter
    "useContextAnalysis": boolean                     // Enable context analysis
  },
  "bloomFilter": {
    "enabled": boolean,                               // Enable/disable
    "expectedItems": number,                          // Expected dictionary size (default: 10000)
    "falsePositiveRate": number                       // Acceptable false positive rate (default: 0.01)
  },
  "ahoCorasick": {
    "enabled": boolean,                               // Enable/disable
    "prebuild": boolean                               // Prebuild automaton (default: true)
  },
  "contextAnalysis": {
    "enabled": boolean,                               // Enable/disable pattern-based context detection
    "contextWindow": number,                          // Characters around match to check (default: 50)
    "languages": string[],                            // Languages for keyword patterns (default: ["en"])
    "scoreThreshold": number                          // Detection threshold 0-1 (default: 0.5)
  },
  "profanityDetection": {
    "enableLeetSpeak": boolean,                       // Detect l33t speak (default: true)
    "caseSensitive": boolean,                         // Case sensitive matching (default: false)
    "strictMode": boolean,                            // Require word boundaries (default: false)
    "detectPartialWords": boolean,                    // Detect within words (default: false)
    "defaultPlaceholder": string                      // Default censoring character (default: "*")
  },
  "performance": {
    "enableCaching": boolean,                         // Enable result cache (default: false)
    "cacheSize": number                               // Cache size limit (default: 1000)
  }
}

Pre-configured Templates

High Performance (Large Texts)

{
  "algorithm": { "matching": "aho-corasick" },
  "ahoCorasick": { "enabled": true, "prebuild": true },
  "profanityDetection": { "enableLeetSpeak": true }
}

Reduced False Positives (Content Moderation)

{
  "algorithm": {
    "matching": "hybrid",
    "useContextAnalysis": true,
    "useBloomFilter": true
  },
  "contextAnalysis": {
    "enabled": true,
    "contextWindow": 50,
    "scoreThreshold": 0.5
  },
  "performance": { "enableCaching": true }
}

Balanced (Production)

{
  "algorithm": {
    "matching": "hybrid",
    "useAhoCorasick": true,
    "useBloomFilter": true
  },
  "profanityDetection": { "enableLeetSpeak": true },
  "performance": { "enableCaching": true, "cacheSize": 1000 }
}

Using Config Files

Step 1: Generate Config Files

# Run this in your project directory
npx allprofanity

# Output:
# ✅ AllProfanity configuration files created!
#
# Created files:
#   📄 allprofanity.config.json - Main configuration
#   📄 config.schema.json - JSON schema for IDE autocomplete

Step 2: Load Config in Your Code

// ES Modules / TypeScript
import { AllProfanity } from 'allprofanity';
import config from './allprofanity.config.json';

const filter = AllProfanity.fromConfig(config);

// CommonJS (Node.js)
const { AllProfanity } = require('allprofanity');
const config = require('./allprofanity.config.json');

const filter = AllProfanity.fromConfig(config);

Step 3: Customize Config

Edit allprofanity.config.json to enable/disable features. Your IDE will provide autocomplete thanks to the JSON schema!

Severity Levels

Severity reflects the number and variety of detected profanities:

Level	Enum Value	Description
MILD	1	1 unique/total word
MODERATE	2	2 unique or total words
SEVERE	3	3 unique/total words
EXTREME	4	4+ unique or 5+ total profane words

Language Support

Built-in: English, Hindi, French, German, Spanish, Bengali, Tamil, Telugu, Brazilian Portuguese
Scripts: Latin/Roman, Devanagari, Tamil, Telugu, Bengali, etc.
Mixed Content: Handles mixed-language and code-switched sentences.

profanity.check('This is bullshit and चूतिया.'); // true (mixed English/Hindi)
profanity.check('Ce mot est merde and पागल.');   // true (French/Hindi)
profanity.check('Isso é uma merda.');             // true (Brazilian Portuguese)

Use Exported Wordlists

For sample words in a language (for UIs, admin, etc):

import { englishBadWords, hindiBadWords } from 'allprofanity';
console.log(englishBadWords.slice(0, 5)); // ["fuck", "shit", ...]

Security

No wordlist exposure: There is no .list() function for security and encapsulation. Use exported word arrays for samples.
TRIE-based: Scales easily to 50,000+ words.
Handles leet-speak: Catches obfuscated variants like f#ck, a55hole.

Full Example

import profanity, { ProfanitySeverity } from 'allprofanity';

// Multi-language detection
profanity.loadLanguages(['english', 'french', 'tamil']);
console.log(profanity.check('Ce mot est merde.')); // true

// Leet-speak detection
console.log(profanity.check('You a f#cking a55hole!')); // true

// Whitelisting
profanity.addToWhitelist(['anal', 'ass']);
console.log(profanity.check('He is an associate professor.')); // false

// Severity
const result = profanity.detect('This is fucking bullshit and chutiya.');
console.log(ProfanitySeverity[result.severity]); // "SEVERE"

// Custom dictionary
profanity.loadCustomDictionary('pirate', ['barnacle-head', 'landlubber']);
profanity.loadLanguage('pirate');
console.log(profanity.check('You barnacle-head!')); // true

// Placeholder configuration
profanity.setPlaceholder('#');
console.log(profanity.clean('This is bullshit.')); // "This is ########."
profanity.setPlaceholder('*'); // Reset

FAQ

Q: How do I see all loaded profanities?
A: For security, the internal word list is not exposed. Use englishBadWords etc. for samples.

Q: How do I reset the filter?
A: Use clearList() and reload languages/dictionaries.

Q: Is this safe for browser and Node.js?
A: Yes! AllProfanity is universal.

Middleware Examples

Looking for Express.js/Node.js middleware to use AllProfanity in your API or chat app?
Check the examples/ folder for ready-to-copy middleware and integration samples.

Roadmap

🚧 Multi-language context analysis (Hindi, Spanish, etc.)
🚧 Phonetic matching (sounds-like detection)
🚧 More language packs (Arabic, Russian, Japanese, etc.)
🚧 Machine learning integration for adaptive scoring
🚧 Plugin system for custom detection algorithms

License

MIT — See LICENSE

Contributing

We welcome contributions! Please see our CONTRIBUTORS.md for:

How to add your name to our contributors list
Guidelines for adding new languages
Test requirements (must include passing test screenshots in PRs)
Code of conduct and PR guidelines

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
benchmark		benchmark
bin		bin
docs		docs
examples		examples
src		src
tests		tests
.gitignore		.gitignore
.npmignore		.npmignore
CONTRIBUTORS.md		CONTRIBUTORS.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
allprofanity.config.example.json		allprofanity.config.example.json
config.json		config.json
config.schema.json		config.schema.json
jest.config.js		jest.config.js
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

AllProfanity

What's New in v2.2.0

Features

Performance & Speed

Accuracy & Detection

Multi-Language & Flexibility

Developer Experience

Installation

Quick Start

Algorithm Configuration

Configuration Methods

Method 1: Constructor Options (Inline)

Method 2: Config File (Recommended)

Quick Configuration Examples

1. Default (Best for General Use)

2. Large Text Processing (Documents, Articles)

3. Reduced False Positives (Social Media, Content Moderation)

4. Repeated Checks (Chat, Forms, APIs)

5. Medical/Professional Content

Performance Characteristics

API Reference & Examples

check(text: string): boolean

detect(text: string): ProfanityDetectionResult

clean(text: string, placeholder?: string): string

cleanWithPlaceholder(text: string, placeholder?: string): string

add(word: string | string[]): void

remove(word: string | string[]): void

addToWhitelist(words: string[]): void

removeFromWhitelist(words: string[]): void

setPlaceholder(placeholder: string): void

updateConfig(options: Partial<AllProfanityOptions>): void

loadLanguage(language: string): boolean

loadLanguages(languages: string[]): number

loadIndianLanguages(): number

loadCustomDictionary(name: string, words: string[]): void

getLoadedLanguages(): string[]

getAvailableLanguages(): string[]

clearList(): void

getConfig(): Partial<AllProfanityOptions>

Configuration File Structure

Full Configuration Schema

Pre-configured Templates

High Performance (Large Texts)

Reduced False Positives (Content Moderation)

Balanced (Production)

Using Config Files

Severity Levels

Language Support

Use Exported Wordlists

Security

Full Example

FAQ

Middleware Examples

Roadmap

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`check(text: string): boolean`

`detect(text: string): ProfanityDetectionResult`

`clean(text: string, placeholder?: string): string`

`cleanWithPlaceholder(text: string, placeholder?: string): string`

`add(word: string | string[]): void`

`remove(word: string | string[]): void`

`addToWhitelist(words: string[]): void`

`removeFromWhitelist(words: string[]): void`

`setPlaceholder(placeholder: string): void`

`updateConfig(options: Partial<AllProfanityOptions>): void`

`loadLanguage(language: string): boolean`

`loadLanguages(languages: string[]): number`

`loadIndianLanguages(): number`

`loadCustomDictionary(name: string, words: string[]): void`

`getLoadedLanguages(): string[]`

`getAvailableLanguages(): string[]`

`clearList(): void`

`getConfig(): Partial<AllProfanityOptions>`

Packages