FractalTextGuard

Structural Text Analysis via Long-Range Dependence Metrics

Overview

FractalTextGuard analyzes text structure using Detrended Fluctuation Analysis (DFA) and related metrics. It calculates the Hurst exponent (H) and other statistical properties that characterize long-range correlations in sequential data.

What this tool does:

Calculates Hurst exponent via DFA (orders 2-4)
Measures n-gram repetition rates
Computes compression ratio and Shannon entropy
Provides structured verdicts: HEALTHY / WARNING / CRITICAL

What this tool does NOT guarantee:

100% accurate AI detection (no tool can)
Works equally well on all text types
Replaces human judgment

⚠️ ## ⚠️ Important Warnings & Limitations

FractalTextGuard is an experimental research tool. It is not a standalone AI-text detector for high-stakes decisions (e.g., academic misconduct accusations, legal/disciplinary actions).

Short texts are unreliable: DFA/Hurst-based metrics are unstable on small samples. Treat results as low confidence unless the text is sufficiently long.
Human writing variation matters: non-native speakers and some neurodivergent/very formal writing styles can look more repetitive/structured and may increase false positives.
This method is best viewed as a signal (fractal/LRD structure), not proof. Always combine with other evidence and human review.

Installation

git clone https://github.com/muhomor2/FractalTextGuard_v3.0.2.git
cd FractalTextGuard
# Python 3.8+ required. No external dependencies.

Usage

# Single file analysis
python analyze.py --file document.txt

# Batch processing
python analyze.py --folder documents/ --output results.json

# Detailed metrics
python analyze.py --file document.txt --detailed

Output

Verdicts

Verdict	Meaning
HEALTHY	No strong structural anomalies detected
WARNING	Mixed signals; manual review recommended
CRITICAL	Strong structural anomalies (high repetition, abnormal H)

Metrics Provided

Metric	Description	Typical Range
H (Hurst exponent)	Long-range correlation measure	0.3 - 0.8
Compression ratio	zlib compressibility	0.3 - 0.7
Trigram repeat rate	3-gram repetition	0.05 - 0.30
5-gram repeat rate	5-gram repetition	0.02 - 0.20
Entropy	Shannon entropy (bits/byte)	3.5 - 5.0

Limitations

Read before use: See docs/LIMITATIONS.md and docs/VERDICT_POLICY.md

Short texts (< 500 tokens): DFA estimates become unstable
Code-heavy files: May trigger false positives
Domain-specific text: Thresholds may need calibration
Adversarial evasion: Sophisticated attacks can bypass detection
No ground truth: Verdicts are probabilistic, not deterministic

Calibration

Default thresholds are based on limited testing. For production use:

Run stress tests on your domain data:

python stress_test_rule_based.py --cases your_data/ --labels labels.json

Adjust thresholds in config.json based on results
Document your calibration methodology

Package Structure

FractalTextGuard/
├── analyze.py              # CLI entry point
├── src/
│   ├── analyzer_core.py    # Core DFA/metrics
│   └── gsl_lrd_v25.py      # Extended analyzer
├── docs/
│   ├── LIMITATIONS.md
│   └── VERDICT_POLICY.md
├── examples/               # Test files
├── tests/                  # Unit tests
├── stress_test_*.py        # Calibration tools
├── config.json             # Thresholds
├── LICENSE                 # CC BY-NC 4.0
└── CITATION.cff

Related Work

This tool is part of the LRD Time Series Analysis research community:

Community: https://zenodo.org/communities/lrd-time-series/records
Related publications: See CITATION.cff

License

Non-Commercial Use

Use	Status
Personal / Academic / Educational	✅ Free
Commercial	❌ Requires written permission

See LICENSE for full terms.

Citation

@software{chechelnitsky2026FractalTextGuard_v3.0.2,
  author = {Chechelnitsky, Igor},
  title = {FractalTextGuard: Structural Text Analysis via Long-Range Dependence},
  year = {2026},
  publisher = {Zenodo},
  version = {3.0.2},
  doi = {10.5281/zenodo.18207859},
  url = {https://github.com/IgorChechelnitsky/FractalTextGuard}
}

Author

Igor Chechelnitsky
ORCID: 0009-0007-4607-1946

Changelog

See CHANGELOG.md for version history.

v3.0.2: Standardized verdicts (HEALTHY/WARNING/CRITICAL), added limitations docs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FractalTextGuard

Overview

Installation

Usage

Output

Verdicts

Metrics Provided

Limitations

Calibration

Package Structure

Related Work

License

Citation

Author

Changelog

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
docs		docs
examples		examples
src		src
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
ZENODO_UPLOAD.md		ZENODO_UPLOAD.md
analyze.py		analyze.py
config.json		config.json
requirements.txt		requirements.txt
setup.py		setup.py
stress_test_hybrid_industrial.py		stress_test_hybrid_industrial.py
stress_test_rule_based.py		stress_test_rule_based.py

Folders and files

Latest commit

History

Repository files navigation

FractalTextGuard

Overview

Installation

Usage

Output

Verdicts

Metrics Provided

Limitations

Calibration

Package Structure

Related Work

License

Citation

Author

Changelog

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages