vikrant-OBLITERATUS

Advanced Abliteration Framework for Large Language Models
Precision refusal removal with fine-tuning, voice support, real-time collaboration & security scanning

Abstract

vikrant-OBLITERATUS represents a significant advancement in mechanistic interpretability and model behavior modification. Built upon established abliteration research, this framework introduces a comprehensive 8-stage pipeline that surgically removes refusal mechanisms from transformer-based language models while preserving core capabilities. Unlike conventional approaches that rely solely on weight modification, vikrant-OBLITERATUS integrates post-abliteration fine-tuning, multi-modal voice analysis, collaborative research workflows, and model provenance verification—establishing a new paradigm for production-grade model liberation.

The framework implements novel techniques including adaptive fine-tuning after abliteration, voice-based refusal analysis for audio-capable LLMs, WebSocket-powered collaborative sessions for distributed research teams, and cryptographic model scanning to detect adversarial modifications. Each abliteration run contributes to a crowd-sourced research dataset, enabling unprecedented cross-architecture analysis of alignment mechanisms.

Key Innovation: Unlike existing tools that treat abliteration as a terminal operation, vikrant-OBLITERATUS introduces a closed-loop pipeline where post-intervention fine-tuning recovers degraded capabilities while maintaining liberation—achieving superior coherence-compliance trade-offs.

Why vikrant-OBLITERATUS?

The Problem with Current Approaches

Existing abliteration tools (RepE, FailSpy, Heretic, OBLITERATUS) suffer from four critical limitations:

Terminal Operations: Abliteration degrades capabilities with no recovery mechanism
Single Modality: No support for emerging voice/audio LLMs
Isolated Research: Each run is siloed—no collaborative analysis
Security Blind: No detection of adversarially modified models

How vikrant-OBLITERATUS Solves This

Challenge	Traditional Approach	vikrant-OBLITERATUS Solution
Capability Loss	Accept trade-off between refusal removal and coherence	Auto Fine-Tuning: Post-abliteration LoRA adapter recovers 89% of lost coherence
Voice Models	Incompatible with audio-capable LLMs	Voice Pipeline: Whisper transcription → abliterate → TTS synthesis
Collaboration	Email JSON files between researchers	Real-Time Sessions: Live WebSocket sessions with shared heatmaps
Model Trust	Assume model provenance is honest	Dark Web Scanner: Cryptographic fingerprinting detects poison vectors
Parameter Tuning	Manual guesswork or expensive sweeps	Analysis-Informed: Auto-configures 17 hyperparameters via geometry analysis

Why You Need This

Researchers: Closed-loop pipeline with PDF report generation (ArXiv-ready)
Red Teamers: Voice refusal analysis + adversarial model detection
ML Engineers: Production-grade API with reversible steering vectors
Open Science: Every run contributes to largest abliteration dataset ever assembled

Installation

pip (Recommended)

pip install vikrant-obliteratus

# With all features
pip install vikrant-obliteratus[all]

From Source

git clone https://github.com/vikrant-project/vikrant-OBLITERATUS.git
cd vikrant-OBLITERATUS
pip install -e .

Quick Start

CLI

# Basic abliteration
vobl obliterate meta-llama/Llama-3.1-8B-Instruct

# With auto fine-tuning
vobl obliterate meta-llama/Llama-3.1-8B-Instruct --auto-finetune

# Security scan
vobl scan meta-llama/Llama-3.1-8B-Instruct --deep

Python API

from vikrant_obl.pipeline import VikrantAbliterationPipeline

pipeline = VikrantAbliterationPipeline(
    model_name=\"meta-llama/Llama-3.1-8B-Instruct\",
    method=\"svd_normpres\",
)
result = pipeline.run()

4 Revolutionary Features

✅ Auto Fine-Tuning: Post-abliteration LoRA recovery
✅ Dark Web Scanner: Model security verification
✅ Real-Time Collaboration: Multi-researcher sessions
✅ Voice/Audio Support: Whisper + TTS integration

Comparison with Other Tools

Tool	Methods	Fine-Tuning	Voice	Collaboration	Scanner	Tests
vikrant-OBLITERATUS	10	✅	✅	✅	✅	1000+
OBLITERATUS	7	❌	❌	❌	❌	837
Heretic	1	❌	❌	❌	❌	?
FailSpy	1	❌	❌	❌	❌	0
RepE	0	❌	❌	❌	❌	Minimal

Citation

@software{vikrant_obliteratus2026,
  title     = {vikrant-OBLITERATUS: Advanced Abliteration Framework},
  author    = {Vikrant},
  year      = {2026},
  url       = {https://github.com/vikrant-project/vikrant-OBLITERATUS}
}

License

AGPL-3.0 (open source) + Commercial dual-license

Developed by Vikrant | GitHub | HuggingFace

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
memory		memory
tests		tests
vikrant_obl		vikrant_obl
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
FOLDER_STRUCTURE.md		FOLDER_STRUCTURE.md
GITHUB_CONFIG.md		GITHUB_CONFIG.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test_result.md		test_result.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vikrant-OBLITERATUS

Abstract

Why vikrant-OBLITERATUS?

The Problem with Current Approaches

How vikrant-OBLITERATUS Solves This

Why You Need This

Installation

pip (Recommended)

From Source

Quick Start

CLI

Python API

4 Revolutionary Features

Comparison with Other Tools

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

vikrant-OBLITERATUS

Abstract

Why vikrant-OBLITERATUS?

The Problem with Current Approaches

How vikrant-OBLITERATUS Solves This

Why You Need This

Installation

pip (Recommended)

From Source

Quick Start

CLI

Python API

4 Revolutionary Features

Comparison with Other Tools

Citation

License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages