TONL API Documentation v2.5.2

Version: 2.5.2 Status: Stable & Production Ready Last Updated: 2025-12-20

This document provides detailed API documentation for the TONL TypeScript library.

🎉 What's New in v2.5.2

Enhanced Test Coverage: 698+ tests with comprehensive coverage for all modules
Browser Documentation: Complete browser API guide (see BROWSER.md)
Error Handling Guide: Detailed error handling documentation (see ERROR_HANDLING.md)
CLI Documentation: Full CLI documentation with all commands
Zero Breaking Changes: All existing code continues to work unchanged

🎉 What's New in v2.5.0

Enterprise Security: Enhanced security with centralized error messages
Performance Optimization: Improved caching and query performance
Schema Generation: TypeScript type generation from schemas
Compound Indexing: Multi-field index support for complex queries
REPL Improvements: Enhanced interactive exploration

🎉 What's New in v2.1.0

Buffer Size Reporting: Fixed accurate buffer size reporting in encode-stream overflow error messages
Test Suite Stability: Resolved incorrect test expectations for buffer overflow scenarios
Enhanced Error Handling: Improved error message accuracy for stream buffer overflow
Production Ready: Fully tested and stable release

🎉 What's New in v2.0.9

Version Consistency Update: Synchronized version numbers across all distribution channels
Documentation Alignment: Updated version references throughout project documentation
Website Version Sync: Aligned website with current version information
Zero Breaking Changes: All existing code continues to work unchanged
Production Ready: Release preparation with consistent versioning

🎉 What's New in v2.0.6

Fixed Nested Array Round-Trip: Perfect encode/decode for [[]], [[[]]], and complex nested arrays
Enhanced Parser Logic: Improved handling of [index][length]: format in nested contexts
Zero Breaking Changes: All existing code continues to work unchanged
Production Ready: Critical data integrity fix for nested array usage

🎉 What's New in v2.0.5

Dual-Mode System: Choose between perfect round-trip (quoting) and clean output (preprocessing)
Enhanced CLI Support: --preprocess flag for handling problematic JSON keys
Browser Preprocessing: preprocessJSON() function for key transformation
Advanced Key Quoting: Smart handling of #, @, spaces, and special characters

TONLDocument API (Primary Interface)
Core Functions (Legacy/Lower-level)
Utility Functions
Dual-Mode System
Optimization API
Streaming API
Schema API
Query API
Modification API
Navigation API
Indexing API
File Operations
Error Handling → See ERROR_HANDLING.md
Browser API → See BROWSER.md
CLI Reference → See CLI.md
Performance

TONLDocument API

TONLDocument is the primary class for working with TONL data. It provides a high-level interface for querying, modifying, and navigating TONL documents.

Static Factory Methods

`TONLDocument.parse(tonlText, options?)`

Parse a TONL string into a document.

static parse(tonlText: string, options?: DecodeOptions): TONLDocument

Example:

const doc = TONLDocument.parse(`
#version 1.0
users[2]{id,name}:
  1, Alice
  2, Bob
`);

`TONLDocument.fromJSON(data)`

Create a document from JavaScript data.

static fromJSON(data: any): TONLDocument

Example:

const doc = TONLDocument.fromJSON({
  users: [
    { id: 1, name: 'Alice' },
    { id: 2, name: 'Bob' }
  ]
});

`TONLDocument.fromFile(path)`

Load a TONL document from a file (async).

static async fromFile(path: string): Promise<TONLDocument>

Example:

const doc = await TONLDocument.fromFile('data.tonl');

`TONLDocument.fromFileSync(path)`

Load a TONL document from a file (sync).

static fromFileSync(path: string): TONLDocument

Example:

const doc = TONLDocument.fromFileSync('data.tonl');

Query Methods

`get(pathExpression)`

Get a value at a specific path.

get(pathExpression: string): any

Examples:

doc.get('user.name')           // 'Alice'
doc.get('users[0]')            // { id: 1, name: 'Alice' }
doc.get('users[-1]')           // Last user

`query(pathExpression)`

Query the document with advanced expressions.

query(pathExpression: string): any

Examples:

doc.query('users[*].name')                    // ['Alice', 'Bob']
doc.query('users[?(@.role == "admin")]')      // Filter users
doc.query('$..email')                         // All emails recursively

`exists(pathExpression)`

Check if a path exists.

exists(pathExpression: string): boolean

`typeOf(pathExpression)`

Get the type of value at a path.

typeOf(pathExpression: string): string | undefined

Modification Methods

`set(path, value)`

Set a value at a path (creates intermediate objects/arrays).

set(path: string, value: any): TONLDocument

Example:

doc
  .set('user.name', 'Alice')
  .set('user.age', 30)
  .set('user.verified', true);

`delete(path)`

Delete a value at a path.

delete(path: string): TONLDocument

`push(path, ...items)`

Push items to an array.

push(path: string, ...items: any[]): number

Returns: New array length

`pop(path)`

Remove and return the last item from an array.

pop(path: string): any

`merge(path, object)`

Shallow merge an object at a path.

merge(path: string, object: object): TONLDocument

Navigation Methods

`entries()`

Iterate over [key, value] pairs at root level.

*entries(): Generator<[string, any]>

Example:

for (const [key, value] of doc.entries()) {
  console.log(`${key}: ${value}`);
}

`keys()` / `values()`

Iterate over keys or values at root level.

*keys(): Generator<string>
*values(): Generator<any>

`deepEntries()` / `deepKeys()` / `deepValues()`

Recursively iterate over all [path, value] pairs, paths, or values.

*deepEntries(): Generator<[string, any]>
*deepKeys(): Generator<string>
*deepValues(): Generator<any>

`walk(callback, options?)`

Walk the document tree with a callback.

walk(callback: WalkCallback, options?: WalkOptions): void

Example:

doc.walk((path, value, depth) => {
  console.log(`[Depth ${depth}] ${path}: ${value}`);
});

`find(predicate)` / `findAll(predicate)`

Find values matching a predicate.

find(predicate: (value: any, path: string) => boolean): any
findAll(predicate: (value: any, path: string) => boolean): any[]

`some(predicate)` / `every(predicate)`

Check if any/all values match a predicate.

some(predicate: (value: any, path: string) => boolean): boolean
every(predicate: (value: any, path: string) => boolean): boolean

`countNodes()`

Count total nodes in the document.

countNodes(): number

Change Tracking Methods

`snapshot()`

Create an independent copy of the document.

snapshot(): TONLDocument

`diff(other)`

Compare with another document and generate a diff.

diff(other: TONLDocument): DiffResult

Returns:

interface DiffResult {
  changes: DiffEntry[];
  summary: {
    added: number;
    modified: number;
    deleted: number;
    total: number;
  };
}

`diffString(other)`

Generate a human-readable diff string.

diffString(other: TONLDocument): string

Indexing Methods

`createIndex(options)`

Create an index for fast lookups.

createIndex(options: IndexOptions): void

interface IndexOptions {
  name: string;
  fields: string | string[];
  type?: 'hash' | 'btree' | 'compound';
  unique?: boolean;
}

Example:

// Hash index (O(1) lookups)
doc.createIndex({
  name: 'userById',
  fields: 'id',
  type: 'hash',
  unique: true
});

// BTree index (O(log n) range queries)
doc.createIndex({
  name: 'userByAge',
  fields: 'age',
  type: 'btree'
});

// Compound index (multiple fields)
doc.createIndex({
  name: 'userByNameAndAge',
  fields: ['name', 'age'],
  type: 'compound'
});

`getIndex(name)`

Get an existing index.

getIndex(name: string): IIndex | undefined

Example:

const idx = doc.getIndex('userById');
const paths = idx.find(123);  // O(1) lookup

`listIndices()`

List all index names.

listIndices(): string[]

`dropIndex(name)`

Remove an index.

dropIndex(name: string): void

Export Methods

`toJSON()`

Export to JavaScript object.

toJSON(): any

`toTONL(options?)`

Export to TONL string.

toTONL(options?: EncodeOptions): string

`save(path, options?)`

Save to file (async).

async save(path: string, options?: EncodeOptions): Promise<void>

`saveSync(path, options?)`

Save to file (sync).

saveSync(path: string, options?: EncodeOptions): void

Metadata Methods

`stats()`

Get document statistics.

stats(): DocumentStats

interface DocumentStats {
  sizeBytes: number;
  nodeCount: number;
  maxDepth: number;
  arrayCount: number;
  objectCount: number;
  primitiveCount: number;
}

Core Functions

Lower-level encode/decode functions for direct use.

`encodeTONL(input, options?)`

Encodes JavaScript/TypeScript data to TONL format string.

function encodeTONL(input: any, options?: EncodeOptions): string

interface EncodeOptions {
  delimiter?: "," | "|" | "\t" | ";";    // Field delimiter (default: ",")
  includeTypes?: boolean;                // Add type hints (default: false)
  version?: string;                      // TONL version (default: "1.0")
  indent?: number;                       // Spaces per level (default: 2)
  singleLinePrimitiveLists?: boolean;    // Single line for primitives (default: true)
}

Example:

import { encodeTONL } from 'tonl';

const data = {
  users: [
    { id: 1, name: "Alice", active: true },
    { id: 2, name: "Bob", active: false }
  ]
};

const tonl = encodeTONL(data, {
  delimiter: "|",
  includeTypes: true,
  indent: 4
});

`decodeTONL(text, options?)`

Decodes TONL format string back to JavaScript objects.

function decodeTONL(text: string, options?: DecodeOptions): any

interface DecodeOptions {
  delimiter?: "," | "|" | "\t" | ";";    // Field delimiter (auto-detected)
  strict?: boolean;                      // Strict mode validation (default: false)
}

Example:

import { decodeTONL } from 'tonl';

const tonlText = `#version 1.0
users[2]{id:u32,name:str,active:bool}:
  1, Alice, true
  2, Bob, false`;

const data = decodeTONL(tonlText);
// { users: [{ id: 1, name: "Alice", active: true }, ...] }

`encodeSmart(input, options?)`

Automatically chooses optimal encoding settings based on data analysis.

function encodeSmart(input: any, options?: EncodeOptions): string

Smart Optimization:

Delimiter selection to minimize quoting
Layout optimization for compactness
Type hint optimization

Example:

import { encodeSmart } from 'tonl';

const data = {
  items: [
    { name: "Item A", category: "Tools, Hardware" },
    { name: "Item B", category: "Electronics" }
  ]
};

// Smart encoding will use "|" delimiter to avoid quoting commas
const optimized = encodeSmart(data);

Dual-Mode System v2.0.6 ⭐ UPDATED

The dual-mode system provides two approaches for handling problematic JSON keys:

Mode 1: Default (Quoting Only)

Perfect Round-trip: Data integrity guaranteed
Smart Quoting: Automatically quotes problematic keys
Special Characters: Handles #, @, spaces, empty keys, etc.

Mode 2: Preprocessing (Key Transformation)

Clean Output: Transforms problematic keys to safe identifiers
Enhanced Readability: Better for LLM prompts and data analysis
Automatic Mapping: Handles key transformation transparently

Browser Preprocessing Function

`preprocessJSON(input, options?)`

Preprocess JSON data to clean up problematic keys.

function preprocessJSON(
  input: string | object,
  options?: PreprocessOptions
): string | object

interface PreprocessOptions {
  renameEmptyKeys?: boolean;    // Rename empty string keys (default: true)
  renameSpecialChars?: boolean; // Rename keys with special chars (default: true)
  renameSpaces?: boolean;       // Rename keys with spaces (default: true)
  renameReserved?: boolean;     // Rename reserved keywords (default: true)
}

Examples:

import { preprocessJSON, encodeTONL } from 'tonl/browser';

const problematicJSON = `{
  "#": "hash-key",
  "": "empty-key",
  "key with spaces": "spaced-key",
  "@type": "at-symbol-key"
}`;

// Preprocess for clean TONL output
const preprocessed = preprocessJSON(problematicJSON);
console.log(preprocessed);
// {
//   "comment": "hash-key",
//   "empty": "empty-key",
//   "key_with_spaces": "spaced-key",
//   "type": "at-symbol-key"
// }

// Encode to clean TONL
const tonl = encodeTONL(JSON.parse(preprocessed));
console.log(tonl);
// comment[1]:
//   "hash-key"
// empty[1]:
//   "empty-key"
// key_with_spaces[1]:
//   spaced-key
// type[1]:
//   "at-symbol-key"

Node.js Key Transformation

`transformObjectKeys(obj, transformer)`

Transform object keys using a custom function.

function transformObjectKeys(
  obj: any,
  transformer: (key: string, path: string) => string
): any

Example:

import { transformObjectKeys } from 'tonl';

const data = {
  "#": "hash-value",
  "": "empty-value",
  "user name": "Alice"
};

// Custom transformation
const transformed = transformObjectKeys(data, (key, path) => {
  if (key === '#') return 'comment';
  if (key === '') return 'empty';
  if (key.includes(' ')) return key.replace(/ /g, '_');
  return key;
});

console.log(transformed);
// {
//   "comment": "hash-value",
//   "empty": "empty-value",
//   "user_name": "Alice"
// }

CLI Integration

The CLI automatically supports preprocessing through the --preprocess flag:

# Default mode (perfect round-trip)
tonl encode messy-data.json

# Preprocessing mode (clean output)
tonl encode messy-data.json --preprocess

When to Use Each Mode

Default Mode (Quoting)

Configuration files
API responses
Database exports
When exact round-trip is critical
Production data pipelines

Preprocessing Mode

Data analysis and exploration
LLM prompts and training data
Temporary files and scripts
When readability is priority
Development and debugging

Advanced Key Quoting

The encoding system automatically detects and quotes problematic keys:

import { encodeTONL } from 'tonl';

const data = {
  "": "empty-key",
  "#": "hash-key",
  "@type": "at-key",
  "key with spaces": "spaced-key",
  "key:with:colons": "colon-key",
  "key{braces}": "brace-key"
};

const tonl = encodeTONL(data);
console.log(tonl);
// ""[1]:
//   "empty-key"
// "#"[1]:
//   "hash-key"
// "@type"[1]:
//   "at-key"
// "key with spaces"[1]:
//   "spaced-key"
// "key:with:colons"[1]:
//   "colon-key"
// "key{braces}"[1]:
//   "brace-key"

Characters That Trigger Quoting:

Empty strings ""
Hash #
At symbol @
Colon :
Comma ,
Braces {}
Quotes "
Leading/trailing spaces
Tab characters
Newline characters

Utility Functions

`parseTONLLine(line, delimiter)`

Parses a single TONL line into array of field values.

function parseTONLLine(line: string, delimiter: TONLDelimiter): string[]

`inferPrimitiveType(value)`

Infers the primitive type of a value for type hint generation.

function inferPrimitiveType(value: unknown): TONLTypeHint

type TONLTypeHint = "u32" | "i32" | "f64" | "bool" | "null" | "str" | "obj" | "list"

Examples:

inferPrimitiveType(42);        // "u32"
inferPrimitiveType(-10);       // "i32"
inferPrimitiveType(3.14);      // "f64"
inferPrimitiveType(true);      // "bool"
inferPrimitiveType(null);      // "null"
inferPrimitiveType("hello");   // "str"
inferPrimitiveType([1,2,3]);   // "list"
inferPrimitiveType({a: 1});    // "obj"

`isUniformObjectArray(arr)`

Check if an array contains uniform objects.

function isUniformObjectArray(arr: any[]): boolean

`getUniformColumns(arr)`

Get stable column order for uniform object array.

function getUniformColumns(arr: any[]): string[]

Streaming API (v0.7.5+)

For handling large datasets efficiently.

`streamQuery(filePath, pathExpression, options?)`

Stream query results from a file.

async function* streamQuery(
  filePath: string,
  pathExpression: string,
  options?: StreamQueryOptions
): AsyncGenerator<any>

interface StreamQueryOptions {
  filter?: (value: any) => boolean;
  limit?: number;
  skip?: number;
  highWaterMark?: number;
}

Example:

import { streamQuery } from 'tonl';

// Process 10GB file with constant memory
for await (const record of streamQuery('huge-data.tonl', 'records[*]', {
  filter: r => r.active,
  limit: 1000
})) {
  process(record);
}

`streamAggregate(filePath, pathExpression, reducer, initialValue)`

Aggregate data from a stream.

async function streamAggregate<T, R>(
  filePath: string,
  pathExpression: string,
  reducer: (accumulator: R, value: T) => R,
  initialValue: R
): Promise<R>

Example:

const total = await streamAggregate(
  'sales.tonl',
  'sales[*].amount',
  (sum, amount) => sum + amount,
  0
);

`StreamPipeline`

Chainable stream transformations.

import { StreamPipeline } from 'tonl';

const pipeline = new StreamPipeline('data.tonl')
  .filter(item => item.active)
  .map(item => ({ ...item, processed: true }))
  .limit(100);

for await (const item of pipeline) {
  console.log(item);
}

Schema API (v0.8.0+)

For data validation and type generation.

`parseSchema(schemaText)`

Parse TONL Schema Language (TSL) into schema object.

function parseSchema(schemaText: string): TONLSchema

Example:

import { parseSchema } from 'tonl/schema';

const schemaText = `
@schema v1
User: obj
  id: u32 required
  name: str required min:2 max:100
  email: str required pattern:email
`;

const schema = parseSchema(schemaText);

`validateTONL(data, schema, options?)`

Validate data against a schema.

function validateTONL(
  data: any,
  schema: TONLSchema,
  options?: { strict?: boolean }
): ValidationResult

interface ValidationResult {
  valid: boolean;
  errors: ValidationError[];
}

Example:

import { parseSchema, validateTONL } from 'tonl/schema';

const schema = parseSchema(schemaText);
const data = { id: 123, name: 'Alice', email: 'alice@example.com' };

const result = validateTONL(data, schema);
if (!result.valid) {
  result.errors.forEach(err => console.error(err.message));
}

`generateTypeScript(schema, options?)`

Generate TypeScript type definitions from schema.

function generateTypeScript(
  schema: TONLSchema,
  options?: GenerateOptions
): string

interface GenerateOptions {
  exportAll?: boolean;
  readonly?: boolean;
  strict?: boolean;
}

Query API (v0.6.0+)

See QUERY_API.md for detailed query syntax and examples.

Path Syntax:

Property access: user.name
Array indexing: users[0], users[-1]
Wildcards: users[*].name, data.*
Recursive descent: $..email
Array slicing: users[0:5], users[::2]
Filters: users[?(@.age > 18)]

Operators:

Comparison: ==, !=, >, <, >=, <=
Logical: &&, ||, !
String: contains, startsWith, endsWith, matches

Modification API (v0.6.5+)

See MODIFICATION_API.md for detailed modification examples.

Operations:

set(path, value) - Create/update values
delete(path) - Remove values
push(path, ...items) - Add to arrays
pop(path) - Remove from arrays
merge(path, object) - Merge objects

Change Tracking:

snapshot() - Create backups
diff(other) - Generate diffs
diffString(other) - Human-readable diffs

Navigation API (v0.6.0+)

See NAVIGATION_API.md for detailed navigation examples.

Iterators:

entries(), keys(), values() - Root level
deepEntries(), deepKeys(), deepValues() - Recursive
walk(callback, options?) - Tree walking

Search:

find(predicate) - First match
findAll(predicate) - All matches
some(predicate), every(predicate) - Predicates

Indexing API (v0.7.0+)

Index Types:

Hash Index: O(1) exact matches
BTree Index: O(log n) range queries
Compound Index: Multi-field indexing

Operations:

createIndex(options) - Create index
getIndex(name) - Retrieve index
listIndices() - List all indices
dropIndex(name) - Remove index

File Operations

FileEditor

Atomic file editing with automatic backups.

import { FileEditor } from 'tonl';

// Open file (creates backup)
const editor = await FileEditor.open('config.tonl', {
  backup: true,
  backupSuffix: '.bak'
});

// Modify data
editor.data.app.version = '2.0.0';

// Check if modified
if (editor.isModified()) {
  // Save atomically (temp file + rename)
  await editor.save();
}

// Restore from backup if needed
await editor.restoreBackup();

Error Handling

Error Classes

TONLError - Base error class

class TONLError extends Error {
  line?: number;
  column?: number;
  source?: string;
}

TONLParseError - Syntax errors

class TONLParseError extends TONLError {
  suggestion?: string;
}

TONLValidationError - Schema validation errors

class TONLValidationError extends TONLError {
  field: string;
  expected?: string;
  actual?: string;
}

TONLTypeError - Type mismatch errors

class TONLTypeError extends TONLError {
  expected: string;
  actual: string;
}

Example

try {
  const doc = TONLDocument.parse('invalid syntax');
} catch (error) {
  if (error instanceof TONLParseError) {
    console.error(`Parse error at line ${error.line}: ${error.message}`);
    if (error.suggestion) {
      console.log(`Suggestion: ${error.suggestion}`);
    }
  }
}

Performance Considerations

Encoding Performance

Linear time O(n) where n = data size
Memory efficient with array joins
Type inference is cached

Decoding Performance

Single-pass parsing
Efficient state machine
Lazy type coercion

Query Performance

Simple path access: <0.1ms
Wildcard queries (1000 nodes): <20ms
Filter queries (1000 nodes): <50ms
With indices: O(1) for hash, O(log n) for btree

Optimization Tips

Use encodeSmart() for automatic optimization
Create indices for repeated lookups
Use streaming for large files (>100MB)
Enable strict mode only when needed
Batch modifications before saving

TypeScript Integration

Type Safety

interface User {
  id: number;
  name: string;
  role: string;
}

const doc = TONLDocument.fromJSON({
  users: [] as User[]
});

// Type-safe queries (with assertion)
const users = doc.query('users[*]') as User[];

Generic Helpers

function loadTyped<T>(filePath: string): T {
  const doc = TONLDocument.fromFileSync(filePath);
  return doc.toJSON() as T;
}

// Usage
interface Config {
  database: {
    host: string;
    port: number;
  };
}

const config = loadTyped<Config>('config.tonl');

Browser Compatibility

TONL works in all modern browsers and Node.js environments.

ES Module

<script type="module">
  import { TONLDocument, encodeTONL, decodeTONL } from 'https://cdn.skypack.dev/tonl';

  const doc = TONLDocument.fromJSON({ hello: 'world' });
  const tonl = doc.toTONL();
</script>

CommonJS (Node.js)

const { TONLDocument, encodeTONL, decodeTONL } = require('tonl');

Bundle Size

Core: 8.84 KB gzipped
Full: ~25 KB gzipped (with all features)
Tree-shakeable: Import only what you need

Optimization API v2.0.0 ⭐ NEW

The Optimization API provides advanced token and byte compression strategies for TONL documents. This is the most powerful feature in v2.0.0, offering up to 60% additional savings beyond standard TONL compression.

Overview

The optimization system includes 10 different strategies that can be applied individually or automatically:

Dictionary Encoding - Compress repetitive values
Column Reordering - Optimize field order for compression
Numeric Quantization - Reduce decimal precision safely
Delta Encoding - Compress sequential numeric data
Run-Length Encoding (RLE) - Compress repeated patterns
Bit Packing - Optimized binary encoding for booleans/flags
Schema Inheritance - Reuse type definitions
Hierarchical Grouping - Structure-based optimization
Tokenizer Awareness - LLM-specific optimization
Adaptive Optimization - Multi-strategy automatic optimization

AdaptiveOptimizer (Recommended)

The AdaptiveOptimizer automatically analyzes your data and selects the best combination of optimization strategies.

import { AdaptiveOptimizer } from 'tonl';

const optimizer = new AdaptiveOptimizer();
const data = [
  { id: 1, name: "Alice", department: "Engineering", salary: 75000 },
  { id: 2, name: "Bob", department: "Engineering", salary: 80000 },
  { id: 3, name: "Carol", department: "Marketing", salary: 65000 }
];

// Analyze data for optimization opportunities
const analysis = optimizer.analyzeDataset(data);
console.log('Recommended strategies:', analysis.recommendedStrategies);
console.log('Estimated savings:', analysis.estimatedSavings + '%');

// Apply automatic optimization
const result = optimizer.optimize(data);
console.log('Optimized data:', result.optimizedData);
console.log('Directives:', result.directives);

// Example output:
// Directives: [
//   '@dict department: {0:Engineering,1:Marketing}',
//   '@delta salary',
//   '@map name: {A:Alice,B:Bob,C:Carol}'
// ]

Individual Optimizers

DictionaryBuilder

Compress repetitive values by creating lookup dictionaries:

import { DictionaryBuilder } from 'tonl';

const dictBuilder = new DictionaryBuilder();
const values = ["Engineering", "Marketing", "Engineering", "Sales"];

const dictionary = dictBuilder.analyzeDictionaryCandidates(values, 'department');
if (dictionary) {
  console.log('Savings:', dictionary.totalSavings, 'bytes');
  console.log('Encoding strategy:', dictionary.encoding);

  // Generate TONL directive
  const directive = dictBuilder.generateDictionaryDirective(dictionary);
  console.log('Directive:', directive); // @dict department: {0:Engineering,1:Marketing,2:Sales}

  // Encode values
  const encoded = dictBuilder.encodeWithDictionary(values, dictionary);
  console.log('Encoded:', encoded); // [0, 1, 0, 2]
}

DeltaEncoder

Compress sequential numeric data using delta encoding:

import { DeltaEncoder } from 'tonl';

const delta = new DeltaEncoder();
const timestamps = [1704067200000, 1704067201000, 1704067202000];

// Analyze sequence
const analysis = delta.analyzeSequence(timestamps);
console.log('Recommended:', analysis.recommended);
console.log('Compression ratio:', analysis.compressionRatio);

// Encode sequence
const encoded = delta.encode(timestamps, 'timestamp');
console.log('Delta encoded:', encoded); // [1704067200000, 1000, 1000]

// Generate directive
const directive = delta.generateDirective('timestamp');
console.log('Directive:', directive); // @delta timestamp

BitPacker

Compress boolean values and small integers using bit packing:

import { BitPacker } from 'tonl';

const packer = new BitPacker();
const flags = [true, false, true, true, false];

// Analyze packing potential
const analysis = packer.analyzeBitPacking(flags);
console.log('Recommended:', analysis.recommended);
console.log('Bit savings:', analysis.bitSavings);

// Pack values
const packed = packer.packBooleans(flags);
console.log('Packed:', packed); // Bit-packed binary representation

// Generate directive
const directive = packer.generateDirective('flags');
console.log('Directive:', directive); // @bitpack flags:bool

RunLengthEncoder (RLE)

Compress repeated consecutive values:

import { RunLengthEncoder } from 'tonl';

const rle = new RunLengthEncoder();
const values = ["A", "A", "A", "B", "B", "C"];

// Analyze RLE potential
const analysis = rle.analyzeSequence(values);
console.log('Recommended:', analysis.recommended);
console.log('Compression ratio:', analysis.compressionRatio);

// Encode sequence
const encoded = rle.encode(values);
console.log('RLE encoded:', encoded); // [{value: "A", count: 3}, {value: "B", count: 2}, {value: "C", count: 1}]

ColumnReorderer

Optimize column order for better compression:

import { ColumnReorderer } from 'tonl';

const reorderer = new ColumnReorderer();
const data = [
  { name: "Alice", id: 1, department: "Engineering" },
  { name: "Bob", id: 2, department: "Engineering" }
];

// Analyze reordering potential
const shouldReorder = reorderer.shouldReorder(data, ['name', 'id', 'department']);
if (shouldReorder) {
  const result = reorderer.reorderColumns(data, ['name', 'id', 'department']);
  console.log('New column order:', result.reorderedColumns);
  console.log('Mapping:', result.mapping);

  // Generate directive
  const directive = reorderer.generateMappingDirective(result.mapping);
  console.log('Directive:', directive); // @map {0:id,1:name,2:department}
}

Integration with TONLDocument

Optimization integrates seamlessly with TONLDocument:

import { TONLDocument, AdaptiveOptimizer } from 'tonl';

const doc = TONLDocument.fromJSON({
  users: [
    { id: 1, name: "Alice", role: "admin", active: true },
    { id: 2, name: "Bob", role: "user", active: false }
  ]
});

// Optimize the document
const optimizer = new AdaptiveOptimizer();
const userData = doc.get('users');
const optimization = optimizer.optimize(userData);

// Create new document with optimizations
const optimizedDoc = TONLDocument.fromJSON({
  users: optimization.optimizedData
});

// Export with optimization directives
const tonlWithOptimizations = optimizedDoc.toTONL();
console.log(tonlWithOptimizations);

// Output includes directives like:
// @dict role: {0:admin,1:user}
// @bitpack active:bool

Performance Impact

Additional Savings: 15-60% beyond standard TONL compression
Processing Time: O(n) linear time, typically <10ms for 10K records
Memory Usage: Minimal overhead, optimized for streaming
Decoding: Full round-trip compatibility with all optimizers

Best Practices

Use AdaptiveOptimizer for automatic optimization selection
Apply to large datasets (>100 records) for maximum benefit
Combine with Smart Encoding for best results
Profile your data first to identify optimization opportunities
Consider decode cost vs compression benefit for real-time applications

Version

Current version: 2.1.0

✅ Production ready and stable
✅ Full feature set (query, modify, index, stream, schema, optimize)
✅ 100% test coverage (791+ tests)
✅ Zero runtime dependencies
✅ TypeScript-first with full type safety
✅ Browser and Node.js support
🆕 Advanced optimization system with 10 strategies
🆕 Dual-mode system for handling problematic JSON keys
🆕 Enhanced CLI with preprocessing support
🆕 Advanced key quoting for special characters

FilesExpand file tree

API.md

Latest commit

History

API.md

File metadata and controls

TONL API Documentation v2.5.2

🎉 What's New in v2.5.2

🎉 What's New in v2.5.0

🎉 What's New in v2.1.0

🎉 What's New in v2.0.9

🎉 What's New in v2.0.6

🎉 What's New in v2.0.5

Table of Contents

TONLDocument API

Static Factory Methods

TONLDocument.parse(tonlText, options?)

TONLDocument.fromJSON(data)

TONLDocument.fromFile(path)

TONLDocument.fromFileSync(path)

Query Methods

get(pathExpression)

query(pathExpression)

exists(pathExpression)

typeOf(pathExpression)

Modification Methods

set(path, value)

delete(path)

push(path, ...items)

pop(path)

merge(path, object)

Navigation Methods

entries()

keys() / values()

deepEntries() / deepKeys() / deepValues()

walk(callback, options?)

find(predicate) / findAll(predicate)

some(predicate) / every(predicate)

countNodes()

Change Tracking Methods

snapshot()

diff(other)

diffString(other)

Indexing Methods

createIndex(options)

getIndex(name)

listIndices()

dropIndex(name)

Export Methods

toJSON()

toTONL(options?)

save(path, options?)

saveSync(path, options?)

Metadata Methods

stats()

Core Functions

encodeTONL(input, options?)

decodeTONL(text, options?)

encodeSmart(input, options?)

Dual-Mode System v2.0.6 ⭐ UPDATED

Mode 1: Default (Quoting Only)

Mode 2: Preprocessing (Key Transformation)

Browser Preprocessing Function

preprocessJSON(input, options?)

Node.js Key Transformation

transformObjectKeys(obj, transformer)

CLI Integration

When to Use Each Mode

Advanced Key Quoting

Utility Functions

parseTONLLine(line, delimiter)

inferPrimitiveType(value)

isUniformObjectArray(arr)

getUniformColumns(arr)

Streaming API (v0.7.5+)

streamQuery(filePath, pathExpression, options?)

streamAggregate(filePath, pathExpression, reducer, initialValue)

StreamPipeline

Schema API (v0.8.0+)

parseSchema(schemaText)

`TONLDocument.parse(tonlText, options?)`

`TONLDocument.fromJSON(data)`

`TONLDocument.fromFile(path)`

`TONLDocument.fromFileSync(path)`

`get(pathExpression)`

`query(pathExpression)`

`exists(pathExpression)`

`typeOf(pathExpression)`

`set(path, value)`

`delete(path)`

`push(path, ...items)`

`pop(path)`

`merge(path, object)`

`entries()`

`keys()` / `values()`

`deepEntries()` / `deepKeys()` / `deepValues()`

`walk(callback, options?)`

`find(predicate)` / `findAll(predicate)`

`some(predicate)` / `every(predicate)`

`countNodes()`

`snapshot()`

`diff(other)`

`diffString(other)`

`createIndex(options)`

`getIndex(name)`

`listIndices()`

`dropIndex(name)`

`toJSON()`

`toTONL(options?)`

`save(path, options?)`

`saveSync(path, options?)`

`stats()`

`encodeTONL(input, options?)`

`decodeTONL(text, options?)`

`encodeSmart(input, options?)`

`preprocessJSON(input, options?)`

`transformObjectKeys(obj, transformer)`

`parseTONLLine(line, delimiter)`

`inferPrimitiveType(value)`

`isUniformObjectArray(arr)`

`getUniformColumns(arr)`

`streamQuery(filePath, pathExpression, options?)`

`streamAggregate(filePath, pathExpression, reducer, initialValue)`

`StreamPipeline`

`parseSchema(schemaText)`

`validateTONL(data, schema, options?)`

`generateTypeScript(schema, options?)`