Tonina is a comprehensive multi-agent bioinformatics analysis system that extends the capabilities of gffutilsAI to handle multiple types of biological data through specialized AI agents.
- GFF Agent: Complete GFF file analysis (ported from gffutilsAI)
- Sequence Agent: FASTA sequence analysis and manipulation
- Proteomics Agent: Protein structure and functional analysis
- Phylogenetics Agent: Evolutionary analysis and tree operations
- General Agent: Cross-format utilities and file operations
Tonina automatically routes your queries to the appropriate specialized agent based on:
- File types (.gff, .fasta, .pdb, etc.)
- Query content and keywords
- Domain-specific terminology
- Local Models: Ollama (llama3.1, codellama, etc.)
- Cloud Models: OpenAI GPT-4, Anthropic Claude, Google Gemini
- Flexible Configuration: Easy switching between providers
# Clone the repository
git clone <repository-url>
cd tonina
# Install dependencies
pip install -e .
# Or using uv (recommended)
uv pip install -e .- Copy the example environment file:
cp .env.example .env- Edit
.envwith your API keys:
# For cloud models (optional)
OPENAI_API_KEY=your_key_here
ANTHROPIC_API_KEY=your_key_here
GOOGLE_API_KEY=your_key_here# Interactive mode with local model
tonina --model llama3.1 --server local
# Single query
tonina --query "What genes are in chromosome 1?" --model gpt-4o
# Batch processing
tonina --batch queries.txt --model claude-3-5-sonnet-latest
# Specific provider
tonina --anthropic --model claude-3-5-sonnet-latest
tonina --gemini --model gemini-2.0-flash-exp
tonina --openai --model gpt-4otonina "Find all genes in chromosome 1 between positions 1000-5000"
tonina "Calculate feature statistics for my GFF file"
tonina "Export all membrane proteins to CSV"tonina "Analyze this FASTA file for ORFs"
tonina "Calculate GC content for all sequences"
tonina "Find motifs in this protein sequence"tonina "Extract genes from GFF, get their sequences, and analyze domains"
# This automatically coordinates: GFF Agent → Sequence Agent → Proteomics Agent- Coordinate-based queries and region analysis
- Feature statistics and length distributions
- Attribute searches and GO function analysis
- Intergenic region identification
- Comprehensive export capabilities
- FASTA file parsing and validation
- Sequence composition analysis
- ORF finding and translation
- Motif searching and pattern matching
- Multiple sequence alignment
- Protein structure analysis (PDB files)
- Domain prediction and annotation
- Functional classification
- Comparative protein analysis
- Phylogenetic tree parsing
- Evolutionary distance calculations
- Species comparison
- Ortholog identification
- Cross-format file operations
- Data integration workflows
- Format conversion utilities
- Batch processing tools
- Create agent directory:
src/tonina/agents/new_agent/ - Implement tools with
@tooldecorator - Create agent class extending
BaseAgent - Register in
AgentManager
from strands import tool
@tool
def my_analysis_tool(input_file: str, parameter: str) -> dict:
\"\"\"Tool description for AI understanding\"\"\"
# Implementation
return {"result": "analysis_output"}GPL-3.0 License - see LICENSE file for details.
- Fork the repository
- Create a feature branch
- Add tests for new functionality
- Submit a pull request
For issues and questions, please use the GitHub issue tracker.