llm-cascade

Here are 4 public repositories matching this topic...

n24q02m / tacet

A self-distilling neuro-symbolic cascade that amortises LLM cost across knowledge-graph QA and regulatory-compliance checking, with auditable Datalog proof trees.

datalog knowledge-graph compliance proof-trees kgqa neuro-symbolic llm auditable-ai llm-cascade rule-distillation cost-amortization

Updated Jun 18, 2026
Python

A verifier-anchored self-distilling neuro-symbolic cascade: a cheap, teacher-independent correctness verifier gates and corrects a frontier LLM before its answers distil into auditable, type-checked Datalog rules with proof trees.

datalog knowledge-distillation proof-trees verifier neuro-symbolic llm auditable-ai llm-cascade cost-amortization

Updated Jun 18, 2026
Python

aws-samples / sample-nvidia-nemotron-cascade-workshop

Star

Sample code for two-tier LLM inference (cascading) on Amazon Bedrock, using a single Bedrock Converse API surface. NVIDIA Nemotron Nano handles routine support-ticket classification on every request; Anthropic Claude Sonnet handles cases the routing logic flags as harder. Includes a bake-off harness, sample data, and tests. Next.js + TypeScript.

typescript workshop nextjs mit-0 anthropic-claude amazon-bedrock aws-samples nvidia-nemotron model-routing llm-cost-optimization llm-cascade bedrock-converse-api two-tier-inference

Updated Jun 19, 2026
TypeScript

nabindev3 / llm-sae-difficulty

Star

A leakage-controlled, paired-bootstrap-rigorous study of whether an LLM's internal TopK Sparse Autoencoder (SAE) feature spaces encode a difficulty-predictive routing signal. Benchmarked on HellaSwag (rigorous predictive null) and SQuAD continuous perplexity (positive Pareto cascade), with Platt recalibration and causal ablatings.

calibration pythia interpretability sparse-autoencoder selective-prediction llm-cascade

Updated Jun 19, 2026
Python

Improve this page

Add a description, image, and links to the llm-cascade topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-cascade topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly