ryzen-ai-max

Here are 6 public repositories matching this topic...

hogeheer499-commits / strix-halo-guide

Strix Halo local LLM guide: 65-87 t/s on Ryzen AI Max+ 395 128GB mini PCs. Benchmarks, setup, backend comparisons, and failure cases.

benchmark amd optimization vulkan inference rocm mini-pc unified-memory beelink llm llama-cpp local-llm ollama gguf rdna3 strix-halo gfx1151 dgx-spark ryzen-ai-max

Updated May 4, 2026
Python

hec-ovi / vllm-awq4-qwen

Star

vLLM Qwen 3.6-27B (AWQ-INT4) + DFlash speculative decoding on AMD Strix Halo (gfx1151 iGPU, 128 GB UMA, ROCm 7.13). 24.8 t/s single-stream, vision, tool calling, 256K context, OpenAI-compatible, Docker. Matches DGX Spark FP8+DFlash+MTP at a third of the cost. No CUDA.

docker rocm openai-api awq vllm llm-inference speculative-decoding multimodal-llm qwen3 gfx1151 ryzen-ai-max dflash amd-strix-halo rdna35 27b

Updated May 2, 2026
Python

ianbarber / strix-halo-skills

Star

Claude Code skill for AMD Strix Halo (Ryzen AI MAX+ 395) ML setup. Handles PyTorch installation (official wheels don't work with gfx1151), GTT memory config, and environment setup. Enables 30B parameter models.

machine-learning amd pytorch rocm llm claude-code strix-halo gfx1151 ryzen-ai-max ml-setup

Updated Jan 23, 2026
Python

ChrisJR035 / Talos-O-Architecture

Star

Talos-O (Omni): A sovereign, embodied agentic organism forged on AMD Strix Halo. Integrating the Chimera Kernel (Linux 7.0), Zero-Copy Introspection, and the Phronesis Engine. Built from First Principles.

zero-copy linux-kernel first-principles linux-kernel-hacking unified-memory embodied-ai unified-memory-parallelism ryzen-ai sovereign-ai strix-halo ryzen-ai-max first-principles-ai rocm-6-2 neo-techne phronesis

Updated Apr 21, 2026
Python

MaxusAI / ryzen-ai-max-rocm-ollama-testbench

Star

Docker stack: Ollama v0.21.0 built from source against ROCm 7.2.2 with native gfx1151 (Strix Halo) — serves Gemma 4 up to 256K context on AMD Ryzen AI MAX+ 395 / Radeon 8060S. Includes a 9-layer make validate ladder for the host firmware, ROCm runtime, container, and long-context inference.

docker ubuntu amd validator hip gemma rocm llm long-context ollama rdna3 gemma3 strix-halo gfx1151 ryzen-ai-max radeon-8060s

Updated Apr 22, 2026
Shell

nabe2030 / faster-whisper-rocm-strix-halo

Star

Drop-in recipe for running faster-whisper on AMD Strix Halo (Ryzen AI Max+ 395, gfx1151) with Ubuntu 26.04 + ROCm 7.2.2 — no source build required

ubuntu speech-to-text transcription whisper asr rocm amd-gpu faster-whisper ctranslate2 strix-halo gfx1151 ryzen-ai-max

Updated Apr 29, 2026
Python

Improve this page

Add a description, image, and links to the ryzen-ai-max topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ryzen-ai-max topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ryzen-ai-max

Here are 6 public repositories matching this topic...

hogeheer499-commits / strix-halo-guide

hec-ovi / vllm-awq4-qwen

ianbarber / strix-halo-skills

ChrisJR035 / Talos-O-Architecture

MaxusAI / ryzen-ai-max-rocm-ollama-testbench

nabe2030 / faster-whisper-rocm-strix-halo

Improve this page

Add this topic to your repo