🎵 HeartMuse - AI Music Generator with Smart Lyrics

HeartMuse is an intuitive web-based interface for creating high-quality AI-generated music completely locally on your machine. It combines the power of HeartMuLa (state-of-the-art open-source music generation model for local inference) with intelligent lyrics generation using local LLMs, giving you complete creative control without relying on cloud services.

✨ What Makes HeartMuse Special?

While HeartMuLa provides state-of-the-art music generation capabilities, HeartMuse extends it with:

🎨 User-Friendly Web Interface - No command-line expertise needed
📝 Smart Lyrics Generation - Leverages local Ollama models or OpenAI API to automatically generate coherent, themed lyrics from simple descriptions
🏷️ Intelligent Tagging - Automatically generates appropriate music style tags
💾 Complete Privacy - Run 100% locally with Ollama (no data leaves your machine)
📚 Generation History - Browse, replay, and manage all your previous creations
⚙️ Flexible Configuration - Easy-to-use controls for fine-tuning generation parameters

🎯 Features

Smart Text Generation

Describe Your Vision: Simply write what kind of song you want (e.g., "upbeat pop song about summer adventures")
Automatic Lyrics: AI generates full lyrics matching your description and chosen theme
Song Titles: Creative, relevant titles generated automatically
Style Tags: Intelligent tagging system for music genre, mood, and instrumentation

Powerful Music Generation

HeartMuLa 3B Model: State-of-the-art open-source model for local music generation (3 billion parameters, RL-trained)
High-Fidelity Audio: Uses HeartCodec for superior audio quality
Customizable Parameters: Control temperature, CFG scale, Top-K sampling, and duration
GPU Acceleration: CUDA support with efficient memory management and lazy loading (reduces VRAM usage)
Memory Efficient: Lazy loading feature allows generation on GPUs with limited VRAM

Dual LLM Backend Support

Ollama (Recommended): Run completely locally with models like glm-4.7-flash, llama3, mistral, etc.
OpenAI API: Use GPT-4o, GPT-4o-mini, or other OpenAI models for lyrics generation

Seamless Workflow

Enter a song description
Let AI generate lyrics, title, and tags (or write your own)
Click "Generate Music" and get professional-quality audio
Browse your creation history anytime

🚀 Quick Start

Prerequisites

Git - For cloning repositories and submodules
Python 3.10 - 3.12 (3.10 recommended by HeartMuLa authors; newer versions may not work)
NVIDIA GPU with CUDA 12.4+ (recommended 12GB VRAM, 8GB VRAM minimum, for HeartMuLa-3B model)
Ollama (optional, for local lyrics generation) - Download Ollama

Installation

Linux / macOS:

git clone https://github.com/yourusername/heartmuse.git
cd heartmuse
./install.sh

Windows:

git clone https://github.com/yourusername/heartmuse.git
cd heartmuse
install.bat

The installer will:

Create a Python virtual environment
Clone the HeartMuLa library
Install all dependencies
Prepare your system for music generation

Running HeartMuse

Linux / macOS:

./run.sh

Windows:

run.bat

Open your browser to http://localhost:7860 and start creating!

⚙️ Configuration

Copy .env.example to .env and customize. See the file for available options and their descriptions.

Using Ollama (100% Local)

Install Ollama from ollama.ai
Download a model: ollama pull glm-4.7-flash (or llama3, mistral, etc.)
Make sure Ollama is running: ollama serve
Set LLM_BACKEND=Ollama in your .env

Using OpenAI API

Get your API key from platform.openai.com
Set OPENAI_API_KEY in your .env
Set LLM_BACKEND=OpenAI

📖 How It Works

HeartMuse orchestrates a two-stage generation pipeline:

Stage 1: Text Generation (LLM)

Takes your song description
Generates contextually appropriate lyrics
Creates a catchy title
Suggests music style tags (genre, mood, instruments)

Stage 2: Music Generation (HeartMuLa)

Processes lyrics and tags through HeartMuLa's 3B parameter model
Generates high-fidelity audio using HeartCodec
Saves output with complete metadata

All generations are saved to the output/ directory with JSON metadata, making it easy to track your creative journey.

🎓 Examples

Example 1: Upbeat Pop Song

Description: "Energetic pop song about chasing dreams"

Generated Output:

Title: "Dreams in Motion"
Lyrics: Full verses and chorus about ambition and perseverance
Tags: pop, upbeat, energetic, electronic, synthesizer
Audio: 2-3 minute high-quality music track

Example 2: Melancholic Ballad

Description: "Slow, emotional ballad about lost love"

Generated Output:

Title: "Fading Echoes"
Lyrics: Heartfelt verses about memories and longing
Tags: ballad, slow, melancholic, piano, emotional
Audio: Emotive instrumental with appropriate pacing

🙏 Credits & Acknowledgments

HeartMuse is built on top of the incredible work by the HeartMuLa team:

HeartMuLa Project: github.com/HeartMuLa/heartlib
Models: HeartMuLa-RL-oss-3B (state-of-the-art for local music generation), HeartCodec-oss
Research Papers: Check the HeartMuLa repository for technical details

Huge thanks to the HeartMuLa authors for creating and open-sourcing their state-of-the-art music generation technology, making professional-quality AI music generation accessible to everyone for local inference!

🛠️ Technology Stack

HeartMuLa - 3B parameter music generation model
Gradio - Web interface framework
Ollama - Local LLM inference
OpenAI API - Cloud LLM option
PyTorch - Deep learning backend
Python 3.10 - 3.12 - Core runtime

📋 System Requirements

Required:

Git
Python 3.10 - 3.12 (3.10 recommended)
NVIDIA GPU with 8GB+ VRAM (e.g., RTX 3070, RTX 4060, or better)
CUDA 12.4+
16GB system RAM
20GB disk space (for models and generated audio)

Memory Optimization:

Lazy loading is enabled by default (reduces VRAM footprint)
Manual "Unload Model" button frees GPU memory between generations
For GPUs with less VRAM, reduce MUSIC_MAX_LENGTH_SEC to generate shorter clips

🐛 Troubleshooting

Models Not Downloading

The first run automatically downloads ~3GB of model weights from Hugging Face. Ensure you have:

Stable internet connection
Sufficient disk space in the ckpt/ directory

Out of Memory Errors

Use the "Unload Model" button between generations
Reduce MUSIC_MAX_LENGTH_SEC in GUI or .env

Installation Problems

Make sure you are using Python 3.10 - 3.12 (other versions are not supported)
Update your NVIDIA drivers to the latest version

Ollama Connection Issues

Ensure Ollama is running: ollama serve
Check OLLAMA_URL matches your Ollama installation
Verify the model is downloaded: ollama list

💖 Support the Project

If HeartMuse saves you time or helps you create something cool, consider supporting development 🙏

Sponsor via GitHub

Donate with Bitcoin

bc1qgsn45g02wran4lph5gsyqtk0k7t98zsg6qur0y

📝 License

This project is released under the MIT License. See LICENSE for details.

The HeartMuLa library has its own license - please refer to the HeartMuLa repository for licensing information.

🤝 Contributing

Contributions are welcome! Feel free to:

Report bugs via GitHub Issues
Suggest new features
Submit pull requests

📧 Support

For questions and support:

Open an issue on GitHub
Check the HeartMuLa documentation

Made with ❤️ using HeartMuLa | Developed with assistance from Claude Code

Create music with AI, own your creativity

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
config.py		config.py
generator.py		generator.py
history.py		history.py
install.bat		install.bat
install.sh		install.sh
llm_backends.py		llm_backends.py
lyrics_llm.py		lyrics_llm.py
model_manager.py		model_manager.py
prompt_templates.py		prompt_templates.py
requirements.txt		requirements.txt
run.bat		run.bat
run.sh		run.sh
test_prompts.py		test_prompts.py

Uh oh!

License

strnad/HeartMuse

Folders and files

Latest commit

History

Repository files navigation

🎵 HeartMuse - AI Music Generator with Smart Lyrics

✨ What Makes HeartMuse Special?

🎯 Features

Smart Text Generation

Powerful Music Generation

Dual LLM Backend Support

Seamless Workflow

🚀 Quick Start

Prerequisites

Installation

Running HeartMuse

⚙️ Configuration

Using Ollama (100% Local)

Using OpenAI API

📖 How It Works

Stage 1: Text Generation (LLM)

Stage 2: Music Generation (HeartMuLa)

🎓 Examples

Example 1: Upbeat Pop Song

Example 2: Melancholic Ballad

🙏 Credits & Acknowledgments

🛠️ Technology Stack

📋 System Requirements

🐛 Troubleshooting

Models Not Downloading

Out of Memory Errors

Installation Problems

Ollama Connection Issues

💖 Support the Project

Sponsor via GitHub

Donate with Bitcoin

📝 License

🤝 Contributing

📧 Support

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Sponsor this project

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages