Contributing to AudioMuse-AI

Thank you for considering a contribution to AudioMuse-AI. Open-source projects thrive on the collective effort and expertise of their communities, and every contribution, regardless of size, is highly valued.

The vision of AudioMuse-AI is to bring Sonic Analysis open and free to the higher number of user possible. So each change should aim to bring it more usebul from more and more users.

The project is currently in a BETA release stage. This means there is a wealth of opportunity to make a significant and lasting impact. Whether it involves fixing a bug, implementing a new feature, or improving the documentation, your contributions are essential to the project's growth and stability.

Remember that contributing not only means develop code, also suggest new feature, highlight a bug or just share your feedback (good or bad is always important) by an issue is also contributing.

High-Level Architecture

To contribute effectively, it is crucial to understand that AudioMuse-AI is not a monolithic program. It is a multi-service, containerized application designed for robustness, scalability, and a clear separation of concerns. This architecture is composed of several core components that work in concert.

Flask Web Application (audiomuse-ai-flask): Here you have the front-end of the application both intended as html page and API. Here live also the logic of the service that are syncronous like get the similar song.
Redis Queue (RQ) Workers (audiomuse-ai-worker): This is for what need to be executed in async, like analyze the song, do clustering or reconstruct the index for similar song search. With the redis queue and a kubernetes architecture is possible to spawn more woker for increase scalability and avaiability.
PostgreSQL Database (postgres-deployment): The database. Not only the analysis live here but also the log status o the async task.

Supported Architecture and Mediaserver

Remember that this software support both Intel and Arm architecture. So avoid code that will not work on both except for very specific case. If you're not able to test on both architecture, add this in the PR description.

Rememeber also that this application support multiple mediaserver, at the moment Jellyfin and Navidrome. So try to don't introduce change that can distrupt one or the other mediaserver. If you're not able to test on all mediaserver, add this in the PR description.

The Codebase Map

The following table details the most important paths in the repository, their purpose, and the key technologies associated with them.

Path	Purpose
app.py, app_*.py	The main entry point for the Flask web application. It handles the initialization of the Flask app, database connections, and the registration of API routes and blueprints.
tasks/	The Core Logic Hub. This is where the most intensive computations occur. Each API or async task then point to an specific implementation in this directory
tasks/mediaserver.py	In this fail the generic method to interact with the mediaservers are specialized to call the specific one
ai.py	This module centralizes all interactions with Large Language Models (LLMs). It contains the logic for communicating with services like self-hosted Ollama or the Google Gemini API for tasks such as AI-powered playlist naming and translating natural language requests into SQL queries.
config.py	Contains the application's default, non-sensitive configuration parameters. These values serve as fallbacks and can be easily overridden by environment variables, providing a flexible and secure configuration system.
static/ & templates/	These directories contain all frontend assets.
deployment/	This contains deployment example but also the supervisord configuration
Dockerfile, Dockerfile.nvidia	These files contain the instructions for building the OCI-compatible container images for the application.
.github/	This directory holds GitHub-specific configuration files, such as issue templates, pull request templates, and potentially continuous integration/continuous deployment (CI/CD) workflows.1

Prerequisites

The development environment for AudioMuse-AI is fully containerized to ensure consistency and simplify setup. The only required tools are:

Git: For version control and interacting with the GitHub repository.
Docker and Docker Compose: For building and running the containerized application stack.

Because the entire application runs inside Docker containers, there is no need to install Python, PostgreSQL, or Redis directly on the local development machine.

How to compile

If you have a k3s (kubernetes) cluster at home, I highly recommend to deploy a local registry and then directly deploy the image against it. To do that I suggest to follow my private-registry how-to

If you don't have K3s (kubernetes) at home, you can just use docker-compose to compile the docker-compose.yaml file on the flight. Remember to point it to your local image by changing this:

audiomuse-ai-flask:
    image: ghcr.io/neptunehub/audiomuse-ai:0.6.5-beta
    # ... rest of the service definition

to something like this:

audiomuse-ai-flask:
    build:
      context: .  # <-- ADD THIS: Tells Docker to look for the Dockerfile in the current directory
      dockerfile: Dockerfile # <-- ADD THIS: Specifies the name of the Dockerfile
    image: audiomuse-ai:dev # <-- CHANGE THIS: Give your local build a new, clear name
    # ... rest of the service definition

for both flask and worker container. Then you can just run&build with this command:

docker-compose up --build -d

PR

Before You Start

Check existing PRs and issues to avoid duplicate work
Open a Draft PR early for significant changes to discuss your approach and get feedback before investing too much time
- When creating a PR on GitHub, click the dropdown next to "Create Pull Request" and select "Create Draft Pull Request"
- This gives visibility to other contributors and maintainers can provide early guidance
Discuss breaking changes or major architectural decisions in an issue or draft PR first

PR Requirements

When submitting a pull request, ensure:

Clear description: Explain what the PR achieves and why the change is needed. Also cleary explain how to test it.
Testing: Verify core features work on at least one architecture (Intel/ARM) and one media server:
- Analysis and Clustering
- Instant Playlist
- Playlist from Similar Song
- Song Path
- Sonic Fingerprint
- (Basically, test each function in the integrated front-end menu at least once)
License compliance: Your code must align with AudioMuse-AI's license
CPU Compatibility: AudioMuse-AI supports both Intel and ARM CPUs, including older Intel processors. PRs that introduce dependencies breaking compatibility with older CPUs will not be merged

How to Open a Draft PR

Push your branch to your fork
Go to the main repository and click "New Pull Request"
Select your fork and branch
Click the dropdown arrow next to "Create Pull Request"
Select "Create Draft Pull Request"
Once ready for review, click "Ready for review" in the PR

This workflow helps avoid spending time on PRs that may not align with project goals.

Related Repositories

AudioMuse-AI: the core application, it run Flask and Worker containers to actually run all the feature;

AudioMuse-AI Helm Chart: helm chart for easy installation on Kubernetes;

AudioMuse-AI Plugin for Jellyfin: Jellyfin Plugin.

Questions

For any question you can raise an issue

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to AudioMuse-AI

High-Level Architecture

Supported Architecture and Mediaserver

The Codebase Map

Prerequisites

How to compile

PR

Before You Start

PR Requirements

How to Open a Draft PR

Related Repositories

Questions

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to AudioMuse-AI

High-Level Architecture

Supported Architecture and Mediaserver

The Codebase Map

Prerequisites

How to compile

PR

Before You Start

PR Requirements

How to Open a Draft PR

Related Repositories

Questions