🕹️ Ft_transcendence

Ft_transcendence is the final project of the 42 Common Core. Its goal is to make us work with unfamiliar technologies (mainly web-related) and in a larger team (between 3 and 5 developers). It requires a solid understanding of the subject, strong communication skills, and good planning and task distribution.

We used Jira as our project management tool and divided the work into three main areas — backend, frontend, and DevOps. Each member also took the time to learn and experiment with technologies outside their assigned domain, to ensure complete control over the codebase and to handle potential absences within the team.

Project Overview

[parler de la VM]

Selected Modules

As we were only a team of three, we had to focus on modules that interested us but were consistent with each other and not too time-consuming. That's why we didn't manage the server-side (CLI, API) at all, nor did we spend too much time on the graphics. Above all, we wanted a solid backend that communicates logically with the front end, with a devops infrastructure that makes this site ready for production.

#### 🛠️ Web (backend/frontend)
   ├── Major: Use a framework to build the backend (Fastify + Node.js)
   ├── Minor: Use a framework to build the backend (Tailwind + TS)
   └── Minor: Use a database for the backend (SQLite)

#### 👤 User Management
   ├── Major: Standard user management, authentication and users across tour-
naments
   └── Major: Remote authentication. (Google Sign-In)

#### 🎮 Gameplay & UX
   └── Major: Remote Players

#### 🔐 Cybersecurity
   └── Major: Two-Factor Authentication (2FA) and JWT.

#### ⚙️ DevOps
   ├── Major: Designing the Backend as Microservices
   ├── Major: Infrastructure Setup with ELK for Log Management
   └── Minor: Monitoring system (Prometheus + Grafana)

#### 🌍 Accessibility
   ├── Minor: Expanding Browser Compatibility
   └── Minor: Multiple language support

Getting Started

The entire project is designed as services, which are isolated, metered, and orchestrated by a docker-compose.yml file, itself managed by a Makefile (this allows for greater flexibility during the development and debugging phase). For readability, we have divided the compose into several files according to their use.

Prerequisites

To run the project, we need to ensure we have:

VirtualBox or similar (for 42 campus environment)
8GB RAM minimum, 10GB recommended
20GB free disk space

⚠️ NOTE When running in a 42 campus environment, execute the make fullstack command only after commenting out the grafana, prometheus, and kibana server blocks in reverse-proxy/config/nginx.conf.

Installed Docker
Installed Docker Compose
Make sure our user has root privileges and be in the docker groupe:

sudo usermod -aG docker $USER

Quick Start

1️⃣ Clone the repo

git clone https://github.com/Cimeci/ft_transcendence.git
cd ft_transcendence

2️⃣ Configure env variables You can find a .env.example file in the repository. Use it to set up your environment variables, such as passwords and API keys. If you want to enable Google or GitHub OAuth, you’ll need to create the corresponding client IDs and secrets. You can also set up a Discord webhook in a server and add its URL to the .env file.
3️⃣ Launch the app

make

The Makefile manages the full Docker Compose lifecycle and provides convenient shortcuts for common operations:

Command	Description
`make help`	Displays all available commands.
`make all`	Builds and starts all services (`make build` + `make up`).
`make fullstack`	Launches the full web stack only (without ELK and monitoring services).
`make down`	Stops and removes all containers.
`make down-v`	Stops and removes all containers, networks, and volumes.
`make clean`	Stops and removes all containers, networks, volumes, and the SQLite database.

Components

To get closer to a production-like environment, we placed all services behind a reverse proxy, allowing access to every component via HTTPS. This setup also enables connections from other machines if we want to offload resources from the main server or VM — for example, running Kibana or Grafana on a different host.

Application architecture

Frontend

Go to Readme

Backend

Monitoring

A core pillar of observability, monitoring allows us to visualize metrics from the different components of our architecture, troubleshoot bottlenecks, manage resource allocation, and be notified of component health issues.

Prometheus: Scrapes metrics and stores them in its time-series database (TSDB) for querying.
Alertmanager: Handles alerting, records issues, and sends notifications when critical problems occur.
Grafana: A powerful monitoring and visualization tool, used here primarily as a UI to build dashboards from our metrics.
Various Exporters: Expose component metrics through APIs for Prometheus to scrape.

Monitoting architecture

You can connect to Prometheus at https://localhost:4443/prometheus/ and Grafana at https://localhost:4443/grafana/ with the admin user name and password defined in the .env.

Security Concerns

Grafana and Prometheus, which are accessible from outside the system, require a secure connection with a username and password defined in the .env file. Since Prometheus only supports hashed passwords, we created a small Python script that reads the .env file, hashes the plain-text password, and writes it back as a new environment variable. This hashed value is then used in Prometheus’ configuration files. That’s why the Makefile includes the prom-crypt rule — it generates the hashed password before starting the monitoring stack.

Grafana

Grafana manages its own security policies, and in our setup, we only have one admin user who can modify dashboards and data sources. Anonymous access is disabled, and connections are restricted to HTTPS over localhost through a reverse proxy. We don’t expose any sensitive data through metrics, so visualization security is not a major concern. (Logs are handled by Filebeat and filtered in the backend.)

Exporters

Fastify Metrics

Only exporters communicate with the sub-networks of the services they monitor. By placing them on the monitoring subnet, Prometheus, Alertmanager, and Grafana can access only the exported metrics—not the services themselves. This isolation limits entry points and strengthens overall security.

However, since each service uses fastify-metrics, it is safer to allow Prometheus to access the backend rather than letting every service access the monitoring network (we want to keep services isolated from monitoring components).

We chose fastify-metrics, which uses the official prom-client under the hood, as it provides a framework-specific abstraction with automatic /metrics routes and built-in HTTP metrics.

await app.register(fastifyMetrics, {
  endpoint: '/metrics'
});

Note: To avoid additional latency that could distort metrics, we do not route metrics through the API Gateway but metrics are accessible at http://localhost:4443//metrics and to secure this, Nginx blocks all external access to these endpoints.

ELK Monitoring

In production, the ELK stack would typically be monitored using the native Elastic ecosystem (Metricbeat + SElastic Agent). However, since this project required us to monitor components using Prometheus, we combined the community-maintained elasticsearch-exporter with telegraf, which collects and formats JSON metrics from the pipeline components (Filebeat and Logstash) into PromQL-compatible data.

We also monitor Telegraf itself, since this is a small-scale architecture. In larger setups, the container should be monitored separately, and if resource usage grows too high, resilience and buffering mechanisms could be added to prevent data loss in case of crashes.

Data Retention and Storage

A simple configuration is sufficient since the stack runs temporarily and has limited disk space (6 GB for all services):

Prometheus automatically deletes the oldest blocks in its TSDB to keep storage under 1 GB or to remove metrics older than 7 days.
Local Docker volume persistence ensures data durability if services are restarted.
Because this is a short-lived academic project, we did not implement snapshots.
Write-Ahead Log (WAL) compression is enabled by default in Prometheus versions later than 2.20.0, reducing storage usage and protecting the write journal in case of a crash.

Possible Improvments

Comme nous utilisons Sqlite comme database pour simplifier le stockage, il n'y a pas de serveur à surveiller ou interroger. Monitorer notre DB avec des exporters customs risqueait de: alourdir le code du backend, mais surtout de bloquer le fichier de DB à chaque inspection du monitoring. Ainsi nous aurions obtenu peu de metrics (car espacées dans le temps) tout en augmentant le nombre de requetes sur la DB.

Avoid Prometheus and Alertmanager ${VAR} substitution issues
- Current: We created a custom Dockerfile based on Alpine to replace environment variables at runtime, keeping encrypted passwords and webhooks in a .env file.
- Recommendation / Production: In large-scale production, tools like Kubernetes Secrets would handle this.
Monitor our DB
- Current: We use SQLite for simplicity, there’s no dedicated database server to monitor
- Why skipped: Using a custom exporter for SQLite would increase backend complexity and could lock the database file during inspections, resulting in fewer and delayed metrics.
- Production: Use a postgreSQL our any server could be a solution in a larger scale.

Elastic Stack

ELK stands for Elasticsearch, Logstash, and Kibana - a powerful stack for centralized log management:

Elasticsearch: Search and analytics engine that stores and indexes logs
Logstash: Data processing pipeline that enriches and transforms logs
Kibana: Visualization interface for exploring and analyzing data
Filebeat: Lightweight log shipper (added to the traditional ELK stack)

This stack allows us to:

Centralize logs from multiple services
Search through millions of log entries in seconds
Create real-time dashboards and alerts
Monitor application health and performance

You can connect to Kibana at https://localhost:4443/kibana/ with the elastic user name and password defined in the .env.

Virtual Machine environnment

As the subject specifies it itself:

Several container technologies exist: Docker, containerd, podman, etc. On the computers of your campus, you may access the container software in rootless mode for security reasons. This could lead to the following extra constraints: • Your runtime needs to be located in /goinfre or /sgoinfre. • You are not able to use “bind-mount volumes” between the host and the container if non-root UIDs are used in the container. Depending on the current requirements of the subject (highlighted in green above) and the local configuration in clusters, you may need to adopt different strategies, such as: container solution in virtual machine, rebuild your container after your changes, craft your own image with root as unique UID.

We decided to build our project in a virtual machine because:

The rootless containers on campus limit bind mounts and memory/JVM options.
The ELK stack requires these features to function properly (and we're limited in RAM and disk space on our machines).
The VM allows us to regain a full root environment and use Docker normally.

Stack and Component

The subject specifies a ELK (ES, Logstash, Kibana) tools are needed. But to keep the stack lightweight, we chose to use Filebeat, which handles log ingestion and sending them to Logstash for several reasons:

Lighter than Logstash (low memory and CPU consumption)
It handles offset tracking on its own (no duplicates if restarted)
It buffers logs while waiting for Logstash to respond
It can manage ILM policies on its own (but we discovered that using Logstash as an intermediary for ES disables this feature)
Logstash only manages log enrichment and is therefore more efficient

Our stack consists of seven different containers, each with its own role, which are launched in a specific order. Each of them communicates through the elk Docker network:

1️⃣ certs - Temporary container that generates all SSL/TLS certificates for secure communication between components. Stops after certificate generation.
2️⃣ es01 (Elasticsearch) - Exposes ports 9200 (HTTPS API) and 9300 (cluster communication) on the elk network only. Not exposed to the host for security.
3️⃣ ilm-manager - Temporary container that configures Index Lifecycle Management (ILM) policies in Elasticsearch. Stops after configuration.
4️⃣ kibana - Exposes port 5601 (HTTPS API) on the network.
5️⃣ kibana-dashboards - Temporary container that imports pre-configured dashboards into Kibana. Stops after import.
6️⃣ logstash - Exposes port 5044 (Beats input) and 9600 (HTTP monitoring API) on the elk network only.
7️⃣ filebeat - Reads application logs from /var/log/app and forwards them to Logstash on port 5044.

ELK Stack architecture with secure mTLS communication between components

Note: The stack runs in basic mode, so some warnings are displayed at launch (IA Assistant or AWS S3 Region). These tools are not used, so the logs can be ignored.

Security concern

The elk_certs container is built from an Elasticsearch image, which contained command elastic-certutil and allow us to create different certificate. Each service can communicate with a mutually certification (mTLS). Each service as its own certificate, signed by the intern CA ca.crt (wich is a way to trust all the differents certificates).

The stack is running on a isolated network: Elasticsearch (9200) is NOT exposed to the host, and Kibana (5601) is accessible via the /kibana endpoint with our reverse-proxy. All other communications happen within the private Docker elk network.

Template and ILM Policies

The ilm-manager container waits for Elasticsearch to be ready and uses the REST API to configure Index Lifecycle Management:

Creates and applies ILM policies
Creates index templates
Bootstraps the first index

Our policy manages indices through four phases based on age and size:

🔥 Hot Phase (Active writing) Creates a new index when ANY condition is met.
- Primary shard reaches 1GB
- Index is 7 days old
- Index contains 1,000,000 documents.
🌡️ Warm Phase (After 3 days) Set Index in read-only, used for recent historical data that's queried less frequently.
❄️Cold Phase (After 7 days) Rarely accessed data, stored for compliance or historical analysis
🗑️ Delete Phase (After 30 days) Index is permanently deleted, frees up storage space.

Also, we create an Alias transcendence pointing to the real current index, specified by a number, transcendence-000001 for the exemple. That's means we got after a rollover we got:

transcendence (ALIAS) 
    ↓ pointing to 
transcendence-000002 (New Index, actif to the writing)
--------------------
transcendence-000001 (Old Index, read only)

This architecture allows:

✅ Seamless index rotation without downtime
✅ Applications always write to transcendence alias
✅ Automatic lifecycle management
✅ Optimized storage and performance

Note: While this is somewhat overkill for an educational project like Transcendence, it demonstrates best practices for log management at scale and was a requirement of the subject.

Kibana Dashboards

As this stack is dedicated to a school project and cannot be maintained on a server permenently, we couldn't create dashboards from the Kiban UI and just save it. To prevent this issue, we export 3 Dashboards vues and set a kibana-dashboards container, build from an Alpine image and use the REST API to POST the ndjson dashboards.

1️⃣ 📊 Application Overview - Check our services stats (by request, attempt).
2️⃣ 🚨 Errors and Security - Monitor all ERROR and FATAL logs to keep a maintainable infra, and create a top list users make several attempts to connect.
3️⃣ ⚡ Performances - Uses the response_time to create performances views about our webapp.

Possible Improvments

As this project is built in a limited school environment with educational constraints rather than production requirements, we chose to keep a KISS (Keep It Simple, Stupid) workflow. However, in a production environment, we would consider:

Keep TLS outside the ELK only
- Current: TLS is enabled both on the reverse proxy and between each ELK component.
- Why this happened: Initially, as this was our first web project, we overcomplicated the architecture by securing each component individually.
- Recommendation / Production: In production, it’s generally simpler and sufficient to terminate TLS at the reverse proxy.
Persistent Queue (Logstash)
- Current: In-memory queue only
- Why skipped: No critical audit data, easy service restart, memory constraints (1GB limit)
- Production: Redis or Kafka for durable queuing and replay capability
Secrets Management
- Current: Logstash uses keystore, other services use environment variables
- Why skipped: Kibana keystore presented integration issues, Docker Compose simplicity preferred and a dedicated Vault module exists in the subject.
- Production: HashiCorp Vault, Docker Swarm secrets, or Kubernetes secrets
Multi-node Cluster
- Current: Single-node Elasticsearch (master + data)
- Why skipped: Resource constraints, added complexity
- Production: Separate master and data nodes for better scalability and reliability

📚 Sources

Backend

Frontend

Monitoring

Elastic Stack

Offcial ELK Documentation
Official Docker Documentation
Official Pino Documentation
Guides ELK [slightly outdated but still a good starting point]
Repo inspiration

👥 Credits

Backend Yolan ygoget
Frontend Ilan inowak--
Devops Antoine antauber

📍[42 Angouleme - October 25]

Name		Name	Last commit message	Last commit date
Latest commit History 429 Commits
.readme_assets		.readme_assets
backend		backend
elk		elk
frontend		frontend
monitoring		monitoring
reverse-proxy		reverse-proxy
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

🕹️ Ft_transcendence

Project Overview

Selected Modules

Getting Started

Prerequisites

Quick Start

Components

Frontend

Backend

Monitoring

Security Concerns

Grafana

Exporters

Fastify Metrics

ELK Monitoring

Data Retention and Storage

Possible Improvments

Elastic Stack

Virtual Machine environnment

Stack and Component

Security concern

Template and ILM Policies

Kibana Dashboards

Possible Improvments

📚 Sources

Backend

Frontend

Monitoring

Elastic Stack

👥 Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages