Budget App Backend — Event-Driven

Backend system powering an offline-first collaborative budgeting app. Accepts immutable client events, resolves conflicts via server-authoritative versioning, and delivers ordered change streams to keep multiple devices eventually consistent.

Android App · LinkedIn · GitHub · Email

Why This Project Exists

Standard CRUD APIs assume the client and server are always in sync. This backend was built to solve a harder problem: what happens when multiple users edit shared data independently, sometimes offline, and changes arrive out of order?

Instead of mutating state directly, this system accepts immutable, idempotent events from clients. Every change is validated against server-side version numbers, conflicts are resolved deterministically, and clients receive an ordered stream of accepted events to reconstruct consistent local state.

This is the backend counterpart to a collaborative Android budgeting app that implements the Outbox/Inbox sync pattern. Both sides were designed together — the API contract, event model, and conflict resolution strategy are deliberately co-designed across client and server.

System Overview

Client
  │
  ├── POST /events          → Railway (Node.js server) → PostgreSQL (Aiven)
  │                                 ↓
  │                         Firebase Auth (per-request token validation)
  │
  ├── GET  /events          → long-poll for ordered change stream (inbox)
  │
  └── GET  /user/photo-upload-url → S3 presigned URL → client uploads direct
                                         ↓
                                    CloudFront CDN (serves profile photos)

Server hosted on Railway. S3 and CloudFront provisioned via Terraform. Database managed by Aiven.

Architecture

Single Node.js Server on Railway

All routes are handled by one Express server deployed as a persistent Node.js process on Railway. This replaces the previous Lambda-based setup entirely — there is no API Gateway or Lambda in the current architecture.

Running as a long-lived process means the server initialises once and stays up. There are no cold starts, and no per-invocation initialisation overhead.

Connection Pooling

The PostgreSQL connection pool is initialised at startup and reused across all incoming requests for the lifetime of the process. Because Railway runs a persistent server (not ephemeral function instances), the pool behaves as expected — connections are held and reused rather than torn down between requests. If Railway scales to multiple instances, each maintains its own pool; at current scale this is not a concern.

Authentication

Every request passes through a middleware layer backed by the Firebase Admin SDK. The middleware validates the Firebase JWT, extracts the user ID, and injects it into the request context. Business logic never handles raw tokens.

The Firebase serviceAccount.json is not committed to the repository. Instead it is base64 URL-safe encoded and passed to the server as the FIREBASE_SERVICE_ACCOUNT_BASE64 environment variable. The middleware decodes it at startup to initialise the Admin SDK.

Request Flow

Client Request
  → Railway (Node.js server)
  → Auth Middleware (Firebase token validation)
  → Route Handler
  → Service Layer (business logic + conflict resolution)
  → Drizzle ORM
  → PostgreSQL / Aiven (ACID transaction)
  → Response

API Reference

The API surface covers four domains: events (the core sync mechanism), budgets, user profile, and public profiles.

Events

Method	Endpoint	Description
`POST`	`/events`	Submit a batch of events (max 25). Processed in order; stops on first error.
`GET`	`/events?budgetId=&key=&count=`	Fetch ordered event stream for inbox (long-poll cursor-based).

Budgets

Method	Endpoint	Description
`POST`	`/budget`	Create a new budget
`GET`	`/budgets?key=&count=`	List budgets for authenticated user (paginated)
`GET`	`/budgets/:budgetId`	Get full budget snapshot
`GET`	`/budgets/:budgetId/join`	Join a shared budget via invite link
`DELETE`	`/budgets/:budgetId/leave`	Leave a shared budget
`GET`	`/budgets/:budgetId/last-event-sequence`	Get last processed sequence number (sync cursor)
`GET`	`/budgets/:budgetId/participants`	List all participants
`GET`	`/budgets/:budgetId/participants/:participantId`	Get single participant
`GET`	`/budgets/:budgetId/categories`	List all categories
`GET`	`/budgets/:budgetId/categories/:categoryId`	Get single category
`GET`	`/budgets/:budgetId/expenses/:expenseId`	Get single expense

User Profile

Method	Endpoint	Description
`GET`	`/user`	Get authenticated user's profile
`PUT`	`/user`	Update profile fields
`GET`	`/user/photo-upload-url`	Get S3 presigned URL for photo upload
`POST`	`/user/confirm-photo-upload`	Confirm upload complete, update profile record
`DELETE`	`/user/profile-photo`	Remove profile photo
`GET`	`/profiles/:userId`	Get any user's public profile

Profile Photo Upload Flow

Direct-to-S3 upload avoids routing binary data through the server:

Client → GET /user/photo-upload-url → receives presigned S3 URL
Client → PUT <presigned URL> (uploads directly to S3)
Client → POST /user/confirm-photo-upload → server updates DB record
User   → photo served via CloudFront CDN

Event Processing Model

What an Event Is

Every client action is represented as an immutable event with a globally unique eventId. Events describe what happened, not what the new state should be.

{
  "eventId": "d0e9f472-0992-4841-a43a-537b9a89d9b6",
  "eventType": "expense.add",
  "budgetId": "64143031-e16c-4c97-859d-8997e94ed98e",
  "recordId": "813667ca-8a40-4678-9602-f198d86eff1e",
  "when": 1774718381798,
  "amount": "150.00",
  "note": "coffee",
  "date": "2025-05-06"
}

Create events carry no version. Update and delete events must include the current version — this is how conflicts are detected.

Processing Pipeline

Receive batch → Validate schema → For each event (in order):
  ├── Check accepted_events for duplicate eventId
  │     └── Duplicate → return cached result, skip processing
  ├── Check entity version (update/delete only)
  │     └── Mismatch → return latest record, stop batch
  ├── Apply change within ACID transaction
  │     ├── Write to entity table (budget / category / expense)
  │     └── Write to accepted_events (idempotency record)
  └── Return result for this event

Batch Error Behaviour

Events in a batch are processed sequentially in submission order. If an event fails (version mismatch, validation error), processing stops at that point. Events already successfully processed in the same batch are not rolled back — they are committed and their results are included in the response. The client receives a partial success response and can resume from the failed event.

This mirrors how the Android client's Outbox worker handles partial batch responses — it deletes confirmed events and retries from the failure point.

Idempotency

accepted_events has a unique constraint on eventId. If a client retries a previously processed event (e.g., network dropped after the server responded), the server detects the duplicate, returns the original cached result, and skips reprocessing. Retries are always safe.

Conflict Resolution

Each entity row carries a server-assigned version number, incremented on every accepted mutation. The server is the only system that assigns versions.

Resolution Table

Scenario	Server Action	Client Response
Version matches	Apply event, increment version, return updated record	Apply response to local DB
Version mismatch	Return current latest record (no mutation)	Overwrite local record with server state
Duplicate `eventId`	Return original cached response	Treat as success, move on
Client needs full state	`GET /budgets/:id` snapshot	Replace local data entirely

Why This Approach

Version-based resolution is simpler than CRDT or operational transform strategies — there's no merge logic to implement or test. The trade-off is that the losing client's change is silently discarded rather than merged. For a budgeting app where precision matters over leniency, this is the correct trade-off.

Database Design

Normalized relational schema in PostgreSQL. All mutations use ACID transactions.

Schema

users
  └── id (PK)
  └── ...profile fields

budgets
  └── id (PK)
  └── owner_id (FK → users)

participants
  └── budget_id (FK → budgets)   ← composite PK
  └── user_id   (FK → users)     ←

categories
  └── id (PK)
  └── budget_id (FK → budgets)
  └── version

expenses
  └── id (PK)
  └── category_id (FK → categories)
  └── created_by  (FK → users)
  └── version

accepted_events
  └── event_id (PK / unique) ← idempotency enforcement
  └── budget_id
  └── response_payload        ← cached for duplicate requests
  └── processed_at

Relationships

User → owns many Budgets
Budget → has many Participants (join table with users)
Budget → has many Categories
Category → has many Expenses
Expense → belongs to one User (creator)
accepted_events → stores every processed eventId with its response payload for idempotent replay

Version Column

All budgets, categories and expenses carry a version integer. The server increments this on every accepted update or delete. Clients must echo the current version in mutation events — a mismatch is the conflict signal.

Cloud Infrastructure

S3 and CloudFront are provisioned and managed via Terraform. The Node.js server runs on Railway. The database is hosted on Aiven. No manual console configuration for any of these.

Resources

Resource	Purpose
Railway	Hosts the Node.js server; auto-deploys on push to main
PostgreSQL (Aiven)	Primary data store. See Aiven
S3 Bucket	Profile photo storage (presigned upload target)
CloudFront	CDN for serving profile photos

Environment

Currently a single production environment. Multi-environment (dev/staging/prod) separation is on the roadmap.

CI/CD Pipeline

Implemented with GitHub Actions, triggered on push to main. Railway detects the same push and auto-deploys the server independently — GH Actions does not trigger Railway directly, but Terraform and migrations are prerequisites that should complete before Railway's new build starts serving traffic.

Pipeline Steps

Push to main
  │
  ├── 1. Apply Terraform changes (S3 + CloudFront)
  │         Storage infrastructure changes applied first
  │
  ├── 2. Deploy database migrations
  │         Drizzle migrations run against Aiven PostgreSQL
  │
  └── 3. Railway auto-deploys (triggered by the same push)
            New server build goes live after infra and schema are current

Deploy only triggers on main. Feature branches do not trigger any of these steps.

The ordering matters: infrastructure is updated before migrations, and migrations run before the new server build is live — preventing the updated handler from running against a stale schema.

Tech Stack

Layer	Technology	Reason
Runtime	Node.js	Persistent server process, no cold start overhead
Language	TypeScript	Type safety across API contracts and DB schema
Hosting	Railway	Managed Node.js deployment, auto-deploy from GitHub
ORM	Drizzle	TypeScript-native, schema-as-code, migration tooling
Database	PostgreSQL (Aiven)	ACID transactions, referential integrity, versioning
Auth	Firebase Admin SDK	Offloads user management; JWT validation in middleware
Storage	AWS S3 + presigned URLs	Binary uploads bypass the server; no size or timeout limits
CDN	CloudFront	Low-latency profile photo delivery
Infrastructure	Terraform	Manages S3 + CloudFront; reproducible and version-controlled
CI/CD	GitHub Actions	Terraform + migration pipeline on push to main

Project Structure

src/
  ├── routes/           ← Route handlers (one file per domain)
  ├── services/         ← Business logic (event processing, conflict resolution)
  ├── db/               ← Drizzle schema definitions and query helpers
  └── middleware/       ← Auth context extraction, error handling

drizzle/
  └── migrations/       ← Generated migration files (version-controlled)
  └── seed/            ← Sample data

terraform/
  ├── storage.tf        ← S3, CloudFront
  └── variables.tf

scripts/                ← Utility scripts (seed, migration runner)

Setup & Installation

Prerequisites

Node.js 22+
Docker (local PostgreSQL)
AWS CLI configured with credentials (for S3 + CloudFront via Terraform)
Firebase project with Admin SDK service account

Local Development

git clone https://github.com/rahulstech/budget-app-backend.git
cd budget-app-backend
npm install

Rename example.env to dev.env with. Fill necessary fileds. Comment unnecessary field.

Start local database:

docker run -d \
  --name budgetdb \
  -e POSTGRES_USER=postgres \
  -e POSTGRES_PASSWORD=postgres \
  -e POSTGRES_DB=budget_db \
  -p 5432:5432 \
  postgres:15

Run migrations and start server:

# create tables 
npm run drizzle:migrate

# (optional) add fake users
npm run drizzle:seed:fake_users

# start development server
npm run dev

Deploying

Before pushing to main, complete these one-time manual steps in order:

1. Provision the Aiven PostgreSQL database

Create the PostgreSQL service on Aiven first. The database must exist before migrations can run. Copy the connection string — you'll need it for the Railway environment variables below.

2. Configure Railway environment variables

In the Railway project dashboard, open the Variables table and add all required environment variables manually. Key ones to set:

Variable	Notes
`DATABASE_URL`	Aiven PostgreSQL connection string
`CDN_BASE_URL`	CloudFront distribution URL — only available after Terraform has provisioned CloudFront (step 3). Add or update this after the first Terraform run.
`FIREBASE_SERVICE_ACCOUNT_BASE64`	Base64 URL-safe encoded `serviceAccount.json` (see Authentication note in Architecture)
(other env vars)	Refer to `example.env` for the full list

3. Apply Terraform (S3 + CloudFront)

Ensure Terraform is initialised and AWS credentials are set, then push to main — the GH Actions pipeline applies Terraform changes and runs database migrations automatically. After the first Terraform run, copy the CloudFront URL into CDN_BASE_URL in Railway if not already set.

4. Server (Railway)

Railway is connected to this repository and auto-deploys on every push to main. No manual deploy step is needed once environment variables are configured.

Testing

Current State

Manual API testing via Postman, covering:

Valid event submission (single and batch)
Duplicate eventId — verifies idempotent response
Version mismatch on update/delete — verifies latest record returned
Partial batch — verifies stop-on-error behaviour and partial response
Long-poll event fetch with sequence cursor
Snapshot download via budget GET

What Needs Automated Tests (and Why)

The highest-risk logic in this system is the event processing pipeline and conflict resolution — both are pure functions over database state and are straightforward to unit test with a test database or mocked Drizzle client.

Priority test targets:

Target	Type	Why
Idempotency check	Unit	Core correctness guarantee
Version conflict path	Integration	Most complex branching logic
Partial batch response	Integration	Client depends on exact response shape
Schema migrations	Migration test	Prevents data loss on deploy

This is the most significant gap in the current project and the first area for improvement.

Limitations & Trade-offs

Stop-on-First-Error Batch Processing

The current batch model stops at the first failing event. This is simple and predictable, but means a single version conflict blocks the rest of a batch. An alternative is to process all events independently and return a per-event result map — more complex to implement on both client and server, but more efficient for large batches with isolated failures.

No Real-Time Push

The inbox model uses client-initiated long polling. The server has no mechanism to push changes to connected clients. This means update propagation latency is bounded by the polling interval, not by the event arrival time. WebSockets or Server-Sent Events would close this gap.

Single Environment

Terraform currently manages one environment. There's no dev/staging separation, which means infrastructure changes go directly to production.

No Automated Tests

Covered in the Testing section above. This is the most important gap for production readiness.

Roadmap

Priority	Item	Reason
High	Integration tests for event pipeline and conflict resolution	Core correctness, highest-risk code
Medium	Per-event batch result map (replace stop-on-first-error)	More resilient client sync
Medium	Cursor-based pagination standardization across all list endpoints	Consistency and scalability
Medium	Terraform multi-environment (dev/staging/prod)	Safe infra changes
Medium	Delta sync endpoint	Avoid full snapshot on every client join
Low	Authorization result caching	Reduce Firebase SDK call per request
Low	Reporting and analytics endpoints	Category-wise spending summaries

👤 Author — Rahul Bagchi

Backend and Android developer focused on distributed systems and offline-first architecture. Built both sides of this system — the Android client and this backend — as a cohesive end-to-end exploration of sync reliability and eventual consistency.

GitHub: rahulstech
LinkedIn: Rahul Bagchi
Email: rahulstech18@gmail.com
Android counterpart: budget-app-android

Open to backend, Android, and full-stack roles — particularly where distributed data or sync complexity is part of the problem.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github/workflows		.github/workflows
drizzle		drizzle
scripts		scripts
src		src
terraform		terraform
.gitignore		.gitignore
ReadMe.md		ReadMe.md
budgetify.png		budgetify.png
example.env		example.env
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

Budget App Backend — Event-Driven

Contents

Why This Project Exists

System Overview

Architecture

Single Node.js Server on Railway

Connection Pooling

Authentication

Request Flow

API Reference

Events

Budgets

User Profile

Profile Photo Upload Flow

Event Processing Model

What an Event Is

Processing Pipeline

Batch Error Behaviour

Idempotency

Conflict Resolution

Resolution Table

Why This Approach

Database Design

Schema

Relationships

Version Column

Cloud Infrastructure

Resources

Environment

CI/CD Pipeline

Pipeline Steps

Tech Stack

Project Structure

Setup & Installation

Prerequisites

Local Development

Deploying

Testing

Current State

What Needs Automated Tests (and Why)

Limitations & Trade-offs

Stop-on-First-Error Batch Processing

No Real-Time Push

Single Environment

No Automated Tests

Roadmap

👤 Author — Rahul Bagchi

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages