SME Credit Risk Analysis

A Node.js + Express API that analyzes SME bank statement CSV files and generates a counterparty concentration risk report.

The service processes inflow and outflow transactions, normalizes noisy counterparty names, detects concentration risks, and provides a simple risk recommendation.

Features

Upload SME bank statement CSV files
Analyze top inflow and outflow counterparties
Normalize and deduplicate noisy counterparty names
Detect concentration risk flags
Monthly inflow concentration trend analysis
Risk scoring system (0 = diversified, 100 = high concentration risk)
Recommendation engine
Swagger API documentation
Unit test coverage

Tech Stack

Node.js
Express.js
Multer
CSV Parser
Jest
Swagger UI

API Endpoint

POST `/analyse`

Upload a CSV bank statement file and receive a concentration risk analysis report.

Example Output

{
  "concentration_score": 50,
  "recommendation": {
    "decision": "investigate_concentration"
  }
}

Running Locally

Prerequisites

Make sure Node.js is installed on your machine.

Check installation:

node -v
npm -v

Install Dependencies

npm ci

Run in Development Mode

npm run dev

Run in Production Mode

npm run main

Run Test Cases

npm run test

Swagger API Documentation

After starting the server, open:

http://localhost:3000

You can directly upload a CSV file and test the API from Swagger UI.

Project Structure

src/
 ├── index.js
 ├── services/
 ├── utils/
tests/

index.js → API routes and Swagger setup
services/ → core business logic
utils/ → normalization, fuzzy matching, scoring
tests/ → unit tests

Design Decisions & Tradeoffs

Used rule-based scoring instead of ML to keep the system explainable and easier to review.
Used normalization + fuzzy matching instead of hardcoded aliases for better scalability across datasets.
Focused specifically on concentration risk instead of building a full credit underwriting engine.

AI Usage

AI was used as a pair-programming assistant for:

project structuring
Swagger setup
debugging Express/multer issues
improving normalization logic
creating test cases
validating output against requirements

AI initially suggested a reversed scoring model where 100 = good, but the requirement expected 100 = high concentration risk. This was corrected after validating the assessment requirements manually.

AI also missed some counterparty deduplication edge cases initially, which were identified by reviewing the generated JSON output and refining normalization/fuzzy matching logic.

Future Improvements

Better entity resolution and fuzzy matching
Docker support
Improved audit trail for merged counterparties
More edge-case and performance tests
Better CSV validation and error handling

Author

Ashit Chuphal

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SME Credit Risk Analysis

Features

Tech Stack

API Endpoint

POST `/analyse`

Example Output

Running Locally

Prerequisites

Install Dependencies

Run in Development Mode

Run in Production Mode

Run Test Cases

Swagger API Documentation

Project Structure

Design Decisions & Tradeoffs

AI Usage

Future Improvements

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SME Credit Risk Analysis

Features

Tech Stack

API Endpoint

POST /analyse

Example Output

Running Locally

Prerequisites

Install Dependencies

Run in Development Mode

Run in Production Mode

Run Test Cases

Swagger API Documentation

Project Structure

Design Decisions & Tradeoffs

AI Usage

Future Improvements

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

POST `/analyse`

Packages