[FinServAssessments] Add 64 FinServ GenAI risk checks (FS-01 to FS-69) by mehtadman87 · Pull Request #23 · aws-samples/sample-aiml-security-assessment

mehtadman87 · 2026-04-29T16:41:48Z

Summary

Adds 64 standalone security checks (FS-01 to FS-69) plus 5 upstream extensions to the AIML Security Assessment framework, derived from the AWS guide for Financial Services risk management of the use of Generative AI (March 2026).

The new checks are delivered as a new Lambda function (finserv_assessments/) that runs in parallel with the existing Bedrock, SageMaker, and AgentCore assessment Lambdas. The function produces a finserv_security_report_{execution_id}.csv that is loaded into the consolidated report.

Closes #22

Risk categories covered (15 total)

Range	Category	PDF §
FS-01..06	Unbounded Consumption	§1.2.11
FS-07..11	Excessive Agency	§1.2.9
FS-12..16	Supply Chain Vulnerabilities	§1.2.12
FS-17..21	Training Data & Model Poisoning	§1.2.14
FS-22..26	Vector & Embedding Weaknesses	§1.2.15
FS-27..30	Non-Compliant Output	§1.2.1
FS-31..34	Misinformation	§1.2.3
FS-35..38	Abusive or Harmful Output	§1.2.4
FS-39..42	Biased Output	§1.2.5
FS-43..46	Sensitive Information Disclosure	§1.2.6
FS-47..50	Hallucination	§1.2.7
FS-51..54	Prompt Injection	§1.2.8
FS-55..58	Improper Output Handling	§1.2.13
FS-59..60	Off-Topic & Inappropriate Output	§1.2.2
FS-61..63	Out-of-Date Training Data	§1.2.10
FS-64..69	Material Gap Checks (cross-category)	—

Upstream extensions (5 checks merged into existing checks)

These 5 FS checks add detection/remediation depth to existing checks rather than shipping as standalone entries:

FS check	Upstream check	What it adds
FS-17	SM-07	Model Monitor data quality baseline cadence + CloudWatch namespace
FS-18	SM-23	Low-entropy classification monitoring as poisoning early-warning
FS-19	SM-22	PendingManualApproval default + flag auto-approved latest versions
FS-23	BR-06	Advanced event selector for `AWS::Bedrock::KnowledgeBase` data events
FS-64	BR-04	Guardrail trace logging verification (action/inputAssessments/outputAssessments)

Files changed

New:

functions/security/finserv_assessments/ — new Lambda (app.py, schema.py, requirements.txt, __init__.py)
docs/SECURITY_CHECKS_FINSERV_COMMON.md — shared intro, severity rubric, upstream-overlap table, compliance framework mapping
docs/SECURITY_CHECKS_FINSERV_PART1_INFRA_CONTROLS.md — FS-01..26
docs/SECURITY_CHECKS_FINSERV_PART2_GUARDRAILS_CONTENT_SAFETY.md — FS-27..46
docs/SECURITY_CHECKS_FINSERV_PART3_APP_LAYER_AND_GAPS.md — FS-47..69
docs/AIMLSecurityAssessment-MappingsTable.csv — machine-readable gap mapping
.ash/.ash.yaml — ASH v3.2.6 config with documented suppressions for upstream-convention findings

Modified:

template.yaml + template-multi-account.yaml — add FinServSecurityAssessmentFunction resource, DefinitionSubstitutions, and LambdaInvokePolicy
statemachine/assessments.asl.json — add FinServ branch to Run Security Assessments parallel state
deployment/1-aiml-security-member-roles.yaml + deployment/aiml-security-single-account.yaml — add FinServGenAIRiskAssessmentPermissions IAM statement
functions/security/generate_consolidated_report/app.py — load finserv_security_report_*.csv
functions/security/bedrock_assessments/app.py — FS-64/FS-23 extension notes
functions/security/sagemaker_assessments/app.py — FS-17/FS-18/FS-19 extension notes
docs/SECURITY_CHECKS.md — add FinServ section

Compliance frameworks mapped

SR 11-7, FFIEC CAT, NYDFS 500.06, PCI-DSS 12.3.2, DORA Art.6, MAS TRM 9, ISO 27001 A.12, ECOA, OWASP LLM Top 10

Note: Compliance mappings are preliminary and illustrative. They have not been reviewed by AWS Security Assurance Services or external auditors. Each firm should validate mappings with their own MRM/Legal/Compliance teams before relying on them as audit evidence.

Testing

Local quality gates (all passed):

Gate	Result
`ruff check` on all changed `.py` files	✅ All checks passed
`ruff format --check` on all changed `.py` files	✅ All files formatted
`cfn-lint` on all 5 CFN templates (with `.cfnlintrc`)	✅ Exit 0
`sam validate --lint` on both SAM templates	✅ Valid SAM Template
`sam build`	✅ Build Succeeded
`ast.parse` on all new/modified Python files	✅ All parse clean

ASH v3.2.6 local scan summary:

Scanner	Suppressed	Actionable	Result
bandit	0	2 (upstream `consolidate_html_reports.py`)	FAILED
cdk-nag	0	51 (all pre-existing upstream patterns)	FAILED
checkov	56	22 (all pre-existing upstream patterns)	FAILED
detect-secrets	0	0	PASSED
semgrep	0	0	PASSED
npm-audit	0	0	PASSED

All remaining actionable findings are pre-existing upstream patterns (no-VPC Lambda, no-DLQ, wildcard IAM for list/describe operations). The CI ASH workflow uses continue-on-error: true and is informational only. The 56 Checkov suppressions are documented in .ash/.ash.yaml with justifications referencing upstream convention.

Known follow-up items

HTML report rendering — report_template.py currently renders Bedrock/SageMaker/AgentCore sections. Adding a fourth FinServ section requires extending the HTML template, JavaScript filter wiring, and format-string variables. This is a non-trivial change best done in a follow-up PR to keep this one reviewable.
Compliance placeholder wiring — Each check includes a COMPLIANCE_PLACEHOLDER comment listing the applicable FinServ regulatory frameworks. The prototype report owner should wire these into the HTML report's compliance-standards column.
Upstream extension code changes — The 5 upstream extensions are currently added as source-code comments (extension notes). If the maintainers want the detection logic itself extended (e.g., SM-07 checking emit_metrics=Enabled), that can be done in a follow-up PR targeting those specific checks.

Implements 64 standalone security checks plus 5 upstream extensions derived from the AWS guide for Financial Services risk management of the use of Generative AI (March 2026): https://d1.awsstatic.com/onedam/marketing-channels/website/public/global-FinServ-ComplianceGuide-GenAIRisks-public.pdf Covers 15 risk categories across FS-01 to FS-69: FS-01..06 Unbounded Consumption (WAF/Shield, rate limiting, token quotas, cost monitoring) FS-07..11 Excessive Agency (action boundaries, AgentCore Policy, transaction limits, rate alarms) FS-12..16 Supply Chain Vulnerabilities (SCPs, model inventory, onboarding governance, adversarial evaluation) FS-17..21 Training Data & Model Poisoning (Model Monitor, data drift, Model Registry, Feature Store rollback) FS-22..26 Vector & Embedding Weaknesses (KB least privilege, CloudTrail logging, metadata filtering, multi-tenancy) FS-27..30 Non-Compliant Output (Automated Reasoning, RAG controls, human-in-the-loop, compliance guardrails) FS-31..34 Misinformation (KB data quality, source attribution, integrity monitoring, sync cadence) FS-35..38 Abusive or Harmful Output (FMEval, user reporting, allowlists, AI Service Cards) FS-39..42 Biased Output (SageMaker Clarify, Bedrock Evaluations, bias datasets, AI Service Cards) FS-43..46 Sensitive Information Disclosure (CloudWatch log masking, Macie, PII pre-processing, data classification) FS-47..50 Hallucination (contextual grounding, Automated Reasoning, disclaimers, RAG) FS-51..54 Prompt Injection (input sanitization, parameterized queries, pen testing, SDK currency) FS-55..58 Improper Output Handling (output validation, sanitization, encoding, XSS prevention) FS-59..60 Off-Topic & Inappropriate Output (contextual grounding, topic allowlists) FS-61..63 Out-of-Date Training Data (KB sync cadence, data currency disclaimers, FM version updates) FS-64..69 Material Gap Checks (guardrail trace logging, KB data-source S3 event notifications, AgentCore end-user identity propagation, agent financial transaction value thresholds, API Gateway request body size limits, prompt input validation) 5 checks contributed as upstream extensions (not standalone entries): FS-17 -> SM-07 Model Monitor data quality baseline cadence FS-18 -> SM-23 Model drift low-entropy classification monitoring FS-19 -> SM-22 Model Registry PendingManualApproval default enforcement FS-23 -> BR-06 Knowledge Base data-plane CloudTrail event selectors FS-64 -> BR-04 Guardrail trace logging verification New files: functions/security/finserv_assessments/app.py (64 check functions) functions/security/finserv_assessments/schema.py (Finding schema) functions/security/finserv_assessments/requirements.txt functions/security/finserv_assessments/__init__.py docs/SECURITY_CHECKS_FINSERV_COMMON.md docs/SECURITY_CHECKS_FINSERV_PART1_INFRA_CONTROLS.md docs/SECURITY_CHECKS_FINSERV_PART2_GUARDRAILS_CONTENT_SAFETY.md docs/SECURITY_CHECKS_FINSERV_PART3_APP_LAYER_AND_GAPS.md docs/AIMLSecurityAssessment-MappingsTable.csv .ash/.ash.yaml (ASH v3.2.6 config with documented suppressions) Modified files: template.yaml + template-multi-account.yaml - Add FinServSecurityAssessmentFunction resource - Wire DefinitionSubstitutions and LambdaInvokePolicy statemachine/assessments.asl.json - Add FinServ branch to Run Security Assessments parallel state deployment/1-aiml-security-member-roles.yaml deployment/aiml-security-single-account.yaml - Add FinServ GenAI Risk Assessment Permissions statement functions/security/generate_consolidated_report/app.py - Load finserv_security_report_{execution_id}.csv into consolidated report functions/security/bedrock_assessments/app.py - Add FS-64 (BR-04) and FS-23 (BR-06) extension notes functions/security/sagemaker_assessments/app.py - Add FS-17 (SM-07), FS-18 (SM-23), FS-19 (SM-22) extension notes docs/SECURITY_CHECKS.md - Add FinServ section pointing to the four new reference files Compliance frameworks mapped (preliminary, for MRM/compliance team review): SR 11-7, FFIEC CAT, NYDFS 500.06, PCI-DSS 12.3.2, DORA Art.6, MAS TRM 9, ISO 27001 A.12, ECOA, OWASP LLM Top 10 Quality gates passed locally: ruff check + ruff format --check: clean cfn-lint (with .cfnlintrc): exit 0 on all 5 CFN templates sam validate --lint: valid on both SAM templates sam build: Build Succeeded ASH v3.2.6 local scan: detect-secrets PASSED, semgrep PASSED, npm-audit PASSED; remaining Checkov/cdk-nag findings are pre-existing upstream patterns (CI uses continue-on-error: true for ASH) Note: HTML report rendering for FinServ findings (fourth section in report_template.py) is a follow-up item. The finserv_security_report_*.csv is generated and available in S3 for downstream consumption. Closes aws-samples#22

mehtadman87 · 2026-04-29T16:48:02Z

@vivekmittal514 @agasthik — PR is ready for review and merge. All CI checks pass (cfn-lint, python-lint, sam-validate, ASH scan). The only failing check is CodeQL Advanced, which has been failing on upstream main for 7+ days due to a pre-existing repo configuration conflict (default setup + advanced workflow both enabled) — not caused by this PR.

Happy to address any questions on the FS-XX mapping to the AWS FinServ GenAI Risk Guide (March 2026). The full check catalog is in docs/SECURITY_CHECKS_FINSERV_PART*.md.

FS-07: Wrap get_agent() in try/except ClientError so agents encrypted with KMS keys the caller cannot decrypt are skipped rather than crashing the entire check with ERROR status. FS-33: Wrap get_bucket_versioning() in try/except ClientError so deleted or cross-account KB data-source buckets are recorded as '(access error)' and the check continues rather than returning ERROR. Both fixes follow the defensive pattern already used in FS-65's get_bucket_notification_configuration() call. Discovered and verified in Phase 2 live integration testing against account 469898429403 (us-east-1). sim: aws-samples#22

- Bump check count from 52 to 116 (52 core + 64 FinServ) throughout README - Add Financial Services GenAI Risk to title, key features, services covered, How It Works module list, member role permissions, and report structure - Add finserv_security_report_{execution_id}.csv to individual account report file listing - Update SECURITY_CHECKS.md overview table, TOC, and check ID convention table to include FS-XX prefix and 64-check count - SECURITY_CHECKS.md already contains the full FinServ section (added in the original feat commit); this commit updates the summary tables only sim: aws-samples#22

mehtadman87 · 2026-05-01T17:51:40Z

PR Update — Phase 2 & 3 Testing Complete ✅

Author: @mehtadman87 (self-assigned to issue #22)

What's new in this update (2 commits since initial PR)

fix(finserv): Handle per-agent and per-bucket ClientErrors gracefully

FS-07 (check_bedrock_agent_action_boundaries): get_agent() now catches ClientError per-agent so KMS-encrypted agents that the caller cannot decrypt are skipped gracefully instead of crashing the entire check with ERROR status.
FS-33 (check_knowledge_base_integrity_monitoring): get_bucket_versioning() now catches ClientError per-bucket so deleted or cross-account KB data-source buckets are recorded as (access error) and the check continues.

Both bugs were discovered during Phase 2 live integration testing against an AWS account in us-east-1.

docs: Update README and SECURITY_CHECKS for FinServ addition

README: title, check count (52 → 116), key features, services covered, How It Works, member role permissions, report structure, documentation table.
SECURITY_CHECKS.md: overview table, TOC, and check ID convention table updated to include FS-XX prefix and 64-check count.

Testing completed

Phase	Status	Details
Phase 1 — Unit tests	✅ 315 tests, 99% coverage	`pytest tests/` — all passing
Phase 2 — Live boto3	✅ 64/64 checks ran, 0 ERRORs	Direct Python invocation against account 469898429403
Phase 2 — `sam local invoke`	✅ statusCode 200, 65 CSV rows	Finch container, real AWS credentials
Phase 3 — Full stack deploy	✅ Step Functions SUCCEEDED	Stack `aiml-sec-finserv`, all 4 parallel branches completed
ASH scan	✅ 0 Critical, 0 High, 0 Medium	bandit, checkov, detect-secrets, semgrep all PASSED

Requesting review

@vivekmittal514 @agasthik — ready for your review. Happy to address any feedback.

mehtadman87 · 2026-05-01T18:04:12Z

PR Update — Phase 2 & 3 Testing Complete ✅

Author: @mehtadman87 (self-assigned to issue #22)

What's new in this update (2 commits since initial PR)

fix(finserv): Handle per-agent and per-bucket ClientErrors gracefully

FS-07 (check_bedrock_agent_action_boundaries): get_agent() now catches ClientError per-agent so KMS-encrypted agents that the caller cannot decrypt are skipped gracefully instead of crashing the entire check with ERROR status.
FS-33 (check_knowledge_base_integrity_monitoring): get_bucket_versioning() now catches ClientError per-bucket so deleted or cross-account KB data-source buckets are recorded as (access error) and the check continues.

Both bugs were discovered during Phase 2 live integration testing against an aws account in us-east-1.

docs: Update README and SECURITY_CHECKS for FinServ addition

README: title, check count (52 → 116), key features, services covered, How It Works, member role permissions, report structure, documentation table.
SECURITY_CHECKS.md: overview table, TOC, and check ID convention table updated to include FS-XX prefix and 64-check count.

Testing completed

Phase	Status	Details
Phase 1 — Unit tests	✅ 315 tests, 99% coverage	`pytest tests/` — all passing
Phase 2 — Live boto3	✅ 64/64 checks ran, 0 ERRORs	Direct Python invocation against account 469898429403
Phase 2 — `sam local invoke`	✅ statusCode 200, 65 CSV rows	Finch container, real AWS credentials
Phase 3 — Full stack deploy	✅ Step Functions SUCCEEDED	Stack `aiml-sec-finserv`, all 4 parallel branches completed
ASH scan	✅ 0 Critical, 0 High, 0 Medium	bandit, checkov, detect-secrets, semgrep all PASSED

CI status

5/6 checks pass. The CodeQL Advanced check fails — this is a pre-existing failure on main (confirmed by checking the main branch CI history). It is not caused by this PR.

Requesting review

@vivekmittal514 @agasthik — ready for your review. Happy to address any feedback.

vivekmittal514 · 2026-06-01T21:16:41Z

PR #23 Review: FinServ GenAI Risk Checks against the AWS FinServ Guide

Overall verdict: Strong PR. The 64 checks map faithfully to the guide's 15 risk categories, the consolidation logic is sound, and the documentation is thorough. There are a handful of code bugs and design gaps worth addressing before merge.

Coverage vs. the PDF Guide

The PDF defines 15 risk categories (§1.2.1–§1.2.15) with explicit mitigations. The PR covers all 15. Traceability is clearly marked [PDF §x.y.z] vs [PDF §x.y.z, extension] throughout the docs. The 5 consolidations (FS-17→SM-07, FS-18→SM-23, FS-19→SM-22, FS-23→BR-06, FS-64→BR-04) are well-reasoned and documented. No PDF risk category is missing.

One gap: the PDF's §1.2.1 (Non-Compliant Output) "Practical guidance" callout explicitly says to use existing compliance materials (employee policies, training materials, procedure documents, incident reports) to author Bedrock guardrail denied-topic policies. FS-28 checks for FinServ denied topics but the remediation doesn't mention using these internal materials as the source — a minor doc-only gap.

Code Issues

Bug 1 — FS-03: Wrong proxy for "quota reviewed" (app.py:287)

default_only = all(not q.get("Adjustable") for q in tpm_quotas + rpm_quotas)

Adjustable means the quota can be increased, not that it has been. All Bedrock TPM/RPM quotas are adjustable by nature, so default_only will almost always be True, causing the check to always report Failed. The doc correctly says to compare applied quota value against the default via ListAWSDefaultServiceQuotas — but that API call isn't in the implementation. The code needs to call list_aws_default_service_quotas(ServiceCode="bedrock") and compare Value against the applied value.

Bug 2 — FS-06: Only checks deprecated CostFilters, misses new-style budgets (app.py:451)

svc in json.dumps(b.get("CostFilters", {})).lower()

The doc correctly notes that new budgets use FilterExpression. The code only checks CostFilters. Any budget created after AWS deprecated that field will be missed, producing false WARN results on modern accounts.

Bug 3 — FS-04: Overly broad DIMENSIONAL monitor match (app.py:327)

or m.get("MonitorType") == "DIMENSIONAL"

This treats any DIMENSIONAL monitor as covering Bedrock (e.g., MonitorDimension=LINKED_ACCOUNT). The doc correctly says only MonitorDimension=SERVICE provides coverage. The code should check m.get("MonitorDimension") == "SERVICE".

Bug 4 — logger level set to ERROR suppresses all operational info (app.py:67)

logger.setLevel(logging.ERROR)

All logger.warning() calls throughout the file (e.g., "Could not describe agent") are silently dropped. Lambda logs will be nearly empty on auth/permission errors during partial-access runs. This should be logging.WARNING or logging.INFO for a diagnostic Lambda.

Bug 5 — FS-03: Empty list corner case causes misleading PASS

When tpm_quotas + rpm_quotas is empty (e.g., if the list_service_quotas call returns nothing), all(...) on an empty iterable returns True, so default_only = True → status "Failed". That part is correct. But details will say "Found 0 token-based and 0 request-based Bedrock quotas" with no explanation — the caller sees a Failed finding with no actionable context about why the quota list was empty (API throttle? Wrong region? Permission issue?). Worth adding an explicit branch.

Design Concerns

D1 — Advisory checks mixed into the pass/fail report without a distinct status

Checks like check_hallucination_disclaimer_advisory(), check_compliance_disclaimer_in_outputs(), check_output_encoding_advisory(), and check_data_currency_disclaimer_advisory() always return N/A status because "no AWS API can verify application-level disclaimers." These are valuable but they will show up in the consolidated report alongside real pass/fail results. The severity rubric defines an "Advisory" severity tier, but the StatusEnum in schema.py only has Failed, Passed, N/A — it doesn't include Advisory. Consumers of the CSV won't be able to filter advisory guidance from actionable findings. Consider either adding Advisory to StatusEnum or using a consistent N/A + Informational severity combination with a standard finding name prefix (e.g., "ADVISORY — ") so report consumers can filter.

D2 — Known follow-up: HTML report rendering is incomplete

The PR body explicitly flags this: the HTML report template doesn't render the FinServ section. For a sample repo used in customer demos, shipping a Lambda that writes a CSV but produces no visible output in the HTML report is a UX regression. Worth either blocking merge on this or very prominently documenting it in the README.

D3 — COMPLIANCE_PLACEHOLDER comments are not wired to the report

The doc disclaimer is clear. However, the placeholder text appears in Python docstrings, not in the Finding or CSV output. If someone ingests the CSV expecting compliance mappings, they won't find them. The docs/AIMLSecurityAssessment-MappingsTable.csv helps, but the gap between the code comment and the delivered report artifact may confuse integrators.

IAM Permissions

The FinServGenAIRiskAssessmentPermissions IAM statement in deployment/1-aiml-security-member-roles.yaml is well-scoped and read-only. All actions are list/describe/get variants. Resource: "*" is appropriate for inventory-style checks where ARNs aren't known in advance. One omission: bedrock-agentcore:ListGateways, bedrock-agentcore:GetGateway, and bedrock-agentcore:ListAgentRuntimes are called by check_agentcore_policy_engine() (FS-08) but are not in the IAM statement — those calls will fail with AccessDenied at runtime.

Documentation Quality

The three-part doc split is sensible. The PDF traceability tagging ([PDF §x.y.z] vs extension) is rigorous and reviewer-friendly. The upstream-overlap consolidation table is the strongest part of the PR — the 5-factor analysis for each overlap is methodical and will save future maintainers significant effort.

The compliance disclaimer in SECURITY_CHECKS_FINSERV_COMMON.md is appropriately conservative. One small doc issue: two places in the common doc say "Add this content to docs/SECURITY_CHECKS.md in the forked repository" — these look like stale copy-paste from a planning document and should be removed before merge.

Summary of actionable items

Priority | Item -- | -- Must fix | Bug 1: FS-03 quota detection uses wrong field — add ListAWSDefaultServiceQuotas comparison Must fix | Bug 3: FS-04 DIMENSIONAL monitor match too broad — check MonitorDimension=SERVICE Must fix | Missing IAM perms: bedrock-agentcore:ListGateways/GetGateway/ListAgentRuntimes not in member role Should fix | Bug 2: FS-06 only checks deprecated CostFilters, not FilterExpression Should fix | Bug 4: logger.setLevel(logging.ERROR) drops warnings — set to WARNING Should fix | D1: Add Advisory to StatusEnum or establish a filter convention for advisory checks Nice to have | HTML report rendering for FinServ section (tracked as known follow-up) Cleanup | Remove two stale "Add this content to SECURITY_CHECKS.md" lines from the common doc

PR #23 Review: FinServ GenAI Risk Checks against the AWS FinServ Guide
Overall verdict: Strong PR. The 64 checks map faithfully to the guide's 15 risk categories, the consolidation logic is sound, and the documentation is thorough. There are a handful of code bugs and design gaps worth addressing before merge.

Coverage vs. the PDF Guide
The PDF defines 15 risk categories (§1.2.1–§1.2.15) with explicit mitigations. The PR covers all 15. Traceability is clearly marked [PDF §x.y.z] vs [PDF §x.y.z, extension] throughout the docs. The 5 consolidations (FS-17→SM-07, FS-18→SM-23, FS-19→SM-22, FS-23→BR-06, FS-64→BR-04) are well-reasoned and documented. No PDF risk category is missing.

One gap: the PDF's §1.2.1 (Non-Compliant Output) "Practical guidance" callout explicitly says to use existing compliance materials (employee policies, training materials, procedure documents, incident reports) to author Bedrock guardrail denied-topic policies. FS-28 checks for FinServ denied topics but the remediation doesn't mention using these internal materials as the source — a minor doc-only gap.

Code Issues
Bug 1 — FS-03: Wrong proxy for "quota reviewed" (app.py:287)

default_only = all(not q.get("Adjustable") for q in tpm_quotas + rpm_quotas)
Adjustable means the quota can be increased, not that it has been. All Bedrock TPM/RPM quotas are adjustable by nature, so default_only will almost always be True, causing the check to always report Failed. The doc correctly says to compare applied quota value against the default via ListAWSDefaultServiceQuotas — but that API call isn't in the implementation. The code needs to call list_aws_default_service_quotas(ServiceCode="bedrock") and compare Value against the applied value.

Bug 2 — FS-06: Only checks deprecated CostFilters, misses new-style budgets (app.py:451)

svc in json.dumps(b.get("CostFilters", {})).lower()
The doc correctly notes that new budgets use FilterExpression. The code only checks CostFilters. Any budget created after AWS deprecated that field will be missed, producing false WARN results on modern accounts.

Bug 3 — FS-04: Overly broad DIMENSIONAL monitor match (app.py:327)

or m.get("MonitorType") == "DIMENSIONAL"
This treats any DIMENSIONAL monitor as covering Bedrock (e.g., MonitorDimension=LINKED_ACCOUNT). The doc correctly says only MonitorDimension=SERVICE provides coverage. The code should check m.get("MonitorDimension") == "SERVICE".

Bug 4 — logger level set to ERROR suppresses all operational info (app.py:67)

logger.setLevel(logging.ERROR)
All logger.warning() calls throughout the file (e.g., "Could not describe agent") are silently dropped. Lambda logs will be nearly empty on auth/permission errors during partial-access runs. This should be logging.WARNING or logging.INFO for a diagnostic Lambda.

Bug 5 — FS-03: Empty list corner case causes misleading PASS

When tpm_quotas + rpm_quotas is empty (e.g., if the list_service_quotas call returns nothing), all(...) on an empty iterable returns True, so default_only = True → status "Failed". That part is correct. But details will say "Found 0 token-based and 0 request-based Bedrock quotas" with no explanation — the caller sees a Failed finding with no actionable context about why the quota list was empty (API throttle? Wrong region? Permission issue?). Worth adding an explicit branch.

Design Concerns
D1 — Advisory checks mixed into the pass/fail report without a distinct status

Checks like check_hallucination_disclaimer_advisory(), check_compliance_disclaimer_in_outputs(), check_output_encoding_advisory(), and check_data_currency_disclaimer_advisory() always return N/A status because "no AWS API can verify application-level disclaimers." These are valuable but they will show up in the consolidated report alongside real pass/fail results. The severity rubric defines an "Advisory" severity tier, but the StatusEnum in schema.py only has Failed, Passed, N/A — it doesn't include Advisory. Consumers of the CSV won't be able to filter advisory guidance from actionable findings. Consider either adding Advisory to StatusEnum or using a consistent N/A + Informational severity combination with a standard finding name prefix (e.g., "ADVISORY — ") so report consumers can filter.

D2 — Known follow-up: HTML report rendering is incomplete

The PR body explicitly flags this: the HTML report template doesn't render the FinServ section. For a sample repo used in customer demos, shipping a Lambda that writes a CSV but produces no visible output in the HTML report is a UX regression. Worth either blocking merge on this or very prominently documenting it in the README.

D3 — COMPLIANCE_PLACEHOLDER comments are not wired to the report

The doc disclaimer is clear. However, the placeholder text appears in Python docstrings, not in the Finding or CSV output. If someone ingests the CSV expecting compliance mappings, they won't find them. The docs/AIMLSecurityAssessment-MappingsTable.csv helps, but the gap between the code comment and the delivered report artifact may confuse integrators.

IAM Permissions
The FinServGenAIRiskAssessmentPermissions IAM statement in deployment/1-aiml-security-member-roles.yaml is well-scoped and read-only. All actions are list/describe/get variants. Resource: "*" is appropriate for inventory-style checks where ARNs aren't known in advance. One omission: bedrock-agentcore:ListGateways, bedrock-agentcore:GetGateway, and bedrock-agentcore:ListAgentRuntimes are called by check_agentcore_policy_engine() (FS-08) but are not in the IAM statement — those calls will fail with AccessDenied at runtime.

Documentation Quality
The three-part doc split is sensible. The PDF traceability tagging ([PDF §x.y.z] vs extension) is rigorous and reviewer-friendly. The upstream-overlap consolidation table is the strongest part of the PR — the 5-factor analysis for each overlap is methodical and will save future maintainers significant effort.

The compliance disclaimer in SECURITY_CHECKS_FINSERV_COMMON.md is appropriately conservative. One small doc issue: two places in the common doc say "Add this content to docs/SECURITY_CHECKS.md in the forked repository" — these look like stale copy-paste from a planning document and should be removed before merge.

Summary of actionable items

Priority Item
Must fix Bug 1: FS-03 quota detection uses wrong field — add ListAWSDefaultServiceQuotas comparison
Must fix Bug 3: FS-04 DIMENSIONAL monitor match too broad — check MonitorDimension=SERVICE
Must fix Missing IAM perms: bedrock-agentcore:ListGateways/GetGateway/ListAgentRuntimes not in member role
Should fix Bug 2: FS-06 only checks deprecated CostFilters, not FilterExpression
Should fix Bug 4: logger.setLevel(logging.ERROR) drops warnings — set to WARNING
Should fix D1: Add Advisory to StatusEnum or establish a filter convention for advisory checks
Nice to have HTML report rendering for FinServ section (tracked as known follow-up)
Cleanup Remove two stale "Add this content to SECURITY_CHECKS.md" lines from the common doc###

Python Lint Error - Must Fix:
aiml-security-assessment/functions/security/sagemaker_assessments/app.py
Error: aiml-security-assessment/functions/security/finserv_assessments/app.py:4022:9: F841 Local variable apigwv2 is assigned to but never used
help: Remove assignment to unused variable apigwv2
Error: Process completed with exit code 1.

CodeQL Error - Must Fix
It looks like when you forked the PR, at that time you had following file .github/workflows/codeql.yml. This is removed from the main gitub repo but since it's in your fork, GitHub runs workflows from the fork's .github/workflows/ directory and hence this error is coming. Please perform following steps on your fork repo to remove this error -

git checkout main
git rm .github/workflows/codeql.yml
git commit -m "chore: remove codeql.yml — upstream uses default CodeQL setup"
git push

vivekmittal514

Summary of actionable items

Must fix Bug 1: FS-03 quota detection uses wrong field — add ListAWSDefaultServiceQuotas comparison
Must fix Bug 3: FS-04 DIMENSIONAL monitor match too broad — check MonitorDimension=SERVICE
Must fix Missing IAM perms: bedrock-agentcore:ListGateways/GetGateway/ListAgentRuntimes not in member role
Should fix Bug 2: FS-06 only checks deprecated CostFilters, not FilterExpression
Should fix Bug 4: logger.setLevel(logging.ERROR) drops warnings — set to WARNING
Should fix D1: Add Advisory to StatusEnum or establish a filter convention for advisory checks
Nice to have HTML report rendering for FinServ section (tracked as known follow-up)
Cleanup Remove two stale "Add this content to SECURITY_CHECKS.md" lines from the common doc###

Python Lint Error - Must Fix:
aiml-security-assessment/functions/security/sagemaker_assessments/app.py
Error: aiml-security-assessment/functions/security/finserv_assessments/app.py:4022:9: F841 Local variable apigwv2 is assigned to but never used
help: Remove assignment to unused variable apigwv2
Error: Process completed with exit code 1.

CodeQL Error - Must Fix
It looks like when you forked the PR, at that time you had following file .github/workflows/codeql.yml. This is removed from the main gitub repo but since it's in your fork, GitHub runs workflows from the fork's .github/workflows/ directory and hence this error is coming. Please perform following steps on your fork repo to remove this error -

git checkout main
git rm .github/workflows/codeql.yml
git commit -m "chore: remove codeql.yml — upstream uses default CodeQL setup"
git push

Review Details

Overall verdict: Strong PR. The 64 checks map faithfully to the guide's 15 risk categories, the consolidation logic is sound, and the documentation is thorough. There are a handful of code bugs and design gaps worth addressing before merge.

Coverage vs. the PDF Guide

The PDF defines 15 risk categories (§1.2.1–§1.2.15) with explicit mitigations. The PR covers all 15. Traceability is clearly marked [PDF §x.y.z] vs [PDF §x.y.z, extension] throughout the docs. The 5 consolidations (FS-17→SM-07, FS-18→SM-23, FS-19→SM-22, FS-23→BR-06, FS-64→BR-04) are well-reasoned and documented. No PDF risk category is missing.

One gap: the PDF's §1.2.1 (Non-Compliant Output) "Practical guidance" callout explicitly says to use existing compliance materials (employee policies, training materials, procedure documents, incident reports) to author Bedrock guardrail denied-topic policies. FS-28 checks for FinServ denied topics but the remediation doesn't mention using these internal materials as the source — a minor doc-only gap.

Code Issues

Bug 1 — FS-03: Wrong proxy for "quota reviewed" (app.py:287)

default_only = all(not q.get("Adjustable") for q in tpm_quotas + rpm_quotas)

Adjustable means the quota can be increased, not that it has been. All Bedrock TPM/RPM quotas are adjustable by nature, so default_only will almost always be True, causing the check to always report Failed. The doc correctly says to compare applied quota value against the default via ListAWSDefaultServiceQuotas — but that API call isn't in the implementation. The code needs to call list_aws_default_service_quotas(ServiceCode="bedrock") and compare Value against the applied value.

Bug 2 — FS-06: Only checks deprecated CostFilters, misses new-style budgets (app.py:451)

svc in json.dumps(b.get("CostFilters", {})).lower()

The doc correctly notes that new budgets use FilterExpression. The code only checks CostFilters. Any budget created after AWS deprecated that field will be missed, producing false WARN results on modern accounts.

Bug 3 — FS-04: Overly broad DIMENSIONAL monitor match (app.py:327)

or m.get("MonitorType") == "DIMENSIONAL"

This treats any DIMENSIONAL monitor as covering Bedrock (e.g., MonitorDimension=LINKED_ACCOUNT). The doc correctly says only MonitorDimension=SERVICE provides coverage. The code should check m.get("MonitorDimension") == "SERVICE".

Bug 4 — logger level set to ERROR suppresses all operational info (app.py:67)

logger.setLevel(logging.ERROR)