Update rate limits configuration and add graph search endpoint by jfrench9 · Pull Request #525 · RoboFinSystems/robosystems

jfrench9 · 2026-03-22T07:10:49Z

Summary

This PR refactors the rate limiting configuration, simplifies its structure, and introduces a new search endpoint for graphs. The changes span rate limit definitions, middleware enforcement, and supporting infrastructure.

Key Accomplishments

Rate Limits Refactor

Simplified rate limits configuration (rate_limits.py): Reduced complexity with a net removal of ~40 lines, streamlining how rate limits are defined and managed across the application.
Updated rate limiting middleware to align with the revised configuration structure, ensuring consistent enforcement across routes.
Adjusted repository rate limits to reflect the new configuration approach.

Search Functionality

Added a new search endpoint in robosystems/routers/graphs/search.py (+75 lines) to support graph search capabilities.
Updated the graph configuration (.github/configs/graph.yml) to register the new search route.

SEC Adapter Update

Extended the SEC manifest (robosystems/adapters/sec/manifest.py) with additional configuration to support the new functionality.

Breaking Changes

Rate limit configuration structure has changed. Any external consumers or tooling that depend on the specific shape of rate limit definitions (e.g., subscription tiers, endpoint-level limits) should be reviewed and updated accordingly.
Middleware behavior for rate limit enforcement has been adjusted — verify that all subscription tiers are still correctly rate-limited.

Testing Notes

Updated 4 test files covering:
- test_rate_limits.py — validates the new rate limit configuration structure
- test_repository_rate_limits.py — ensures repository-level limits are correctly applied
- test_subscription_rate_limiting.py — significant rework (~60 lines changed) to align with the refactored middleware behavior
- test_subscription_rate_limits.py — updated assertions to match the simplified configuration
All existing rate limiting tests have been adapted rather than removed, indicating backward-compatible behavior where possible.
The new search endpoint should be verified for proper rate limit enforcement under all subscription tiers.

Infrastructure Considerations

The graph service configuration has been updated to include the new search route — ensure deployment pipelines pick up the updated YAML configuration.
No new dependencies or service requirements are introduced; changes are confined to configuration and application logic.

🤖 Generated with Claude Code

Branch Info:

Source: chore/search-rate-limits
Target: main
Type: chore

Co-Authored-By: Claude noreply@anthropic.com

- Adjusted API rate limits in graph.yml for production environments, reducing multipliers for team and enterprise usage. - Enhanced SEC_MANIFEST to include search rate limits for minute, hour, and day. - Introduced GRAPH_SEARCH category in rate_limits.py and updated related rate limit configurations. - Implemented search rate limiting in search.py, ensuring proper access control for shared repositories. - Updated tests to reflect changes in rate limits and added coverage for new search functionality.

jfrench9 · 2026-03-22T07:10:50Z

@claude please review this PR

claude · 2026-03-22T07:11:12Z

Claude finished @jfrench9's task in 2m 25s —— View job

PR Review: Update rate limits configuration and add graph search endpoint

Todo List:

Summary

The PR is well-structured and the search endpoint integrates cleanly with the existing dual-layer rate limiting system. However, there are two bugs, a significant (likely unintentional) breaking change in production rate limits, and a few minor issues worth addressing before merging.

🐛 Bugs

1. Hardcoded user_tier in search rate limiting (search.py:75)

user_tier = "ladybug-standard"  # ← always hardcoded

The _check_search_rate_limit function uses "ladybug-standard" for every user, bypassing the actual user subscription tier. While all tiers currently share the same base GRAPH_SEARCH limit of 10/min, this will silently break when limits are differentiated by tier. It also means base-tier users (anonymous/unrecognized) are granted ladybug-standard burst limits for search — a tier they don't have.

Fix this →

2. Hardcoded Retry-After: 60 ignores actual window (search.py:99)

headers={"Retry-After": "60"},  # ← always 60s, even if day-window was exceeded

The _check_repository_limit can return reset_in of 60, 3600, or 86400 depending on which window triggered. The Retry-After header should be set to limit_check.get("detail", {}).get("reset_in", 60) so clients can back off correctly.

Fix this →

⚠️ Potentially Breaking: Production Multiplier Reductions

The graph.yml diff reduces production api_rate_multiplier values:

ladybug-large: 2.5x → 1.5x
ladybug-xlarge: 5.0x → 2.5x

Combined with the base limit reductions in rate_limits.py (e.g., GRAPH_READ 500 → 120), the effective limits for paying customers dropped dramatically:

Tier	Category	Old effective	New effective	Change
ladybug-large	GRAPH_READ	500 × 2.5 = 1250/min	120 × 1.5 = 180/min	−86%
ladybug-xlarge	GRAPH_READ	500 × 5.0 = 2500/min	120 × 2.5 = 300/min	−88%
ladybug-large	GRAPH_WRITE	100 × 2.5 = 250/min	30 × 1.5 = 45/min	−82%
ladybug-large	GRAPH_MCP	100 × 2.5 = 250/min	30 × 1.5 = 45/min	−82%

If this is intentional (infra protection, new baseline more aligned with actual capacity), it should be clearly documented and existing customers notified. If not, the base limits for paid tiers need to be revisited.

🔍 Code Quality

3. Unused req: Request parameter (search.py:108-109, search.py:121-122)

Both search_documents and get_document_section declare req: Request but never use it. FastAPI doesn't require it to be declared if not used. Remove to reduce noise.

4. Stale test function name (test_rate_limits.py:16)

def test_unknown_tier_falls_back_to_free_limits():  # ← "free" no longer exists

The tier was renamed from "free" to "base" in this PR. The test body correctly references "base", but the function name is now misleading.

🧪 Test Gaps

5. AllowedSharedEndpoints.SEARCH not asserted (test_repository_rate_limits.py:15-21)

The new SEARCH = "search" enum member was added to AllowedSharedEndpoints, and _operation_to_category("search") is tested at line 139, but the test_values test doesn't assert AllowedSharedEndpoints.SEARCH.value == "search".

6. No test coverage for _check_search_rate_limit — the new non-shared-repo early-return path, 403 on missing repo_access, and the full rate-limit-exceeded path from search.py have no direct test.

💡 Minor Notes

The service.search_documents(graph_id, request) call at search.py:115 is synchronous (service.search_documents), but the endpoint is async. If the search service does I/O, it should be awaited or offloaded to a thread pool to avoid blocking the event loop.
The test_rate_limit_hierarchy test in test_subscription_rate_limits.py:192 will pass since large and xlarge base values equal standard (all 120/min for GRAPH_READ), but it no longer truly validates a hierarchy — consider a note or adjusting the assertion to reflect the multiplier-based approach.

Overall this is a clean refactor and the new GRAPH_SEARCH category is well integrated. The hardcoded tier and Retry-After bugs should be fixed before merge, and the production multiplier reductions deserve explicit sign-off given their impact on paying customers.

- Added 'searches_per_hour' to the service offerings rate limits in offering.py. - Updated user tier retrieval in search.py to fallback to 'ladybug-standard' if not set. - Improved HTTP exception handling to dynamically set 'Retry-After' header based on rate limit response. - Removed unused 'req' parameter from search_documents and get_document_section functions. - Updated tests to reflect changes in rate limit handling and removed unnecessary parameters.

jfrench9 · 2026-03-22T07:33:48Z

@claude can you review the latest changes

claude · 2026-03-22T07:34:10Z

Claude finished @jfrench9's task in 1m 48s —— View job

Todo List:

Gather context and understand the request
Read changed files on the current branch
Compare with previous review findings
Post updated review

Review: Latest Changes (commit `0ea1cd0`)

Good follow-up commit — all five actionable issues from the previous review have been addressed. Here's the status and what still needs attention.

✅ Issues Fixed

#	Previous Issue	Status
1	Hardcoded `user_tier = "ladybug-standard"`	Fixed via `getattr` fallback
2	Hardcoded `Retry-After: 60`	Fixed — now uses `reset_in` from response
3	Unused `req: Request` parameter	Removed from both endpoints
4	Stale test name (`_free_limits`)	Renamed to `_base_limits`
5	`AllowedSharedEndpoints.SEARCH` not asserted	Added to `test_values`

⚠️ Remaining Issue: `subscription_tier` Not on `User` Model

The tier fix at search.py:75 is:

user_tier = getattr(current_user, "subscription_tier", None) or "ladybug-standard"

subscription_tier doesn't appear to exist on the User model (no matches in robosystems/models/iam/). This means getattr always returns None, and the fallback is always "ladybug-standard" — functionally identical to the original hardcode. The real tier isn't being resolved.

Other rate-limited endpoints (e.g., subscription_aware_rate_limit_dependency in rate_limiting.py:484) hardcode "ladybug-standard" for all authenticated users too, so this is consistent with the current approach. If tier differentiation isn't needed yet, the fallback is fine — but the getattr implies there's an intent to use it later, which will silently not work until the attribute is added.

Fix this →

🔍 Still Open: Minor Code Quality

Synchronous calls in async endpoints (search.py:115, search.py:127):

return service.search_documents(graph_id, request)   # sync call in async endpoint
result = service.get_document_section(graph_id, document_id)  # same

If the search service does I/O (OpenSearch HTTP calls), these block the event loop. If it's CPU-only or already uses an async client internally, this is fine — but worth confirming.

test_rate_limit_hierarchy passes trivially (test_subscription_rate_limits.py:192):

assert standard_limit[0] <= large_limit[0]   # 120 <= 120 — always passes
assert large_limit[0] <= xlarge_limit[0]     # 120 <= 120 — always passes

The base values are intentionally identical across tiers (multiplier applied externally via graph.yml), so the hierarchy test will always pass regardless of whether differentiation is working. Consider testing the multiplier-applied values, or adding a comment explaining why equality is expected.

No test coverage for _check_search_rate_limit — the 403 path on missing repo_access, the rate-limit-exceeded 429 path, and the early return for non-shared repos remain untested.

Overall the critical bugs are resolved and this is in good shape. The subscription_tier attribute question and the sync/async concern are worth a quick decision before merge.
| Branch

jfrench9 merged commit 6ff516f into main Mar 22, 2026
7 checks passed

jfrench9 deleted the chore/search-rate-limits branch March 22, 2026 07:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update rate limits configuration and add graph search endpoint#525

Update rate limits configuration and add graph search endpoint#525
jfrench9 merged 2 commits intomainfrom
chore/search-rate-limits

jfrench9 commented Mar 22, 2026

Uh oh!

jfrench9 commented Mar 22, 2026

Uh oh!

claude bot commented Mar 22, 2026 •

edited

Loading

Uh oh!

jfrench9 commented Mar 22, 2026

Uh oh!

claude bot commented Mar 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jfrench9 commented Mar 22, 2026

Summary

Key Accomplishments

Rate Limits Refactor

Search Functionality

SEC Adapter Update

Breaking Changes

Testing Notes

Infrastructure Considerations

Uh oh!

jfrench9 commented Mar 22, 2026

Uh oh!

claude bot commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: Update rate limits configuration and add graph search endpoint

Summary

🐛 Bugs

⚠️ Potentially Breaking: Production Multiplier Reductions

🔍 Code Quality

🧪 Test Gaps

💡 Minor Notes

Uh oh!

jfrench9 commented Mar 22, 2026

Uh oh!

claude bot commented Mar 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review: Latest Changes (commit 0ea1cd0)

✅ Issues Fixed

⚠️ Remaining Issue: subscription_tier Not on User Model

🔍 Still Open: Minor Code Quality

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude bot commented Mar 22, 2026 •

edited

Loading

claude bot commented Mar 22, 2026 •

edited

Loading

Review: Latest Changes (commit `0ea1cd0`)

⚠️ Remaining Issue: `subscription_tier` Not on `User` Model