Release v6.5.0 into Main#921
Merged
Merged
Conversation
* add context window to models * test coverage * pre-commit * pr comments * tests * pre-commit * metrics dashboard token usage update * unit tests * cr comments * pre-commit --------- Co-authored-by: jmharold <jmharold@amazon.com> Co-authored-by: Evan Stohlmann <estohlmann@gmail.com>
…ly runs with cleanup
* fix streamed guardrail responses --------- Co-authored-by: jmharold <jmharold@amazon.com>
Replace aws_s3_tools with aws_operator_tools for broader AWS API coverage. Update mcp-workbench docs and README; refresh package-lock.json. Made-with: Cursor
| } | ||
|
|
||
| const accountNumbersEcr = getConfigArray('.accountNumbersEcr[]'); | ||
| const ecrAccounts = [...new Set([...accountNumbersEcr, accountNumber].filter(Boolean))]; |
estohlmann
approved these changes
Apr 6, 2026
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
v6.5.0
Key Features
Self-Service RAG Administration
A new RAG Admin role gives designated users full control over RAG repository operations, document ingestion, collection management, and pipeline configuration without granting full system administrator privileges. This reduces the operational bottleneck where every RAG change required a system administrator. Self-service RAG is especially useful in multi-tenant environments.
Operations Metrics Dashboard
New dashboard reports track metrics across models and clusters, including inference latency, token usage, and batch ingestion job status. For example, customers can use the new input/output token reports to derive costs across users, groups, and models. This is useful for multi-tenant environments with a variety of end-user orgs. Also, model containers publish Prometheus metrics for vLLM, TEI, and TGI, and batch ingestion jobs report totals and failures for RAG document ingestion.
Integrating Externally Deployed Models
Administrators can register deployed models that are not LISA-managed by providing a URL that can be accessed from the LiteLLM ECS cluster. These models appear and behave like other models in the platform.
AWS Session Credentials
LISA now lets you attach AWS credentials to a chat session. While that session is active, MCP tools can use those credentials to call AWS APIs, so tool-based workflows can reach AWS resources in the same context as the conversation instead of requiring separate per-tool setup.
An example of a tool using this can be seen: lib/serve/mcp-workbench/src/examples/sample_tools/aws_operator_tools.py
Other Key Changes
Acknowledgements
Full Changelog: https://github.com/awslabs/LISA/compare/v6.4.0..v6.5.0