Improvement: Simplified management for MCP Servers
MCP Server integrations are now standalone, top-level resources with their own permissions and simplified management.
More Updates
- Added support for Google Gemini image models
- Tiered pricing in Google Vertex AI models
- Behavior Change Guardrails would now run in parallel to reduce latency in AI Gateway. Read more
- Added support for certificate based auth in Azure OpenAI Integration
- Added suport for External identity in MCP Gateway
Release instructions
- Update
truefoundryhelm chart version to0.112.1.
Create your own custom roles and assign to User
You can now create your custom tenant level roles and assign to users. Read more
More Updates
- Bugfix - fixed SLA cutoff in priority based routing config
- Added support for xAI model provider in AI Gateway
- Added Request Failure metrics for tools in MCP metrics
- Behavior Change: For production GPU workloads, truefoundry now automatically adds pod disruption budget for max availbility of 25%. This reduces disruption of GPU workloads in case of node consolidation.
Release instructions
- Update
truefoundryhelm chart version to0.111.2.
New: Support added for dynamic TrueFoundry authentication for MCP access
When using MCP Servers in Cursor/VSCode, you can now use OAuth for authentication without need to hardcode the token in mcp.jsonMore Updates
- Behavior Change In AWS Paramter store, we now store the secrets as
SecureStringinstead ofStringparameter type. Read more - A request body size limit added in AI Gateway requests (default: 50 MB)
- Bugfix - Fixed Assumed Role based auth for AWS Bedrock Guardrails
- Bugfix - Removed default request timeout of 5 min within AI Gateway
Release instructions
- Update
truefoundryhelm chart version to0.110.3.
New: Support for SCIM
TrueFoundry now supports SCIM for SAML based SSO. SCIM enabled automatic user/team management using IdP users/groups. Read more
Improved Rate Limit Config
- Rule IDs must be static (no
{}placeholders). Userate_limit_applies_perto create per-entity rate limit instead of dynamic rule IDs. Read more
More Updates
- Added support for API key based auth in AWS Bedrock model integration.
- Behavior Change: Tenant Admin can now access all entities(Models, MCP Server, Guardrail, Agent) in AI Gateway.
Release instructions
- Update
truefoundryhelm chart version to0.109.3.
New: Request Caching
You can now support both Exact match and Semantic caching in AI Gateway requests. Read moreMore Updates
- Error message improvement in Self-Hosted models response via AI Gateway.
- Added Embedding model support in Cloudera.
Release instructions
- Update
truefoundryhelm chart version to0.107.1.
New: Support for External Identity
You can now use externally vended JWT tokens to authenticate to TrueFoundry. Read moreMore Updates
- Fixed Gemini 3 Pro Model usage in Agent Response.
-
Added support for custom Slug in Model integrations.

- Added TrueFoundry integration with Goose. Read more
Release instructions
- Update
truefoundryhelm chart version to0.106.2.
New: Configure location to store your AI Gateway Request and Metrics
We have added support to configure location on which AI Gateway Request and Metrics would be stored. This helps in complying with local Data Residency laws and privacy policies..
This feature is only available in SaaS TrueFoundry AI Gateway
Updates and Bug Fixes
- Added support for Finetune API in Vertex Model as well. Read more
- Added support for
thought_singnaturein Google Gemini and Vertex model response
Release instructions
- Update
truefoundryhelm chart version to0.104.2.
Updates and Bug Fixes
- Added
404in status codes used for default Fallback. - Added support for GCP workload identify authentication for Vertex models. Read more
- Added support for Media resolution in Vertex Models. Read more
- Added support for
noneas value inreasoning_effort. Read more
Release instructions
- Update
truefoundryhelm chart version to0.103.1.
New: we now support Virtual Models
Create reusable virtual models with intelligent routing configurations to distribute requests across multiple model providers. Read more
More Updates
-
Updates on Prisma Guardrail:
- We now pass
tfy.request.conversation_idandtraceIdto all the request allowing to group messages. - We slice the payload to max size of 1.5 MB when sending to Prisma.
- We now pass
-
New Routing Metrics: visualize effect of different AI Gateway configs like Rate limit, Budget limit, Routing, Load Balancing or Fallback.

-
New Metrics introduced:
- Latency per output token
-
Model usage per user and per model

- Added basic tracing support for requests via Gemini CLI.
- Added support for AWS IAM role based auth in AWS SQS based async service. Read more
Release instructions
- Update
truefoundryhelm chart version to0.102.5.
Improved Budget Limit Config
- Added support for Budget per week.
- Windows for budget usage will now be a fixed. (Day starts with 00:00 UTC, Week starts with Mon, Month starts with 1st) Read more
- Added support for setting Alerts to get warned before limit reaches. Read more

More Updates
- Behavior Change: Rate limit and Budget limit rules will now be evaluated for all matching rules and the first one in order will be applied. This enables user to set priority of rules by adjusting the order in config.
- We now support AWS IAM role based auth when adding AWS bedrock integrations in TrueFoundry SaaS Gateway. Read more
- Bug fix: fixed Patronus AI guardrail validation, fixed adding ‘Origin’ header to API request options in Promptfoo guardrail.
-
Added support for configuring multiple AI Gateways based on BU, region, etc. Read more

Release instructions
- Update
truefoundryhelm chart version to0.101.2.


