Truefoundry Docs

v0.112.1

2026-01-14

Improvement: Simplified management for MCP Servers

MCP Server integrations are now standalone, top-level resources with their own permissions and simplified management.

Breaking Change: We are removing MCP Server Groups. MCP Servers will no longer be nested under groups. Read more

More Updates

Added support for Google Gemini image models
Tiered pricing in Google Vertex AI models
Behavior Change Guardrails would now run in parallel to reduce latency in AI Gateway. Read more
Added support for certificate based auth in Azure OpenAI Integration
Added suport for External identity in MCP Gateway

Release instructions

Update truefoundry helm chart version to 0.112.1.

v0.111.2

2026-01-12

Create your own custom roles and assign to User

You can now create your custom tenant level roles and assign to users. Read more

More Updates

Bugfix - fixed SLA cutoff in priority based routing config
Added support for xAI model provider in AI Gateway
Added Request Failure metrics for tools in MCP metrics
Behavior Change: For production GPU workloads, truefoundry now automatically adds pod disruption budget for max availbility of 25%. This reduces disruption of GPU workloads in case of node consolidation.

Release instructions

Update truefoundry helm chart version to 0.111.2.

v0.110.3

2026-01-05

New: Support added for dynamic TrueFoundry authentication for MCP access

When using MCP Servers in Cursor/VSCode, you can now use OAuth for authentication without need to hardcode the token in mcp.json

More Updates

Behavior Change In AWS Paramter store, we now store the secrets as SecureString instead of String parameter type. Read more
A request body size limit added in AI Gateway requests (default: 50 MB)
Bugfix - Fixed Assumed Role based auth for AWS Bedrock Guardrails
Bugfix - Removed default request timeout of 5 min within AI Gateway

Release instructions

Update truefoundry helm chart version to 0.110.3.

v0.109.3

2025-12-22

New: Support for SCIM

TrueFoundry now supports SCIM for SAML based SSO. SCIM enabled automatic user/team management using IdP users/groups. Read more

Enable SCIM checkbox in SAML SSO configuration

Improved Rate Limit Config

Rule IDs must be static (no {} placeholders). Use rate_limit_applies_per to create per-entity rate limit instead of dynamic rule IDs. Read more

More Updates

Added support for API key based auth in AWS Bedrock model integration.
Behavior Change: Tenant Admin can now access all entities(Models, MCP Server, Guardrail, Agent) in AI Gateway.

Release instructions

Update truefoundry helm chart version to 0.109.3.

Breaking Change: Removed support for Dynamic Rule ID in Rate Limit config. Read more

v0.108.2

2025-12-17

New: Use git tags to deploy your code

You can now use tags to deploy your code to have better version control of your releases.

More Updates

Bug fixes and improvements

Release instructions

Update truefoundry helm chart version to 0.108.2.

v0.107.1

2025-12-15

New: Request Caching

You can now support both Exact match and Semantic caching in AI Gateway requests. Read more

More Updates

Error message improvement in Self-Hosted models response via AI Gateway.
Added Embedding model support in Cloudera.

Release instructions

Update truefoundry helm chart version to 0.107.1.

v0.106.2

2025-12-10

New: Support for External Identity

You can now use externally vended JWT tokens to authenticate to TrueFoundry. Read more

More Updates

Fixed Gemini 3 Pro Model usage in Agent Response.
Added support for custom Slug in Model integrations.
Added TrueFoundry integration with Goose. Read more

Release instructions

Update truefoundry helm chart version to 0.106.2.

v0.105.2

2025-12-08

New: Budget Usage

You can now monitor the usage metrics for each Budget rule added. Read more

More Updates

Added support for Deepseek model in Vertex AI.

Release instructions

Update truefoundry helm chart version to 0.105.2.

v0.104.2

2025-12-02

New: Configure location to store your AI Gateway Request and Metrics

We have added support to configure location on which AI Gateway Request and Metrics would be stored. This helps in complying with local Data Residency laws and privacy policies..

This feature is only available in SaaS TrueFoundry AI Gateway

Updates and Bug Fixes

Added support for Finetune API in Vertex Model as well. Read more
Added support for thought_singnature in Google Gemini and Vertex model response

Release instructions

Update truefoundry helm chart version to 0.104.2.

v0.103.1

2025-11-28

Updates and Bug Fixes

Added 404 in status codes used for default Fallback.
Added support for GCP workload identify authentication for Vertex models. Read more
Added support for Media resolution in Vertex Models. Read more
Added support for none as value in reasoning_effort. Read more

Release instructions

Update truefoundry helm chart version to 0.103.1.

v0.102.5

2025-11-26

New: we now support Virtual Models

Create reusable virtual models with intelligent routing configurations to distribute requests across multiple model providers. Read more

More Updates

Updates on Prisma Guardrail:
- We now pass tfy.request.conversation_id and traceId to all the request allowing to group messages.
- We slice the payload to max size of 1.5 MB when sending to Prisma.
New Routing Metrics: visualize effect of different AI Gateway configs like Rate limit, Budget limit, Routing, Load Balancing or Fallback.
New Metrics introduced:
- Latency per output token
- Model usage per user and per model
Added basic tracing support for requests via Gemini CLI.
Added support for AWS IAM role based auth in AWS SQS based async service. Read more

Release instructions

Update truefoundry helm chart version to 0.102.5.

v0.101.2

2025-11-21

Improved Budget Limit Config

Added support for Budget per week.
Windows for budget usage will now be a fixed. (Day starts with 00:00 UTC, Week starts with Mon, Month starts with 1st) Read more
Added support for setting Alerts to get warned before limit reaches. Read more

More Updates

Behavior Change: Rate limit and Budget limit rules will now be evaluated for all matching rules and the first one in order will be applied. This enables user to set priority of rules by adjusting the order in config.
We now support AWS IAM role based auth when adding AWS bedrock integrations in TrueFoundry SaaS Gateway. Read more
Bug fix: fixed Patronus AI guardrail validation, fixed adding ‘Origin’ header to API request options in Promptfoo guardrail.
Added support for configuring multiple AI Gateways based on BU, region, etc. Read more

Breaking Change: Removed support for Dynamic Rule ID in Budget config. Read more

Release instructions

Update truefoundry helm chart version to 0.101.2.

v0.100.2

2025-11-18

Override prompts when re-routing/falling back to different model

More Updates

Added support for MiniMax model in Vertex.
Bug fix: resolved issue of index number handling in case of parallel tool calls with Anthropic models

Release instructions

Update truefoundry helm chart version to 0.100.2.

​Improvement: Simplified management for MCP Servers

​More Updates

​Release instructions

​Create your own custom roles and assign to User

​More Updates

​Release instructions

​New: Support added for dynamic TrueFoundry authentication for MCP access

​More Updates

​Release instructions

​New: Support for SCIM

​Improved Rate Limit Config

​More Updates

​Release instructions

​New: Use git tags to deploy your code

​More Updates

​Release instructions

​New: Request Caching

​More Updates

​Release instructions

​New: Support for External Identity

​More Updates

​Release instructions

​New: Budget Usage

​More Updates

​Release instructions

​New: Configure location to store your AI Gateway Request and Metrics

​Updates and Bug Fixes

​Release instructions

​Updates and Bug Fixes

​Release instructions

​New: we now support Virtual Models

​More Updates

​Release instructions

​Improved Budget Limit Config

​More Updates

​Release instructions

​Override prompts when re-routing/falling back to different model

​More Updates

​Release instructions

Improvement: Simplified management for MCP Servers

More Updates

Release instructions

Create your own custom roles and assign to User

More Updates

Release instructions

New: Support added for dynamic TrueFoundry authentication for MCP access

More Updates

Release instructions

New: Support for SCIM

Improved Rate Limit Config

More Updates

Release instructions

New: Use git tags to deploy your code

More Updates

Release instructions

New: Request Caching

More Updates

Release instructions

New: Support for External Identity

More Updates

Release instructions

New: Budget Usage

More Updates

Release instructions

New: Configure location to store your AI Gateway Request and Metrics

Updates and Bug Fixes

Release instructions

Updates and Bug Fixes

Release instructions

New: we now support Virtual Models

More Updates

Release instructions

Improved Budget Limit Config

More Updates

Release instructions

Override prompts when re-routing/falling back to different model

More Updates

Release instructions