Skip to main content
v0.112.1
2026-01-14

Improvement: Simplified management for MCP Servers

MCP Server integrations are now standalone, top-level resources with their own permissions and simplified management.
Breaking Change: We are removing MCP Server Groups. MCP Servers will no longer be nested under groups. Read more

More Updates

  • Added support for Google Gemini image models
  • Tiered pricing in Google Vertex AI models
  • Behavior Change Guardrails would now run in parallel to reduce latency in AI Gateway. Read more
  • Added support for certificate based auth in Azure OpenAI Integration
  • Added suport for External identity in MCP Gateway

Release instructions

  • Update truefoundry helm chart version to 0.112.1.
v0.111.2
2026-01-12

Create your own custom roles and assign to User

You can now create your custom tenant level roles and assign to users. Read more

More Updates

  • Bugfix - fixed SLA cutoff in priority based routing config
  • Added support for xAI model provider in AI Gateway
  • Added Request Failure metrics for tools in MCP metrics
  • Behavior Change: For production GPU workloads, truefoundry now automatically adds pod disruption budget for max availbility of 25%. This reduces disruption of GPU workloads in case of node consolidation.

Release instructions

  • Update truefoundry helm chart version to 0.111.2.
v0.110.3
2026-01-05

New: Support added for dynamic TrueFoundry authentication for MCP access

When using MCP Servers in Cursor/VSCode, you can now use OAuth for authentication without need to hardcode the token in mcp.json

More Updates

  • Behavior Change In AWS Paramter store, we now store the secrets as SecureString instead of String parameter type. Read more
  • A request body size limit added in AI Gateway requests (default: 50 MB)
  • Bugfix - Fixed Assumed Role based auth for AWS Bedrock Guardrails
  • Bugfix - Removed default request timeout of 5 min within AI Gateway

Release instructions

  • Update truefoundry helm chart version to 0.110.3.
v0.109.3
2025-12-22

New: Support for SCIM

TrueFoundry now supports SCIM for SAML based SSO. SCIM enabled automatic user/team management using IdP users/groups. Read more
Enable SCIM checkbox in SAML SSO configuration

Improved Rate Limit Config

  • Rule IDs must be static (no {} placeholders). Use rate_limit_applies_per to create per-entity rate limit instead of dynamic rule IDs. Read more

More Updates

  • Added support for API key based auth in AWS Bedrock model integration.
  • Behavior Change: Tenant Admin can now access all entities(Models, MCP Server, Guardrail, Agent) in AI Gateway.

Release instructions

  • Update truefoundry helm chart version to 0.109.3.
Breaking Change: Removed support for Dynamic Rule ID in Rate Limit config. Read more
v0.108.2
2025-12-17

New: Use git tags to deploy your code

You can now use tags to deploy your code to have better version control of your releases.

More Updates

  • Bug fixes and improvements

Release instructions

  • Update truefoundry helm chart version to 0.108.2.
v0.107.1
2025-12-15

New: Request Caching

You can now support both Exact match and Semantic caching in AI Gateway requests. Read more

More Updates

  • Error message improvement in Self-Hosted models response via AI Gateway.
  • Added Embedding model support in Cloudera.

Release instructions

  • Update truefoundry helm chart version to 0.107.1.
v0.106.2
2025-12-10

New: Support for External Identity

You can now use externally vended JWT tokens to authenticate to TrueFoundry. Read more

More Updates

  • Fixed Gemini 3 Pro Model usage in Agent Response.
  • Added support for custom Slug in Model integrations.
  • Added TrueFoundry integration with Goose. Read more

Release instructions

  • Update truefoundry helm chart version to 0.106.2.
v0.105.2
2025-12-08

New: Budget Usage

You can now monitor the usage metrics for each Budget rule added. Read more

More Updates

  • Added support for Deepseek model in Vertex AI.

Release instructions

  • Update truefoundry helm chart version to 0.105.2.
v0.104.2
2025-12-02

New: Configure location to store your AI Gateway Request and Metrics

We have added support to configure location on which AI Gateway Request and Metrics would be stored. This helps in complying with local Data Residency laws and privacy policies..
This feature is only available in SaaS TrueFoundry AI Gateway

Updates and Bug Fixes

  • Added support for Finetune API in Vertex Model as well. Read more
  • Added support for thought_singnature in Google Gemini and Vertex model response

Release instructions

  • Update truefoundry helm chart version to 0.104.2.
v0.103.1
2025-11-28

Updates and Bug Fixes

  • Added 404 in status codes used for default Fallback.
  • Added support for GCP workload identify authentication for Vertex models. Read more
  • Added support for Media resolution in Vertex Models. Read more
  • Added support for none as value in reasoning_effort. Read more

Release instructions

  • Update truefoundry helm chart version to 0.103.1.
v0.102.5
2025-11-26

New: we now support Virtual Models

Create reusable virtual models with intelligent routing configurations to distribute requests across multiple model providers. Read more

More Updates

  • Updates on Prisma Guardrail:
    • We now pass tfy.request.conversation_id and traceId to all the request allowing to group messages.
    • We slice the payload to max size of 1.5 MB when sending to Prisma.
  • New Routing Metrics: visualize effect of different AI Gateway configs like Rate limit, Budget limit, Routing, Load Balancing or Fallback.
  • New Metrics introduced:
    • Latency per output token
    • Model usage per user and per model
  • Added basic tracing support for requests via Gemini CLI.
  • Added support for AWS IAM role based auth in AWS SQS based async service. Read more

Release instructions

  • Update truefoundry helm chart version to 0.102.5.
v0.101.2
2025-11-21

Improved Budget Limit Config

  • Added support for Budget per week.
  • Windows for budget usage will now be a fixed. (Day starts with 00:00 UTC, Week starts with Mon, Month starts with 1st) Read more
  • Added support for setting Alerts to get warned before limit reaches. Read more

More Updates

  • Behavior Change: Rate limit and Budget limit rules will now be evaluated for all matching rules and the first one in order will be applied. This enables user to set priority of rules by adjusting the order in config.
  • We now support AWS IAM role based auth when adding AWS bedrock integrations in TrueFoundry SaaS Gateway. Read more
  • Bug fix: fixed Patronus AI guardrail validation, fixed adding ‘Origin’ header to API request options in Promptfoo guardrail.
  • Added support for configuring multiple AI Gateways based on BU, region, etc. Read more
    Breaking Change: Removed support for Dynamic Rule ID in Budget config. Read more

Release instructions

  • Update truefoundry helm chart version to 0.101.2.
v0.100.2
2025-11-18

Override prompts when re-routing/falling back to different model

More Updates

  • Added support for MiniMax model in Vertex.
  • Bug fix: resolved issue of index number handling in case of parallel tool calls with Anthropic models

Release instructions

  • Update truefoundry helm chart version to 0.100.2.