Cline with TrueFoundry AI Gateway: Setup Guide for VS Code

If you use VS Code and AI while you code, Cline is a great fit. When you run Cline through TrueFoundry’s AI Gateway, you get the same coding experience with enterprise guardrails, observability, and cost control. This guide walks you through what Cline is, why routing it through the Gateway helps, and how to set it up in a few minutes. Cline runs inside your editor and can write files, modify code, and help you debug through a natural conversation. It feels like a teammate who knows your repo and can act on it from the editor.

Why pair Cline with TrueFoundry

One key place for access and keys: You give Cline a single TrueFoundry API key and a base URL. The Gateway handles provider keys behind the scenes, so you do not spread raw model keys across laptops and scripts. You can also rotate and expire tokens when needed.
Spend only what you intend. Set hard budgets per user, team, app, or model. If a limit is crossed, calls stop. This prevents surprise bills from loops or heavy usage.
Keep traffic healthy. Apply rate limits per user, team, app, or model.
See what is happening. Use dashboards for latency, token usage, cost, errors, and the rules that were triggered. You can group by model, user, team, or your own labels.
Log what you need. Turn request logging on or off with a header when you need a trace and keep it off when you do not.
Tag every call with context. Add metadata like project, environment, tenant, or feature to each request and then filter, chart, and limit with those tags.

What you need

A TrueFoundry account and Gateway access. The quick start shows how to get set up.
VS Code with the Cline extension installed.

Step-by-step setup

Open VS Code with Cline installed. Open the command palette with Cmd or Ctrl plus Shift plus P. Run “Cline: Open in new tab”. Click the gear icon in the Cline tab.

Point Cline to the Gateway
In Cline’s API settings, set:‍
1. API Provider: OpenAI Compatible‍
2. Base URL: your TrueFoundry Gateway URL‍
3. API Key: your TrueFoundry API key‍
4. Model ID: for example openai-main/gpt-4o, or any model you expose via the Gateway
Save the settings.

That is it. Cline will now send requests through the Gateway with your chosen model.

Tip: If you prefer to test with a short script, the OpenAI clients can talk to the Gateway by setting the base URL and your TrueFoundry key as shown in the access control guide.

Try these first prompts in Cline

Create: “Create a Python function that returns the first N Fibonacci numbers.”
Modify: “Add input validation and error handling to that function.”
Explain: “Explain what this function does in plain English.”
Debug: “I am getting a ValueError in this file. Help me fix it.”

Recommended Gateway setup for teams

1) Use the right kind of key

Personal Access Tokens work well for individual developers.
Virtual Access Tokens work well for apps and shared tools. They are not tied to a person and can be scoped and revoked by an admin.

2) Add budgets

Create a simple budget config so one person or one tool cannot overspend.

Daily or monthly limits are supported.
You can scope limits to users, teams, virtual accounts, models, or any mix of these.
If any matching rule is over the limit, the call is blocked.

3) Add rate limits

Protect backends and enforce fair use.

Limit by tokens or by requests.
Apply per minute, hour, or day.
Match by user, team, virtual account, model, or metadata like environment or project.
The Gateway uses a sliding window token bucket.

4) Tag every request

Ask your team to add a small header with tags like customer, project, environment, or feature.

Send X-TFY-METADATA with string values.
Use these tags to filter logs, build custom charts, and even scope rate or budget rules.

5) Be deliberate about logging

Toggle logging per request using the X-TFY-LOGGING-CONFIG header.
For self-hosted Gateway, you can also set a global mode to always log or never log.
View logs in the Monitor section of the UI when you need them.

Observability that helps you ship

Open the Metrics Dashboard to watch latency, time to first token, inter-token latency, cost, token counts, error codes, and more. Group by model to compare performance. Group by user or team to see who uses what. Group by your own metadata to track tenants or features. You can also export raw metrics to CSV for deeper analysis.

What you gain

Speed in the editor with Cline’s autonomous code help.
Control over who can use which models and how.
Safety with budgets and rate limits.
Clarity with logs, analytics, and your own metadata.

Closing thoughts

Cline makes coding feel lighter. The Gateway makes it safe to roll that power out across a team. Set the URL, pick the model, add a key, and you are ready to code. As you scale, add budgets, rate limits, logging, and metadata, and you will keep speed without losing control.

Built for Speed: ~10ms Latency, Even Under Load

Blazingly fast way to build, track and deploy your models!

Handles 350+ RPS on just 1 vCPU — no tuning needed
Production-ready with full enterprise support

Get Started with Truefoundry Now Talk to the Expert

TrueFoundry AI Gateway delivers ~3–4 ms latency, handles 350+ RPS on 1 vCPU, scales horizontally with ease, and is production-ready, while LiteLLM suffers from high latency, struggles beyond moderate RPS, lacks built-in scaling, and is best for light or prototype workloads.

Built for Speed: ~10ms Latency, Even Under Load

Schedule your Demo Now

The fastest way to build, govern and scale your AI

Book a Demo

Cline Integration with TrueFoundry AI Gateway

Why pair Cline with TrueFoundry

What you need

Step-by-step setup

Try these first prompts in Cline

Recommended Gateway setup for teams

1) Use the right kind of key

2) Add budgets

3) Add rate limits

4) Tag every request

5) Be deliberate about logging

Observability that helps you ship

What you gain

Closing thoughts

Built for Speed: ~10ms Latency, Even Under Load

Top 5 Azure ML Alternatives of 2025

Exploring Alternatives to VertexAI

Top 6 SageMaker Alternatives of 2025

AI Security Platforms & Gateways: Safeguarding LLMs and Agentic AI

GPT-5.1 vs GPT-5: 9 Major Improvements You Need to Know

Data Residency in the Age of Agentic AI: How AI Gateways Enable Sovereign Scale and Compliance

Multi-Agent Systems Explained: Why the Future of AI Is Collaborative

The Complete Guide to AI Gateways and MCP Servers

Cline Integration with TrueFoundry AI Gateway

Why pair Cline with TrueFoundry

What you need

Step-by-step setup

Try these first prompts in Cline

Recommended Gateway setup for teams

1) Use the right kind of key

2) Add budgets

3) Add rate limits

4) Tag every request

5) Be deliberate about logging

Observability that helps you ship

What you gain

Closing thoughts

Built for Speed: ~10ms Latency, Even Under Load

Discover More

Top 5 Azure ML Alternatives of 2025

Exploring Alternatives to VertexAI

Top 6 SageMaker Alternatives of 2025

AI Security Platforms & Gateways: Safeguarding LLMs and Agentic AI

GPT-5.1 vs GPT-5: 9 Major Improvements You Need to Know

Data Residency in the Age of Agentic AI: How AI Gateways Enable Sovereign Scale and Compliance

Multi-Agent Systems Explained: Why the Future of AI Is Collaborative

The Complete Guide to AI Gateways and MCP Servers

Subscribe to our newsletter