> ## Documentation Index
> Fetch the complete documentation index at: https://www.truefoundry.com/llms.txt
> Use this file to discover all available pages before exploring further.

# TrueFoundry Mocker Provider Setup Guide

> Set up a mock LLM provider for benchmarking, load testing, and performance evaluation

This guide walks you through setting up a benchmarking provider using the TrueFoundry platform for load testing and performance evaluation.

### 1. Deploy the Benchmarking Service

* Deploy this [repository](https://github.com/truefoundry/tfy-openai-benchmarking-provider) as a service on TrueFoundry.
* Configure environment variables:
  1. `TOKEN_COUNT`: Number of tokens to return in responses (default: 100)
  2. `LATENCY`: Latency in seconds (default: 0)

     <img src="https://mintcdn.com/truefoundry/FeKcq2n1MMm83Par/images/Screenshot2025-07-09at5.12.26PM-min.png?fit=max&auto=format&n=FeKcq2n1MMm83Par&q=85&s=b69e6c4578184f166709cf234ac8f8bd" alt="Service deployment configuration panel showing environment variable settings for TOKEN_COUNT and LATENCY" width="2308" height="2008" data-path="images/Screenshot2025-07-09at5.12.26PM-min.png" />

### 2. Create a Provider Account

### 2.1. **Navigate to the AI Gateway**

* In the **TrueFoundry Dashboard**, go to **AI Gateway** → **Models**
* Select **Self Hosted Models** as your model provider

  <img src="https://mintcdn.com/truefoundry/FeKcq2n1MMm83Par/images/Screenshot2025-07-09at5.38.06PM-min.png?fit=max&auto=format&n=FeKcq2n1MMm83Par&q=85&s=a134d6f042dc59953bbea66a4ce05c18" alt="TrueFoundry AI Gateway interface showing Self Hosted Models provider selection" width="3598" height="2002" data-path="images/Screenshot2025-07-09at5.38.06PM-min.png" />

### 2.2. **Configure Models**

* **Model Type**: Select `Vllm-Openai`
* **Base URL**: Enter the URL of your deployed benchmarking service
* **Name**: Add a proper display name to your models

  <img src="https://mintcdn.com/truefoundry/FeKcq2n1MMm83Par/images/Screenshot2025-07-09at5.12.50PM-min.png?fit=max&auto=format&n=FeKcq2n1MMm83Par&q=85&s=6a6888966eb0122f0fd763337e4a2b8e" alt="Model configuration form with fields for model type, base URL, and display name" width="2696" height="2010" data-path="images/Screenshot2025-07-09at5.12.50PM-min.png" />

### 3. Generate Load Traffic

Create a client for producing traffic at your desired RPS (Requests Per Second). You can use any HTTP client or load testing tool such as:

* **Locust**: For advanced load testing scenarios
* **Custom scripts**: Using any HTTP client library

Make sure to keep the provider service on auto-scaling so it can handle high RPS traffic without getting throttled.