- Maintains a safe and positive environment for users.
- Protects against harmful or illegal content.
- Ensures compliance with regulations and builds user trust.
Key Areas for Validation
- Explicit Content: Inappropriate sexual or violent content.
- Hate Speech: Discriminatory language targeting individuals or groups.
- Misinformation: False or misleading information, especially in health and safety.
- Illegal Content: Instructions for illegal activities or regulatory violations.
TrueFoundry’s Content Moderation Solutions
TrueFoundry offers comprehensive content moderation through various integrations:Truefoundry’s Content Moderation Solutions
-
OpenAI Moderation
You can use the OpenAI Moderation integration on TrueFoundry to achieve real-time analysis of text and images for harmful or inappropriate content. A brief tutorial for setting this up is provided at the end of this page. -
AWS Bedrock Guardrails
You can leverage AWS Bedrock Guardrails integration on TrueFoundry to enforce pre-defined or custom content policies, enabling real-time blocking of policy violations and context-aware filtering. -
Azure Content Safety
You can use the Azure Content Safety integration on TrueFoundry to detect and mitigate harmful, unsafe, or inappropriate content in model inputs and outputs. -
Guardrails AI (Custom Guardrail Integration)
You can implement custom content moderation logic using the custom guardrail integration option on TrueFoundry Gateway. As a starting point, refer to the TrueFoundry Guardrail Template Repository, which can be extended to suit your specific requirements.
Link to the TrueFoundry Guardrail Template Repository
How to set up content moderation using OpenAI Moderation on Truefoundry?
- Get your OpenAI API key from OpenAI
- Create a new or Add to an existing guardrail group on Truefoundry Gateway (AI Gateway -> Guardrails -> Add New Guardrail Group or Add/Edit Guardrails)
- Add OpenAI guardrail and fill the details such as:
- Name
- API Key
- Base URL (optional)
- Model (optional)
- Create a new or Edit existing guardrail configuration on Truefoundry Gateway (AI Gateway -> Config -> Guardrail -> Create/Edit)
- Test out the guardrail in playground (AI Gateway -> Playground)