Theme
AI Model Integrations
AI model integrations connect your workspace to large language model providers. These integrations power LLM Response nodes, RAG Query nodes, and other AI-driven features in your bot flows.
Supported Providers
| Provider | Models | Authentication |
|---|---|---|
| Anthropic | Claude Opus, Claude Sonnet, Claude Haiku | API key |
| OpenAI | GPT-4o, GPT-4, GPT-3.5 Turbo | API key |
| Google Vertex AI | Gemini Pro, Gemini Flash | GCP service account |
Configuring Anthropic (Claude)
- Go to Settings > Integrations > Add Integration.
- Select AI Model as the type, then choose Anthropic.
- Enter a label (e.g., "Anthropic Production").
- Paste your API key from the Anthropic Console.
- Select a default model (e.g., Claude Sonnet).
- Click Test Connection, then Save.
Configuring OpenAI (GPT)
- Go to Settings > Integrations > Add Integration.
- Select AI Model as the type, then choose OpenAI.
- Enter a label (e.g., "OpenAI Development").
- Paste your API key from the OpenAI dashboard.
- Optionally enter an Organization ID if your account belongs to an organization.
- Select a default model (e.g., GPT-4o).
- Click Test Connection, then Save.
Configuring Google Vertex AI (Gemini)
- Go to Settings > Integrations > Add Integration.
- Select AI Model as the type, then choose Vertex AI.
- Enter a label (e.g., "Vertex AI Production").
- Enter your GCP Project ID.
- Enter the region (e.g.,
us-central1). - Upload or paste your service account JSON key.
- Select a default model (e.g., Gemini Pro).
- Click Test Connection, then Save.
AI provider configuration form showing fields for label, API key, default model selector, and test connection button, with Anthropic Claude selected as the provider
TIP
For Vertex AI, the service account needs the Vertex AI User role in your GCP project. You can create a dedicated service account with minimal permissions for better security.
Setting the Default Model
Each workspace has a default AI model that is used when a flow node does not specify a particular integration.
- Go to Settings > Integrations.
- Find the AI model integration you want as the default.
- Click the three-dot menu and select Set as Default.
The default integration is indicated with a badge in the integrations list.
Per-Node Override
Individual flow nodes can override the workspace default by selecting a specific integration. In the flow builder:
- Select an LLM Response or RAG Query node.
- In the configuration panel, find the AI Integration dropdown.
- Choose the desired integration from the list.
This allows you to use different models for different tasks within the same flow. For example, use a faster, cheaper model for simple classification and a more capable model for complex responses.
Rate Limits
You can configure rate limits on each AI model integration to control usage and costs.
| Setting | Description |
|---|---|
| Requests per minute | Maximum API calls per minute across all bots using this integration |
| Tokens per minute | Maximum total tokens (input + output) per minute |
| Monthly budget | Optional spending cap that disables the integration when reached |
WARNING
When a rate limit is reached, bot nodes using that integration return a fallback response or an error, depending on your flow design. Set rate limits conservatively at first and adjust based on actual usage patterns.
Cost Tracking
OmniBots tracks token usage and estimated costs for each AI model integration. To view usage:
- Go to Settings > Integrations.
- Click on an AI model integration.
- Select the Usage tab.
The usage view shows daily token consumption, request counts, and estimated costs broken down by model. Use this data to optimize model selection and manage spending.
Token usage and cost tracking chart showing daily token consumption bars with a cost trend line overlay, broken down by AI model
Next Steps
- Connect storage providers for Knowledge Base document sources
- Set up CCaaS platforms for live agent handoff
- Learn how to use AI integrations in flow builder nodes
