Models Version
LIMITED TIME OFFER
Get $5 Free Credit on First Payment
No strings attached — add funds and get $5 bonus instantly
Gemini 3.1 Flash TTS Text to Speech API Documentation
https://gateway.pixazo.ai/gemini-flash-tts/v1
Authentication
All requests require an API key passed via header.
| Header | Type | Required | Description |
|---|---|---|---|
| Ocp-Apim-Subscription-Key | string | Yes | Your API subscription key |
Gemini Flash TTS generate request - Gemini Flash TTS API
Request Code
POST https://gateway.pixazo.ai/gemini-flash-tts/v1/gemini-flash-tts/generate
Content-Type: application/json
Cache-Control: no-cache
Ocp-Apim-Subscription-Key: YOUR_SUBSCRIPTION_KEY
{
"text": "Hey there, I am a new text to speech model and I can say things in many different ways.",
"voice": "Algenib",
"prompt": "Say this in a dramatic, theatrical tone",
"language_code": "en-GB"
}
import requests
url = "https://gateway.pixazo.ai/gemini-flash-tts/v1/gemini-flash-tts/generate"
headers = {
"Content-Type": "application/json",
"Cache-Control": "no-cache",
"Ocp-Apim-Subscription-Key": "YOUR_SUBSCRIPTION_KEY"
}
data = {
"text": "Hey there, I am a new text to speech model and I can say things in many different ways.",
"voice": "Algenib",
"prompt": "Say this in a dramatic, theatrical tone",
"language_code": "en-GB"
}
response = requests.post(url, json=data, headers=headers)
print(response.json())
const url = 'https://gateway.pixazo.ai/gemini-flash-tts/v1/gemini-flash-tts/generate';
const data = {
text: 'Hey there, I am a new text to speech model and I can say things in many different ways.',
voice: 'Algenib',
prompt: 'Say this in a dramatic, theatrical tone',
language_code: 'en-GB'
};
fetch(url, {
method: 'POST',
headers: {
'Content-Type': 'application/json',
'Cache-Control': 'no-cache',
'Ocp-Apim-Subscription-Key': 'YOUR_SUBSCRIPTION_KEY'
},
body: JSON.stringify(data)
})
.then(response => response.json())
.then(data => console.log(data))
.catch(error => console.error('Error:', error));
curl -X POST "https://gateway.pixazo.ai/gemini-flash-tts/v1/gemini-flash-tts/generate" \
-H "Content-Type: application/json" \
-H "Cache-Control: no-cache" \
-H "Ocp-Apim-Subscription-Key: YOUR_SUBSCRIPTION_KEY" \
--data-raw '{
"text": "Hey there, I am a new text to speech model and I can say things in many different ways.",
"voice": "Algenib",
"prompt": "Say this in a dramatic, theatrical tone",
"language_code": "en-GB"
}'
Output
{
"request_id": "gemini-flash-tts_019dxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx",
"status": "QUEUED",
"polling_url": "https://gateway.pixazo.ai/v2/requests/status/gemini-flash-tts_019dxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"
}
Webhook (Optional)
Add the X-Webhook-URL header to your generate request to receive a POST callback instead of polling.
X-Webhook-URL: https://your-server.com/webhook/callback
Request Parameters - Gemini Flash TTS generate request
| Parameter | Required | Type | Description |
|---|---|---|---|
| text | Yes | string | The textual content to convert into speech. Must be a valid string of readable language. |
| voice | No | string | Voice preset name used for narration (e.g., Algenib). Controls timbre and speaker identity. |
| prompt | No | string | Style instruction that shapes delivery, emotion, pacing, or accent (e.g., Say this in a dramatic, theatrical tone). |
| language_code | No | string | BCP-47 language code hint for pronunciation (e.g., en-US, en-GB). |
Example Request
{
"text": "Hey there, I am a new text to speech model and I can say things in many different ways.",
"voice": "Algenib",
"prompt": "Say this in a dramatic, theatrical tone",
"language_code": "en-GB"
}
Response
{
"request_id": "gemini-flash-tts_019dxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx",
"status": "QUEUED",
"polling_url": "https://gateway.pixazo.ai/v2/requests/status/gemini-flash-tts_019dxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx"
}
Request Headers
| Header | Value |
|---|---|
| Content-Type | application/json |
| Cache-Control | no-cache |
| Ocp-Apim-Subscription-Key | YOUR_SUBSCRIPTION_KEY |
Response Handling
Common status codes.
| Code | Meaning |
|---|---|
| 202 | Accepted — Request queued |
| 400 | Bad Request |
| 401 | Unauthorized |
| 402 | Insufficient Balance |
| 403 | Forbidden |
| 429 | Too Many Requests |
| 500 | Internal Server Error |
Error Responses
Queue system errors and model validation errors.
Queue System Errors
// 402 — Insufficient balance
{
"error": "Insufficient Balance",
"message": "Your wallet does not have enough balance. Required: $0.04"
}
// 400 — Model not found
{
"error": "Model not found",
"message": "Model 'gemini-flash-tts' not found or is disabled"
}
Error via Status/Webhook
{
"request_id": "gemini-flash-tts_019dxxxx-xxxx-xxxx-xxxx-xxxxxxxxxxxx",
"status": "ERROR",
"model_id": "gemini-flash-tts",
"error": "Description of the error",
"output": null
}
Retrieving Results
Poll the universal status endpoint to check progress and retrieve results.
Endpoint
GET https://gateway.pixazo.ai/v2/requests/status/{request_id}
Ocp-Apim-Subscription-Key: YOUR_API_KEY
cURL Example
curl -H "Ocp-Apim-Subscription-Key: YOUR_API_KEY" \
"https://gateway.pixazo.ai/v2/requests/status/gemini-flash-tts_019d9a88-bba1-7f39-2655-a0c5f4a109529"
Response (Completed)
{
"request_id": "gemini-flash-tts_019d9a88-bba1-7f39-2655-a0c5f4a109529",
"status": "COMPLETED",
"model_id": "gemini-flash-tts",
"error": null,
"output": {
"media_url": [
"https://pub-582b7213209642b9b995c96c95a30381.r2.dev/v1/gemini-flash-tts_019d9a88-bba1-7f39-2655-a0c5f4a109529/output.wav"
],
"media_type": "audio/wav"
},
"created_at": "2026-04-17T08:22:25.893Z",
"updated_at": "2026-04-17T08:22:43.000Z",
"completed_at": "2026-04-17T08:22:43.000Z"
}
Response Fields
| Field | Type | Description |
|---|---|---|
| request_id | string | Unique request identifier |
| status | string | QUEUED, PROCESSING, COMPLETED, FAILED, or ERROR |
| model_id | string | Model that processed the request |
| error | string|null | Error message if failed |
| output.media_url | array | URLs to generated media (R2 CDN) |
| output.media_type | string | MIME type (audio/wav) |
| created_at | string | When request was created |
| completed_at | string|null | When request completed |
| polling_url | string | Status URL (initial response only) |
Status Values
| Status | Description |
|---|---|
| QUEUED | Request accepted, waiting to be processed |
| PROCESSING | Being processed by the model |
| COMPLETED | Done — output contains the result |
| FAILED | Failed — check error field |
| ERROR | System error — not charged |
Status Flow
QUEUED → PROCESSING → COMPLETED
→ FAILED
→ ERROR
Typical Workflow
- Send a generate request to the API endpoint
- Save the
request_idfrom the response - Poll every 5-10 seconds:
GET /v2/requests/status/{request_id} - When
statusis"COMPLETED", download fromoutput.media_url
Tip: Use X-Webhook-URL header to get a callback instead of polling.
Gemini 3.1 Flash TTS Text to Speech API Pricing
| Resolution | Price (USD) |
|---|---|
| default | $0.04 |
