Models
List models
GET
/v1/modelsReturns the full model catalog with live pricing. Pricing is dynamic — fetch this endpoint rather than hardcoding rates, as they can change without notice. Authentication required (API key or session token).
Example
curl https://api.pinstripes.io/v1/models \ -H "Authorization: Bearer sk-ps-..."
Response
{
"object": "list",
"data": [
{
"id": "ps/deepseek-v4-flash",
"object": "model",
"created": 1748995200,
"owned_by": "pinstripes",
"context_length": 1000000,
"max_output_tokens": 16384,
"pricing": {
"prompt": "0.000000087",
"completion": "0.000000173",
"cached_tokens": "0.00000005"
}
},
{
"id": "ps/llama-4-maverick",
"object": "model",
"created": 1748995200,
"owned_by": "pinstripes",
"context_length": 1000000,
"max_output_tokens": 16384,
"pricing": {
"prompt": "0.00000013",
"completion": "0.00000053",
"cached_tokens": "0.00000005"
}
},
{
"id": "ps/qwen3.6-35b-a3b",
"object": "model",
"created": 1748995200,
"owned_by": "pinstripes",
"context_length": 262144,
"max_output_tokens": 32768,
"pricing": {
"prompt": "0.00000012",
"completion": "0.00000085",
"cached_tokens": "0.00000005"
}
}
]
}Pricing fields are per token. Multiply by 1,000,000 to get the $/1M figure shown on the pricing page. The cached_tokens rate is $0.05/1M across all models.
Model reference
| Model ID | Context | Input /1M | Output /1M | Cached /1M |
|---|---|---|---|---|
| ps/deepseek-v4-flash | 1M | $0.087 | $0.173 | $0.050 |
| ps/deepseek-v4-pro | 1M | $0.390 | $0.780 | $0.050 |
| ps/llama-4-maverick | 1M | $0.130 | $0.530 | $0.050 |
| ps/llama-4-scout | 10M | $0.071 | $0.265 | $0.050 |
| ps/qwen3.6-35b-a3b | 262K | $0.120 | $0.850 | $0.050 |
| ps/qwen3.5-35b-a3b | 262K | $0.120 | $0.850 | $0.050 |
| ps/kimi-k2 | 131K | $0.510 | $2.000 | $0.050 |
| ps/step-3.7-flash | 256K | $0.175 | $1.000 | $0.050 |
| ps/glm-4.5-air | 131K | $0.110 | $0.750 | $0.050 |
| ps/gpt-oss-120b | 131K | $0.033 | $0.153 | $0.050 |
| ps/qwen3-coder-30b-a3b | 160K | $0.061 | $0.235 | $0.050 |