Comprehensive technical documentation for developers, including model architecture, parameters, and performance metrics.
View Model SpecsGemini is a 17.5B parameter multimodal foundation model capable of processing text, code, and images across 100+ languages.
17.5 billion total
100+ supported
100TB real-world data
128 tokens/second latency on consumer-grade GPUs
512 tokens/second on A100 cloud GPUs
Controls randomness in generation (0=deterministic, 1=creative).
Limits output length. Maximum 32768 tokens (256k per request).
Number of alternative responses per request (1-5).
Cumulative probability threshold for nucleus sampling (0-1).
Code | Description | Solution |
---|---|---|
400 | Bad Request | Check JSON syntax |
401 | Unauthorized | Validate API key |
429 | Rate limit exceeded | Retry after timeout header |
500 | Internal Error | Contact support |
Check our GitHub repositories for code samples or search the documentation. Enterprise developers: Contact your account team for technical support.
Contact Support