Gemma 3 27B
Gemma 3 27B is Google's open-source lightweight model with text and image capabilities, featuring a 128K token context window and multimodal understanding.
API Pricing
Cheapest on Together AI — 81% below avg| Provider | Input / 1M | Output / 1M | Speed | TTFT | Updated |
|---|---|---|---|---|---|
| $0.020 | $0.040 | 17.4 t/s | 386ms | 4/14/2026 | |
| $0.080 | $0.160 | 17.4 t/s | 386ms | 4/4/2026 | |
| $0.080 | $0.160 | 17.4 t/s | 386ms | 4/14/2026 | |
| $0.120 | $0.190 | 17.4 t/s | 386ms | 4/14/2026 | |
| $0.230 | $0.380 | 17.4 t/s | 386ms | 4/14/2026 |
Prices updated daily. Last check: 4/14/2026
Model Details
General
- Creator
- Family
- Gemma
- Tier
- Lightweight
- Context Window
- 128K
- Modalities
- Text, Image
Capabilities
- Tool Calling
- No
- Open Source
- Yes
- Subtypes
- Chat Completion
- Aliases
- gemma-3-27b-it, gemma-3n-e4b-it
Strengths & Limitations
- Open-source with accessible model weights for local deployment
- Multimodal support for both text and image inputs
- 128K token context window for processing lengthy documents
- Fast inference speed at 39.07 tokens per second
- Low time to first token at 308ms for responsive applications
- Lightweight 27B parameter size reduces computational requirements
- Created by Google with established model development expertise
- No function calling or tool use capabilities
- Smaller parameter count than flagship models in other families
- Limited to chat completion format without structured output modes
- Fewer advanced API features compared to proprietary alternatives
- May have reduced reasoning capabilities versus larger models
Key Features
About Gemma 3 27B
Common Use Cases
Gemma 3 27B is well-suited for organizations needing cost-effective multimodal AI capabilities with the flexibility of open-source deployment. Its combination of text and image processing makes it valuable for document analysis, content moderation, educational applications, and customer support systems that handle both text and visual inputs. The model's lightweight nature and fast inference speed make it practical for high-volume applications where response time matters, while its open-source license enables customization for domain-specific tasks, on-premises deployment for data privacy requirements, and integration into existing infrastructure without vendor dependencies.
Frequently Asked Questions
How much does Gemma 3 27B cost per million tokens?
Gemma 3 27B pricing varies by provider and deployment method. As an open-source model, you can run it locally without API costs, or use hosted providers with different pricing structures. Check the pricing table above for current rates across all providers offering Gemma 3 27B.
What is Gemma 3 27B best used for?
Gemma 3 27B excels at multimodal tasks requiring both text and image understanding, such as document analysis, content moderation, and educational applications. Its lightweight design and fast inference make it suitable for high-volume deployments, while its open-source nature enables customization and on-premises deployment for organizations with specific requirements.
Can I run Gemma 3 27B locally or do I need an API?
Gemma 3 27B is open-source, so you can download the model weights and run it locally on your own infrastructure. This gives you full control over deployment, data privacy, and costs. Alternatively, you can use hosted API providers for easier setup without managing infrastructure yourself.