LightweightAmazon

Nova Micro

Name: Nova Micro
Availability: InStock
Author: Amazon

Nova Micro is Amazon's lightweight text model in the Nova family, designed for fast, cost-effective tasks with a 128K token context window.

Context 128K

Tier Lightweight

Input from

$0.018 / 1M tokens

across 2 providers

Compare Prices

API Pricing

Cheapest on Amazon AWS — 40% below avg

Provider	Input / 1M	Output / 1M	Speed	TTFT	Updated
Amazon AWSBatch	$0.018	$0.070	329 t/s	598ms	7/13/2026
OpenRouter	$0.035	$0.140	329 t/s	598ms	7/13/2026
Amazon AWS	$0.035	$0.140	329 t/s	598ms	7/13/2026

Prices updated daily. Last check: Jul 13, 2026

Performance & Benchmarks

Source: Artificial Analysis →

Intelligence

4.7 / 100

Math

6.0 / 100

Output Speed

329 t/s

Latency (TTFT)

598ms

Reasoning & Knowledge

MMLU-Pro53.1%
GPQA Diamond35.8%
Humanity's Last Exam4.7%

Coding

LiveCodeBench14.0%
SciCode9.4%

Math

AIME 20256.0%
AIME8.0%
MATH-50070.3%

Agentic & Tool Use

Terminal-Bench Hard1.5%
τ²-bench14.0%

Instruction & Long Context

IFBench29.4%
Long-Context Reasoning9.7%

Benchmarks measured Jul 2026. Scores are independent evaluations, not vendor-reported.

Model Details

General

Creator: Amazon
Family: Nova
Tier: Lightweight
Context Window: 128K
Modalities: Text

Capabilities

Tool Calling: No
Open Source: No

Strengths & Limitations

Strengths

Fast inference speed at 173.71 output tokens per second
Quick response initiation with 404ms time to first token
128K token context window for processing lengthy documents
Lightweight architecture optimized for cost efficiency
Integration with Amazon Bedrock ecosystem
Suitable for high-volume automated workflows
Low latency for real-time applications

Limitations

No tool calling or function execution capabilities
Text-only input - no image or multimodal support
Proprietary model with no open source weights available
Limited reasoning capabilities compared to higher-tier Nova models
No advanced features like structured output modes

Key Features

•128K token context window

•Text-only input and output

•Streaming response support

•Amazon Bedrock API integration

•Batch processing capabilities

•Real-time inference optimization

•UTF-8 text encoding support

•JSON API response format

About Nova Micro

Nova Micro is Amazon's entry-level model in the Nova family, positioned as a lightweight option for high-volume, cost-sensitive applications. As part of Amazon's proprietary model lineup, it sits at the foundation tier below more capable Nova variants, focusing on speed and efficiency over advanced reasoning capabilities. The model supports a 128K token context window and handles text-only interactions, making it suitable for straightforward language tasks. With benchmark speeds of 173.71 output tokens per second and a time to first token of 404ms, Nova Micro prioritizes rapid response times. However, it lacks tool calling functionality and multimodal capabilities, reflecting its positioning as a streamlined model for basic text processing. Nova Micro targets use cases where speed and cost efficiency matter more than complex reasoning or advanced features. Organizations typically deploy it for high-volume text classification, content moderation, simple Q&A, and other automated workflows where faster, more capable models would be unnecessary overhead.

Common Use Cases

Nova Micro excels in high-volume, cost-sensitive applications where basic language understanding is sufficient. Its fast inference speed and lightweight design make it ideal for content moderation pipelines, simple chatbots, text classification systems, and automated customer service responses. Organizations use it for sentiment analysis, basic summarization, simple Q&A systems, and content filtering where the 128K context window provides adequate document processing capability. The model's speed optimization makes it particularly valuable for real-time applications requiring immediate responses, such as live chat systems or automated email routing, where complex reasoning is unnecessary but consistent, fast text processing is essential.

Frequently Asked Questions

How much does Nova Micro cost per million tokens?

Nova Micro pricing varies by provider and usage type (standard vs batch processing). Check the pricing table above for current rates across all available providers offering Nova Micro access.

What is Nova Micro best used for?

Nova Micro is best suited for high-volume, cost-sensitive text processing tasks like content moderation, simple classification, basic Q&A, and automated responses. Its fast inference speed and lightweight design make it ideal when you need quick, consistent text processing without complex reasoning or multimodal capabilities.

Does Nova Micro support tool calling or function execution?

No, Nova Micro does not support tool calling or function execution capabilities. It's designed as a lightweight text model focused on speed and cost efficiency. For tool calling features, you would need to use higher-tier models in the Nova family or other providers that offer function calling support.