Overview
Nova Micro is the compact tier of the Nova family, tuned for low-latency assistants and high-throughput apps. It follows instructions cleanly, supports long context, tool and function calling, and reliable JSON output, with steady multilingual and lightweight coding performance.
Description
Nova Micro is built for speed, cost control, and predictable behavior in production. It keeps multi-turn conversations coherent, summarizes documents, answers questions with clear reasoning, and handles common coding and data tasks without heavy compute. The API supports function calls for retrieval and actions, streams tokens for responsive UX, and returns schema-true JSON so pipelines can parse outputs without brittle post-processing. It quantizes well for modest GPUs or edge devices, which makes fleet deployment practical. Teams typically use Nova Micro as the default engine for chat, support, and workflow automation, then route the hardest prompts to a larger Nova tier when extra depth is required.
About Amazon
Global e-commerce and cloud giant behind Prime and AWS.
View Company Profile