Table of Contents [expand]
Last updated February 12, 2026
Nova Lite is a fast and cost-effective large language model (LLM) from Amazon. It offers a multimodal solution that can process image, video, and text inputs.
- Model ID:
nova-lite - Region:
us,eu
When to Use This Model
Nova Lite is optimized for high-throughput tasks and supports a variety of common use cases, including rapid text generation, summarization, and copywriting.
Usage
Nova Lite follows our /v1/chat/completions API schema.
To provision access to the model, attach a Managed Inference and Agents add-on add-on to your app $APP_NAME:
heroku addons:create heroku-inference:standard -a $APP_NAME
Using config variables, you can invoke the model in various ways:
- Heroku CLI
aiplugin (heroku ai:models:call) - curl
- Python
- Ruby
- Javascript
Multimodal Support
- Supported inputs:
text,image,video - Supported outputs:
text
Rate Limits
- Maximum requests per minute: 150
- Maximum tokens per minute: 800,000
Prompt Caching
Prompt caching is supported for system prompts. It isn’t supported for tools. The minimum tokens required for prompt caching is 1,000.
Example curl Requests
Text to Text
export INFERENCE_KEY=$(heroku config:get -a example-app INFERENCE_KEY)
export INFERENCE_URL=$(heroku config:get -a example-app INFERENCE_URL)
curl $INFERENCE_URL/v1/chat/completions \
-H "Authorization: Bearer $INFERENCE_KEY" \
-d @- <<EOF
{
"model": "nova-lite",
"messages": [
{ "role": "user", "content": "Hello!" },
{ "role": "assistant", "content": "Hi there! How can I assist you today?" },
{ "role": "user", "content": "What's the weather like in Portland, Oregon right now?" }
],
"temperature": 0.5,
"max_tokens": 100,
"stream": false,
"tools": [
{
"type": "function",
"function": {
"name": "get_weather",
"description": "Fetches the current weather for a given city.",
"parameters": {
"type": "object",
"properties": {
"city": {
"type": "string",
"description": "The name of the city to get weather for."
}
},
"required": ["city"]
}
}
}
],
"tool_choice": "auto",
"top_p": 0.9
}
EOF
Image to Text
curl -X POST $INFERENCE_URL/v1/chat/completions \
-H "Authorization: Bearer $INFERENCE_KEY" \
-H "Content-Type: application/json" \
-H "X-Forwarded-Proto: https" \
-d @- <<EOF
{
"model": "nova-lite",
"messages": [{
"role": "user",
"content": [
{"type": "text", "text": "What do you see in this image?"},
{"type": "image_url", "image_url": {"url": "https://upload.wikimedia.org/wikipedia/commons/thumb/0/09/A_chinstrap_penguin_%28Pygoscelis_antarcticus%29_on_Deception_Island_in_Antarctica.jpg/960px-A_chinstrap_penguin_%28Pygoscelis_antarcticus%29_on_Deception_Island_in_Antarctica.jpg"}}
]
}]
}
EOF
Video to Text
curl -X POST $INFERENCE_URL/v1/chat/completions \
-H "Authorization: Bearer $INFERENCE_KEY" \
-H "Content-Type: application/json" \
-H "X-Forwarded-Proto: https" \
-d @- <<EOF
{
"model": "nova-lite",
"stream": true,
"messages": [
{
"role": "user",
"content": [
{
"type": "text",
"text": "What do you see happening in this video?"
},
{
"type": "video_url",
"video_url": {
"url": "https://freestockfootagearchive.com/wp-content/uploads/2025/10/Green-Screen-Glitch-Grunge-Overlay-Effect-Layer.mp4"
}
}
]
}
]
}
EOF