Meta / Llama 3.1 8B
meta/llama-3.1-8b
Meta
meta
Llama 3.1 8B brings powerful performance in a smaller, more efficient package. With improved multilingual support, tool use, and a 128K context length, it enables sophisticated use cases like interactive agents and compact coding assistants while remaining lightweight and accessible.
Run predictions with the official JavaScript SDK. Use your API key from Settings.
This model may apply different rates depending on input fields (for example tokens, seconds, megapixels, or prediction type). The effective price uses the first matching rule when you run a prediction.
Pricing formula
Pricing is calculated separately for input, cached input, and output tokens.
Terms
| Rule | Condition | Rate |
|---|---|---|
Base pricing Default pricing when no criteria match | — | $0.12 / million token |
Input token pricing | token_type == input | $0.12 / million token |
Cached input token pricing | token_type == cached_input | $0.12 / million token |
Output token pricing | token_type == output | $0.12 / million token |