Developer/Model ID | Type | Context length (batch size) | Features and optimizations | View on Hugging Face |
---|---|---|---|---|
Meta | ||||
Meta-Llama-3.3-70B-Instruct | Text | View
| View
| Model card |
Meta-Llama-3.1-8B-Instruct | Text | View
| View
| Model card |
Llama-4-Maverick-17B-128E-Instruct | Image, Text | View
| View
| Model card |
DeepSeek | ||||
DeepSeek-R1-0528 | Reasoning, Text | View
| View
| Model card |
DeepSeek-R1-Distill-Llama-70B | Reasoning, Text | View
| View
| Model card |
DeepSeek-V3-0324 | Text | View
| View
| Model card |
DeepSeek-V3.1 | Reasoning, Text | View
| View
| Model card |
OpenAI | ||||
Whisper-Large-v3 | Audio | View
| View
| Model card |
Qwen | ||||
Qwen3-32B | Reasoning, Text | View
| View
| Model card |
Tokyotech-llm | ||||
Llama-3.3-Swallow-70B-Instruct-v0.4 | Text | View
| View
| Model card |
Other | ||||
E5-Mistral-7B-Instruct | Embedding | View
| View
| Model card |