Qwen2.5 1.5B

Generate text responses to prompts, enabling natural language understanding, multilingual support, and content creation

Model Properties

The pipeline consists of a prefill and tbt models

License name: Apache License 2.0
Number of parameters: 1.5B

Select device..

Technical Details

Context Length: 2048
Operations: 29.4GOPs

Performance Metrics

Prefill Token Per Sec 400
Time To First Token Per Sec 0.24
TPS 11
Accuracy
Test Evaluation Metric Full Precision Accuracy Post Quantization Accuracy
MMLU Accuracy 32.5 31

Explore More Models

GenAI Models
Qwen2.5-Coder-1.5B-Instruct
Generate text responses to prompts, enabling natural language understanding, multilingual support, and code generation
GenAI Models
DeepSeek-R1-Distill-Qwen-1.5B
Generate text responses to prompts, enabling natural language understanding, multilingual support, content creation and advanced reasoning