Qwen2.5-Coder-1.5B-Instruct

Generate text responses to prompts, enabling natural language understanding, multilingual support, and code generation

Model Properties

The pipeline consists of a prefill and TBT models, optimized for coding tasks

License name: Apache License 2.0
Number of parameters: 1.5B

Select device..

Technical Details

Context Length: 2048
Operations: 29.4GOPs

Performance Metrics

Prefill Token Per Sec 400
Time To First Token Per Sec 0.24
TPS 11
Accuracy
Test Evaluation Metric Full Precision Accuracy Post Quantization Accuracy
MMLU Accuracy 32.5 31

Explore More Models

GenAI Models
DeepSeek-R1-Distill-Qwen-1.5B
Generate text responses to prompts, enabling natural language understanding, multilingual support, content creation and advanced reasoning
GenAI Models
Qwen2 1.5B
Generate text responses to prompts, enabling natural language conversations and content creation