Qwen2.5 1.5B

Generate text responses to prompts, enabling natural language understanding, multilingual support, and content creation

Model Properties

The pipeline consists of a prefill and tbt models

License name: Apache License 2.0
Number of parameters: 1.5B

Select device..

Technical Details

Context Length: 2048
Operations: 29.4GOPs

Performance Metrics

Prefill Token Per Sec 400
Time To First Token Per Sec 0.24
TPS 11

Explore More Models

GenAI Models
Qwen2.5-Coder-1.5B-Instruct
Generate text responses to prompts, enabling natural language understanding, multilingual support, and code generation
GenAI Models
DeepSeek-R1-Distill-Qwen-1.5B
Generate text responses to prompts, enabling natural language understanding, multilingual support, content creation and advanced reasoning