Qwen2.5-Coder-1.5B

Generate text responses to prompts, enabling natural language understanding,
multilingual support, and code generation.

Model Properties

The pipeline consists of a prefill and TBT models, optimized
for coding tasks

License name: Apache License 2.0
Number of parameters: 1.5B
Model Size: 1.64 GB
Select device..

Technical Details

Operations: 29.4 GOPs per input token
Context Length: 2048
Numerical Scheme: A8W4, symmetric, channel-wise
Inference Api: CPP, Hailo-Ollama
Compiled Model:

Performance Metrics

First Load Time In Sec 8.58971
Time To First Token In Sec 0.322522
Tps 8.08952

Explore More Models

GenAI Models
DeepSeek-R1-Distill-Qwen-1.5B
Generate text responses to prompts, enabling natural language understanding, multilingual support, content creation and advanced reasoning
GenAI Models
Qwen2 1.5B Instruct
Generate text responses to prompts, enabling natural language conversations and content creation