DeepSeek-R1-Distill-Qwen-1.5B

Generate text responses to prompts, enabling natural language understanding,
multilingual support, content creation and advanced reasoning

Model Properties

Utilizes a transformer-based architecture with instruction
tuning to enhance logical reasoning, natural language understanding, and content
generation

License name: MIT License
Number of parameters: 1.5B
Model Size: 1.58 GB
Select device..

Technical Details

Operations: 29.4GOPs
Context Length: 2048
Numerical Scheme: A8W4, symmetric, channel-wise
Inference API: CPP, Hailo-Ollama

Performance Metrics

Load Time In Sec 9.81788
Time To First Token In Sec 0.680284
TPS 7.83009

Explore Related Models

GenAI Models
Qwen2.5-Coder-1.5B
Generate text responses to prompts, enabling natural language understanding, multilingual support, and code generation
GenAI Models
Qwen2.5 1.5B-Instruct
Generate text responses to prompts, enabling natural language understanding, multilingual support, and content creation