Home GenAI Models Qwen2.5-Coder-1.5B-Instruct

Qwen2.5-Coder-1.5B-Instruct

Generate text responses to prompts, enabling natural language understanding, multilingual support, and code generation

Model Properties

The pipeline consists of a prefill step and token by token step, optimized for coding tasks

License name: Apache License 2.0
Number of parameters: 1.5B
Model Size: 1.64 GB

Select device..

Hailo-10H

Operations: 29.4 GOPs per input token

Context Length: 2048

Numerical Scheme: A8W4, symmetric, channel-wise

Inference Api: C++, Python, Hailo-Ollama

Compiled Model:

First Load Time In Sec 4.75

Time To First Token In Sec 0.32

Tps 8.13

GenAI Models

Generate text responses to prompts, enabling natural language understanding, multilingual support, content creation and advanced reasoning

GenAI Models

Generate text responses to prompts, enabling natural language conversations and content creation