Qwen2-1.5B-Instruct

Name: Qwen2-1.5B Instruct
Brand: Hailo AI

Generate text responses to prompts, enabling natural language conversations and content creation

Model Properties

The pipeline consists of a prefill step and a token-by-token step

License name: Apache License 2.0
Number of parameters: 1.5B
Model Size: 1.56 GB

Select device..

Hailo-10H

Operations: 29.4 GOPs per input token

Context Length: 2048

Numerical Scheme: A8W4, symmetric, channel-wise

Inference Api: C++, Python, Hailo-Ollama

Compiled Model:

Pre Compiled Model:

First Load Time In Sec 3.79

Time To First Token In Sec 0.32

TPS 8.08

0/5 (0 Reviews)