Qwen2 1.5B

Generate text responses to prompts, enabling natural language conversations and content creation

Model Properties

The pipeline consists of a prefill and TBT models, optimized for coding tasks

License name: Apache License 2.0
Number of parameters: 1.5B

Select device..

Technical Details

Context Length: 2048
Operations: 29.4GOPs

Performance Metrics

Prefill Token Per Sec 400
Time To First Token Per Sec 0.24
TPS 11
Accuracy
Test Evaluation Metric Full Precision Accuracy Post Quantization Accuracy
MMLU Accuracy 32.5 31