Generate text responses to prompts, enabling natural language understanding,
multilingual support, and code generation.
The pipeline consists of a prefill and TBT models, optimized
for coding tasks
Test | Evaluation Metric | Full Precision Accuracy | Post Quantization Accuracy |
---|---|---|---|
MMLU | accuracy | 48 | 43 |