Generate text responses to prompts, enabling natural language understanding,
multilingual support, and content creation.
The pipeline consists of a prefill and tbt models
License name: Apache License 2.0Test | Evaluation Metric | Full Precision Accuracy | Post Quantization Accuracy |
---|---|---|---|
MMLU | accuracy | 59 | 51 |