Generate text responses to prompts, enabling natural language conversations
and content creation.
The pipeline consists of a prefill and tbt models
License name: Apache License 2.0Test | Evaluation Metric | Full Precision Accuracy | Post Quantization Accuracy |
---|---|---|---|
MMLU | accuracy | 55 | 51 |