Microsoft's Deberta Large Mnli is a chat model fine-tuned for natural language understanding tasks, particularly exceling at tasks like MNLI, where it achieves an accuracy of 91.3/91.1. Notably, it utilizes disentangled attention and an enhanced mask decoder, and has a context window of 512 tokens. With top-notch performance on several GLUE benchmark tasks, including SST-2 with an accuracy of 96.5, it demonstrates strong capabilities in text generation and understanding.
Input
Output
Context
1K
Max Output
1K
Parameters
350M
Input Modalities
Output Modalities
Estimates based on INT8 quantization. Actual requirements vary by framework and configuration.
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.