Bigbird Roberta Base is a sparse-attention based transformer model developed by Google, exceling at handling long sequences such as documents and contexts, with a context window of 4,096 tokens. It utilizes block sparse attention, reducing compute cost compared to traditional attention mechanisms like BERT's.
Input
Output
Context
4K
Max Output
-
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.