NVIDIA's Segformer B 5 Finetuned Ade 640 640 is a vision model exceling at semantic segmentation tasks, particularly on benchmarks like ADE20K. Its architecture features a hierarchical Transformer encoder paired with a lightweight all-MLP decode head, allowing for efficient processing of high-resolution images like 640x640 inputs. With its pre-training on ImageNet-1k and fine-tuning on ADE20K, this model is well-suited for tasks involving detailed image understanding.
Input
Output
Context
-
Max Output
-
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 24, 2026
Automatically route workloads to the right model for every task, every time.