SegFormer is a hierarchical Transformer encoder with a lightweight all-MLP decoder for semantic segmentation, built by researchers and released by NVIDIA. It is fine-tuned specifically on the ADE20k dataset at a resolution of 512x512. A notable technical trait is its encoder is pre-trained on ImageNet-1k before the decoder is added and the whole system is fine-tuned.
Input
Output
Context
-
Max Output
-
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.