Microsoft's Swinv 2 Tiny Patch 4 Window 16 256 is a vision model that excels at image classification tasks, serving as a general-purpose backbone for both image classification and dense recognition tasks. It features a Swin Transformer v2 architecture, which builds hierarchical feature maps by merging image patches and has linear computation complexity to input image size due to local window-based self-attention computation.
Input
Output
Context
-
Max Output
-
Parameters
-
Input Modalities
Output Modalities
Data sourced from official provider APIs and documentation
Last updated: Jun 23, 2026
Automatically route workloads to the right model for every task, every time.