SourceScore

Verified claim · AI-ML · 100% confidence

Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020).

Last verified 2026-05-16 · Methodology veritas-v0.1 · d3681b0981e0b700

Structured fields

Subject
Vision Transformer (ViT)
Predicate
introduced_in_paper
Object
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)
Confidence
100%
Tags
vit · vision-transformer · foundational · dosovitskiy · 2020 · google · iclr

Sources (2)

  1. [1] preprint · arXiv (Dosovitskiy et al., Google Research) · 2020-10-22

    An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
    We show that this reliance on CNNs is not necessary and a pure transformer applied directly to sequences of image patches can perform very well on image classification tasks.
  2. [2] peer reviewed · OpenReview / ICLR · 2021-05-04

    Vision Transformer (ICLR 2021)

Cite this claim

Ready-to-paste citation (Markdown / plain text):

Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020). — SourceScore Claim d3681b0981e0b700 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json

Embed this claim

Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.

<iframe src="https://sourcescore.org/embed/claim/d3681b0981e0b700/" width="100%" height="360" frameborder="0" loading="lazy" title="Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)."></iframe>

Preview: open in new tab

Related claims

Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.

Use this claim in your code

Fetch this signed envelope from your application. The response includes the verbatim excerpt, primary source URLs, and an HMAC-SHA256 signature you can verify locally for audit trails.

cURL

curl https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json

JavaScript / TypeScript

const r = await fetch("https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json"); const envelope = await r.json(); console.log(envelope.claim.statement); // "Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)."

Python

import httpx r = httpx.get("https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json") envelope = r.json() print(envelope["claim"]["statement"]) # "Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)."

LangChain (retrieve-then-cite)

from langchain_core.tools import tool import httpx @tool def get_vision_transformer_vit_fact() -> dict: """Fetch the verified SourceScore claim for Vision Transformer (ViT).""" r = httpx.get("https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json") return r.json()