Verified claim · AI-ML · 100% confidence
Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020).
Last verified 2026-05-16 · Methodology veritas-v0.1 · d3681b0981e0b700
Structured fields
- Subject
- Vision Transformer (ViT)
- Predicate
introduced_in_paper- Object
- An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)
- Confidence
- 100%
- Tags
- vit · vision-transformer · foundational · dosovitskiy · 2020 · google · iclr
Sources (2)
[1] preprint · arXiv (Dosovitskiy et al., Google Research) · 2020-10-22
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale“We show that this reliance on CNNs is not necessary and a pure transformer applied directly to sequences of image patches can perform very well on image classification tasks.”
[2] peer reviewed · OpenReview / ICLR · 2021-05-04
Vision Transformer (ICLR 2021)
Cite this claim
Ready-to-paste citation (Markdown / plain text):
Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020). — SourceScore Claim d3681b0981e0b700 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/d3681b0981e0b700.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/d3681b0981e0b700/" width="100%" height="360" frameborder="0" loading="lazy" title="Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
Reformer introduced in paper: Reformer: The Efficient Transformer (Kitaev, Kaiser, Levskaya, 2020).
76f7f00e79bc18c8 · 100% confidence · shares 4 tags (foundational, 2020, iclr…)
ELECTRA introduced in paper: ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators (Clark et al., 2020).
2f9c79357e9d4da9 · 100% confidence · shares 3 tags (foundational, 2020, google)
Mixture of Experts (MoE) revival popularized in: Shazeer et al. 2017 — outrageously large neural networks via sparse gating.
f068236101568ad7 · 100% confidence · shares 3 tags (google, foundational, iclr)
Retrieval-Augmented Generation (RAG) introduced in paper: Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks (Lewis et al., 2020).
d15057ced937a103 · 100% confidence · shares 2 tags (foundational, 2020)
BERT (Bidirectional Encoder Representations from Transformers) introduced in paper: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (Devlin et al., 2018).
4c1ee70007dc89c1 · 100% confidence · shares 2 tags (foundational, google)
Use this claim in your code
Fetch this signed envelope from your application. The response includes the verbatim excerpt, primary source URLs, and an HMAC-SHA256 signature you can verify locally for audit trails.
cURL
curl https://sourcescore.org/api/v1/claims/d3681b0981e0b700.jsonJavaScript / TypeScript
const r = await fetch("https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json");
const envelope = await r.json();
console.log(envelope.claim.statement);
// "Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)."Python
import httpx
r = httpx.get("https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json")
envelope = r.json()
print(envelope["claim"]["statement"])
# "Vision Transformer (ViT) introduced in paper: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Dosovitskiy et al., 2020)."LangChain (retrieve-then-cite)
from langchain_core.tools import tool
import httpx
@tool
def get_vision_transformer_vit_fact() -> dict:
"""Fetch the verified SourceScore claim for Vision Transformer (ViT)."""
r = httpx.get("https://sourcescore.org/api/v1/claims/d3681b0981e0b700.json")
return r.json()