Verified claim · AI-ML · 100% confidence
Speculative decoding introduced in: Leviathan, Kalman, Matias 2023 — Google Research.
Last verified 2026-05-16 · Methodology veritas-v0.1 · 6cdc7730bf41bb3d
Structured fields
- Subject
- Speculative decoding
- Predicate
introduced_in- Object
- Leviathan, Kalman, Matias 2023 — Google Research
- Confidence
- 100%
- Tags
- speculative-decoding · google · inference · foundational · icml · 2022 · introduced_in
Sources (2)
[1] preprint · arXiv (Leviathan, Kalman, Matias / Google Research) · 2022-11-30
Fast Inference from Transformers via Speculative Decoding“Inference from large autoregressive models like Transformers is slow - decoding K tokens takes K serial runs of the model. In this work we introduce speculative decoding - an algorithm to sample from autoregressive models faster without any changes to the outputs, by computing several tokens in parallel.”
[2] peer reviewed · PMLR / ICML 2023 · 2023-07-23
Speculative Decoding — ICML 2023 proceedings
Cite this claim
Ready-to-paste citation (Markdown / plain text):
Speculative decoding introduced in: Leviathan, Kalman, Matias 2023 — Google Research. — SourceScore Claim 6cdc7730bf41bb3d (verified 2026-05-16). https://sourcescore.org/api/v1/claims/6cdc7730bf41bb3d.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/6cdc7730bf41bb3d/" width="100%" height="360" frameborder="0" loading="lazy" title="Speculative decoding introduced in: Leviathan, Kalman, Matias 2023 — Google Research."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
Chain-of-Thought prompting introduced in paper: Chain-of-Thought Prompting Elicits Reasoning in Large Language Models (Wei et al., 2022).
3af924da138ff84c · 100% confidence · shares 3 tags (foundational, 2022, google)
Batch Normalization introduced in paper: Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (Ioffe & Szegedy, 2015).
56c451642ab41e68 · 100% confidence · shares 3 tags (foundational, icml, google)
PaLM introduced in paper: PaLM: Scaling Language Modeling with Pathways (Chowdhery et al., 2022).
d58d505fd9d705fe · 100% confidence · shares 3 tags (google, foundational, 2022)
Imagen introduced in paper: Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding (Saharia et al., 2022).
30fdfa95f8684ca5 · 100% confidence · shares 3 tags (google, foundational, 2022)
GPTQ introduced in: Frantar et al. 2022 — accurate post-training quantization for GPT models.
a9ab1ec12062f7ae · 100% confidence · shares 3 tags (inference, 2022, introduced_in)
Use this claim in your code
Fetch this signed envelope from your application. The response includes the verbatim excerpt, primary source URLs, and an HMAC-SHA256 signature you can verify locally for audit trails.
cURL
curl https://sourcescore.org/api/v1/claims/6cdc7730bf41bb3d.jsonJavaScript / TypeScript
const r = await fetch("https://sourcescore.org/api/v1/claims/6cdc7730bf41bb3d.json");
const envelope = await r.json();
console.log(envelope.claim.statement);
// "Speculative decoding introduced in: Leviathan, Kalman, Matias 2023 — Google Research."Python
import httpx
r = httpx.get("https://sourcescore.org/api/v1/claims/6cdc7730bf41bb3d.json")
envelope = r.json()
print(envelope["claim"]["statement"])
# "Speculative decoding introduced in: Leviathan, Kalman, Matias 2023 — Google Research."LangChain (retrieve-then-cite)
from langchain_core.tools import tool
import httpx
@tool
def get_speculative_decoding_fact() -> dict:
"""Fetch the verified SourceScore claim for Speculative decoding."""
r = httpx.get("https://sourcescore.org/api/v1/claims/6cdc7730bf41bb3d.json")
return r.json()