Verified claim · AI-ML · 100% confidence
Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).
Last verified 2026-05-16 · Methodology veritas-v0.1 · aede848e23c8de8e
Structured fields
- Subject
- Byte-Pair Encoding (BPE) for NMT
- Predicate
introduced_in_paper- Object
- Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)
- Confidence
- 100%
- Tags
- bpe · tokenization · subword · foundational · 2015 · acl
Sources (2)
[1] preprint · arXiv (Sennrich, Haddow, Birch) · 2015-08-31
Neural Machine Translation of Rare Words with Subword Units“We discuss the suitability of different word segmentation techniques, including simple character n-gram models and a segmentation based on the byte pair encoding compression algorithm, and empirically show that subword models improve over a back-off dictionary baseline.”
[2] peer reviewed · ACL Anthology · 2016-08-07
Neural Machine Translation of Rare Words with Subword Units (ACL 2016)
Cite this claim
Ready-to-paste citation (Markdown / plain text):
Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015). — SourceScore Claim aede848e23c8de8e (verified 2026-05-16). https://sourcescore.org/api/v1/claims/aede848e23c8de8e.jsonEmbed this claim
Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.
<iframe src="https://sourcescore.org/embed/claim/aede848e23c8de8e/" width="100%" height="360" frameborder="0" loading="lazy" title="Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."></iframe>Preview: open in new tab
Related claims
Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.
Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).
e942c93d70a4dab2 · 100% confidence · shares 5 tags (bpe, tokenization, foundational…)
ResNet (Residual Networks) introduced in paper: Deep Residual Learning for Image Recognition (He et al., 2015).
4f55f77c4bfb316e · 100% confidence · shares 2 tags (foundational, 2015)
SentencePiece tokenizer introduced in paper: SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing (Kudo & Richardson, 2018).
0d47bb8eb637a2e4 · 100% confidence · shares 2 tags (tokenization, foundational)
Batch Normalization introduced in paper: Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift (Ioffe & Szegedy, 2015).
56c451642ab41e68 · 100% confidence · shares 2 tags (foundational, 2015)
BLEU score introduced in paper: BLEU: a Method for Automatic Evaluation of Machine Translation (Papineni et al., 2002).
bf5bdd9756278449 · 100% confidence · shares 2 tags (foundational, acl)
Use this claim in your code
Fetch this signed envelope from your application. The response includes the verbatim excerpt, primary source URLs, and an HMAC-SHA256 signature you can verify locally for audit trails.
cURL
curl https://sourcescore.org/api/v1/claims/aede848e23c8de8e.jsonJavaScript / TypeScript
const r = await fetch("https://sourcescore.org/api/v1/claims/aede848e23c8de8e.json");
const envelope = await r.json();
console.log(envelope.claim.statement);
// "Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."Python
import httpx
r = httpx.get("https://sourcescore.org/api/v1/claims/aede848e23c8de8e.json")
envelope = r.json()
print(envelope["claim"]["statement"])
# "Byte-Pair Encoding (BPE) for NMT introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."LangChain (retrieve-then-cite)
from langchain_core.tools import tool
import httpx
@tool
def get_byte_pair_encoding_bpe_for_nmt_fact() -> dict:
"""Fetch the verified SourceScore claim for Byte-Pair Encoding (BPE) for NMT."""
r = httpx.get("https://sourcescore.org/api/v1/claims/aede848e23c8de8e.json")
return r.json()