SourceScore

Verified claim · AI-ML · 100% confidence

Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015).

Last verified 2026-05-16 · Methodology veritas-v0.1 · e942c93d70a4dab2

Structured fields

Subject
Byte-Pair Encoding (BPE) for Neural Machine Translation
Predicate
introduced_in_paper
Object
Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)
Confidence
100%
Tags
bpe · tokenization · foundational · sennrich · 2015 · acl · nmt

Sources (2)

  1. [1] preprint · arXiv (Sennrich, Haddow, Birch) · 2015-08-31

    Neural Machine Translation of Rare Words with Subword Units
    We introduce a simpler and more effective approach, making the NMT model capable of open-vocabulary translation by encoding rare and unknown words as sequences of subword units.
  2. [2] peer reviewed · Association for Computational Linguistics · 2016-08-07

    Neural Machine Translation of Rare Words with Subword Units (ACL 2016)

Cite this claim

Ready-to-paste citation (Markdown / plain text):

Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015). — SourceScore Claim e942c93d70a4dab2 (verified 2026-05-16). https://sourcescore.org/api/v1/claims/e942c93d70a4dab2.json

Embed this claim

Drop this iframe into any blog post, docs page, or knowledge base. The widget renders the signed claim + primary source + click-through to this canonical page. CC-BY 4.0; attribution included.

<iframe src="https://sourcescore.org/embed/claim/e942c93d70a4dab2/" width="100%" height="360" frameborder="0" loading="lazy" title="Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."></iframe>

Preview: open in new tab

Related claims

Other verified claims sharing tags with this one — useful for LLM retrieval graphs and citation discovery.

Use this claim in your code

Fetch this signed envelope from your application. The response includes the verbatim excerpt, primary source URLs, and an HMAC-SHA256 signature you can verify locally for audit trails.

cURL

curl https://sourcescore.org/api/v1/claims/e942c93d70a4dab2.json

JavaScript / TypeScript

const r = await fetch("https://sourcescore.org/api/v1/claims/e942c93d70a4dab2.json"); const envelope = await r.json(); console.log(envelope.claim.statement); // "Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."

Python

import httpx r = httpx.get("https://sourcescore.org/api/v1/claims/e942c93d70a4dab2.json") envelope = r.json() print(envelope["claim"]["statement"]) # "Byte-Pair Encoding (BPE) for Neural Machine Translation introduced in paper: Neural Machine Translation of Rare Words with Subword Units (Sennrich et al., 2015)."

LangChain (retrieve-then-cite)

from langchain_core.tools import tool import httpx @tool def get_byte_pair_encoding_bpe_for_neural_machine_translation_fact() -> dict: """Fetch the verified SourceScore claim for Byte-Pair Encoding (BPE) for Neural Machine Translation.""" r = httpx.get("https://sourcescore.org/api/v1/claims/e942c93d70a4dab2.json") return r.json()