K-BERT: Enabling Language Representation with Knowledge Graph

Liu, Weijie; Zhou, Peng; Zhao, Zhe; Wang, Zhiruo; Ju, Qi; Deng, Haotang; Wang, Ping

Computer Science > Computation and Language

arXiv:1909.07606 (cs)

[Submitted on 17 Sep 2019]

Title:K-BERT: Enabling Language Representation with Knowledge Graph

Authors:Weijie Liu, Peng Zhou, Zhe Zhao, Zhiruo Wang, Qi Ju, Haotang Deng, Ping Wang

View PDF

Abstract:Pre-trained language representation models, such as BERT, capture a general language representation from large-scale corpora, but lack domain-specific knowledge. When reading a domain text, experts make inferences with relevant knowledge. For machines to achieve this capability, we propose a knowledge-enabled language representation model (K-BERT) with knowledge graphs (KGs), in which triples are injected into the sentences as domain knowledge. However, too much knowledge incorporation may divert the sentence from its correct meaning, which is called knowledge noise (KN) issue. To overcome KN, K-BERT introduces soft-position and visible matrix to limit the impact of knowledge. K-BERT can easily inject domain knowledge into the models by equipped with a KG without pre-training by-self because it is capable of loading model parameters from the pre-trained BERT. Our investigation reveals promising results in twelve NLP tasks. Especially in domain-specific tasks (including finance, law, and medicine), K-BERT significantly outperforms BERT, which demonstrates that K-BERT is an excellent choice for solving the knowledge-driven problems that require experts.

Comments:	8 pages, 20190917
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1909.07606 [cs.CL]
	(or arXiv:1909.07606v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1909.07606

Submission history

From: Weijie Liu [view email]
[v1] Tue, 17 Sep 2019 06:16:04 UTC (276 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-09

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Peng Zhou
Zhe Zhao
Qi Ju
Ping Wang

export BibTeX citation

Computer Science > Computation and Language

Title:K-BERT: Enabling Language Representation with Knowledge Graph

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:K-BERT: Enabling Language Representation with Knowledge Graph

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators