Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts

Tulkens, Stéphan; Šuster, Simon; Daelemans, Walter

Computer Science > Computation and Language

arXiv:1608.05605 (cs)

[Submitted on 19 Aug 2016]

Title:Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts

Authors:Stéphan Tulkens, Simon Šuster, Walter Daelemans

View PDF

Abstract:In this paper, we report a knowledge-based method for Word Sense Disambiguation in the domains of biomedical and clinical text. We combine word representations created on large corpora with a small number of definitions from the UMLS to create concept representations, which we then compare to representations of the context of ambiguous terms. Using no relational information, we obtain comparable performance to previous approaches on the MSH-WSD dataset, which is a well-known dataset in the biomedical domain. Additionally, our method is fast and easy to set up and extend to other domains. Supplementary materials, including source code, can be found at https: //github.com/clips/yarn

Comments:	6 pages, 1 figure, presented at the 15th Workshop on Biomedical Natural Language Processing, Berlin 2016
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1608.05605 [cs.CL]
	(or arXiv:1608.05605v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1608.05605
Journal reference:	Proceedings of the 15th Workshop on Biomedical Natural Language Processing, Berlin, Germany, 2016, pages 77-82. Association for Computational Linguistics

Submission history

From: Stéphan Tulkens [view email]
[v1] Fri, 19 Aug 2016 14:05:03 UTC (51 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2016-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Stéphan Tulkens
Simon Suster
Walter Daelemans

export BibTeX citation

Computer Science > Computation and Language

Title:Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators