skip to main content
10.1007/978-3-030-33246-4_39guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

NotaryPedia: A Knowledge Graph of Historical Notarial Manuscripts

Published: 21 October 2019 Publication History

Abstract

The Notarial Archives in Valletta, the capital city of Malta, houses a rich and valuable collection of around twenty thousand notarial manuscripts dating back to the 15th century. The Archive wants to make the contents of this collection easily accessible and searchable to researchers and the general public. Knowledge Graphs have been successfully used to represent similar historical content. Nevertheless, building a Knowledge Graph for the archives is challenging as these documents are written in medieval Latin and currently there is a lack of information extraction tools that recognise this language. This is, furthermore, compounded with a lack of medieval Latin corpora to train and evaluate machine learning algorithms, as well as a lack of an ontological representation for the contents of notarial manuscripts. In this paper, we present NotaryPedia, a Knowledge Graph for the Notarial Archives. We extend our previous work on entity and keyphrase extraction with relation extraction to populate the Knowledge Graph using an ontological vocabulary for notarial deeds. Furthermore, we perform Knowledge Graph completeness using link-prediction and inference. Our work was evaluated using different translational distance and semantic matching models to predict relations amongst literals by promoting them to entities and to infer new knowledge from existing entities. A 49% relation prediction accuracy using TransE was achieved.

References

[1]
ISAD(G): General international standard archival description 2000, 2 edn. (2000)
[2]
Ahonen, E., Hyvonen, E.: Publishing Historical Texts on the Semantic Web –A Case Study, pp. 167–173. IEEE (2009)
[3]
Bordes, A., Usunier, N., Garcia-Duran, A., Weston, J., Yakhnenko, O.: Translating Embeddings for Modeling Multi-relational Data, pp. 2787–2795 (2013)
[4]
Debruyne C, Beyan OD, Grant R, Collins S, Decker S, and Harrower N A semantic architecture for preserving and interpreting the information contained in irish historical vital records Int. J. Digit. Libr. 2016 17 3 159-174
[5]
Efremova J, Montes García A, and Calders T Hanbury A, Kazai G, Rauber A, and Fuhr N Classification of historical notary acts with noisy labels Advances in Information Retrieval 2015 Cham Springer 49-54
[6]
Efremova J, García AM, Iriondo AB, Calders T, et al. Braslavski P et al. Who are my ancestors? Retrieving family relationships from historical texts Information Retrieval 2016 Cham Springer 121-129
[7]
Efremova, J., Montes Garcia, A., Calders, T., Zhang, J.: Towards population reconstruction: extraction of family relationships from historical documents (2015)
[8]
Efremova J et al. Bloothooft G, Christen P, Mandemakers K, Schraagen M, et al. Multi-source entity resolution for genealogical data Population Reconstruction 2015 Cham Springer 129-154
[9]
Ehrlinger, L., Wob, W.: Towards a Definition of Knowledge Graphs (2016)
[10]
Ellul, C., Abela, C., Azzopardi, J.: Extracting Information from Medieval Notarial deeds, pp. 25–28. EKAW (2018)
[11]
Erdmann, A., et al.: Challenges and solutions for latin named entity recognition. In: The COLING 2016 Organizing Committee, pp. 85–93 (2016)
[12]
Feeney KC, O’Sullivan D, Tai W, and Brennan R Improving curated web-data quality with structured harvesting and assessment Int. J. Semant. Web Inf. Syst. 2014 10 2 35-62
[13]
Fiorini, S.: Documentary Sources of Maltese History Part I Notarial Documents No 1 Notary Giacomo Zabbara. University of Malta, 1 edn. (1996)
[14]
Gonzalez, E.: Unsupervised Relation Extraction by Massive Clustering (2009)
[15]
Han, X., et al.: Openke: an open toolkit for knowledge embedding. In: Proceedings of EMNLP (2018)
[16]
Monti M et al. Pan JZ, Vetere G, Gomez-Perez JM, Wu H, et al. Construction of enterprise knowledge graphs Exploiting Linked Data and Knowledge Graphs in Large Organisations 2017 Cham Springer
[17]
Paulheim H Knowledge graph refinement: a survey of approaches and evaluation methods Semant. Web 2016 8 3 489-508
[18]
Pawar, S., Palshikar, G., Bhattacharyya, P.: Relation Extraction: A Survey (2017)
[19]
Ruddock B Linked data and the locah project Bus. Inf. Rev. 2011 28 2 105-111
[20]
Siddiqui T and Aalam P Short text clustering; challenges & solutions: a literature review Int. J. Math. Comput. Res. 2015 3 6 1025-1031
[21]
Srinivas V Link Prediction in Social Networks 2016 1 Cham Springer
[22]
Villazon-Terrazas B, Garcia-Santa N, Ren Y, Srinivas K, Rodriguez-Muro M, Alexopoulos P, and Pan JZ Construction of enterprise knowledge graphs (I) Exploiting Linked Data and Knowledge Graphs in Large Organisations 2017 Cham Springer 87-116
[23]
Wang Q, Mao Z, Wang B, and Guo L Knowledge graph embedding: a survey of approaches and applications IEEE Trans. Knowl. Data Eng. 2017 29 12 2724-2743
[24]
Winkler, W.: String comparator metrics and enhanced decision rules in the fellegi-sunter model of record linkage. In: Proceedings of the Section on Survey Research Methods (1990)
[25]
Yang Y, Lichtenwalter RN, and Chawla NV Evaluating link prediction methods Knowl. Inf. Syst. 2014 45 3 751-782

Index Terms

  1. NotaryPedia: A Knowledge Graph of Historical Notarial Manuscripts
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image Guide Proceedings
          On the Move to Meaningful Internet Systems: OTM 2019 Conferences: Confederated International Conferences: CoopIS, ODBASE, C&TC 2019, Rhodes, Greece, October 21–25, 2019, Proceedings
          Oct 2019
          780 pages
          ISBN:978-3-030-33245-7
          DOI:10.1007/978-3-030-33246-4

          Publisher

          Springer-Verlag

          Berlin, Heidelberg

          Publication History

          Published: 21 October 2019

          Author Tags

          1. Knowledge Graph
          2. Medieval latin text
          3. Notarial Ontology
          4. Relation extraction
          5. Link prediction

          Qualifiers

          • Article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 15 Sep 2024

          Other Metrics

          Citations

          View Options

          View options

          Get Access

          Login options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media