research-article

Open access

Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search

Authors:

Krisztian Balog,

Fabio CrestaniAuthors Info & Claims

CHIIR '24: Proceedings of the 2024 Conference on Human Information Interaction and Retrieval

Pages 209 - 218

https://doi.org/10.1145/3627508.3638300

Published: 10 March 2024 Publication History

All formats PDF

Abstract

Conversational Information Seeking (CIS) is an emerging paradigm for knowledge acquisition and exploratory search. Traditional web search interfaces enable easy exploration of entities, but this is limited in conversational settings due to the limited-bandwidth interface. This paper explore ways to rewrite answers in CIS, so that users can understand them without having to resort to external services or sources. Specifically, we focus on salient entities—entities that are central to understanding the answer. As our first contribution, we create a dataset of conversations annotated with entities for saliency. Our analysis of the collected data reveals that the majority of answers contain salient entities. As our second contribution, we propose two answer rewriting strategies aimed at improving the overall user experience in CIS. One approach expands answers with inline definitions of salient entities, making the answer self-contained. The other approach complements answers with follow-up questions, offering users the possibility to learn more about specific entities. Results of a crowdsourcing-based study indicate that rewritten answers are clearly preferred over the original ones. We also find that inline definitions tend to be favored over follow-up questions, but this choice is highly subjective, thereby providing a promising future direction for personalization.

References

[1]

Mohammad Aliannejadi, Hamed Zamani, Fabio Crestani, and W Bruce Croft. 2019. Asking clarifying questions in open-domain information-seeking conversations. In Proceedings of the 42nd International ACM SIGIR conference on research and development in information retrieval(SIGIR ’19). 475–484.

Digital Library

[2]

James E Allen, Curry I Guinn, and Eric Horvtz. 1999. Mixed-initiative interaction. IEEE Intelligent Systems and their Applications 14, 5 (1999), 14–23.

Digital Library

[3]

Avishek Anand, Lawrence Cavedon, Hideo Joho, Mark Sanderson, and Benno Stein. 2020. Conversational search (Dagstuhl seminar 19461). In Dagstuhl Reports, Vol. 9. Schloss Dagstuhl-Leibniz-Zentrum für Informatik.

[4]

Raviteja Anantha, Svitlana Vakulenko, Zhucheng Tu, Shayne Longpre, Stephen Pulman, and Srinivas Chappidi. 2021. Open-Domain Question Answering Goes Conversational via Question Rewriting. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies(NAACL ’21). 520–534.

[5]

Krisztian Balog. 2018. Entity-oriented search. Springer.

[6]

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, 2020. Language models are few-shot learners. In Advances in Neural Information Processing Systems(NeurIPS ’20). 1877–1901.

[7]

Arthur Câmara, Nirmal Roy, David Maxwell, and Claudia Hauff. 2021. Searching to Learn with Instructional Scaffolding. In Proceedings of the 2021 Conference on Human Information Interaction and Retrieval(CHIIR ’21). 209–218.

Digital Library

[8]

Eunsol Choi, He He, Mohit Iyyer, Mark Yatskar, Wen-tau Yih, Yejin Choi, Percy Liang, and Luke Zettlemoyer. 2018. QuAC: Question answering in context. arXiv preprint arXiv:1808.07036 (2018).

[9]

Kevyn Collins-Thompson, Paul N. Bennett, Ryen W. White, Sebastian de la Chica, and David Sontag. 2011. Personalizing Web Search Results by Reading Level. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management(CIKM ’11). 403–412.

Digital Library

[10]

Jeffrey Dalton, Chenyan Xiong, and Jamie Callan. 2019. TREC CAsT 2019: The conversational assistance track overview. In The Twenty-Eighth Text REtrieval Conference Proceedings(TREC’19).

[11]

Carsten Eickhoff, Jaime Teevan, Ryen White, and Susan Dumais. 2014. Lessons from the Journey: A Query Log Analysis of within-Session Learning. In Proceedings of the 7th ACM International Conference on Web Search and Data Mining(WSDM ’14). 223–232.

Digital Library

[12]

Liana Ermakova, Patrice Bellot, Jaap Kamps, Diana Nurbakova, Irina Ovchinnikova, Eric SanJuan, Elise Mathurin, Sílvia Araújo, Radia Hannachi, Stéphane Huet, 2022. Automatic Simplification of Scientific Texts: SimpleText Lab at CLEF-2022. In European Conference on Information Retrieval(ECIR ’22). 364–373.

[13]

Joseph L Fleiss. 1971. Measuring nominal scale agreement among many raters.Psychological bulletin 76, 5 (1971), 378.

[14]

Ujwal Gadiraju, Ran Yu, Stefan Dietze, and Peter Holtz. 2018. Analyzing knowledge gain of users in informational search sessions on the web. In Proceedings of the 2018 Conference on Human Information Interaction & Retrieval(CHIIR ’18). 2–11.

Digital Library

[15]

Michael Gamon, Tae Yano, Xinying Song, Johnson Apacible, and Patrick Pantel. 2013. Identifying Salient Entities in Web Pages. In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management(CIKM ’13). 2375–2380.

Digital Library

[16]

Luyu Gao, Xueguang Ma, Jimmy Lin, and Jamie Callan. 2023. Precise Zero-Shot Dense Retrieval without Relevance Labels. (2023), 1762–1777.

[17]

Yasin Ghafourian. 2022. Relevance Models Based on the Knowledge Gap. In European Conference on Information Retrieval(ECIR ’22). Springer, 488–495.

[18]

Xianpei Han, Le Sun, and Jun Zhao. 2011. Collective entity linking in web text: a graph-based method. In Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval(SIGIR ’11). 765–774.

Digital Library

[19]

Marti Hearst. 2009. Search user interfaces. Cambridge University Press.

[20]

Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Ye Jin Bang, Andrea Madotto, and Pascale Fung. 2023. Survey of hallucination in natural language generation. Comput. Surveys 55, 12 (2023), 1–38.

Digital Library

[21]

Hideaki Joko and Faegheh Hasibi. 2022. Personal Entity, Concept, and Named Entity Linking in Conversations. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management(CIKM ’22’). 4099–4103.

Digital Library

[22]

Hideaki Joko, Faegheh Hasibi, Krisztian Balog, and Arjen P de Vries. 2021. Conversational Entity Linking: Problem Definition and Datasets. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR ’21). 2390–2397.

Digital Library

[23]

Tom Kwiatkowski, Jennimaria Palomaki, Olivia Redfield, Michael Collins, Ankur Parikh, Chris Alberti, Danielle Epstein, Illia Polosukhin, Jacob Devlin, Kenton Lee, 2019. Natural Questions: A Benchmark for Question Answering Research. Transactions of the Association for Computational Linguistics 7 (2019), 452–466.

[24]

Gondy Leroy, James E Endicott, David Kauchak, Obay Mouradi, Melissa Just, 2013. User evaluation of the effects of a text simplification algorithm using term familiarity on perception, understanding, learning, and information retention. Journal of medical Internet research 15, 7 (2013), e2569.

[25]

Margaret Li, Jason Weston, and Stephen Roller. 2019. Acute-eval: Improved dialogue evaluation with optimized questions and multi-turn comparisons. arXiv preprint arXiv:1909.03087 (2019).

[26]

Gary Marchionini. 2006. Exploratory search: from finding to understanding. Commun. ACM 49, 4 (2006), 41–46.

Digital Library

[27]

Long Ouyang, Jeffrey Wu, Xu Jiang, Diogo Almeida, Carroll Wainwright, Pamela Mishkin, Chong Zhang, Sandhini Agarwal, Katarina Slama, Alex Ray, 2022. Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 27730–27744.

[28]

Deepa Paranjpe. 2009. Learning document aboutness from implicit user feedback and document structure. In Proceedings of the 18th ACM conference on Information and knowledge management(CIKM ’09). 365–374.

Digital Library

[29]

Jayr Pereira, Robson Fidalgo, Roberto Lotufo, and Rodrigo Nogueira. 2023. Visconde: Multi-document QA with GPT-3 and Neural Reranking. In Advances in Information Retrieval: 45th European Conference on Information Retrieval, ECIR 2023, Dublin, Ireland, April 2–6, 2023, Proceedings, Part II(ECIR ’23’). 534–543.

Digital Library

[30]

Francesco Piccinno and Paolo Ferragina. 2014. From TagME to WAT: A new entity annotator. In Proceedings of the first international workshop on Entity recognition & disambiguation(ERD ’14). 55–62.

Digital Library

[31]

Marco Ponza, Paolo Ferragina, and Francesco Piccinno. 2019. Swat: A system for detecting salient Wikipedia entities in texts. Computational Intelligence 35, 4 (2019), 858–890.

[32]

Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J Liu, 2020. Exploring the limits of transfer learning with a unified text-to-text transformer.J. Mach. Learn. Res. 21, 140 (2020), 1–67.

[33]

Tetsuya Sakai. 2021. Graded relevance. In Evaluating Information Retrieval and Access Tasks. 1–20.

[34]

Kim Cheng Sheang and Horacio Saggion. 2021. Controllable Sentence Simplification with a Unified Text-to-Text Transfer Transformer. In Proceedings of the 14th International Conference on Natural Language Generation(INLG ’21). 341–352.

[35]

Idan Szpektor, Deborah Cohen, Gal Elidan, Michael Fink, Avinatan Hassidim, Orgad Keller, Sayali Kulkarni, Eran Ofek, Sagie Pudinsky, Asaf Revach, 2020. Dynamic composition for conversational domain exploration. In Proceedings of The Web Conference 2020(TheWebConf ’20). 872–883.

Digital Library

[36]

Svitlana Vakulenko, Nikos Voskarides, Zhucheng Tu, and Shayne Longpre. 2021. A comparison of question rewriting methods for conversational passage retrieval. In European Conference on Information Retrieval(ECIR ’21). 418–424.

Digital Library

[37]

Anthony J Viera, Joanne M Garrett, 2005. Understanding interobserver agreement: the kappa statistic. Family Medicine 37, 5 (2005), 360–363.

[38]

Ryen W White. 2016. Interactions with search systems. Cambridge University Press.

[39]

Chuan Wu, Evangelos Kanoulas, and Maarten de Rijke. 2020. It all starts with entities: A Salient entity topic model. Natural Language Engineering 26, 5 (2020), 531–549.

[40]

Shi Yu, Jiahua Liu, Jingqin Yang, Chenyan Xiong, Paul Bennett, Jianfeng Gao, and Zhiyuan Liu. 2020. Few-shot generative conversational query rewriting. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval(SIGIR ’20). 1933–1936.

Digital Library

[41]

Shi Yu, Zhenghao Liu, Chenyan Xiong, Tao Feng, and Zhiyuan Liu. 2021. Few-shot conversational dense retrieval. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR ’21). 829–838.

Digital Library

[42]

Hamed Zamani, Johanne R Trippas, Jeff Dalton, Filip Radlinski, 2023. Conversational information seeking. Foundations and Trends® in Information Retrieval 17, 3-4 (2023), 244–456.

Index Terms

Towards Self-Contained Answers: Entity-Based Answer Rewriting in Conversational Search
1. Information systems
  1. Information retrieval

Recommendations

Towards Faceted Search for Named Entity Queries
Advances in Web and Network Technologies, and Information Management

A considerable fraction of the web queries contain named entities. This, coupled with the fact that a proper name might refer to multiple entities, imposes the ever-increasing need that search engines handle efficiently named entity queries. In this ...
Concordance-Based Entity-Oriented Search
WI '07: Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence

We consider the problem of finding the relevant named entities in response to a search query over a given text corpus. Entity search can readily be used to augment conventional web search engines for a variety of applications. To assess the significance ...
Automated News Suggestions for Populating Wikipedia Entity Pages
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

Wikipedia entity pages are a valuable source of information for direct consumption and for knowledge-base construction, update and maintenance. Facts in these entity pages are typically supported by references. Recent studies show that as much as 20% of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CHIIR '24: Proceedings of the 2024 Conference on Human Information Interaction and Retrieval

March 2024

481 pages

ISBN:9798400704345

DOI:10.1145/3627508

Editors:
Paul Clough
University of Sheffield Information School
,
Morgan Harvey
University of Sheffield Information School
,
Frank Hopfgartner
Universität Koblenz

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 March 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

CHIIR '24

CHIIR '24: 2024 ACM SIGIR Conference on Human Information Interaction and Retrieval

March 10 - 14, 2024

Sheffield, United Kingdom

Acceptance Rates

Overall Acceptance Rate 55 of 163 submissions, 34%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
133
Total Downloads

Downloads (Last 12 months)133
Downloads (Last 6 weeks)24

Reflects downloads up to 21 Sep 2024

Other Metrics

View Author Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents