skip to main content
10.1145/1571941.1571945acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Context-aware query classification

Published: 19 July 2009 Publication History

Abstract

Understanding users'search intent expressed through their search queries is crucial to Web search and online advertisement. Web query classification (QC) has been widely studied for this purpose. Most previous QC algorithms classify individual queries without considering their context information. However, as exemplified by the well-known example on query "jaguar", many Web queries are short and ambiguous, whose real meanings are uncertain without the context information. In this paper, we incorporate context information into the problem of query classification by using conditional random field (CRF) models. In our approach, we use neighboring queries and their corresponding clicked URLs (Web pages) in search sessions as the context information. We perform extensive experiments on real world search logs and validate the effectiveness and effciency of our approach. We show that we can improve the F1 score by 52% as compared to other state-of-the-art baselines.

References

[1]
http://search.yahoo.com/dir.
[2]
http://www.google.com/dirhp?hl=en.
[3]
Angrawal, R., et al. Mining association rules between sets of items in large databases. In SIGMOD'93, pages 206--21 1993.
[4]
Beitzel, S., M., et al. Hourly analysis of a very large topically categorized web query log. In SIGIR'04, pages 321--328, 2004.
[5]
Beitzel, S., M., et al. Automatic web query classification using labeled and unlabeled training data. In SIGIR'05, pages 581--582, 2005.
[6]
Beitzel, S., M., et al. Improving automatic query classification via semi-supervised learning. In ICDM'05, pages 42--49, 2005.
[7]
Broder, A., Z. A taxonomy of web search. In SIGIR Forums pages 3---10, 2002.
[8]
Broder, A., Z., et al. Robust classification of rare queries using web Knowledge. In SIGIR'07, pages 231--238, 2007.
[9]
Cao, H., et al. Context-aware query suggestion by mining click-through and session data. In KDD'08, pages 875--88, 2008.
[10]
Cao, H., et al. Towards context-aware search by learning a very large variable length hidden markov model from search logs. To appear in WWW'09, 2009.
[11]
Cui, H., et al. Probabilistic query expansion using query logs. In WWW'02, pages 325--332, 2002.
[12]
Dai,H.,K., et al. Detecting online commercial intention (oci). In WWW'06, pages 829--837, 2006.
[13]
Fonseca, B.M., et al. Concept-based interactive query expansion. In CIKM'05, pages 696--703, 2005.
[14]
Guo, J.,F., et al. A unified and discriminative model for query refinement. In SIGIR'08, pages 379--386, 2008.
[15]
He, D., et al. Detecting session boundaries from Web user logs. In Proceedings of BCS-IRSG 22nd Annual Colloquium on Information Retrieval Research, pages 57--66, 2000.
[16]
He, Q., et al. Web query recommendation via sequential query prediction . To appear in ICDE'09, 2009.
[17]
Jones, R., et al. Beyond the session timeout: automatic hierarchical segmentation of search topics in query logs. In CIKM'08, pages 699--708, 2008.
[18]
Kang, I., et al. Query type classification for web document retrieval. In SIGIR'03, pages 64--71, 2003.
[19]
Lafferty, J.,D., et al. Conditional random fields: probabilistic models for segmenting and labeling sequence data. In ICML'01, pages 282--289, 2001.
[20]
Lee, U., et al. Automatic identification of user goals in web search. In WWW'05, pages 391--400, 2005.
[21]
Li, X., et al. Learning query intent from regularized click graphs. In SIGIR'08, pages 339--346, 2008.
[22]
Mccallum, A. Efficiently inducing features of conditional random fields. In UAI'03, pages 403--410, 2003.
[23]
Olsson, J.,S., et al. Cross-language text classification. In SIGIR'05, pages 645--646, 2005.
[24]
Rose, D.E., et al. Understanding user goals in web search. In WWW'04, pages 13--19, 2004.
[25]
Salton, G., et al. A vector space model for automatic indexing. In Communications of the ACM. vol. 18, pages 613--620, 1975.
[26]
Settles, B. Abner: an open source tool for automatically tagging genes, proteins, and other entity names in text. In Bioinformatics.21(14), pages 3191--3192, 2005.
[27]
Shen, D., et al. Building bridges for web query classification. In SIGIR'06, pages 131--138, 2006.
[28]
Sutton, C., et al. Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data. In ICML'04, 2004.
[29]
Wen, J., et al. Clustering user queries of a search engine. In WWW'01, pages 162--168, 2001.

Cited By

View all
  • (2024)Intent Identification by Semantically Analyzing the Search QueryModelling10.3390/modelling50100165:1(292-314)Online publication date: 22-Feb-2024
  • (2024)Hierarchical Query Classification in E-commerce SearchCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648332(338-345)Online publication date: 13-May-2024
  • (2024)A Semi-supervised Multi-channel Graph Convolutional Network for Query Classification in E-commerceCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648302(56-64)Online publication date: 13-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
July 2009
896 pages
ISBN:9781605584836
DOI:10.1145/1571941
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. query classification
  2. search context

Qualifiers

  • Research-article

Conference

SIGIR '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)45
  • Downloads (Last 6 weeks)4
Reflects downloads up to 14 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Intent Identification by Semantically Analyzing the Search QueryModelling10.3390/modelling50100165:1(292-314)Online publication date: 22-Feb-2024
  • (2024)Hierarchical Query Classification in E-commerce SearchCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648332(338-345)Online publication date: 13-May-2024
  • (2024)A Semi-supervised Multi-channel Graph Convolutional Network for Query Classification in E-commerceCompanion Proceedings of the ACM Web Conference 202410.1145/3589335.3648302(56-64)Online publication date: 13-May-2024
  • (2023)Searching Online for Art and Culture: User Behavior AnalysisFuture Internet10.3390/fi1506021115:6(211)Online publication date: 11-Jun-2023
  • (2023)Investigating Online Art Search through Quantitative Behavioral Data and Machine Learning TechniquesAnalytics10.3390/analytics20200212:2(359-392)Online publication date: 26-Apr-2023
  • (2023)HierCat: Hierarchical Query Categorization from Weakly Supervised Data at Facebook MarketplaceCompanion Proceedings of the ACM Web Conference 202310.1145/3543873.3584622(331-335)Online publication date: 30-Apr-2023
  • (2023)Graph Enhanced BERT for Query UnderstandingProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval10.1145/3539618.3591845(3315-3319)Online publication date: 19-Jul-2023
  • (2023)SST: Semantic and Structural Transformers for Hierarchy-aware Language Models in E-commerce2023 IEEE International Conference on Big Data (BigData)10.1109/BigData59044.2023.10386079(838-846)Online publication date: 15-Dec-2023
  • (2023)The vision of on-demand architectural knowledge systems as a decision-making companionJournal of Systems and Software10.1016/j.jss.2022.111560198:COnline publication date: 1-Apr-2023
  • (2023)Knowing Before Seeing: Incorporating Post-retrieval Information into Pre-retrieval Query Intention ClassificationKnowledge Science, Engineering and Management10.1007/978-3-031-40286-9_1(3-15)Online publication date: 9-Aug-2023
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media