Alasiry, Areej and Levene, Mark and Poulovassilis, Alexandra (2014) Mining named entities from search engine query logs. In: Almeida, A. and Bernardino, J. and Ferreira Gomes, E. (eds.) Proceedings of the 18th International Database Engineering & Applications Symposium. IDEAS '14. Porto, Portugal: Association for Computing Machinery, pp. 46-56. ISBN 9781450326278.
|
Text
10190.pdf - Published Version of Record Download (998kB) | Preview |
Abstract
We present a seed expansion based approach to classify named entities in web search queries. Previous approaches to this classification problem relied on contextual clues in the form of keywords surrounding a named entity in the query. Here we propose an alternative approach in the form of a Bag-of-Context-Words (BoCW) that is used to represent the context words as they appear in the snippets of the top search results for the query. This is particularly useful in the case where the query consists of only the named entity without any context words, since in the previous approaches no context is discovered. In order to construct the BoCW, we employ a novel algorithm, which iteratively expands a Class Vector that is created through expansion by gradually aggregating the BoCWs of similar named entities appearing in other queries. We provide comprehensive experimental evidence using a commercial query log showing that our approach is competitive with existing approaches.
Metadata
Item Type: | Book Section |
---|---|
Additional Information: | IDEAS '14 18th International Database Engineering & Applications Symposium - Porto, Portugal — 7th-9th July 2014 |
Keyword(s) / Subject(s): | named entity recognition, query logs |
School: | Birkbeck Faculties and Schools > Faculty of Science > School of Computing and Mathematical Sciences |
Research Centres and Institutes: | Innovation Management Research, Birkbeck Centre for, Bioinformatics, Bloomsbury Centre for (Closed), Birkbeck Knowledge Lab |
Depositing User: | Miss Areej Alasiry |
Date Deposited: | 17 Sep 2014 07:55 |
Last Modified: | 09 Aug 2023 12:35 |
URI: | https://eprints.bbk.ac.uk/id/eprint/10190 |
Statistics
Additional statistics are available via IRStats2.