An evaluation of multi-probe locality sensitive hashing for computing similarities over web-scale query logs

Show simple item record

dc.contributor.author Cormode, Graham
dc.contributor.author Dasgupta, Anirban
dc.contributor.author Goyal, Amit
dc.contributor.author Lee, Chi Hoon
dc.date.accessioned 2018-02-12T11:59:00Z
dc.date.available 2018-02-12T11:59:00Z
dc.date.issued 2018-01
dc.identifier.citation Cormode, Graham; Dasgupta, Anirban; Goyal, Amit and Lee, Chi Hoon, “An evaluation of multi-probe locality sensitive hashing for computing similarities over web-scale query logs”, PLOS ONE, DOI: 10.1371/journal.pone.0191175, vol. 13, no. 1, Jan. 2018. en_US
dc.identifier.issn
dc.identifier.issn 1932-6203
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/3445
dc.identifier.uri http://dx.doi.org/10.1371/journal.pone.0191175
dc.description.abstract Many modern applications of AI such as web search, mobile browsing, image processing, and natural language processing rely on finding similar items from a large database of complex objects. Due to the very large scale of data involved (e.g., users’ queries from commercial search engines), computing such near or nearest neighbors is a non-trivial task, as the computational cost grows significantly with the number of items. To address this challenge, we adopt Locality Sensitive Hashing (a.k.a, LSH) methods and evaluate four variants in a distributed computing environment (specifically, Hadoop). We identify several optimizations which improve performance, suitable for deployment in very large scale settings. The experimental results demonstrate our variants of LSH achieve the robust performance with better recall compared with “vanilla” LSH, even when using the same amount of space. en_US
dc.description.statementofresponsibility Graham Cormode;Anirban Dasgupta;Amit Goyal;Chi Hoon Lee
dc.format.extent Vol. 13, no. 1,
dc.language.iso en en_US
dc.publisher Public Library of Science en_US
dc.title An evaluation of multi-probe locality sensitive hashing for computing similarities over web-scale query logs en_US
dc.type Article en_US
dc.relation.journal PLOS ONE


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account