Optimizing BEVA with Two-Level Indexes

???item.export.label???

Please use this identifier to cite or link to this item: https://tede.ufam.edu.br/handle/tede/7840

Full metadata record

DC Field	Value	Language
dc.creator	Ferreira, Van Den Berg da Gama	-
dc.creator.Lattes	http://lattes.cnpq.br/9433225118102294	por
dc.contributor.advisor1	Moura, Edleno Silva de Moura	-
dc.contributor.advisor1Lattes	http://lattes.cnpq.br/4737852130924504	por
dc.contributor.referee1	Silva, Altigran Soares da	-
dc.contributor.referee1Lattes	http://lattes.cnpq.br/3405503472010994	por
dc.contributor.referee2	Rosa, Thierson Couto	-
dc.contributor.referee2Lattes	http://lattes.cnpq.br/4414718560764818	por
dc.embargo.liftdate	Período de Embargo: 22/07/2020 - 09/01/2023	por
dc.date.issued	2020-07-10	-
dc.identifier.citation	FERREIRA, Van Den Berg. Optimizing BEVA with Two-Level Indexes. 2020. 89 f. Dissertação (Mestrado em Informática) - Universidade Federal do Amazonas, Manaus, 2020.	por
dc.identifier.uri	https://tede.ufam.edu.br/handle/tede/7840	-
dc.description.resumo	Indisponível	por
dc.description.abstract	Query autocompletion is an important component of modern search systems that suggests possible queries at each user keystroke to complete the query based on the prefix already typed in the search box. One of the most adopted and successful data structures for query autocompletion is the TRIE which is used to index the possible query suggestions. The TRIE is traversed based on the search prefix typed by the user in order to select suggestions that match the prefix. The use of TRIEs requires a large amount of extra memory for processing queries, which may increase the cost for processing queries and may limit the number of query suggestions indexed. In this work we propose optimized alternative implementations of BEVA algorithm, currently the state-of-the-art in the literature for autocompletion, in order to achieve a reduction in its memory consumption while keeping it efficient in query processing times. First, we propose a novel strategy to build the TRIE, named level-at-a-time (laat), and compare its performance to the way TRIEs are usually built, the key-ata-time (kaat). In the kaat strategy the index is built in depth-ward direction and in the laat strategy the index is built in breadth-ward direction. We implemented the proposed ideas and experimented them with several datasets, where we show that laat strategy allows a significant speedup in query processing of BEVA, being up to four times faster, with improvements achieved specially in queries with high number of errors, which are the most expensive ones. Second, we study the use of two-level indexing and prefix processing approaches for query autocompletion also in BEVA method. Two-level approaches combine the use of indexes with sequential search in order to reduce memory requirements in search systems. In our study, we insert in the TRIE only part of each query inserted, and the leaf nodes reference to a set of queries where a sequential search is performed. We experimented two alternative ways of selecting the portion of each key that remains indexed in the first level and compare their performance. The two-level approach has shown to significantly reduce the memory requirements for storing the index with just a small variation in query processing times	eng
dc.format	application/pdf	*
dc.thumbnail.url	https://tede.ufam.edu.br//retrieve/39514/Reprodu%c3%a7%c3%a3o%20N%c3%a3o%20Autorizada.pdf.jpg	*
dc.thumbnail.url	https://tede.ufam.edu.br/retrieve/61940/Dissertacao_VanDenBerg_PPGI.pdf.jpg	*
dc.language	por	por
dc.publisher	Universidade Federal do Amazonas	por
dc.publisher.department	Instituto de Computação	por
dc.publisher.country	Brasil	por
dc.publisher.initials	UFAM	por
dc.publisher.program	Programa de Pós-graduação em Informática	por
dc.rights	Acesso Aberto	por
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	-
dc.subject.cnpq	CIÊNCIAS EXATAS E DA TERRA: CIÊNCIA DA COMPUTAÇÃO	por
dc.title	Optimizing BEVA with Two-Level Indexes	por
dc.title.alternative	Otimizando o BEVA com índices de dois níveis	por
dc.type	Dissertação	por
dc.subject.user	Query processing	eng
dc.subject.user	Error-tolerant	eng
dc.subject.user	Autocompletion	eng
dc.subject.user	Two level	eng
dc.subject.user	Trie	eng
dc.subject.user	Programação de sistemas (Computação)	por
dc.subject.user	Programas de computador - Precisão	por
Appears in Collections:	Mestrado em Informática

Files in This Item:

File	Description	Size	Format
Dissertacao_VanDenBerg_PPGI.pdf		3.17 MB	Adobe PDF	Download/Open Preview ×

Show simple item record Recommend this item

This item is licensed under a Creative Commons License

Universidade Federal do Amazonas