???item.export.label??? ???item.export.type.endnote??? ???item.export.type.bibtex???

Please use this identifier to cite or link to this item: https://tede.ufam.edu.br/handle/tede/6831
???metadata.dc.type???: Tese
Title: Leveraging User Opinions for Product Catalog Enrichment
???metadata.dc.creator???: Melo, Tiago Eugenio de 
???metadata.dc.contributor.advisor1???: Soares, Altigran
???metadata.dc.contributor.referee1???: Soares, Altigran
???metadata.dc.contributor.referee2???: Moura, Edleno
???metadata.dc.contributor.referee3???: Calado, Pável
???metadata.dc.contributor.referee4???: Moreira, Viviane
???metadata.dc.contributor.referee5???: Barbosa, Luciano
???metadata.dc.description.resumo???: A large number of people post reviews on the products of all types, which are offered online. In these reviews, people express their opinion regarding these products and their features. Consequently, a large number of opinions are available, which can be a valuable source of knowledge for decision-making for manufacturers as well as customers. From these opinions, manufacturers can obtain immediate feedback to improve the quality of their products and customers are able to obtain assessments from reviews prior to purchasing a product. However, as it is common in many types of social media, the sheer volume of available reviews for each product normally exceeds the human processing capacity and can, thus, become a major barrier to its effective use. The question that now arises is how to structure opinions so that they can be effectively used by customers and manufacturers. Traditional methods of organizing a large number of product reviews aim at creating an opinion summary. However, these methods are inadequate to address customer queries on the most relevant product characteristics. In particular, in current methods, the opinions are arbitrarily clustered by aspect expressions, causing these clusters to not necessarily align with relevant product characteristics. We claim that the most important product characteristics for people are represented by the attributes of the product catalogs and the process of organizing opinions should be guided by the attributes of the product catalogs. Therefore, in this thesis, we formulated and investigated the following problem: enriching product catalogs with user opinions at the attribute granularity level as a new form of opinion summarization. Grouping opinions around the attributes of the product catalog also allows the catalog to be enriched with these opinions with the passage of time. To deal with this novel problem, in this thesis, we started by investigating the impacts of attributes of product catalogs on user opinions. In this investigation, we used a large collection of data. The experimental results indicate that user opinions are significantly influenced by product attributes. In addition, we presented a new approach comprising of two phases: opinion extraction and opinion mapping. Based on this approach, we developed two distinct methods. For the first method, named AspectLink, an unsupervised strategy has been adopted. For the second method, named OpinionLink, a supervised strategy has been adopted. To verify the effectiveness of the methods, an extensive experimental evaluation was conducted which demonstrated the effectiveness of the proposed methods. Furthermore, a bootstrapping strategy was proposed to train the classifiers of OpinionLink in order to reduce the dependence on training data. Finally, the supervised method was applied as a full pipeline, and the experimental results demonstrated the feasibility of using this method in real and large-scale applications. To properly evaluate the methods developed in this study, the experimental datasets, non-existent in the literature, were developed, which are available as another contribution. We also developed a practical application to showcase some proposal ideas in this thesis.
Abstract: A large number of people post reviews on the products of all types, which are offered online. In these reviews, people express their opinion regarding these products and their features. Consequently, a large number of opinions are available, which can be a valuable source of knowledge for decision-making for manufacturers as well as customers. From these opinions, manufacturers can obtain immediate feedback to improve the quality of their products and customers are able to obtain assessments from reviews prior to purchasing a product. However, as it is common in many types of social media, the sheer volume of available reviews for each product normally exceeds the human processing capacity and can, thus, become a major barrier to its effective use. The question that now arises is how to structure opinions so that they can be effectively used by customers and manufacturers. Traditional methods of organizing a large number of product reviews aim at creating an opinion summary. However, these methods are inadequate to address customer queries on the most relevant product characteristics. In particular, in current methods, the opinions are arbitrarily clustered by aspect expressions, causing these clusters to not necessarily align with relevant product characteristics. We claim that the most important product characteristics for people are represented by the attributes of the product catalogs and the process of organizing opinions should be guided by the attributes of the product catalogs. Therefore, in this thesis, we formulated and investigated the following problem: enriching product catalogs with user opinions at the attribute granularity level as a new form of opinion summarization. Grouping opinions around the attributes of the product catalog also allows the catalog to be enriched with these opinions with the passage of time. To deal with this novel problem, in this thesis, we started by investigating the impacts of attributes of product catalogs on user opinions. In this investigation, we used a large collection of data. The experimental results indicate that user opinions are significantly influenced by product attributes. In addition, we presented a new approach comprising of two phases: opinion extraction and opinion mapping. Based on this approach, we developed two distinct methods. For the first method, named AspectLink, an unsupervised strategy has been adopted. For the second method, named OpinionLink, a supervised strategy has been adopted. To verify the effectiveness of the methods, an extensive experimental evaluation was conducted which demonstrated the effectiveness of the proposed methods. Furthermore, a bootstrapping strategy was proposed to train the classifiers of OpinionLink in order to reduce the dependence on training data. Finally, the supervised method was applied as a full pipeline, and the experimental results demonstrated the feasibility of using this method in real and large-scale applications. To properly evaluate the methods developed in this study, the experimental datasets, non-existent in the literature, were developed, which are available as another contribution. We also developed a practical application to showcase some proposal ideas in this thesis.
Keywords: Sentiment Analysis
Opinion Mining
Product Catalog Enrichment
Aspect-Base Summarization
Online reviews
???metadata.dc.subject.cnpq???: CIÊNCIAS EXATAS E DA TERRA: CIÊNCIA DA COMPUTAÇÃO
Language: eng
???metadata.dc.publisher.country???: Brasil
Publisher: Universidade Federal do Amazonas
???metadata.dc.publisher.initials???: UFAM
???metadata.dc.publisher.department???: Instituto de Computação
???metadata.dc.publisher.program???: Programa de Pós-graduação em Informática
Citation: MELO, Tiago Eugênio de. Leveraging User Opinions for Product Catalog Enrichment. 2018. 88 f. Tese (Doutorado em Informática) - Universidade Federal do Amazonas, Manaus, 2018.
???metadata.dc.rights???: Acesso Aberto
URI: https://tede.ufam.edu.br/handle/tede/6831
Issue Date: 3-Dec-2018
Appears in Collections:Doutorado em Informática

Files in This Item:
File Description SizeFormat 
Tese_Tiago Melo_PPGI4.44 MBAdobe PDFThumbnail

Download/Open Preview


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.