Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining

Kauffmann, Erick; Peral, Jesús; Gil, David; Ferrández, Antonio; Sellers, Ricardo; Mora, Higinio

doi:10.3390/su11154235

Open AccessArticle

Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining

¹

School of Industrial Engineering, University of Costa Rica, San José 11501-2060, Costa Rica

²

Department of Software and Computing Systems, University of Alicante, 03690 Alicante, Spain

³

Department of Computer Technology and Computation, University of Alicante, 03690 Alicante, Spain

⁴

Department of Marketing, University of Alicante, 03690 Alicante, Spain

^*

Authors to whom correspondence should be addressed.

Sustainability 2019, 11(15), 4235; https://0-doi-org.brum.beds.ac.uk/10.3390/su11154235

Submission received: 6 June 2019 / Revised: 29 July 2019 / Accepted: 29 July 2019 / Published: 5 August 2019

(This article belongs to the Special Issue Digital Marketing for Sustainable Growth: Business Models and Online Campaigns using Sustainable Strategies)

Download

Browse Figures

Versions Notes

Abstract

:

Companies have realized the importance of “big data” in creating a sustainable competitive advantage, and user-generated content (UGC) represents one of big data’s most important sources. From blogs to social media and online reviews, consumers generate a huge amount of brand-related information that has a decisive potential business value for marketing purposes. Particularly, we focus on online reviews that could have an influence on brand image and positioning. Within this context, and using the usual quantitative star score ratings, a recent stream of research has employed sentiment analysis (SA) tools to examine the textual content of reviews and categorize buyer opinions. Although many SA tools split comments into negative or positive, a review can contain phrases with different polarities because the user can have different sentiments about each feature of the product. Finding the polarity of each feature can be interesting for product managers and brand management. In this paper, we present a general framework that uses natural language processing (NLP) techniques, including sentiment analysis, text data mining, and clustering techniques, to obtain new scores based on consumer sentiments for different product features. The main contribution of our proposal is the combination of price and the aforementioned scores to define a new global score for the product, which allows us to obtain a ranking according to product features. Furthermore, the products can be classified according to their positive, neutral, or negative features (visualized on dashboards), helping consumers with their sustainable purchasing behavior. We proved the validity of our approach in a case study using big data extracted from Amazon online reviews (specifically cell phones), obtaining satisfactory and promising results. After the experimentation, we could conclude that our work is able to improve recommender systems by using positive, neutral, and negative customer opinions and by classifying customers based on their comments.

Keywords:

big data; sentiment analysis; marketing decisions; feature selection

1. Introduction

The long-term sustainability of companies depends, to a great extent, on their ability to properly meet customer needs. In fact, the aim of satisfying customers is to create brand value, which is a key factor for a company’s sustainability [1]. Accordingly, many companies invest huge amounts of money on marketing research to gather information about consumer preferences and demands. From this information, it is crucial to understand what consumers think about the products they buy in order to develop appropriate branding and positioning strategies. Reference [2] stated that a powerful brand name can influence the consumer decision-making process and can positively impact brand sustainability. Specifically, marketing managers need to know how a brand is perceived by its target market relative to other brands in the category and in relation to the most relevant attributes defined for its category. In fact, brand image is built through consumer opinions on specific product characteristics.

With global access to the internet, a large amount of data is generated, thereby providing a promising way to discover consumer opinion about products that are bought and experienced. Organizations want to take advantage of these data and convert them into relevant information that allows them to make better decisions, and this is possible by analyzing all available data (“big data”). Internet users collaborate daily in the generation of huge amounts of data, thereby becoming one of the most important sources of big data. By writing blogs, participating in social media, or reviewing products online, internet users are constantly generating content. Consumer comments in online forums have proven to be a useful source for revealing consumer insights [3], and this user-generated content (UGC) represents a promising alternative source for potentially identifying customer needs [4]. Thus, mining this UGC and analyzing the sentiments of the comments expressed by consumers might be useful for companies. Actually, many researchers highlight the importance of factoring in UGC to aid in decision-making in the marketing field. Particularly, brand management can be one area of interest, as online reviews might have an influence on brand image and brand positioning, including design decisions. In the same line, Fan et al. [5] argued that this type of analysis might help manufacturers not only to find out what consumer demands or requirements are, but also to facilitate the design of new products and the improvement of products already available on the market.

Within this context, sentiment analysis (SA) techniques are a useful way to examine opinionated text, which contains consumer opinions toward companies, products, brands, or events. SA is a subfield in natural language processing (NLP) that automatically classifies text through valence [6], extracting information from user opinions [7]. Certain techniques split the comments into two classes (negative or positive), and others incorporate more sentiment classes [8]. Generally, SA means a classification of the given text polarity at three levels: the document level, sentence level, or aspect level [7].

A more fine-grained view of the opinions expressed by consumers requires analysis at the sentence level [8]. Consumers frequently review products that have many features or attributes, and they usually have a different opinion about each of these aspects. While consumers may find some features of products appealing, other aspects may be disappointing. Thus, when a consumer provides feedback about different features of a product, classifying a single review as either positive or negative may overlook valuable information contained within it. For example, some of the features can be positively reviewed, whereas others can be negatively reviewed. A feature-based SA might offer a more detailed view of how consumers rate a product, which in the end will drive future behavior. Using this feature-based analysis, marketing managers can obtain valuable information about different features of the product that would not be detected if the sentiment was classified only in terms of the whole review [9].

In this paper, we focus on SA techniques as well as on the application of NLP tools to marketing decision-making. First, customer preferences according to a star score usually given by users in UGC and a sentiment score were analyzed. Subsequently, the positive, neutral, and negative sections of the review were split. Finally, the main features of the products that provoked positive, neutral, and negative feelings in clients were identified. We carried out (1) a global sentiment analysis in the review (document level) that allowed us to measure whether the product was liked by people, (2) an analysis of different phrases (sentence level) to find out what buyers liked and disliked about a product, and (3) an extraction of positive/neutral/negative product features (aspect level).

Our case study also extracted big data from Amazon online reviews, as in the case of other researchers (e.g., References [10] or [11]), given that Amazon is recognized as one of the most important online marketplaces to buy products. However, our proposal can be applied to different marketplaces (e.g., Flipkart, Snapdeal). These product reviews were written by buyers and were used by new potential consumers as a source of electronic word-of-mouth to make decisions on their own purchases. In this sense, brand image is derived not only from signals sent by companies, but also by online reviews written by consumers. Specifically, we focused on the cell phone category, an interesting case study given its impact on sustainability [12]. The shortened lifecycle and first world throwaway culture that affect these types of products are concerning, given that the conditions surrounding the extraction of necessary minerals, such as coltan use in batteries, often result in conflict and involve unfair labor practices and human rights abuses [13], such as in the issue of child soldiers [14]. Furthermore, the growing demand for coltan may result in adverse environmental impacts in mining regions [15]. Addressing consumer demands based on their priorities and opinions on a product might help to mitigate these problems and increase the sustainability of the whole sector involved in the manufacturing of electronic and mobile devices.

Regarding sustainability issues, in the critical review of Kemper et al. [16], the authors responded to the need to better understand the foundations of marketing worldviews with respect to sustainability. In Reference [17], the authors developed a framework that provides services marketing managers with a systematic and transparent means of enhancing sustainability performance through marketing functions.

In previous studies [18,19], we have found that the use of sentiment scores and the search for positive and negative product features help in making decisions. However, we have not found studies that have combined product price, the quantitative star score given by users, the sentiment score given by an SA tool in a global review, and the sentiment score given for each specific extracted feature to classify the best products by brand or category shown on dashboards. Due to the complexity of the online reviews, we made use of SA and text data mining techniques to improve the marketing decision-making process through a specific extraction and analysis of positive, neutral, and negative characteristics of reviewed products.

The major contribution of our research is to present a clear and efficient architecture divided into stages in order to address this complex scenario. In addition, the strength of this general approach, with its well-defined stages, rests in its applicability to other research areas.

The novelty of our work is to give a rating based on the qualification of the features and combine them with other scores that serve to classify products, including price and the global sentiment score. Consequently, our work develops a ranking for a product that classifies its features along with other indicators, such as price and the sentiment score of the review.

To conclude, the aims of the work in this paper were the following:

To use the mined product features and the polarity of consumer opinions about each feature to obtain a product score. Then, we combined price, star score, sentiment score of the review, and sentiment score based on product features to rank each product and to assist marketing managers and consumers in their decision-making processes;
To carry out a detailed analysis of the characteristics of the consumers’ reviews;
To improve recommender systems using positive, neutral, and negative customer feedback. In this way, the objective of “efficient” (or “sustainable”) purchases could be achieved.

This paper is organized as follows: Section 1 gives an introduction of the uses of SA and the value of consumer reviews in e-commerce for branding. Section 2 briefs related work presented in the literature about big data techniques applied to marketing, SA, and product feature selection methods. In Section 3, we show the proposed architecture. Section 4 details the data collection and tool setup and the experimentation and results. Finally, Section 5 gives the conclusion and ideas for future work.

2. Background

This paper deals with automatic SA and product features selection, the identification of the main characteristics of an analyzed product (e.g., the users’ opinions about the battery or screen of the Samsung S7 phone) in product reviews, and the benefits of applying big data techniques to marketing. Thus, we will summarize previous work on these topics. This section ends with an overview of the findings extracted from related work, which justifies our contribution to this state-of-the-art work.

Obtaining consumer opinions about a product or service is not an easy task. The traditional way is by means of a simple action consisting of scoring the purchase process experience, including the product or service itself. Scoring the product from 1 to 5 is commonly referred to as a star rating [20]. Moreover, price can be used as an indicator of customer preferences, and the best way to price a product is to know what consumers would be willing to pay [21].

However, these methods do not provide enough clarity and do not help customers to purchase products based on a specific feature [22]. In this way, the quality index of product features strongly influences consumer choices [23].

Given the current state of information technology, consumers can easily make online purchases and post reviews on social media. This user-generated content may be relied upon by potential customers, thereby influencing future purchasing decisions [24]. Everybody can easily share their opinions on companies, products, and services with other internet users, and potential customers can easily access these online reviews [25] in real time. It should be remembered that one of the factors consumers consider in their decision-making processes is word-of-mouth (WOM) [26]. Thus, measures of eWOM (electronic word-of-mouth) [27] or online reviews [28] can be included in marketing-mix models to provide better explanations and predictions of consumer choices and sales. These ratings and comments summarize individual consumer evaluations and act as indicators of product quality [29,30]. Furthermore, and even more importantly, they act as a cue to help future consumers determine product or brand attributes [31]. Such a large volume of constantly generated data is increasingly a big data challenge for businesses [32].

SA classifies product reviews as positive or negative or other sentiment classes [8]: polarity classification is the basic task. A recent stream of research focused on applications that are more specialized. One such application is to use opinion mining to determine areas of a product that need to be improved by summarizing product reviews to see what parts of the product are generally considered good or bad by users [33]. The general opinion about a topic is useful, but it is also important to detect sentiments about individual aspects of the topic [34]. Furthermore, it is possible to classify customers based on their opinions or improve recommender systems using positive and negative customer feedback.

To analyze all these textual data on reviews, SA can be used. SA in product reviews is the process of exploring these reviews to determine the overall opinion or feeling about a product [35]. This information is unstructured and is not something that is “machine processable” [36]. Cambria also exposed that the challenge is huge because an understanding of the explicit and implicit, regular and irregular, and syntactical and semantic language rules is necessary. SA is an avidly researched field with a large number of papers that have been summarized in abundant surveys that have tried to present an overview of the applied techniques and algorithms (e.g., References [37,38,39,40,41]). Jandail [33] showed six types of issues in SA: (1) opposite meanings in particular domains; (2) an interrogative sentence or conditional sentence may not have positive or negative sentiments and may have other sentiment classes (e.g., a neutral sentiments); (3) sarcastic sentences may have the opposite sentiment; (4) sentiment information without using sentiment words; (5) a word can change the feeling polarization in two similar sentences, as well as the fact that for a different person, a sentence may have a different sentiment; (6) natural language semantics may change according to the geographical, cultural, or temporal context. Some researchers have focused on these particular issues [19,42,43,44,45,46,47,48].

Cambria et al. [49] classified the main existing approaches into four categories: keyword spotting, lexical affinity, statistical methods, and concept-based approaches. The keyword spotting approach classifies text by affect categories based on the presence of unambiguous affect words. The lexical affinity approach detects obvious affect words and assigns arbitrary words a probable “affinity” to particular emotions. The statistical methods include Bayesian inference and supervised vector machine (SVM), which is popular for affect text classification. It uses machine-learning algorithms that are trained with a large corpus of affectively annotated texts, and the system learns the affective valence of keywords. The concept-based approaches use web ontologies or semantic networks to accomplish semantic text analysis. All these techniques need to use a sentiment lexicon.

Many studies have focused on analyzing product reviews to get feedback on a product for decision-making purposes. García-Moya et al. [50] proposed a new methodology based on language models in order to facilitate the portability of the proposal to new domains and languages for the retrieval of product features and opinions from a collection of free-text customer reviews about a product or service. In addition, Singla et al. [51] classified text as positive, neutral, or negative, although different emotions were also considered (e.g., anger, anticipation, disgust, fear, joy, sadness, surprise, and trust, as well as the traditional positive, negative, and neutral). Paknejad [52] studied different machine learning approaches to determine the best options for sentiment classification problems for online reviews using product reviews from Amazon. Abbasi et al. [53] used SVM classifiers for SA with several univariate and multivariate methods for feature selection, reaching 85%–88% accuracy after using the chi-squared method for selecting the relevant attributes in text. A network-based feature selection method, that is, feature relation networks (FRNs), helped to improve the performance of the classifier. Saura et al. [54] identified key factors in UGC for the creation of successful start-ups by analyzing sentiments with an SVM. This method was applied to identify the start-up topics via the polarity sentiment.

There are several methods that have been used in feature selection, where some are syntactic, based on the syntactic position of the word (such as adjectives); some are univariate, based on each feature’s relation to a specific category; and some are multivariate, based on features subsets [35]. Archak et al. [55] used techniques that decompose reviews into segments that evaluate the individual characteristics of a product (e.g., the image quality and battery life of a digital camera). Then, as a major contribution, the authors adapted methods from the econometrics literature, specifically the hedonic regression concept. As mentioned by Chi et al. [56], existing feature selection techniques compute feature scores solely based on training data statistics or by modifying a specific feature metric formula to include test data information that cannot be generalized to other types of feature metrics, and they proposed combining both techniques (i.e., the training dataset and the feature metric formula). Mars and Gouider [57] proposed a big data architecture for decision-making, analyzing data and extracting customer opinions about product features. The architecture uses, among other techniques, machine learning, NLP, and big data. To detect the features, they used an ontology that covers features and characteristics of mobile phones in general and in other specific technical terms of electronic products and then extracted feature opinions based on the MapReduced programming model. The feature frequency is most widely used for feature weighting, and there have been many related studies [58,59]. Wang et al. [60] investigated the relevancy between the clustered features and the class in assigning the weights, proposing a method to reduce the size of features by removing irrelevant ones. Zhou et al. [61] proposed a feature selection approach based on the document frequency of segmented term frequency to eliminate redundant features and retain words with strong class distinguishing ability.

Currently, deep learning approaches have achieved very high performance across many diverse NLP tasks. In Reference [62], the authors reviewed meaningful deep learning-related paradigms as well as approaches used for NLP tasks. Furthermore, they also supplied an evolution rehearsal. Regarding SA, current studies [63] exist that have highlighted the importance of extensive phrases and how (in these cases) supervised training, assessment resources, and more powerful models are required.

We conclude this section by emphasizing that our contribution to this state-of-the-art work is not focused on SA techniques but on a combination of quantitative scores given by users, SA scores in a global review, and SA scores on individual characteristics extracted from product reviews (e.g., the positive, neutral, or negative user opinions expressed about product features) in order to assist marketing managers and consumers in their decision-making processes. This combination has not been found in previous works. Our work developed a ranking for the product that classifies its features along with other indicators, such as the price and the sentimental score of the review.

3. The Proposed Methodology Architecture

In this section, we present our proposal (based on the use of SA tools and product feature detection), which focuses on a detailed analysis of the characteristics of consumer reviews. Figure 1 illustrates the proposal.

As shown in the figure, we distinguished seven stages: (1) data collection, (2) review preprocessing using NLP techniques, (3) product feature selection, (4) sentiment analysis, (5) clustering features, (6) new scores, and (7) dashboards.

As previously mentioned, our architecture allows for an analysis of reviews at different levels: (1) In stage 4, SA, the score of the whole review is obtained (document level). The objective is to calculate a global score of the product that measures whether it is liked by people. (2) Moreover, in stage 4, the score for each sentence is calculated (sentence level). The goal is to have a sentence score to find out what buyers like and dislike about the product. (3) Finally, in stage 5, clustering features, the score for each feature is obtained (aspect level) in order to say which ones are positive, neutral, or negative.

Next, we will explain in detail the different stages.

3.1. Data Collection Stage

We had a corpus of product reviews as well as relevant information about each product (for example, the price, the brand, and the categories into which the product is classified). Our proposal analyzes these data and discovers new information that will help managers and users to make decisions regarding the products.

These reviews usually contain an explicit star score assigned by the reviewer, ranging from 1 (bad) to 5 (very good) and a comment in unstructured text. This numerical score is global, relating to the product or the experience of using it, even though the user may not like some specific product characteristics. For instance, the user may qualify a product with 4 or 5 stars, but he/she criticizes some aspect of it. The textual comment has positive, neutral, and negative opinions related to different aspects of the product.

3.2. Review Preprocessing Using NLP Techniques

Using the textual reviews, NLP preprocessing is done. This consists of lexical, syntactic, and semantic analyses. The result of this preprocessing is a tagged word list with part-of-speech (POS) tags (lexical information) and semantic information of the different words. The use of an NLP preprocessing stage has been common in previous works to enrich the input information of general frameworks (for instance, the approaches of Mora et al. [64] and Peral et al. [65]).

We proceed in the following way. The words are enriched with their POS tag and syntactic information obtained from the NLP tools. For instance, Freeling [66], Standford CoreNLP [67], or Treetagger [68] may be used. Furthermore, sentiment information is added to the words by means of specialized lexicons. The sentiment lexicon Afinn [69], an affective word list manually rated between −5 (more negative) and +5 (more positive), may be used to assign a sentiment value to the different words. All of this information is used to calculate the sentiment of a review or sentence in a subsequent stage (SA stage).

In addition, we select the main product features based on the product descriptions. The product descriptions highlight their main features. The NLP tools previously mentioned are used to obtain lexical, syntactic, and semantic information from the product descriptions in order to select the most frequently used nouns and adjectives of these products. With this information, we match selected product features and sentiment scores to rank the positive, neutral, and negative elements. With this ranking, we show relevant information in dashboards about the best products, based on the scored features.

3.3. Product Feature Selection Stage

The identification of the product features was partially based on the methods used by Archak et al. [55]. These authors used a part-of-speech tagger to annotate each review word with its POS tag, identifying whether the word was a noun, an adjective, a verb, and so on. Nouns and noun phrases are popular candidates for product features, though other constructs (such as verb phrases) can be used as well. Alternative techniques search for statistical patterns in the text, e.g., words and phrases that appear frequently in the reviews. In our experimentation, we used a domain ontology to detect the main product features. We built an ontology on cell phones/mobile phones. This is a list of the principal features of the topic in analysis, in this case, the topic cell phone. Each product has metadata with its product description. In this description, the seller highlights the main features of his own product. Then, we mined (from the product descriptions) the most frequent nouns. This list was filtered to remove irrelevant words. For each review, we selected the features according to the principal feature list. Other methods, such as the latent Dirichlet allocation (LDA) model presented in the work of Saura et al. [54], can be used to extract the main features.

3.4. Sentiment Analysis Stage

Two scores are calculated in this stage: (a) a global sentiment score for each review and (b) a specific sentiment score for each main feature of the product. In our experimentation, we used the abovementioned affective lexicon Afinn, with 2476 rated words.

The two mentioned scores are obtained using the textual comments of product reviews. In our approach, the following algorithm is applied, which carries out the following tasks: (a) calculating a global sentiment score for each review; (b) splitting the review into phrases and calculating a phrase sentiment score; (c) selecting the main features in all of the reviews of each product; (d) calculating the sentiment score for each main feature that is included in the sentence and selected in the previous task; (e) classifying the opinions as positive, neutral, and negative; and (f) a dashboard display for decision-making. The algorithm will be explained in detail in the case study section. With our proposal, the following analysis can be obtained: the best products of a category, the best products based on particular features, the best features of a product, or word clouds for positive and negative opinions of a product.

The sentiment score of a product is the arithmetic mean of the sentiment scores of all product reviews. The feature-based sentiment score of a product is the arithmetic mean of the sentiment scores of product features. The sentiment score is used as an additional criterion to search for the best products within a product category or within a brand. In addition, the sentiment score of the features is used to determine which product is the best according to the specific attributes of the product.

It is important to mention that our proposal allows for the use of different NLP tools (the abovementioned Freeling, CoreNLP, or Treetagger) to do the preprocessing and other tools to calculate the sentiment scores that provide a sentiment rating for a given sentence (such as CoreNLP, OpeNER [70], or the GPLSI system [71]).

3.5. Clustering Features Stage

To evaluate the sentiment polarity of a product feature, the sentiment score of each phrase in which the feature appears is evaluated, and the average of these scores is calculated (Equation (1)). This is done for each product feature. The features are classified as having a positive, neutral, or negative score:

s e n t i m e n t_s c o r e (f e a t u r e) = \frac{\sum_{p \in P} s e n t i m e n t_s c o r e (p)}{|P|},

(1)

where P =

{p | p i s a p h r a s e o f r e v i e w ⋀

feature in p}.

3.6. New Score Stage

In this stage, two new scores are calculated: the product feature-based score and the product global score. First, the feature-based score is calculated, a new sentiment score of a product averaging the sentiment score of all features of the product, as seen in Equation (2):

f e a t u r e_b a s e d_s c o r e (p r o d u c t) = \frac{\sum_{f \in F} s e n t i m e n t_s c o r e (f)}{|F|},

(2)

where F =

{f | f i s a f e a t u r e o f p r o d u c t}

.

Second, the new feature-based score for a product, which we call the feature sentiment score (FSS), is combined with the price, star score, and review sentiment score (which we call RSS) to calculate a global score for a product (Equation (3)). We assigned a weighting to each variable, considering the following: the most important element for the consumer is the product price, followed by the sentiment score, RSS, and FSS, whose weightings are greater than the star score. For this case study, after several tests, we selected the following weightings: 0.3 for price, 0.25 for the RSS and FSS, and 0.2 for the star score. Since the range of these variables varies widely, they are normalized by using the maximum and minimum score in the same category. The three normalized scores are done using the same NormalizedScore(product) formula (min–max normalization method):

GlobalScore(product) = NormalizedPrice(product) * 0.3 + NormalizedStarScore(product) * 0.2 + NormalizedSentimentScore(product) * 0.25 NormalizedFeatureSentimentScore(product) * 0.25 NormalizedPrice(product) = \frac{(M a x P r i c e - P r i c e + 1)}{M a x P r i c e} NormalizedScore(product) = \frac{S c o r e - M i n S c o r e}{(M a x S c o r e - M i n S c o r e)} .

(3)

3.7. Dashboards Stage

In the last stage, the extracted data are shown to the users. There are many possible dashboards available, such as word clouds. These dashboards are very advantageous and have indications of positive/negative features about a product or even the ranking of the top products using the global score. These dashboards are especially appealing for companies in terms of following up on the evolution of their products according to consumer reviews, which can be deeply analyzed.

4. Case Study

In this section, we show the application of our proposal in helping with marketing decision-making. In the first subsection, a data description is shown. The second subsection explains the experimentation that was carried out.

4.1. Data Description

To analyze our proposal, we used big data from amazon reviews. For this research, a dataset was used with product reviews and metadata from Amazon compiled between May 1996 and July 2014. The dataset was retrieved from “Amazon Product Data by Julian McAuley” [72], at http://jmcauley.ucsd.edu/data/amazon/ (visited on 15 March 2019). Specifically, we used the corpus about “Cell Phones and Accessories” and particularly the category “Cell Phone”. The case study addressed the need to examine the sustainability of the mobile phone sector [12]. This is due to the fact that the manufacturing of mobile phones with a short lifecycle has increased sales of these electronic goods [12], which in turn has produced a negative environmental impact due to the corresponding increase in demand for resources such as coltan, as previously mentioned.

Each review included a star score in the range [1,5] and a comment given by the user. They had the following structure, shown with the following sample:

{

“reviewerID”: “A6FGO4TBZ3QFZ”,

“asin”: “3998899561”,

“reviewerName”: “…”,

“helpful”: [1,2],

“reviewText”: “it worked for the first week then it only charge my phone to 20%. it is a waste of money.”,

“overall”: 1.0,

“summary”: “not a good Idea”,

“unixReviewTime”: 1384992000,

“reviewTime”: “11 21, 2013”

}.

We carried out a semiautomatic process for the construction of a mobile feature ontology to identify relevant product features. First, we manually analyzed some product descriptions on product metadata for all cell phone products, and we deduced that these descriptions highlighted the principal features about cell phones. Then, we automatically preprocessed these descriptions using the open source language analysis tool Freeling [66] to obtain nouns. The most frequent nouns were selected, and some nonthematic words were removed. We obtained 200 words between features and significant thematic words. In this regard, we focused specifically on a detailed analysis of the characteristics of consumer reviews. Additionally, we consulted specific dictionaries, a thesaurus, and some research where mobile ontologies were built [73,74], comparing entities and adding some words that were not found automatically. Finally, we manually identified the relationships between the discovered features.

To obtain a quantitative sentiment score expressed in the review’s text (RSS), we studied two tools. The first one used the approach adopted by other research [75,76]. It consisted of different stages: (a) splitting the review text into individual words; (b) removing the words that belonged to a stop word list; (c) searching the remaining words in the sentiment lexicon Afinn [69]; (d) if the word was in the lexicon Afinn, adding the emotional rating of this word; and (e) averaging all of these emotional ratings to get the review sentiment. Equation (4) shows the overall mathematical formula for obtaining the sentiment value of a review in the range [−5,5] using the mentioned approach. The second tool was the NLP tool Standford CoreNLP (https://stanfordnlp.github.io/CoreNLP/, visited on 15 March 2019). It provides a set of human language technology tools [67], including SA. With this tool, a sentence classification of Very Negative (=1), Negative, Neutral, Positive, or Very Positive (=5) was obtained. Equation (4) is

s e n t i m e n t (R e v i e w) = \frac{\sum_{w \in R}^{} e m o t i o n a l_r a t i n g (w)}{|R|},

(4)

where

R = R e v i e w_W o r d s \cap A f i n n_W o r d s

.

In our case study, we implemented the first approach using R Language (https://www.r-project.org/about.html, visited on 15 March 2019). Subsequently, we scored the reviews and the products, and we also deduced which were the main positive, neutral, and negative features of a product. CoreNLP is an alternative tool that may be used in our modular architecture.

Due to the fact that the feeling score was in the range of −5 to 5 and the star score was between 1 and 5, we had to normalize both scores in order to compare them. After some experimentation, we concluded that these values were adequate to map the values of sentiment analysis and star rating by the user (see Equation (5)):

n o r m (x) = {\begin{matrix} 5, & x \geq 3 \\ 4, & 3 < x \leq 1 \\ 3, & 1 < x \leq - 0.5 \\ 2, & 0.5 < x \leq - 3 \\ 1, & x < - 3 \end{matrix},

(5)

where

x = r e v i e w s e n t i m e n t i n [- 5, 5] c a l c u l a t e d i n F o r m u l a (4)

.

Here is an analysis of a sample review text of the product “B000W09N9W” that was extracted from the corpus: “… This cell phone has passed the proof of time under really really tough conditions. ….Great things: 1. Signal: Superb, I have had many cel phones before, including Nokia which I thing it has a great signal, but HTC TYTN II has much better signal. This one sustains signal in elevators while my nokia can’t…. 3. Screen: Touch screen works really well. Tilting (40 degrees) screen is nice and comfortable to work with when you are writting over the table. 4. Sliding QWERTY keyboard is the main reason to buy it for us who don’t like front keyboards, this makes the phone a little bulky but is great. 5. Plenty of buttons: Has plenty of buttons that make it easy to operate. The 360 degree 3 way jog wheel paired with oK button (left side) is fantastic, great option to operate the phone while you are driving. 6. Setting e-mail/sms accounts was really easy and fast…..Good things: 1. Processor: 400 Mhz, works oK, it is not super-fast but certainly it is not slow. Phone turns on fast (less than 1 minute to operate). 2. Platform: Windows mobile 6 is good. Until date I have had to re-start the phone 3 or 4 times due to system fail (unable to detect end call), besides this it has worked well. 3. Camera 3 mega-pixels: Has good definition, works precisely. 4. HTC Home screen is nice, very interactive. … Not so Good: 1. Camera: Does not has flash, so don’t expect to get good insides pictures. 2. Battery: Weak point, don’t expect your battery to last more than 24 hours, and much less if you use it heavily. Requires car charger, charge through USB. 3. Speaker: It is not so loud…”.

In this review, we found positive features such as signal, screen, processor, platform, keyboard, and camera. In addition, negative features such as battery were identified. The sentiment score in the phrase about the camera was 3, and the keyboard was 2.5. Both were positive sentiments. The features camera and keyboard, therefore, were classified as positive features. The sentiment score in the phrase about the battery was −2. This was a negative sentiment. The battery feature, subsequently, was classified as a negative feature.

In what follows, we apply the algorithm proposed in the SA stage of our architecture (Section 3) to our example:

(a): The RSS for each review was calculated (document level). For this sample review, the RSS was 1.6, which was a positive review. The RSS for this review was close to the average of the RSS scores for all of the reviews of the same product (the RSSes of the products). It was 1.21, which was also a positive score;
(b): The reviews were split into phrases, and a phrase sentiment score was calculated (sentence level). This particular review was separated into 44 sentences. For example, a positive sentence was, “Signal: Superb I have had many cel phones before including Nokia which I thing it has a great signal but HTC_TYTN_II has much better signal.” This had a phrase sentiment score of 3.33 using Equation (4). Another example was a negative sentence: “Battery: Weak point do not expect your battery to last more than 24 h and much less if you use it heavily.” This obtained a phrase sentiment score of −2 with Equation (4);
(c): The main features in all of the reviews of each product were selected. Finally, a total of 200 features were collected. These lists were filtered, and the most “important” were selected. The selection and filtering of the main product features were made using the abovementioned mobile feature ontology;
(d): For each product, a sentiment score for each feature was assigned based on the phrase sentiment score each feature had. In this particular product, “B000W09N9W”, a total of 51 features were collected, of which 50 were positive. The feature “battery” was classified as a negative feature. The FSS of this product was 1.24;
(e): The opinions were classified into positive and negative;
(f): Dashboards for decision-making were shown.

To help to make decisions, we generated some graphics to summarize valuable information about products and their features.

4.2. Experimentation

The original corpus was comprised of 600,000 reviews. We specifically filtered the reviews in the category cell phone, and 44,460 reviews were selected. Some of these reviews were irrelevant, as they did not mention any feature about the product itself (they were reviews with general opinions or comments about the service or the seller). These were removed, and 36,452 reviews with 330,330 sentences were kept. SA was applied to all cell phone reviews to obtain their sentiment values. Most of the reviews obtained a star score of 5 and an RSS of 4 (see left-hand side chart of Figure 2, representing the overall RSS score for all products and comparing it to the review’s star score). In addition, we calculated the RSS score by product and compared it to the star score. Most products received a star score of 4, whereas most products received an RSS of 3 (see right-hand side chart of Figure 2, representing the proximity of the sentiment score to the star score for each product).

After the experiment, we could conclude that although in general terms the phones were good products or experiences with the product were very good, there were specific details that were criticized, and therefore some negative comments were mentioned. The same occurred for those products that were given a poor RSS with some positive features. These comparisons were made to analyze the correlation between the different scores, showing that there were indeed some differences that could be explained by customer satisfaction with the product as a whole or with parts of the product.

It is important to know that there were products that had a good star score but where the consumer’s opinion was significantly different because these products probably had important features to be analyzed. We included Table 1, which shows a list of products where the star score and the RSS were very distinct. Then, we selected two specific products to see the detailed features that provoked a significant difference.

For example, regarding the product B0012SK0F4, the table shows that the star score and the RSS of the product were very different. Subsequently, we analyzed the sentiment score of the main product features to identify likes and dislikes. With respect to the product B0012SK0F4, the feature “battery” was bad. With the product B002RD07EW, the problem was the “camera”.

In addition, we compared the RSS to the FSS of products. With this comparison, we wanted to show the difference between customer global opinion and specific opinions about features of the product. In Figure 3, we show the differences between these scores.

As Figure 3 demonstrates, there was a high correlation between both scores. After analyzing in detail the results obtained, we verified that in the case of a positive RSS, the FSS score was slightly lower because it factored in negative sentiments detected for some product features. It was also proven in the experimentation that the inverse was detected in the case of a negative RSS, where the FSS score was slightly higher because it factored in positive sentiments detected for some product features.

After an analysis of the results of 2085 products, there were 425 products with different scores: 173 products had better FSSes than RSSes, and 252 products had better RSSes than FSSes.

Finally, we wanted to use together the four measures (the new product global score defined in Equation (3)) to evaluate customer satisfaction with a product: Not just the price or the general feeling about a product, but also the feeling about the characteristics of the product. With this new measure, we were able to rank the products and determine the best recommendations for cell phone products. As previously mentioned, we combined four weighted variables to obtain the global score of the product: (1) the normalized price, (2) the normalized star score, (3) the normalized RSS, and (4) the normalized FSS. Then we used the global score to specify the best products. Table 2 shows an example for the category “Cell Phone”.

It is important to highlight that the most expensive product was not necessarily the best evaluated, and the best evaluated in stars was not necessarily the best evaluated globally. In this case, the best-scored product was not the most expensive, but it had the best RSS and FSS (4.0). Furthermore, it can be seen that the fifth product was the most expensive, but its RSS and FSS were low.

The clustering of features allowed for some products to be selected, e.g., to discover the best products with a selected feature. In addition, we could find the positive and negative features of a product, or we could obtain the main features that were the most interesting to consumers.

For example, if a customer was interested in a cell phone with a very good camera, we could search the top products with the best sentiment score of the specific feature “camera” (Equation (1)). In Table 3, we show the five best-scored products based on the feature sentiment score where the specific feature was “camera”.

Furthermore, we could explore the products that had positive and negative features. This search can help managers pay attention to negative features to improve them or highlight positive features. Besides, it provides valuable information in analyzing the positioning of different products on the market. In our experimentation, an example of positive and negative features occurred with the product “B000W09N9W” after an analysis of all of the reviews for this specific product. In that case, we found positive features for “processor”, “signal”, “camera”, and “Bluetooth” and negative features for “battery” and “touchscreen”.

We mined the most-mentioned features in the cell phone category in all of the reviews. The five most-mentioned ones were the following: screen, battery, camera, keyboard, and pictures. Then, considering these main features, we could search for products with some positive and negative features. For instance, we were interested in products that received positive reviews for the camera and negative ones related to the battery. We found the following products: B000W09N9W, B000E7T7JO, B000M3S6ME, B000P6CEYE, B001IWO6IQ, B003EEME8A, B000BYGNUQ, and B001CJNHLC.

After gathering the positive and negative features, we used word cloud dashboards to clearly show the main liked and disliked features of all of the products analyzed in the case study. These dashboards showed in a graphical way the detailed analysis that obtained the main features of the consumer reviews. For example, the product “B0000AGRYX” had a lot of positive and negative features that we can show using word clouds. We can see that the main positive feature mentioned was the camera and that the main negative feature mentioned was the button (Figure 4).

5. Conclusions

A company has a sustainable competitive advantage when it creates more economic value than a marginal firm in its industry and when other firms are unable to duplicate the benefits of this strategy [77]. In this sense, Kiron et al. [78] identified big data analytics as a significant dimension to explore for achieving a sustainable competitive advantage. In the same line, Dolores et al. [79] highlighted that activities aimed at building a firm’s intangible value create business benefits and sustainability.

Within this context, the approach presented in this study focuses on the importance of managers in factoring in consumer opinions expressed on the internet to gain a better understanding of consumer needs and demands. In fact, new technologies are transforming how companies do business by improving the lifecycle management of their products [80]. Particularly, the internet offers a promising way to reveal consumer opinions on the products they buy as well as the whole consumer experience. The continuous stream of UGC over time provides a huge amount of information that can be employed to help decision-making in the marketing field.

This paper proposes a further step in the application of sentiment analysis to decision-making by adding product feature selection and calculating additional information from online reviews. By mining the text of user-generated online reviews and increasing the granularity of the analysis at a sentence level, our proposal would help managers and users to make better decisions in analyzing their products and purchases.

Our proposal focuses on SA techniques and on the application of NLP tools to marketing decision-making. Customer preferences according to star and sentiment scores are analyzed. Subsequently, the negative and positive features of the reviewed products are identified. We reviewed some algorithms to detect the features of products, and they were adapted to improve the information that is shown in the dashboards. An analysis of consumer perceptions reveals what products feature differentiated positions in the minds of consumers. Moreover, the employment of word clouds allows us to visualize the most important features mentioned by consumers.

We can carry out (1) a global sentiment analysis in the review (document level) that allows us to measure whether the product is liked by people, (2) an analysis of different phrases (sentence level) to find out what buyers like and dislike about a product, and (3) an extraction of positive/negative product features (aspect level).

We put into practice the proposal on a corpus of reviews of cell phone products extracted from Amazon. Our analysis showed that star scores represented a general user’s viewpoint about the product, but that in a review the user could highlight specific features that he/she liked or disliked. It should be recalled that customer preferences for a product are driven by different product features and the value that he/she attaches to these attributes [81]. Thus, analyzing a consumer’s opinion in relation to these specific features is of great interest for managers. It is consumer perception of a brand that determines the value the brand has in terms of these attributes.

This procedure will help managers to better understand the positioning of the products of their firm. In fact, the market structure analysis begins with a predetermined set of attributes and their underlying dimensions and assumes that consumers differ in their evaluations of product attributes [81,82]. With the methodology employed in this paper, we show the opinion of consumers in relation to the main features of a product. These consumer perceptions reflect the similarities and differences between products and brands, helping marketing managers depict the market structure and brand positioning. Furthermore, this framework could be employed to diminish the eco-footprint of the mobile sector and increase its long-term sustainability. In fact, if marketing managers understand what consumers think about the products they buy and experience, they will have the necessary information to increase customer satisfaction in relation to a product. Eventually, this could increase the lifecycle of mobile phones and reduce the consumption of the raw materials needed to produce these products.

The main contributions of our research are the following: (1) a definition of a general architecture using sentiment analysis and text data mining to identify a product’s main positive/negative features; (2) a combination of price, star score, sentiment score of a review, and sentiment score based on product features to rank each product and assist marketing managers and consumers in their decision-making processes; (3) an analysis of the characteristics of a consumer’s review using visualization techniques such as dashboards.

We propose that future work consider using other types of corpora to validate our proposal. Furthermore, we must consider n-gram features and the positive as well as negative qualifiers of features. This study had some specific limitations. First, the sentiment analysis was performed with the Afinn lexicon. Despite this lexicon being well recognized in this field, a different sentiment analysis tool could obtain different results. Second, the main features of the product were manually derived from product descriptions and processed with the open source language analysis tool Freeling. Although these features were double-checked and described the main characteristics of the product, a different procedure could obtain a different ontology for these descriptions.

Author Contributions

E.K. made the following contributions to the study: performing a literature review, conceiving of and designing the study, acquiring and preprocessing the data, analyzing and interpreting the data, and drafting the article. D.G. and J.P. made the following contributions to the study: critical revision, the conception and design of the study, analyses and interpretation of the data, drafting of the article, and approval of the submitted version. A.F. and H.M. contributed the following: the conception and design of the study, drafting of the article, and approval of the submitted version. R.S. performed a literature review, conceived of and designed the study, analyzed and interpreted the data, and drafted the article. All authors read and approved the final manuscript.

Funding

This study was partially funded by the ECLIPSE-UA (RTI2018-094283-B-C32) and RESCATA (TIN2015-65100-R) projects of the Spanish Ministry of Economy and Competitiveness (MINECO) and PROMETEO/2018/089. This work was partially funded by the Spanish Ministry of Economy and Competitiveness (MINECO) under the grant Project CloudDriver4Industry TIN2017-89266-R and by the Conselleria de Educación, Investigación, Cultura y Deporte of the Community of Valencia, Spain, within the program of support for research under project AICO/2017/134.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kang, M.; Choi, Y.; Choi, J. The effect of celebrity endorsement on sustainable firm value: Evidence from the Korean telecommunications industry. Int. J. Advert. 2019, 38, 563–576. [Google Scholar] [CrossRef]
Whang, H.; Ko, E.; Zhang, T.; Mattila, P. Brand popularity as an advertising cue affecting consumer evaluation on sustainable brands: A comparison study of Korea, China, and Russia. Int. J. Advert. 2015, 34, 789–811. [Google Scholar] [CrossRef]
Vriens, M.; Chen, S.; Vidden, C. Mapping brand similarities: Comparing consumer online comments versus survey data. Int. J. Mark. Res. 2019, 61, 130–139. [Google Scholar] [CrossRef]
Timoshenko, A.; Hauser, J. Identifying customer needs from user-generated content. Mark. Sci. 2019, 38, 1–20. [Google Scholar] [CrossRef]
Fan, Z.; Xi, Y.; Li, Y. Supporting the purchase decisions of consumers: A comprehensive method for selecting desirable online products. Kybernetes 2018, 47, 689–715. [Google Scholar] [CrossRef]
Pang, B.; Lee, L. Opinion Mining and Sentiment Analysis. Found. Trends Inf. Retr. 2008, 2, 1–135. [Google Scholar] [CrossRef] [Green Version]
Joshi, M.; Prajapati, P.; Shaikh, A.; Vala, V. A survey on Sentiment Analysis. Int. J. Comput. Appl. 2017, 163, 34–39. [Google Scholar] [CrossRef]
Feldman, R. Techniques and applications for sentiment analysis. Commun. ACM 2013, 56, 82–89. [Google Scholar] [CrossRef]
Gandomi, A.; Haider, M. Beyond the hype: Big data concepts, methods, and analytics. Int. J. Inf. Manag. 2015, 35, 137–144. [Google Scholar] [CrossRef] [Green Version]
Rain, C. Sentiment Analysis in Amazon Reviews Using Probabilistic Machine Learning; Department of Computer Science, Swarthmore College: Swarthmore, PA, USA, 2013. [Google Scholar]
Kumar, K.; Desai, J.; Majumdar, J. Opinion mining and sentiment analysis on online customer review. In Proceedings of the IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), Tamil Nadu, India, 15–17 December 2016; pp. 1–4. [Google Scholar]
Paiano, A.; Lagioia, G.; Cataldo, A. A critical analysis of the sustainability of mobile phone use. Resour. Conserv. Recycl. 2013, 73, 162–171. [Google Scholar] [CrossRef]
Mantz, J.W. Improvisational economies: Coltan production in the eastern Congo. Soc. Anthropol. 2008, 16, 34–50. [Google Scholar] [CrossRef]
Mbembe, A. Necropolitics. Public Cult. 2003, 15, 11–40. [Google Scholar] [CrossRef]
Hayes, K.; Burge, R. Coltan Mining in the Democratic Republic of Congo: How Tantalum-Using Industries Can Commit to the Reconstruction of the DRC; Fauna & Flora International: Cambridge, UK, 2003. [Google Scholar]
Kemper, J.A.; Hall, C.M.; Ballantine, P.W. Marketing and Sustainability: Business as Usual or Changing Worldviews? Sustainability 2019, 11, 780. [Google Scholar] [CrossRef]
Pomering, A.; Johnson, L. Building Sustainability into Services Marketing: Expanding decision-making from a mix to a matrix. Sustainability 2018, 10, 2992. [Google Scholar] [CrossRef]
Piryani, R.; Madhavi, D.; Singh, V. Analytical mapping of opinion mining and sentiment analysis research during 2000–2015. Inf. Process. Manag. 2017, 53, 122–150. [Google Scholar] [CrossRef]
Hussein, D.M.E.D.M. A Survey on Sentiment Analysis Challenges. J. King Saud Univ.-Eng. Sci. 2018, 30, 330–338. [Google Scholar] [CrossRef]
Hendrawan, R.; Suryani, E.; Oktavia, R. Evaluation of E-Commerce Product Reviews Based on Structural, Metadata, and Readability Characteristics. Procedia Comput. Sci. 2017, 124, 280–286. [Google Scholar] [CrossRef]
Finkelstein, A.; Harman, M.; Jia, Y.; Martin, W.; Sarro, F.; Zhang, Y. Investigating the relationship between price, rating, and popularity in the Blackberry World App Store. Inf. Softw. Technol. 2017, 87, 119–139. [Google Scholar] [CrossRef]
Raja, D.; Pushpa, S. Feature level review table generation for E-Commerce websites to produce qualitative rating of the products. Future Comput. Inform. J. 2017, 2, 118–124. [Google Scholar] [CrossRef]
Von Helversen, B.; Abramczuk, K.; Kopeć, W.; Nielek, R. Influence of consumer reviews on online purchasing decisions in older and younger adults. Decis. Support Syst. 2018, 113, 1–10. [Google Scholar] [CrossRef]
Xu, X.; Wang, X.; Li, Y.; Haghighi, M. Business intelligence in online customer textual reviews: Understanding consumer perceptions and influential factors. Int. J. Inf. Manag. 2017, 37, 673–683. [Google Scholar] [CrossRef]
Dellarocas, C. The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Manag. Sci. 2003, 49, 1407–1424. [Google Scholar] [CrossRef]
Wang, J.; Wang, L.; Wang, M. Understanding the effects of eWOM social ties on purchase intentions: A moderated mediation investigation. Electron. Commer. Res. Appl. 2018, 28, 54–62. [Google Scholar] [CrossRef]
Godes, D.; Mayzlin, D. Using On-Line Conversations to Study Word-of-Mouth Communication. Mark. Sci. 2004, 23, 545–560. [Google Scholar] [CrossRef]
Chevalier, J.; Mayzlin, D. The Effect of Word of Mouth on Sales: Online Book Reviews. J. Mark. Res. 2006, 43, 345–354. [Google Scholar] [CrossRef] [Green Version]
Noone, B.; McGuire, K. Effects of price and user-generated content on consumers’ prepurchase evaluations of variably priced services. J. Hosp. Tour. Res. 2014, 38, 562–581. [Google Scholar] [CrossRef]
Tsang, A.; Prendergast, G. Is a star worth a thousand words? The inter-play between product-review texts and rating valences. Eur. J. Mark. 2009, 43, 1269–1280. [Google Scholar] [CrossRef]
Sun, T.; Youn, S.; Wu, G.; Kuntaraporn, M. Online word-of-mouth: An exploration of its antecedents and consequences. J. Comput. Mediat. Commun. 2006, 11, 1104–1127. [Google Scholar] [CrossRef]
Singh, J.; Irani, S.; Rana, N.; Dwivedi, Y.; Saumya, S.; Roy, P. Predicting the helpfulness of online consumer reviews. J. Bus. Res. 2017, 70, 346–355. [Google Scholar] [CrossRef]
Jandail, R. A proposed Novel Approach for Sentiment Analysis and Opinion Mining. Int. J. UbiComp 2014, 5, 1–10. [Google Scholar] [CrossRef]
Yi, J.; Nasukawa, T.; Bunescu, R.; Niblack, W. Sentiment Analyzer: Ex-tracting Sentiments about a Given Topic Using Natural Language Processing Techniques. In Proceedings of the Third IEEE International Conference on Data Mining, Melbourne, FL, USA, 19–22 November 2003; pp. 427–434. [Google Scholar]
Haddi, E.; Liu, X.; Shi, Y. The Role of Text Pre-processing in Sentiment Analysis. Procedia Comput. Sci. 2013, 17, 26–32. [Google Scholar] [CrossRef] [Green Version]
Cambria, E.; Schuller, B.; Xia, Y.; Havasi, C. New Avenues in Opinion Mining and Sentiment Analysis. IEEE Intell. Syst. 2013, 28, 15–21. [Google Scholar] [CrossRef]
Angulakshmi, G.; ManickaChezian, R. An analysis on opinion mining: Techniques and tools. Int. J. Adv. Res. Comput. Commun. Eng. 2014, 3, 1021–2278. [Google Scholar]
Medhat, W.; Hassan, A.; Korashy, H. Sentiment analysis algorithms and applications: A survey. Ain Shams Eng. J. 2014, 5, 1093–1113. [Google Scholar] [CrossRef] [Green Version]
Ravi, K.; Ravi, V. A survey on opinion mining and sentiment analysis: Tasks, approaches and applications. Knowl.-Based Syst. 2015, 89, 14–46. [Google Scholar] [CrossRef]
Devika, M.; Sunitha, C.; Ganesh, A. Sentiment Analysis: A Comparative Study on Different Approaches. Procedia Comput. Sci. 2016, 87, 44–49. [Google Scholar] [CrossRef] [Green Version]
Sneka, G.; Vidhya, C. Algorithms for Opinion Mining and Sentiment Analysis: An Overview. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2016, 6, 455–459. [Google Scholar]
Nguyen, H.; Jung, J. Statistical approach for figurative sentiment analysis on Social Networking Services: A case study on Twitter. Multimed. Tools Appl. 2016, 76, 8901–8914. [Google Scholar] [CrossRef]
Rajadesingan, A.; Zafarani, R.; Liu, H. Sarcasm detection on Twitter: A behavioral modeling approach. In Proceedings of the 8th ACM International Conference on Web Search and Data Mining, Shanghai, China, 2–6 February 2015; pp. 97–106. [Google Scholar]
Mehndiratta, P.; Sachdeva, S.; Soni, D. Detection of Sarcasm in Text Data Using Deep Convolutional Neural Networks. Scalable Comput. Pract. Exp. 2017, 18, 219–228. [Google Scholar] [CrossRef]
Cambria, E.; Poria, S.; Hazarika, D.; Kwok, K. SenticNet 5: Discovering Conceptual Primitives for Sentiment Analysis by Means of Context Embeddings. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, LA, USA, 2–7 February 2018; pp. 1795–1802. [Google Scholar]
Kiritchenko, S.; Mohammad, S. The Effect of Negators, Modals, and Degree Adverbs on Sentiment Composition. In Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, San Diego, CA, USA, 16 June 2016; pp. 43–52. [Google Scholar]
Jiménez, S.; Martín-Valdivia, M.; Martínez-Cámara, E.; Ureña, L. Studying the Scope of Negation for Spanish Sentiment Analysis on Twitter. IEEE Trans. Affect. Comput. 2017, 10, 129–141. [Google Scholar] [CrossRef]
Farooq, U.; Mansoor, H.; Nongaillard, A.; Ouzrout, Y.; Qadir, M. Negation Handling in Sentiment Analysis at Sentence Level. J. Comput. 2017, 12, 470–478. [Google Scholar] [CrossRef]
Cambria, E.; Das, D.; Bandyopadhyay, S.; Feraco, A. Affective Computing and Sentiment Analysis. IEEE Intell. Syst. 2016, 31, 102–107. [Google Scholar] [CrossRef]
García-Moya, L.; Anaya-Sánchez, H.; Berlanga-Llavori, R. Retrieving Product Features and Opinions from Customer Reviews. IEEE Intell. Syst. 2013, 28, 19–27. [Google Scholar] [CrossRef]
Singla, Z.; Randhawa, S.; Jain, S. Sentiment analysis of customer product reviews using machine learning. In Proceedings of the IEEE 2017 International Conference on Intelligent Computing and Control (I2C2), Coimbatore, India, 23–24 June 2017; pp. 1–5. [Google Scholar]
Paknejad, S. Sentiment Classification on Amazon Reviews Using Machine Learning Approaches. Ph.D. Thesis, KTH Royal Institute of Technology, School of Electrical Engineering and Computer Science, Stockholm, Sweden, 2018. [Google Scholar]
Abbasi, A.; France, S.; Zhang, Z.; Chen, H. Selecting Attributes for Sentiment Classification Using Feature Relation Networks. IEEE Trans. Knowl. Data Eng. 2011, 23, 447–462. [Google Scholar] [CrossRef]
Saura, J.; Palos-Sanchez, P.; Grilo, A. Detecting Indicators for Startup Business Success: Sentiment Analysis Using Text Data Mining. Sustainability 2019, 11, 917. [Google Scholar] [CrossRef]
Archak, N.; Ghose, A.; Ipeirotis, P. Show me the Money! Deriving the Pricing Power of Product Features by Mining Consumer Reviews. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, CA, USA, 12–15 August 2007; pp. 56–65. [Google Scholar]
Chi, X.; Siew, T.; Cambria, E. Adaptive two-stage feature selection for sentiment classification. In Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, AB, Canada, 5–8 October 2017; pp. 1238–1243. [Google Scholar]
Mars, A.; Gouider, M. Big data analysis to Features Opinions Extraction of customer. Procedia Comput. Sci. 2017, 112, 906–916. [Google Scholar] [CrossRef]
Hu, W.; Gong, Z.; Guo, J. Mining product features from online reviews. In Proceedings of the IEEE 7th International Conference on E-Business Engineering, Shanghai, China, 10–12 November 2010; pp. 24–29. [Google Scholar]
Singh, P.; Sachdeva, A.; Mahajan, D.; Pande, N.; Sharma, A. An approach towards feature specific opinion mining and sentimental analysis across e-commerce websites. In Proceedings of the IEEE 5th International Conference-Confluence the Next Generation Information Technology Summit (Confluence), Noida, India, 25–26 September 2014; pp. 329–335. [Google Scholar]
Wang, Y.; Kim, K.; Lee, B.; Youn, H. Word clustering based on POS feature for efficient twitter sentiment analysis. Hum. Cent. Comput. Inf. Sci. 2018, 8, 17. [Google Scholar] [CrossRef]
Zhou, H.; Han, S.; Liu, Y. A Novel Feature Selection Approach Based on Document Frequency of Segmented Term Frequency. IEEE Access 2018, 6, 53811–53821. [Google Scholar] [CrossRef]
Young, T.; Hazarika, D.; Poria, S.; Cambria, E. Recent trends in deep learning based natural language processing. IEEE Comput. Intell. Mag. 2018, 13, 55–75. [Google Scholar] [CrossRef]
Socher, R.; Perelygin, A.; Wu, J.; Chuang, J.; Manning, C.; Ng, A.; Potts, C. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, WA, USA, 19 October 2013; pp. 1631–1642. [Google Scholar]
Mora, H.; Ferrández, A.; Gil, D.; Peral, J. A computational method for enabling teaching-learning process in huge online courses and communities. Int. Rev. Res. Open Distrib. Learn. 2017, 18. [Google Scholar] [CrossRef]
Peral, J.; Ferrández, A.; Mora, H.; Gil, D.; Kauffmann, E. A Review of the Analytics Techniques for an Efficient Management of Online Forums: An Architecture Proposal. IEEE Access 2019, 7, 12220–12240. [Google Scholar] [CrossRef]
Padró, L.; Stanilovsky, E. FreeLing 3.0: Towards Wider Multilinguality. In Proceedings of the Language Resources and Evaluation Conference (LREC 2012) ELRA, Istanbul, Turkey, 21–27 May 2012; pp. 2473–2479. [Google Scholar]
Manning, C.; Surdeanu, M.; Bauer, J.; Finkel, J.; Bethard, S.; McClosky, D. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, Baltimore, MD, USA, 22 June 2014; pp. 55–60. [Google Scholar]
Schmid, H. Improvements in Part-of-Speech Tagging with an Application to German. In Proceedings of the ACL SIGDAT-Workshop, Dublin, Ireland, 27 March 1995; pp. 1–9. [Google Scholar]
Nielsen, F. A new ANEW: Evaluation of a word list for sentiment analysis in microblogs. In Proceedings of the ESWC2011 Workshop on ‘Making Sense of Microposts’: Big Things Come in Small Packages, Heraklion, Crete, 30 May 2011; pp. 93–98. [Google Scholar]
Agerri, R.; Cuadros, M.; Gaines, S.; Rigau, G. Opener: Open polarity enhanced named entity recognition. Proces. Leng. Nat. 2013, 51, 215–218. [Google Scholar]
Fernández, J.; Gutiérrez, Y.; Gómez, J.; Martínez-Barco, P. GPLSI: Supervised sentiment analysis in twitter using skipgrams. In Proceedings of the 8th International Workshop on Semantic Evaluation SemEval, Dublin, Ireland, 23–24 August 2014; pp. 294–299. [Google Scholar]
He, R.; McAuley, J. Ups and Downs: Modeling the Visual Evolution of Fashion Trends with One-Class Collaborative Filtering. In Proceedings of the 25th International Conference on World Wide Web, Montreal, QC, Canada, 11–15 April 2016; pp. 507–517. [Google Scholar]
Junwu, Z.; Bin, L.; Fei, W.; Sicheng, W. Mobile Ontology. Int. J. Digit. Content Technol. Appl. 2010, 4, 46–54. [Google Scholar] [Green Version]
Hasni, N.; Bouallegue, R. Ontology for Mobile Phone Operating Systems. Int. J. Wirel. Mobile Netw. 2012, 4, 169–181. [Google Scholar]
Silge, J.; Robinson, D. Text Mining with R; O’Reilly Media Inc.: Sebastopol, CA, USA, 2017. [Google Scholar]
Liske, D. Tidy Sentiment Analysis in R. 2018. Available online: https://www.datacamp.com/community/tutorials/sentiment-analysis-R (accessed on 15 March 2019).
Barney, J.; Clark, D. Resource-Based Theory: Creating and Sustaining Competitive Advantage; Oxford University Press: Oxford, UK, 2007. [Google Scholar]
Kiron, D.; Prentice, P.; Ferguson, R. The analytics mandate. MIT Sloan Manag. Rev. 2014, 55, 1. [Google Scholar]
Dolores, L.; Macchiaroli, M.; De Mare, G. Sponsorship for the sustainability of historical-architectural heritage: Application of a model’s original test finalized to maximize the profitability of private investors. Sustainability 2017, 9, 1750. [Google Scholar] [CrossRef]
Zhang, Q.; Lu, X.; Peng, Z.; Ren, M. Perspective: A review of lifecycle management research on complex products in smart-connected environments. Int. J. Prod. Res. 2019, 1–22. [Google Scholar] [CrossRef]
Elrod, T.; Russell, G.; Shocker, A.; Andrews, R.; Bacon, L.; Bayus, B.; Carroll, J.; Johnson, R.; Kamakura, W.; Lenk, P.; et al. Inferring Market Structure from customer response to competing and complementary products. Mark. Lett. 2002, 13, 221–232. [Google Scholar] [CrossRef]
Lee, T.; Bradlow, E. Automated marketing research using online customer reviews. J. Mark. Res. 2011, 48, 881–894. [Google Scholar] [CrossRef]

Figure 1. The proposed architecture using sentiment analysis (SA) and text data mining to identify the main positive/negative product features.

Figure 2. Classification of reviews and products by star score and review sentiment score (RSS).

Figure 3. Comparative score graph: RSS versus feature sentiment score (FSS) of the products.

Figure 4. Positive (left) and negative (right) word clouds (features) in the dashboard.

Table 1. Products with significant differences between star score and RSS score.

#	Product	Star Score	Product RSS
1	B0012SK0F4	4	1
2	B0012U7NP2	5	2
3	B002RD07EW	5	2

Table 2. The top five products in the “Cell Phone” category based on the global score.

Product	Price (€)	Star Score	RSS	FSS	Global Score
B002ED12NA	85.47	4.5	4.00	4.0	0.95
003I86URI	79.90	5.0	2.83	2.75	0.89
B001PNFALA	67.98	4.5	3.00	3.0	0.88
B002ED5C6I	67.98	5.0	2.50	2.5	0.87
B000N4Z0SK	119.94	5.0	2.67	2.5	0.87

Table 3. The top five products with the best camera based on the feature sentiment score.

Product	Feature Sentiment Score (“Camera”)
B00192I5ZA	2.78
B000FDXU54	2.75
B001GT9GRW	2.75
B0034THXTK	2.67
B003VNKLF2	2.67

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kauffmann, E.; Peral, J.; Gil, D.; Ferrández, A.; Sellers, R.; Mora, H. Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining. Sustainability 2019, 11, 4235. https://0-doi-org.brum.beds.ac.uk/10.3390/su11154235

AMA Style

Kauffmann E, Peral J, Gil D, Ferrández A, Sellers R, Mora H. Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining. Sustainability. 2019; 11(15):4235. https://0-doi-org.brum.beds.ac.uk/10.3390/su11154235

Chicago/Turabian Style

Kauffmann, Erick, Jesús Peral, David Gil, Antonio Ferrández, Ricardo Sellers, and Higinio Mora. 2019. "Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining" Sustainability 11, no. 15: 4235. https://0-doi-org.brum.beds.ac.uk/10.3390/su11154235

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Managing Marketing Decision-Making with Sentiment Analysis: An Evaluation of the Main Product Features Using Text Data Mining

Abstract

1. Introduction

2. Background

3. The Proposed Methodology Architecture

3.1. Data Collection Stage

3.2. Review Preprocessing Using NLP Techniques

3.3. Product Feature Selection Stage

3.4. Sentiment Analysis Stage

3.5. Clustering Features Stage

3.6. New Score Stage

3.7. Dashboards Stage

4. Case Study

4.1. Data Description

4.2. Experimentation

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI