Next Article in Journal
Encounter with Bullying in Sport and Its Consequences for Youth: Amateur Athletes’ Approach
Next Article in Special Issue
More Agility to Semantic Similarities Algorithm Implementations
Previous Article in Journal
Satisfaction with the Aesthetic Effect and Quality of Life for Women after Breast Conserving Therapy (BCT)—Preliminary Research
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predicting Factors Affecting Adolescent Obesity Using General Bayesian Network and What-If Analysis

1
SKK Business School, Sungkyunkwan University, Seoul 03063, Korea
2
Airport Business Analytics, Economics Department, Airport Council International (ACI) World, 800 rue du Square Victoria, Suite 1810, Montreal, QC H4Z 1G8, Canada
3
Department of Health Sciences & Technology, Samsung Advanced Institute for Health Sciences & Technology (SAIHST), Sungkyunkwan University, Seoul 06355, Korea
4
Creativity Science Research Institute (CSRI), Sungkyunkwan University, Seoul 03063, Korea
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2019, 16(23), 4684; https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph16234684
Submission received: 4 October 2019 / Revised: 21 November 2019 / Accepted: 22 November 2019 / Published: 25 November 2019
(This article belongs to the Special Issue Computing Techniques for Environmental Research and Public Health)

Abstract

:
With the remarkable improvement in people’s socioeconomic living standards around the world, adolescent obesity has increasingly become an important public health issue that cannot be ignored. Thus, we have implemented its use in an attempt to explore the viability of scenario-based simulations through the use of a data mining approach. In doing so, we wanted to explore the merits of using a General Bayesian Network (GBN) with What-If analysis while exploring how it can be utilized in other areas of public health. We analyzed data from the 2017 Korean Youth Health Behavior Survey conducted directly by the Korea Centers for Disease Control & Prevention, including 19 attributes and 11,206 individual data points. Our simulations found that by manipulating the amount of pocket money-between $60 and $80-coupled with a low-income background, it has a high potential to increase obesity compared with other simulated factors. Additionally, when we manipulated an increase in studying time with a mediocre academic performance, it was found to potentially increase pressure on adolescents, which subsequently led to an increased obesity outcome. Lastly, we found that when we manipulated an increase in a father’s education level while manipulating a decrease in mother’s education level, this had a large effect on the potential adolescent obesity level. Although obesity was the chosen case, this paper acts more as a proof of concept in analyzing public health through GBN and What-If analysis. Therefore, it aims to guide health professionals into potentially expanding their ability to simulate certain outcomes based on predicted changes in certain factors concerning future public health issues.

1. Introduction

Due to changes in our lifestyle and eating habits, the world’s obese population has more than doubled from 0.857 billion to 2.1 billion since 1980. Additionally, according to World Health Organization (WHO), it is believed that by 2025, a total of 260 million children worldwide will be overweight, with more than 90 million categorized as obese or worse [1]. Obesity in adolescence is one of the most serious factors causing premature disease and the demise of health in later life [2]. According to the government of Korea, it was reported in 2018 that the rate of obesity among adolescents in South Korea was 10.8%, which means that at least one in every ten adolescents is obese [3].
With this rise in obesity, being able to identify potential causes can help to find preventive measures. Several studies have been conducted in order to explain the factors that cause obesity [4], citing a variety of factors such as: economic factors, such as the cost of eating; state-level prices of grocery food; restaurant prevalence; cigarettes; and alcohol [1]. However, prior research has only considered a few factors within their models, usually implementing simple statistical analysis [4], or using simple classification [5]. In this study, we propose using a more holistic approach whereby classification is integrated with a prediction model that allows the user to adjust certain factors in order to find out what the outcome could be based on this change. Thus, this paper implements the use of a General Bayesian Network (GBN) to identify and classify factors and then uses What-If analysis to then build scenarios on obesity. The use of such a tool is not only useful for obesity research but can also be directly transferred to other areas of health informatics in a way to predict future health outcomes.
Prior research has focused on the use of a GBN in health informatics [6], including its use with a Markov Blanket (MB) to understand obesity [7]. Although this prior research has shed light on obesity within health informatics, it has the following shortcomings. First, is the use of generally accepted influencing factors, such as dietary structure, physical exercise, cigarettes, and alcohol to help predict obesity. Also, although there are many factors that could be involved in the development and maintenance of obesity among a population, there are too many to be investigated in this proof of concept study. Therefore, this study’s focus was more on parameters that are less obvious, such as education and wealth of the family, smartphone usage, pocket money, time spent studying, academic performance, and parents’ education level. These factors have more profound and practical significance in the context of reality. Additionally, we take the analysis a step further compared with this prior work by implementing a What-If analysis based upon the results from the GBN.

1.1. Factors Affecting Adolescent Obesity

Adolescence is a pivotal period of one’s life, in which habits are difficult to shake off once learned. Habits such as eating and exercise are ones that adolescent adults struggle with the most [8]. In particular, unhealthy habits, such as smoking, drinking, lack of exercise, and overeating, are known to be direct causes of chronic illnesses, including obesity [9]. Obesity not only has a negative effect on physical, psychological, and emotional development in growing adolescents, but research shows that one-third of adolescent obesity cases leads to adult obesity in later life [9]. In addition, a higher score of Body Mass Index (BMI) ( 25 kg/ m 2 ) in adults is known to be deeply related to mortality and the occurrence of cardiovascular disease [10]. In addition, the socioeconomic characteristics of adolescents have been reported to have an effect on one’s health. For example, research has shown that elder elementary school students living within residential areas are at greater risk of obesity [11]. Studies have also focused on an ecological model that suggests that individual, interpersonal, organizational, and community characteristics can interact and affect individual behavior and health [12].
Lee et al. found an association between BMI and a mothers’ education level (low) and a lower age of adolescents (middle school). Also, it was found in urban areas for boys who’s fathers’ education level was low [4]. A lack of exercise, increased sitting time, and changes in dietary habits have also been seen as factors [13]. One of the main drivers of this lifestyle change is the recent prevalence in the use of information communication technologies, such as the Internet and smartphones [14]. Previous work has indicated that adolescents addicted to the Internet were more likely to be overweight [14]. Furthermore, a higher use of devices with screens, including computers, video games, smartphones, and tablets, has been seen to increase obesity risk factors in adolescents [11]. In particular, smartphone use has been shown to be related to obesity-related lifestyles, such as watching TV and bad eating habits. Furthermore, it has been shown that teenagers who have more than two hours use of a smartphone a day have higher BMI scores comparatively [11]. Park and Song found that the risk of obesity is much higher when the main reasons for engaging with a smartphone is for entertainment purposes compared with educational purposes, i.e., studying apps [1]. Conversely, their study found the risk of obesity to be lower when the smartphone is used for social purposes [1]. In addition, there has been found to be a significant moderating effect between the quality of sleep and one’s BMI. Hence, sleep quality should be considered as a factor for the prevention and management of obesity [3].

1.2. Data Mining for Predicting Obesity

Predicting obesity is a social challenge and, therefore, researchers have been studying obesity from various perspectives [15,16,17,18,19,20]. Due to its complexity, an increasing number of researchers are turning to the use of data mining and machine learning tools [21]. Bhattarai et al.’s [22] research used text classification to help predict obesity. Zhang, Tjortjis, Zeng, Qiao, Buchan, and Keane’s [5] paper provided a comparison between seven data mining techniques, including Bayesian Networks (BN). Research implementing Artificial Neural Network (ANN), Naïve Bayes Network (NBN), and Decision Tree (DT) have proven to be applicable to predicting childhood obesity [23,24]. However, the prediction of adolescent ages is slightly more complex and thus difficult for many algorithms to handle. This is because the reasons that lead to being overweight or obese are complicated, involving not only physiological but also genetic, sociological, and many psychological factors [25].

1.3. General Bayesian Network (GBN) and What-If Analysis

GBN is a directed acyclic graph in which nodes represent domain variables, and arcs between nodes represent probabilistic dependencies [26]. GBN is a popular type of probabilistic network, which has found many different applications in areas such as online learning [27], software development [28], medical informatics [24], social media analytics [29], environmental concerns [30], and in-classification tasks [31]. Finding the probability of an event given a set of prior evidence can be solved through Bayesian nets with sequential applications of Bayes Theorem. It is important to find out the inverse probability, i.e., when we know P(B|A), then we can use a Bayesian network to find out P(A|B). Overfitting often happens in GBN when the parameters of datasets reach the extents that the computer can cope with. Feature selection (FS) can be applied to solve the above problem through the removal of redundant and irrelevant features. For this reason, we implemented the use of the Markov Blanket, which has been used for this feature in previous papers [32,33,34].
P ( A | B ) = P ( B | A ) P ( A ) P ( B )
What-If analysis can be described as a data-intensive simulation that inspects the behavior of a complex system under some given hypothesis (called scenarios) [35]. More pragmatically, What-If analysis measures how changes in a set of independent variables impact a set of dependent variables with reference to a given simulation model [36]. What-If analysis can be used to solve several complicated problems, especially for highly complex research models where the number of variables would make it difficult for general classification models to find patterns [37]. Furthermore, if the What-If analysis is integrated with a GBN model, important matters can be easily perceived by researchers through the graphical presentation [38].
What-If analysis recalculates variables using substituted presumption to discover another consequence. What-If analysis can be represented by x i as the ith input of x = {x1, ..., xn}, and ( x i , x j ) is the sub-vector of x i , j , then, the What-If analysis is deduced with a decomposition of the function η(∙), and it can be fully written in the following [37]:
y = ( X ) = ( Y ) + i = 1 d z i ( x i ) + i < j z i , j ( x i , j ) + i < j < k z i , j , k ( x i , j , k ) + + z 1 , 2 , , d ( x ) where   z i ( x i ) = E ( Y | x i ) E ( Y ) ,   z i , j ( x i , j ) = E ( Y | x i , j ) z i ( x i ) z j ( x j ) E ( Y ) , z i , j , k ( x i , j , k ) = E ( Y | x i , j , k ) z i , j ( x i , j ) z i , k ( x i , k ) z j , k ( x j , k ) z i ( x i ) z j ( x j ) z k ( x k ) E ( Y ) .
In this research, GBN MB with What-If analysis was implemented in obesity to suggest the impact of various parameter changes on the incidence of obesity. Note that the parameters that we have selected for this paper are for the purposes of demonstration, rather than for the purposes of providing an overall medical solution for obesity.

2. Methodology

2.1. Procedure

First, we compared the GBN classifier with some common algorithms known for high accuracy in solving complex datasets. Next, accuracy, AUC, and F-measure were performed in order to test the performance of the GBN models against the other chosen classification models. Furthermore, in order to provide a practical implication for predicting adolescent obesity from this paper, What-If analysis was simulated using the result from the GBN models derived from the given dataset on obesity. The open source tool WEKA was used to implement classifiers on the dataset and can be found at the following URL: https://www.cs.waikato.ac.nz/ml/weka/.

2.2. Dataset

This paper uses the raw data of the 2017 Korean Youth Health Behavior Survey conducted by the Korean Centers for Disease Control & Prevention. The target collection group of the 13th Adolescent Health Form Online Survey (KCDC, 2017) had a participation of 62,276 in April 2016. The online survey of youth health behaviors was based on and approved by the government approval statistics committee (Approval No. 11758) and the National Health Promotion Act (Article 19). The 2017 Korean Youth Health Behavior Survey surveyed 123 questions in 15 areas, including smoking, drinking, and physical activity, and calculated 216 indicators. This paper used the sociological characteristics of the population, health morphological characteristics, smartphone use characteristics, obesity, and other related projects.

2.3. Data Pre-Processing

First, we deleted responses that had missing values and also removed the irrelevant attributes that could not help in evaluating obesity. Second, we removed the data for respondents that had no prior smartphone use, as well as data from respondents who answered as “don’t know” to the smartphone usage section. Third, we calculated the degree of obesity by the following steps: (1) the BMI is calculated by dividing the body weight by the square of the height (Kg/ m 2 ). (2) According to the 2007 growth chart for children and adolescents, as shown in Table 1, four categories are defined. Over the 95th percentile is obesity, the 85th percentile to 95th percentile is overweight, 5th percentile to 85th percentile is normal, and less than 5th percentile is underweight. Fourth, we selected 33 attributes based on three key articles, and the attributes are processed as follows: (1) since the classification of BMI is obtained according to gender and age, and the level of school is directly related to age, we deleted these four attributes: “AGE”, “SEX”, “SCHOOL”, and “GRADE”. (2) We combined some relevant attributes into one attribute, specifically, “F_BR”, “F_FRUIT”, “F_VEG”, and “F_MILK” were transformed into “Healthy_Eating”; “F_FASTFOOD”, “F_INSTND”, “F_CRACK”, “F_SODA”, “F_CAFFINE”, and “F_SWDRINK” were transformed into “Unhealthy_Eating”; “PA_SWD_S” and “PA_SWK_S” were transformed into “Sitting_Time_Study (min)”, and “INT_SP_WD” and “INT_SP_WK” were transformed into “Smartphone_Time (min)”.
After the data-preprocessing was complete, the revised dataset had 4 classes, 19 attributes, and 11,206 data instances (see Appendix A for a full breakdown).

3. Results

3.1. Performance Comparison of GBN with Other Classifiers

In order to confirm the suitability of our GBN-MB model, we have compared the results of Accuracy, F-measure, and AUC (area under the curve), which are commonly used to evaluate the performance of machine learning techniques [39,40,41], of the model to the other machine learning classifiers. Results indicated that for this specific set of data, the best fit was obtained with the GBN model that embedded the MB. Based on the scores obtained from accuracy, GBN-MB showed an accuracy [39] of 53.703%, an F-measure [40] of 0.535, and an AUC (area under the curve) [41] score of 0.758 (see Table 2). This result proved that the use of MB could enhance the performance of a GBN network’s classification ability. Hence, we recommend GBN-MB to predict adolescent obesity, as its performance is slightly better than all other classifiers. GBN and Bagging (BA) also were found to have comparatively reasonable scores; however, they fell short in all the tests. A Support Vector Machine (SVM) has been considered as a trustful machine learning approach among researchers [42,43,44]. However, surprisingly, for this dataset, Support Vector Machine (SVM) showed very poor results in accuracy (45.431%), together with the Naïve Bayes Network (NBN) (45.627%) compared to the other models implemented. Despite GBN-MB showing the best performance in all the tests, the results suggest that in the domain of adolescent obesity prediction, scores are fairly low, and thus, the complexity of the problem remains clear. However, moving forward, the inclusion of GBN-MB and potentially GBN and Bagging (BA) could be considered.

3.2. Chosen Factors for What-If Analysis

In order to find interesting relationships in the obesity data, we attempted to shift the lens in which previous research has viewed obesity. Although there are many common factors that could be used to explain the cause of obesity, this study focused more on unique factors that were not dealt with in previous research. Hence, we chose the following features for further analysis: “Obesity_Level”, “Pocket_Money (KRW)”, “Wealth”, “Pressure”, “Sitting_Time_Study (min)”, “Sleeping_Quality”, “Education_Mother”, “Education_Father”, “Academic_Performance”, “Smartphone_Time (min)”, and “Smartphone_Service”.
Table 3 shows the sensitivity to findings of each factor node with respect to each influencing factor. There would be a higher influence of the variable on the target nodes when the value of mutual information was higher. The same principle can be applied to entropy (%), which means the percentage of each influencing factors on the dependent variable is greater (Obesity_Level).
As Table 3 shows, Pocket_Money (KRW) was found to be the most influential factor on the level of obesity, while Sleeping_Quality, Sitting_Time_Study (min), Academic_Performance, Wealth, and Pressure follow. Through this result, it could be interpreted that the amount of pocket money which adolescents receive every week can affect the somatotype. In addition, the quality of sleeping and the time adolescents spent sitting for study ranked second and third, respectively, indicating that healthy lifestyles are significant to the bodily form during one’s adolescence. The academic performance, family wealth, and pressure level ranked fourth, fifth, and sixth, respectively. All the important influencing factors will be discussed in detail by sensitivity analysis in the following paragraphs.

3.3. What-If Analysis

What-If analysis can be used to solve several complicated problems, especially for highly complex research models. For example, to determine the factors that affect adolescent obesity, we needed to manipulate the maximum and minimum output of the models. Therefore, in this study, we have generated 13 scenarios for obesity, as shown in Table 4.
Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8 show the results of the What-If analysis for each scenario. Figure 1 shows the initial state of the Bayesian Network previously obtained in the classification section of this paper, in which the initial value of ‘Obesity’ is 22.8.
When scenario 1 is applied (see Figure 2), the medium value of ‘Pocket_Money’ (60 k x < 80   k ) is applied, and as can be seen, the value of ‘Obesity’ increases from 23.0 to 59.7. This result shows that adolescents with weekly pocket money around 60 k to 80 k Korean Won are more likely to become obese. When the level of obesity was manipulated with maximized or minimized levels of ‘Pocket_Money’, the value of ‘Obesity’ became close to 0. This result indicates that adolescents who have either a large budget or low budget for pocket money have a near to zero chance of becoming obese. Potential reasons for this kind of finding could be inferred as those adolescents who have larger budgets might pay more attention to body management or spend money on bigger ticket items compared with a small ticket item such as fast food. In contrast, if one has a smaller budget, or none at all, they cannot afford to spend money on luxuries such as fast food or snacks. Hence, our results indicate that only those who have a reasonable pocket money budget may face the large risk of being obese.
When scenario 2 is applied (see Figure 3), which sets ‘Sleeping_Quality’ to ‘Very bad’, the value of ‘Obesity’ increases from 22.8 to 29.9. Even though the change is not so dramatic, we can still find that adolescents with poor sleeping quality are more likely to be obese. At the same time, we observed that the value of ‘Pressure’ increased significantly from the initial 39.1 to 67.4, indicating that high pressure brings about bad sleeping quality. This result presumes that adolescents may manage pressure from studying and exams by eating, leading to a higher probable chance of obesity. Essentially, eating could be seen as a coping mechanism to deal with times of pressure.
When we apply scenario 3 (see Figure 4), time spent studying is set to the maximum value. This has the effect of slightly reducing smartphone usage, but more obvious is the amount of pressure on adolescents. Pressure in a state of high time spent studying increases in the high category from 39.1 to 50.0, having a knock-on effect in terms of obesity levels (22.8 > 31.0). Therefore, scenario 3 confirms what is generally deemed as common knowledge when it comes to excessive studying.
For scenario 4, we selected a medium budget for ‘Pocket_Money’ and also set the ‘Wealth’ to ‘Low’ (see Figure 5). As can be seen in the results, the value of ‘Obesity’ has increased from 22.8 to 74.7, indicating that the family economic condition compounded with the amount of money that adolescents received as pocket money can influence the adolescents’ obesity level. This finding reflects the fact that families who are on the lower end of the economic spectrum have to buy cheaper fast and processed foods. This then has a knock-on effect on the fact that this choice from parents then reflects into the adolescent’s choice in how they spend their pocket money, i.e., further fast food, or unhealthy snacks.
In Figure 6, we applied scenario 5, which maximizes the level of ‘Sitting_Time_Study’, and also implemented a middle value for ‘Academic_Performance’. The result showed that the value of obesity increased from 22.8 to 35.3. Upon testing the significance of this feature while maximizing or minimizing the level of ‘Academic_Performance’, we found no significant change to obesity. In addition, the change in obesity level is not significant when we maximize or minimize the level of ‘Academic_Performance’ in conjunction with maximized ‘Sitting_Time_Study’. We can reasonably suggest that the students that are in the medium percentile of intelligence might have to put more effort into their studying efforts, which ultimately leads to the increase of time spent sitting down studying. Therefore, the value of obesity also increased, respectively. At the same time, it is obvious that the value of high ‘Pressure’ also increased from 39.1 to 50.5, a greater increase compared with scenario 3. This suggests that those who spend more time studying who fall within the medium percentile of intelligence suffer from a higher pressure from studying.
By maximizing the level of ‘Smartphone_Time’ and setting the ‘Smartphone_Service’ configuration to “study”, we applied scenario 6 to the obesity dataset (see Figure 7). The results from this adjustment show how obesity went from 23.0 to 32.8 in probability. This scenario shows similar aspects to the results that were seen in scenario 4, albeit with different manipulation of the variables. As can be seen in Figure 7, the increase in studying time spent on a smartphone results in an increase in obesity level. As smartphones continue to be an integral part of adolescent life and grow as a source of entertainment, studying, and socializing for these young people, it is leading to a decrease in physical activity and thus, can help to explain why obesity will increase, even if the technology is used for studying.
In our final scenario, we wanted to investigate the relationship between adolescent obesity and their parents’ education level. To the best of our knowledge, this has not been explored in previous research and, thus, could shed light on an unknown area within the obesity debate. To achieve this supposition, we adjusted the value of the father’s education data point to the maximum value as well as that of the of the mother’s education data point to the minimum value to see if the educational backgrounds of parents in general could influence the obesity levels of their children, as shown in scenario 7, (see Figure 8). In doing this, we are not saying that the people questioned in this survey will change their education; instead, this allows a general overview of what potentially would happen if this trend was to continue and become more extreme. The result showed that obesity levels were likely to increase from 23.0 to 32.8. This combination proved to be the highest change in obesity levels when compared to other scenarios such as “Maximized mother and father education” and “Minimized mother and father education”. A plausible explanation of this is that the high level of a father’s education results in higher level job, one that has more demanding hours and responsibility. Consequently, this takes time away from the father’s family life and leaves the mother to rear the children in a more independent household. Coupling this with a low education level for the mother creates the potential for the mother to not correctly nourish the children with correct amounts of food or nutrition and can lead to the mother making poor choices when it comes to eating out. All these factors ultimately lead to a greater risk of adolescent obesity.

4. Discussion

With the remarkable improvement in people’s socioeconomic living standards, the obesity of adolescents has become a serious social problem in the developed world. Over the past 30 years, the rate of adolescent obesity has quadrupled, increasing health care and their associated costs, as well as increasing the risk of disease. Although obesity rates have been halting in countries like the United States [45], more recently, in developed countries, such as South Korea, obesity in adolescents is dramatically increasing. With this, the associated risks and health concerns, for example, increased chance of cancer, are now becoming a serious problem in these countries too [46]. Up until now, much research on obesity and adolescence has been based on case studies, and the analysis of data with common techniques has helped to make people more aware of the issue and increase the level of understanding in this area [47,48]; however, little research has tried to build scenarios that can help to predict the levels of obesity based on computing techniques [6]. The lack of research within this area is great, so this study attempted to fill this gap by applying computing techniques to a public health dataset obtained from the South Korean government in order to try and predict the main contributing factors of adolescent obesity.
This paper focused on predicting the influencing factors of adolescent obesity by using a GBN with an embedded MB. Chung et al. (2013) made recommendations on the use of a GBN within health informatics, and previous research done on data from the UK showed the power of using a GBN for understanding obesity [7]. The results of our study also found a GBN with an MB to be the best classifier for the causes of obesity. However, the classification of causes cannot tell us much, and therefore, we took the focus one step further by adding the power of a What-If analysis. In doing so, and to the best of our knowledge, this paper is one of the first to implement this computing technique on obesity prediction within adolescents. Implications of the results from this analysis are presented next.

4.1. Implications

Based on the seven scenarios poised in the What-If analyses, we can reach the following implications. First, in scenario one, adolescents with a reasonable pocket money budget were seen to have a higher chance of obesity. This is also cited in prior research, which suggested that children in China who had more pocket money would more likely consume sugary drinks and eat fast food [49]. Although this study focused on children, continuing a trend from childhood through to adolescence is very possible. However, it is clear that when children turn to adolescence, it is possible that those with considerably larger budgets are more likely to take an interest in activities such as going to the cinema or driving. Thus, money spent on food at childhood may turn into bigger ticket items. In Li et al.’s study [49], they found that the mother’s perception of their caregiving role, plus concern over their child’s health, played an influence on how much money they gave for pocket money. This tie is nicely seen within scenario 7.1. whereby low education of the mother resulted in greater obesity. However, interestingly, when we applied a low economic family scenario alongside a medium level of pocket money, obesity increased dramatically. Summarizing these findings would lead one to imply that low education of the mother (i.e., unawareness of child/adolescents’ health) compounded with a reasonable amount of pocket money is also tied in with a low-income home. Low-income households usually do not have the luxury of affording fresh fruit and vegetables, and thus, fast food and packaged foods are more likely on the menu at home. Thus, poor teaching on behalf of the mother (based on lack of education and funds) leads to unhealthy choices when the child/adolescence has enough pocket money to buy food/snacks.
From scenario two, it shows that as poor quality of sleep increases the chance of being obese, agreeing with prior research that also links body aches with these two factors [50]. Another key outcome seen was with the increase in pressure within adolescents, a factor that increased in quite significantly when sleep deprivation and obesity also increased. This, therefore, adds a psychological element to the obesity epidemic and has been linked in prior research [51]. Lack of sleep, especially in countries like South Korea, is directly linked to higher hours spent studying; however, no link between hours spent studying and obesity has previously been found [52]. This study, however, did find a direct link between increased study time and obesity. Excessive studying can lead to boredom as well as tiredness. From this result, it could be implied that adolescents within these states would then turn to eating in order to eliminate boredom or turn to energy drinks or coffee to relieve fatigue. In both cases, the risk of choosing high sugar snacks and drinks is high. Additionally, excessive studying leads to sleepless nights, as well as late consumption of sugary and caffeinated drinks, which helps to prevent a good night’s sleep. Therefore, our results comply with common knowledge when it comes to the risks of obesity in these particular aspects.
Finally, smartphone use was implied to increase the chances of obesity, when used for studying purposes in scenario six. Although smartphone use has been linked to obesity in prior work [53], it has not specified the usage of the smartphone. This result could imply that when Korean adolescents study, they use their phones as a tool, and therefore, with this, it removes time away from, for example, physical activity. However, contrary to common belief, this result cannot ignore the fact that phones used for entertainment and social purposes may also have a contributing effect on the potential levels of obesity.

4.2. Limitations and Future Recommendations

Korea is still a society largely driven by male dominance and a mother and father household. Therefore, some of the scenarios might be unique whereby men are the so-called ‘breadwinners’, and women are expected to be ‘stay-at-home’ mothers. Although some of the scenarios shed potentially new implications on the causes of obesity, the Korean culture may be playing a more dominant factor in these results. This implies that these results may not cross boundaries into countries whereby women play a more liberal or dominant role in the workplace and whereby women are the ‘breadwinners’, and it is instead, the father who stays at home. Furthermore, this result ignores the fact that there is growing evidence that divorced households, i.e., single mothers or fathers, have been shown to have an effect on obesity in preadolescent children [54], and it is very plausible that this effect could spill over into adolescents too.
Another limitation of this study is that we did not consider the effects of factors such as posttraumatic stress disorders, lack of exercise, and a direct reference to diet choices within our scenario building. Recent research has started to make direct links between these factors and obesity [55], and therefore, future research will need to look to include such factors in order to gain a greater insight into the causes of adolescent obesity. Another factor that is sometimes ignored within the public health sphere when it comes to preventing obesity is in the government’s role to potentially limit the number of, or heavily tax fast food restaurants, such as McDonalds and KFC, which are now rapidly growing in Asia [56]. Therefore, future research could also look into the geographical locations of fast food restaurants and the perceived choice adolescents have when it comes to selecting a restaurant.
Because our GBN-MB research model is based on cross sectional data, the relationships are not causal, while they are more associative. In addition, it is difficult to represent the characteristics of all situations within the globe as the dataset used was limited to the case of Korea. Thus, future research should look to study a selection of various data samples from several nations who are suffering from this rise in obesity. This would help to find more meaningful results that have not been derived in this study and could potentially contribute even more by providing a more fruitful solution from the perspective of public health concerns on obesity. In line with this, another limitation is in the ability to translate informatic-based models into real-world health informatics. Therefore, in order to examine the findings within this paper, we suggest that future research should look to employ longitudinal epidemiological data to help support the findings from this paper.

Author Contributions

Conceptualization, K.C.L. and C.K.; methodology, K.C.L., C.K.; software, F.J.C., C.K., C.L., Y.L.; validation, C.K., F.J.C.; formal analysis, C.K., F.J.C., Y.L., C.L.; investigation, C.K., F.J.C., K.C.L., Y.L., C.L.; data curation, C.K., Y.L., C.L.; writing—original draft preparation, C.K., F.J.C., Y.L., C.L.; writing—review and editing, K.C.L.; supervision, K.C.L.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. All attributes included within this study.
Table A1. All attributes included within this study.
Revised AttributesOriginal AttributesTypeDescriptionSource
RegionCTYPENominalCity type (County, Big city, Small city)Lee, Kang, Kim, Son, Lee and Ham [4]
Park and Song [1]
Jang, Oh, Kim and Shin [3]
Academic_PerformanceE_S_RCRDAcademic performance
PressureM_STRHow often do you feel under pressure?
Suicide_ThoughtM_SUI_CONIn the last 12 months, have you ever had suicide thought?
Sleeping_QualityM_SLP_ENIn the last 7 days, do you think that sleeping time is sufficient?
DrinkingDRINKINGHave you ever had a drink?
SmokingSMOKINGHave you ever smoked cigarettes?
Education_FatherE_EDU_FFather’s educational levelLee, Kang, Kim, Son, Lee and Ham [4]
Education_MotherE_EDU_MMother’s educational level
WealthE_SESFamily economic levelJang, Oh, Kim and Shin [3]
Pocket_Money (KRW)E_ALLWNAverage pocket money per week
Healthy_EatingF_BRNumericDuring the last 7 days, how many days did you eat breakfast?Lee, Kang, Kim, Son, Lee and Ham [4]
Park and Song [1]
Jang, Oh, Kim and Shin [3]
F_FRUITDuring the last 7 days, how often did you eat fruit?
F_VEGDuring the last 7 days, how often did you eat vegetables?
F_MILKDuring the last 7 days, how often did you drink milk?
Unhealthy_EatingF_FASTFOODDuring the last 7 days, how often did you eat fast food?
F_INSTNDDuring the last 7 days, how often did you eat instant noodles?
F_CRACKDuring the last 7 days, how often did you eat snake?
F_SODADuring the last 7 days, how often did you drink carbonated drinks?
F_CAFFEINEDuring the last 7 days, how often did you drink coffee or energy drinks?
F_SWDRINKDuring the last 7 days, how often did you drink sweet drinks?
Exercise_60 minPA_TOTIn the past 7 days, the number of days that take more than 60 min of exercise.Lee, Kang, Kim, Son, Lee and Ham [4]
Park and Song [1]
Jang, Oh, Kim and Shin [3]
Exercise_20 minPA_VIGIn the past 7 days, the number of days that take more than 20 min of strenuous exercise.
Sitting_Time_Study (min)PA_SWD_SIn the last 7 days, how many hours do you spend sitting for study per day on weekday?Lee, Kang, Kim, Son, Lee and Ham [4]
PA_SWK_SIn the last 7 days, how many hours do you spend sitting for study per day on weekend?
Smartphone_Time (min)INT_SP_WDIn the last 7 days, how many hours do you spend using smartphone per day on weekday?Park and Song [1]
INT_SP_WKIn the last 7 days, how many hours do you spend using smartphone per day on weekend?
Smartphone_ServiceINT_SP_ITEMNominalDuring the last 30 days, which kind of smartphone service is mainly used?
Obesity_LevelObesity >=95th BMI PercentileLee, Kang, Kim, Son, Lee and Ham [4]
Park and Song [1]
Overweight85th~95th BMI Percentile
Normal5th~85th BMI Percentile
Underweight<=5th BMI Percentile

References

  1. Park, M.H.; Song, H.Y. Relationship among the Korean Adolescents Obesity, Health Behavior and Smart phone Usage. J. Korean Public Health Nurs. 2019, 33, 47–58. [Google Scholar]
  2. Ebbeling, C.B.; Pawlak, D.B.; Ludwig, D.S. Childhood obesity: Public-health crisis, common sense cure. Lancet 2002, 360, 473–482. [Google Scholar] [CrossRef]
  3. Jang, M.N.; Oh, S.-M.; Kim, H.S.; Shin, H. The Relationship between Quality of Sleep and Body Mass Index of Adolescents: Focusing on Mediating Effect of Physical Activity, Dietary Habit, and Academic Stress and Moderating Effect of Gender. J. Korean Public Health Nurs. 2019, 33, 85–97. [Google Scholar]
  4. Lee, J.Y.; Kang, S.N.; Kim, S.A.; Son, D.M.; Lee, B.G.; Ham, O.K. Factors Influencing Obesity among Rural and Urban Adolescent: Analysis of 2013 Korean Youth Health Behavior Survey. J. Korean Public Health Nurs. 2019, 33, 73–84. [Google Scholar]
  5. Zhang, S.; Tjortjis, C.; Zeng, X.; Qiao, H.; Buchan, I.; Keane, J. Comparing data mining methods with logistic regression in childhood obesity prediction. Inf. Syst. Front. 2009, 11, 449–460. [Google Scholar] [CrossRef]
  6. Chung, D.; Lee, K.C.; Seong, S.C. General Bayesian network approach to health informatics prediction: Emphasis on performance comparison. Procedia Soc. Behav Sci 2013, 81, 465–468. [Google Scholar] [CrossRef]
  7. Harding, N.J. Application of Bayesian Networks to Problems within Obesity Epidemiology. Ph.D. Thesis, The University of Manchester, Manchester, UK, 2011. [Google Scholar]
  8. Kim, H.-K.; Kim, J.-H.; Jung, H.-K. A comparison of health related habits, nutrition knowledge, dietary habits, and blood composition according to gender and weight status of college students in Ulsan. Korean J. Nutr. 2012, 45, 336–346. [Google Scholar] [CrossRef]
  9. Lee, J.H.; Yi, K.H. An analysis of the determinants of adolescents’ health inequality in Korea: A view from the students’ selfrated health condition and obesity. Korea Educ. Rev. 2015, 21, 217–243. [Google Scholar]
  10. Jee, S.H.; Sull, J.W.; Park, J.; Lee, S.-Y.; Ohrr, H.; Guallar, E.; Samet, J.M. Body-mass index and mortality in Korean men and women. N. Engl. J. Med. 2006, 355, 779–787. [Google Scholar] [CrossRef]
  11. Kenney, E.L.; Gortmaker, S.L. United States adolescents’ television, computer, videogame, smartphone, and tablet use: Associations with sugary drinks, sleep, physical activity, and obesity. J. Pediatr. 2017, 182, 144–149. [Google Scholar] [CrossRef]
  12. Sallis, J.; Owen, N.; Fisher, E. Ecological models of health behavior. In Health Behavior and Health Education: Theory, Research, and Practice; Glanz, K., Rimer, B.K., Viswanath, K., Eds.; Jossey-Bass: San Francisco, CA, USA, 2008. [Google Scholar]
  13. Hill, J.O.; Peters, J.C. Environmental contributions to the obesity epidemic. Science 1998, 280, 1371–1374. [Google Scholar] [CrossRef] [PubMed]
  14. Burke, V.; Beilin, L.J.; Durkin, K.; Stritzke, W.G.; Houghton, S.; Cameron, C.A. Television, computer use, physical activity, diet and fatness in Australian adolescents. Int. J. Pediatr. Obes. 2006, 1, 248–255. [Google Scholar] [CrossRef] [PubMed]
  15. Moore, J.; Haemer, M.; Mirza, N.Z.; Weatherall, Y.; Han, J.; Mangarelli, C.; Hawkins, M.J.; Xanthakos, S.; Siegel, R. Pilot Testing of a Patient Decision Aid for Adolescents with Severe Obesity in US Pediatric Weight Management Programs within the COMPASS Network. Int. J. Environ. Res. Public Health 2019, 16, 1776. [Google Scholar] [CrossRef] [PubMed]
  16. Ievers-Landis, C.E.; Dykstra, C.; Uli, N.; O’Riordan, M.A. Weight-Related Teasing of Adolescents Who Are Primarily Obese: Roles of Sociocultural Attitudes Towards Appearance and Physical Activity Self-Efficacy. Int. J. Environ. Res. Public Health 2019, 16, 1540. [Google Scholar] [CrossRef]
  17. Zanganeh, M.; Adab, P.; Li, B.; Frew, E. A systematic review of methods, study quality, and results of economic evaluation for childhood and adolescent obesity intervention. Int. J. Environ. Res. Public Health 2019, 16, 485. [Google Scholar] [CrossRef]
  18. Galan-Lopez, P.; Ries, F.; Gisladottir, T.; Domínguez, R.; Sánchez-Oliver, A. Healthy lifestyle: Relationship between Mediterranean diet, body composition and physical fitness in 13 to 16-years old Icelandic students. Int. J. Environ. Res. Public Health 2018, 15, 2632. [Google Scholar] [CrossRef]
  19. Brewis, A.; Bruening, M. Weight shame, social connection, and depressive symptoms in late adolescence. Int. J. Environ. Res. Public Health 2018, 15, 891. [Google Scholar] [CrossRef]
  20. Gustafson, A.; Jilcott, P.S.; McDonald, J.; Ford, H.; Connelly, P.; Gillespie, R.; Liu, E.; Bush, H.; Brancato, C.; Babatande, T. Direct effects of the home, school, and consumer food environments on the association between food purchasing patterns and dietary intake among rural adolescents in Kentucky and North Carolina, 2017. Int. J. Environ. Res. Public Health 2017, 14, 1255. [Google Scholar] [CrossRef]
  21. Fayyad, U.; Piatetsky-Shapiro, G.; Smyth, P. From data mining to knowledge discovery in databases. AI Mag. 1996, 17, 37. [Google Scholar]
  22. Bhattarai, A.; Rus, V.; Dasgupta, D. Classification of clinical conditions: A case study on prediction of obesity and its co-morbidities. In Proceedings of the 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLING’09), Memphis, TN, USA, 24–28 February 2009. [Google Scholar]
  23. Adnan, M.H.B.M.; Husain, W.; Damanhoori, F. A survey on utilization of data mining for childhood obesity prediction. In Proceedings of the 8th Asia-Pacific Symposium on Information and Telecommunication Technologies, Kuching, Malaysia, 15–18 June 2010; pp. 1–6. [Google Scholar]
  24. Codecasa, D.; Stella, F. Learning continuous time Bayesian network classifiers. Int. J. Approx. Reason. 2014, 55, 1728–1746. [Google Scholar] [CrossRef]
  25. Dunstan, J.; Aguirre, M.; Bastías, M.; Nau, C.; Glass, T.A.; Tobar, F. Predicting nationwide obesity from food sales using machine learning. Health Inform. J. 2019. [Google Scholar] [CrossRef]
  26. Bouckaert, R.R. Bayesian network classifiers in weka. 1 September 2004. Available online: https://researchcommons.waikato.ac.nz/bitstream/handle/10289/85/content.pdf?sequence=1 (accessed on 14 August 2019).
  27. Nguyen, T.T.T.; Nguyen, T.T.; Sharma, R.; Liew, A.W.C. A Lossless Online Bayesian Classifier. Inf. Sci. 2019, 489, 1–17. [Google Scholar] [CrossRef]
  28. Perkusich, M.; Soares, G.; Almeida, H.; Perkusich, A. A procedure to detect problems of processes in software development projects using Bayesian networks. Expert Syst. Appl. 2015, 42, 437–450. [Google Scholar] [CrossRef]
  29. Li, G.; Yao, L.; Zhang, J.; Li, X.; Dang, S.; Zeng, K.; Zhou, Y.; Gao, F. Tumor-suppressive microRNA-34a inhibits breast cancer cell migration and invasion via targeting oncogenic TPD52. Tumor Biol. 2016, 37, 7481–7491. [Google Scholar] [CrossRef] [PubMed]
  30. Ropero, R.; Renooij, S.; Van der Gaag, L. Discretizing environmental data for learning Bayesian-network classifiers. Ecol. Model. 2018, 368, 391–403. [Google Scholar] [CrossRef]
  31. Alonso-Montesinos, J.; Martínez-Durbán, M.; del Sagrado, J.; del Águila, I.; Batlles, F. The application of Bayesian network classifiers to cloud classification in satellite images. Renew. Energy 2016, 97, 155–161. [Google Scholar] [CrossRef]
  32. Madden, M.G. On the classification performance of TAN and general Bayesian networks. In Proceedings of the International Conference on Innovative Techniques and Applications of Artificial Intelligence, Cambridge, UK, 9–11 December 2008; pp. 3–16. [Google Scholar]
  33. Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference; Elsevier: Amsterdam, The Netherlands, 2014. [Google Scholar]
  34. Kaufmann, D.; Parbhoo, S.; Wieczorek, A.; Keller, S.; Adametz, D.; Roth, V. Bayesian markov blanket estimation. In Proceedings of the 19th International Conference on Artificial Intelligence and Statistics, Cadiz, Spain, 9–11 May 2016; pp. 333–341. [Google Scholar]
  35. Golfarelli, M.; Rizzi, S.; Proli, A. Designing what-if analysis: Towards a methodology. In Proceedings of the 9th ACM international workshop on Data warehousing and OLAP, Arlington, VA, USA, 10 November 2006; pp. 51–58. [Google Scholar]
  36. Philippakis, A.S. Structured what if analysis in DSS models. In Proceedings of the 1988 Twenty-First Annual Hawaii International Conference on System Sciences. Volume III: Decision Support and Knowledge Based Systems Track, Kailua-Kona, HI, USA, 5–8 January 1988; pp. 366–370. [Google Scholar]
  37. Oakley, J.E.; O’Hagan, A. Probabilistic sensitivity analysis of complex models: A Bayesian approach. J. R. Stat. Soc. Ser. B 2004, 66, 751–769. [Google Scholar] [CrossRef]
  38. Nadathur, S.G.; Warren, J.R. Emergency department triaging of admitted stroke patients—A Bayesian Network analysis. Health Inf. J. 2011, 17, 294–312. [Google Scholar] [CrossRef]
  39. Sokolova, M.; Japkowicz, N.; Szpakowicz, S. Beyond accuracy, F-score and ROC: A family of discriminant measures for performance evaluation. In Proceedings of the Australasian joint conference on artificial intelligence, Hobart, Australia, 4–8 December 2006; pp. 1015–1021. [Google Scholar]
  40. Powers, D.M. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. 15 December 2011. Available online: https://dspace2.flinders.edu.au/xmlui/handle/2328/27165 (accessed on 12 August 2019).
  41. Brown, I.; Mues, C. An experimental comparison of classification algorithms for imbalanced credit scoring data sets. Expert Syst. Appl. 2012, 39, 3446–3453. [Google Scholar] [CrossRef]
  42. Sasikala, B.; Biju, V.G.; Prashanth, C. Kappa and accuracy evaluations of machine learning classifiers. In Proceedings of the 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT), Bangalore, India, 19–20 May 2017; pp. 20–23. [Google Scholar]
  43. Abdelaal, M.M.A.; Sena, H.A.; Farouq, M.W.; Salem, A.-B.M. Using data mining for assessing diagnosis of breast cancer. In Proceedings of the International Multiconference on Computer Science and Information Technology, Wisla, Poland, 18–20 October 2010; pp. 11–17. [Google Scholar]
  44. Kim, C.; Costello, F.J.; Lee, K.C. Integrating Qualitative Comparative Analysis and Support Vector Machine Methods to Reduce Passengers’ Resistance to Biometric E-Gates for Sustainable Airport Operations. Sustainability 2019, 11, 5349. [Google Scholar] [CrossRef]
  45. Golden, N.H.; Schneider, M.; Wood, C. Preventing obesity and eating disorders in adolescents. Pediatrics 2016, 138, e20161649. [Google Scholar] [CrossRef] [PubMed]
  46. Song, M.; Choi, J.-Y.; Yang, J.J.; Sung, H.; Lee, Y.; Lee, H.-W.; Kong, S.-H.; Lee, H.-J.; Kim, H.-H.; Kim, S.G. Obesity at adolescence and gastric cancer risk. Cancer Causes Control 2015, 26, 247–256. [Google Scholar] [CrossRef] [PubMed]
  47. Must, A.; Strauss, R.S. Risks and consequences of childhood and adolescent obesity. Int. J. Obes. 1999, 23, S2. [Google Scholar] [CrossRef] [PubMed]
  48. Lobstein, T.; Jackson-Leach, R.; Moodie, M.L.; Hall, K.D.; Gortmaker, S.L.; Swinburn, B.A.; James, W.P.T.; Wang, Y.; McPherson, K. Child and adolescent obesity: Part of a bigger picture. Lancet 2015, 385, 2510–2520. [Google Scholar] [CrossRef]
  49. Li, M.; Xue, H.; Jia, P.; Zhao, Y.; Wang, Z.; Xu, F.; Wang, Y. Pocket money, eating behaviors, and weight status among Chinese children: The childhood obesity study in China mega-cities. Prev. Med. 2017, 100, 208–215. [Google Scholar] [CrossRef] [PubMed]
  50. Paiva, T.; Gaspar, T.; Matos, M.G. Sleep deprivation in adolescents: Correlations with health complaints and health-related quality of life. Sleep Med. 2015, 16, 521–527. [Google Scholar] [CrossRef]
  51. Marseglia, L.; Manti, S.; D’Angelo, G.; Nicotera, A.; Parisi, E.; Di Rosa, G.; Gitto, E.; Arrigo, T. Oxidative stress in obesity: A critical component in human diseases. Int. J. Mol. Sci. 2015, 16, 378–400. [Google Scholar] [CrossRef] [Green Version]
  52. Kong, I.G.; Lee, H.-J.; Kim, S.Y.; Sim, S.; Choi, H.G. Physical activity, study sitting time, leisure sitting time, and sleep time are differently associated with obesity in Korean adolescents: A population-based study. Medicine 2015, 94, e1965. [Google Scholar] [CrossRef]
  53. Kim, S.-E.; Kim, J.-W.; Jee, Y.-S. Relationship between smartphone addiction and physical activity in Chinese international students in Korea. J. Behav. Addict. 2015, 4, 200–205. [Google Scholar] [CrossRef]
  54. Mauskopf, S.S.; O’Leary, A.K.; Banihashemi, A.; Weiner, M.; Cookston, J.T. Divorce and eating behaviors: A 5-day within-subject study of preadolescent obesity risk. Child. Obes. 2015, 11, 122–129. [Google Scholar] [CrossRef]
  55. van den Berk-Clark, C.; Secrest, S.; Walls, J.; Hallberg, E.; Lustman, P.J.; Schneider, F.D.; Scherrer, J.F. Association between posttraumatic stress disorder and lack of exercise, poor diet, obesity, and co-occuring smoking: A systematic review and meta-analysis. Health Psychol. 2018, 37, 407. [Google Scholar] [CrossRef] [PubMed]
  56. Wang, Y.; Wang, L.; Xue, H.; Qu, W. A review of the growth of the fast food industry in China and its potential impact on obesity. Int. J. Environ. Res. Public Health 2016, 13, 1112. [Google Scholar] [CrossRef] [PubMed]
Figure 1. What-If analysis: this is the initial state of the predicted data we extracted from the GBN analysis. From this canvas, we are able to manipulate certain parameters to find out what effect this has on the relationship with other nodes, especially with the dependent variable node obesity.
Figure 1. What-If analysis: this is the initial state of the predicted data we extracted from the GBN analysis. From this canvas, we are able to manipulate certain parameters to find out what effect this has on the relationship with other nodes, especially with the dependent variable node obesity.
Ijerph 16 04684 g001
Figure 2. What-If analysis when Scenario 1 is applied: note that the grey variable represents the variable that has been manipulated and, thus, through this, we can try to understand what effect this has on the other relationships in this What-If analysis graph, including the effect it has on the final outcome obesity. The graphs that follow on from this one have the same methodological approach.
Figure 2. What-If analysis when Scenario 1 is applied: note that the grey variable represents the variable that has been manipulated and, thus, through this, we can try to understand what effect this has on the other relationships in this What-If analysis graph, including the effect it has on the final outcome obesity. The graphs that follow on from this one have the same methodological approach.
Ijerph 16 04684 g002
Figure 3. What-If analysis: when scenario 2 is applied.
Figure 3. What-If analysis: when scenario 2 is applied.
Ijerph 16 04684 g003
Figure 4. What-If analysis: when scenario 3 is applied.
Figure 4. What-If analysis: when scenario 3 is applied.
Ijerph 16 04684 g004
Figure 5. What-If analysis: when scenario 4 is applied.
Figure 5. What-If analysis: when scenario 4 is applied.
Ijerph 16 04684 g005
Figure 6. What-If analysis: when scenario 5 is applied.
Figure 6. What-If analysis: when scenario 5 is applied.
Ijerph 16 04684 g006
Figure 7. What-If analysis: when scenario 6 is applied.
Figure 7. What-If analysis: when scenario 6 is applied.
Ijerph 16 04684 g007
Figure 8. What-If analysis: when scenario 7 is applied.
Figure 8. What-If analysis: when scenario 7 is applied.
Ijerph 16 04684 g008
Table 1. Growth chart for children and adolescents in 2007.
Table 1. Growth chart for children and adolescents in 2007.
AgeGenderBMI Percentile
5th85th95th
12Boy15.3523.3226.35
Girl15.2022.2224.77
13Boy15.8223.9327.02
Girl15.7122.8325.38
14Boy16.3224.4027.48
Girl16.2523.3125.83
15Boy16.8324.7427.77
Girl16.7823.6726.11
16Boy17.3324.9527.89
Girl17.2723.8926.24
17Boy17.8025.0827.89
Girl17.6823.9926.24
18Boy18.2025.1827.85
Girl17.9623.9826.15
Table 2. Performance evaluation of classifiers.
Table 2. Performance evaluation of classifiers.
AccuracyF-MeasureAUC
GBN-MB53.703%0.5350.758
GBN52.579%0.5230.743
LR46.689%0.4360.688
DT46.779%0.4670.632
SVM45.431%0.1340.661
NN50.535%0.4950.723
NB45.627%0.4510.674
BA52.588%0.5210.742
RSS51.776%0.4890.736
RF52.570%0.5180.743
Table 3. Sensitivity of ‘Obesity_Level’ to findings at other nodes.
Table 3. Sensitivity of ‘Obesity_Level’ to findings at other nodes.
NodesMutual InformationEntropy (%)Variance of Beliefs
Obesity_Level1.809761000.4895308
Pocket_Money (KRW)0.2219612.30.0246610
Sleeping_Quality0.033981.890.0025873
Sitting_Time_Study (min)0.024941.380.0027259
Academic_Performance0.024231.350.0014365
Wealth0.024131.340.0017131
Pressure0.014200.7660.0010950
Education_Mother0.007660.4250.0007854
Smartphone_Service0.006450.3570.0006391
Smartphone_Time (min)0.004780.2830.0001776
Education_Father0.003960.2210.0003561
Table 4. What-If analysis scenario of factors affecting adolescent obesity.
Table 4. What-If analysis scenario of factors affecting adolescent obesity.
What-If Analysis ScenarioObesity Result (%)
Scenario 1Select the middle level of Pocket_Money22.8 → 59.7
Scenario 2Set Sleeping_Quality to ‘Very low’22.8 → 29.9
Scenario 3Maximize Sitting_Time_Study22.8 → 31.9
Scenario 4Select the middle level of Pocket_Money and set Wealth to “Low”22.8 → 74.7
Scenario 5Maximize Sitting_Time_Study and select the middle level of Academic_Performance22.8 → 35.3
Scenario 6Maximize Smartphone _Time & set Smartphone_Service to “Study”22.8 → 32.8
Scenario 7Maximize Education_Father and minimize Education_Mother22.8 → 32.8
Note: obesity result (%) stands for the probability of causing obesity that was calculated by our General Bayesian Network (GBN) models.

Share and Cite

MDPI and ACS Style

Kim, C.; Costello, F.J.; Lee, K.C.; Li, Y.; Li, C. Predicting Factors Affecting Adolescent Obesity Using General Bayesian Network and What-If Analysis. Int. J. Environ. Res. Public Health 2019, 16, 4684. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph16234684

AMA Style

Kim C, Costello FJ, Lee KC, Li Y, Li C. Predicting Factors Affecting Adolescent Obesity Using General Bayesian Network and What-If Analysis. International Journal of Environmental Research and Public Health. 2019; 16(23):4684. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph16234684

Chicago/Turabian Style

Kim, Cheong, Francis Joseph Costello, Kun Chang Lee, Yuan Li, and Chenyao Li. 2019. "Predicting Factors Affecting Adolescent Obesity Using General Bayesian Network and What-If Analysis" International Journal of Environmental Research and Public Health 16, no. 23: 4684. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph16234684

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop