Computers

15 pages, 4109 KiB

Open AccessArticle

How Machine Learning Classification Accuracy Changes in a Happiness Dataset with Different Demographic Groups

by Colm Sweeney, Edel Ennis, Maurice Mulvenna, Raymond Bond and Siobhan O’Neill

Computers 2022, 11(5), 83; https://0-doi-org.brum.beds.ac.uk/10.3390/computers11050083 - 23 May 2022

Cited by 10 | Viewed by 2745

This study aims to explore how machine learning classification accuracy changes with different demographic groups. The HappyDB is a dataset that contains over 100,000 happy statements, incorporating demographic information that includes marital status, gender, age, and parenthood status. Using the happiness category field, [...] Read more.

This study aims to explore how machine learning classification accuracy changes with different demographic groups. The HappyDB is a dataset that contains over 100,000 happy statements, incorporating demographic information that includes marital status, gender, age, and parenthood status. Using the happiness category field, we test different types of machine learning classifiers to predict what category of happiness the statements belong to, for example, whether they indicate happiness relating to achievement or affection. The tests were initially conducted with three distinct classifiers and the best performing model was the convolutional neural network (CNN) model, which is a deep learning algorithm, achieving an F1 score of 0.897 when used with the complete dataset. This model was then used as the main classifier to further analyze the results and to establish any variety in performance when tested on different demographic groups. We analyzed the results to see if classification accuracy was improved for different demographic groups, and found that the accuracy of prediction within this dataset declined with age, with the exception of the single parent subgroup. The results also showed improved performance for the married and parent subgroups, and lower performances for the non-parent and un-married subgroups, even when investigating a balanced sample. Full article

(This article belongs to the Special Issue Advances of Machine and Deep Learning in the Health Domain)

Journal Menu

Journal Browser

Computers, Volume 11, Issue 5 (May 2022) – 26 articles

Further Information

Guidelines

MDPI Initiatives

Follow MDPI