Important Features Selection and Classification of Adult and Child from Handwriting Using Machine Learning Methods

Shin, Jungpil; Maniruzzaman, Md.; Uchida, Yuta; Hasan, Md. Al Mehedi; Megumi, Akiko; Suzuki, Akiko; Yasumura, Akira

doi:10.3390/app12105256

Open AccessArticle

Important Features Selection and Classification of Adult and Child from Handwriting Using Machine Learning Methods

¹

School of Computer Science and Engineering, The University of Aizu, Aizuwakamatsu 965-8580, Fukushima, Japan

²

Graduate School of Social and Cultural Sciences, Kumamoto University, 2-40-1 Kurokami Chuo-ku, Kumamoto City 860-8555, Kumamoto, Japan

³

Faculty of Humanities and Social Sciences, Kumamoto University, 2-40-1 Kurokami Chuo-ku, Kumamoto City 860-8555, Kumamoto, Japan

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(10), 5256; https://0-doi-org.brum.beds.ac.uk/10.3390/app12105256

Submission received: 19 April 2022 / Revised: 16 May 2022 / Accepted: 20 May 2022 / Published: 23 May 2022

(This article belongs to the Special Issue Human and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

The classification of different age groups, such as adult and child, based on handwriting is very important due to its various applications in many different fields. In forensics, handwriting classification helps investigators focus on a certain category of writers. This paper aimed to propose a machine-learning (ML)-based approach for automatically classifying people as adults or children based on their handwritten data. This study utilized two types of handwritten databases: handwritten text and handwritten pattern, which were collected using a pen tablet. The handwritten text database had 57 subjects (adult: 26 vs. child: 31). Each subject (adult or child) wrote the same 30 words using Japanese hiragana characters. The handwritten pattern database had 81 subjects (adult: 42 and child: 39). Each subject (adult or child) drew four different lines as zigzag lines (trace condition and predict condition), and periodic lines (trace condition and predict condition) and repeated these line tasks three times. Handwriting classification of adult and child is performed in three steps: (i) feature extraction; (ii) feature selection; and (iii) classification. We extracted 30 features from both handwritten text and handwritten pattern datasets. The most efficient features were selected using sequential forward floating selection (SFFS) method and the optimal parameters were selected. Then two ML-based approaches, namely, support vector machine (SVM) and random forest (RF) were applied to classify adult and child. Our findings showed that RF produced up to 93.5% accuracy for handwritten text and 89.8% accuracy for handwritten pattern databases. We hope that this study will provide the evidence of the possibility of classifying adult and child based on handwriting text and handwriting pattern data.

Keywords:

handwritten text; handwritten pattern; adult and child classification; machine learning; sequential forward floating selection approach

1. Introduction

In recent years, the field of handwriting has attracted interest from various aspects, such as biometrics [1] and the medical field [2]. In addition, handwritten characters can be obtained from a variety of sources such as paper documents, images, touch screens, and other devices. This makes the data easy to collect and suitable for classification. Furthermore, since handwriting is something that everyone uses every day in school, it is a method that is less stressful for people. There are few studies on handwriting classification for adults and children, and most of the studies are on the classification of face recognition [3], age groups [4], age, gender, and nationality [5], gender [6,7], gender and handedness [8], detection of alcohol [9], and Parkinson’s disease (PD) [10] based on handwriting images.

There are two types of handwriting data: offline and online. The input data collected using a scanning machine are called “offline”, whereas input data obtained using a pen tip are called “online” [11]. In our research work, we used the online-based handwritten database. Moreover, a single writer’s handwriting may be unique or differ slightly, but the handwriting of a child and adult must always be different. Most forensic handwriting analysis is based on the inspection of specific character shapes, character ligatures, size, pen lift, pen pressure, speed, letter spacing, etc., to identify a suspected person. Age group detection will be a great solution before detecting the actual suspected person in forensic analysis. It will give additional evidence about the suspected person’s age. Currently, there are many applications of handwritten recognition, for example, signature authentication used in industrial applications [2], authenticating of criminal investigations in a court of justice [12,13], document examinations [14], and so on. The most difficult aspects of handwriting identification are distortions and pattern variations; feature extraction is of supreme importance. Handwritten forensic analysis or handwriting recognition using machine learning (ML) algorithms can be a great solution to classify adults and children based on their handwritten text and handwritten pattern.

Ahmad et al. (2004) proposed support vector machine (SVM) with some kernels for online handwritten recognition [15]. They showed that at the character level, the SVM recognition rate was dramatically better due to the use of maximizing boundaries in the decision function. The only problem with this algorithm was storing large support vector for a huge training character that requires a larger memory size. Babu et al. (2014) proposed k-nearest neighbors (k-NN) for recognizing handwritten digits based on structural features, which does not require thinning operation and size normalization approach [16]. Ramzan et al. (2018) implemented neural networks (NN) and their variants to recognize handwritten digits. The survey details some existing techniques implemented for handwritten digit recognition (HWDR) being carried out [17]. Baldominos et al. (2019) [18] also proposed convolutional neural networks (CNNs) to distinguish previous work to recognize handwritten characters using some data augmentation from works using the original dataset out-of-the-box [19,20]. They provided the most extensive and updated survey of the MNIIST and EMNIST datasets and achieved the lowest error rate.

Poon et al. (2019) [2] applied logistic regression to predict PD based on handwritten recognition. They utilized the publicly available PD database and extracted secondary kinematic handwriting features from the dataset. It is being studied not only for personal identification but also in the medical field. The limitation of their proposed model was that they used small sample size of the dataset and lacked control in the study design. As for all the limitations of handwriting recognition, Japanese handwritten character recognition is complex due to the various types of writing styles, characters, and confusion among similar characters. One of the major causes of the inefficient classification of Japanese characters is a large number of letters. However, many methods have been developed to recognize Japanese handwriting as text images for several applications, but there are few studies on the classification of adults and children based on Japanese handwritten recognition.

Nisimura et al. (2004) [21] suggested a discriminating strategy based on statistical learning and extracted linguistic features from speech or voice data to classify adults and children. They applied SVM and found that it performed with better classification accuracy than the Gaussian mixture model [22]. The disadvantage of this strategy is that it has a trait in common with both labels. Makihara et al. (2010) [23] proposed a method to classify gender and age using video-based gait feature analysis with a large-scale multi-view gait database. They adopted the k-NN classifier to classify gender and age. They used three databases (HumanID, Soton, and CASIA) that contained over 100 subjects. These datasets have their particular limitations, such as the small view images in the HumanID dataset; also, single view images in the Soton dataset, and maximum subjects in the CASIA dataset included in the 20’s or 30’s. Faghel-Soubeyrand classified adult and child based on faces [24]. In this study, we propose a new approach for the classification of adults and children based on their handwritten text and pattern recognition. Our proposed method can achieve more than 89% classification accuracy, implying that classification accuracy with handwritten characters can be expected.

The organization of this paper is as follows: Section 2 presents materials and methods, including proposed ML-based framework; description of datasets, feature extraction, feature selection, classifiers along with their performance evaluation metrics are discussed in this section. The experimental results and discussion are discussed in Section 3. Finally, the conclusion is discussed in Section 4.

2. Materials and Methods

In this section, we summarize the proposed ML-based framework. Next, two databases used in this research work are described. We also describe feature extraction, feature selection, and two classification methods along their performance evaluation metrics in this section.

2.1. Proposed ML-Based Framework

The goal of this work is to propose an ML-based model for predicting adult and child based on their handwritten texts and handwritten patterns. The proposed ML-based framework is presented in Figure 1. First, we divide the handwriting (text and pattern) dataset into two phases: the training phase and the testing phase. We take 80% of the dataset in the training phase and the remaining 20% of the dataset for the test phase. The second step is to preprocess the handwriting data. After preprocessing handwriting data, we extract 30 features and then select an optimal subset of the features using sequential forward floating selection (SFFS). We applied two ML-based algorithms, SVM and RF, for the classification of adult and child. We tuned the hyperparameters of the classifiers (SVM and RF) using a grid search method and trained SVM and RF-based classifiers with five-fold cross-validation protocol. After training, classifiers (SVM and RF) are used in the testing phase for the classification of adult and child. Accuracy, recall, precision, f1-score, and area under the curve (AUC) are used to evaluate the performance of the classifiers.

2.2. Dataset

2.2.1. Device for Data Collection

Handwriting data were recorded using a pen tablet system (Cintiq Pro 16, Wacom Co., Ltd., Saitama, Japan). The tablet was connected to a laptop PC running Windows 10. Figure 2 illustrates the coordinates of the parameters generated by the pen tablet.The screen size of the pen tablet was 15.6 inches, and the resolution size was

2560 \times 1440

pixels.

2.2.2. Handwritten Text

We developed a new dataset to evaluate our proposed method where adult and child handwriting-based text data were collected using a pen tablet. A total of 57 participants were taken for this work, consisting of 26 adults (aged 19–59 years) with handwriting and 31 children (aged 12–13 years). Each subject (child or adult) was asked to write the same 30 words (tasks) using hiragana characters only on the pen tablet using a dedicated stylus pen. Each word contains a minimum of 2 characters and a maximum of 7 characters. A summary of the handwritten text dataset is described in Table 1.

2.2.3. Handwritten Pattern

Handwriting-based pattern data were also collected from 81 subjects using a pen tablet system. The dataset had 39 children and 42 adults. In this study, we adopted two patterns. One was drawing a continuous zigzag line, essentially a continuous set of triangles without a base. Another was drawing a continuous periodic line pattern (PL) that was repeated squares and triangles sequentially without a base. The trace and predict conditions were used for each pattern. Each subject was asked to draw these four patterns on the pen tablet using a dedicated stylus pen and each drawing pattern was repeated 3 times. The traced over the sample zigzag lines are presented in Figure 3a, and the data are written on a blank sheet of paper after memorizing the sample. The traced over the sample PL lines are also presented in Figure 3b, and the data were derived from memorizing the sample and writing it on a blank sheet of paper. The data were collected by separating the zigzag line and the PL line, taking data for 30 s, resting for 20 s, taking data again for 30 s, and resting for 20 s, and so on, until six data were collected. The reason for the intervals was to let the brain rest. A summary of the handwritten pattern dataset is described in Table 2.

2.3. Feature Extraction

The handwriting data contained six pieces of information, including the time of writing, pen pressure, x-coordinate and y-coordinate of the writing position, angle of the horizontal component of the pen, and angle of the vertical component of the pen. To classify adults and children based on their handwriting, 30 feature parameters are evaluated for each task. These feature parameters only required the localization of primary features of handwritten text images, namely, the width, height, speed, peak, different types of grip angle, and various types of pressure, which are given in detail in Table 3.

2.4. Feature Normalization

Data normalization is a technique that minimizes redundancy and improves the efficiency of the data. Mathematically, it is defined as follows:

z = \frac{X - μ}{σ}

(1)

where X is the original feature vector;

μ

is the mean of that feature vector, and

σ

is its standard deviation. The value of z lies between 0 to 1.

2.5. Feature Selection

Feature selection is the process of removing irrelevant features to improve the efficiency of the model. We have used SFFS for feature selection, which is an extension of sequential forward selection (SFS), to reduce the initial d-dimensional feature space into a k-dimensional feature subspace

(k < d)

[25]. Let

Y = {y_{1}, y_{2}, \dots, y_{d}}

be a set of all features and

X_{k} = {x_{j} | j = 1, 2, \dots, k; x_{j} \in Y}

, where

k \in (0, 1,, 2, \dots, d)

and

X_{k}

is a subset of Y. We start the algorithm with

X_{o} = \emptyset, k = 0

. The steps of SFFS are described as follows:

Step 1:: $x^{+}$ = argmax $J (X_{k} + x)$ , where $x \in Y - X_{k}, J$ is an evaluation index and $x^{+}$ is the feature with the highest evaluation when it chooses.
Step 2:: $X_{k + 1} = X_{k} + x^{+}$ . The feature with the highest evaluation by selecting is used.
Step 3:: $k = k + 1$ .
Step 4:: Step 1 to Step 3 is repeatedly iterating. Then, $x^{+}$ when k reaches the specified number which is the set of the most appropriate features obtained.
SFFS is performed up to Step 3 of SFS, and a process for searching for features to be deleted is added. At first, Step 1 to Step 4 are performed starting from $X_{0} = \emptyset, k = 0$ , as in the SFS.
Step 5:: $x^{-}$ = argmax $J (X_{k} - x)$ , where $x \in X_{k}$ and $x^{-}$ is the feature with the best performance when the feature is deleted.
Step 6:: If $J (X_{k} - x) > J (X_{k}) :$
$X_{k - 1} = X_{k} - x^{-}$
$k = k - 1$
Go to Step 1.

In Step 1, we capture the features that best improve the performance of the feature subset from the feature space. Then, we proceed to Step 2. In Step 2, remove features only if they improve the performance of the resulting subset. In this study, the Sequential Feature Selector in mlxtend library was used and implemented [26].

2.6. Classifiers

2.6.1. Support Vector Machine

Support vector machine (SVM) [27,28] is supervised learning that is used for both classification and regression problems. In this study, we implemented SVM in Scikit-learn support vector classification (SVC) [29]. SVM is classified on the largest hyperplane up to the nearest training data point of the class. A highly accurate model can be obtained with a small amount of data, and the accuracy of identification can be kept even when the number of features increases. The main objective of SVM is to find the hyperplane in the feature space that can easily separate the classes, which needs to solve the following constraint problem:

\max α \sum_{i = 1}^{n} α_{i} - \frac{1}{2} \sum_{i = 1}^{n} \sum_{j = 1}^{n} α_{i} α_{j} y_{i} y_{j} K (x_{i}, x_{j})

(2)

Subject to

\sum_{i = 1}^{n} {y_{i}}^{T} α_{i} = 1, 0 \leq α_{i} \leq C, i = 1, \dots, n \forall i = 1, 2, 3, \dots, n

(3)

The final discriminate function takes the following form:

f (x) = \sum_{i = 1}^{n} α_{i} K (x_{i}, x_{j}) + b

(4)

where, b is the bias term.

2.6.2. Random Forest

Random forest (RF) [30] is one type of ensemble learning used for classification, regression, etc. It is a model in which decision trees are created in parallel and predictions are made by calculating the majority vote of the output results of each learning machine. Random learning enables fast learning and identification even for high-dimensional features, and the random selection of training data makes it strong against noise. Therefore, it is possible to build an overall good model. In this study, we also implemented RF with random forest classifier in Scikit-learn [29].

2.7. Performance Evaluation Metrics

To evaluate the performance of the classification model, we adopted five evaluation metrics: classification accuracy (ACC), recall (Rec), precision (Pre), f1-score, and AUC. The evaluation metrics of accuracy, recall, precision, and f1-score are computed based on true positive (

t_{p})

, false positive

(f_{p})

, true negative

(t_{n})

, and false negative

(f_{n})

, which are briefly explained as follows:

ACC (%) = \frac{t_{p} + t_{n}}{t_{p} + f_{p} + t_{n} + f_{n}} \times 100

(5)

Rec (%) = \frac{t_{p}}{t_{p} + f_{n}} \times 100

(6)

Pre (%) = \frac{t_{p}}{t_{p} + f_{p}} \times 100

(7)

f 1 - score (%) = 2 \times \frac{(Pre \times Rec)}{Pre + Rec} \times 100

(8)

3. Experimental Results and Discussion

3.1. Experimental Setup

To perform the classification of adult and child, 80% of the dataset was utilized for training sets and 20% of the dataset for testing sets. For all statistical analysis, Python version 3.9 and Scikit-learn version 1.0.2 were used. We used Windows 10 21H1 (build 19043.1151) 64-bit with an Intel (R) Core (TM) i5-10400 processor and 16 GB of RAM.

3.2. Baseline Characteristics of Adult and Child

The baseline characteristics of adults and children for the handwritten text and pattern datasets are presented in Table 4. For the handwritten text dataset, the prevalence of adult and child was 45.6% and 54.4%. Among them, 42.7% and 59.3% of adult and child were female. The average ages of adult and child for the handwritten text dataset were

27.3 \pm 10.5

and

12.5 \pm 0.3

years.

For the handwritten pattern dataset, the average ages of adult and children were

23.9 \pm 4.9

and

11.8 \pm 1.6

years. The overall prevalence of females was 59.3%. Approximately 64.6% and 35.4% of adult and child were female. It was observed that age and gender (except gender for handwritten text data) were significantly associated with adult and child for both handwritten text and pattern dataset (p-value < 0.05).

3.3. Hyperparameter Tuning

For the classification tasks, we set the following hyperparameters for SVM as cost (C) = [0.0001, 0.001, 0.01, 0.1, 1, 10, 100, 1000]; kernel = [“rbf”, “linear”, “poly”, “sigmoid”]; and gamma: [0.00001, 0.0001, 0.001, 0.01, 0.1, 1]. We also set the hyperparameters for RF as max_depth = [2, 3, 5, 10], n_estimators = [50, 100, 200, 300, 400], min_samples_split = [2, 3, 10], min_samples_leaf = [1, 3, 10], bootstrap = [True, False], and criterion = [“gini”, “entropy”]. We implemented grid search algorithms to tune these hyperparameters. We choose the hyperparameters that will provide the highest classification accuracy.

3.4. Experiment-1: Evaluation for Handwritten Text Dataset

In this experiment, we used different types of handwritten texts and then extracted various types of features from each image or task. We applied SVM and RF classifiers to classify adult and child and calculated the classification accuracy. We used 30 hiragana words and extracted 30 features which are clearly explained in Table 3. Table 5 shows the performance scores of SVM and RF for better features combination of handwritten text dataset. It was observed that SVM with RBF kernel produced the classification accuracy of 87.7% for the combination of 15 selected features out of 30 features. Moreover, SVM also produced 92.4% recall, 85.9% precision, 89.1% f1-score, and 0.919 AUC for the selected 15 features, whereas RF classifier achieved an excellent classification accuracy of 93.5% along with 95.7% recall, 92.2% precision, 93.9% f1-score, and 0.983 AUC, respectively, for the combination of 18 selected features. Therefore, RF achieved more outstanding performance than SVM.

We observed that 15 and 18 features were selected by SFFS with SVM and RF classifiers. A total of 11 common features was extracted from those two methods, which are shown in Figure 4, and the listed selected features are presented in Table 6. These 11 common features were used as input features and then we applied SVM and RF classifiers to distinguish adults from children.

The performance scores of SVM and RF classifiers for 11 common features are shown in Table 7. It was observed that SVM with RBF provided 87.4% accuracy, 90.8% recall, 86.6% precision, 88.7% f1-score, and 0.947 AUC, respectively, whereas RF gave 91.5% accuracy, 93.0% recall, 91.5% precision, 92.3% f1-score, and 0.967 AUC, receptively. Finally, we may conclude that RF had more outstanding performance scores than SVM for the prediction of the adult and child for handwritten text dataset.

3.5. Experiment-2: Evaluation for Handwritten Pattern Dataset

To evaluate our proposed model, we used a handwritten pattern dataset and obtained a classification accuracy of up to 89.8%. In this section, we performed two experiments. Firstly, the best combination of the features set was identified using SFFS-based RF and SVM classifiers. We chose the feature combination at which the classification model provides the highest classification accuracy. The classification accuracy of RF and SVM for the handwritten pattern dataset is presented in Table 8. For the trace of zigzag lines, RF produced 83.3% classification accuracy for the combination of 19 selected features, whereas SVM produced 71.4% accuracy for the combination of 26 selected features. For the prediction of the zigzag, the RF classifier obtained the highest classification accuracy of 85.7% for 13 combinations of feature sets and the prediction of the zigzag line, whereas SVM provided 75.5% classification accuracy for 3 selected features. For the prediction and trace of the PL line, RF achieved 73.5% classification accuracy for the combination of 24 selected features, whereas SVM achieved 79.6% accuracy for 7 selected features and 87.7% accuracy for 12 selected features. RF classifier provided a good classification accuracy of 85.6% for the combination of all handwritten patterns, 25 features, whereas 82.1% classification accuracy was provided by SVM for the combination of all 28 features. Therefore, RF achieved better classification accuracy (89.8%) than SVM for the prediction of PL line.

The second experiment was to take the common features from the two best combinations of feature sets and apply two classifiers for the prediction of adult and child. The number of selected common features was 18 features from the trace of zigzag line, 2 features from the prediction of zigzag line, 7 features from the trace of PL line, 9 features from the prediction of PL lines, and 23 features from all handwritten patterns (zigzag and PL lines), which are shown in Figure 5, and the corresponding list of selected common features is presented in Table 9.

The classification accuracies of RF and SVM for these common features are presented in Table 10. It was observed that the RF classifier provided a higher classification accuracy of 79.5%, 73.4%, 83.6%, and 89.8% for the trace and prediction of the zigzag line than SVM for the trace and prediction of the PL line, respectively. On the other hand, the SVM classifier provided 85.1% accuracy for all handwritten patterns, whereas RF classifier gave 84.1% accuracy.

The recall, precision, f1-score, and AUC of RF and SVM for common features of the handwritten dataset are presented in Table 11. It was observed that the RF classifier achieved comparatively better performance for all types of lines than SVM. RF classifier provided a higher recall of 87.0%, precision of 93.1%, f1-score of 90.0%, and AUC of 0.903 for the prediction of the PL line dataset, whereas SVM gave 83.8% recall, and 0.811 AUC, respectively. Table 11 shows that the highest performance scores are achieved by RF for four types of lines with all handwritten patterns. Finally, we can say that in our experiment, RF performed better than SVM.

3.6. Comparison of Our Proposed Method with the Existing Method

The comparison of the classification accuracy of our proposed method with the existing method in the literature is presented in Table 12. Guimaraes et al. (2017) [4] applied different ML algorithms such as multilayer perception (MLP), deep convolutional neural network (DCNN), decision tree (DT), RF, and SVM for the classification of adult and teenager age groups based on sentences. They collected 7000 sentences for the classification of age groups (teenager vs. adult). They showed that DCNN had a better performance and obtained 95.0% precision. Rizwan et al. (2021) [31] proposed a novel method for the classification of human age. They extracted features using interior angle formulation, anthropometric model, carnio-facial development, wrinkle detection, and heat maps. The best combination of feature sets was selected using SFS. They adopted CNN to classify human age and achieved 94.6% classification accuracy. Özkan and Turan (2018) [32] proposed a deep learning algorithm for the classification of people based on their age. They divided the people into 12 classes using age groups and collected 18,000 images. They took 10% of the images for testing and the rest of the images for training. They showed that the DL model can correctly classify people into different groups of age and achieved 78.5% classification accuracy.

Goshvarpour (2019) [33] proposed a novel Poincare feature set to classify age and gender based on ECG. They collected ECG data from 79 respondents. Among them, 37 were males aged 31.24 years and 42 were females aged 25.8 years. They applied SVM for the classification of age and gender and obtained the highest classification accuracy of 94.6%. Ilyas et al. (2020) [34] investigated a novel biometric method for the classification of human age. For classification, RF, SVM, linear regression (LR), ridge regression (RR), polynomial regression (PR), and ANN were used. They collected a total of 837 subjects aged 6–60 years to evaluate the proposed biometric system. They showed that RF produced the highest classification accuracy of 92.0%. Voice is also used for user authentication and identification. Voiceprints were used in various forensic approaches to classify age, gender, and language. Reade et al. (2015) [35] conducted a study for the classification of adult, child, and senior using face images dataset. They extracted features using HOG, local binary pattern, and active appearance model. They adopted k-NN, SVM, and GB algorithms for the classification of adult, child, and senior and achieved 82.0% classification accuracy. Tin (2012) [36] applied PCA for the classification of age using face image and produced the highest classification accuracy of 92.5%. Our proposed SFFS with RF (SFFS-RF) model produced higher accuracy compared to SFFS with SVM (SFFS-SVM) to classify adult and child based on their handwritten text and handwritten pattern.

4. Conclusions

The purpose of this study was to clarify changes in the development of handwritten text and pattern between adult and child. Online handwritten text and pattern datasets were collected using a pen tablet system. We utilized SFFS for feature selection and adopted two classification algorithms, RF and SVM, for the classification of adult and child. We selected the common features from SFFS-RF and SFFS-SVM classifiers and then also applied RF and SVM classifiers for the classification of adult and child. For the handwritten text dataset, our proposed system SFFS with RF classifier produced 93.5% accuracy for 18 features, and 89.8% accuracy for 9 features in the handwritten pattern dataset. After identifying the common features, SFFS-RF also produced 91.5% and 87.7% classification accuracy for handwritten text and handwritten pattern datasets. We hope that this study will provide evidence of the possibility of classifying adults and child based on their handwritten text and handwritten pattern data. If we can find out the age range between adult and child, that will help our model to produce an estimated performance accuracy.

Author Contributions

Conceptualization, J.S., M.A.M.H. and A.Y.; methodology, M.M. and M.A.M.H.; software, M.M., Y.U. and M.A.M.H.; validation, J.S. and M.A.M.H.; formal analysis, M.M., Y.U. and M.A.M.H.; investigation, J.S. and M.A.M.H.; resources, J.S., A.M., A.S. and A.Y.; data curation and collection, A.M., A.S. and A.Y.; writing—original draft preparation, J.S., M.M. and M.A.M.H.; writing—review and editing, M.M. and M.A.M.H.; visualization, M.M., Y.U. and M.A.M.H.; supervision, J.S. and M.A.M.H.; project administration, J.S.; funding acquisition, J.S. and A.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Japan Society for the Promotion of Science Grants-in-Aids for Scientific Research (KAKENHI), Japan (Grant Numbers JP20K11892, which was awarded to Jungpil Shin and JP21H00891, which was awarded to Akira Yasumura).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by The Ethics Committee of the Graduate School of Social and Cultural Sciences of Kumamoto University (approval number: 45, approved on: 25 May 2021).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest for this research.

References

Faundez-Zanuy, M.; Fierrez, J.; Ferrer, M.A.; Diaz, M.; Tolosana, R.; Plamondon, R. Handwriting biometrics: Applications and future trends in e-security and e-health. Cognit. Comput. 2020, 12, 940–953. [Google Scholar] [CrossRef]
Poon, C.; Gorji, N.; Latt, M.; Tsoi, K.; Choi, B.; Loy, C.; Poon, S. Derivation and analysis of dynamic handwriting features as clinical markers of Parkinson’s disease. In Proceedings of the 52nd Hawaii International Conference on System Sciences, Maui, HI, USA, 8–11 January 2019; pp. 3721–3730. [Google Scholar]
Singhal, P.; Srivastava, P.K.; Tiwari, A.K.; Shukla, R.K. A Survey: Approaches to facial detection and recognition with machine learning techniques. In Proceedings of the Second Doctoral Symposium on Computational Intelligence, Lucknow, India, 6 March 2021; Springer: Singapore, 2022; pp. 103–125. [Google Scholar]
Guimaraes, R.G.; Rosa, R.L.; De Gaetano, D.; Rodriguez, D.Z.; Bressan, G. Age groups classification in social network using deep learning. IEEE Access 2017, 5, 10805–10816. [Google Scholar] [CrossRef]
Al Maadeed, S.; Hassaine, A. Automatic prediction of age, gender, and nationality in offline handwriting. EURASIP J. Image Video Process. 2014, 2014, 10. [Google Scholar] [CrossRef] [Green Version]
Illouz, E.; Netanyahu, N.S. Handwriting-based gender classification using end-to-end deep neural networks. In Proceedings of the International Conference on Artificial Neural Networks, Rhodes, Greece, 4–7 October 2018; pp. 613–621. [Google Scholar]
Maken, P.; Gupta, A. A method for automatic classification of gender based on text-independent handwriting. Multimed. Tools Appl. 2021, 80, 24573–24602. [Google Scholar] [CrossRef]
Morera, Á.; Sánchez, Á.; Vélez, J.F.; Moreno, A.B. Gender and handedness prediction from offline handwriting using convolutional neural networks. Complexity 2018, 2018, 3891624. [Google Scholar] [CrossRef] [Green Version]
Shin, J.; Okuyama, T. Detection of alcohol intoxication via online handwritten signature verification. Pattern Recognit. Lett. 2014, 35, 101–104. [Google Scholar] [CrossRef]
Gupta, U.; Bansal, H.; Joshi, D. An improved sex-specific and age-dependent classification model for Parkinson’s diagnosis using handwriting measurement. Comput. Methods Programs Biomed. 2020, 189, 105305–105327. [Google Scholar] [CrossRef]
Wong, L.; Loh, W. Segregating Offline and Online handwriting for conditional classification analysis. Proc. IOP Conf. Ser. Mater. Sci. Eng. 2019, 530, 012058–012065. [Google Scholar] [CrossRef]
Lewis, J.A. Forensic Document Examination: Fundamentals and Current Trends; Elsevier: Amsterdam, The Netherlands, 2014. [Google Scholar]
Papaodysseus, C.; Rousopoulos, P.; Giannopoulos, F.; Zannos, S.; Arabadjis, D.; Panagopoulos, M.; Kalfa, E.; Blackwell, C.; Tracy, S. Identifying the writer of ancient inscriptions and Byzantine codices. A novel approach. Comput. Vis. Image. Underst. 2014, 121, 57–73. [Google Scholar] [CrossRef]
Ahmed, Z.E. Forensic documents examination. Asian J. Forensic Sci. 2022, 1, 2–3. [Google Scholar]
Ahmad, A.R.; Khalia, M.; Viard-Gaudin, C.; Poisson, E. Online handwriting recognition using support vector machine. In Proceedings of the 2004 IEEE Region 10 Conference TENCON 2004, Chiang Mai, Thailand, 24 November 2004; pp. 311–314. [Google Scholar]
Babu, U.R.; Venkateswarlu, Y.; Chintha, A.K. Handwritten digit recognition using K-nearest neighbour classifier. In Proceedings of the 2014 World Congress on Computing and Communication Technologies, Trichirappalli, India, 27 February–1 March 2014; pp. 60–65. [Google Scholar]
Ramzan, M.; Khan, H.U.; Akhtar, W.; Zamir, A.; Awan, S.M.; Ilyas, M.; Mahmood, A. A survey on using neural network based algorithms for hand written digit recognition. Environment 2018, 9, 519–528. [Google Scholar] [CrossRef]
Baldominos, A.; Saez, Y.; Isasi, P. A survey of handwritten character recognition with mnist and emnist. Appl. Sci. 2019, 9, 3169. [Google Scholar] [CrossRef] [Green Version]
Ashiquzzaman, A.; Tushar, A.K.; Rahman, A.; Mohsin, F. An efficient recognition method for handwritten arabic numerals using CNN with data augmentation and dropout. In Data Management, Analytics and Innovation; Springer: Berlin/Heidelberg, Germany, 2019; pp. 299–309. [Google Scholar]
Altwaijry, N.; Al-Turaiki, I. Arabic handwriting recognition system using convolutional neural network. Neural. Comput. Appl. 2021, 33, 2249–2261. [Google Scholar] [CrossRef]
Nishimura, R.; Lee, A.; Saruwatari, H.; Shikano, K. Public speech-oriented guidance system with adult and child discrimination capability. In Proceedings of the 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, QC, Canada, 17–21 May 2004; Volume 1. [Google Scholar]
Reynolds, D.; Rose, R. RC: Robust Text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. Speech Audio Process. 1995, 3, 72–83. [Google Scholar] [CrossRef] [Green Version]
Makihara, Y.; Mannami, H.; Yagi, Y. Gait analysis of gender and age using a large-scale multi-view gait database. In Proceedings of the 10th Asian Conference on Computer Vision, Queenstown, New Zealand, 8–12 November 2010; pp. 440–451. [Google Scholar]
Faghel-Soubeyrand, S.; Kloess, J.A.; Gosselin, F.; Charest, I.; Woodhams, J. Diagnostic features for human categorisation of adult and child faces. Front. Psychol. 2021, 12, 775338. [Google Scholar] [CrossRef] [PubMed]
Pudil, P.; Novovičová, J.; Kittler, J. Floating search methods in feature selection. Pattern Recognit. Lett. 1994, 15, 1119–1125. [Google Scholar] [CrossRef]
Raschka, S. MLxtend: Providing machine learning and data science utilities and extensions to Python’s scientific computing stack. J. Open Source Softw. 2018, 3, 638–640. [Google Scholar] [CrossRef]
Jan, S.U.; Lee, Y.D.; Shin, J.; Koo, I. Sensor fault classification based on support vector machine and statistical time-domain features. IEEE Access 2017, 5, 8682–8690. [Google Scholar] [CrossRef]
Hasan, M.A.M.; Nasser, M.; Pal, B.; Ahmad, S. Support vector machine and random forest modeling for intrusion detection system (IDS). J. Intell. Learn. Syst. Appl. 2014, 2014, 45–52. [Google Scholar] [CrossRef] [Green Version]
Nelli, F. Machine Learning with scikit-learn. In Python Data Analytics; Springer: Berlin/Heidelberg, Germany, 2018; pp. 313–347. [Google Scholar]
Hasan, M.A.M.; Nasser, M.; Ahmad, S.; Molla, K.I. Feature selection for intrusion detection using random forest. J. Inform. Secur. 2016, 7, 129–140. [Google Scholar] [CrossRef] [Green Version]
Rizwan, S.A.; Jalal, A.; Gochoo, M.; Kim, K. Robust active shape model via hierarchical feature extraction with SFS-optimized convolution neural network for invariant human age classification. Electronics 2021, 10, 465. [Google Scholar] [CrossRef]
Özkan, İ.; Turan, B. Classification of different age groups of people by using deep learning. J. New Res. Sci. 2018, 7, 9–16. [Google Scholar]
Goshvarpour, A.; Goshvarpour, A. Gender and age classification using a new Poincare section-based feature set of ECG. Signal Image Video Process. 2019, 13, 531–539. [Google Scholar] [CrossRef]
Ilyas, M.; Othmani, A.; Naït-Ali, A. Auditory perception based system for age classification and estimation using dynamic frequency sound. Multimed. Tools Appl. 2020, 79, 21603–21626. [Google Scholar] [CrossRef]
Reade, S.; Viriri, S. Hybrid age estimation using facial images. In Proceedings of the 12th International Conference, ICIAR 2015, Niagara Falls, ON, Canada, 22–24 July 2015; pp. 239–246. [Google Scholar]
Tin, H.H.K. Subjective age prediction of face images using PCA. Int. J. Eng. Res. Appl. 2012, 2, 296–299. [Google Scholar] [CrossRef]

Figure 1. Proposed ML-based classification model.

Figure 2. Pen tablet device.

Figure 3. Handwritten pattern dataset for (a) zigzag line sample and painted; (b) PL line sample and painted.

Figure 4. Identification of common features from SFFS-RF and SFFS-SVM for handwritten text dataset.

Figure 5. Identification of common features from SFFS-RF and SFFS-SVM for different lines of handwritten pattern dataset: (a) Feature from trace of zigzag line, (b) feature from the prediction of zigzag line, (c) feature from trace of PL line, (d) feature from prediction of PL line, and (e) feature from all handwritten patterns.

Table 1. Summary of the handwritten text dataset.

Group Name	Age (Years)	No. of Subjects	No. of Words	Total Samples
Adult	19–59	26	30	780 (26 × 30)
Child	12–13	31	30	930 (31 × 30)

Table 2. Summary of the handwritten pattern dataset.

Group Name	Age (Years)	No. of Subjects	No. of Task	No. of Repeat	Total Samples
Adult	19–43	42	4	3	504 (42 × 4 ×3)
Child	8–13	39	4	3	468 (39 × 4 ×3)

Table 3. Extracted feature names and their description.

SN	Feature	Description
1	Width	Max (X)–Min (X)
2	Height	Max (Y)–Min (Y)
3	Length	The total length of the drawing
4	Velocity	(Length)/(total) drawing time
5	PIVH	The maximum speed recorded at any time point
6	PIVL	Minimum speed recorded at any time point (PMS > 0)
7	PIAH	Maximum acceleration recorded at any time point
8	PIAL	Minimum acceleration recorded at any time point (PMA > 0)
9	GripAngleMeanW	Mean of grip angle values for the entire drawing task (Horizontal)
10	GripAngleMeanL	Mean of grip angle values for the entire drawing task (Vertical)
11	GripAngleSDW	SD of grip angle values for the entire drawing task (Horizontal)
12	GripAngleSDL	SD of grip angle values for the entire drawing task (Vertical)
13	PressureMean	Mean of recorded pressure values for the entire task
14	PressureSD	SD of recorded pressure values for the entire task
15	PCAvgPos	Mean increase in pressure between two-time points
16	PCSDPos	SD of increase in pressure between two-time points
17	PCMax	The maximum increase in handwriting pressure between two-time points
18	PCAvgNeg	Mean decrease in pressure between two-time points
19	PCSDNeg	SD of decrease in pressure between two-time points
20	PCMin	Maximum reduction in handwriting pressure between two-time points
21	Error	Number of outliers and triangle square errors based on angles
22	PeakPresMean	Mean pressure at minima
23	ErrorStopTime	Mean stuck time at the starting minima point just before the error
24	AngleMean	Mean of angles at maxima and minima
25	AngleVar	The variance of angles at maxima and minima
26	ReglineSlope	The slope of the regression line
27	ReglineIntercept	The intercept of the regression line
28	LoopCount	Time spent writing divided by the number of peaks
29	AngleSpeed	Mean of velocities at the edge of the peaks and valleys
30	ErrorRate	(Error)/(All Peaks) Error rate

Table 4. Baseline characteristics of adult and child.

Variables	Overall	Adult	Child	p-Value ¹
Handwritten text dataset
Total, n (%)	57	26 (45.6)	31 (54.4)	–
Gender, Female, n (%)	27 (47.4)	11 (42.7)	16 (59.3)	0.483
Age (year), Mean ± SD	19.2 ± 10.2	27.3 ± 10.5	12.5 ± 0.3	0.001
Handwritten pattern dataset
Total, n (%)	81	42 (51.9)	39 (48.1)	–
Gender, Female, n (%)	48 (59.3)	31 (64.6)	17 (35.4)	0.0025
Age (year), Mean ± SD	17.8 ± 7.9	23.9 ± 4.9	11.8 ± 1.6	0.001

n is the total number of subjects; ¹ p-value is obtained from t-test for age and chi-square test for gender.

Table 5. Performance scores of SVM and RF for handwritten text dataset.

CT	# of Features	ACC	Rec	Prec	f1-Score	AUC
SVM	15	87.7	92.4	85.9	89.1	0.919
RF	18	93.5	95.7	92.2	93.9	0.983

CT: Classifier types.

Table 6. List of common features selected for handwritten text dataset.

SN	Feature Names	SN	Feature Names
1	Height	7	GripAngleSDW
2	Velocity	8	GripAngleSDL
3	PIVL	9	PressureMean
4	PIAL	10	PressureSD
5	GripAngleMeanW	11	PCMin
6	GripAngleMeanL	–	–

Table 7. Performance scores of SVM and RF for common features for handwritten text dataset.

CT	ACC	Rec	Prec	f1-Score	AUC
SVM	87.4	90.8	86.6	88.7	0.947
RF	91.5	93.0	91.5	92.3	0.967

Table 8. Classification accuracy (in %) of RF and SVM for handwritten pattern dataset.

Line Types	No. of Features	RF	No. of Features	SVM
Zigzag trace	19	83.7	26	71.4
Zigzag predict	13	85.7	3	75.5
PL trace	24	73.5	7	79.6
PL predict	9	89.8	12	87.7
All patterns	25	85.6	28	82.1

Table 9. List of common features of handwritten pattern dataset.

SN	ZigZag Trace	Zigzag Predict	PL Trace	PL Predict	All Patterns
1	Height	PIAH	Length	Velocity	Height
2	Length	AngleMean	PIVL	PIAH	Length
3	Velocity	–	PIAH	PIAL	Velocity
4	PIVH	–	PIAL	PCSDNeg	PIAH
5	PIAH	–	PeakpresMean	PeakpresMean	PIAL
6	PIAL	–	ReglineSlope	AngleMean	GripAngleMeanW
7	GripAngleMeanW	–	ReglineIntercept	AngleVar	GripAngleMeanL
8	GripAngleMeanL	–	–	ReglineIntercept	GripAngleSDW
9	PressureMean	–	–	ErrorRate	PressureMean
10	PCMax	–	–	–	PressureSD
11	Error	–	–	–	PCAvgPos
12	PeakpresMean	–	–	–	PCSDPos
13	ErrorStopTime	–	–	–	PCMax
14	AngleMean	–	–	–	PCAvgNeg
15	ReglineSlope	–	–	–	PCSDNeg
16	ReglineIntercept	–	–	–	PCMin
17	AngleSpeed	–	–	–	Error
18	ErrorRate	–	–	–	PeakpresMean
19	–	–	–	–	ErrorStopTime
20	–	–	–	–	AngleVar
21	–	–	–	–	ReglineSlope
22	–	–	–	–	ReglineIntercept
23	–	–	–	–	LoopCount

Table 10. Classification accuracy (in %) of RF and SVM for common features of handwritten pattern dataset.

Line Types	CF	RF	SVM
Zigzag trace	18	79.5	71.4
Zigzag predict	2	73.4	69.3
PL trace	7	83.6	79.5
PL predict	9	89.8	79.5
All patterns	25	84.1	85.1

CF: Common features.

Table 11. Performance scores of RF and SVM for common features of handwritten pattern dataset.

Line Types	CF	RF				SVM
Line Types	CF	Rec	Prec	f1-Score	AUC	Rec	Prec	f1-Score	AUC
Zigzag trace	18	80.6	86.2	83.3	0.870	67.7	84.0	75.0	0.820
Zigzag predict	2	83.8	76.4	80.0	0.784	77.4	75.0	76.1	0.732
PL trace	7	83.8	89.6	86.6	0.926	74.1	92.0	82.1	0.872
PL predict	3	87.0	93.1	90.0	0.903	83.8	83.8	83.8	0.811
All patterns	23	85.1	84.3	84.7	0.923	83.1	87.5	85.2	0.919

CF: Common features.

Table 12. Accuracy comparison with the methods in the literature.

Authors	Year	Data Types	ACC (%)
Guimaraes et al. [4]	2017	Sentences	Prec: 95.0
Rizwan et al. [31]	2021	FG-NET	94.6
Özkan and Turan [32]	2018	FG-NET	78.5
Goshvarpour [33]	2019	ECG	94.6
Ilyas [34]	2020	Auditory perception	92.0
Reade [35]	2015	FG-Net	82.0
Tin [36]	2012	FG-Net	92.5
		Handwritten text	93.5
Our proposed	2022	Handwritten pattern	89.8

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shin, J.; Maniruzzaman, M.; Uchida, Y.; Hasan, M.A.M.; Megumi, A.; Suzuki, A.; Yasumura, A. Important Features Selection and Classification of Adult and Child from Handwriting Using Machine Learning Methods. Appl. Sci. 2022, 12, 5256. https://0-doi-org.brum.beds.ac.uk/10.3390/app12105256

AMA Style

Shin J, Maniruzzaman M, Uchida Y, Hasan MAM, Megumi A, Suzuki A, Yasumura A. Important Features Selection and Classification of Adult and Child from Handwriting Using Machine Learning Methods. Applied Sciences. 2022; 12(10):5256. https://0-doi-org.brum.beds.ac.uk/10.3390/app12105256

Chicago/Turabian Style

Shin, Jungpil, Md. Maniruzzaman, Yuta Uchida, Md. Al Mehedi Hasan, Akiko Megumi, Akiko Suzuki, and Akira Yasumura. 2022. "Important Features Selection and Classification of Adult and Child from Handwriting Using Machine Learning Methods" Applied Sciences 12, no. 10: 5256. https://0-doi-org.brum.beds.ac.uk/10.3390/app12105256

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Important Features Selection and Classification of Adult and Child from Handwriting Using Machine Learning Methods

Abstract

1. Introduction

2. Materials and Methods

2.1. Proposed ML-Based Framework

2.2. Dataset

2.2.1. Device for Data Collection

2.2.2. Handwritten Text

2.2.3. Handwritten Pattern

2.3. Feature Extraction

2.4. Feature Normalization

2.5. Feature Selection

2.6. Classifiers

2.6.1. Support Vector Machine

2.6.2. Random Forest

2.7. Performance Evaluation Metrics

3. Experimental Results and Discussion

3.1. Experimental Setup

3.2. Baseline Characteristics of Adult and Child

3.3. Hyperparameter Tuning

3.4. Experiment-1: Evaluation for Handwritten Text Dataset

3.5. Experiment-2: Evaluation for Handwritten Pattern Dataset

3.6. Comparison of Our Proposed Method with the Existing Method

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI