Next Article in Journal
Ensemble Approach to Combining Episode Prediction Models Using Sequential Circadian Rhythm Sensor Data from Mental Health Patients
Next Article in Special Issue
Sensor-Based Wearable Systems for Monitoring Human Motion and Posture: A Review
Previous Article in Journal
A Real-Time Inspection System for Industrial Helical Gears
Previous Article in Special Issue
The Use of Embedded IMU Insoles to Assess Gait Parameters: A Validation and Test-Retest Reliability Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Can Gait Characteristics Be Represented by Physical Activity Measured with Wrist-Worn Accelerometers?

Pfizer Inc., Cambridge, MA 02139, USA
*
Author to whom correspondence should be addressed.
Submission received: 10 September 2023 / Revised: 11 October 2023 / Accepted: 12 October 2023 / Published: 18 October 2023
(This article belongs to the Special Issue Wearable or Markerless Sensors for Gait and Movement Analysis)

Abstract

:
Wearable accelerometers allow for continuous monitoring of function and behaviors in the participant’s naturalistic environment. Devices are typically worn in different body locations depending on the concept of interest and endpoint under investigation. The lumbar and wrist are commonly used locations: devices placed at the lumbar region enable the derivation of spatio-temporal characteristics of gait, while wrist-worn devices provide measurements of overall physical activity (PA). Deploying multiple devices in clinical trial settings leads to higher patient burden negatively impacting compliance and data quality and increases the operational complexity of the trial. In this work, we evaluated the joint information shared by features derived from the lumbar and wrist devices to assess whether gait characteristics can be adequately represented by PA measured with wrist-worn devices. Data collected at the Pfizer Innovation Research (PfIRe) Lab were used as a real data example, which had around 7 days of continuous at-home data from wrist- and lumbar-worn devices (GENEActiv) obtained from a group of healthy participants. The relationship between wrist- and lumbar-derived features was estimated using multiple statistical methods, including penalized regression, principal component regression, partial least square regression, and joint and individual variation explained (JIVE). By considering multilevel models, both between- and within-subject effects were taken into account. This work demonstrated that selected gait features, which are typically measured with lumbar-worn devices, can be represented by PA features measured with wrist-worn devices, which provides preliminary evidence to reduce the number of devices needed in clinical trials and to increase patients’ comfort. Moreover, the statistical methods used in this work provided an analytic framework to compare repeated measures collected from multiple data modalities.

1. Introduction

In recent years, due to advancements in sensor technologies, there has been a rapid growth in the use of wearable devices to obtain high-resolution and objective measures of physical function and behavior in participants’ free-living environments, for both public health research and clinical trials [1]. Wearable devices provide opportunities to characterize human health beyond conventional snapshots of in-lab measurements by capturing a holistic picture of patients’ daily life, including sitting, walking, climbing, cycling, sleep, etc. [2].
Among wearable devices, accelerometers, typically a key component of most wearable activity trackers and smartphones, are commonly used to track 24-h motor activity such as walking, climbing, cycling, and sleep [1,3,4]. Walking gait characteristics are important functional measurements and have been shown to be associated with aging-related functional and cognitive decline, falls, and mortality and can be used as clinical endpoints for multiple conditions [5,6,7,8,9,10]. Research on physical activity (PA) focuses on summarizing overall activity volumes and categorizing the waking period into different activity intensities, such as sedentary, light, moderate, and vigorous [1,11], and studying the distributions and transitions of different intensities. A wealth of evidence shows that time spent in sedentary behaviors is associated with adverse health outcomes and increased time spent in sedentary behaviors increases the severity and complications associated with those health outcomes [12,13,14]. In recent years, accelerometry-measured PA has been gradually accepted to be used as primary or secondary endpoint for pivotal clinical studies [15,16].
The location of the accelerometer on the body can impact the prediction of activity types [3]. Lumbar-worn and wrist-worn devices are two commonly used types of devices. There has been extensive literature demonstrating that the lumbar location can provide a reliable estimation of temporal and spatial gait parameters [17,18,19,20,21]. Multiple algorithms were developed to extract gait features based on signals from lumbar-worn accelerometers [17,22,23]. On the other hand, wrist-worn devices are typically accurate in quantifying PA in general and number of steps per day [24]. The underlying assumption of employing accelerometers to quantify physical activity hinges on establishing a correlation between the acceleration signal and energy expenditure, where this correlation is established through controlled laboratory-based activity tasks, during which oxygen consumption is measured. Numerous studies have been dedicated to constructing and validating associations for difference cohorts. Below, we provide a selection of noteworthy studies that have contributed significantly to this field of research [3,25,26,27]. More importantly, due to the high comfort and user acceptability of a wrist-worn device (i.e., nowadays people are already used to wearing smart watches), it has been used in a number of large cohort studies to objectively quantify daily PA for multiple days [28,29,30,31].
In drug development, it is of interest to select reliable and sensitive endpoints collected from the most appropriate body location. For example, to evaluate the treatment effect for Duchenne muscular dystrophy, stride velocity 95th percentile is considered a regulatory acceptable clinical endpoint [32], which can be measured by a lumbar-worn accelerometer [33]. From an operational point of view, it is important to consider patient burdens and preferences when conducting a long-term clinical trial since this impacts compliance and data completeness. Having to wear a device in the lumbar region for weeks or months may not be ideal for patients. On the other hand, wrist-worn devices have been shown to offer better compliance when compared with lumbar-worn devices due to ease and comfort of wear resulting in lower patient burden [34,35], and therefore may be preferred by patients [36]. Choosing the right sensor location based on cohort and study design to obtain the endpoints of interest while maintaining operational simplicity is critical for successful deployment in clinical trials.
To evaluate whether it is possible to avoid having to deploy both lumbar- and wrist-worn devices in a clinical study, we need to explore the association between gait and PA features measured at the two locations. Studies have demonstrated that specific gait features such as gait speed are associated with PA [1,31,37]. Those studies typically only focus on a small number of PA or gait features and are hard to be generalized to the two domains of gait and activity with a full spectrum of features from each domain. Moreover, results were based on accelerometry-measured PA and lab-measured gait speed. There is still a lack of systematic evaluation of the association and joint information between a comprehensive list of gait and PA features in free-living environments. It calls for statistical innovation to explore their correlation while incorporating the totality of PA and gait and also the repeated-measurement nature of the data collection. And, it has not been established if gait features measured by lumbar-worn devices can be represented by PA features measured by wrist-worn devices.
The purpose of this paper is to fill the gap by evaluating if gait features measured by lumbar-worn devices can be represented by PA features measured by wrist-worn devices. In the remainder of the paper, we first review multiple statistical models to evaluate the univariate and multivariate association between measurements from the gait domain and the PA domain. These methods range from simple feature-to-feature correlations, including linear regression analysis and principal component regression, to more complex multivariate dimension reduction methods, such as partial least square (PLS) [38] and joint and individual variation explained (JIVE) [39]. Since the features from lumbar and wrist devices are generally collected for multiple days, we evaluated the association between gait and PA based on both summary statistics (one-level) across seven days and repeated measures recorded daily (multilevel). We then demonstrate how these methods can be leveraged in a real data application where both wrist-worn and lumbar-worn devices were deployed with the same participants simultaneously.

2. Evaluate the Association and Joint Information from the Two Placements

A large number of algorithms have been proposed in the literature to evaluate the association or the joint information of data from different domains. In this section, we review and summarize commonly used methods in three parts: (1) feature-on-domain models, (2) domain-on-domain models, and (3) multilevel models. Figure 1 summarizes statistical models implemented for one-level and multilevel data. In this work, one-level methods refer to models applied to features averaged over days. Multilevel models are extended from one-level models, which incorporate repeated daily measurements of each feature for each participant. Feature-on-domain models are implemented only on one-level data, including pairwise correlation analysis, penalized linear regression [40], and principal component regression (PCR) [41]. Domain-on-domain models involve multivariate and multidomain analysis for both one-level and multilevel data, such as partial least square (PLS) regression [38] and joint and individual variation explained (JIVE) [39].

2.1. Notation

We use the P dimension vector x i j = ( x i j 1 , , x i j P ) and Q dimensional vector y i j = ( y i j 1 , , y i j Q ) to denote the P features in the PA domain and Q features in the gait domain, respectively, for subject i , i = 1 , , n at day j , j = 1 , , n i . Let x p = { x i j p } i = 1 , , n , j = 1 , , n i be the pth feature vector in PA domain and y q = { y i j q } i = 1 , , n , j = 1 , , n i be the qth feature vector in PA domain. These vectors are stacked to form the PA feature domain X = ( x 1 , , x P ) and the gait feature domain Y = ( y 1 , , y Q ) . We use N = i n i to denote the total number of observations. In our proposed one-level models (Section 2.2 and Section 2.3), the mean values across days, i.e., x i . p = 1 n i j = 1 n i x i j p and y i . q = 1 n i j = 1 n i y i j q , are incorporated. In these models, x i p and y i q are used for notation simplicity. We use (.)’ to denote the transpose of a vector or a matrix.

2.2. Feature-on-Domain Models

First, correlation and regression analysis are implemented to explore each feature individually from both gait and PA domains and its association with the other domain. The Pearson correlation coefficient is a measure of linear correlation between two sets of data, with the form of the covariance of the two variables divided by the product of their standard deviations. For each pair of features ( x p , y q ) , p = 1 , , P , q = 1 , , Q , from wrist- and lumbar-worn devices, respectively, we compute their corresponding correlation coefficients ρ x p , y q .
In addition to pairwise correlation evaluation, linear multiple regression models are included to further explore the linear associations between two domains, via regressing each univariate variable from the gait domain on the PA domain. Multicollinearity within the PA feature domain can be addressed with enforced sparsity via Lasso regressions, and the model selection is based on a five-fold cross-validation procedure incorporating ’one-standard-error’ rule, that is, choosing the model with the fewest coefficients that is less than one standard error away from the sub-model with the lowest error. Therefore, the penalized linear regression is utilized to explore the linear association between a single feature from the one measurement domain with multiple features from the other measurement domain, extending the one-on-one correlation analysis.
Another common approach to address multicollinearity when analyzing multiple regression data is principal component regression (PCR) [41]. Specifically, principal component analysis (PCA) was applied on the PA domain to transform it into an orthogonal predictor space with a lower dimension. In our implementation, the number of principal components (PCs) is selected based on finding the “elbow” point in the scree plot. PCR is implemented to find the features in the PA domain which explain the most variance, as well as to find the features in the gait domain which have the highest association with the PCs.

2.3. Domain-on-Domain Models

We expand our modeling strategy to quantify the associations between the gait and PA domains using multivariate models. These models, referred to as domain-on-domain models, provide a more comprehensive analysis of the relationship between gait and PA features.

2.3.1. Partial Least Squares Regression

The partial least squares (PLS) regression [38] is a technique that combines features from PCA and multiple linear regression, which seeks to find the multidimensional direction in the PA domain X that explains the maximum multidimensional variance direction in the gait domain Y . In essence, the goal of PLS regression is to find a pair of unit vectors μ and ν so that the expression
μ X Y ν ,
is maximized. The process is implemented with R package pls. The resulting relation between gait and PA domains can be explained with the correlation and loading plots. Compared with the PCR model introduced in Section 2.2, the components obtained from the PLS regression are based on covariance from both X and Y domains so that they can explain the mutual association.

2.3.2. Joint and Individual Variation Explained

JIVE has been proposed to deal with scenarios where different sources or views of the data are simultaneously available for the same set of samples [39]. JIVE decomposes the original multiblock data into a sum of three components: a low-rank approximation capturing joint variation of the domains, low-rank approximations capturing individual variation in each domain, and residual noise. The imposed rank and orthogonality constraints can be considered as extensions of PCA. Let Z = X Y , represent the combined feature matrix with dimension ( P + Q ) × N . The JIVE can be formulated as follows
X = J 1 + A 1 + ϵ 1 Y = J 2 + A 2 + ϵ 2 ,
where ϵ i is the error matrix of independent entries with E ( ϵ i ) = 0 , for i = 1 , 2 . J = J 1 J 2 is the joint structure matrix with rank r and A l denotes the individual structure with respect to X and Y with lower dimensional ranks r 1 and r 2 , respectively. Row orthogonality is enforced between both joint and individual components. JIVE was fitted using the R package r.jive.

2.4. Multilevel Models

The above mentioned feature-on-domain and domain-on-domain models are all considered one level because they are implemented on the mean feature values recorded at subject level. They can be further extended to multilevel models which include repeated measures for each subject, since both lumbar-worn and wrist-worn devices are normally worn continuously for multiple days.

2.4.1. Multilevel Partial Least Square Regression

The multilevel PLS regression [42] borrows the idea from the two-way ANOVA model, which decomposes the original process into the sum of three parts: overall mean, between-subject effect, and within-subject effect; i.e., for the observed feature vector x i j (or y i j ), it can be decomposed into
x i j = μ + ( x i . μ ) + ( x i j x i . ) ,
where μ is the mean term with the form 1 n i i , j x i j . ( x i j x i . ) is the within-subject matrix with x i . = 1 n i j x i j , and ( x i . μ ) represents the between-subject matrix. The same format of transformation can be implemented on y i j . For both within- and between-subject matrices, the proposed PLS regression in Section 2.3.1 is performed. Therefore, both within-subject and between-subject relations between gait and PA domains can be explained via the multilevel PLS regression model.

2.4.2. Multilevel JIVE

To incorporate repeated measurements from multidomain data, Di et al. (2019) proposed multilevel JIVE [43] that combines multilevel PCA (MPCA) [44] and JIVE.
Multilevel JIVE utilizes a two-step procedure. First, a two-way ANOVA like decomposition is performed via MPCA to separate the joint variation into random between- and within-subject effects ( u i , w i j , respectively) with lower dimensional presentation, as shown in
z i j = μ + u i + w i j ,
where μ is the overall mean, and z i j = ( x i j , y i j ) is the P + Q dimensional vector for all features. The number of retained principal components is based on a pre-specified percentage of explained variation.
As the second step, JIVE is applied to both the between- and within-subject effects ( u i , w i j , respectively) to extract the joint and individual structure, as shown in
u i = J u i + A u i + ϵ u i w i j = J w i j + A w i j + ϵ w i j .
Therefore, the proposed multilevel JIVE is able to explain the joint and individual effects at both subject and day levels.

3. Real Data Application: The STRYDE Study

3.1. Subjects and Instrumentation

The Sensors to Record Your Daily Exercise (STRYDE) study was conducted at the Pfizer Innovation Research (PfIRe) Lab in Cambridge, Massachusetts. The study recruited 65 healthy participants in total, with 33 of them in the younger (18–40) cohort (age = 29.2 ± 4.6 years, 17 females) and 32 in the older (65–85) cohort (age = 72.3 ± 5.8  years, 16 females). Details of the study were previously published [33]. In general, the study was designed to have participants perform the same walking tasks during two in-clinic visits and be continuously monitored in their free-living environment for 7–14 days.
Specfically for this study, we only focused on the gait and PA measurements in the free-living enviroments. Continuous assessment of functional activities at home was collected using two Activinsights GENEActiv devices [45], with one placed at the lumbar, and the other on the non-dominant wrist. Devices recorded tri-axial accelerometer data (range: ±8 g, sampling rate: 50 Hz, resolution: 12 bits/3.9 mg) and were attached to the body using straps. A schematic representation of a participant wearing the device and participating in the gait analysis is provided in Figure 2. Data were stored locally on the device and downloaded for offline processing following the return of the device to the study site.

3.2. Data Processing

Raw acceleration signals from lumbar-worn GENEActiv devices were used to derive gait features. Gait features were generated using the implementation in the open-source SciKit Digital Health (SKDH) (https://github.com/PfizerRD/scikit-digital-health (accessed on 30 August 2023)) Python library [22]. The gait algorithm first detects gait bouts from continuous free-living data using a gradient-boosted tree classifier and then applies a continuous wavelet transformation [46] and inverted pendulum model [20] to derive temporal and spatial gait features for steps in each gait bout. We followed previous practice to first remove bouts that lasted <10 s or >3000 s, then take the median of gait features across all steps within each bout, and eventually take the mean (and/or 95th percentile) across all gait bouts per day [33]. Step counts and bout lengths were summed up across all bouts within each day. Eleven gait features were derived, namely total step counts per day, total bout length per day, gait speed, cadence, stance time, swing time, stride duration, double support, single limb support, stride length, and 95th percentile of gait speed.
Raw signals from wrist-worn GENEActiv devices were used to derive PA features. The Microsoft Excel Macros accompanying the GENEActiv device were used to generate the PA features [47]. The algorithm generated minute-by-minute epochs in each 24-h period and classified them into sleep, wear, bed, no wear, wear time (excluding bed time), sedentary activity, light activity, moderate activity, and vigorous activity time and eventually generated the relevant activity features. The activity features were extracted based on summarizing and thresholding the signal vector magnitude (SVMg). A total of thirteen PA features were presented in this study, including: total duration and numbers of sedentary, light, moderate, and vigorous PA bouts, total daily sleep time, and non-thresholded features, such as mean and 95th percentile of SVMg and mean activity level during the most active 6/15/60-min-window per day.
For both devices, we followed the convention to keep only valid days for each participant, defined as having 10 h or more of wearing time of wrist-worn devices [48,49]. Table 1 provided a list of gait and PA features along with their corresponding abbreviations. Summary metrics of these features for all participants, as well as for younger and older cohorts, were included in Table 2. On average, participants in the study had 8.78 (±1.86) valid days. To deal with the large discrepancy in the scales between different gait and PA features, all features were centered and scaled. For the one-level model, for each gait and PA feature, averages across days were taken. Sensitivity analysis showed that the model performance was similar if the median or quantile value across days was used.

3.3. Results—One-Level Models

The linear correlation between the gait and PA features was analyzed using pairwise Pearson correlation coefficients and is displayed in Figure 3. The within-domain correlations for gait features revealed patterns that potentially group gait features into three clusters, i.e., spatial features related to gait quality (including stride duration, cadence, double support, single limb support, stance, and swing times), spatial and spatio-temporal features (stride length, gait speed, 95th percentile of gait speed total steps), and features related to gait quantity (total steps and total bout length). The results also indicated that gait quantity measures, i.e., total steps and total bout length, have a stronger correlation with PA features. In PA domain, features related to peak activities during the day were grouped, such as maximum activity in the most active 6/15/60 window, moderate/vigorous activity, and mean/95th percentile of SVMg, while sedentary to light activities formed another group.
We applied linear multiple regression models with a Lasso penalty to evaluate the relationship between the two domains. Figure 4 displays the coefficients from the Lasso regression for the selected predictors in each regression model. The results showed that gait quantity features, i.e., total steps and total bout length, are positively associated with time spent in moderate activity and mean SVMg.
PCA was applied to the PA features and the first four components explained over 90% of total variation in the PA domain. Figure 5 displays the estimated eigenvectors of the top two PCs. Specifically, the first PC (PC1) was heavily loaded on mean SVMg, 95% percentile SVMg, and moderate activity, while the second PC (PC2) was heavily loaded on light activity and number of light and sedentary activity bouts. We fitted the PCR with the top four PCs as the predictors and each gait feature as the response. The estimated coefficients are displayed in Figure 6, which further supported the strong association between total bout length and total steps per day, and the activity features, especially in the first and fourth PCs.
In the domain-on-domain models, we aimed to uncover the association between two domains including all features. Figure 7 displays the contribution of gait and PA features on the top two components estimated from PLS. It first showed that gait features, including total bout length, total steps per day, 95th percentile of gait speed, mean gait speed, and stride length heavily contributed to the first component in the same direction, while double support, single limb support, stance, and swing times contributed to the first component in the opposite direction. This suggests that within all gait features, there are two latent clusters. The first cluster includes spatial and spatio-temporal features (e.g., gait speed and stride length) and gait quantity features (e.g., total steps per day and total bout length). And another cluster consists of gait quality features, such as double support and stride duration. Moreover, total steps per day, total bout length, mean and 95% percentile gait speed, and stride length contributed to the first component in a similar fashion to almost all PA features (i.e., negative loadings on the first component) except for sleep and sedentary time. Interestingly, PA features that summarized peak activity during the day (e.g., maximum activity in the most active 6/15/60 window, vigorous activity, and 95th percentile of SVMg) and the gait quantity features contributed to both components in a similar fashion, which suggests that they are highly correlated.
We further applied JIVE to explore the joint variation shared by the two domains. We identified a rank-1 representation of the joint structure and rank-2 representations of the gait and PA individual structure. Table 3 provides the contribution of both gait and PA features to the variation explained by the joint and individual components. Overall, the joint component explained 30% of the total variation of the combined gait and PA domain and the individual components contributed slightly more, which explained 56% of the total variation. In total, the joint and individual components explained over 85% of the total variation in two domains. Fifty eight percent of the joint variation was contributed by gait and the remaining forty-two percent by PA. These results suggest that the domains shared a great amount of joint information, with the gait domain contributing to the joint information slightly more than the PA domain. On the contrary, the PA domain explained more variation in individual components than the gait domain.

3.4. Results—Multilevel Models

In the multilevel models, the day-to-day variations in PA and gait features were taken into account. The results from the multilevel PLS regression demonstrated that the between- and within-subject effects were dominated by total steps per day and total bout length in the gait domain. Figure 8 shows the estimated first component for both between and within-subject effects. Particularly, total bout length and total steps per day had the highest contributions from the gait domain, followed by moderate time, mean, and 95th percentile of SVMg from the activity domain, within both between- and within-subject effects. These findings strengthen the conclusion that total steps per day and total bout length were closely related to the PA domain and played a significant role in the association between gait and PA domains.
Furthermore, the two-step multilevel JIVE was implemented. In the first step, MPCA was applied to all the features to separate between-subject and within-subject effects. The estimated between-subject and within-subject effects both explained around 45% of the total variation, which demonstrated the importance of not overlooking the within-subject (day-to-day) variation (Table 4). In the second step, JIVE was applied on between-subject effects and within-subject effects with main results shown in the second part of Table 4, which indicated that around 50% joint variation was shared between the two domains for both between- and within-subject effects. This demonstrated the equal importance of two domains in explaining the joint variance. The individual components explained 42% and 49% of the between- and within-subject effects, respectively. Therefore, for both between- and within-subject effects, over 95% of the total variation was covered by the joint and individual components. Figure 9 and Figure 10 illustrate the cross-correlation between features from the lower-rank representation of the between-subject effect u i and within-subject effect w i j and the extracted JIVE scores from the corresponding multilevel JIVE models. Similar grouping patterns of features were found in between-subject and within-subject effects. Gait quantity features (i.e., total bout length, total steps) and gait speed contributed to the joint component (first column) in the same direction as most of features from the PA domain (except for sedentary and sleep features). Furthermore, these gait features exhibited the highest correlation with the joint component, indicating that they are more likely to be represented by the PA features. Regarding the gait individual component in within-subject effects (as shown in the second column in Figure 10), it was observed that PA features related with more vigorous activities, such as maximum activity in the most active 6/15/60 window and 95th percentile of SVMg, contributed to this component in the same direction as gait speed, stride length, and cadence.

3.5. Comparison between One-/Multilevel PLS/JIVE

We then compared the results from the one-level and multilevel models. The estimated lower dimension components from the one- and the multilevel PLS and JIVE are shown in Figure 11. The first component in each model, which explains the largest amount of variance, was always highly correlated with the first components from the other models, indicating that the results are consistent across models. In particular, the strongest correlations were observed between the first principal components of the two one-level models (PLS.PC1 vs. JIVE.JT.PC1) and between the between-subject and within-subject effects of the multilevel models (PLS.B.PC1 vs. PLS.W.PC1 and JIVE.B.JT.PC1 vs. JIVE.W.JT.PC1).

4. Discussion

In this paper, we systematically investigated the association between gait features measured with lumbar-worn accelerometers and PA features measured with wrist-worn accelerometers, which, to the best of our knowledge, is the first study of its kind. We applied multiple univariate and multivariate statistical models to data collected from the STRYDE study to evaluate the association based on the average feature values across days and repeated measurements across days. Our results showed that gait and PA are interrelated and share joint information. Specifically, gait quantity measures, such as total steps and total bout length, and gait speed show a higher correlation with PA features when compared to other gait features. As a result, they are more likely to be represented by a combination of PA features, including those summarizing peak activity during the day, such as maximum activity within the most active 6-, 15-, and 60-min windows, vigorous activity, and the 95th percentile of SVMg. This result has been corroborated by multiple univariate and multivariate methods, such as multilevel JIVE and multilevel PLS.
We implemented multiple statistical models, including both univariate and multivariate methods which can be categorized into feature-on-domain analyses and domain-on-domain analyses. Specifically, the feature-on-domain analyses aim to reveal the association between each individual gait feature with all PA features. The domain-on-domain analyses aim to reveal the overall covariance and shared variation between the two domains of gait and PA. We specifically highlighted the use of JIVE, which has been previously used to explore the joint association among different physiological domains of PA, sleep, and circadian rhythms [1]. JIVE can be further applied in multimodal or multidomain data analyses. The day-to-day variation in continuous measurements from wearable devices is a well-known fact [50] and is associated with multiple health-related phenomena, such as social jetlag [51] and weekend warrior [52]. Therefore, in this work, we also assessed the importance of using multilevel analyses to take into account the within-subject correlation due to the repeated daily measurements, which are often ignored by only considering average values across days. Both one-level and multilevel models yielded consistent results, demonstrating that gait quantity features can be represented by PA features. The multilevel models provided further insights—it showed that both between-subject and within-subject effects explain 50% of the total variation in the data, which indicates that individual characteristics and day-to-day variability play important roles in understanding the associations between the gait and activity domains.
Previous research has indeed established connections between specific gait characteristics and PA features. For instance, Wanigatunga et al. identified associations between faster gait speed and several PA metrics, including increased activity counts, daily active minutes, and reduced activity fragmentation [31]. Similarly, Schrack et al. also observed a correlation between gait speed and activity fragmentation [37]. However, these studies primarily focused on gait speed as a singular gait characteristic and relied on lab-measured gait speed rather than accelerometer-based gait characterization. Moreover, Dawe et al. [53] discovered that three gait and balance measures were independently associated with total daily physical activity, assessed through total activity counts. However, it is important to highlight that the “gait and balance measures” in this research encompassed a combination of activities such as walking, sit-to-stand, and turning, extending beyond a pure gait characterization. These measurements were derived from a structured mobility testing protocol conducted in a controlled laboratory environment, which differs from the accelerometer-based gait characterization in free-living environment used in our study. Nevertheless, our study reinforces the consistent conclusion that gait speed is indeed correlated with various PA features. Importantly, we achieved this through a comprehensive analytical framework that encompasses a wide range of gait and PA characteristics. This approach not only reaffirms the relationship between gait and PA but also underscores the potential clinical significance of these correlations, particularly in understanding patients’ prognoses. Future research endeavors should explore these connections further, delving into longitudinal trends to provide valuable insights into the evolving relationship between PA and gait in clinical contexts.
The association between gait features and PA features identified in this work provides preliminary evidence that a single wrist-worn device can be deployed in a clinical trial. Specifically, as our results suggest, total gait quantity and gait speed exhibit a strong association with PA features, which indicates that they can be analytically represented by PA features. Furthermore, in scenarios where the primary interest lies in assessing total gait quantity or gait speed, PA features hold potential as surrogate endpoints. From an operational point of view, deploying a single wrist-worn, watch-like device will increase compliance and reduce data loss due to difficulty in wearing the devices. This will be a good step further toward decentralized clinical trials and more patient-centric clinical trials in general [11,54]. However, for therapeutic areas where specific spatial and spatio-temporal features are of clinical interest, such as stride duration, step symmetry, or gait variability, it may still be preferable to use the more fit-for-purpose devices, such as the lumbar-worn devices [55].
There has been recent research attempting to directly quantify walking gaits from wrist-worn accelerometers. However, it is worth noting that challenges persist when utilizing a single wrist-worn accelerometer for comprehensive and reliable gait characterization. For instance, Trost et al. [56] employed logistic regressions to classify various activity types, including walking, using both wrist and hip-worn devices. While their walking detection achieved reasonable accuracy, extracting precise gait metrics from wrist-accelerometry signals during walking bouts remains non-trivial. Similarly, Sokas et al. [57] detected 6-min windows of fast walking in daily activities using wrist-worn accelerometers. They counted steps in these windows and used piece-wise linear models with covariates like age and height to estimate walking distance which potentially can be hard to scale. While valuable for estimating distance in free-living 6-min walk task, this approach did not delve into detailed spatial and spatio-temporal gait characteristics. Brand [58] explored machine learning and deep learning techniques to classify gait based on free-living data. However, their summarization of walking bouts was limited to total daily walking time and a few time and frequency domain features, lacking the comprehensive gait characterization which can be typically obtained from lumbar-worn devices. Soltani et al. [59] proposed a personalized machine learning method for gait speed estimation using accelerometers but relied on Global Navigation Satellite System’s calibration and correction, limiting its generalizability. Chan et al. [60] employed support vector machine (SVM) models for gait bout classification and speed estimation but faced challenges in characterizing a wide range of gait characteristics. Additionally, the non-physical nature of the SVM model may not fully capture the underlying mechanics of walking. Therefore, the need for further investigation into wrist-worn-accelerometer-based gait characterization is evident, and lumbar-worn accelerometers remain the preferred choice for reliable and comprehensive gait characterization, surpassing the mere detection of walking bouts. Future research may leverage this framework to examine cross-domain associations between wrist-based activity, lumbar-based gait, and wrist-based gait once more reliable wrist-based gait estimation becomes available.
A significant contribution of this work is to provide a comprehensive analytic framework to study the inter-correlation and association between multiple data modalities with repeated measurements. Using this framework, we comprehensively evaluate the relationship between PA and gait, taking into account the repeated measurement design and a wide spectrum of PA and gait characteristics. This approach distinguishes our research from previous studies with similar objectives. We have implemented our methods within a hierarchical fashion, making it more accessible for researchers with less statistical background in this field. As demonstrated in this work, while one could consider a subset of the introduced methods to assess the association between features from different domains, our comprehensive implementation has provided new insights and interpretations from various perspectives. Additionally, cross-checking between methods has been performed to validate the results obtained from them. In this paper, even though the two modalities are gait and PA features from lumbar- and wrist-worn devices, respectively, such an analytic framework can be generalized to other comparisons involving repeated measurements to explore the associations, interactions, and shared information across different data modalities, domains, and/or time points.
We acknowledge that the biggest limitation of this work is the relatively small sample size. One important future work is to validate our findings in a separate study with a larger sample size and with both lumbar- and wrist-worn devices deployed, which may be challenging. Another direction to pursue validation is to pool multiple studies with small sample sizes which utilize both device locations. If the studies use different brands of devices, a device-agnostic data processing pipeline, such as Scikit Digital Health (SKDH) [22], could be used to generate gait and PA features across studies. With lager sample sizes, it is possible to evaluate the age effect on the association between gait and PA features, which may provide additional clinical insights. Furthermore, it is important to note that the STRYDE study primarily relied on data collected from a sample of healthy adult participants. While the findings from this study offer valuable insights, it is essential to exercise caution when extrapolating these results to individuals with pathological conditions. To establish a broader applicability of the conclusions drawn from the STRYDE study, additional research involving participants who have specific medical conditions or pathology may be necessary. In those cases, our proposed analytic framework can be directly applied.

Author Contributions

Conceptualization, J.D., F.I.K. and C.D.; methodology, J.D., W.L. and F.I.K.; formal analysis, W.L. and J.D.; data curation, L.A. and D.P.; study design and data collection, M.S., X.C., C.D. and F.I.K.; writing—original draft preparation, W.L. and J.D.; writing—review and editing, all authors; supervision, J.D. and F.I.K. All authors have read and agreed to the published version of the manuscript.

Funding

This study was sponsored by Pfizer Inc.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Advarra Institutional Review Board (protocol code: Pro00029419 initial date of approval 12Oct2018).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Upon request, and subject to review, Pfizer will provide the data that support the findings of this study. Subject to certain criteria, conditions and exceptions, Pfizer may also provide access to the related individual de-identified participant data. See https://www.pfizer.com/science/clinical-trials/data-and-results (accessed on 30 August 2023) for more information.

Acknowledgments

We thank all the staff members of Pfizer Innovation Research Lab for conducting the study and collecting the data, including Hao Zhang, Tomasz Adamusiak, Monica Calicchio, Amey Kelekar, Andrew Messere, Koene R. A. Van Dijk, Vesper Ramos, and Steve Amatto.

Conflicts of Interest

All authors are employees and stockholders of Pfizer Inc.

Abbreviations

The following abbreviations are used in this manuscript:
PAPhysical activity
JIVEJoint and individual variation explained
PCPrincipal component
PCAPrincipal component analysis
PCRPrincipal component regression
PLSPartial Least Square
MPCAMultilevel principal component analysis
STRYDESensors to Record Your Daily Exercise
PfIRePfizer Innovation Research
SVMgSignal vector magnitude
ANOVAAnalysis of variance
SKDHSciKit Digital Health

References

  1. Di, J.; Spira, A.; Bai, J.; Urbanek, J.; Leroux, A.; Wu, M.; Resnick, S.; Simonsick, E.; Ferrucci, L.; Schrack, J.; et al. Joint and individual representation of domains of physical activity, sleep, and circadian rhythmicity. Stat. Biosci. 2019, 11, 371–402. [Google Scholar] [CrossRef] [PubMed]
  2. Rose, M.J.; Neogi, T.; Friscia, B.; Torabian, K.A.; LaValley, M.P.; Gheller, M.; Adamowicz, L.; Georgiev, P.; Viktrup, L.; Demanuele, C.; et al. Reliability of wearable sensors for assessing gait and chair stand function at home in people with knee osteoarthritis. Arthritis Care Res. 2023, 75, 1939–1948. [Google Scholar] [CrossRef]
  3. Karas, M.; Bai, J.; Strączkiewicz, M.; Harezlak, J.; Glynn, N.W.; Harris, T.; Zipunnikov, V.; Crainiceanu, C.; Urbanek, J.K. Accelerometry data in health research: Challenges and opportunities. Stat. Biosci. 2019, 11, 210–237. [Google Scholar] [CrossRef] [PubMed]
  4. Lin, W.; Zou, J.; Di, C.; Sears, D.D.; Rock, C.L.; Natarajan, L. Longitudinal Associations Between Timing of Physical Activity Accumulation and Health: Application of Functional Data Methods. Stat. Biosci. 2023, 15, 309–329. [Google Scholar] [CrossRef] [PubMed]
  5. Hornyak, V.; VanSwearingen, J.M.; Brach, J.S. Measurement of gait speed. Top. Geriatr. Rehabil. 2012, 28, 27–32. [Google Scholar] [CrossRef]
  6. Peel, N.M.; Kuys, S.S.; Klein, K. Gait speed as a measure in geriatric assessment in clinical settings: A systematic review. J. Gerontol. Ser. A Biomed. Sci. Med. Sci. 2013, 68, 39–46. [Google Scholar] [CrossRef]
  7. Pirker, W.; Katzenschlager, R. Gait disorders in adults and the elderly: A clinical guide. Wien. Klin. Wochenschr. 2017, 129, 81–95. [Google Scholar] [CrossRef]
  8. Polhemus, A.M.; Bergquist, R.; de Basea, M.B.; Brittain, G.; Buttery, S.C.; Chynkiamis, N.; Dalla Costa, G.; Ortiz, L.D.; Demeyer, H.; Emmert, K.; et al. Walking-related digital mobility outcomes as clinical trial endpoint measures: Protocol for a scoping review. BMJ Open 2020, 10, e038704. [Google Scholar] [CrossRef]
  9. Jung, H.K.; Chung, E.; Lee, B.H. A comparison of the balance and gait function between children with Down syndrome and typically developing children. J. Phys. Ther. Sci. 2017, 29, 123–127. [Google Scholar] [CrossRef]
  10. Kennedy, R.A.; Carroll, K.; McGinley, J.L.; Paterson, K.L. Walking and weakness in children: A narrative review of gait and functional ambulation in paediatric neuromuscular disease. J. Foot Ankle Res. 2020, 13, 1–15. [Google Scholar] [CrossRef]
  11. Di, J.; Demanuele, C.; Kettermann, A.; Karahanoglu, F.I.; Cappelleri, J.C.; Potter, A.; Bury, D.; Cedarbaum, J.M.; Byrom, B. Considerations to address missing data when deriving clinical trial endpoints from digital health technologies. Contemp. Clin. Trials 2022, 113, 106661. [Google Scholar] [CrossRef] [PubMed]
  12. Dunstan, D.W.; Kingwell, B.A.; Larsen, R.; Healy, G.N.; Cerin, E.; Hamilton, M.T.; Shaw, J.E.; Bertovic, D.A.; Zimmet, P.Z.; Salmon, J.; et al. Breaking up prolonged sitting reduces postprandial glucose and insulin responses. Diabetes Care 2012, 35, 976–983. [Google Scholar] [CrossRef] [PubMed]
  13. Healy, G.N.; Dunstan, D.W.; Salmon, J.; Cerin, E.; Shaw, J.E.; Zimmet, P.Z.; Owen, N. Breaks in sedentary time: Beneficial associations with metabolic risk. Diabetes Care 2008, 31, 661–666. [Google Scholar] [CrossRef] [PubMed]
  14. Carson, V.; Wong, S.L.; Winkler, E.; Healy, G.N.; Colley, R.C.; Tremblay, M.S. Patterns of sedentary time and cardiometabolic risk among Canadian adults. Prev. Med. 2014, 65, 23–27. [Google Scholar] [CrossRef]
  15. Nathan, S.D.; Flaherty, K.R.; Glassberg, M.K.; Raghu, G.; Swigris, J.; Alvarez, R.; Ettinger, N.; Loyd, J.; Fernandes, P.; Gillies, H.; et al. A randomized, double-blind, placebo-controlled study of pulsed, inhaled nitric oxide in subjects at risk of pulmonary hypertension associated with pulmonary fibrosis. Chest 2020, 158, 637–645. [Google Scholar] [CrossRef]
  16. Bellerophon Announces Top-Line Data from Phase 3 REBUILD Clinical Trial of INOpulse® for Treatment of Fibrotic Interstitial Lung Disease. Available online: https://bellerophon.gcs-web.com/news-releases/news-release-details/bellerophon-announces-top-line-data-phase-3-rebuild-clinical (accessed on 30 August 2023).
  17. Czech, M.D.; Patel, S. GaitPy: An open-source python package for gait analysis using an accelerometer on the lower back. J. Open Source Softw. 2019, 4, 1778. [Google Scholar] [CrossRef]
  18. Trojaniello, D.; Cereatti, A.; Della Croce, U. Accuracy, sensitivity and robustness of five different methods for the estimation of gait temporal parameters using a single inertial sensor mounted on the lower trunk. Gait Posture 2014, 40, 487–492. [Google Scholar] [CrossRef]
  19. Menz, H.B.; Lord, S.R.; Fitzpatrick, R.C. Acceleration patterns of the head and pelvis when walking on level and irregular surfaces. Gait Posture 2003, 18, 35–46. [Google Scholar] [CrossRef]
  20. Zijlstra, W.; Hof, A.L. Assessment of spatio-temporal gait parameters from trunk accelerations during human walking. Gait Posture 2003, 18, 1–10. [Google Scholar] [CrossRef]
  21. Brandes, M.; Zijlstra, W.; Heikens, S.; van Lummel, R.; Rosenbaum, D. Accelerometry based assessment of gait parameters in children. Gait Posture 2006, 24, 482–486. [Google Scholar] [CrossRef]
  22. Adamowicz, L.; Christakis, Y.; Czech, M.D.; Adamusiak, T. Scikit digital health: Python package for streamlined wearable inertial sensor data processing. JMIR Mhealth Uhealth 2022, 10, e36762. [Google Scholar] [CrossRef] [PubMed]
  23. Chakravorty, A.; Mobbs, R.J.; Anderson, D.B.; Rooke, K.; Phan, K.; Yoong, N.; Maharaj, M.; Choy, W.J. The role of wearable devices and objective gait analysis for the assessment and monitoring of patients with lumbar spinal stenosis: Systematic review. BMC Musculoskelet. Disord. 2019, 20, 1–9. [Google Scholar] [CrossRef] [PubMed]
  24. Benson, L.C.; Clermont, C.A.; Bošnjak, E.; Ferber, R. The use of wearable devices for walking and running gait analysis outside of the lab: A systematic review. Gait Posture 2018, 63, 124–138. [Google Scholar] [CrossRef] [PubMed]
  25. Zhang, Y.; Li, H.; Keadle, S.K.; Matthews, C.E.; Carroll, R.J. A review of statistical analyses on physical activity data collected from accelerometers. Stat. Biosci. 2019, 11, 465–476. [Google Scholar] [CrossRef]
  26. Hildebrand, M.; Van Hees, V.T.; Hansen, B.H.; Ekelund, U. Age group comparability of raw accelerometer output from wrist-and hip-worn monitors. Med. Sci. Sport. Exerc. 2014, 46, 1816–1824. [Google Scholar] [CrossRef]
  27. Staudenmayer, J.; He, S.; Hickey, A.; Sasaki, J.; Freedson, P. Methods to estimate aspects of physical activity and sedentary behavior from high-frequency wrist accelerometer measurements. J. Appl. Physiol. 2015, 119, 396–403. [Google Scholar] [CrossRef]
  28. Troiano, R.P.; Berrigan, D.; Dodd, K.W.; Masse, L.C.; Tilert, T.; McDowell, M. Physical activity in the United States measured by accelerometer. Med. Sci. Sport. Exerc. 2008, 40, 181. [Google Scholar] [CrossRef]
  29. Berkemeyer, K.; Wijndaele, K.; White, T.; Cooper, A.; Luben, R.; Westgate, K.; Griffin, S.; Khaw, K.; Wareham, N.; Brage, S. The descriptive epidemiology of accelerometer-measured physical activity in older adults. Int. J. Behav. Nutr. Phys. Act. 2016, 13, 1–10. [Google Scholar] [CrossRef]
  30. Doherty, A.; Jackson, D.; Hammerla, N.; Plötz, T.; Olivier, P.; Granat, M.H.; White, T.; Van Hees, V.T.; Trenell, M.I.; Owen, C.G.; et al. Large scale population assessment of physical activity using wrist worn accelerometers: The UK biobank study. PLoS ONE 2017, 12, e0169649. [Google Scholar] [CrossRef]
  31. Wanigatunga, A.A.; Liu, F.; Urbanek, J.K.; Wang, H.; Di, J.; Zipunnikov, V.; Cai, Y.; Dougherty, R.J.; Simonsick, E.M.; Ferrucci, L.; et al. Wrist-Worn Accelerometry, Aging, and Gait Speed in the Baltimore Longitudinal Study of Aging. J. Aging Phys. Act. 2022, 31, 408–416. [Google Scholar] [CrossRef]
  32. Servais, L.; Yen, K.; Guridi, M.; Lukawy, J.; Vissière, D.; Strijbos, P. Stride velocity 95th centile: Insights into gaining regulatory qualification of the first wearable-derived digital endpoint for use in Duchenne muscular dystrophy trials. J. Neuromuscul. Dis. 2022, 9, 335–346. [Google Scholar] [CrossRef] [PubMed]
  33. Czech, M.D.; Psaltos, D.; Zhang, H.; Adamusiak, T.; Calicchio, M.; Kelekar, A.; Messere, A.; Van Dijk, K.R.; Ramos, V.; Demanuele, C.; et al. Age and environment-related differences in gait in healthy adults using wearables. NPJ Digit. Med. 2020, 3, 127. [Google Scholar] [CrossRef] [PubMed]
  34. Duncan, S.; Stewart, T.; Mackay, L.; Neville, J.; Narayanan, A.; Walker, C.; Berry, S.; Morton, S. Wear-time compliance with a dual-accelerometer system for capturing 24-h behavioural profiles in children and adults. Int. J. Environ. Res. Public Health 2018, 15, 1296. [Google Scholar] [CrossRef] [PubMed]
  35. Straczkiewicz, M.; Glynn, N.W.; Harezlak, J. On placement, location and orientation of wrist-worn tri-axial accelerometers during free-living measurements. Sensors 2019, 19, 2095. [Google Scholar] [CrossRef]
  36. Tarachandani, A.; Karahanoglu, F.I.; Messere, A.; Tarasenko, L.; LaRonde-Richard, A.M.; Kessler, N.; Rossulek, M.; Plate, H.; Mahoney, K.; Santamaria, M. Patient Willingness to Use Digital Health Technologies: A Quantitative and Qualitative Survey in Patients with Cancer Cachexia. Patient Prefer. Adherence 2023, 17, 1143–1157. [Google Scholar] [CrossRef]
  37. Schrack, J.A.; Kuo, P.L.; Wanigatunga, A.A.; Di, J.; Simonsick, E.M.; Spira, A.P.; Ferrucci, L.; Zipunnikov, V. Active-to-sedentary behavior transitions, fatigability, and physical functioning in older adults. J. Gerontol. Ser. A 2019, 74, 560–567. [Google Scholar] [CrossRef]
  38. Abdi, H. Partial least squares regression and projection on latent structure regression (PLS Regression). Wiley Interdiscip. Rev. Comput. Stat. 2010, 2, 97–106. [Google Scholar] [CrossRef]
  39. Lock, E.F.; Hoadley, K.A.; Marron, J.S.; Nobel, A.B. Joint and individual variation explained (JIVE) for integrated analysis of multiple data types. Ann. Appl. Stat. 2013, 7, 523. [Google Scholar] [CrossRef]
  40. Tibshirani, R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
  41. Jolliffe, I.T. A note on the use of principal components in regression. J. R. Stat. Soc. Ser. C Appl. Stat. 1982, 31, 300–303. [Google Scholar] [CrossRef]
  42. Liquet, B.; Lê Cao, K.A.; Hocini, H.; Thiébaut, R. A novel approach for biomarker selection and the integration of repeated measures experiments from two assays. BMC Bioinform. 2012, 13, 1–14. [Google Scholar] [CrossRef] [PubMed]
  43. Di, J. Feature Engineering and Statistical Modeling of Wearable Devices Data. Ph.D. Thesis, Johns Hopkins University, Baltimore, MD, USA, 2019. [Google Scholar]
  44. Di, C.Z.; Crainiceanu, C.M.; Caffo, B.S.; Punjabi, N.M. Multilevel functional principal component analysis. Ann. Appl. Stat. 2009, 3, 458. [Google Scholar] [CrossRef] [PubMed]
  45. GENEActiv: Raw Data Accelerometry. Available online: https://activinsights.com/technology/geneactiv/ (accessed on 30 August 2023).
  46. McCamley, J.; Donati, M.; Grimpampi, E.; Mazza, C. An enhanced estimate of initial contact and final contact instants of time using lower trunk inertial sensor data. Gait Posture 2012, 36, 316–318. [Google Scholar] [CrossRef] [PubMed]
  47. GENEActiv Support. Available online: https://activinsights.com/support/geneactiv-support/ (accessed on 30 August 2023).
  48. Troiano, R.P. Large-scale applications of accelerometers: New frontiers and new questions. Med. Sci. Sport. Exerc. 2007, 39, 1501. [Google Scholar] [CrossRef] [PubMed]
  49. Tudor-Locke, C.; Camhi, S.M.; Troiano, R.P. Peer reviewed: A catalog of rules, variables, and definitions applied to accelerometer data in the National Health and Nutrition Examination Survey, 2003–2006. Prev. Chronic Dis. 2012, 9, E113. [Google Scholar] [PubMed]
  50. Dillon, C.B.; Fitzgerald, A.P.; Kearney, P.M.; Perry, I.J.; Rennie, K.L.; Kozarski, R.; Phillips, C.M. Number of days required to estimate habitual activity using wrist-worn GENEActiv accelerometer: A cross-sectional study. PLoS ONE 2016, 11, e0109913. [Google Scholar] [CrossRef]
  51. Urbanek, J.K.; Spira, A.P.; Di, J.; Leroux, A.; Crainiceanu, C.; Zipunnikov, V. Epidemiology of objectively measured bedtime and chronotype in US adolescents and adults: NHANES 2003–2006. Chronobiol. Int. 2018, 35, 416–434. [Google Scholar] [CrossRef]
  52. O’Donovan, G.; Lee, I.M.; Hamer, M.; Stamatakis, E. Association of “weekend warrior” and other leisure time physical activity patterns with risks for all-cause, cardiovascular disease, and cancer mortality. JAMA Intern. Med. 2017, 177, 335–342. [Google Scholar] [CrossRef]
  53. Dawe, R.J.; Leurgans, S.E.; Yang, J.; Bennett, J.M.; Hausdorff, J.M.; Lim, A.S.; Gaiteri, C.; Bennett, D.A.; Buchman, A.S. Association between quantitative gait and balance measures and total daily physical activity in community-dwelling older adults. J. Gerontol. Ser. A 2018, 73, 636–642. [Google Scholar] [CrossRef]
  54. Ghadessi, M.; Di, J.; Wang, C.; Toyoizumi, K.; Shao, N.; Mei, C.; Demanuele, C.; Tang, R.; McMillan, G.; Beckman, R.A. Decentralized clinical trials and rare diseases: A Drug Information Association Innovative Design Scientific Working Group (DIA-IDSWG) perspective. Orphanet J. Rare Dis. 2023, 18, 79. [Google Scholar] [CrossRef]
  55. Micó-Amigo, M.E.; Bonci, T.; Paraschiv-Ionescu, A.; Ullrich, M.; Kirk, C.; Soltani, A.; Küderle, A.; Gazit, E.; Salis, F.; Alcock, L.; et al. Assessing real-world gait with digital technology? Validation, insights and recommendations from the Mobilise-D consortium. J. Neuroeng. Rehabil. 2023, 20, 1–26. [Google Scholar] [CrossRef] [PubMed]
  56. Trost, S.G.; Zheng, Y.; Wong, W.K. Machine learning for activity recognition: Hip versus wrist data. Physiol. Meas. 2014, 35, 2183. [Google Scholar] [CrossRef]
  57. Sokas, D.; Paliakaitė, B.; Rapalis, A.; Marozas, V.; Bailón, R.; Petrėnas, A. Detection of walk tests in free-living activities using a wrist-worn device. Front. Physiol. 2021, 12, 706545. [Google Scholar] [CrossRef] [PubMed]
  58. Brand, Y.E.; Schwartz, D.; Gazit, E.; Buchman, A.S.; Gilad-Bachrach, R.; Hausdorff, J.M. Gait detection from a wrist-worn sensor using machine learning methods: A daily living study in older adults and people with Parkinson’s disease. Sensors 2022, 22, 7094. [Google Scholar] [CrossRef] [PubMed]
  59. Soltani, A.; Dejnabadi, H.; Savary, M.; Aminian, K. Real-world gait speed estimation using wrist sensor: A personalized approach. IEEE J. Biomed. Health Inform. 2019, 24, 658–668. [Google Scholar] [CrossRef] [PubMed]
  60. Chan, L.L.; Lord, S.R.; Brodie, M.A. Is Maximal or Usual Walking Speed from Large Scale Wrist Sensor Data Better at Predicting Dementia, Depression and Death? Sci. Rep. 2022, 12, 16211. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Overview of the analytical framework implemented in this paper for comparing multivariate, multilevel, and multidomain data.
Figure 1. Overview of the analytical framework implemented in this paper for comparing multivariate, multilevel, and multidomain data.
Sensors 23 08542 g001
Figure 2. A schematic representation of devices placement of a participant (left) and gait characteristics (right).
Figure 2. A schematic representation of devices placement of a participant (left) and gait characteristics (right).
Sensors 23 08542 g002
Figure 3. Pairwise cross-correlation matrix for gait features (colored in red) and physical activity (colored in blue) at significance level 0.1. The ‘X’ represents non-significant correlation. Three groups in gait domain (red box) and two groups in PA domain (blue box) were identified.
Figure 3. Pairwise cross-correlation matrix for gait features (colored in red) and physical activity (colored in blue) at significance level 0.1. The ‘X’ represents non-significant correlation. Three groups in gait domain (red box) and two groups in PA domain (blue box) were identified.
Sensors 23 08542 g003
Figure 4. Regression coefficient (numbers in the circles) from Lasso regression, with one feature from one domain (rows) regressed on the multivariate feature domain of the other one (columns). The place was left blank if the variable was not selected in the corresponding penalized regression model. From the regression models, total bout length and total steps per day are associated with more features from the PA domain and the corresponding estimated coefficients tend to be of large value.
Figure 4. Regression coefficient (numbers in the circles) from Lasso regression, with one feature from one domain (rows) regressed on the multivariate feature domain of the other one (columns). The place was left blank if the variable was not selected in the corresponding penalized regression model. From the regression models, total bout length and total steps per day are associated with more features from the PA domain and the corresponding estimated coefficients tend to be of large value.
Sensors 23 08542 g004
Figure 5. Loading vectors of the first two principal components from PCA applied to all PA features. Features related with peak activity (e.g., mean/95th percentile SVMg and moderate activity) contributed more to the first component. Features related to low-intensity activity (e.g., sedentary to light activities) contributed more to the second component.
Figure 5. Loading vectors of the first two principal components from PCA applied to all PA features. Features related with peak activity (e.g., mean/95th percentile SVMg and moderate activity) contributed more to the first component. Features related to low-intensity activity (e.g., sedentary to light activities) contributed more to the second component.
Sensors 23 08542 g005
Figure 6. Estimated coefficients (numbers in circles) from PCR using the first four principal components from the PCA decomposition as the regression predictors. The blank box represents non-significant (p-value < 0.05) coefficients. PC 1 and PC 4 from the PA domain have the highest correlation with total steps and total bout length.
Figure 6. Estimated coefficients (numbers in circles) from PCR using the first four principal components from the PCA decomposition as the regression predictors. The blank box represents non-significant (p-value < 0.05) coefficients. PC 1 and PC 4 from the PA domain have the highest correlation with total steps and total bout length.
Sensors 23 08542 g006
Figure 7. Variable correlation circle plot between gait (red) and PA (blue) domains for the first two PLS components. Two latent clusters were identified here. The first cluster included spatial and spatio-temporal features (e.g., gait speed and stride length) and gait quantity features (e.g., total steps per day and total bout length), contributing to the first component in the same direction as PA features. Another cluster included gait quality features, such as double support and stride duration. They contributed to the first component in the opposite direction of other gait features, together with sedentary activity and sleep time features from PA domain.
Figure 7. Variable correlation circle plot between gait (red) and PA (blue) domains for the first two PLS components. Two latent clusters were identified here. The first cluster included spatial and spatio-temporal features (e.g., gait speed and stride length) and gait quantity features (e.g., total steps per day and total bout length), contributing to the first component in the same direction as PA features. Another cluster included gait quality features, such as double support and stride duration. They contributed to the first component in the opposite direction of other gait features, together with sedentary activity and sleep time features from PA domain.
Sensors 23 08542 g007
Figure 8. Loading plot of 1st component in multilevel PLS for between-subject (left) and within-subject (right) effects for both gait (red) and PA (blue) features. Total bout length and total steps per day from the gait domain and moderate time, mean and 95th percentile of SVMg from the PA domain had the highest contributions to the first component for both between and within-subject effects.
Figure 8. Loading plot of 1st component in multilevel PLS for between-subject (left) and within-subject (right) effects for both gait (red) and PA (blue) features. Total bout length and total steps per day from the gait domain and moderate time, mean and 95th percentile of SVMg from the PA domain had the highest contributions to the first component for both between and within-subject effects.
Sensors 23 08542 g008
Figure 9. Cross-correlations between eleven gait features and thirteen PA features and the first joint score, first gait individual score, and two PA individual scores for between-subject effect from multilevel JIVE (* indicates p-values < 0.05). Total bout length and total steps per day from the gait domain and mean/95th percentile SVMg and moderate activity from the PA domain had the biggest contribution to the joint component in the same direction (blue dots).
Figure 9. Cross-correlations between eleven gait features and thirteen PA features and the first joint score, first gait individual score, and two PA individual scores for between-subject effect from multilevel JIVE (* indicates p-values < 0.05). Total bout length and total steps per day from the gait domain and mean/95th percentile SVMg and moderate activity from the PA domain had the biggest contribution to the joint component in the same direction (blue dots).
Sensors 23 08542 g009
Figure 10. Cross-correlations between eleven gait features and thirteen PA features and the first joint score, first gait individual score, and two PA individual scores for within-subject effect from multilevel JIVE (* indicates p-values < 0.05 ). Total bout length and total steps per day from the gait domain and mean/95th percentile SVMg and moderate activity from the PA domain had the biggest contribution to the joint component in the same direction (red dots).
Figure 10. Cross-correlations between eleven gait features and thirteen PA features and the first joint score, first gait individual score, and two PA individual scores for within-subject effect from multilevel JIVE (* indicates p-values < 0.05 ). Total bout length and total steps per day from the gait domain and mean/95th percentile SVMg and moderate activity from the PA domain had the biggest contribution to the joint component in the same direction (red dots).
Sensors 23 08542 g010
Figure 11. Correlations between first two components from one-level and multilevel PLS and JIVE models. First components from all models are correlated with each other, indicating the results are consistent across models. Label notations: PC1/2: 1st/2nd principal component; B.: between-subject effect; W.: within-subject effect; JT.: joint component in JIVE; ind.: individual component in JIVE.
Figure 11. Correlations between first two components from one-level and multilevel PLS and JIVE models. First components from all models are correlated with each other, indicating the results are consistent across models. Label notations: PC1/2: 1st/2nd principal component; B.: between-subject effect; W.: within-subject effect; JT.: joint component in JIVE; ind.: individual component in JIVE.
Sensors 23 08542 g011
Table 1. List of features and corresponding abbreviations in figures and tables.
Table 1. List of features and corresponding abbreviations in figures and tables.
Features in the Gait DomainAbbreviations
total step counts per day (#)total_step_per_day
total bout length per day (s)total_bout_length
gait speed (m/s)gait_speed
cadence (# steps/min)cadence
stance time (s)stance
swing time (s)swing
stride duration (s)stride_duration
double support (s)double_support
single limb support (s)single_limb_support
stride length (m)stride_length
95th percentile of gait speed (m/s)gait_speed_95perc
Features in the PA domainAbbreviations
total duration of sedentary activity bouts (h)sedentary
total duration of light activity bouts (h)light
total duration of moderate activity bouts (h)moderate
total duration of vigorous activity bouts (h)vigorous
total daily sleep time (h)sleep_time
total number of sedentary activity bouts (#)sed_period
total number of light activity bouts (#)light_period
total number of moderate activity bouts (#)mod_period
mean SVMgmean_SVMg
95th percentile of SVMgperc95_SVMg
mean activity level during the most active 6-min windowmax_6min
mean activity level during the most active 15-min windowmax_15min
mean activity level during the most active 60-min windowmax_60min
Table 2. Descriptive statistics (mean ± sd) of gait and PA features for all participants and younger and older cohorts separately.
Table 2. Descriptive statistics (mean ± sd) of gait and PA features for all participants and younger and older cohorts separately.
FeaturesOverall (Mean ± Sd)Younger (Mean ± Sd)Older (Mean ± Sd)
total_step_per_day7361.59 ± 3249.697924.53 ± 3422.656781.05 ± 3004.10
total_bout_length81.71 ± 34.0186.47 ± 36.4376.80 ± 31.14
gait_speed0.86 ± 0.100.91 ± 0.090.81 ± 0.09
cadence99.21 ± 5.82101.11 ± 5.8797.25 ± 0.16
stance0.76 ± 0.040.75 ± 0.040.78 ± 0.04
swing0.47 ± 0.030.46 ± 0.030.48 ± 0.03
stride_duration1.25 ± 0.071.23 ± 0.071.27 ± 0.07
double_support0.29 ± 0.020.29 ± 0.010.30 ± 0.02
single_limb_support0.49 ± 0.030.48 ± 0.030.50 ± 0.03
stride_length1.04 ± 0.111.07 ± 0.111.00 ± 0.10
gait_speed_95perc1.27 ± 0.171.35 ± 0.141.18 ± 0.16
sedentary11.22 ± 1.3610.97 ± 1.3111.47 ± 1.40
light1.30 ± 0.461.29 ± 0.311.32 ± 0.58
moderate2.06 ± 0.902.47 ± 0.781.64 ± 0.83
vigorous0.11 ± 0.140.16 ± 0.160.07 ± 0.11
sleep_time5.79 ± 1.485.64 ± 1.595.94 ± 1.38
sed_period60.25 ± 15.7364.41 ± 10.1055.96 ± 19.18
light_period56.24 ± 17.6657.37 ± 12.1155.07 ± 22.13
mod_period45.05 ± 17.0653.10 ± 11.2736.75 ± 18.16
mean_SVMg131.22 ± 33.95146.59 ± 29.82115.36 ± 30.85
perc95_SVMg496.51 ± 139.47558.60 ± 117.45432.49 ± 132.61
max_6min921.76 ± 362.271100.49 ± 344.17737.44 ± 282.13
max_15min726.55 ± 305.06851.77 ± 302.87597.42 ± 252.05
max_60min441.38 ± 170.51502.57 ± 169.68378.28 ± 149.03
Table 3. Percentages of joint and individual variation explained by each domain in JIVE.
Table 3. Percentages of joint and individual variation explained by each domain in JIVE.
Joint ComponentIndividual Components
Explained Variation0.300.56
Gait0.580.47
PA0.420.53
Table 4. Results from multilevel JIVE. 1. Between- and within-subject effects (percentages of explained variance) from MPCA decomposition. 2. Percentages of joint and individual variation explained by each domain for between- and within-subject effect in multilevel JIVE.
Table 4. Results from multilevel JIVE. 1. Between- and within-subject effects (percentages of explained variance) from MPCA decomposition. 2. Percentages of joint and individual variation explained by each domain for between- and within-subject effect in multilevel JIVE.
Between-Subject Effect Within-Subject Effect
1. MPCA decomposition0.46 0.44
JointIndividualJointIndividual
2. JIVE decomposition0.530.420.450.49
Gait0.470.410.540.27
PA0.530.590.460.73
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lin, W.; Karahanoglu, F.I.; Psaltos, D.; Adamowicz, L.; Santamaria, M.; Cai, X.; Demanuele, C.; Di, J. Can Gait Characteristics Be Represented by Physical Activity Measured with Wrist-Worn Accelerometers? Sensors 2023, 23, 8542. https://0-doi-org.brum.beds.ac.uk/10.3390/s23208542

AMA Style

Lin W, Karahanoglu FI, Psaltos D, Adamowicz L, Santamaria M, Cai X, Demanuele C, Di J. Can Gait Characteristics Be Represented by Physical Activity Measured with Wrist-Worn Accelerometers? Sensors. 2023; 23(20):8542. https://0-doi-org.brum.beds.ac.uk/10.3390/s23208542

Chicago/Turabian Style

Lin, Wenyi, Fikret Isik Karahanoglu, Dimitrios Psaltos, Lukas Adamowicz, Mar Santamaria, Xuemei Cai, Charmaine Demanuele, and Junrui Di. 2023. "Can Gait Characteristics Be Represented by Physical Activity Measured with Wrist-Worn Accelerometers?" Sensors 23, no. 20: 8542. https://0-doi-org.brum.beds.ac.uk/10.3390/s23208542

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop