Novel Method of Classification in Knee Osteoarthritis: Machine Learning Application Versus Logistic Regression Model

Article information

Ann Rehabil Med. 2020;44(6):415-427
Publication date (electronic) : 2020 December 31
doi : https://doi.org/10.5535/arm.20071
1Department of Rehabilitation Medicine, Hanyang University College of Medicine, Seoul, Korea
2Department of Rehabilitation Medicine, Hanyang University Guri Hospital, Hanyang University College of Medicine, Guri, Korea
3Korea Orthopedics & Rehabilitation Engineering Center, Incheon, Korea
Corresponding author: Jaesung Cho Korea Orthopedics & Rehabilitation Engineering Center, 26 Gyeongin-ro 10beon-gil, Bupyeong-gu, Incheon 21417, Korea. Tel: +82-32-509-5200, Fax: +82-32-503-5017, E-mail: jscho@kcomwel.or.kr
Received 2020 April 10; Revised 2020 May 28; Accepted 2020 July 1.

Abstract

Objective

To present new classification methods of knee osteoarthritis (KOA) using machine learning and compare its performance with conventional statistical methods as classification techniques using machine learning have recently been developed.

Methods

A total of 84 KOA patients and 97 normal participants were recruited. KOA patients were clustered into three groups according to the Kellgren-Lawrence (K-L) grading system. All subjects completed gait trials under the same experimental conditions. Machine learning-based classification using the support vector machine (SVM) classifier was performed to classify KOA patients and the severity of KOA. Logistic regression analysis was also performed to compare the results in classifying KOA patients with machine learning method.

Results

In the classification between KOA patients and normal subjects, the accuracy of classification was higher in machine learning method than in logistic regression analysis. In the classification of KOA severity, accuracy was enhanced through the feature selection process in the machine learning method. The most significant gait feature for classification was flexion and extension of the knee in the swing phase in the machine learning method.

Conclusion

The machine learning method is thought to be a new approach to complement conventional logistic regression analysis in the classification of KOA patients. It can be clinically used for diagnosis and gait correction of KOA patients.

INTRODUCTION

Osteoarthritis (OA), which is defined as the degeneration of joint cartilage and the underlying bone, is the most common joint disease. It can also be described by joint symptoms including pain and stiffness and/or structural pathology affecting many joint tissues [1]. It is clinically known that the knee joint is the most common site afflicted by OA despite the potential for any joint involvement [2]. Knee osteoarthritis (KOA) is the most common source of pain, disability, and functional loss in adults, consequently leading to abnormal gait patterns. As KOA seems to be much more prevalent today due to the increased life expectancy and body mass index (BMI) in modern times, early intervention is required through accurate early diagnosis of KOA [3].

There are various tools to classify KOA including radiological, arthroscopic, and functional scales [4]. The mainstay for diagnosing KOA is plain radiography and the Kellgren-Lawrence (K-L) grading system, which assigns a grade from 0 (no presence of OA) to 4 (severe OA), is the most commonly used [5].

Gait analysis systems can provide information on the biomechanical response of the human body to musculoskeletal disorders like KOA with non-invasive methods [6]. Some studies investigated the kinematic changes during the gait cycle and reported decreased knee joint motion during flexion, a decreased knee flexion angle during the stance phase, and an increased knee flexion angle at heel strike [7-10]. Others showed kinetic changes in KOA patients including a reduced internal knee extensor moment to reduce the loading on the knee joint, increased first peak knee external adduction moment, and initial contact with the ground with a more extended knee [10,11].

Previous studies on gait analysis mainly used spatiotemporal parameters such as step length and gait speed; kinematic parameters such as joint angle; and kinetic parameters such as ground reaction force and moment. In terms of the statistical techniques used in the study, conventional statistics such as a t-test or Mann-Whitney U test were used [10,12]. In statistically more precise studies, principle component analysis or multiple regression analysis were used [6,13,14].

We can obtain dynamic and segmented gait data, which is difficult to obtain through plain radiography, by gait analysis. When this is applied to a clinical trial, there is the advantage that rehabilitative interventions for the gait patterns of KOA patients can be performed together. However, a vast amount of gait data from gait analysis still remains limited for clinical use, and machine learning techniques have been introduced to compensate for this limitation and improve the classification accuracy.

Recently, there have been many attempts to classify and distinguish abnormal pathologic gait patterns via combining machine learning-based techniques with gait analysis system in the medical field [15-18]. When analyzing gait patterns through machine learning-based techniques, a machine classifier is necessary and the support vector machine (SVM) has been established as a successful technique for pattern recognition [19].

Therefore, this study aimed to present anew classification method of KOA patients using machine learning and evaluate the accuracy of classification by comparing it to conventional statistical analysis methods.

MATERIALS AND METHODS

Subjects

This study was performed on age-matched KOA patients (n=84) and normal participants (n=97). KOA was diagnosed through plain radiography and graded according to the K-L grading system. Subjects without knee pain, joint stiffness, gait problems, or a history of being diagnosed with KOA were considered normal participants. Before all subjects decided to participate in this study, the purpose and methods of the study was fully explained to them. Additionally, participants provided informed consent.

Prior to beginning the study, we asked subjects to maintain their usual lifestyle. In preparation for any problems, we contacted them over the phone during the experiment to check their condition or whether they had any health problems. When subjects visited our institution, they provided their own elementary demographic information including sex and age, and KOA patients were assessed for the severity of their disease through plain radiography. All demographic characteristics including age, height, weight, and BMI are described as the mean values in both the control and KOA groups (Table 1). Also, KOA patients were clustered into three groups according to the K-L grade of their affected limb via plain radiography: mild, as definite osteophyte and possible joint space narrowing on the anteroposterior weightbearing radiograph; moderate, as multiple osteophytes, definite joint space narrowing, sclerosis, and possible bony deformation; and severe, as large osteophytes, marked joint space narrowing, severe sclerosis and definite bony deformation. The affected limb was defined as a more symptomatic side in bilateral KOA patients and an ipsilateral side in unilateral KOA patients. This study was approved by Hanyang University Guri Hospital Institutional Review Board (No. 2016-06-014).

Characteristics of patients with knee osteoarthritis and control subjects

Gait analysis and experimental equipment

All subjects completed ten walking trials with a 10-m gait course. Gait trials were done in a quiet environment with proper speed for participants. We subdivided each gait cycle to easily understand and utilize it for research. Gait cycles were divided into the stance phase (SP) and swing phase (SW), and then, the SP into the initial double support phase (IDS), single support phase (SS), and terminal double support phase (TDS). Three-dimensional (3D) kinematic data from the hip, knee, and ankle joints were also gathered. All subjects performed all components of the experiments in the same environments and conditions.

In this study, a major piece of equipment for analyzing gait patterns was the inertial measurement unit (IMU)- based system (Human Track; RBiotech Co. Ltd., Seoul, Korea). Its main components consisted of a gyroscope, magnetic sensor, and accelerometer. For measuring angular velocity and proper acceleration, the triaxial gyroscope and accelerometer were used, respectively. We obtained information on the azimuth angle through a magnetic sensor. The 3D motion information was obtained using the gyroscope, accelerometer, and magnetic sensor in all three axes. The wearable IMU used in this study was wireless and attached to the lower abdomen, both sides of mid-thigh level, tibial shaft, and feet dorsum. According to Aminian et al. [20], gait analysis using a gyroscope was able to effectively estimate spatiotemporal parameters. In a previous study, the validity of applying the IMU system consisting of a gyroscope, accelerometer, and magnetometer to gait analysis was also verified [21]. Because temporal gait parameters are used to compute higher-level gait features and the accuracy of spatiotemporal parameter measurement using IMU was evaluated as appropriate, our study could also measure the gait cycle and divide it into four phases using IMU [22].

The hardware of the gait analysis system used in this study consisted of PC and wireless IMU sensors (Fig. 1). Using a Bluetooth module with 100 Hz serial peripheral interface (SPI) communication, the PC received signals from the sensors without direct connection.

Fig. 1.

A hardware of the gait analysis system consisting of PC and wireless IMU sensors (A), and gait trial of knee osteoarthritis patient (B) and normal subject (C). IMU, inertial measurement unit.

Extraction of kinematic parameters and segmentation of the gait cycle

Utilizing the motion information gathered from all sensors attached to the patients, the 3D joint angles were calibrated. In this study, the gait data on both sides of the limbs were averaged based on the gait cycle. This is because we believed that if we analyzed the left and right sides separately, the number of cases would be too large to lead to a meaningful performance of the machine learning method. Based on the data, we were able to show how the joint angle changed throughout the whole gait cycle (Fig. 2). And then, 72 gait features could be listed to classify KOA patients, which described the kinematic components of the gait patterns (Table 2). Through the process of subdividing joint angles during the gait cycle, abnormal gait patterns and their cause could be explained.

Fig. 2.

The gait waveform data of one representative normal subject’s lower limb was divided and expressed by each segmented gait phase.

Kinematic gait parameters used to classify osteoarthritis patients

Machine learning application for the classification of knee osteoarthritis

To classify KOA patients and severity, a machine learning algorithm was used in this study. The SVM is a supervised learning model with associated learning algorithms used to find a support vector. A support vector is an optimal hyperplane that separates two data classes. The SVM figures out the hyperplane with a maximum-margin via process finding two data in two groups with the shortest distance. It is a brief overview of the SVM (Fig. 3), and if it cannot draw a dividing line in two-dimensional space, the kernel function can be applied in a higher dimensional space to classify data.

Fig. 3.

Conceptualization of the support vector machine (SVM). The purpose of the SVM is to identify a hyperplane that precisely divides the data points. A hyperplane is a decision boundary that segregates data into classes. The data points close to the hyperplane are support vectors, and these affect the position of the hyperplane.

With the z-scores calculated, all gait features were normalized and then applied to the classifiers. In this study, a 5-fold cross-validation scheme was performed to assess the possibility of generalization of the classifier, and the results were attained through the accuracy result. After splitting data set into five subsets, one subset was used as a test set and the remaining four subsets were used as training sets.

From all gait analyses, we collected the feature set data and operated it using the MATLAB Classification Learner Tool. Using the SVM classifier, we calculated the distance between data from KOA patients and normal subjects and generated a model capable of classifying each cluster by distance. A flow chart of our classification algorithm is shown in Fig. 4. By using this algorithm, we generated a model that can classify KOA patients through a machine learning technique. For successful feature selection that is notably sensitive to feature interactions, the ReliefF algorithm was used. A feature score for each feature was calculated and applied to find the highest scoring features for feature selection. A flow diagram of this algorithm is shown in Fig. 5.

Fig. 4.

Flow diagram for the classification algorithm of knee osteoarthritis patients. SVM, support vector machine.

Fig. 5.

Flow diagram of feature selection for improving the SVM classification of knee osteoarthritis severity. SVM, support vector machine; OA, osteoarthritis.

Statistical analysis

We also performed a logistic regression analysis to compare the results of conventional medical statistical methods and machine learning in classifying KOA patients and normal subjects. Age, gender, height, weight, duration of one gait cycle, gait speed, cadence, and the maximum knee flexion angle were used as covariates and the presence of KOA (KOA patients group vs. normal subjects group) was used as dependent variable in the logistic regression model. Odds ratios (ORs) and 95% confidence intervals (CIs) were used to show the associations in the logistic regression analysis, and Nagelkerke’s R2 statistic was calculated to determine the variance in classification of KOA patients and normal subjects. Statistical analysis was performed with SPSS Statistics software (IBM SPSS, First Armonk, NY, USA).

RESULTS

Primarily, the result values of basic gait parameters for each group are shown in Supplementary Table S1. The mean and standard deviation values for the range of motion (ROM) of the aforementioned 72 gait features are shown in Supplementary Table S2. All subjects were divided into two groups (KOA and control) based on the 72 gait features. We used the MATLAB Classification Learner to assess the accuracy of the SVM classifier, which was 92.8% (Table 3). Then, machine learning-based classification through the SVM classifier was shown as a scatter plot with two-dimensions. We constructed a 2×2 confusion matrix with actual and predicted values, and visualized the performance of machine learning-based classification (Fig. 6).

Accuracy results in classification of knee osteoarthritis and normal subjects using SVM

Fig. 6.

The results of classification of knee osteoarthritis (OA) patients and normal subjects using an SVM classifier as a confusion matrix. The number of correct predictions is 75 and 93 for true positive and negative cases, respectively; and the number of incorrect predictions is 4 and 9 for false positive and negative cases, respectively. SVM, support vector machine.

We also classified the severity of KOA patients by the 72 gait features. Calculated in the same way as mentioned above, the accuracy of the SVM classifier was 81.2%. To know how the number of the features affects our classification, a forward feature selection algorithm was used in this study. By using the ReliefF algorithm, we found the best combination of kinematic features among all 72 features. As a result, we found the 10 features which showed the highest accuracy of 85.6% and were superior to classification with all 72 features (Table 4).

Accuracy results in classification of severity of knee osteoarthritis using SVM and SVM with feature selection algorithm classifier

As the same as above, we displayed machine learningbased classification through an SVM classifier in scatter plot form with two-dimensions. We also constructed a 4×4 confusion matrix with actual and predicted values, and visualized the performance of machine learningbased classification (Fig. 7).

Fig. 7.

The results of knee osteoarthritis severity classification using the SVM classifier as a confusion matrix. The number of correct predictions is colored green in 4×4 confusion matrix. SVM, support vector machine.

Regarding the feature selection process using the ReliefF algorithm, Fig. 8 shows the accuracy of each feature set and the highest accuracy of classification was found to be 85.6% when optimal 10 features were used. It is noteworthy that classifiers trained with a suitable number of features showed better performance in classification.

Fig. 8.

The accuracy of knee osteoarthritis severity classification according to the number of features selected by the ReliefF algorithm.

In addition, among the 72 gait features representing the kinematic data of subjects, the three most critical features in abnormal gait were listed by the level of their contributions. When classifying KOA patients and normal subjects, the primary contributions were left knee flexion and extension in the SW, secondary contributions were right knee flexion and extension in SW, and the tertiary contributions were ankle dorsiflexion and plantarflexion in the SS. When classifying mild, moderate, and severe KOA patients and normal subjects, the primary and secondary contributions were the same as those from the previous classification, and the tertiary contributions were internal and external knee rotation in the SS.

In the classification of KOA severity in this study, Fig. 9 presents the most crucial features in the sagittal plane of the knee.

Fig. 9.

The difference in waveform data according to the severity of knee osteoarthritis in the sagittal plane of the knee joint from one representative subject in each group.

In the logistic regression analysis for the classification of KOA patients and normal subjects, the final model included the maximum knee flexion angle of the right (OR=1.106; 95% CI, 1.038–1.179; p=0.002) and left (OR=1.110; 95% CI, 1.039–1.186; p=0.002) knee and cadence (OR=1.076; 95% CI, 1.017–1.138; p=0.010). The final logistic regression model explained 58.0% (Nagelkerke’s R2) of the variance in the classification of KOA patients and normal subjects and correctly classified 83.9% of cases (Table 5).

Logistic regression analysis with gait features and clinical factors

DISCUSSION

Today, as physicians have begun to recognize that KOA is caused by various factors including genetic, inflammatory, and mechanical processes, KOA, to a certain extent, is thought to be a preventable disease. Therefore, early intervention is required through accurate early diagnosis of KOA.

The severity of KOA can be assessed by radiologically (K-L grading system), arthroscopically (Collins system), and functionally (Western Ontario and McMaster Universities Osteoarthritis index) [23,24].

The most widely used method of radiologic grading and KOA identification is described by Kellgren and Lawrence as the K-L grading system [5,23]. Although it has been used for decades, debates regarding its limitations are ongoing. The K-L grading system has difficulty in evaluating the progress of disease and can be harmful due to the repetitive radiation exposure. Since many studies have adapted this classification in various ways, it can also be inconsistent with its original description [5,25]. Simultaneously, some previous studies have criticized the consistency of the pre-standardized classification of KOA. Schiphof et al. [26] performed an epidemiological cohort study from 1966 to 2005 to summarize the different descriptions of KOA classifications from previous studies. Remarkably, disagreement on the definition and grading system of KOA between some major OA cohort studies was shown despite their accordance with following the original K-L grading system. Some studies even had inconsistencies within themselves.

In the past few decades, the technology related to human motion analysis has shown remarkable advancements and as a result, our understanding of human body motion and the accuracy of gait analysis have significantly improved. Consequently, gait analysis can be utilized in the medical field and considered a clinically useful technique.

Recently, the effort to use gait analysis as a methodology for KOA research and actual clinical application has increased significantly [15]. Many studies on KOA using gait analysis investigated representative gait patterns of KOA patients and reported differences in gait parameters compared to normal subjects [10,11,27-29].

Mills et al. [30] conducted a systematic review and meta-analysis on the biomechanical deviations of KOA patients. It revealed spatiotemporal parameters as a great scale to determine the severity of KOA. Astephen et al. [31] implied that as the disease progressed, the role which the biomechanical factors played depended on the different stages of severity. Previous studies have reported the kinetics and kinematics for different stages of KOA. In terms of kinetics, Mundermann et al. [10] reported a sufficient hip adduction moment and reduced second peak knee adduction moment in less severe KOA patients, and reduced hip adduction moment and increased first peak knee adduction moment in more severe KOA patients considering the gait compensatory mechanism. Similarly, Chang et al. [32] reported that a greater hip abduction moment occurring during gait reduced the progression of ipsilateral medial KOA in relatively less advanced KOA patients. Meissier et al. [33] and Al-Zahrani and Bakheit [12] conducted studies in more severe KOA patients and reported significantly reduced ROM in the hip, knee, and ankle joints. They reported that these kinematic gait abnormalities observed in severe KOA patients were due to the instability of the knee joint, and rehabilitative interventions that improved knee stability enhanced the walking ability of severe KOA patients. KOA is a joint disease, which inevitably affects all lower limb joints including both sides of the hip, knee, and ankle. In addition, through analysis of the gait of KOA patients in a number of previous studies, the most significantly changed kinematic parameter was determined to be the knee flexion ROM [6,34]. Therefore, in this study, we proceeded using ROM data in the kinematic parameters.

However, previously used camera-based gait analysis cannot be easily performed due to challenges such as appropriate space, costs, and time required for testing [35]. It also requires complicated image processing and appropriate lighting conditions to operate normally [36]. To compensate for the limitations shown in camera-based methods, this study used a wireless form of wearable IMU. The strength of wearable IMU systems is that they are radiation-free, small-sized, and cost effective, and it made them to be widely used. Beange et al. [37] reported that the IMU system not only had a higher accuracy compared to Vicon but also was sufficient to be used in clinical settings.

Today, almost every part of our daily lives is affected by machine learning and big data. In general, clinicians can feel unfamiliar with big data and machine learning. However, machine learning is a similar method in principle to traditional statistics. Machine learning has emerged as an important and powerful technique in the medical field. By applying machine learning to clinical settings, we can expect advances at the level of global health. By developing a prognostic algorithm, better assessments can improve the care for patients with various diseases and thereby improve the prognosis. A large part of the role of anatomical pathologists and radiologists will be replaced by machine learning. Finally, through machine learning, diagnosis will become more consistent and accurate [38].

Shetty et al. [17] reported the successful classification of Parkinson’s disease from amyotrophic lateral sclerosis, Huntington’s disease, and healthy controls using a machine learning method. This study used a Gaussian radial basis function kernel based SVM classifier and achieved an overall classification accuracy of 83.33% after feature vector selection. Mezghani et al. [18] classified asymptomatic and KOA subjects. They analyzed gait patterns by measuring the 3D ground reaction force, and the data was classified using the nearest neighbor rule. Finally, with the wavelet decomposition using force components in the frontal and transverse planes, the accuracy of classification between the two groups was 94%.

In this study, using all 72 gait features for machine learning-based classification, the results revealed a classification accuracy of 92.8% between the KOA and control groups, and 81.2% among three groups according to the severity of KOA. Then, the process of feature selection needed to classify KOA patients according to the severity was performed by ReliefF algorithm, and we found that not all features were necessary to classify the severity of KOA. Consequently, when 10 optimal features were used in classification, the accuracy showed the highest value of 85.6%. These results suggest that the process of feature selection increased efficiency in data processing and the accuracy of classification by excluding unnecessary gait features. Therefore, the eventual reduction in the number of used features is helpful for optimizing machine learning algorithms and helping clinicians to easily understand the process. This improvement of accuracy via feature selection was supported by previous studies about classification of scoliosis and diagnosis of glaucoma [16,39].

From the gait analysis in this study, the data on kinematic and spatiotemporal parameters were gathered from the sensors attached to the patients. Based on the data, dynamic changes of the joint angle according to the four segmented stages (IDS, SS, TDS, and SW) are described in Fig. 2.

As previous studies that incorporated gait analysis into KOA assessment have mainly used conventional statistics such as t-tests and regression analyses [40], we also conducted a logistic regression analysis as a conventional medical statistical method to compare the results with the machine learning-based method for the classification of KOA patients and normal subjects. From the logistic regression analysis using both the demographic characteristics and spatiotemporal parameters, we found the final model with the maximum knee flexion angle and cadence. This final logistic regression model correctly classified 83.9% of cases, which was lower than the machine learning-based method which correctly classified 92.8% of cases. Even though the independent variables used in the two methods were not identical to each other, we could acquire the additional evidence of methodological validity in the machine learning-based classification method.

There are several limitations in our study. First, although gait speed is known to affect gait kinematics, we conducted this study without controlling for gait speed. We thought that instead of controlling gait speed by performing gait trials on a treadmill, it was more important to analyze the subjects’ gait patterns in their daily lives under the same experimental conditions. Second, this study failed to control for the effects of pain which are one of the most common symptoms of KOA patients and can also affect the kinematics of gait. Third, we did not perform radiographic evaluation of the normal control group because of the risk of radiation exposure. Thus, we could not compare radiologic findings between KOA patients and normal subjects. Fourth, in this study, we conducted the study based only on the K-L grade of the affected limb which was defined as the more symptomatic side, and patients with symmetric severity of KOA were excluded from the study. In the future, it would be a good idea to collect enough study subjects and conduct research on patients with symmetric severity of KOA. Fifth, although we were able to achieve considerable reliability in the machine learning-based classification of KOA patients, research on either modification of abnormal gait patterns or future treatment was not included in this study. Therefore, the study on these aspects needs to be conducted afterwards.

In conclusion, this study proposed a new KOA classification method using machine learning for KOA patients and normal subjects and compared its performance with the conventional statistical processing technique. We recognized that the machine learning-based method could complement conventional logistic regression analysis in the classification of KOA patients and normal subjects. Furthermore, the machine learning technique is thought to be able to overcome a clinician’s lack of access due to limitations on the complexity and massive amount of data presented in gait analysis. In addition, we expect that the results of this study can be used for the diagnosis of KOA patients and rehabilitative intervention through gait correction.

Acknowledgements

This work was supported by the Medical Device Technology Development Program (No. 201900000002517, Development of patient custom active rehabilitation total solution) funded by the Ministry of Trade, Industry and Energy (MOTIE) and by Institute for Information & communications Technology Promotion (IITP) grant funded by the Korean government (MSIT) (No. 2017-0-01800, Development of AR sports training platform based recognition technology on smart glass).

Notes

No potential conflict of interest relevant to this article was reported.

Conceptualization: Jang SH, Park JH, Cho J. Methodology: Jang SH, Cho J. Formal analysis: Yang JH, Park JH, Cho J. Project administration: Yang JH, Jang SH, Cho J. Visualization: Yang JH, Cho J. Writing – original draft: Yang JH. Writing – review and editing: Yang JH, Jang SH, Cho J. Approval of final manuscript: all authors.

Supplementary Materials

Supplementary materials can be found via http://doi.org/10.5535/arm.20071.

Table S1.

Results of the basic gait parameters for each group

arm-20071-suppl1.pdf
Table S2.

The ROM measurements of 72 gait features for each group (unit: º)

arm-20071-suppl2.pdf

References

1. Arden N, Nevitt MC. Osteoarthritis: epidemiology. Best Pract Res Clin Rheumatol 2006;20:3–25.
2. Hunter DJ, Bierma-Zeinstra S. Osteoarthritis. Lancet 2019;393:1745–59.
3. Wallace IJ, Worthington S, Felson DT, Jurmain RD, Wren KT, Maijanen H, et al. Knee osteoarthritis has doubled in prevalence since the mid-20th century. Proc Natl Acad Sci U S A 2017;114:9332–6.
4. Samuel AJ, Kanimozhi D. Outcome measures used in patient with knee osteoarthritis: with special importance on functional outcome measures. Int J Health Sci (Qassim) 2019;13:52–60.
5. Kohn MD, Sassoon AA, Fernando ND. Classifications in brief: Kellgren-Lawrence classification of osteoarthritis. Clin Orthop Relat Res 2016;474:1886–93.
6. Deluzio KJ, Astephen JL. Biomechanical features of gait waveform data associated with knee osteoarthritis: an application of principal component analysis. Gait Posture 2007;25:86–93.
7. Astephen JL, Deluzio KJ, Caldwell GE, Dunbar MJ. Biomechanical changes at the hip, knee, and ankle joints during gait are associated with knee osteoarthritis severity. J Orthop Res 2008;26:332–41.
8. Zeni JA Jr, Higginson JS. Differences in gait parameters between healthy subjects and persons with moderate and severe knee osteoarthritis: a result of altered walking speed? Clin Biomech (Bristol, Avon) 2009;24:372–8.
9. Favre J, Jolles BM. Gait analysis of patients with knee osteoarthritis highlights a pathological mechanical pathway and provides a basis for therapeutic interventions. EFORT Open Rev 2017;1:368–74.
10. Mundermann A, Dyrby CO, Andriacchi TP. Secondary gait changes in patients with medial compartment knee osteoarthritis: increased load at the ankle, knee, and hip during walking. Arthritis Rheum 2005;52:2835–44.
11. Kaufman KR, Hughes C, Morrey BF, Morrey M, An KN. Gait characteristics of patients with knee osteoarthritis. J Biomech 2001;34:907–15.
12. Al-Zahrani KS, Bakheit AM. A study of the gait characteristics of patients with chronic osteoarthritis of the knee. Disabil Rehabil 2002;24:275–80.
13. Lewek MD, Scholz J, Rudolph KS, Snyder-Mackler L. Stride-to-stride variability of knee motion in patients with knee osteoarthritis. Gait Posture 2006;23:505–11.
14. Astephen Wilson JL, Deluzio KJ, Dunbar MJ, Caldwell GE, Hubley-Kozey CL. The association between knee joint biomechanics and neuromuscular control and moderate knee osteoarthritis radiographic and pain severity. Osteoarthritis Cartilage 2011;19:186–93.
15. Elbaz A, Mor A, Segal G, Debi R, Shazar N, Herman A. Novel classification of knee osteoarthritis severity based on spatiotemporal gait analysis. Osteoarthritis Cartilage 2014;22:457–63.
16. Cho JS, Cho YS, Moon SB, Kim MJ, Lee HD, Lee SY, et al. Scoliosis screening through a machine learning based gait analysis test. Int J Precis Eng Manuf 2018;19:1861–72.
17. Shetty S, Rao YS. SVM based machine learning approach to identify Parkinson’s disease using gait analysis. In : Proceedings of 2016 International Conference on Inventive Computation Technologies (ICICT); 2016 Aug 26-27; Coimbatore, India.
18. Mezghani N, Husse S, Boivin K, Turcot K, Aissaoui R, Hagemeister N, et al. Automatic classification of asymptomatic and osteoarthritis knee gait patterns using kinematic data features and the nearest neighbor classifier. IEEE Trans Biomed Eng 2008;55:1230–2.
19. Burges CJ. A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 1998;2:121–67.
20. Aminian K, Najafi B, Bula C, Leyvraz PF, Robert P. Spatio-temporal parameters of gait measured by an ambulatory system using miniature gyroscopes. J Biomech 2002;35:689–99.
21. Cho YS, Jang SH, Cho JS, Kim MJ, Lee HD, Lee SY, et al. Evaluation of validity and reliability of inertial measurement unit-based gait analysis systems. Ann Rehabil Med 2018;42:872–83.
22. Allseits E, Lucarevic J, Gailey R, Agrawal V, Gaunaurd I, Bennett C. The development and concurrent validity of a real-time algorithm for temporal gait analysis using inertial measurement units. J Biomech 2017;55:27–33.
23. Lespasio MJ, Piuzzi NS, Husni ME, Muschler GF, Guarino A, Mont MA. Knee osteoarthritis: a primer. Perm J 2017;21:16–183.
24. Pritzker KP, Gay S, Jimenez SA, Ostergaard K, Pelletier JP, Revell PA, et al. Osteoarthritis cartilage histopathology: grading and staging. Osteoarthritis Cartilage 2006;14:13–29.
25. Spector TD, Cooper C. Radiographic assessment of osteoarthritis in population studies: whither Kellgren and Lawrence? Osteoarthritis Cartilage 1993;1:203–6.
26. Schiphof D, Boers M, Bierma-Zeinstra SM. Differences in descriptions of Kellgren and Lawrence grades of knee osteoarthritis. Ann Rheum Dis 2008;67:1034–6.
27. Baliunas AJ, Hurwitz DE, Ryals AB, Karrar A, Case JP, Block JA, et al. Increased knee joint loads during walking are present in subjects with knee osteoarthritis. Osteoarthritis Cartilage 2002;10:573–9.
28. McKean KA, Landry SC, Hubley-Kozey CL, Dunbar MJ, Stanish WD, Deluzio KJ. Gender differences exist in osteoarthritic gait. Clin Biomech (Bristol, Avon) 2007;22:400–9.
29. Kiss RM. Effect of severity of knee osteoarthritis on the variability of gait parameters. J Electromyogr Kinesiol 2011;21:695–703.
30. Mills K, Hunt MA, Ferber R. Biomechanical deviations during level walking associated with knee osteoarthritis: a systematic review and meta-analysis. Arthritis Care Res (Hoboken) 2013;65:1643–65.
31. Astephen JL, Deluzio KJ, Caldwell GE, Dunbar MJ, Hubley-Kozey CL. Gait and neuromuscular pattern changes are associated with differences in knee osteoarthritis severity levels. J Biomech 2008;41:868–76.
32. Chang A, Hayes K, Dunlop D, Song J, Hurwitz D, Cahue S, et al. Hip abduction moment and protection against medial tibiofemoral osteoarthritis progression. Arthritis Rheum 2005;52:3515–9.
33. Messier SP, Loeser RF, Hoover JL, Semble EL, Wise CM. Osteoarthritis of the knee: effects on gait, strength, and flexibility. Arch Phys Med Rehabil 1992;73:29–36.
34. McCarthy I, Hodgins D, Mor A, Elbaz A, Segal G. Analysis of knee flexion characteristics and how they alter with the onset of knee osteoarthritis: a case control study. BMC Musculoskelet Disord 2013;14:169.
35. Simon SR. Quantification of human motion: gait analysis-benefits and limitations to its application to clinical problems. J Biomech 2004;37:1869–80.
36. Ahmed H, Tahir M. Improving the accuracy of human body orientation estimation with wearable IMU sensors. IEEE Trans Instrum Meas 2017;66:535–42.
37. Beange KH, Chan AD, Graham RB. Evaluation of wearable IMU performance for orientation estimation and motion tracking. In : Proceedings of 2018 IEEE International Symposium on Medical Measurements and Applications (MeMeA); 2018 Jun 11-13; Rome, Italy.
38. Obermeyer Z, Emanuel EJ. Predicting the future: big data, machine learning, and clinical medicine. N Engl J Med 2016;375:1216–9.
39. Chan K, Lee TW, Sample PA, Goldbaum MH, Weinreb RN, Sejnowski TJ. Comparison of machine learning and traditional classifiers in glaucoma diagnosis. IEEE Trans Biomed Eng 2002;49:963–74.
40. Marriott KA, Birmingham TB, Leitch KM, Pinto R, Giffin JR. Strong independent associations between gait biomechanics and pain in patients with knee osteoarthritis. J Biomech 2019;94:123–9.

Article information Continued

Fig. 1.

A hardware of the gait analysis system consisting of PC and wireless IMU sensors (A), and gait trial of knee osteoarthritis patient (B) and normal subject (C). IMU, inertial measurement unit.

Fig. 2.

The gait waveform data of one representative normal subject’s lower limb was divided and expressed by each segmented gait phase.

Fig. 3.

Conceptualization of the support vector machine (SVM). The purpose of the SVM is to identify a hyperplane that precisely divides the data points. A hyperplane is a decision boundary that segregates data into classes. The data points close to the hyperplane are support vectors, and these affect the position of the hyperplane.

Fig. 4.

Flow diagram for the classification algorithm of knee osteoarthritis patients. SVM, support vector machine.

Fig. 5.

Flow diagram of feature selection for improving the SVM classification of knee osteoarthritis severity. SVM, support vector machine; OA, osteoarthritis.

Fig. 6.

The results of classification of knee osteoarthritis (OA) patients and normal subjects using an SVM classifier as a confusion matrix. The number of correct predictions is 75 and 93 for true positive and negative cases, respectively; and the number of incorrect predictions is 4 and 9 for false positive and negative cases, respectively. SVM, support vector machine.

Fig. 7.

The results of knee osteoarthritis severity classification using the SVM classifier as a confusion matrix. The number of correct predictions is colored green in 4×4 confusion matrix. SVM, support vector machine.

Fig. 8.

The accuracy of knee osteoarthritis severity classification according to the number of features selected by the ReliefF algorithm.

Fig. 9.

The difference in waveform data according to the severity of knee osteoarthritis in the sagittal plane of the knee joint from one representative subject in each group.

Table 1.

Characteristics of patients with knee osteoarthritis and control subjects

Control group (n=97) Osteoarthritis group
Mild (n=28) Moderate (n=31) Severe (n=25)
Age (yr) 62.1±7.7 64.7±7.5 69.3±9.5 71.1±6.4
Height (cm) 158.3±12.2 157.5±7.0 159.3±7.2 159.5±9.4
Weight (kg) 59.40±9.37 61.10±10.30 62.20±10.90 60.50±9.50
BMI (kg/m2) 22.75±3.70 24.50±0.70 24.30±8.30 23.10±2.70

Values are presented as mean±standard deviation.

BMI, body mass index.

Table 2.

Kinematic gait parameters used to classify osteoarthritis patients

Feature ID Feature name (left limb) Feature ID Feature name (right limb)
1 Hip flexion extension in IDS 37 Hip flexion extension in IDS
2 Hip flexion extension in SS 38 Hip flexion extension in SS
3 Hip flexion extension in TDS 39 Hip flexion extension in TDS
4 Hip flexion extension in SW 40 Hip flexion extension in SW
5 Hip adduction abduction in IDS 41 Hip adduction abduction in IDS
6 Hip adduction abduction in SS 42 Hip adduction abduction in SS
7 Hip adduction abduction in in TDS 43 Hip adduction abduction in in TDS
8 Hip adduction abduction in SW 44 Hip adduction abduction in SW
9 Hip internal external rotation in IDS 45 Hip internal external rotation in IDS
10 Hip internal external rotation in SS 46 Hip internal external rotation in SS
11 Hip internal external rotation in in TDS 47 Hip internal external rotation in in TDS
12 Hip internal external rotation in SW 48 Hip internal external rotation in SW
13 Knee flexion extension in IDS 49 Knee flexion extension in IDS
14 Knee flexion extension in SS 50 Knee flexion extension in SS
15 Knee flexion extension in TDS 51 Knee flexion extension in TDS
16 Knee flexion extension in SW 52 Knee flexion extension in SW
17 Knee varus valgus in IDS 53 Knee varus valgus in IDS
18 Knee varus valgus in SS 54 Knee varus valgus in SS
19 Knee varus valgus in in TDS 55 Knee varus valgus in in TDS
20 Knee varus valgus in SW 56 Knee varus valgus in SW
21 Knee internal external rotation in IDS 57 Knee internal external rotation in IDS
22 Knee internal external rotation in SS 58 Knee internal external rotation in SS
23 Knee internal external rotation in in TDS 59 Knee internal external rotation in in TDS
24 Knee internal external rotation in SW 60 Knee internal external rotation in SW
25 Ankle dorsiflexion plantarflexion in IDS 61 Ankle dorsiflexion plantarflexion in IDS
26 Ankle dorsiflexion plantarflexion in SS 62 Ankle dorsiflexion plantarflexion in SS
27 Ankle dorsiflexion plantarflexion in TDS 63 Ankle dorsiflexion plantarflexion in TDS
28 Ankle dorsiflexion plantarflexion in SW 64 Ankle dorsiflexion plantarflexion in SW
29 Ankle inversion eversion in IDS 65 Ankle inversion eversion in IDS
30 Ankle inversion eversion in SS 66 Ankle inversion eversion in SS
31 Ankle inversion eversion in in TDS 67 Ankle inversion eversion in in TDS
32 Ankle inversion eversion in SW 68 Ankle inversion eversion in SW
33 Ankle internal external rotation in IDS 69 Ankle internal external rotation in IDS
34 Ankle internal external rotation in SS 70 Ankle internal external rotation in SS
35 Ankle internal external rotation in in TDS 71 Ankle internal external rotation in in TDS
36 Ankle internal external rotation in SW 72 Ankle internal external rotation in SW

IDS, initial double support phase; SS, single support phase; TDS, terminal double support phase; SW, swing phase.

Table 3.

Accuracy results in classification of knee osteoarthritis and normal subjects using SVM

Classifier Accuracy (%) Specificity (%) Sensitivity (%)
SVM 92.8 94.9 91.2

SVM, support vector machine.

Table 4.

Accuracy results in classification of severity of knee osteoarthritis using SVM and SVM with feature selection algorithm classifier

Classifier Accuracy (%) PPV (%) Sensitivity (%)
SVM 81.2 79.7 70.4
SVM with feature selection algorithm 85.6 84.7 81.7

SVM, support vector machine; PPV, positive predictive value.

Table 5.

Logistic regression analysis with gait features and clinical factors

Regression coefficient Standard error p-value OR 95% CI of OR
Lower Upper
Maximum flexion angle
 Right knee 0.101 0.032 0.002* 1.106 1.038 1.179
 Left knee 0.104 0.034 0.002* 1.110 1.039 1.186
Cadence 0.073 0.029 0.010* 1.076 1.017 1.138
Constant -17.911 3.626 0.000* 0.000 - -

OR, odds ratio; CI, confidence interval.

*

p<0.05.