INTRODUCTION
Preterm infants (born before 37 weeks gestational age) are known to be at high risk for neurodevelopmental delay [
1]. These infants have significantly less developed executive functioning, smaller receptive and expressive language lexicons, and weaker motor skills as compared with full-term infants [
2]. The importance of early neurodevelopmental assessment for preterm infants has been highlighted because delayed intervention for neurodevelopmental delay may result in high rates of cognitive impairment, motor impairment, learning disabilities, and the need for special education placement [
3].
The most widely used developmental assessment scheme is the Bayley Scales of Infant Development (BSID). The first two editions of the scales produce 2 scores: the Mental Developmental Index (MDI) was used to assess cognitive and language development, and the Psychomotor Developmental Index (PDI) to assess motor skills. The revised Bayley Scales of Infant and Toddler Development, third edition (Bayley-III, 2006) separated the MDI into distinct cognitive, receptive language, and expressive language scales, and the PDI into fine motor and gross motor scales [
4].
Although the new edition provides indices of performance over separable domains, concerns have arisen regarding the interpretation of scores. Recent studies in the United States, United Kingdom, and Australia have reported that BSID-III assessments significantly underestimate the rate of developmental delay in preterm and full-term infants [
5,
6,
7].
This study aims to investigate neurodevelopmental assessment of Korean preterm infants using Bayley-III scales. We also explore within-group patterns of performance and correlation between the perinatal risk factors and the Bayley-III scores.
MATERIALS AND METHODS
Study design
The study is a retrospective chart review of inpatient and outpatient medical records.
Study population
We identified all preterm infants who visited the Department of Physical Medicine and Rehabilitation and the Department of Pediatrics, Soonchunhyang University Hospital for assessment of developmental delay from July 1, 2014 to June 30, 2016. A total of 120 preterm infants screened for developmental status using Bayley-III scales. The study population consisted of 18 extremely preterm (gestational age [GA] <28 weeks), 29 very preterm (28 weeks ≤ GA < 32 weeks), and 73 moderate to late preterm (32 weeks ≤ GA < 37 weeks) infants. We divided preterm infants into extremely preterm group (n=18) and very/moderate to late preterm group (n=102). Although there was no exclusion based on perinatal characteristics, infants lacking medical records were excluded. This study was approved by the Soonchunhyang University Cheonan Hospital Institutional Review Board (No. 2016-08-009).
Perinatal and demographic data collection
Inpatient and outpatient charts were reviewed to determine demographic and perinatal characteristics, such as GA, corrected age at test, birthweight, sex, neonatal medical problems, radiologic findings, and prenatal maternal problem. Infants with very low birth weight less than 1,500 g were separated from the others. Neonatal medical problems were classified as pulmonary disease (respiratory distress syndrome, hyaline membrane disease, meconium aspiration, bronchopulmonary dysplasia, pneumonia, etc), cardiac disease (tetralogy of Fallot, atrial septal defect, ventricular septal defect, patent foramen ovale, etc.), seizure, and others. Prenatal maternal problems were classified as intrauterine problem (premature rupture of membranes, placenta previa, incompetent internal os of cervix, etc), maternal medical disease (hypertension, diabetes mellitus, hypothyroidism, etc.) and others. Radiologic abnormalities in brain magnetic resonance imaging (MRI) such as periventricular leukomalacia, intraventricular hemorrhage, or cerebral infarction were considered as perinatal risk factors. We then analyzed the relationship between perinatal risk factors and Bayley-III scores.
Outcome measures
Neurodevelopmental outcome was assessed at 1–48 months of corrected GA using the Bayley-III scales administered by a licensed occupational therapist. The examiner was rigorously trained prior to conducting the study as recommended in the guidelines of Bayley-III. The assessment consists of five scales: cognition, receptive language, expressive language, gross motor, and fine motor. The language composite scores consist of the combination of expressive and receptive language subscales; motor composite scores consist of gross and fine motor subscales [
4]. The five subscales were used independently as they identify differences in the language and motor scales which are not evident with the composite scores. The rates of developmental delay were determined using the Bayley-III test norms, indicating mild, moderate and severe delay, respectively [
4]. We compared the Bayley-III mean scores and rate of infants showing delay for extremely preterm and very/moderate to late preterm groups.
Statistical analysis
All statistical analyses were performed using SPSS ver. 14.0 (SPSS Inc., Chicago, IL, USA) and R ver. 3.1.3 (The R Foundation for Statistical Computing, Vienna, Austria). Demographic characteristics were described by mean±standard deviation for continuous variables using an independent t-test and frequency (percentage, %) for categorical variables using a chi-square test or Fisher exact test. A Mann-Whitney U-test was used to compare the Bayley-III scores for the extremely preterm and very/moderate to late preterm groups. Fisher exact test was used to compare the rates of mild, moderate, and severe delay for both groups. The relationship between perinatal risk factors and Bayley-III scores was calculated. We used an independent t-test for comparing birthweight groups and a Kruskal-Wallis test for comparing radiologic findings and medical problems. A p-value of less than 0.05 was considered statistically significant.
DISCUSSION
Understanding neurodevelopmental patterns is necessary since sensitive identification of cognitive, language, and motor delay is required for enrollment in early. Previous research has revealed that preterm infants demonstrate cognitive, language, and motor delays during the first postnatal year and into adolescence [
2,
8,
9,
10]. Our study examined neurodevelopmental patterns of preterm infants using the Bayley-III scales to quantify differences between extremely preterm and very/moderate to late preterm infants in a Korean-based population.
Previous studies show that extremely preterm infants have significantly lower function levels in cognitive, language, and motor areas compared with full-term infants [
11,
12]. We subdivided the preterm group into an extremely preterm group and a very/moderate to late preterm group. We investigated differences between the groups, unlike previous studies which analyzed extremely preterm or preterm infants separately while comparing with a full-term or control group. In the present study there is no significant difference of mean scores for all domains between the extremely preterm group and the very/moderate to late preterm group, although the frequency of infants showing developmental delay in motor composite scores was significantly higher in the extremely preterm group. However, this is less meaningful due to the small sample size of the extremely preterm group (n=18).
The Bayley-III scales were developed and standardized in 2006 using a random sample of 1,700 American children. The sample was reported to be representative of the 2000 US Bureau of the Census population survey data in terms of parent education, ethnicity, and geographic region [
4]. It seems doubtful that US norms would be adequate in populations with different characteristics and languages, leading to potential inaccuracies in identification of referral thresholds. Therefore, we compared the Bayley-III performance of Korean preterm infants with that of US preterm infants. In a retrospective cohort of 85 preterm infants in United States, mean Bayley-III cognitive, language, and motor composite scores were 102±15, 91±15 and 94±17. In patterns of Bayley-III performance, mean language index and motor index were significantly lower than the cognitive index. For the majority (53%) of preterm infants, language development was the weakest domain; for another 39%, motor skill was the weakest area of development [
13]. In our study, mean cognitive, language, and motor composite scores were 88.8±18.6, 92.0±17.5, and 91.8±23.0. The proportions of cognitive, language, and motor delay in the preterm group were 38.3%, 26.7%, and 35.0%, respectively. Cognitive development was the most frequently delayed domain. Although we could not statistically analyze the Bayley-III scales difference among US preterm infants and the preterm infants enrolled in our study, cognitive composite scores seemed to be relatively lower in our study, compared to the US study. Previous studies in other countries showed some differences between children of their country and US children in Bayley-III performance. For example, Taiwanese children had lower scores on the Bayley cognitive and motor scales when compared to the US norms at 6 and 24 months [
14]. A Dutch cohort study revealed that the Dutch norms differ from the US norms for all subtests, therefore population specific norms appear necessary [
15]. Hence it is important to establish normative data for Korean children and further large cohort study is needed to compare the Bayley-III scales of Korean preterm or full-term infants with those of US infants in order to establish appropriate norms of the Bayley-III scales in Korea.
The Bayley-III scales assessment has been reported to underestimate rates of delay [
6]. Recently published cohort studies reporting preterm outcomes confirmed that if the performance of the preterm infants had been compared with the normative means, there would have been misleading frequencies of delay [
5,
7]. Although we could not enroll control group participants due to the characteristics of a retrospective study, doing so might have contributed no significant effect on the interpretation of our results. Most previous studies in Korea examining developmental outcomes have used the Bayley-II scales until Kim and Shin [
16] recently investigated developmental assessment using the Bayley-III scales. The Bayley-III scales demonstrated strong correlations with the Korean Infant and Child Development Test (KICDT), except in the social-emotional category. Further study is needed to assess the validity of the Bayley-III scales for Korean infants.
The present study also examined correlation between the perinatal risk factors and the Bayley-III subtest scores. Previous research has revealed that very low birth weight [
5], low Apgar score [
17], abnormal spike waves in electroencephalography [
18], and abnormal findings in brain MRI [
19,
20,
21] were related to poor outcomes in neurodevelopment. In addition to the severity of periventricular leukomalacia in brain MRI [
20], the degree of intraventricular hemorrhage [
21] was consistently related to neurodevelopmental outcome. Our results showed that very low birthweight (<1,500 g) was significantly related to lower cognitive, language, and motor composite scores, and abnormal findings in brain MRI significantly predicted lower motor composite scores. Medical problems such as pulmonary disease, seizure, cardiac disease, and others were not significantly related to Bayley-III scales (
Table 4). Future investigations should include detailed socio-demographic variables including parental education and economic status.
This study has several limitations. First, the subjects were recruited from a tertiary care hospital in a neonatal clinical setting, meaning our results may not be appropriate to the general population. Second, instead of control group performance values we used the test norms as reference values since this retrospective study had no inclusion of a contemporary control group. Third, the sample of the extremely preterm group was too small to analyze group. However, there are some strengths of this study. Our study had a relatively large sample size in comparison to a previous study on Korean infants [
16]. Also, this study targeted preterm infants to evaluate using the Bayley-III scales in Korea.
In conclusion, we assessed neurodevelopmental patterns of both extremely preterm and very/moderate to late preterm infants using the Bayley-III scales. Contrary to US studies, cognitive development was the most frequently delayed domain in our study. No significant difference in mean scores was observed between the extremely preterm group and the very/moderate to late preterm group. However, the frequency of infants showing developmental delay in motor composite scores was significantly higher in the extremely preterm group, but this is less meaningful due to the small sample size. Very low birthweight (<1,500 g) was significantly related to lower cognitive, language, and motor composite scores, and abnormal findings in brain MRI significantly predicted lower motor composite scores. Further prospective studies including large enough sample size and a control group are needed.