What is the clinical evidence from this study?

Study design: Cross-Sectional. Population: Healthy volunteers (n=100). Intervention: COMPASS-31 questionnaire. Primary outcome: Rasch model fit and person reliability.

What does this research mean for the field?

The Dutch version of the COMPASS-31 demonstrates low person reliability and multidimensionality in an online volunteer population, indicating it is not a unidimensional measure and should not be used to produce a single total score. Novelty: ClaimNovelty.CONFIRMATORY. Consensus alignment: ConsensusAlignment.NEUTRAL.

What question did this study set out to answer?

To analyze the Dutch version of the COMPASS-31 using classical test theory and Rasch analysis in a different population compared to previous studies.

May 15, 2026Open Access

Rasch Analysis of the Composite Autonomic Symptom Score (COMPASS-31) From Data Collected in an Online Population

Key Result

The Dutch version of the COMPASS-31 demonstrated low person reliability (below 0.8) and multidimensionality in an online volunteer population, indicating it is not a unidimensional measure.

Key Points

To analyze the Dutch version of the COMPASS-31 using classical test theory and Rasch analysis in a different population compared to previous studies.
Transversal observational study with 100 volunteers completing the COMPASS-31 online.
Data analyzed using IBM SPSS Statistics and Winsteps for Rasch analysis.
Assessment of psychometric properties including reliability across COMPASS-31 domains.
Cronbach's alpha below 0.65 for most domains, with only three domains above this threshold.
Person reliability below 0.8, indicating insufficient reliability for decision-making.
Only the Pupillomotor domain differentiated between the general population and clinical groups.

Study Design

Type

Cross-Sectional (n=100)

Structured PICO

Population

100 volunteers from a Dutch internet panel (general population), 42 females, 58 males, age groups: 18-29 years (37%), 30-44 years (22%), 45-60 years (31%), and over 60 years (10%).

Intervention

Completion of the Dutch version of the COMPASS-31 online

Outcome

Psychometric properties of the COMPASS-31 using classical test theory and Rasch analysispatient reported

The COMPASS-31 questionnaire functions better as a descriptive screening tool rather than a unidimensional measure, and its total score should not be used as a single metric in the general population.

Limitations

Moderate sample size (N=100) is at the lower end of acceptability for domains with many items
Online volunteer population may not accurately represent the general or clinical populations
Absence of clinical data on participants' health status to confirm dysautonomia
Low symptom prevalence and limited variability may have influenced reliability and dimensionality findings

Abstract

Background: A Dutch internet panel with volunteers was assessed using the patient-reported outcome measure known as the Composite Autonomic Symptom Score (COMPASS-31). Objectives: This study aimed to analyze the Dutch version of the COMPASS-31 using classical test theory and Rasch analysis, or one-parameter item response theory, to examine the questionnaire's structure for the second time in a different population. The first group consisted of clinical patients, while the second group consisted of the general population. The choice of the general population contrasts with most validation studies, which focus on clinical groups, as in our first study. Methods: A transversal observational study of a cohort of 100 people voluntarily completed the Dutch version of the COMPASS-31 online. The sample population comprised 100 persons (42 females, 58 males). The age groups and their percentages were as follows: 18-29 years, 37%; 30-44 years, 22%; 45-60 years, 31%; and over 60 years, 10%. Classical test theory was performed after importing the data into IBM SPSS Statistics for Windows, Version 31.0 (IBM Corp., Armonk, New York, United States). Six Rasch analyses were completed for each domain of the COMPASS-31 using the Winsteps measurement software (Winsteps Rasch Measurement Version 5.10.0). Results: The COMPASS-31 domains performed moderately in a group of adult volunteers. Cronbach's alpha across the domains was below 0.65, except for the Gastrointestinal, Bladder, and Pupillomotor domains, which were 0.65 or above. The person's reliability is below 0.8, the minimum level required for serious decision-making. Item reliability was good, except for the Bladder domain, which had a reliability of zero. Only the Pupillomotor domain can differentiate between the two population groups. Not all items in the Vasomotor and Gastrointestinal domains measure a single unidimensional variable, one of the prerequisites for a Rasch model. The probability curve for different domains performed adequately. Patients were able to distinguish between four levels of items. Some domains included categories that were too broad. Conclusion: This study, conducted with a different group of internet volunteers, offered several valuable insights into the psychometric properties of the COMPASS-31 and its domains. Our data again did not align with the Rasch model. The analysis failed to confirm the scale's reliability, as the domains could differentiate only one or two levels of autonomic symptoms in adults. The person's reliability is below 0.8, which is the minimum threshold needed for meaningful decision-making. Only the Pupillomotor domain can distinguish between the two groups, while the other domains identify only one of them. The COMPASS-31 is a screening tool or descriptive profile, not a unidimensional measure designed to produce a single, interval-level score for any domain. Based on our findings, we recommend against using the total score as a single metric. The domains should serve as independent, ordinal indices. Item banks for computer adaptive testing (CAT) should be developed specifically for each domain.

Bookmark

View Full Paper