901 Observations: What the DLI Dataset Shows at Scale
The Decision Load Index dataset started with 47 participants in the 7-day retest cohort. That sample was sufficient to test instrument stability and explore early patterns. It was not sufficient to say anything reliable about the shape of the distribution — how common high-load states are relative to low-load states, which occupational categories cluster at the extremes, or whether the retest findings would hold at larger N.
At 901 participants, some of those questions have clearer answers. This field note describes what the additional scale reveals. It also describes what it does not resolve.
What 47 Participants Showed
At 47 participants, the DLI distribution appeared roughly normal. No strong skew was visible. The 7-day retest finding was the most reliable result from that cohort: 81% of participants showed stable scores (within one score band) across the two administrations. The 19% who showed meaningful change shared three behavioral patterns — externalizing open loops into a trusted capture system, reducing context-switch frequency, and reducing unresolved communication threads.
The ADHD subgroup finding from the early cohort was unexpected: ADHD-identified participants showed both higher baseline scores and higher retest stability than the general population, suggesting the DLI captures something more durable than mood.
What 901 Participants Add
Distribution shape
The right-skewed distribution is the most significant observation. A normal distribution of decision load scores would suggest that high-load states are unusual — as many people in the low range as in the high range, with most clustering near the center. A right-skewed distribution suggests the opposite: extreme high-load states are disproportionately represented relative to extreme low-load states.
This is consistent with what asymmetric work environments would predict. Knowledge work creates decision load through accumulation — open loops multiply faster than they resolve when context-switching is high. Low-load states are achievable but require deliberate structural choices. High-load states emerge by default in most knowledge work environments.
Occupational clustering at the high end
The top 10% of DLI scores in the dataset cluster in three occupational categories: managers with more than five direct reports, solo founders, and doctoral students. This is consistent with earlier field notes examining each group separately. What the larger dataset adds is the relative concentration: these three categories appear disproportionately in the high-load tail compared to their representation in the overall dataset.
This is observational and self-reported. Occupational category is coarse-grained. The finding is a pattern worth tracking, not a reliable effect size.
Retest stability at scale
The 81% stability finding from 47 participants held in the extended dataset: approximately 80% of participants who completed a second assessment showed scores within one band of their initial score. This consistency across a larger and more diverse sample provides moderate confidence that the DLI is measuring something structural rather than a transient state.
If 7-day stability were primarily driven by mood, we would expect much lower consistency — mood varies substantially day-to-day and week-to-week. The stability pattern is more consistent with measurement of a behavioral or environmental characteristic that changes slowly.
The 22% improver pattern
In the early cohort, 19% of retested participants showed meaningful improvement. In the extended dataset, approximately 22% of those who retested showed score reduction that exceeded the measurement noise threshold. This is a small increase, but it is directionally consistent.
The behavioral pattern among improvers remains the same across both cohort sizes: externalization of open loops into a trusted capture system appears most frequently in participant-reported behavior changes between assessments. This is not a controlled finding — participants self-reported behavior change, and the capture system was not specified or verified. But the consistency of the pattern across sample sizes is notable.
| Finding | N=47 (7-day retest cohort) | N=901 (full dataset) |
|---|---|---|
| Distribution shape | Appeared roughly normal | Right-skewed (high-load tail) |
| 7-day retest stability | 81% within one band | ~80% (extended dataset) |
| Improver rate (retest) | 19% | ~22% |
| High-load tail occupations | Too small to identify | Managers (>5 reports), founders, PhD students |
| High:low outlier ratio | Not estimable | ~3:1 (high-load more frequent) |
What This Does Not Resolve
Dataset limitations
The dataset is observational. Participants self-selected to complete the DLI, which introduces selection bias toward people who are curious about their cognitive load — likely not a representative sample of the general knowledge worker population.
Occupational categories are self-reported and coarse-grained. The clustering at the high-load tail may reflect selection bias as much as occupational reality.
The dataset does not include longitudinal tracking of the same individuals over time (beyond the 7-day retest). Whether high-load states persist, resolve, or worsen over months is unknown.
The dataset is not clinical. DLI scores do not diagnose anything. High scores indicate reported cognitive burden, not burnout, psychological disorder, or performance impairment.
The right-skewed distribution is a descriptive finding about this dataset. It does not establish that knowledge work in general produces more high-load than low-load states. The sample is not population-representative.
What the Scale Adds to the Instrument
At 901 observations, the DLI has sufficient data to begin examining subgroup patterns that were not visible at smaller N. The occupational clustering and distribution shape are findings that emerge from scale, not from the instrument design. They inform where to look next rather than confirming mechanistic claims.
The retest stability holding across a larger and more diverse sample is the most methodologically meaningful finding. It increases confidence that the instrument is measuring a stable characteristic rather than noise.
Add your observation to the dataset
Every DLI completion improves the research. 5 minutes, free, immediate results. The full preprint is available at Zenodo.
Take the AssessmentRead the DLI preprint (DOI: 10.5281/zenodo.18217577) →
Research Context
CTE Research Initiative. (2026). Decision Load Index: 901-participant knowledge worker dataset. Self-selected, observational. DOI: 10.5281/zenodo.18217577
CTE Research Initiative. (2026). 7-day retest cohort. N=47 participants, repeated measures. Reported in: Decision Load Index preprint, Section 4.
Baumeister, R. F. et al. (1998). “Ego Depletion: Is the Active Self a Limited Resource?” JPSP. (Foundational decision fatigue research.)
Hagger, M. S. et al. (2016). “A Multilab Preregistered Replication of the Ego-Depletion Effect.” Perspectives on Psychological Science. (Reliability context for decision fatigue findings.)
This is a research field note based on observational, self-selected data. N=901 reflects total DLI completions; retest data is a subset. Distribution shape and occupational clustering are descriptive findings, not causal claims. This dataset has not undergone peer review. This content is educational and does not constitute professional or clinical advice.