Rare disease natural history studies and patient registries share a structural problem: they need patients to stay engaged for years, and a meaningful fraction drop out well before the study ends. Exact dropout rates vary a lot by disease, design, and platform — published registry and longitudinal observational studies commonly report attrition in the range of 20% to 50% within the first couple of years, and higher over longer horizons. I won't quote a single number because the honest answer is that it depends heavily on the study, and the "rare disease registry has 60% dropout at year two" figure that floats around industry decks isn't a literature estimate I can substantiate.

What is well-documented is that when investigators ask why patients stop participating, questionnaire burden — survey fatigue — is consistently among the top modifiable reasons. That part isn't really in dispute.

And importantly, this isn't a patient commitment problem. Rare disease communities are famously motivated research participants. When a registry loses them, the design is usually doing something the patient can't sustain.

Two different things called "survey fatigue"

The term gets used to cover two related but distinct phenomena.

Within-survey fatigue is the decline in response quality inside a single questionnaire — straight-lining down a Likert column, satisficing on a neutral midpoint, terse open-ended answers. It's a function of instrument length and cognitive load, and it's been studied for decades in survey methodology. Longer instruments produce worse back-half data; this is not controversial.

Between-survey fatigue is the declining engagement and completion rate over the life of a study. This is the retention problem. It's driven by cumulative burden — total time, cognitive load, disruption to daily life — weighed against whatever value the patient perceives they're getting back from participating.

The distinction matters because the interventions are different. Within-survey fatigue is reduced by shortening instruments. Between-survey fatigue usually requires rethinking the data collection model, not just trimming items.

What the EMA and survey-methodology literature actually tells us

A few findings from the literature that I'd trust to generalize:

Burden is perceived momentarily, not as a weekly total. Shiffman, Stone & Hufford (2008), the foundational review on EMA, makes this point repeatedly: participants in EMA protocols with many short daily prompts often report lower subjective burden than participants in protocols with a single long weekly assessment, even when the total time is similar. Frequency of contact signals commitment demands more clearly than total time does, but small interactions at the right moments tend to fit into a day in a way that a 30-minute obligation doesn't.

Recall degrades quickly for episodic events. This is why EMA exists as a methodology at all. For patients asked to reconstruct a week of symptoms on Sunday night, the data is a reconstruction, not a record. Real-time or near-real-time capture is closer to ground truth for anything episodic — seizures, migraines, pain flares, flares of inflammatory conditions — and the literature on this is extensive.

Delayed entry and "parking-lot" completion are detectable and common in paper-diary studies. The pattern of patients completing a week of diary entries minutes before a clinic visit was documented memorably by Stone et al. (2002) in BMJ — compliance with paper diaries was far lower than participants self-reported, and a significant share of entries were "backfilled" rather than recorded contemporaneously. Electronic diaries with timestamps make this visible; paper diaries hide it.

Reciprocity affects retention. Patients who receive something back from the study — their own data visualized, periodic summaries, community updates — tend to stay engaged longer than those who only give data and get nothing. The exact effect size varies by study, but the direction is consistent across the literature on participant engagement in long-term studies.

What I would not claim from the literature: precise dropout percentages for "rare disease registries" as a category, or the idea that any single intervention cuts dropout by a specific amount. Rare disease is a heterogeneous category, and most retention interventions are studied in single-disease or single-platform contexts. Be suspicious of industry claims that aren't grounded in a specific citation.

What seems to help

With that caveat, here's what the evidence supports as directional guidance for registry and natural history study design:

Shorter, more frequent over longer, infrequent

The EMA literature has consistently favored small, frequent prompts over single long sessions. The practical implication for a registry: a 60–90 second micro-assessment several times a week tends to be better tolerated than a 30–45 minute monthly battery, even at comparable total annual time. This is one of the more robust findings in the EMA tradition.

Event-triggered over calendar-triggered, for episodic conditions

For episodic conditions, asking at the time of the event produces more accurate data than asking on a fixed schedule, and patients understand why they're being asked. A prompt right after a migraine feels relevant. A scheduled Tuesday-morning questionnaire about whether you had any migraines last week feels like homework. Relevance and perceived value are both driving retention.

Mobile-first capture

Patient communities skew younger in many rare diseases — particularly pediatric and genetic conditions — and more broadly, smartphones are the default device for most patients under 60. Desktop-portal-only ePRO platforms make participation harder than it needs to be. This isn't controversial; it's just a design default worth stating explicitly.

Give something back

Even a simple longitudinal view of a patient's own data — a sparkline of their symptoms, a summary of their reported medication use — materially changes how the study feels to participate in. This is both ethically right and practically useful for retention. If you're designing a registry, write the "what participants get back" policy at the same time you're writing the data collection plan, not after IRB review.

Front-load less, not more

Early dropout is typically higher than steady-state dropout — the first month or two is where habits are formed or not formed. Heavy baseline batteries (which are common because everyone wants their instrument included at enrollment) can suppress early engagement. A staged approach, where the full instrument battery rolls out progressively over the first few weeks, tends to establish the habit before testing its limits.

Caregivers as first-class reporters, for pediatric and post-ictal populations

For pediatric rare disease, and for adult populations with significant cognitive or post-event impairment, designing for caregiver-reported data from day one outperforms bolting caregiver workflows on later. The data structure and consent model should treat caregiver reports as a formal stream, not a fallback.

If you're drafting a protocol

A few concrete suggestions from working with teams on this:

Audit your total annual respondent time before you lock the protocol. Every optional module, every validated instrument, every exploratory questionnaire. If the total comes in above three or four hours a year, expect meaningful attrition from burden alone, and consider whether every item is actually load-bearing for your analysis.

Pick your data collection platform early. The platform determines whether you can do event-triggered capture, whether you can verify entries with timestamps, and whether participants can complete assessments from their phone. These are measurement-design decisions, not IT procurement decisions.

Write your reciprocity plan. Even something simple — "participants can download their own data at any time; we send a quarterly community summary" — is meaningful.

Pre-specify a missing data model. No multi-year registry hits 100% completion. Specifying up front how missing data will be handled, and what completion rate your primary analysis needs, makes the trade-off between burden and completeness explicit rather than implicit.

Rare disease research depends on patients staying engaged across years of data collection. Survey fatigue is the largest modifiable threat to that engagement, and it's mostly addressable through design choices made at the protocol stage. The tools exist. The decision is to prioritize retention when there's still time to change the plan.

Survey Fatigue Is a Study Design Problem, Not a Patient Commitment Problem