Design of Treatment Trials for Functional GastrointestinalDisorders
Design of Treatment Trials Committee: E. JAN IRVINE,* WILLIAM E. WHITEHEAD,‡WILLIAM D. CHEY,§ KEI MATSUEDA,ʈ MICHAEL SHAW,¶ NICHOLAS J. TALLEY,#,** andSANDER J. O. VELDHUYZEN VAN ZANTEN‡‡*Division of Gastroenterology, St. Michael’s Hospital and University of Toronto, Toronto, Ontario, Canada; ‡Division of Gastroenterology,University of North Carolina–Chapel Hill, Chapel Hill, North Carolina; §Division of Gastroenterology, University of Michigan, Ann Arbor,Michigan; ʈDivision of Gastroenterology, NCNP, Ichakawa City, Japan; ¶Division of Gastroenterology, Park Nicollet Clinic and University ofMinnesota, Minneapolis, Minnesota; #Division of Gastroenterology, Mayo Clinic College of Medicine and Division of Gastroenterology andHepatology, Rochester, Minnesota; **Department of Medicine, University of Sydney, Sydney, Australia; and ‡‡Division of GastroenterologyDalhousie University, Halifax, Nova Scotia, Canada
This document addresses the design of trials to assess the
provide standards to help explain the mechanisms of
efficacy of new treatments for functional gastrointestinal
therapeutic success and enable regulatory agencies, re-
disorders (FGID), emphasizing trials in irritable bowel syn-
searchers, and providers to better evaluate the quality of
drome and dyspepsia, because most research has been
published studies. This report focuses largely on designs
undertaken in these conditions. The double-blind, random-
that evaluate treatment efficacy, with emphasis on irri-
ized, placebo-controlled, parallel group trial remains the
table bowel syndrome (IBS) and functional dyspepsia
preferred design. Randomized withdrawal designs, al-
(FD), because they have been studied most extensively.
though encouraged by the European Agency for the Eval-
Studies that address pathophysiology or mechanism of
uation of Medicinal Products, have the same potentialdisadvantages as a crossover design, including carryover
treatment effects are not included in this review because
effects, unmasking (unblinding), and overestimation of the
they require quite different and diverse study designs.
potential benefit for clinical practice. Innovative trial de-
Recommendations in this article are based largely on
signs that evaluate intermittent (on demand) treatment
consensus of the literature, except where specifically in-
are likely to become more common in the future. Investi-
dicated. We refer readers to the corresponding chapter in
gators should include as broad a spectrum of patients as
the Rome III book for a more detailed discussion with
possible and should report recruitment strategies, inclu-
sion/exclusion criteria, and attrition data. The primaryanalysis should be based on the proportion of patients in
each treatment arm who satisfy an a priori treatment
responder definition, or a prespecified clinically meaningfulchange in a patient-reported symptom improvement mea-
The goals of most treatment trials are to ascertain
sure. Such measures of improvement are psychometrically
the impact of the intervention(s) on (1) the frequency and
validated subjective global assessments or a change from
severity of symptoms, (2) health status and quality of life,
baseline in a validated symptom severity questionnaire. It
(3) the patient’s ability to cope with symptoms, and/or
is unethical to change the responder definition after a trial
(4) the use of health care resources. Generally, a single
begins. Data analysis should address all patients enrolled,
using an intention-to-treat principle. Reporting of results
Investigators should select their most important research ques-
should follow the Consolidated Standards for ReportingTrials guidelines and include an analysis of harms data and
tion(s), pertinent to the specific FGID, develop a hypothesis
secondary outcome measures to support or explain the
based on available evidence, and design a study that will most
primary outcome. Trials should be registered in a public
effectively answer the proposed research question.
location, prior to initiation, and should be published even ifthe results are negative or inconclusive. Abbreviations used in this paper: CONSORT, Consolidated Standards
for Reporting Trials; EMEA, European Agency for the Evaluation ofMedicinal Products; FD, functional dyspepsia; FGID, functional gastro-
Thecommittee’saimsweretoreviewtheliteratureon intestinal disorder; ITT, intention to treat; NNH, number needed to
trial design for the functional gastrointestinal dis-
harm; NNT, number needed to treat; IBS, irritable bowel syndrome.
2006 by the American Gastroenterological Association Institute
orders (FGIDs), to further develop to assist
researchers in conducting treatment trials for the FGIDs,
Special recruitment strategies such as advertising have
been accepted in some countries to accelerate recruit-
A broad spectrum of patients should be included to
ment. A recent IBS study observed that patients re-
support the generalizability of the trial findings to patients
cruited by newspaper advertisement, in comparison to
outside of the trial. In pharmaceutical research, particularly,
patients enrolled by gastroenterologists, were older, more
regulatory agencies may limit licensed drug indications to
highly educated, more often depressed but less anxious,
the trial population. The study population should be se-
and had less severe IBS symptoms; primary care patients
lected based on the question, treatment (including possible
were also anxious but had symptom severity that was
side effects), expected results, and empirical A screen-
intermediate between patients recruited by advertise-
ing log, summarizing the most important demographic
ment and patients recruited from gastroenterology clin-
variables in patients entered or excluded and the reasons for
Recruitment strategies should be clearly identified to allowA screening log provides support for the generalizability of theexploration of different patterns of treatment response.
Specified inclusion and exclusion criteria are manda-
tory for all studies and should include the FGID case
Important patient characteristics to report include
definition. If enrollment is targeted to a special popula-
age, gender, race, symptom severity, duration of disease,
tion to maximize treatment efficacy or minimize side
prior treatments for the study condition (and response),
and the use of coexisting medications, including over-
It is advisable to include as broad a spectrum of patients as
the-counter drugs and vitamins. These may impact out-
possible, defined by the ROME-specific FGID criteria. Restrict-
comes and should be tested as possible disease or effect
ing the study population must be justified and inclusion and
modifiers. For example, gender differences in drug re-
exclusion criteria must be specified.
sponse have become evident in clinical trials of certain
In clinical practice, many physicians avoid formal
serotonergic drugs in patients with Depending on
investigation in favor of a positive diagnosis, reassurance,
the hypothesis, investigators may choose to enroll only
and lifestyle modification. However, entry criteria for
one gender. However, if both women and men are to be
treatment trials must be more specific. The consensus
included, there should be sufficient numbers of both to
view is that the minimum evaluation should include a
allow meaningful subgroup analyses. As data accumulate
complete blood count, imaging of the relevant part of the
describing the genetics of the FGIDs in relation to drug
gastrointestinal tract within the previous 5 years, and
responsiveness, it may become relevant to assess these
other investigations determined by symptoms and family
parameters during clinical It is also advisable to
Emerging evidence suggests that screening IBS
assess for psychological distress or prior mental health
patients for gluten enteropathy may also be
problems; in trials of psychological interventions or psy-
When testing is required for study inclusion, it should
choactive drugs, these may be important effect modifiers,
be consistent across all study arms, and the timing
and in trials of nonpsychological treatments, they are
potential confounders that could influence both baseline
The minimum screening investigation for eligibility should bePotential disease modifiers/confounders that might affect re-
Most trials of FGIDs have been conducted in academic
sponse to therapy should be assessed.
centers specifically interested in the FGIDs, creatingconcerns of selection bias that favor inclusion of patientswith more severe symptoms and/or with psychosocial
Two large trials showed significant differences in
Clinical trials differ from usual practice in several
treatment response between primary and referred pa-
ways, including the application of strict eligibility cri-
tients with Thus, researchers should consider re-
teria, use of a placebo, a standardized intervention, fre-
cruiting broadly, noting if subjects are from primary,
quent follow-up visits with extensive data recording, and
secondary, or tertiary care. Differences in baseline sever-
the use of study coordinators. Nonetheless, standard
ity and treatment response by site or type of recruitment
aspects of diagnosis and management, especially an ad-
equate explanation and reassurance about the disease, are
Patient characteristics should be documented sufficiently to
part of standard care and should be provided to all
examine the comparability of patients among centers and allow
patients in the trial. Novel interventions must show
comparisons with other populations.
promise of a benefit over standard care.
or drug trials in which the active drug causes predictable
side effects or rapid symptom change, are difficult tomask from patients or investigators, but possible solu-
tions to maintain an investigator-masked outcome assess-
ment include using independent assessors who are un-
interviewer or self-administered questionnaire, or per-
forming laboratory tests (eg, anal manometry in fecal
incontinence) that are interpreted by individuals not
It is mandatory to undertake the maximum masking possible,determined by the type of intervention and study design.
Randomization. Randomization is a process (equiv-
alent to the flip of a coin) used to assign patients to
treatment arms in an unbiased fashion. The allocation se-
quence should be concealed from investigators and research
personnel should be unaware of the treatment to which a
patient will be assigned until after the patient has beendeemed eligible and has consented to Stratifiedrandomization, whereby the most important prognostic fac-
Every trial should incorporate the principles of good clinical
tors (eg, gender and usual bowel habit) are identified be-
practice to ensure that the study results are relevant to real
forehand, uses a separate randomization sequence for each
stratum (eg, male versus female or constipation-predomi-nant versus diarrhea-predominant IBS) to balance these
factors among treatment Stratification should be
limited to 1 or 2 fParticularly in multicenter
Study designs for treatment trials in FGID face
trials, in which sites may enroll only a few subjects, ran-
several important challenges: (1) a high placebo response
domization can be performed in blocks. A block refers to the
number of subjects within which the group assignments
for multimodal therapy owing to weak effects of available
have to be balanced. A permutated block design (variable block
treatments or multiple etiologic mechanisms interacting
size) ensures that the sequence of assignments is unpredict-
in the disease (4) difficulty of masking (blind-
able to the investigator. When reporting the trial, the
ing) patients and investigators, particularly in trials of
randomization procedure should be explicitly described be-
cause it is a potential source of bias.
the-counter treatments or drugs taken for other condi-
Investigators must include a detailed description of their
tions (eg, antidepressants); and (6) avoidance of harms in
randomization scheme in the report of the study.
Selecting the control group. A placebo control
Bias, defined as systematic error that leads to a devi-
group is essential to establish the efficacy of a new
ation of the estimated treatment effect from its true
treatment. When a proven efficacious treatment exists,
value, may enter a clinical trial at any stage from patient
comparison against this active treatment may be consid-
enrollment to publication of the results. The major
but inclusion of a placebo is still recommended to
avoid an inconclusive trial, in which the active treat-
Masking. Double masking (of both patients and
researchers) to the intervention ensures the validity of the
Behavioral therapy pose particular challenges
outcome assessment. “Triple masking” is desirable and
to identify inactive comparison treatments that generate
extends masking to all investigators, including data
expectancy comparable to the active intervention. Un-
managers and In drug trials, investigators
treated patients are poor control subjects because they
are encouraged to ask both the patient and the interven-
tionist who interacts with the patient at the end of the
result in an overestimate of the impact of the interven-
trial whether they believe the active treatment was ad-
tion. Options to assess the integrity of behavioral trials
ministered and to report these data. Certain interven-
include (1) testing the credibility of both active and
tions, such as psychotherapy, hypnosis, sphincterotomy,
control interventions after initial exposure (eg, by using
the Credibility or (2) using a process measure
The placebo response rate in treatment trials of FGIDs is
to ensure that the active treatment is producing the
substantial and largely unavoidable.
intended effects on physiology or cognitions while the
Baseline observation versus placebo run-in. A
control treatment does not (eg, does biofeedback for fecal
period of prospective baseline measurement before treat-
incontinence change anal sphincter squeeze pressure
ment is useful to evaluate patient eligibility. This also
more than the control condition, or does cognitive–
limits recall and reporting biases by ensuring that pa-
behavioral therapy alter the patients dysfunctional atti-
tients are currently symptomatic. It allows comparison of
tudes to a greater extent than an education control
patients in the active and placebo groups, as well as
evaluation of a clinically important change in health
sources of bias and their potential impact on study
findings in the discussion section of the report.
Older studies have used a placebo run-in period where
A placebo control group is essential. In behavioral treatment
all patients received placebo for a specified period and
trials, confirming that the control condition produces a similar
their response was assessed, using the study outcome
expectation of benefit, but does not act on the same physiologic or
measures. Patients who significantly improved were ex-
psychological principles, is recommended.
cluded from further participation to reduce the propor-tion of placebo responders and to exclude patients with
poor adherence. This has been used in several trials of
A placebo is an intervention believed to lack any
allergic rhinitis and, although acceptable to regulatory
specific effect to change a particular Placebo
agencies, may underestimate the overall effect It
effects range from 10% to 70% for and 0% to 84%
is also difficult to predict whether (1) the placebo re-
for This substantial placebo response rate makes it
sponse increases, plateaus, or decays after the run-in
more difficult to demonstrate superior efficacy of new
(2) a differential dropout occurs, and (3) patients
treatments. Of note, a placebo administered by a physi-
removed from a trial have a different response to those
cian appears to be more powerful than one given by other
who continue. Exclusion of patients for placebo response
may also disrupt the doctor–patient relationship for fu-
an order effect, in which an effective drug has a lesser
benefit when given after a placebo. This is especially
The disadvantages of a placebo run-in appear to outweigh the
important if a placebo run-in period is implemented to
benefits and it is best avoided. However, baseline observations
exclude placebo responders or in studies with a crossover
design, because approximately half of patients in a cross-
External factors may also contribute to changes in
The double-masked, randomized, placebo-con-
health status making it difficult to detect a treatment
trolled trial is the gold standard method to test the
effect, including (1) a natural variation in symptoms, (2)
efficacy of a new treatment. A parallel group study
regression toward the mean, and (3) unidentified or
design requires that patients be randomized to receive
unintended cointerventions. Regression to the mean is the
only one treatment assignment throughout the trial (af-
likelihood that patients consult when symptoms are par-
ter a period of baseline assessment without treatment).
ticularly severe and improve with time owing to the
Dose-ranging studies (different groups receive different
natural variation in symptom severity and irrespective of
doses) and multiple control treatments, with a baseline
observation of no treatment or a washout period after
changes in diet or using over-the-counter remedies could
treatment, are different variants of a parallel group de-
also lead to a false interpretation that an intervention was
effective, as could the extra attention given patients by
Crossover designs, in which subjects receive both treat-
researchers during clinical trials (Hawthorn The
ments during distinct time periods, separated by a wash-
magnitude of the placebo response may also be influ-
out phase have been popular in some FTheoret-
enced by the wording of the question used to define
ically, lesser variability in outcomes within subjects
treatment response or by the use of a compound ques-
could require a smaller sample size for the desired sta-
tistical power. However, patient dropout rates and miss-
the placebo response rate is larger when a responder is
ing data have a greater impact than in a parallel design,
defined by a global improvement in IBS symptoms com-
because patients are omitted from both study arms when
pared to defining a responder by reduction in abdominal
data are missing. The greatest disadvantages of crossover
pain (average placebo responses of 36% versus 28%).
designs are (1) the carry-over (period-by-treatment) ef-
fects that occur when the first treatment influences the
There is a growing interest in developing drugs for
response to the second treatment or when symptoms
intermittent treatment (short-course administration for a
change with and (2) the high likelihood of un-
predetermined time period after symptom recurrence) or
masking owing to side The European Agency
on-demand treatment (medication is taken only during
for the Evaluation of Medicinal Products (EMEA) may
symptoms). These issues have been addressed in gastro-
accept a crossover design for a Phase III trial, yet has
esophageal reflux IBS and FD trials have fo-
highlighted problems that could invalidate study re-
cused on continuous administration of drugs to moderate
and does not provide guidance for analysis. If
period and sequence effects occur, only the first treat-
ment period data should be used to determine efficacy.
believe that patients often take medications only as
Although crossover designs are not recommended for
needed. Trial designs and outcome measures required for
treatment trials with subjective end points, they may be
testing the efficacy of intermittent therapy differ from
used in physiologic studies, where the end points are
those used to test continuously administered treatments.
After establishing efficacy during continuous administra-
A factorial design can be undertaken to evaluate com-
tion, intermittent or on-demand studies can be con-
bined For example, to test the effects of
ducted. Guidelines for intermittent treatment of mi-
combining two treatments, A and B, subjects are ran-
domly assigned to 4 groups: no A and no B; A and no B;
B and no A; or both A and B. Investigators might
The parallel group design is the accepted standard for
consider such a design either (1) to save money by testing
evaluation of efficacy for most treatments and is applicable to
2 treatments at once with fewer subjects overall, or (2) to
most experimental situations. The crossover design is best
test for synergistic effects of combined treatments. Im-
portantly, the 2 treatments should have distinct mecha-
Types of trials. The strongest case supporting the
nisms of action to be able to interpret the simple effects
efficacy of a new medication is made by demonstrating
(ie, the comparison of all patients receiving treatment A
clinical and statistical superiority to placebo or an active
to all patients not receiving treatment A, and the com-
control treatment. An equivalence study or noninferior-
parison of all patients receiving treatment B to all pa-
ity trial can also be considered if (1) a known, effective
tients not receiving treatment B), or to detect whether
treatment is available and it would be unethical to
there is added benefit from combining treatments. Also,
administer a placebo (eg, cancer or inflammatory bowel
a control is required for each intervention. Potential cost
disease, not FGID), or (2) a new treatment might be less
savings are frequently offset by the complexity of inter-
costly, safer, or just as good as standard Such
preting the data, except when testing for synergistic
trials are usually more costly than superiority trials,
because much larger sample sizes are required. Investi-
The withdrawal trial is an enrichment design, in which
gators must first estimate the expected difference be-
all subjects receive the active treatment. At a predefined
tween standard treatment and placebo from a meta-
time point, they are classed as responders or nonre-
sponders and the latter are excluded. Responders are then
“equivalence margins” that are smaller than the expected
randomly assigned to receive active treatment or placebo
difference. The trial is judged to be positive only if the
and efficacy is based on the second part of the trial.
95% confidence interval for the observed difference be-
Potential carry-over effects from the first treatment, how-
tween the new and standard treatments falls within the
ever, can prevent an accurate estimate of the drug benefit.
equivalence margins. For a noninferiority trial, only the
lower 95% confidence limit must fall within the
drugs for short-term efficacy in IBS, and require 2 or
more treatment cycles to demonstrate efficacy. The
In trials that compare the investigational treatment to
a different active treatment, the investigator is obliged to
placebo at an unpredictable time) be undertaken after
show that the treatment arms are in equipoise; it is
active drug, but does not address how to perform the
unethical, for example, to compare the investigational
complex statistical analysis. Like the placebo run-in, this
drug to an ineffective dose of an alternative compound.
design can overestimate the effect One completed
Superiority trials (not equivalence or non-inferiority trials)
followed the EMEA guidelines (with minor vari-
ations) and provided data supporting the efficacy of
Duration of treatment. Treatment duration for
tegaserod for IBS on repeated dosing cycles.
specific FGIDs should be based on natural history data
describing the frequency and duration of episodes. For
IBS, this is highly but for the majority of
Diaries have been used to measure primary or
patients both flares and remissions appear to last less than
secondary end points and minimize recall bias. Relatively
1 wfor dyspepsia, there is a high symptom
few symptoms are recorded and ratings can be performed
at a fixed time (eg, bedtime) or when symptoms actually
dations for trials of 8 –12 weeks were based on experience
occur. The former method is simpler for data analysis. A
and on concerns for cost and ability to retain patients.
major problem of is poor adherence; patients
EMEA differentiate trials of short-term ef-
often complete them retrospectively or just before a
ficacy, for which they would accept 4-week trials, from
Hand-held electronic devices with reminder
long-term efficacy trials, for which 6-month trials are
required. Although both types of trials require patients
been shown to improve adherence to 80%–90%, and
with active symptoms at randomization, long term-stud-
patient satisfaction is Diary symptoms can also
ies could include patients with intermittent symptoms.
be recorded on secure Web sites, which can accurately
Further research on the natural history of individual
FGIDs should be a high priority, to allow clearer recom-
Retrospective questionnaires are an acceptable method for
mendations for trial duration. Extended follow-up
assessing symptoms provided the recall interval is limited to 3
should be considered to determine treatment durability
months. Patients should receive clear instructions on the use of a
and should relate to symptom periodicity and presumed
diary, including the directive to leave it blank if they forget torecord information. Electronic diaries are preferred over paperA minimum treatment duration of 4 weeks that reflects thediaries. Methods to ensure adherence to recording methods shouldsymptom periodicity and anticipated treatment mechanism isrecommended. If chronic use is anticipated, trials of at least 6months should be undertaken to establish long-term efficacy.
Adherence to treatment and study protocol. Stan-
The primary outcome variable(s) provides the ba-
dard methods to assess adherence include interviewing
sis for judging the success or failure of an intervention.
Only 1 or at most 2 variables should be selected and this
blood levels of and may be especially im-
should be done before the trial begins. The Food and
portant when interpreting studies of long duration. The
Drug Administration and EMEA have recommended
frequency of missed or late appointments and missing
that investigators provide rules, a priori, that allow clas-
data from diaries or questionnaires should be reported for
sification of each participant as a responder or nonre-
Adherence to the protocol and treatment should be measured.
include secondary outcome variables to (1) strengthenthe results by showing concordance between individualsymptoms and the primary outcome measure, (2) address
the mechanism of the intervention, (3) assess the safety or
(4) cost effectiveness of the treatment, and (5) identify
variables that predict which patients are most or leastlikely to benefit.
Efficient symptom assessment can be achieved by
The definition of a responder should reflect a clinically
having patients complete questionnaires before treat-
meaningful symptom improvement for each patient. For
ment and at follow-up visits. However, concerns about
IBS and other FGIDs, there is no consensus on what
the accuracy of retrospective questionnaires include
constitutes a clinically meaningful improvement. Some
whether (1) symptoms present on the day they complete
studies accept as little as a 10% reduction in a visual
the questionnaire influence reporting; (2) poor recall
analog scale rating of symptom or 1 step on a
affects the accuracy of a retrospective report; and (3)
7-step ordinal as clinically meaningful, whereas
patients feel pressured to give a more positive report if
other studies require a 50% reduction in an aggregate
questionnaires are completed in the presence of the in-
vestigator. Although data support the presence of these
the most commonly employed definition of clinically
biases, they do not appear to be Recall of
meaningful improvement in IBS has been a patient’s
health-related events appears to be reasonably accurate
report (yes or no) of “adequate relief of abdominal pain
and or “satisfactory relief of IBS symp-
These definitions are assumed to have face
Adequate relief or satisfactory relief as a primary
validity. However, empirical data are needed for each
outcome measure. Since 1999, most published pharma-
outcome measure to assess the clinical significance of
ceutical trials for IBS have used “adequate relief of ab-
different degrees of change from both the patient’s and
dominal pain and or ”satisfactory relief
of IBS as their primary outcome measure. One or at most 2 primary outcome measures should be specified
Responders were defined as patients who reported “yes”
in advance. Investigators should list criteria to classify each
to adequate relief or satisfactory relief on at least half of
patient as a responder or nonresponder based on a clinically
the weeks in the treatment trial. These studies demon-
strated statistically significantly higher responder ratesfor active drug relative to placebo and led to approvals
for alosetron and tegaserod by the Food and Drug
In selecting a primary outcome, investigators
should examine the trial objectives, population, and
Mangel et assessed the validity of the adequate
mechanism of action of the proposed treatment and
relief measure in diarrhea-predominant IBS patients and
should choose either a global measure, which integrates
showed that responders differed significantly from non-
the symptoms into a single numerical index, or the
responders regarding pain-free days, pain severity, ur-
summary score of a validated symptom severity and/or
gency, stool frequency, and 6 of 8 SF-36 quality of life
subscales plus 8 of 9 scales on a disease-specific quality of
Attention should be paid to the suitability of the
life measure. However, correlations among measure-
measurement scale used for each outcome measure. A
ments (convergent validity), test–retest reliability, and
detailed discussion of measurement scales and their prop-
internal consistency were not reported. Similar validation
erties is beyond the scope of this report, but is more
data have been reported for satisfactory
thoroughly addressed in the Rome III book and else-
Integrative symptom questionnaires. An alterna-
tive method for defining a responder in an IBS treatment
Physician-reported assessments have been accepted in
trial is to ask patients to report the frequency or severity
some but are subject to greater measurement
of all (or a representative group) of IBS symptoms prior
error than patient Therefore, patient-reported
to and again following treatment, and to define a re-
measures are endorsed. Only fully validated instruments
sponder as a patient who reports at least a 50% decrease
are recommended as primary outcome assessment tools,
and secondary outcome measures should also be assessed
tionnaires that examine the severity of IBS, such as the
for robustness. Psychometric validation requires that (1)
Gastrointestinal Symptom Rating Scale for and the
the assessment instrument includes symptoms relevant
Functional Bowel Disorder Severity However,
to and fully representative of the disorder (face validity);(2) it show a predictable relationship with other measures
the Irritable Bowel Syndrome Symptom Severity
(construct validity); (3) the assessment produces similar
is the only IBS symptom severity scale that has been
results when readministered to patients whose health
status has not changed (reliability); (4) it can detect
Whitehead et compared different outcome mea-
clinically meaningful change in health status when such
sures including satisfactory relief and a 50% reduction in
a change has occurred (responsive); and (5) changes in score
the Irritable Bowel Syndrome Symptom Severity Scale
can be related to clinical indicators that are meaningful
questionnaire, in an observational study of patients’ re-
to clinicians (criterion validity).
sponse to usual medical care for IBS. They reported that
Validation of a new outcome measure is best estab-
the response rate on satisfactory relief was influenced by
lished in a separate The frequency of data re-
pretreatment symptom severity: patients with initially
cording for each outcome should also be specified before
mild IBS symptoms showed the highest responder rate
the trial begins, as should the time frame defining the
but the smallest change in symptom severity, whereas
patient response (whether at the end of the trial, during
patients with initially severe IBS symptoms showed the
a prespecified proportion of weeks or months that re-
lowest responder rate but the largest decrease in severity.
sponder criteria have been fulfilled, or for all time points
In contrast, when defining a responder as a patient who
reported at least a 50% decrease in symptom severity,
A patient-reported outcome assessment is recommended. Psy-
pretreatment symptom severity had no impact on the
chometric validation of each outcome measure is recommended
responder rate. Defining a responder based on a 50%
before it is used in clinical trials.
reduction in symptoms has been used in several stud-
how they were measured. Investigators should attempt to
ever, like satisfactory relief and adequate relief, it re-
place benefits and harms for any intervention into
Subjects can be classed as responders and nonre-
Anticipated and unanticipated adverse events should be
sponders at different time points during a trial. In pub-
lished trials, patients were classified as responders if theyreported adequate relief or satisfactory relief on at least
The reasons for including each secondary outcome
However, this loses important information; the most
and the plan for analysis should clearly be identified
persuasive evidence for efficacy would be to show that
before the trial begins. Health economic outcomes are
patients in the active treatment had a sustained response
becoming an important class of secondary
once they reported satisfactory or adequate relief. Inves-
Secondary outcomes should be selected based on the study
tigators are encouraged to use more sophisticated statis-
question and should be validated measures that support or
tical models that address the longitudinal trajectory of
explain the results. Integrating health economic outcomes is
report the proportion of patients responding at each time
Quality of life assessment. FGIDs significantly
impact quality of Generic and disease-specific
Several well-validated outcome measures have been
quality of life instruments are Generic in-
used in FD These use a single global outcome of
struments can assess quality of life in large populations
a specific symptom (eg, Glasgow Dyspepsia Severity
and across a wide spectrum of disorders, but may not
a global overall assessment of dyspepsia symp-
reflect all important aspects of health status for specific
toms (eg, the Canadian Dyspepsia the LEEDS
disorders. They may be less sensitive to detect important
treatment effects, but they permit comparisons with
important dyspepsia and quality of life outcomes (eg, the
other diseases and help to detect unexpected changes in
health status after treatment. Examples of validated in-
struments include the Sickness Impact the
Nottingham Health Profile, the SF-36 (Short Form of
outcome measures are yet to be developed.
General Well-Being Disease-specific quality of
Pain or discomfort is a key feature of many FGIDs and
is typically either the primary outcome variable or an
FGID (eg, the fear of fecal incontinence in IBS). Theo-
important secondary outcome variable in clinical trials.
retically, they can detect smaller and more specifically
Pain has 3 dimensions—intensity, duration, and fre-
relevant changes in health status, which may be missed
quency—that can be considered separately or integrated
by generic instruments. Quality of life measures have not
in a global assessment of pain or can be incorporated into
been used as the primary outcomes in pharmaceutical
a quality of life measure. Different rating scales can be
clinical trials because they were believed to be insuffi-
used that are reproducible and sensitive to If
ciently responsive to treatment, but have been strongly
pain is chosen as the primary outcome, a meaningful
recommended as secondary outcome variables. One re-
clinical response should be defined beforehand, and the
port focusing on the health-related quality of life data
proportion of patients reaching this end point reported.
from two previously reported trials of alosetron found a
Adequate relief and satisfactory relief are the current stan-
significantly greater improvement on active drug com-
dards for primary outcome assessment in treatment trials in
pared to challenging the belief that these
FGIDs. Alternative outcome measures such as integrative symp-
measures are not responsive enough to be employed as
tom questionnaires are also acceptable. All of these measuresQuality of life assessments are important secondary outcomes.
Safety issues and absence of harms. Every trial
Investigators are encouraged to include both a baseline generic
should document and report adverse events. Recent at-
and a pre–post disease-specific quality of life instrument.
tention has focused on the appropriate reporting ofharms-related issues in randomized clinical
When collecting harms data is a trial objective, it shouldbe reflected in the manuscript reporting the study results
The type of statistical analysis is determined by
and the report should clearly define adverse events and
the particular study design and primary outcome mea-
sure(s). The Consolidated Standards for Reporting Trials
efficacy trial, can also allow computation of the number
(CONSORT) statement was developed by scientists and
of patients who need to be treated (NNT) to encounter a
editors to improve the quality of reporting parallel
patient who will experience a clinical benefit. Although
group, randomized, controlled It emphasizes the
the NNT is reported infrequently in randomized, con-
importance of transparently reporting the study objective
trolled trials, its inclusion can convey the clinical impor-
and how the study was conducted and analyzed. Evidence
tance of a study Similarly, harms data can be
supports improved quality of methodology and data
used to estimate the number of patients that would need
to be treated with a drug to see an adverse event (number
journals now require that manuscripts describing clinical
needed to harm [NNH]). Calculation of the NNT and
trials conform to the CONSORT guidelines, found on
NNH allows the researcher or clinician to more quanti-
tatively assess the benefits and risks of any given therapy.
tions have made similar recommendations for studies
When reporting P values, actual values and not thresholds
evaluating diagnostic testing (Standards for Reporting of
should be provided. An NNH can be calculated based on therisk of adverse effects and can be weighed against the NNT.
analyses (Quality of Reports of Meta-Analyses state-
The statistical analysis should be based on an inten-
tion-to-treat (ITT) with a plan for handling
Investigators should adhere to the CONSORT statement on
dropouts. The trial can either be analyzed as the propor-
tion of responders in each group, treating all dropouts as
The main analysis for FGID trials should focus on the
nonresponders, or by carrying forward the last observa-
primary outcome measure(s) to determine whether or not
tion available for the primary outcome. A dual analysis,
the study results support a new treatment. Although the
examining for differences in results using the 2 different
main outcome often compares the end of treatment and
methods should be performed. Many studies also report
baseline observations, data should also describe how pa-
a per protocol (all patients who followed the protocol) or
tients changed during the study; the results of a trial are
an all-patients-treated (all patients who received treat-
far more compelling if patients have had a sustained
ment following randomization) analysis. These analyses
response to the intervention. When 2 primary outcome
may provide insight as to whether a treatment works
variables are included in a trial, investigators should
under optimal conditions, but cannot replace the ITT
specify in advance whether the trial will be considered
analysis. When there is a discrepancy between the ITT
positive if only 1 outcome measure is significant, or if
(negative) and per protocol (positive) analyses, the results
both are required. If significance on any primary outcome
should be interpreted as inconclusive. The effect of po-
suffices, the analysis should adjust for multiple compar-
tential modifiers such as gender, age, duration or severity
isons, for example, using the Bonferroni
of disease, and presence of psychological stress can be
The committee suggests that the EMEA recommenda-
assessed using a logistic regression analysis, where the
tion requiring 2 positive primary outcomes for trials in
binary dependent variable represents the a priori speci-
IBS may be overly conservative. The primary outcome
fied definition of a Such covariates should
results should be stated in absolute numbers to include
both a numerator and denominator; it is not sufficient to
The primary analysis should be the ITT analysis and must
list only percentages of (non)-responders (eg, not 20%
but rather 10/50, 20%). For all outcome measures, theestimated effect of the intervention (difference between
active and placebo treatment) and a 95% 2-sided confi-
The main result of the study must be based on the evaluation
Results should be reported for all prespecified
of the primary outcome measure as stated in the protocol before
outcomes. Score changes should be reported for each
the study begins. The primary outcome should be stated in
cardinal symptom of the entry criteria. Secondary
absolute numbers and should include a 95% confidence interval.
outcomes that are used to support or refute the pri-
Statistically significant differences between study
mary analysis should be analyzed by ITT and not per
groups can also be expressed using a P value. Actual
protocol. Adjustment for multiple comparisons is gen-
values and not thresholds (ie, not P Ͻ .05) should be
erally unnecessary when analyzing secondary outcome
provided and should be complementary to confidence
measures because the efficacy of the treatment is
intervals. The reciprocal of the absolute risk reduction, in
judged on the basis of the analysis of the primary
a risk reduction trial, or therapeutic gain, in a treatment
outcome variable, not the secondary outcomes. Sec-
ondary outcome measures are examined to support the
primary outcome analysis. When many secondary vari-
There is no compelling reason to incorporate in-
ables are included to identify predictors of response or
terim analyses in trials to determine efficacy because
explore for other benefits, the type I error rate may be
FGIDs are not life threatening. Moreover, because the
incidence of serious adverse events is expected to be low,
any occurrence of a serious adverse event is likely to
increase the likelihood of a type II statistical error,
prompt the safety committee to reevaluate the trial with-
rendering truly important differences nonsignificant.
out carrying out an interim analysis. Thus, interim anal-
Using descriptive rather than inferential statistics (eg,
yses in trials of FGIDs are normally only done to assess
means and confidence intervals) or reporting actual P
the futility of continuing the trial. Plans for interim
analyses should be clearly prespecified in the study pro-
Specific plans to present and analyze harms data
tocol and appropriate statistical methods to adjust for
should be clearly described and withdrawals from each
arm of the trial should be detailed. ITT is the preferred
mon method is to partition the ␣ level for the trial by
subtracting the ␣ level for the interim analysis from the
Exploratory subgroup analyses are commonly per-
␣ level intended for the final analysis. Consequently,
formed in trials of FGIDs, although their validity is
most investigators use a conservative ␣ level, such as
.001, for the interim analysis so that sufficient power is
evaluates for differences in treatment effects between
reserved for the final analysis. If an interim analysis is
complementary subgroups (eg, older and younger sub-
preplanned, ␣ sharing can be incorporated when calcu-
jects), rather than simply comparing P values for each
lating the sample size. Unplanned preliminary analysesshould be avoided; premature presentation of results may
subgroup, thereby maintaining statistical power.
affect the further conduct of the trial and can lead to the
Secondary analyses used to support an efficacy claim should be
reporting of inaccurate observations. ITT analyses. Harms data should be analyzed by ITT when
There are few guidelines for conducting interim anal-
possible, but absolute incidence rates and 95% confidence inter-
yses to assess the futility of continuing a trial. However,
to preserve the credibility of the investigators (a) suchanalyses should be overseen by a Data and Safety Moni-
toring Board that is independent from the investigators,
The protocol should present and clearly specify
(b) the analysis should test for equivalence rather than
the assumptions underlying the sample size calculation.
superiority of 1 treatment relative to the other, and (c)
These elements include the minimum effect size (differ-
liberal equivalence margins for the effect size should be
ence in primary outcome between groups) that the trial
defined a priori and will likely be wider than those
is designed to detect, the ␣ (type I) error level, the
statistical power or  (type II) error level, and when
Interim analyses are not recommended because they may
evaluating continuous outcomes (eg, difference in sever-
jeopardize the trial integrity unless there is reason to believe
ity scores), the standard deviation of the difference. Re-
participation in the trial (either in the active treatment or
cent trials have been powered to detect differences as
control group) places the patient at risk.
small as or Often, a power of80% is used ( error or type II error of 20%) and ␣ (type
I) error of 5% using a 2-sided test. An allowance for
The main result of a trial must be presented
dropouts should also be made in determining the appro-
according to the predetermined primary outcome mea-
priate sample size, but efforts should be made to keep the
sure(s). Selecting a primary outcome measure after the
dropout rate below 10%–20%. It is inappropriate for an
trial is concluded inflates the type I error rate and is
investigator to conclude, from an inadequately powered
misleading. Unexpected results that were not part of the
study that fails to find a statistically significant difference
between interventions, that the 2 interventions are
exploratory, for testing in future studies. Adherence to
study goals is strengthened when an independent advi-
A sample size calculation should be routinely performed andshould be based on the expected behavior of the primary outcomeChanging the primary outcome measure(s) in the analysisphase of a study should not be done; it invalidates the statistical
Table 2. Recommendations for Future Research
2. Veldhuyzen van Zanten SJ, Talley NJ, Bytzer P, Klein KB, Whor-
well PJ, Zinsmeister AR. Design of treatment trials for functional
1. Examine the periodicity and severity of symptoms in natural
gastrointestinal disorders. Gut 1999;45(Suppl 2):II69 –II77.
3. Talley NJ, Nyren O, Drossman DA, Heaton KW, Veldhuyzen van
2. Evaluate the multidimensional construct of symptom severity
Zanten SJO, Koch MM, Ransohoff DF. The irritable bowel syn-
(eg, frequency, number present, clustering, severity,
drome: toward optimal design of controlled treatment trials.
contribution to “global severity,” and changes in primary
Gastroenterology International 1993;189 –211.
4. Camilleri M, Northcutt AR, Kong S, Dukes GE, McSorley D,
3. Examine the influence of disease modifiers (predictors) such as
Mangel AW. Efficacy and safety of alosetron in women with
disease duration, baseline severity, psychological status,
irritable bowel syndrome: a randomised, placebo-controlled trial.
comorbidity, surgeries, and response to prior treatments.
4. Investigate what contributes to the placebo response in
5. Muller-Lissner SA, Fumagalli I, Bardhan KD, Pace F, Pecher E,
different FGIDs and how to minimize its impact on efficacy
Nault B, Ruegg P. Tegaserod, a 5-HT(4) receptor partial agonist,
relieves symptoms in irritable bowel syndrome patients with
5. Evaluate the impact of baseline observations and diagnostic
abdominal pain, bloating and constipation. Aliment Pharmacol
testing on symptoms, data quality, and treatment response.
6. Further validate adequate and satisfactory relief during clinical
6. Fass R, Longstreth GF, Pimentel M, Fullerton S, Russak SM,
Chiou CF, Reyes E, Crane P, Eisen G, McCarberg B, Ofman J.
7. Develop, validate fully, and determine minimal clinically
Evidence- and consensus-based practice guidelines for the di-
important differences for new outcome measures and disease-
agnosis of irritable bowel syndrome. Arch Intern Med 2001;161:
specific quality of life instruments. Catalog and critically
7. Cash BD, Schoenfeld P, Chey WD. The utility of diagnostic tests
8. Further evaluate and validate definitions of the treatment
responder measure(s) including a 50% reduction in symptom
in irritable bowel syndrome patients: a systematic review. Am J
severity and ensure that the definitions are clinically relevant.
9. Develop and validate trial designs for testing on-demand
8. Jones R. Likely impacts of recruitment site and methodology on
treatments for intermittent symptoms.
characteristics of enrolled patient population: irritable bowel
10. Examine the impact of CONSORT, EMEA, and Food and Drug
syndrome clinical trial design. Am J Med 1999;107:85S–90S.
Administration guidelines on study quality.
9. Talley NJ, Meineche-Schmidt V, Pare P, Duckworth M, Raisanen
P, Pap A, Kordecki H, Schmid V. Efficacy of omeprazole infunctional dyspepsia: double-blind, randomized, placebo-con-trolled trials (the Bond and Opera studies). Aliment Pharmacol
analysis and renders the conclusions of uncertain value byinflating the chances of a type I error.
10. Longstreth GF, Hawkey CJ, Mayer EA, Jones RH, Naesdal J,
Wilson IK, Peacock RA, Wiklund IK. Characteristics of patients
Concern has been raised that several negative FGID
with irritable bowel syndrome recruited from three sources:
treatment trials have not been published, overestimating
implications for clinical trials. Aliment Pharmacol Ther 2001;15:
the efficacy of some treatments and/or diminishing safety
concerns. Investigators are ethically obliged to publish
11. Camilleri M, Atanasova E, Carlson PJ, Ahmad U, Kim HJ, Vi-
ramontes BE, McKinzie S, Urrutia R. Serotonin-transporter poly-
the results of all completed studies, and journal editors
morphism pharmacogenetics in diarrhea-predominant irritable
should publish methodologically sound studies, whether
bowel syndrome. Gastroenterology 2002;123:425– 432.
results are negative or positive. Some journals now re-
12. Guthrie E, Barlow J, Fernandes L, Ratcliffe J, Read N, Thompson
quire investigators to register clinical trials before initi-
DG, Tomenson B, Creed F. Changes in tolerance to rectal dis-tension correlate with changes in psychological state in patients
ation, and failure to do so bars their publication by
with severe irritable bowel syndrome. Psychosom Med 2004;
subscribing The Cochrane Collaboration sys-
tematic reviews also underscore the need for publication
13. Spiller RC. Problems and challenges in the design of irritable
bowel syndrome clinical trials: experience from published trials. It is unethical to withhold publishing the results of a
14. Veldhuyzen van Zanten SJ, Cleary C, Talley NJ, Peterson TC,
Nyren O, Bradley LA, Verlinden M, Tytgat GN. Drug treatment of
In reviewing the relevant literature for this report, the
functional dyspepsia: a systematic analysis of trial methodologywith recommendations for design of future trials. Am J Gastro-
committee identified a number of areas that require
additional evaluation. These recommendations for future
15. Hahn B, Watson M, Yan S, Gunput D, Heuijerjans J. Irritable
bowel syndrome symptom patterns: frequency, duration, andseverity. Dig Dis Sci 1998;43:2715–2718.
16. Drossman DA, Thompson WG. The irritable bowel syndrome:
review and a graduated multicomponent treatment approach.
1. Irvine EJ, Whitehead WE, Chey WD, Matsueda K, Talley NJ, Shaw
Ann Intern Med 1992;116:1009 –1016.
M, Veldhuyzen van Zanten SJO. Design of treatment trials for
17. Whitehead WE. Control groups appropriate for behavioral inter-
functional gastrointestinal disorders. In: Drossman DA, Corazzi-
ventions. Gastroenterol 2004;126:S159 –S163.
ari E, Delvaux M, Talley NJ, Thompson WG, Spiller RC, White-
18. FDA updates warnings for cisapride. FDA Talk Paper T00-6.
head WE, eds. The functional gastrointestinal disorders: diag-
19. Camilleri M. Safety concerns about alosetron. Arch Intern Med
consensus. 3rd ed. McLean, VA: Degnon Associates, 2006.
20. Sackett DL. Bias in analytic research. J Chronic Dis 1979;32:
Thompson WG, Whitehead WE, eds. Rome II: the functional
gastrointestinal disorders. 2nd ed. McLean, VA: Degnon
21. Spilker B. Choosing and validating the clinical trial’s blind. Guide
to clinical trials. New York: Raven Press, 1991:15–20.
42. Tfelt-Hansen P, Block G, Dahlof C, Diener HC, Ferrari MD,
22. Altman DG. Randomisation. Br Med J 1991;302:1481–1482.
Goadsby PJ, Guidetti V, Jones B, Lipton RB, Massiou H, Meinert
23. Altman DG. Comparability of randomised groups. The Statisti-
C, Sandrini G, Steiner T, Winter PB. Guidelines for controlled
trials of drugs in migraine. 2nd ed. Cephalalgia 2000;20:765–
24. Spilker BI. Randomization procedures. Guide to clinical trials.
New York: Raven Press, 1991:69 –73.
43. Tytgat GN, Heading RC, Muller-Lissner S, Kamm MA, Scholmer-
25. Temple RJ. When are clinical trials of a given agent vs. placebo
ich J, Berstad A, Fried M, Chaussade S, Jewell D, Briggs A.
no longer appropriate or feasible? Control Clin Trials 1997;18:
Contemporary understanding and management of reflux and
constipation in the general population and pregnancy: a consen-
26. Guthrie E, Creed F, Dawson D, Tomenson B. A controlled trial of
sus meeting. Aliment Pharmacol Ther 2003;18:291–301.
psychological treatment for the irritable bowel syndrome. Gas-
44. Tinmouth JM, Steele LS, Tomlinson G, Glazier RH. Are claims of
equivalency in digestive diseases trials supported by the evi-
27. Borkovec TD, Nau SD. Credibility of analogue therapy rationales.
dence? Gastroenterology 2004;126:1700 –1710.
J Behav Ther Exp Psychiatry 1972;3:257–260.
45. Tillisch K, Labus JS, Naliboff BD, Bolus R, Shetzline M, Mayer
28. Drossman DA, Toner BB, Whitehead WE, Diamant NE, Dalton
EA, Chang L. Characterization of the alternating bowel habit
CB, Duncan S, Emmott S, Proffitt V, Akman D, Frusciante K, Le
subtype in patients with irritable bowel syndrome. Am J Gastro-
T, Meyer K, Bradshaw B, Mikula K, Morris CB, Blackman CJ, Hu
Y, Jia H, Li JZ, Koch GG, Bangdiwala SI. Cognitive-behavioral
46. Talley NJ, Weaver AL, Zinsmeister AR, Melton LJ III. Onset and
therapy versus education and desipramine versus placebo for
disappearance of gastrointestinal symptoms and functional
moderate to severe functional bowel disorders. Gastroenterol-
gastrointestinal disorders. Am J Epidemiol 1992;136:165–177.
47. Compliance in health care. Baltimore, MD: The Johns Hopkins
29. Thompson WG. Placebos: a review of the placebo response.
Am J Gastroenterol 2000;95:1637–1643.
48. Von KM, Moore JC. Stepped care for back pain: activating
30. Bland JM, Altman DG. Some examples of regression towards
approaches for primary care. Ann Intern Med 2001;134:911–
31. Pitz M, Cheang M, Bernstein CN. Defining the predictors of the
49. Means B, Nigam A, Zarrow M, Loftus EF, Donaldson MS. Auto-
placebo response in irritable bowel syndrome. Clin Gastroen-
biographical memory for health-related events. DHHS Publica-
tion No. PHS 89-1077. Vital and Health Statistics Series 6.
32. Howarth PH, Stern MA, Roi L, Reynolds R, Bousquet J. Double-
Cognitive and Survey Measurement. Washington, DC: US Gov-
blind, placebo-controlled study comparing the efficacy and
safety of fexofenadine hydrochloride (120 and 180 mg once
50. Sandha GS, Hunt RH, Veldhuyzen van Zanten SJ. A systematic
daily) and cetirizine in seasonal allergic rhinitis. J Allergy Clin
overview of the use of diary cards, quality-of-life questionnaires,
and psychometric tests in treatment trials of Helicobacter pylori-
33. Bachert C, Brostoff J, Scadding GK, Tasman J, Stalla-Bourdillon
positive and -negative non-ulcer dyspepsia. Scand J Gastroen-
A, Murrieta M. Mizolastine therapy also has an effect on nasal
blockade in perennial allergic rhinoconjunctivitis. RIPERAN
51. Stone AA, Shiffman S, Schwartz JE, Broderick JE, Hufford MR.
Study Group. Allergy 1998;53:969 –975.
Patient non-compliance with paper diaries. BMJ 2002;324:
34. Berger VW, Rezvani A, Makarewicz VA. Direct effect on validity of
response run-in selection in clinical trials. Control Clin Trials
52. Harding JP, Hamm LR, Ehsanullah RS, Heath AT, Sorrells SC,
Haw J, Dukes GE, Wolfe SG, Mangel AW, Northcutt AR. Use of a
35. Hills M, Armitage P. The two-period cross-over clinical trial. Br J
novel electronic data collection system in multicenter studies of
irritable bowel syndrome. Aliment Pharmacol Ther 1997;11:
36. Committee for Proprietary Medicinal Products (CPMP). Notes for
guidance on statistical principles for clinical trials. ICH/363/
53. Bardhan KD, Bodemar G, Geldof H, Schutz E, Heath A, Mills JG,
96. London, UK: European Agency for Evaluation of Medicinal
Jacques LA. A double-blind, randomized, placebo-controlled
dose-ranging study to evaluate the efficacy of alosetron in the
37. Cleophas TJ, Zwinderman AH. Limitations of randomized clinical
treatment of irritable bowel syndrome. Aliment Pharmacol Ther
trials. Proposed alternative designs. Clin Chem Lab Med 2000;
54. Payne A, Blanchard EB. A controlled comparison of cognitive
38. Committee for Proprietary Medicinal Products (CPMP). CPMP/
therapy and self-help support groups in the treatment of irritable
EWP/785/97. Points to consider on the evaluation of medicinal
bowel syndrome. J Consult Clin Psychol 1995;63:779 –786.
products for the treatment of IBS. 785/97. European Agency for
55. Whitehead WE, Palsson OS, Levy RL, Feld AD, Von Korff M,
the Evaluation of Medicinal Products, London, England 2003.
Turner M. Reports of “satisfactory relief” by IBS patients receiv-
39. Tack J, Muller-Lissner S, Bytzer P, Corinaldesi R, Chang L,
ing usual medical care are confounded by baseline symptom
Viegas A, Schnekenbuehl S, Dunger-Baldauf C, Rueegg P. A
severity and do not accurately reflect symptom improvement.
randomised controlled trial assessing the efficacy and safety of
repeated tegaserod therapy in women with irritable bowel syn-
56. Camilleri M, Mayer EA, Drossman DA, Heath A, Dukes GE,
drome with constipation (IBS-C). Gut 2005;54:1707-1713.
McSorley D, Kong S, Mangel AW, Northcutt AR. Improvement in
40. Bardhan KD. Intermittent and on-demand use of proton pump
pain and bowel function in female irritable bowel patients with
inhibitors in the management of symptomatic gastroesophageal
alosetron, a 5-HT3 receptor antagonist. Aliment Pharmacol Ther
reflux disease. Am J Gastroenterol 2003;98:S40 –S48.
41. Thompson WG, Longstreth G, Drossman DA, Heaton K, Irvine
57. Camilleri M, Chey WY, Mayer EA, Northcutt AR, Heath A, Dukes
EJ, Muller-Lissner S. Functional bowel disorders and functional
GE, McSorley D, Mangel AM. A randomized controlled clinical
abdominal pain. In: Drossman DA, Corazziari E, Talley NJ,
trial of the serotonin type 3 receptor antagonist alosetron in
women with diarrhea-predominant irritable bowel syndrome.
75. Fraser A, Delaney B, Moayyedi P. Symptom-based outcome
Arch Intern Med 2001;161:1733–1740.
measures for dyspepsia and GERD trials: a systematic review.
58. Chey WD, Chey WY, Heath AT, Dukes GE, Carter EG, Northcutt A,
Am J Gastroenterol 2005;100:442– 452.
Ameen VZ. Long-term safety and efficacy of alosetron in women
76. El-Omar EM, Banerjee S, Wirz A, McColl KE. The Glasgow Dys-
with severe diarrhea-predominant irritable bowel syndrome.
pepsia Severity Score—a tool for the global measurement of
Am J Gastroenterol 2004;99:2195–2203.
dyspepsia. Eur J Gastroenterol Hepatol 1996;8:967–971.
59. Kellow J, Lee OY, Chang FY, Thongsawat S, Mazlam MZ, Yuen H,
77. Veldhuyzen van Zanten SJ, Tytgat KM, Pollak PT, Goldie J,
Gwee KA, Bak YT, Jones J, Wagner A. An Asia-Pacific, double
Goodacre RL, Riddell RH, Hunt RH. Can severity of symptoms be
blind, placebo controlled, randomised study to evaluate the
used as an outcome measure in trials of non-ulcer dyspepsia
efficacy, safety, and tolerability of tegaserod in patients with
and Helicobacter pylori associated gastritis? J Clin Epidemiol
irritable bowel syndrome. Gut 2003;52:671– 676.
60. Nyhlin H, Bang C, Elsborg L, Silvennoinen J, Holme I, Ruegg P,
78. Moayyedi P, Duffett S, Braunholtz D, Mason S, Richards ID,
Jones J, Wagner A. A double-blind, placebo-controlled, random-
Dowell AC, Axon AT. The Leeds Dyspepsia Questionnaire: a valid
ized study to evaluate the efficacy, safety and tolerability of
tool for measuring the presence and severity of dyspepsia.
tegaserod in patients with irritable bowel syndrome. Scand J
Aliment Pharmacol Ther 1998;12:1257–1262.
79. Rabeneck L, Wristers K, Goldstein JL, Eisen G, Dedhiya SD,
61. Wyrwich KW, Tardino VM. A blueprint for symptom scales and
Burke TA. Reliability, validity, and responsiveness of severity of
responses: measurement and reporting. Gut 2004;53(Suppl
dyspepsia assessment (SODA) in a randomized clinical trial of a
COX-2-specific inhibitor and traditional NSAID therapy. Am J
62. Fallone CA, Guyatt GH, Armstrong D, Wiklund I, Degl’Innocenti A,
Heels-Ansdell D, Barkun AN, Chiba N, Zanten SJ, El Dika S,
80. Talley NJ, Verlinden M, Jones M. Validity of a new quality of life
Austin P, Tanser L, Schunemann HJ. Do physicians correctly
scale for functional dyspepsia: a United States multicenter trial
assess patient symptom severity in gastro-oesophageal reflux
of the Nepean Dyspepsia Index. Am J Gastroenterol 1999;94:
disease? Aliment Pharmacol Ther 2004;20:1161–1169.
63. Guyatt GH, Feeny DH, Patrick DL. Measuring health-related qual-
81. Guyatt G, Walter S, Norman G. Measuring change over time:
ity of life. Ann Intern Med 1993;118:622– 629.
assessing the usefulness of evaluative instruments. J Chronic
64. Mangel AW, Hahn BA, Heath AT, Northcutt AR, Kong S, Dukes
GE, McSorley D. Adequate relief as an endpoint in clinical trials
82. Ioannidis JP, Evans SJ, Gotzsche PC, O’Neill RT, Altman DG,
in irritable bowel syndrome. J Int Med Res 1998;26:76 – 81.
Schulz K, Moher D. Better reporting of harms in randomized
65. Dunger-Baldauf C, Nyhlin H, Rueegg P, Wagner A. Subject’s
trials: an extension of the CONSORT statement. Ann Intern Med
global assessment of satisfactory relief as a measure to assess
treatment effect in clinical trials in irritable bowel syndrome
83. Creed F, Fernandes L, Guthrie E, Palmer S, Ratcliffe J, Read N,
(IBS). Am J Gastroenterol 2003;98(Suppl 1):S269.
Rigby C, Thompson D, Tomenson B. The cost-effectiveness of
66. Blanchard EB, Scharff L, Payne A, Schwarz SP, Suls JM,
psychotherapy and paroxetine for severe irritable bowel syn-
Malamood H. Prediction of outcome from cognitive-behavioral
drome. Gastroenterology 2003;124:303–317.
treatment of irritable bowel syndrome. Behav Res Ther 1992;
84. Calvert EL, Houghton LA, Cooper P, Morris J, Whorwell PJ.
Long-term improvement in functional dyspepsia using hypno-
67. Wiklund IK, Junghard O, Grace E, Talley NJ, Kamm M,
therapy. Gastroenterology 2002;123:1778 –1785.
Veldhuyzen van Santen SJ, Pare P, Chiba N, Leddin DS, Bigard
85. Whitehead WE, Burnett CK, Cook EW III, Taub E. Impact of
MA, Colin R, Schoenfeld P. Quality of Life in Reflux and Dyspep-
irritable bowel syndrome on quality of life. Dig Dis Sci 1996;41:
sia patients. Psychometric documentation of a new disease-
specific questionnaire (QOLRAD) Eur J Surg Suppl 1998;583:
86. Talley NJ, Weaver AL, Zinsmeister AR. Impact of functional
dyspepsia on quality of life. Dig Dis Sci 1995;40:584 –589.
68. Drossman DA, Li Z, Toner BB, Diamant NE, Creed FH, Thompson
87. Bergner M, Bobbitt RA, Carter WB, Gilson BS. The Sickness
D, Read NW, Babbs C, Barreiro M, Bank L. Functional bowel
Impact Profile: development and final revision of a health status
disorders. A multicenter comparison of health status and devel-
measure. Med Care 1981;19:787– 805.
opment of illness severity index. Dig Dis Sci 1995;40:986 –
88. Stewart AL, Hays RD, Ware JE Jr. The MOS short-form general
health survey. Reliability and validity in a patient population.
69. Francis CY, Morris J, Whorwell PJ. The irritable bowel severity
scoring system: a simple method of monitoring irritable bowel
89. Dimenas E, Glise H, Hallerback B, Hernqvist H, Svedlund J,
syndrome and its progress. Aliment Pharmacol Ther 1997;11:
Wiklund I. Well-being and gastrointestinal symptoms among
patients referred to endoscopy owing to suspected duodenal
70. Gonsalkorale WM, Miller V, Afzal A, Whorwell PJ. Long term
ulcer. Scand J Gastroenterol 1995;30:1046 –1052.
benefits of hypnotherapy for irritable bowel syndrome. Gut
90. Patrick DL, Drossman DA, Frederick IO, Dicesare J, Puder KL.
Quality of life in persons with irritable bowel syndrome: devel-
71. Whitehead WE, Levy RL, Von Korff M, Feld AD, Palsson OS,
opment and validation of a new measure. Dig Dis Sci 1998;43:
Turner MJ, Drossman DA. Usual medical care for irritable bowel
syndrome. Aliment Pharmacol Ther 2004;20:1305–1315.
91. Borgaonkar MR, Irvine EJ. Quality of life measurement in gas-
72. Corazziari E, Bytzer P, Delvaux M, Holtmann G, Malagelada JR,
trointestinal and liver disorders. Gut 2000;47:444 – 454.
Morris J, Muller-Lissner S, Spiller RC, Tack J, Whorwell PJ.
92. Watson ME, Lacey L, Kong S, Northcutt AR, McSorley D, Hahn B,
Clinical trial guidelines for pharmacological treatment of irritable
Mangel AW. Alosetron improves quality of life in women with
bowel syndrome. Aliment Pharmacol Ther 2003;18:569 –580.
diarrhea-predominant irritable bowel syndrome. Am J Gastroen-
73. Twisk JWR. Applied longitudinal data analysis for epidemiology.
Cambridge, UK: Cambridge University Press, 2003.
93. Moher D, Schulz KF, Altman D. The CONSORT statement: re-
74. Snijders TAB, Bosler RJ. Multilevel analysis: an introduction to
vised recommendations for improving the quality of reports of
basic and advanced multilevel modeling. London: Sage, 1999.
parallel-group randomized trials. JAMA 2001;285:1987–1991.
94. Bossuyt PM, Reitsma JB, Bruns DE, Gatsonis CA, Glasziou PP,
reporting randomized trials: explanation and elaboration. Ann
Irwig LM, Lijmer JG, Moher D, Rennie D, de Vet HC. Towards
complete and accurate reporting of studies of diagnostic accu-
102. DeMets DL, Pocock SJ, Julian DG. The agonising negative trend
racy: the STARD initiative. BMJ 2003;326:41– 44.
in monitoring of clinical trials. Lancet 1999;354:1983–1988.
95. Moher D, Cook DJ, Eastwood S, Olkin I, Rennie D, Stroup DF.
103. Campbell MJ, Julious SA, Altman DG. Estimating sample sizes
Improving the quality of reports of meta-analyses of randomised
for binary, ordered categorical, and continuous outcomes in two
controlled trials: the QUOROM statement. Quality of Reporting
group comparisons. BMJ 1995;311:1145–1148.
of Meta-analyses. Lancet 1999;354:1896 –1900.
104. De Angelis C, Drazen JM, Frizelle FA, Haug C, Hoey J, Horton R,
96. Perneger TV. What’s wrong with Bonferroni adjustments. BMJ
Kotzin S, Laine C, Marusic A, Overbeke AJ, Schroeder TV, Sox
HC, Van Der Weyden MB. Clinical trial registration: a statement
97. Guyatt G, Jaeschke R, Heddle N, Cook D, Shannon H, Walter S.
from the International Committee of Medical Journal Editors.
Basic statistics for clinicians: 2. Interpreting study results: con-
fidence intervals. CMAJ 1995;152:169 –173.
105. Bero L, Rennie D. The Cochrane Collaboration. Preparing, main-
98. Nuovo J, Melnikow J, Chang D. Reporting number needed to
taining, and disseminating systematic reviews of the effects of
treat and absolute risk reduction in randomized controlled trials.
health care. JAMA 1995;274:1935–1938.
99. Gore SM, Jones G, Thompson SG. The Lancet’s statistical
review process: areas for improvement by authors. Lancet1992;340:100 –102.
Received March 2, 2005. Accepted November 3, 2005.
100. Katz MH. Multivariable analysis: a primer for readers of medical
Address requests for reprints to: E. Jan Irvine, MD, Professor of
research. Ann Intern Med 2003;138:644 – 650.
Medicine, University of Toronto, Head, Division of Gastroenterology,
101. Altman DG, Schulz KF, Moher D, Egger M, Davidoff F, Elbourne
16-054 CC Wing, Saint Michael’s Hospital, 30 Bond Street, Toronto,
D, Gotzsche PC, Lang T. The revised CONSORT statement for
Donald W. Reynolds Foundation, the Arizona Geriatric Education Center, and the Arizona Center on Aging A Resource for Providers Hyperlipidemia in Older Adults: To Treat or Not to Treat? Carol L. Howe, MD, MLS, College of Medicine, University of Arizona Barry D. Weiss, MD, College of Medicine, University of Arizona Treatment of hyperlipidemia has well-known benefits for or more pas
Ranbaxy Un indien à l'assaut des géants de la pharmacie Recherche d'articles ou de dossiers entreprises et marchés Ranbaxy Un indien à l'assaut des géants de la pharmacie | à lire aussi | Le champion indien des médicaments génériques mise sur l'innovation pour développer des produits à plus haute valeur ajoutée. Avec ses scientifiques de