Strengthening the Reporting of Observational Studies in Epidemiology(STROBE):Explanation and Elaboration
Jan P.Vandenbroucke1,Erik von Elm2,3,Douglas G.Altman4,Peter C.Gøtzsche5,Cynthia D.Mulrow6,Stuart J.Pocock7, Charles Poole8,James J.Schlesselman9,Matthias Egger2,10*for the STROBE Initiative
1Department of Clinical Epidemiology,Leiden University Medical Center,Leiden,The Netherlands,2Institute of Social&Preventive Medicine(ISPM),University of Bern,Bern, Switzerland,3Department of Medical Biometry and Medical Informatics,University Medical Centre,Freiburg,Germany,4Cancer Research UK/NHS Centre for Statistics in Medicine,Oxford,United Kingdom,5Nordic Cochrane Centre,Rigshospitalet,Copenhagen,Denmark,6University of Texas Health Science Center,San Antonio,United States of America,7Medical Statistics Unit,London School of Hygiene and Tropical Medicine,London,United Kingdom,8Department of Epidemiology,University of North Carolina School of Public Health,Chapel Hill,United States of America,9Department of Biostatistics,University of Pittsburgh Graduate School of Public Health,and University of Pittsburgh Cancer Institute,Pittsburgh,United States of America,10Department of Social Medicine,University of Bristol,Bristol,United Kingdom
Funding:The initial STROBE
workshop was funded by the
European Science Foundation(ESF).
Additional funding was received from the Medical Research Council Health Services Research Collaboration and the National Health Services Research& Development Methodology Programme.The funders had no role in study design,data collection and analysis,decision to publish,or preparation of the manuscript. Competing Interests:The authors have declared that no competing interests exist.
Citation:Vandenbroucke JP,von Elm E,Altman DG,Gøtzsche PC,Mulrow CD,et al.(2007)Strengthening the Reporting of Observational Studies in Epidemiology(STROBE):Explanation and Elaboration.PLoS Med4(10):
e297.doi:10.1371/journal.pmed. 0040297
Received:July20,2007 Accepted:August30,2007 Published:October16,2007 Copyright:Ó2007Vandenbroucke et al.This is an open-access article distributed under the terms of the
Creative Commons Attribution License,which permits unrestricted use,distribution,and reproduction in any medium,provided the original author and source are credited.In order to encourage dissemination of the STROBE Statement,this article is freely available on the Web site of PLoS Medicine,and will also be published and made freely available by Epidemiology and Annals of Internal Medicine.The authors jointly hold the copyright of this article.For details on further use,see STROBE Web site(www.
*To whom correspondence should be addressed.E-mail:strobe@ispm. unibe.ch
Much medical research is observational.The reporting of observational studies is often of insufficient quality.Poor reporting hampers the assessment of the strengths and weaknesses of a study and the generalisability of its results.Taking into account empirical evidence and theoretical considerations,a group of methodologists,researchers,and editors developed the Strengthening the Reporting of Observational Studies in Epidemiology(STROBE)recommen-dations to improve the quality of reporting
of observational studies.The STROBE Statement consists of a checklist of22items,which relate to the title,abstract,introduction,methods, results and discussion sections of articles.Eighteen items are common to cohort studies,case-control studies and cross-sectional studies and four are specific to each of the three study designs.The STROBE Statement provides guidance to authors about how to improve the reporting of observational studies and facilitates critical appraisal and interpretation of studies by reviewers,journal editors and readers.This explanatory and elaboration document is intended to enhance the use,understanding,and dissemination of the STROBE Statement.The meaning and rationale for each checklist item are presented.For each item,one or several published examples and,where possible,references to relevant empirical studies and methodological literature are provided.Examples of useful flow diagrams are also included. The STROBE Statement,this document,and the associated Web site(www. /)should be helpful resources to improve reporting of observational research.
P L o S MEDICINE
Introduction
Rational health care practices require knowledge about the aetiology and pathogenesis,diagnosis,prog
nosis and treat-ment of diseases.Randomised trials provide valuable evi-dence about treatments and other interventions.However, much of clinical or public health knowledge comes from observational research[1].About nine of ten research papers published in clinical speciality journals describe observatio-nal research[2,3].
The STROBE Statement
Reporting of observational research is often not detailed and clear enough to assess the strengths and weaknesses of the investigation[4,5].To improve the reporting of obser-vational research,we developed a checklist of items that should be addressed:the Strengthening the Reporting of Observational Studies in Epidemiology(STROBE)Statement (Table1).Items relate to title,abstract,introduction, methods,results and discussion sections of articles.The STROBE Statement has recently been published in several journals[6].Our aim is to ensure clear presentation of what was planned,done,and found in an observational study.We stress that the recommendations are not prescriptions for setting up or conducting studies,nor do they dictate methodology or mandate a uniform presentation. STROBE provides general reporting recommendations for descriptive observational studies and studies that investigate associations between exposures and health outcomes. STROBE addresses the three main types of observational studies:cohort,case-control and cross-sectional studies. Authors
use diverse terminology to describe these study designs.For instance,‘follow-up study’and‘longitudinal study’are used as synonyms for‘cohort study’,and ‘prevalence study’as synonymous with‘cross-sectional study’. We chose the present terminology because it is in common use.Unfortunately,terminology is often used incorrectly[7] or imprecisely[8].In Box1we describe the hallmarks of the three study designs.
The Scope of Observational Research
Observational studies serve a wide range of purposes:from reporting afirst hint of a potential cause of a disease,to verifying the magnitude of previously reported associations. Ideas for studies may arise from clinical observations or from biologic insight.Ideas may also arise from informal looks at data that lead to further explorations.Like a clinician who has seen thousands of patients,and notes one that strikes her attention,the researcher may note something special in the data.Adjusting for multiple looks at the data may not be possible or desirable[9],but further studies to confirm or refute initial observations are often needed[10].Existing data may be used to examine new ideas about potential causal factors,and may be sufficient for rejection or confirmation. In other instances,studies follow that are specifically designed to overcome potential problems with previous reports.The latter studies will gather new data and will be planned for that purpose,in contrast to analyses of existing data.This leads
to diverse ,on the merits of looking at subgroups or the importance of a predetermined sample size.STROBE tries to accommodate these diverse uses of observational research-from discovery to refutation or confirmation.Where necessary we will indicate in what circumstances specific recommendations apply.
How to Use This Paper
This paper is linked to the shorter STROBE paper that introduced the items of the checklist in several journals[6], and forms an integral part of the STROBE Statement.Our intention is to explain how to report research well,not how research should be done.We offer a detailed explanation for each checklist item.Each explanation is preceded by an example of what we consider transparent reporting.This does not mean that the study from which the example was taken was uniformly well reported or well done;nor does it mean that itsfindings were reliable,in the sense that they were later confirmed by others:it only means that this particular item was well reported in that study.In addition to explanations and examples we included Boxes1–8with supplementary information.These are intended for readers who want to refresh their memories about some theoretical points,or be quickly informed about technical background details.A full understanding of these points may require studying the textbooks or methodological papers that are cited.
STROBE recommendations do not specifically address topics such as genetic linkage studies,infectious disease modelling or case reports and case series[11,12].As many of the key elements in STROBE apply to these designs,authors who report such studies may neverthelessfind our recom-mendations useful.For authors of observational studies that specifically address diagnostic tests,tumour markers and genetic associations,STARD[13],REMARK[14],and STRE-GA[15]recommendations may be particularly useful.
The Items in the STROBE Checklist
We now discuss and explain the22items in the STROBE checklist(Table1),and give published examples for each item.Some examples have been edited by removing citations or spelling out abbreviations.Eighteen items apply to all three study designs whereas four are design-specific.Starred items(for example item8*)indicate that the information should be given separately for cases and controls in case-control studies,or exposed and unexposed groups in cohort and cross-sectional studies.We advise authors to address all items somewhere in their paper,but we do not prescribe a precise location or order.For instance,we discuss the reporting of results under a number of separate items,while recognizing that authors might address several items within a single section of text or in a table.
The Items
TITLE AND ABSTRACT
1(a).Indicate the study’s design with a commonly used term in the title or the abstract.
Example
‘‘Leukaemia incidence among workers in the shoe and boot manufacturing industry:a case-control study’’[18]. Explanation
Readers should be able to easily identify the design that was used from the title or abstract.An explicit,commonly used term for the study design also helps ensure correct indexing of articles in electronic databases[19,20].
Table1.The STROBE Statement—Checklist of Items That Should Be Addressed in Reports of Observational Studies
Item
number
Recommendation
TITLE and ABSTRACT1(a)Indicate the study’s design with a commonly used term in the title or the abstract
(b)Provide in the abstract an informative and balanced summary of what was done and what was found
INTRODUCTION
Background/
rationale
2Explain the scientific background and rationale for the investigation being reported
Objectives3State specific objectives,including any prespecified hypotheses
METHODS
Study design4Present key elements of study design early in the paper
Setting5Describe the setting,locations,and relevant dates,including periods of recruitment,exposure,follow-up,and data collection Participants6(a)Cohort study—Give the eligibility criteria,and the sources and methods of selection of participants.Describe methods of
follow-up
Case-control study—Give the eligibility criteria,and the sources and methods of case ascertainment and control selection.Give
the rationale for the choice of cases and controls
Cross-sectional study—Give the eligibility criteria,and the sources and methods of selection of participants
(b)Cohort study—For matched studies,give matching criteria and number of exposed and unexposed
Case-control study—For matched studies,give matching criteria and the number of controls per case
Variables7Clearly define all outcomes,exposures,predictors,potential confounders,and effect modifiers.Give diagnostic criteria,if applicable
Data sources/ measurement 8*For each variable of interest,give sources of data and details of methods of assessment(measurement).
Describe comparability of assessment methods if there is more than one group
Bias9Describe any efforts to address potential sources of bias
Study size10Explain how the study size was arrived at
Quantitative
variables
11Explain how quantitative variables were handled in the analyses.If applicable,describe which groupings were chosen,and why
Statistical methods 12(a)Describe all statistical methods,including those used to control for confounding
(b)Describe any methods used to examine subgroups and interactions
(c)Explain how missing data were addressed
(d)Cohort study—If applicable,explain how loss to follow-up was addressed
Case-control study—If applicable,explain how matching of cases and controls was addressed
Cross-sectional study—If applicable,describe analytical methods taking account of sampling strategy
(e)Describe any sensitivity analyses
RESULTS
Participants13*(a)Report the numbers of individuals at each stage of the study—e.g.,numbers potentially eligible,examined for eligibility,confirmed eligible,included in the study,completing follow-up,and analysed
(b)Give reasons for non-participation at each stage
(c)Consider use of a flow diagram
Descriptive data 14*(a)Give characteristics of study ,demographic,clinical,social)and in
formation on exposures and potential confounders
(b)Indicate the number of participants with missing data for each variable of interest
(c)Cohort study—Summarise follow-up ,average and total amount)
Outcome data15*Cohort study—Report numbers of outcome events or summary measures over time
Case-control study—Report numbers in each exposure category,or summary measures of exposure
Cross-sectional study—Report numbers of outcome events or summary measures
Main results16(a)Give unadjusted estimates and,if applicable,confounder-adjusted estimates and their ,95%confidence interval).
Make clear which confounders were adjusted for and why they were included
(b)Report category boundaries when continuous variables were categorized
(c)If relevant,consider translating estimates of relative risk into absolute risk for a meaningful time period
Other
analyses
17Report other analyses done—e.g.,analyses of subgroups and interactions,and sensitivity analyses
DISCUSSION
Key results18Summarise key results with reference to study objectives
Limitations19Discuss limitations of the study,taking into account sources of potential bias or imprecision.Discuss both direction and magnitude of any potential bias
Interpretation20Give a cautious overall interpretation of results considering objectives,limitations,multiplicity of analyses,results from similar stu-dies,and other relevant evidence
Generalisability21Discuss the generalisability(external validity)of the study results
OTHER INFORMATION
Funding22Give the source of funding and the role of the funders for the present study and,if applicable,
for the original study on which the present article is based
*Give such information separately for cases and controls in case-control studies,and,if applicable,for exposed and unexposed groups in cohort and cross-sectional studies.
Note:An Explanation and Elaboration article discusses each checklist item and gives methodological background and published examples of transparent reporting.The STROBE checklist is best used in conjunction with this article(freely available on the Web sites of PLoS Medicine at /,Annals of Internal Medicine at /,and Epidemiology at www.epidem/).Separate versions of the checklist for cohort,case-control,and cross-sectional studies are available on the STROBE Web site at www. /.
doi:10.1371/journal.pmed.0040297.t001
1(b).Provide in the abstract an informative and balanced summary of what was done and what was found. Example
‘‘Background:The expected survival of HIV-infected patients is of major public health interest.
Objective:To estimate survival time and age-specific mortal-ity rates of an HIV-infected population com
pared with that of the general population.
Design:Population-based cohort study.
Setting:All HIV-infected persons receiving care in Denmark from1995to2005.Patients:Each member of the nationwide Danish HIV Cohort Study was matched with as many as99persons from the general population according to sex,date of birth,and municipality of residence.
Measurements:The authors computed Kaplan–Meier life tables with age as the time scale to estimate survival from age 25years.Patients with HIV infection and corresponding persons from the general population were observed from the date of the patient’s HIV diagnosis until death,emigration,or 1May2005.
Results:3990HIV-infected patients and379872persons from the general population were included in the study, yielding22744(median,5.8y/person)and2689287(median, 8.4years/person)person-years of observation.Three percent of participants were lost to follow-up.From age25years,the median survival was19.9years(95%CI,18.5to21.3)among patients with HIV infection and51.1years(CI,50.9to51.5) among the general population.For HIV-infected patients, survival increased to32.5years(CI,29.4to34.7)during the 2000to2005period.In the subgroup that excluded persons with kn
own hepatitis C coinfection(16%),median survival was38.9years(CI,35.4to40.1)during this same period.The relative mortality rates for patients with HIV infection compared with those for the general population decreased with increasing age,whereas the excess mortality rate increased with increasing age.
Limitations:The observed mortality rates are assumed to apply beyond the current maximum observation time of10 years.
Conclusions:The estimated median survival is more than35 years for a young person diagnosed with HIV infection in the late highly active antiretroviral therapy era.However,an ongoing effort is still needed to further reduce mortality rates for these persons compared with the general population’’[21].
Explanation
The abstract provides key information that enables readers to understand a study and decide whether to read the article.Typical components include a statement of the research question,a short description of methods and results,and a conclusion[22].Abstracts should summarize key details of studies and should only present information that is provided in the article.We advise presenting key results in a numerical form that includes numbers of participants,estimates of associations and appropriate measu
res of variability and ,odds ratios with confidence intervals).We regard it insufficient to state only that an exposure is or is not significantly associated with an outcome.
A series of headings pertaining to the background,design, conduct,and analysis of a study may help readers acquire the essential information rapidly[23].Many journals require such structured abstracts,which tend to be of higher quality and more readily informative than unstructured summaries [24,25].
INTRODUCTION
The Introduction section should describe why the study was done and what questions and hypotheses it addresses.It should allow others to understand the study’s context and judge its potential contribution to current knowledge.
Box1.Main study designs covered by STROBE
Cohort,case-control,and cross-sectional designs represent different approaches of investigating the occurrence of health-related events in a given population and time period.These studies may address many types of health-related events,including disease or disease remission, disability or complications,death or survival,and the occurrence of risk factors.
In cohort studies,the investigators follow people over time.They obtain information about people and their exposures at baseline,let time pass, and then assess the occurrence of outcomes.Investigators commonly make contrasts between individuals who are exposed and not exposed or among groups of individuals with different categories of exposure. Investigators may assess several different outcomes,and examine exposure and outcome variables at multiple points during follow-up. Closed cohorts(for example birth cohorts)enrol a defined number of participants at study onset and follow them from that time forward, often at set intervals up to a fixed end date.In open cohorts the study population is dynamic:people enter and leave the population at different points in time(for example inhabitants of a town).Open cohorts change due to deaths,births,and migration,but the composition of the population with regard to variables such as age and gender may remain approximately constant,especially over a short period of time.In a closed cohort cumulative incidences(risks)and incidence rates can be estimated;when exposed and unexposed groups are compared,this leads to risk ratio or rate ratio estimates.Open cohorts estimate incidence rates and rate ratios.
In case-control studies,investigators compare exposures between people with a particular disease outcome(cases)and people without that outcome(controls).Investigators aim to collect cases and controls that are representative of an underlying cohort or a cross-section of a population.That populati
on can be defined geographically,but also more loosely as the catchment area of health care facilities.The case sample may be100%or a large fraction of available cases,while the control sample usually is only a small fraction of the people who do not have the pertinent outcome.Controls represent the cohort or population of people from which the cases arose.Investigators calculate the ratio of the odds of exposures to putative causes of the disease among cases and controls(see Box7).Depending on the sampling strategy for cases and controls and the nature of the population studied,the odds ratio obtained in a case-control study is interpreted as the risk ratio,rate ratio or(prevalence)odds ratio[16,17].The majority of published case-control studies sample open cohorts and so allow direct estimations of rate ratios.
In cross-sectional studies,investigators assess all individuals in a sample at the same point in time,often to examine the prevalence of exposures, risk factors or disease.Some cross-sectional studies are analytical and aim to quantify potential causal associations between exposures and disease. Such studies may be analysed like a cohort study by comparing disease prevalence between exposure groups.They may also be analysed like a case-control study by comparing the odds of exposure between groups with and without disease.A difficulty that can occur in any design but is particularly clear in cross-sectional studies is to establish that an exposure preceded the disease,altho
ugh the time order of exposure and outcome may sometimes be clear.In a study in which the exposure variable is congenital or genetic,for example,we can be confident that the exposure preceded the disease,even if we are measuring both at the same time.
2.Background/rationale:Explain the scientific background and rationale for the investigation being reported. Example
‘‘Concerns about the rising prevalence of obesity in children and adolescents have focused on the well docu-mented associations between childhood obesity and in-creased cardiovascular risk and mortality in adulthood. Childhood obesity has considerable social and psychological consequences within childhood and adolescence,yet little is known about social,socioeconomic,and psychological con-sequences in adult life.A recent systematic review found no longitudinal studies on the outcomes of childhood obesity other than physical health outcomes and only two longitu-dinal studies of the socioeconomic effects of obesity in adolescence.Gortmaker et al.found that US women who had been obese in late adolescence in1981were less likely to be married and had lower incomes seven years later than women who had not been overweight,while men who had been overweight were less likely to be married.Sargent et al.found that UK women,but not men,who had been obese at16years in1974earned7.4%less than their non-obese peers at age23. (...)We used longitudinal data from the19
70British birth cohort to examine the adult socioeconomic,educational, social,and psychological outcomes of childhood obesity’’[26]. Explanation
The scientific background of the study provides important context for readers.It sets the stage for the study and describes its focus.It gives an overview of what is known on a topic and what gaps in current knowledge are addressed by the study.Background material should note recent pertinent studies and any systematic reviews of pertinent studies.
3.Objectives:State specific objectives,including any prespecified hypotheses.
Example
‘‘Our primary objectives were to1)determine the prevalence of domestic violence among female patients presenting to four community-based,primary care,adult medicine practices that serve patients of diverse socio-economic background and2)identify demographic and clinical differences between currently abused patients and patients not currently being abused’’[27].
Explanation
Objectives are the detailed aims of the study.Well crafted objectives specify populations,exposures an
d outcomes,and parameters that will be estimated.They may be formulated as specific hypotheses or as questions that the study was designed to address.In some situations objectives may be less specific,for example,in early discovery phases.Regard-less,the report should clearly reflect the investigators’intentions.For example,if important subgroups or addi-tional analyses were not the original aim of the study but arose during data analysis,they should be described accord-ingly(see also items4,17and20).
METHODS
The Methods section should describe what was planned and what was done in sufficient detail to allow others to understand the essential aspects of the study,to judge whether the methods were adequate to provide reliable and valid answers,and to assess whether any deviations from the original plan were reasonable.
4.Study design:Present key elements of study design early in the paper.
s.b.poloExample
‘‘We used a case-crossover design,a variation of a case-control design that is appropriate when a brief
exposure (driver’s phone use)causes a transient rise in the risk of a rare outcome(a crash).We compared a driver’s use of a mobile phone at the estimated time of a crash with the same driver’s use during another suitable time period.Because drivers are their own controls,the design controls for characteristics of the driver that may affect the risk of a crash but do not change over a short period of time.As it is important that risks during control periods and crash trips are similar,we compared phone activity during the hazard interval(time immediately before the crash)with phone activity during control intervals(equivalent times during which participants were driving but did not crash)in the previous week’’[28]. Explanation
We advise presenting key elements of study design early in the methods section(or at the end of the introduction)so that readers can understand the basics of the study.For example,authors should indicate that the study was a cohort study,which followed people over a particular time period, and describe the group of persons that comprised the cohort and their exposure status.Similarly,if the investigation used a case-control design,the cases and controls and their source population should be described.If the study was a cross-sectional survey,the population and the point in time at which the cross-section was taken should be mentioned. When a study is a variant of the three main study types,there is an additional need for clarity.For instance,for a case-crossover study,one of the variants of the case-control design, a succinct description of the principles was given in the example above[28].
We recommend that authors refrain from simply calling a study‘prospective’or‘retrospective’because these terms are ill defined[29].One usage sees cohort and prospective as synonymous and reserves the word retrospective for case-control studies[30].A second usage distinguishes prospective and retrospective cohort studies according to the timing of data collection relative to when the idea for the study was developed[31].A third usage distinguishes prospective and retrospective case-control studies depending on whether the data about the exposure of interest existed when cases were selected[32].Some advise against using these terms[33],or adopting the alternatives‘concurrent’and‘historical’for describing cohort studies[34].In STROBE,we do not use the words prospective and retrospective,nor alternatives such as concurrent and historical.We recommend that,whenever authors use these words,they define what they mean.Most importantly,we recommend that authors describe exactly how and when data collection took place.
Thefirst part of the methods section might also be the place to mention whether the report is one of several from a study.If a new report is in line with the original aims of the study,this is usually indicated by referring to an earlier publication and by briefly restating the salient features of the study.However,the aims of a study may also evolve over time.