Presented at the Scientific Writing Workshop held at the Kulliyah of Medicine UIA Kuantan on 08-09th March 2008 by Professor Omar Hasan Kasule MB ChB (MUK), MPH (Harvard), DrPH (Harvard) Professor of Epidemiology at the Institute of Medicine University of Brunei and Visiting Professor of Epidemiology at the University of Malaya
1.0 INTRODUCTION TO META ANALYSIS
1.1 DEFINITION
Meta analysis refers to quantitative statistical methods used to combine data from several independent studies in order to produce a quantitative summary statistic. It is a form of integrative systematic review of several research papers to derive a general conclusion on which policy or public health decisions can be based. It is used for studies that are similar. It is meaningless for studies that have much diversity.
1.2 HISTORICAL BACKGROUND
Review articles written by discipline leaders were the most popular method of combining findings from various studies. It was realized that they gave poor and unsatisfactory results. This is because of three serious draw-backs: (a) they were subjective and therefore prone to error and bias and (b) the reviewer was free to make decisions on what data and conclusions to emphasize. (c) Secondly reviews dealt with summarization of conclusions without looking at the data on which the conclusions were based.
A systematic meta-analysis review leaves little room for the author to impose his views or biases because of its rigorous methodology that derives final conclusions only from the data available.
Meta-analyis was first developed for randomized clinical trials. Its use in observational studies is more problematic mainly because of the difficulty of knowing and adjusting for confounders. Randomized clinical trials by their very nature have confounders balanced by the randomization process and have a lot of high quality information that can be used to adjust for any residual confounders.
1.3 ADVANTAGES OF META ANALYSIS
Meta-analysis has become popular with the proliferation of epidemiological studies on particular subjects. Writers of review articles and practicing epidemiologists would like to have some form of consensus or summary of the findings of various studies. Meta analysis enables computation of an effect estimate for a larger number of study subjects thus enabling picking up statistical significance that would be missed if analysis was based on small individual studies. Many clinical trials especially with invasive intervention cannot recruit enough patients in one center to reach statistical significance. Meta analysis provides more power than would be found in the individual studies.
Meta analysis can provide results that are more precise than the individual primary studies. Meta analysis can also help reach consensus in situations in which the primary studies have contradictory results. Meta analysis also enables study of variation across several population subgroups since it involves several individual studies carried out in various countries and populations. Meta analysis makes the process of reviewing several studies with view to reaching a general conclusion very transparent because it is based on quantitative assessments.
1.4 DIFFICULTIES OF META ANALYSIS
Meta analysis is methodologically complex because of different study designs, study analyses, and even different data quality. The major problems are: over-conclusion and bias (publication bias and selection bias) and use of wrong methods. Over-conclusion arises when a conclusion is artifactual and is not supported by the aggregated data. The results of meta-analysis based on published sources may not reflect the true situation because of existence of publication bias. Positive findings are more likely to be published than negative ones. Studies carried out in academic or government institutions are thought to be more credible and are therefore more likely to be published whereas studies by pharmaceutical firms have lower publication rates. Inadequate search for reports may lead to bias just as multiple publications of the same study data leads to bias. Bias, conscious or unconscious, may occur in the selection of studies for analysis. In some cases the methods used for meta-analysis are wrong or inappropriate
2.0 SELECTING STUDIES FOR META-ANALYSIS
2.1 PROBLEM IDENTIFICATION AND DEFINITION
Meta-analysis starts with identifying a specific health or clinical problem that requires a solution. It cannot be undertaken as a fishing trip in the hope of finding something worthwhile. The problem should be of practical importance that several researchers have undertaken research on it. Stating the problem clearly and unambiguously helps focus the meta-analytic review process. The framing of the question should include the type of process studied, the study population, the outcome(s), and the study design. The framing of the question is so important because it provides guidelines for the rest of the review process for example the search for research reports, data collection, and the analytic methods depend on the type of question. It is preferable that the question be narrowly focused. It is more efficient to carry out several focused valid analyses that to carry out one large and broad analysis. The problem can be refined or even modified as the search proceeds when the reviewer becomes aware of issues or information that was not available initially.
2.2 SEARCHING FOR APPROPRIATE STUDIES
A meta-analysis protocol has to be written in advance to avoid having to make adhoc decisions that may end up biasing the study. The protocol should set out in detail criteria for selecting studies to be analyzed and the statistical methods. Details about the studies should be provided to guide the selection of what research to include in the meta-analysis. These details should cover the study design, the source population, the type of data collected, the methodology of analysis, and the outcome measures used. These details help identify a fairly homogenous list of studies that will enable a successful meta-analysis exercise.
The search methods for research reports must be thorough and unbiased. These must be defined explicitly in the protocol. The search should be wide ranging because some research reports are found in some data bases and not others. Many research reports are not yet published or will never be published.
The search for research reports goes through various stages. A search strategy / methodology should be defined. Then the search should be carried out systematically according to the protocol. Some research reports that are retrieved are not appropriate. It is therefore necessary to revisit the list of retrieved papers using pre-fixed inclusion criteria. The process of selecting what studies to use may involve writing to authors for additional information. In some cases authors of unpublished papers have to be identified and to obtain their research results.
Electronic data bases are generally used as sources of research reports. MEDLINE (Index Medicus accessed for free as pubmed) is the most popular data base but misses many important research reports. Other data bases that can be used are: EMBASE (Excerpta Medica), Scisearch, CENTRAL, the Chinese Biomedical Literature Data Base, the Latin American Caribbean Health Sciences Literature, the Japan Information Center of Science and Technology File on Science, Technology and Medicine (JIFCS-E), The Cumulative Index of Nursing and Allied Health, and the Cochrane Collaboration.
Hand searching is manual searching of a whole journal for relevant material. It may pick up relevant studies not indexed by the electronic data bases. Hand searching is also useful for conference proceedings that are not usually included in the electronic data bases. Useful research reports can also be identified by going through reference lists of articles. Previous reviews of the same problem may yield research reports otherwise unobtainable.
Unpublished studies can provide useful information. Usually negative studies do not get published. There is no formal way of identifying unpublished studies. Attempts can also be made to identify on-going studies.
In selecting key words and formulating search and article retrieval queries we have to balance two opposing requirements: comprehensiveness and precision. A comprehensive search will pick all what is needed (high sensitivity) but it will also pick up much that is not relevant (low specificity). A more precise search will miss some relevant material (low sensitivity) but will not pick up much that is irrelevant (high specificity). More weight is given to sensitivity than to specificity.
The start of the search should be developing key words to use. The key words can be combined using Boolean logic for more precise searches. The search should be fully documented: title of data base, date and time of search, period of years searched, the key words, and queries used.
Selection of articles starts with looking at the title and the abstract to enable preliminary elimination of what is clearly irrelevant. Reading the full text may reveal that the article is not relevant even it seemed relevant from the title and the abstract.
2.3 ASSESSING THE QUALITY OF THE STUDIES
Not all papers that fulfill the inclusion criteria fulfill the quality requirements of a particular meta-analytic exercise. The quality must therefore be reviewed systematically using pre-fixed quality criteria and papers found wanting should be eliminated from the final review.
Each research report should be assessed for validity (i.e. absence of systematic confounding, selection, misclassification, detection, and follow up bias). Check lists of validity quality criteria can be used to make objective and systematic assessment of validity. Some of these scales have a total score with a cut-off score for describing studies as valid. Where check lists are used it is advised to run a pilot test.
2.4 DATA ABSTRACTION:
A data collection form is used to abstract data from the selected studies. The data form can be paper or electronic. Paper forms enable rechecking of data entered for data entry errors. Electronic forms remove the need for data entry but their errors are difficult to detect later. The name of the person abstracting the information should be recorded. Each study abstracted must be given a specific identifying number. The data collection form must be designed to allow writing notes and updates. There must also have an indication whether the study has selection eligibility and its quality was checked. Data is entered on methods of the study, description of the participants, the interventions or treatments, and outcome measures. Coding instructions should be comprehensive and clear. To avoid mistakes the form should be piloted before use.
If abstraction of the data is carried out by more than one person a system must be developed for resolving differences and reaching a consensus. The abstractors may be blinded to several aspects of the study to prevent bias. In some cases it may be necessary to contact the investigators to obtain data that is not available in the published report.
3.0 DATA ANALYSIS
Meta analysis proceeds in several stages. The first stage described is to determine for each study the effect estimate with its standard error. Then the summary statistic is computed as the weighted sum of the effect measures thus weighted average = ∑ Ei Wi / ∑ Wi where Ei = Effect estimate and Wi = weight given to the study. The Mantel-Haenszel method uses its own weights derived from knowledge of cell numbers that are often not available in published papers. In the inverse variance methods, the weight given to each study is the inverse of the variance of its effect estimate in other words Wi = 1/ Si..
The analysis may assume a random effect or a fixed effect. A random effect analysis assumes that the different studies are estimating different effects that are distributed all over the studies. The fixed effect analysis assumes that the different studies are estimating the same effect.
The standard error of the pooled effect estimate can be computed using specific formulas and can be used to get a confidence interval around the effect estimate. Meta analysis also incorporates a test of heterogeneity across studies. If there is heterogeneity there is no need for a combined effect measure.
The next step is to assess heterogeneity. A simple way is to see whether confidence intervals of effect measures for the various studies overlap or not. Overlap indicates homogeneity and no overlap indicates heterogeneity. A chi square test can in addition be used to test for heterogeneity. If heterogeneity is found the data should be rechecked for accurate abstracting. Fixed effect analysis ignores heterogeneity while random effects analysis incorporates heterogeneity. Other measures of dealing with heterogeneity are: changing the effect measure, excluding some studies, carrying out subgroup analyses, or meta-regression (the effect measure is the dependent variable and various study characteristics are the independent variables).