0 Voturi pozitive0 Voturi negative

3 (de) vizualizări2 paginiHow to choose the correct statistical test

Jan 22, 2018

© © All Rights Reserved

PDF, TXT sau citiți online pe Scribd

How to choose the correct statistical test

© All Rights Reserved

3 (de) vizualizări

How to choose the correct statistical test

© All Rights Reserved

- A Short List of the Most Useful R Commands
- L1 - Introduction to Industrial Economics
- biology_mathematical_requirements.pdf
- untitleddocument 2
- Influence of Curing Age and Mix Composition on Compressive Strength of Volcanic Ash Blended Cement Laterized Concrete
- Factorial Anova
- mco3-1
- ATSWA-QA
- Cochin Port Employee Engagement
- 5702
- 04 Chapter 3
- RM
- Desarrollo de un alimentador de bagazo
- Correlations Per An
- Preliminary Exam Reviewer ECOLAB
- Statistik Ayun Ruth
- pkging.pdf
- HUBUNGAN STATUS GIZI DAN AKTIVITAS OLAHRAGA DENGAN TINGKAT KEBUGARAN JASMANI
- Interorganisational Relations and Goal Consensus.pdf
- Statistics Made Easy

Sunteți pe pagina 1din 2

178] || Click here to download free Android application for this journal

Editorial

Today statistics provides the basis for inference in most medical research. Yet, for want of exposure to statistical theory and

practice, it continues to be regarded as the Achilles heel by all concerned in the loop of research and publication – the researchers

(authors), reviewers, editors and readers.

Most of us are familiar to some degree with descriptive statistical measures such as those of central tendency and those of

dispersion. However, we falter at inferential statistics. This need not be the case, particularly with the widespread availability of

powerful and at the same time user-friendly statistical software. As we have outlined below, a few fundamental considerations

will lead one to select the appropriate statistical test for hypothesis testing. However, it is important that the appropriate statistical

analysis is decided before starting the study, at the stage of planning itself, and the sample size chosen is optimum. These cannot

be decided arbitrarily after the study is over and data have already been collected.

The great majority of studies can be tackled through a basket of some 30 tests from over a 100 that are in use. The test to

be used depends upon the type of the research question being asked. The other determining factors are the type of data being

analyzed and the number of groups or data sets involved in the study. The following schemes, based on five generic research

questions, should help.[1]

Question 1: Is there a difference between groups that are unpaired? Groups or data sets are regarded as unpaired if there is

no possibility of the values in one data set being related to or being influenced by the values in the other data sets. Different tests

are required for quantitative or numerical data and qualitative or categorical data as shown in Fig. 1. For numerical data, it is

important to decide if they follow the parameters of the normal distribution curve (Gaussian curve), in which case parametric tests

are applied. If distribution of the data is not normal or if one is not sure about the distribution, it is safer to use non-parametric

tests. When comparing more than two sets of numerical data, a multiple group comparison test such as one-way analysis of

variance (ANOVA) or Kruskal-Wallis test should be used first. If they return a statistically significant p value (usually meaning

p < 0.05) then only they should be followed by a post hoc test to determine between exactly which two data sets the difference

lies. Repeatedly applying the t test or its non-parametric counterpart, the Mann-Whitney U test, to a multiple group situation

increases the possibility of incorrectly rejecting the null hypothesis.

Question 2: Is there a difference between groups which are paired? Pairing signifies that data sets are derived by repeated

measurements (e.g. before-after measurements or multiple measurements across time) on the same set of subjects. Pairing will

also occur if subject groups are different but values in one group are in some way linked or related to values in the other group

(e.g. twin studies, sibling studies, parent-offspring studies). A crossover study design also calls for the application of paired

group tests for comparing the effects of different interventions on the same subjects. Sometimes subjects are deliberately paired

to match baseline characteristics such as age, sex, severity or duration of disease. A scheme similar to Fig. 1 is followed in paired

data set testing, as outlined in Fig. 2. Once again, multiple data set comparison should be done through appropriate multiple

group tests followed by post hoc tests.

Question 3: Is there any association between variables? The various tests applicable are outlined in Fig. 3. It should be noted

that the tests meant for numerical data are for testing the association between two variables. These are correlation tests and

Numerical data Categorical data

Parametric Otherwise 2 groups > 2 groups

2 groups 2 groups Chi-square Chi-square

Unpaired Mann-Whitney [χ2] test [χ2] test

t test 2 groups 2 groups McNemar's Cochran's

U test Wilcoxon's

Fisher's Paired t test test Q test

> 2 groups Wilcoxon's rank exact test matched

> 2 groups pairs signed McNemar's

Analysis of sum test

variance Repeated rank test test

(ANOVA) > 2 groups measures exact variants

or F test Kruskall-Wallis ANOVA > 2 groups

H test (Kruskall-Wallis Friedman's ANOVA

ANOVA)

Post-hoc (multiple group comparison) tests are to be applied in the event that Post-hoc (multiple group comparison) tests to be applied in the event that

ANOVA or its non-parametric counterpart shows a significant difference (to detect Repeated measures ANOVA or its non-parametric counterpart shows a significant

between which two groups the significant difference lies). Examples of such tests are: difference (to detect between which two data sets the significant difference lies)

• Parametric data: Tukey’s Honestly Significant Difference test (Tukey-Kramer include:

test), Newman-Keuls test, Bonferroni’s test, Dunnett’s test, Scheffe’s test, etc. • Parametric data: Wilcoxon’s matched pairs signed rank test.

• Non-parametric data: Dunn’s test. • Non-parametric data: Dunn’s test

Figure 1: Tests to address the question: Is there a difference between Figure 2: Tests to address the question: Is there a difference between

groups – unpaired (parallel and independent groups) situation? groups – paired situation?

[Downloaded free from http://www.ijo.in on Monday, December 26, 2011, IP: 125.16.60.178] || Click here to download free Android application for this journal

(quantitative method) Bland-Altman

plot (graphical method)

Both variables Otherwise 2 X 2 data Otherwise

parametric Figure 4: Tests to address the question: Is there an agreement between

assessment (screening / rating / diagnostic) techniques?

Pearson's Spearman's Relative risk Chi-square

(product moment (rank correlation [Risk ratio] test for trend

coefficient) ρ they express the strength of the association as a correlation

Logistic

correlation

coefficient) r Kendall's (rank Odds ratio coefficient. An inverse correlation between two variables is

regression

correlation depicted by a minus sign. All correlation coefficients vary

coefficient) τ in magnitude from 0 (no correlation at all) to 1 (perfect

correlation). A perfect correlation may indicate but does not

Figure 3: Tests to address the question: Is there an association

necessarily mean causality. When two numerical variables

between variables?

are linearly related to each other, a linear regression analysis

can generate a mathematical equation, which can predict

the dependent variable based on a given value of the independent variable.[2] Odds ratios and relative risks are the staple of

epidemiologic studies and express the association between categorical data that can be summarized as a 2 × 2 contingency table.

Logistic regression is actually a multivariate analysis method that expresses the strength of the association between a binary

dependent variable and two or more independent variables as adjusted odds ratios.

Question 4: Is there agreement between data sets? This can be a comparison between a new screening technique against the

standard test, new diagnostic test against the available gold standard or agreement between the ratings or scores given by different

observers. As seen from Fig. 4, agreement between numerical variables may be expressed quantitatively by the intraclass correlation

coefficient or graphically by constructing a Bland-Altman plot in which the difference between two variables x and y is plotted

against the mean of x and y. In case of categorical data, the Cohen’s Kappa statistic is frequently used, with kappa (which varies

from 0 for no agreement at all to 1 for perfect agreement) indicating strong agreement when it is > 0.7. It is inappropriate to infer

agreement by showing that there is no statistically significant difference between means or by calculating a correlation coefficient.

Question 5: Is there a difference between time-to-event trends or survival plots? This question is specific to survival analysis[3]

(the endpoint for such analysis could be death or any event that can occur after a period of time) which is characterized by

censoring of data, meaning that a sizeable proportion of the original study subjects may not reach the endpoint in question by

the time the study ends. Data sets for survival trends are always considered to be non-parametric. If there are two groups then

the applicable tests are Cox-Mantel test, Gehan’s (generalized Wilcoxon) test or log-rank test. In case of more than two groups

Peto and Peto’s test or log-rank test can be applied to look for significant difference between time-to-event trends.

It can be appreciated from the above outline that distinguishing between parametric and non-parametric data is important.

Tests of normality (e.g. Kolmogorov-Smirnov test or Shapiro-Wilk goodness of fit test) may be applied rather than making

assumptions. Some of the other prerequisites of parametric tests are that samples have the same variance i.e. drawn from the same

population, observations within a group are independent and that the samples have been drawn randomly from the population.

A one-tailed test calculates the possibility of deviation from the null hypothesis in a specific direction, whereas a two-tailed

test calculates the possibility of deviation from the null hypothesis in either direction. When Intervention A is compared with

Intervention B in a clinical trail, the null hypothesis assumes there is no difference between the two interventions. Deviation

from this hypothesis can occur in favor of either intervention in a two-tailed test but in a one-tailed test it is presumed that only

one intervention can show superiority over the other. Although for a given data set, a one-tailed test will return a smaller p value

than a two-tailed test, the latter is usually preferred unless there is a watertight case for one-tailed testing.

It is obvious that we cannot refer to all statistical tests in one editorial. However, the schemes outlined will cover the hypothesis

testing demands of the majority of observational as well as interventional studies. Finally one must remember that, there is no

substitute to actually working hands-on with dummy or real data sets, and to seek the advice of a statistician, in order to learn

the nuances of statistical hypothesis testing.

P. D. Hinduja National Hospital, Veer Savarkar Marg, Mumbai - 400 016, 1Department of Pharmacololgy,

Institute of Postgraduate Medical Education & Research (IPGME&R), 244B Acharya J. C. Bose Road, Kolkata - 700020, India.

E-mail: ijo.editor@gmail.com

References

1. Hypothesis testing and choice of statistical tests. In: Parikh MN, Hazra A, Mukherjee J, Gogtay N, editors. Research methodology simplified:

Every clinician a researcher. New Delhi: Jaypee Brothers; 2010. p. 121-8.

2. The theory of linear regression and performing a linear regression analysis. In: Petrie A, Sabin C, editors. Medical statistics at a glance. 2nd

ed. London: Blackwell Publishing; 2005. p. 70-3.

3. Wang D, Clayton T, Bakhai A. Analysis of survival data. In: Wang D, Bakhai A, editors. Clinical trials: A practical guide to design, analysis

and reporting. London: Remedica; 2006. p. 235-52.

- A Short List of the Most Useful R CommandsÎncărcat decristiansolomon1754
- L1 - Introduction to Industrial EconomicsÎncărcat degamerneville
- biology_mathematical_requirements.pdfÎncărcat dekayla
- untitleddocument 2Încărcat deapi-358905877
- Influence of Curing Age and Mix Composition on Compressive Strength of Volcanic Ash Blended Cement Laterized ConcreteÎncărcat deBabafemi Adewumi John
- Factorial AnovaÎncărcat deShanmuga Nathan
- mco3-1Încărcat deAbhishek Sarawagi
- ATSWA-QAÎncărcat deMelody Okunuga
- Cochin Port Employee EngagementÎncărcat deRaheena.N
- 5702Încărcat deCrystal Murray
- 04 Chapter 3Încărcat deAce Lawrence Rañises
- RMÎncărcat deSaba Kazi
- Desarrollo de un alimentador de bagazoÎncărcat deWilliam Villarreal
- Correlations Per AnÎncărcat dehhjsjdsd
- Preliminary Exam Reviewer ECOLABÎncărcat deJowi Sal
- Statistik Ayun RuthÎncărcat dePuskesmas Bulik
- pkging.pdfÎncărcat deImran Dost
- HUBUNGAN STATUS GIZI DAN AKTIVITAS OLAHRAGA DENGAN TINGKAT KEBUGARAN JASMANIÎncărcat deAnonymous wGAc8DYl3V
- Interorganisational Relations and Goal Consensus.pdfÎncărcat deAlberth Kamus
- Statistics Made EasyÎncărcat dehclwmganand
- Statistical AdvisorÎncărcat deSarah White
- Accounting Ratios Information: An Instrument for Business Performance AnalysisÎncărcat deEditor IJTSRD
- Programme 22-4-2012(1)Încărcat deAkash Palliwal
- 2011 H1 Prelim ModifiedÎncărcat depanshanren
- better together or not.pdfÎncărcat deMarco Cifang
- Self Study 4Încărcat denancyagarwal
- 1-s2.0-S1877705815034438-mainÎncărcat degusti
- Alternative Banking Channels and Customers’ SatisfactionÎncărcat deDr. Vijay Kumbhar
- Stats Sample BookÎncărcat deapi-3857574
- Research MethodologyÎncărcat densmu838

- Probability CheatsheetÎncărcat deClases Particulares Online Matematicas Fisica Quimica
- Data TransformationÎncărcat detulkas72
- A GeneralConstruction4ParallelizingMetropolis HastingsAlgorithmsÎncărcat detulkas72
- CLANG C2 for Windows - Jim Radigan - CppCon 2015.pptxÎncărcat detulkas72
- lecture2-0operations.pptÎncărcat detulkas72
- Gaussian QuinnÎncărcat detulkas72
- Reg Ex Cheat SheetÎncărcat deIan Flores
- Pentax Af200sÎncărcat detulkas72
- ss2Încărcat detulkas72
- Cache Op Tim Ization TechniquesÎncărcat detulkas72
- FuzzyMarkovChains.pdfÎncărcat detulkas72
- Oo Imple Fuzzy Inference EngineÎncărcat detulkas72
- JEL113Încărcat detulkas72

- Nanofins Science and ApplicationsÎncărcat deAmbrish Singh
- 717SM08Încărcat dehdlozanoa
- Australia Life cycle.pdfÎncărcat deMuhammad Imran Khan
- Reinforcement Concrete Ratio EstimatesÎncărcat debrv1980
- Zenith UAV CatalogÎncărcat deMie Dts
- Bevel Gear in SolidWorks - Exercise #5 in SolidWorks - CAD ModeÎncărcat deSAM
- Expansion JointsÎncărcat deAr Dinesh Gaikwad
- Type Kc TableÎncărcat demjinspector
- Stpm Paper 3 2013t(u)Încărcat deKwongKH
- Bitcoin Moniter Form1Încărcat deBTC BYTE MINER
- a418302 (1)Încărcat degullo01
- Ppr2013.140alrÎncărcat debiles1234
- Fracture Toughness Evaluation of WC-Co Alloys by Indentation TestingÎncărcat deFjodorSerg
- atmosphere inquiry activityÎncărcat deapi-251355123
- DCFÎncărcat deChatinont Yam
- M90_ UG_10-06Încărcat deGerardo A Cárdenas B.
- Research in Electrical Property of CdxZn1-xTe Ternary SemiconductorÎncărcat deallasrivani
- H. K. Moffatt and Tadashi Tokieda- Celt reversals: a prototype of chiral dynamicsÎncărcat deVortices3443
- Onion KaryotypeÎncărcat deUpendra Sharma Sulibele
- Horz_CHS_Brace_double cover plate_Conns.xlsÎncărcat desreenivasulu chenchugari
- nov 2016Încărcat deKhooshal Yadhav Ramphul
- AleÎncărcat deSUSMITA
- Introduction to SolidEdgeÎncărcat deusama tariq
- BC337_25Încărcat deguliver
- Skema Kertas 2 Matematik Spm 2012Încărcat deArayee Jeya
- 32_SENSORS-INDUCTIVE.pdfÎncărcat deMohamed Rashed
- Garlock Metallic Gasket Catalog[1]Încărcat dePeterSP
- maths - lesson plan 3Încărcat deapi-427319867
- TPE1 1W-Series 1W Isolated Single Output DC/DC ConvertersÎncărcat detoppower
- DMM213Încărcat demauricio.gasparini6221

## Mult mai mult decât documente.

Descoperiți tot ce are Scribd de oferit, inclusiv cărți și cărți audio de la editori majori.

Anulați oricând.