search this site.

200709P - MIXED DATA (Quantitative & Qualitative) ANALYSIS

Print Friendly and PDFPrint Friendly

Presented in the Biostatistics module of the Clinical Research Coordinators Course on July 9, 2020 13.00-15.00 by Professor Omar Hasan Kasule MB ChB (MUK), MPH (Harvard), DrPH (Harvard)  Professor of Epidemiology and Bioethics King Fahad Medical City


1.0 TYPES OF STATISTICAL ANALYSIS

Univariate: compare a single mean or proportion against a fixed number

Bivariate: compare two variables

Multivariate: compare more than 2 variables


2.0 PURPOSE OF STATISTICAL ANALYSIS

Testing the null hypothesis (H0): eg there is no difference in height between males and females

Use the appropriate test statistic: Student T-test, F test, chi-square test

Get a p-value and decide

If p<0.05 reject the null hypothesis

If p>0.05 do not reject the null hypothesis 


3.0 STUDENT T-TEST (compare means in 2 groups)

Dependent variable: continuous

Independent variable: categorical

Test statistic: Student T-test

example: height (cm) by gender (male and female)


4.0 ANALYSIS OF VARIANCE, ANOVA (compare means in 3 groups or more)

Dependent variable: continuous

Independent variable: categorial

Test statistic: F statistic

Example: Weight (Kg) by color preference (blue, red, green, yellow)


5.0 CHISQUARE TEST (compare proportions in 2 groups) 

Dependent: categorical

Independent: categorical

Test statistic: Chi-square statistics

Example: wearing glasses (yes, no) by gender (male, female)

 

6.0 2 x 2 Contingency table rows (male, female) columns (glasses+ and glasses)

 

7.0 CHISQUARE TEST (compare proportions in 3 or more) 

Dependent: categorical

Independent: categorical

Test statistic: Chi-square statistic. 

Example: wearing glasses (yes, no) by region of birth (East, North, West, Central, West) 

This test tells us there is some relationship among the variables but cannot tell what is related to what?


8.0 2 x 5 contingency table rows (glasses+, glasses-)  columns (East, North, West, Central, West)


9.0 LINEAR REGRESSION (modeling data)

Dependent (y): continuous

Independent (x): continuous

Test statistic: linear regression coefficient and t-test

Example: weight by height and shoe size


10.0 LINEAR REGRESSION EQUATION

Weight = a + b (height)

  a = intercept

             b = regression coefficient


11.0 LOGISTIC REGRESSION (modeling data)

Dependent(y): dichotomous

Independent(x): all categorical, a mixture of categorical and continuous, 

Test statistic: logistic regression coefficient and t-test

Example: glasses by gender and type of primary school and color preference


12.0 LOGISTIC REGRESSION EQUATION

Equation: Weight = a + b1 (height) + b2 (school) + b3 (color)

  a = intercept

                           b1, b2 = regression coefficient


13.0 COX REGRESSION 

Dependent(y): dichotomous (varies with time)

Independent(x): categorical and continuous

Cox regression is used to analyze survival in cancer trials

Example: death (yes/no) by type of time at death(day), treatment(new/old), gender(male/female), hemoglobin level (gm/cc)


14.0 GRAPH OF COX REGRESSION SHOWING SURVIVAL BYB TIME AND TREATMENT


15.0 LOG-LINEAR ANALYSIS

Dependent: nominal

Independent: categorical

Example: gender (male/female) by wearing glasses (yes/no) and color preference (blue, red, green, yellow).