×

Warning

The RokSprocket Module needs the RokSprocket Component enabled.

Introduction to Biostatistics

Introduction to Biostatistics

Learn about key biostatistical concepts and efficient tools for summarising and plotting data as well as outlier detection. Demystify the statistical testing approach used to make decision in the presence of uncertainty: p-values, power, and so on. 

Course Details

Learning Objectives

Learning Objectives:

Upon completion of this course, participants will be able to:

  • Understand the difference between descriptive and inferential statistics
  • Appreciate the value of exploratory methods in preliminary data analysis and experimental design construction
  • Explore, characterize and identify problems and trends in data using graphical tools
  • Use descriptive statistics to summarize data
  • Understand the concepts of hypothesis testing, confidence intervals, risk and power
  • Identify the appropriate statistical test based on the study objective
  • Understand the importance of sample size calculations and the required input parameters to estimate a sample size
  • Analyze data more quickly and more accurately
  • To interpret results reliably and confidently

Target Audience

Target Audience:

This applied training session in statistics is aimed at all who collect data and who must make decisions based on that data.

Prerequisite

Prerequisite:

No formal knowledge of statistical tools is required to attend the class.

Course Outline

Course Outline:
  • Why Do we Need Statistics?
  • Descriptive or Exploratory Data Analysis "EDA"
    • Overview and Goals
    • Importance of Identifying the Type and Role of Variables in Studies
    • Visualizing and Summarizing Data: The Concept of a Distribution
      • Graphical Tools: histogram, Box-plot
      • Numerical Tools: mean, median, standard deviation, standard error, etc.
    • Exploring the relationship between two (2) variables
      • Frequency tables for categorical variables
        • Relative risks, odds ratios
      • Pearson's correlation coefficient for continuous variables
      • Plots: Scatter plots, box-plots, etc.
  • Statistical inference or hypothesis testing
    • Overview: What is statistical inference?
    • Statistical Inference with Hypothesis Testing:
      • Null and alternative hypotheses
      • One-tailed vs. two-tailed tests
      • Test statistics: t-test, F-tests, etc.
      • Observed significance level or "p-value"
      • Statistical significance and decision rules
    • Risk involved in hypothesis testing
      • Risks or type I and II errors
      • Confidence level of a test
      • Power of test
    • The importance of sample size calculations and the required input parameters to estimate a sample size
    • Statistical inference with confidence Intervals: interpretation and usage
    • Statistical Inference for a Single Sample or Group: Hypothesis Testing vs. Confidence Interval Approach
    • Statistical Inference for two Categorical Variables: The chi-square test
  • Summary

Practical Info

Practical Info:

Recommended Duration: 1 day

Course Materials:

  • Course notes on statistical methods
  • Sample datasets
  • Related Sessions

    • This module offers an easy introduction to R programming. Learn the basics of R programming and the commonly used plots and statistical tools without pain.

    • Predictive analytics (PA) is on everyone's lips. But what is it really all about? Discover its principle, implementation, typical pitfalls and good practices. Learn about data wrangling and munging, a crucial step in predictive analytics. An overview of the most commonly used models is also presented.

    • Efficient experiments must be large enough to detect meaningful scientific differences and maximize the use of available resources. Learn about sample size and power calculations.

    • Learn how to take data (consumers, genes, stores, ...) and organise them into homogeneous groups for use in many applications, such as market analysis and biomedical data analysis, or as a pre-processing step for many data mining tasks. Cluster analysis comprises a collection of powerful techniques. Learn about this very active field of research in statistics and data mining, and discover new techniques.