R For Everyone
A Kolkata R UseR Group Initiative to teach R
What is R?
R is an open-source (free) programming language and environment mainly for statistical computing and graphics. It is similar to S language which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R was initially written by Robert Gentleman and Ross Ihaka (also known as “R & R”) of the Statistics Department of the University of Auckland.
R provides a wide variety of statistical and graphical techniques and highly extensible.
The first official R version 1.0.0 was released on 29 February 2000.
Course Structure
Introduction to R and RStudio
What is R? Why use R for data analysis?
Overview of RStudio Interface
Basic R Commands (Syntax, Console, Scripts)
Data Types and Structures in R
Vectors, Matrices, Data Frames, Lists
Loading data into R (CSV, Excel, etc.)
Understanding variable types: Numeric, Character, Factor, Logical
Data Manipulation with R
Importing, exploring, and cleaning data using
{dplyr}
Selecting, filtering, and arranging data
Summarising and transforming data with use cases
Basic Statistical Concepts
Descriptive Statistics: Mean, Median, Mode, Variance, Standard Deviation
Frequency distributions, tables, and charts
Hands-on practice: Calculating basic statistics using R functions
Inferential Statistics and Hypothesis Testing
Understanding Population vs Sample
Introduction to Hypothesis Testing (t-tests, chi-square tests)
p-value concept and significance levels
Data Visualization
Introduction to
{ggplot2}
for data visualisationCreating basic plots: scatterplots, histograms, boxplots
Statistical Modelling & Linear Regression
Introduction to Regression Analysis
Building a simple linear regression model in R
Interpreting the results of the model
Sessions on Demand
- These can be taken based on the requirements of the candidates.
Course Instructor
- Samrit Pramanik - Senior Statistical Modeler