Difference between revisions of "R"

From Script | Spoken-Tutorial
Jump to: navigation, search
 
(14 intermediate revisions by 2 users not shown)
Line 5: Line 5:
 
'''R''' can be used for simple calculations, matrix calculations, differential equations, optimisation, statistical analysis, plotting graphs, etc. Also, it is useful to anybody who wishes to undertake extensive statistical computations and data visualization.  
 
'''R''' can be used for simple calculations, matrix calculations, differential equations, optimisation, statistical analysis, plotting graphs, etc. Also, it is useful to anybody who wishes to undertake extensive statistical computations and data visualization.  
  
The Spoken Tutorials for R series are contributed by Prof. Kannan Moudgalya, IIT Bombay, FOSSEE Fellows 2018 Shaik Sameer and Varshit Dubey, Vidhya Iyer, Madhuri Ganapati, Sudhakar Kumar, M.Tech student IIT Bombay and overall coordination for the series was done by Smita Wangikar from FOSSEE project, IIT Bombay.
+
The spoken tutorials (ST) for R series was initially created by '''Prof. Kannan Moudgalya, IIT Bombay'''.  Later, the domain expert for this series was '''Prof. Radhendushka Srivastava, Maths Dept. IIT Bombay'''. Content for this series was contributed by FOSSEE Fellows 2018 Shaik Sameer and Varshit Dubey and the tutorials were recorded by Sudhakar Kumar, M.Tech student IIT Bombay. Overall coordination for the series was done by Smita Wangikar from FOSSEE project, IIT Bombay. Madhuri Ganapati and Vidhya Iyer from Spoken Tutorial project, IIT Bombay, were the reviewers from ST end.
 +
 
 +
 
 +
'''Learners:''' Analysts, researchers, statistics students
  
 
__TOC__
 
__TOC__
Line 11: Line 14:
 
'''Note:''' Each numbered topic corresponds to a single spoken tutorial. Each bulleted point corresponds to a command or topic that must be covered in the given spoken tutorial.
 
'''Note:''' Each numbered topic corresponds to a single spoken tutorial. Each bulleted point corresponds to a command or topic that must be covered in the given spoken tutorial.
  
== Introduction to basics of R ==
+
#'''Overview of R and RStudio'''
 
+
#*About '''R''' programming language
* Version of '''R''' and '''RStudio''' used
+
#*Features of '''R'''
 
+
#*Applications of '''R'''
* Operating systems on which these run
+
#*Discuss the contents available on the '''R''' Project website
 
+
#*Use the command-line interface of '''R'''
* Quick intro to '''R''' and '''RStudio'''
+
#*About '''RStudio'''
 
+
#*Features of '''RStudio'''
* Resizing the font and window size
+
#*Discuss the contents available on the '''RStudio''' website
 
+
#*Explain the interface of '''RStudio'''
* Using <code>+</code>, <code>-</code>, <code>^</code>, <code>sqrt</code>
+
#*Topics covered in this series
 
+
#'''Installing R and RStudio on Linux'''
* Using <code>exp</code>, <code>log</code>, <code>sin</code>
+
#*Install '''R''' on Linux
 
+
#*Use the command-line interface of '''R'''
* Different ways of invoking <code>log</code>  
+
#*Show the value of the exponential function in R
 
+
#*Install <code>wget</code> utility
* Vectors using <code>seq</code> and <code>length</code>
+
#*Install <code>gdebi</code> utility
 
+
#*Install '''RStudio''' on Linux
* Using <code>pi</code>
+
#*Launch '''RStudio''' on Linux
 
+
#*Run a plot in '''RStudio'''
* Plotting a <code>sine</code> function
+
#*View packages in '''RStudio'''
 
+
#*Install packages
* Defining more points to get a smooth plot
+
#'''Installing R and RStudio on Windows'''
 
+
#*Install R on Windows 10
* Plotting with points and as line
+
#*Launch R
 
+
#*Use command-line-interface of R
* Introduction to '''help'''
+
#*Run an <code>exp</code> command in R
 
+
#*Install '''RStudio''' on Windows
 
+
#*Launch '''RStudio''' on Windows
== Introduction to data frames in R==
+
#*Run a plot in '''RStudio'''
 
+
#*Install packages
* Storing captaincy information in '''vectors'''
+
#'''Introduction to basics of R'''
 
+
#*Version of '''R''' and '''RStudio''' used
* Constructing a '''data frame''' using '''vectors'''
+
#*Operating systems on which these run
 
+
#*Quick intro to '''R''' and '''RStudio'''
* Plotting one vector of a '''data frame''' vs. another one
+
#*Resizing the font and window size
 
+
#*Using <code>+</code>, <code>-</code>, <code>^</code>, <code>sqrt</code>
* Adding a vector to a '''data frame'''
+
#*Using <code>exp</code>, <code>log</code>, <code>sin</code>
 
+
#*Different ways of invoking <code>log</code>  
* Saving a data frame into a '''csv''' file
+
#*Vectors using <code>seq</code> and <code>length</code>
 
+
#*Using <code>pi</code>
* Preventing the writing of row numbers into the '''csv''' file  
+
#*Plotting a <code>sine</code> function
 
+
#*Defining more points to get a smooth plot
* Changing the contents of a '''csv''' file through a text editor
+
#*Plotting with points and as line
 
+
#*Introduction to '''help'''
* Loading a '''csv''' file into a '''data frame'''
+
#'''Introduction to data frames in R'''
 
+
#*Storing captaincy information in '''vectors'''
* Accessing the data sets that come with '''R'''
+
#*Constructing a '''data frame''' using '''vectors'''
 
+
#*Plotting one vector of a '''data frame''' vs. another one
 
+
#*Adding a vector to a '''data frame'''
== Introduction to RStudio==
+
#*Saving a data frame into a '''csv''' file
 
+
#*Preventing the writing of row numbers into the '''csv''' file  
* Features of '''RStudio'''  
+
#*Changing the contents of a '''csv''' file through a text editor
                           
+
#*Loading a '''csv''' file into a '''data frame'''
* A look at the windows in '''RStudio''' interface:
+
#*Accessing the data sets that come with '''R'''
** '''Source''' and '''Console''' windows
+
#'''Introduction to RStudio'''
** '''Workspace''' window
+
#*Features of '''RStudio'''  
** '''Plots''' and '''Files''' window
+
#*A look at the windows in '''RStudio''' interface:
 
+
#**'''Source''' and '''Console''' windows
* Example to plot a simple data set       
+
#**'''Workspace''' window
 
+
#**'''Plots''' and '''Files''' window
* Introduction to packages in '''R'''               
+
#*Example to plot a simple data set       
 
+
#*Introduction to packages in '''R'''               
* How to find the list of packages installed in '''R'''         
+
#*How to find the list of packages installed in '''R'''         
 
+
#*Installation of '''R''' packages in '''RStudio'''
* Installation of '''R''' packages in '''RStudio'''
+
#*Loading and using '''R''' packages   
 
+
#'''Introduction to R script'''
* Loading and using '''R''' packages   
+
#*What is an '''R''' script
 
+
#*Features of '''R''' script
 
+
#*How to create and save an '''R''' script from the user interface (UI) of '''RStudio'''
== Introduction to R script==
+
#*Shortcut keys to create an '''R''' script  
 
+
#*How to use auto-completion of commands
* What is an '''R''' script
+
#*How to run an entire script
 
+
#*How to run a block of a script  
* Features of '''R''' script
+
#*How to add comments
 
+
#*How to comment an existing line  
* How to create and save an '''R''' script from the user interface (UI) of '''RStudio'''
+
#*How to load one script into another script
 
+
#'''Working Directories in RStudio'''
* Shortcut keys to create an '''R''' script  
+
#*What is working directory in '''R'''
+
#*How to know current working directory
* How to use auto-completion of commands
+
#*How to use <code>getwd</code> function
 
+
#*How to set a working directory from the user interface of '''RStudio'''  
* How to run an entire script
+
#*How to set a working directory from the '''Console''' window of '''RStudio'''  
 
+
#*How to use <code>setwd</code> function  
* How to run a block of a script  
+
#*How to read and store a '''csv''' file in '''R'''
 
+
#*How to use <code>read.csv</code> function
* How to add comments
+
#*How to view a stored '''csv''' file in '''R'''
 
+
#*How to use <code>View</code> function
* How to comment an existing line  
+
#'''Indexing and Slicing Data Frames'''
 
+
#*Shortcut key for assignment operator (<code><-</code>)
* How to load one script into another script
+
#*How to perform numeric indexing  
 
+
#*How to extract a row or column from a data frame
 
+
#*How to retrieve multiple rows from a data frame
== Working Directories in RStudio==
+
#*How to combine objects to form a vector  
 
+
#*How to perform logical indexing on a data frame
* What is working directory in '''R'''
+
#*How to perform name indexing on a data frame
 
+
#*How to slice a data frame using <code>subset</code> function  
* How to know current working directory
+
#*How to select required columns (by name) from a data frame  
 
+
#*How to retrieve data using double square brackets
* How to use <code>getwd</code> function
+
#'''Creating Matrices using Data Frames'''
 
+
#*Data required in a matrix format
* How to set a working directory from the user interface of '''RStudio'''  
+
#*Convert a data frame into a matrix
 
+
#*Create a matrix with known data  
* How to set a working directory from the '''Console''' window of '''RStudio'''  
+
#*Add two matrices
 
+
#*Subtract two matrices  
* How to use <code>setwd</code> function  
+
#*Multiply two matrices element wise
 
+
#*Perform true matrix multiplication
* How to read and store a '''csv''' file in '''R'''
+
#*Calculate the transpose of a matrix
 
+
#*Calculate the determinant of a matrix
* How to use <code>read.csv</code> function
+
#'''Operations on Matrices and Data Frames'''
+
#*How to find the inverse of a matrix  
* How to view a stored '''csv''' file in '''R'''
+
#*How to calculate the sum of elements in a matrix using <code>for</code> loop
 
+
#*How to calculate the sum of elements in a matrix using <code>sum</code> function  
* How to use <code>View</code> function
+
#*How to calculate the time elapsed in an operation
 
+
#*How to find out the sum of rows of a matrix
 
+
#*How to find out the sum of columns of a matrix
== Indexing and Slicing Data Frames==
+
#*How to add a new column or row to an existing data-frame  
 
+
#*How to use <code>cbind</code> and <code>rbind</code> function  
* Shortcut key for assignment operator (<code><-</code>)
+
#'''Merging and Importing Data'''
 
+
#*Use of built-in functions in '''R''' for exploring a data frame
* How to perform numeric indexing  
+
#*Access help in '''RStudio'''
 
+
#*Advantages of merging data frames  
* How to extract a row or column from a data frame
+
#*Merge two data frames  
 
+
#*Import data from the command line
* How to retrieve multiple rows from a data frame
+
#*Import <code>xml</code> file and <code>txt</code> file in R  
 
+
#*Import data from the user interface of '''RStudio'''  
* How to combine objects to form a vector  
+
#'''Data Types and Factors'''
 
+
#*What is an object in '''R'''
* How to perform logical indexing on a data frame
+
#*Types of '''R''' - objects  
 
+
#*What is an atomic vector in '''R'''
* How to perform name indexing on a data frame
+
#*Types of atomic vectors
 
+
#*How to find types of vectors
* How to slice a data frame using <code>subset</code> function  
+
#*Factors in '''R'''
+
#*Levels of a factor in '''R'''  
* How to select required columns (by name) from a data frame  
+
#*Identification of categorical variables
 
+
#*How to change the type of a vector
* How to retrieve data using double square brackets
+
#*How to change the values of levels
+
#'''Lists and its Operations'''
 
+
#*Lists in '''R'''  
== Creating Matrices using Data Frames==
+
#*Atomic vectors in '''R'''  
 
+
#*Difference between atomic vectors and lists in '''R'''  
* Data required in a matrix format
+
#*How to create a list  
 
+
#*How to assign names to the elements of a list
* Convert a data frame into a matrix
+
#*Named list in '''R'''  
+
#*How to access elements of a list by its index  
* Create a matrix with known data  
+
#*How to access an element of a list by its name
 
+
#*How to access an element of an element of a list
* Add two matrices
+
#*Combine two different lists
 
+
#'''Plotting Histograms and Pie Chart'''
* Subtract two matrices  
+
#*How to find the dimensions of a data frame  
 
+
#*Define a histogram
* Multiply two matrices element wise
+
#*Plot a histogram in '''R'''  
+
#*Add labels to the histogram  
* Perform true matrix multiplication
+
#*Add color to the bins of a histogram  
 
+
#*Change the number of breaks in the histogram  
* Calculate the transpose of a matrix
+
#*Define a pie chart  
 
+
#*Plotting a pie chart in '''R'''  
* Calculate the determinant of a matrix
+
#*Add a label to the pie chart  
 
+
#*Saving the plot as an image
 
+
#'''Plotting Bar Charts and Scatter Plot'''
== Operations on Matrices and Data Frames==
+
#*What is a bar chart
 
+
#*Draw a bar chart
* How to find the inverse of a matrix  
+
#*Use the <code>barplot</code> function  
 
+
#*Add labels to the bar chart
* How to calculate the sum of elements in a matrix using <code>for</code> loop
+
#*Adjust the labels of the bar chart
 
+
#*What is a scatter plot
* How to calculate the sum of elements in a matrix using <code>sum</code> function  
+
#*Draw a scatter plot
 
+
#*Use <code>plot</code> function with two objects   
* How to calculate time elapsed in an operation
+
#*Find the correlation coefficient   
 
+
#*Range of correlation coefficient
* How to find out the sum of rows of a matrix
+
#'''Introduction to ggplot2'''
 
+
#*Define visualization
* How to find out the sum of columns of a matrix
+
#*About grammar of graphics - '''ggplot2'''
 
+
#*Use of the <code>plot</code> function
* How to add a new column or row to an existing data-frame  
+
#*Add labels to a plot  
 
+
#*Change the color and type of plot
* How to use <code>cbind</code> and <code>rbind</code> function  
+
#*Plot two graphs in the same plot  
 
+
#*Add a legend to the plot
 
+
#*About '''ggplot2''' package  
== Merging and Importing Data==
+
#*Draw a scatter plot using <code>ggplot</code> function  
 
+
#*Save plots using <code>ggsave</code> function
*Use of built-in functions in '''R''' for exploring a data frame
+
#'''Aesthetic Mapping in ggplot2'''
 
+
#*Define aesthetic  
*Access help in '''RStudio'''
+
#*Need for aesthetic in plotting
+
#*Draw a scatter plot  
*Advantages of merging data frames  
+
#*Customize a scatter plot  
 
+
#*View the structure of an object  
*Merge two data frames  
+
#*View the levels of a categorical variable  
 
+
#*Draw a bar chart using <code>ggplot</code>
*Import data from command line
+
#*Add labels to a plot in <code>ggplot</code>
 
+
#*Use the <code>fill</code> argument in aesthetic mapping  
*Import <code>xml</code> file and <code>txt</code> file in R  
+
#*Draw a histogram using <code>ggplot</code>
 
+
#'''Data Manipulation using dplyr Package'''
*Import data from user interface of '''RStudio'''  
+
#*What is data visualization
 
+
#*Need for data manipulation  
 
+
#*What is '''dplyr''' package  
== Data Types and Factors==
+
#*Functions in '''dplyr''' package   
 
+
#*Install '''dplyr''' package  
* What is an object in '''R'''
+
#*Use <code>filter</code> function  
 
+
#*Use <code>filter</code> function with a logical operator  
* Types of '''R''' - objects  
+
#*Use '''match''' operator  
 
+
#*Use <code>arrange</code> function for ascending order
* What is an atomic vector in '''R'''
+
#*Use <code>arrange</code> function for descending order
 
+
#'''More functions in the dplyr Package'''
* Types of atomic vectors
+
#*Functions in the '''dplyr''' package
 
+
#*Select multiple variables in a data frame
* How to find types of vectors
+
#*Remove variables from a data frame
 
+
#*Use of <code>select</code> function
* Factors in '''R'''
+
#*Use of <code>starts_with</code> function
 
+
#*Change the name of a variable
* Levels of a factor in '''R'''  
+
#*Use of <code>rename</code> function
 
+
#*Create a new variable from existing variables
* Identification of categorical variables
+
#*Use of <code>mutate</code> function
 
+
#*Property of <code>mutate</code> function
* How to change the type of a vector
+
#'''Pipe Operator'''
 
+
#*About <code>summarise</code> function in '''dplyr''' package
* How to change the values of levels
+
#*About <code>group_by</code> function in '''dplyr''' package 
 
+
#*Difference between <code>summarise</code> and <code>group_by</code> functions
 
+
#*Use <code>summarise</code> and <code>group_by</code> functions together
== Lists and its Operations==
+
#*About '''pipe''' operator
 
+
#*Examples of '''pipe''' operator
* Lists in '''R'''  
+
#*Benefits of using '''pipe''' operator
 
+
#*Use '''ggplot2''' and '''dplyr''' package together using '''pipe'''
* Atomic vectors in '''R'''  
+
#*Plot '''boxplot'''
 
+
#*Use <code>count</code> in <code>summarise</code> function
* Difference between atomic vectors and lists in '''R'''  
+
#'''Conditional Statements'''
 
+
#*About conditional statements
* How to create a list  
+
#*Syntax of <code>if</code>, <code>else</code> and <code>else if</code> statements
 
+
#*Use <code>if</code>, <code>else</code> and <code>else if</code> statements
* How to assign names to the elements of a list
+
#*Use <code>ifelse</code> function
 
+
#*Arguments of <code>ifelse</code> function
* Named list in '''R'''  
+
#*Add a new column in an existing data frame
 
+
#*Read and store a <code>csv</code> file
* How to access elements of a list by its index  
+
#*View a data frame
 
+
#*Count true values in a column
* How to access an element of a list by its name
+
#*Use <code>sum</code> function
 
+
#'''Functions in R'''
* How to access an element of an element of a list
+
#*About functions
 
+
#*About '''built-in''' functions and '''user-defined''' functions
* Combine two different lists
+
#*Need for '''user-defined''' functions
 
+
#*Syntax of a function
 
+
#*Parts of a function
== Plotting Histograms and Pie Chart==
+
#*Create a '''user-defined''' function with arguments
 
+
#*Create a '''user-defined''' function without arguments
* How to find the dimensions of a data frame  
+
#*About <code>readline</code> function
 
+
#*Scope of variables
* Define a histogram
+
#*Use the <code>return</code> function
 
+
* Plot a histogram in '''R'''  
+
 
+
* Add labels to the histogram  
+
 
+
* Add color to the bins of a histogram  
+
 
+
* Change the number of breaks in the histogram  
+
 
+
* Define a pie chart  
+
 
+
* Plotting a pie chart in '''R'''  
+
 
+
* Add a label to the pie chart  
+
 
+
* Saving the plot as an image
+
 
+
 
+
== Plotting Bar Charts and Scatter Plot==
+
 
+
* What is a bar chart
+
 
+
* Draw a bar chart
+
 
+
* Use the <code>barplot</code> function  
+
 
+
* Add labels to the bar chart
+
 
+
* Adjust the labels of the bar chart
+
 
+
* What is a scatter plot
+
 
+
* Draw a scatter plot
+
 
+
* Use <code>plot</code> function with two objects   
+
 
+
* Find the correlation coefficient   
+
 
+
* Range of correlation coefficient
+
 
+
 
+
== Introduction to ggplot2==
+
 
+
* Define visualization
+
 
+
* About grammar of graphics - '''ggplot2'''
+
 
+
* Use of the <code>plot</code> function
+
 
+
* Add labels to a plot  
+
 
+
* Change the color and type of plot
+
 
+
* Plot two graphs in the same plot  
+
 
+
* Add a legend to the plot
+
 
+
* About '''ggplot2''' package  
+
 
+
* Draw a scatter plot using <code>ggplot</code> function  
+
 
+
* Save plots using <code>ggsave</code> function
+
 
+
 
+
== Aesthetic Mapping in ggplot2==
+
 
+
* Define aesthetic  
+
 
+
* Need for aesthetic in plotting
+
 
+
* Draw a scatter plot  
+
 
+
* Customize a scatter plot  
+
 
+
* View the structure of an object  
+
 
+
* View the levels of a categorical variable  
+
 
+
* Draw a bar chart using <code>ggplot</code>
+
 
+
* Add labels to a plot in <code>ggplot</code>
+
 
+
* Use the <code>fill</code> argument in aesthetic mapping  
+
 
+
* Draw a histogram using <code>ggplot</code>
+
 
+
 
+
== Data Manipulation using dplyr Package==
+
 
+
* What is data visualization
+
 
+
* Need for data manipulation  
+
 
+
* What is '''dplyr''' package  
+
 
+
* Functions in '''dplyr''' package   
+
 
+
* Install '''dplyr''' package  
+
 
+
* Use <code>filter</code> function  
+
 
+
* Use <code>filter</code> function with a logical operator  
+
 
+
* Use '''match''' operator  
+
 
+
* Use <code>arrange</code> function for ascending order
+
 
+
* Use <code>arrange</code> function for descending order
+

Latest revision as of 11:57, 11 October 2021

R ( http://www.r-project.org/) is an open source software - a well organized and sophisticated package - that facilitates data analysis, modeling, inferential testing and forecasting. It is a user friendly software which allows to create new function commands to solve statistical problems. It runs on a variety of UNIX platforms (and similar systems such as LINUX), Windows and Mac OS.

R is the most preferred open-source language for analytics and data science. At Microsoft, R is used by its data scientists, who apply machine learning to data from Bing, Azure, Office, and the Sales, Marketing, and Finance departments. Twitter has been using R for measuring user-experience. On the other hand, the cross-platform compatibility of R and its capacity to handle large and complex data sets make it an ideal tool for academicians to analyze data in their labs.

R can be used for simple calculations, matrix calculations, differential equations, optimisation, statistical analysis, plotting graphs, etc. Also, it is useful to anybody who wishes to undertake extensive statistical computations and data visualization.

The spoken tutorials (ST) for R series was initially created by Prof. Kannan Moudgalya, IIT Bombay. Later, the domain expert for this series was Prof. Radhendushka Srivastava, Maths Dept. IIT Bombay. Content for this series was contributed by FOSSEE Fellows 2018 Shaik Sameer and Varshit Dubey and the tutorials were recorded by Sudhakar Kumar, M.Tech student IIT Bombay. Overall coordination for the series was done by Smita Wangikar from FOSSEE project, IIT Bombay. Madhuri Ganapati and Vidhya Iyer from Spoken Tutorial project, IIT Bombay, were the reviewers from ST end.


Learners: Analysts, researchers, statistics students


Note: Each numbered topic corresponds to a single spoken tutorial. Each bulleted point corresponds to a command or topic that must be covered in the given spoken tutorial.

  1. Overview of R and RStudio
    • About R programming language
    • Features of R
    • Applications of R
    • Discuss the contents available on the R Project website
    • Use the command-line interface of R
    • About RStudio
    • Features of RStudio
    • Discuss the contents available on the RStudio website
    • Explain the interface of RStudio
    • Topics covered in this series
  2. Installing R and RStudio on Linux
    • Install R on Linux
    • Use the command-line interface of R
    • Show the value of the exponential function in R
    • Install wget utility
    • Install gdebi utility
    • Install RStudio on Linux
    • Launch RStudio on Linux
    • Run a plot in RStudio
    • View packages in RStudio
    • Install packages
  3. Installing R and RStudio on Windows
    • Install R on Windows 10
    • Launch R
    • Use command-line-interface of R
    • Run an exp command in R
    • Install RStudio on Windows
    • Launch RStudio on Windows
    • Run a plot in RStudio
    • Install packages
  4. Introduction to basics of R
    • Version of R and RStudio used
    • Operating systems on which these run
    • Quick intro to R and RStudio
    • Resizing the font and window size
    • Using +, -, ^, sqrt
    • Using exp, log, sin
    • Different ways of invoking log
    • Vectors using seq and length
    • Using pi
    • Plotting a sine function
    • Defining more points to get a smooth plot
    • Plotting with points and as line
    • Introduction to help
  5. Introduction to data frames in R
    • Storing captaincy information in vectors
    • Constructing a data frame using vectors
    • Plotting one vector of a data frame vs. another one
    • Adding a vector to a data frame
    • Saving a data frame into a csv file
    • Preventing the writing of row numbers into the csv file
    • Changing the contents of a csv file through a text editor
    • Loading a csv file into a data frame
    • Accessing the data sets that come with R
  6. Introduction to RStudio
    • Features of RStudio
    • A look at the windows in RStudio interface:
      • Source and Console windows
      • Workspace window
      • Plots and Files window
    • Example to plot a simple data set
    • Introduction to packages in R
    • How to find the list of packages installed in R
    • Installation of R packages in RStudio
    • Loading and using R packages
  7. Introduction to R script
    • What is an R script
    • Features of R script
    • How to create and save an R script from the user interface (UI) of RStudio
    • Shortcut keys to create an R script
    • How to use auto-completion of commands
    • How to run an entire script
    • How to run a block of a script
    • How to add comments
    • How to comment an existing line
    • How to load one script into another script
  8. Working Directories in RStudio
    • What is working directory in R
    • How to know current working directory
    • How to use getwd function
    • How to set a working directory from the user interface of RStudio
    • How to set a working directory from the Console window of RStudio
    • How to use setwd function
    • How to read and store a csv file in R
    • How to use read.csv function
    • How to view a stored csv file in R
    • How to use View function
  9. Indexing and Slicing Data Frames
    • Shortcut key for assignment operator (<-)
    • How to perform numeric indexing
    • How to extract a row or column from a data frame
    • How to retrieve multiple rows from a data frame
    • How to combine objects to form a vector
    • How to perform logical indexing on a data frame
    • How to perform name indexing on a data frame
    • How to slice a data frame using subset function
    • How to select required columns (by name) from a data frame
    • How to retrieve data using double square brackets
  10. Creating Matrices using Data Frames
    • Data required in a matrix format
    • Convert a data frame into a matrix
    • Create a matrix with known data
    • Add two matrices
    • Subtract two matrices
    • Multiply two matrices element wise
    • Perform true matrix multiplication
    • Calculate the transpose of a matrix
    • Calculate the determinant of a matrix
  11. Operations on Matrices and Data Frames
    • How to find the inverse of a matrix
    • How to calculate the sum of elements in a matrix using for loop
    • How to calculate the sum of elements in a matrix using sum function
    • How to calculate the time elapsed in an operation
    • How to find out the sum of rows of a matrix
    • How to find out the sum of columns of a matrix
    • How to add a new column or row to an existing data-frame
    • How to use cbind and rbind function
  12. Merging and Importing Data
    • Use of built-in functions in R for exploring a data frame
    • Access help in RStudio
    • Advantages of merging data frames
    • Merge two data frames
    • Import data from the command line
    • Import xml file and txt file in R
    • Import data from the user interface of RStudio
  13. Data Types and Factors
    • What is an object in R
    • Types of R - objects
    • What is an atomic vector in R
    • Types of atomic vectors
    • How to find types of vectors
    • Factors in R
    • Levels of a factor in R
    • Identification of categorical variables
    • How to change the type of a vector
    • How to change the values of levels
  14. Lists and its Operations
    • Lists in R
    • Atomic vectors in R
    • Difference between atomic vectors and lists in R
    • How to create a list
    • How to assign names to the elements of a list
    • Named list in R
    • How to access elements of a list by its index
    • How to access an element of a list by its name
    • How to access an element of an element of a list
    • Combine two different lists
  15. Plotting Histograms and Pie Chart
    • How to find the dimensions of a data frame
    • Define a histogram
    • Plot a histogram in R
    • Add labels to the histogram
    • Add color to the bins of a histogram
    • Change the number of breaks in the histogram
    • Define a pie chart
    • Plotting a pie chart in R
    • Add a label to the pie chart
    • Saving the plot as an image
  16. Plotting Bar Charts and Scatter Plot
    • What is a bar chart
    • Draw a bar chart
    • Use the barplot function
    • Add labels to the bar chart
    • Adjust the labels of the bar chart
    • What is a scatter plot
    • Draw a scatter plot
    • Use plot function with two objects
    • Find the correlation coefficient
    • Range of correlation coefficient
  17. Introduction to ggplot2
    • Define visualization
    • About grammar of graphics - ggplot2
    • Use of the plot function
    • Add labels to a plot
    • Change the color and type of plot
    • Plot two graphs in the same plot
    • Add a legend to the plot
    • About ggplot2 package
    • Draw a scatter plot using ggplot function
    • Save plots using ggsave function
  18. Aesthetic Mapping in ggplot2
    • Define aesthetic
    • Need for aesthetic in plotting
    • Draw a scatter plot
    • Customize a scatter plot
    • View the structure of an object
    • View the levels of a categorical variable
    • Draw a bar chart using ggplot
    • Add labels to a plot in ggplot
    • Use the fill argument in aesthetic mapping
    • Draw a histogram using ggplot
  19. Data Manipulation using dplyr Package
    • What is data visualization
    • Need for data manipulation
    • What is dplyr package
    • Functions in dplyr package
    • Install dplyr package
    • Use filter function
    • Use filter function with a logical operator
    • Use match operator
    • Use arrange function for ascending order
    • Use arrange function for descending order
  20. More functions in the dplyr Package
    • Functions in the dplyr package
    • Select multiple variables in a data frame
    • Remove variables from a data frame
    • Use of select function
    • Use of starts_with function
    • Change the name of a variable
    • Use of rename function
    • Create a new variable from existing variables
    • Use of mutate function
    • Property of mutate function
  21. Pipe Operator
    • About summarise function in dplyr package
    • About group_by function in dplyr package
    • Difference between summarise and group_by functions
    • Use summarise and group_by functions together
    • About pipe operator
    • Examples of pipe operator
    • Benefits of using pipe operator
    • Use ggplot2 and dplyr package together using pipe
    • Plot boxplot
    • Use count in summarise function
  22. Conditional Statements
    • About conditional statements
    • Syntax of if, else and else if statements
    • Use if, else and else if statements
    • Use ifelse function
    • Arguments of ifelse function
    • Add a new column in an existing data frame
    • Read and store a csv file
    • View a data frame
    • Count true values in a column
    • Use sum function
  23. Functions in R
    • About functions
    • About built-in functions and user-defined functions
    • Need for user-defined functions
    • Syntax of a function
    • Parts of a function
    • Create a user-defined function with arguments
    • Create a user-defined function without arguments
    • About readline function
    • Scope of variables
    • Use the return function

Contributors and Content Editors

Manivel, Nancyvarkey, Sudhakarst