Difference between revisions of "R/C2/Overview-of-R-and-RStudio/English"

From Script | Spoken-Tutorial
Jump to: navigation, search
Line 69: Line 69:
  
 
Features of R
 
Features of R
|| * '''R '''facilitates complex operations with '''vectors, arrays''', and '''data frames'''.  
+
||  
 +
* '''R '''facilitates complex operations with '''vectors, arrays''', and '''data frames'''.  
 
* '''R '''has community support of avid developers.
 
* '''R '''has community support of avid developers.
  
Line 75: Line 76:
 
|| Show Slide
 
|| Show Slide
 
Applications of R  
 
Applications of R  
|| '''R '''is extensively used in-* Fraud detection especially in the finance and banking sectors.
+
|| '''R '''is extensively used in-
 +
* Fraud detection especially in the finance and banking sectors.
 
* Social media analysis to discover potential customers in online advertising.  
 
* Social media analysis to discover potential customers in online advertising.  
  
Line 82: Line 84:
  
 
Applications of R  
 
Applications of R  
|| * Healthcare to perform pre-clinical trials and analyze the drug-safety data.
+
||  
 +
* Healthcare to perform pre-clinical trials and analyze the drug-safety data.
 
* E-Commerce companies to analyze the purchases made by customers.  
 
* E-Commerce companies to analyze the purchases made by customers.  
  
Line 92: Line 95:
  
 
Names of the companies that use '''R'''
 
Names of the companies that use '''R'''
|| ANZ Bank:* '''ANZ Bank''', the fourth largest bank in '''Australia''', uses '''R''' for '''credit risk analysis. '''
+
|| ANZ Bank:
 +
* '''ANZ Bank''', the fourth largest bank in '''Australia''', uses '''R''' for '''credit risk analysis. '''
 
* It also uses '''R''' to fit models for mortgage loss.  
 
* It also uses '''R''' to fit models for mortgage loss.  
  
Line 99: Line 103:
  
 
Names of the companies that use '''R'''
 
Names of the companies that use '''R'''
|| Facebook:* '''Facebook''' uses '''R''' for '''exploratory data analysis, experimental analysis,''' etc.  
+
|| Facebook:
 +
* '''Facebook''' uses '''R''' for '''exploratory data analysis, experimental analysis,''' etc.  
 
* It also uses '''R''' for analyzing user behavior like status updates and profile pictures.
 
* It also uses '''R''' for analyzing user behavior like status updates and profile pictures.
  
 
|-  
 
|-  
 
|| Names of the companies that use R
 
|| Names of the companies that use R
|| Google:* '''Google''' uses '''R''' to calculate return on investment on advertising campaigns.  
+
|| Google:
 +
* '''Google''' uses '''R''' to calculate return on investment on advertising campaigns.  
 
* It also uses '''R''' to improve the efficiency of online advertising.
 
* It also uses '''R''' to improve the efficiency of online advertising.
  
Line 171: Line 177:
  
 
About '''RStudio'''  
 
About '''RStudio'''  
|| * '''RStudio''' is an '''integrated development environment''' for '''R'''.  
+
||  
 +
* '''RStudio''' is an '''integrated development environment''' for '''R'''.  
 
* It includes a '''console''' and an '''editor'''.  
 
* It includes a '''console''' and an '''editor'''.  
 
* It provides tools for plotting, '''debugging''' and '''workspace''' management.
 
* It provides tools for plotting, '''debugging''' and '''workspace''' management.
Line 241: Line 248:
  
 
Installing R and RStudio  
 
Installing R and RStudio  
|| These two tutorials will teach how to install '''R''' and '''RStudio''' on '''Linux''' and '''Windows''' OS.# Installing R and RStudio on Linux.
+
|| These two tutorials will teach how to install '''R''' and '''RStudio''' on '''Linux''' and '''Windows''' OS.
 +
# Installing R and RStudio on Linux.
 
# Installing R and RStudio on Windows.
 
# Installing R and RStudio on Windows.
  
Line 248: Line 256:
  
 
Basics of R and RStudio
 
Basics of R and RStudio
|| These five tutorials will cover the basics of '''R, Data Frames''' in '''R''' and '''RStudio.'''# Introduction to basics of R.  
+
|| These five tutorials will cover the basics of '''R, Data Frames''' in '''R''' and '''RStudio.'''
 +
# Introduction to basics of R.  
 
# Introduction to Data Frames in R
 
# Introduction to Data Frames in R
 
# Introduction to RStudio
 
# Introduction to RStudio
Line 264: Line 273:
 
# Creating Matrices using Data Frames
 
# Creating Matrices using Data Frames
 
# Operations on Matrices and Data Frames
 
# Operations on Matrices and Data Frames
 
  
 
|-  
 
|-  
Line 273: Line 281:
 
# Data types and Factors
 
# Data types and Factors
 
# Lists and its Operations
 
# Lists and its Operations
 
  
 
|-  
 
|-  
Line 280: Line 287:
 
Visualizing data using plots
 
Visualizing data using plots
 
|| Next, we will learn how to visualize our data using simple plots and charts.  
 
|| Next, we will learn how to visualize our data using simple plots and charts.  
 
  
 
These two tutorials will teach how to draw '''histograms, pie charts, bar charts, '''etc.  
 
These two tutorials will teach how to draw '''histograms, pie charts, bar charts, '''etc.  
 
# Plotting Histograms and Pie Chart
 
# Plotting Histograms and Pie Chart
 
# Plotting Bar Charts and Scatter Plot
 
# Plotting Bar Charts and Scatter Plot
 
  
 
|-  
 
|-  
Line 293: Line 298:
 
|| After this, we will see how to customize our plots using the '''ggplot2 package. '''
 
|| After this, we will see how to customize our plots using the '''ggplot2 package. '''
  
 
+
These two tutorials will teach how to use the '''functions''' available in the '''ggplot2 package'''.  
These two tutorials will teach how to use the '''functions''' available in the '''ggplot2 package'''. # Introduction to ggplot2
+
# Introduction to ggplot2
 
# Aesthetic Mapping in ggplot2
 
# Aesthetic Mapping in ggplot2
  
Line 305: Line 310:
  
  
These three tutorials will teach how to use the '''dplyr package''' and '''pipe operator'''. # Data Manipulation using dplyr Package
+
These three tutorials will teach how to use the '''dplyr package''' and '''pipe operator'''.  
 +
# Data Manipulation using dplyr Package
 
# More functions in the dplyr Package
 
# More functions in the dplyr Package
 
# Pipe Operator
 
# Pipe Operator
Line 314: Line 320:
  
 
Using conditional statements  
 
Using conditional statements  
|| At last, we will learn how to use '''conditional statements''' in '''R''' and to create '''user-defined functions.'''# Conditional Statements
+
|| At last, we will learn how to use '''conditional statements''' in '''R''' and to create '''user-defined functions.'''
 +
# Conditional Statements
 
# Functions in R  
 
# Functions in R  
  
Line 322: Line 329:
  
 
Advanced Level Topics
 
Advanced Level Topics
|| * Later on, we may add a few more advanced level topics in this series.
+
||  
 +
* Later on, we may add a few more advanced level topics in this series.
 
* Please visit this series at regular intervals for the latest updates.
 
* Please visit this series at regular intervals for the latest updates.
  
Line 364: Line 372:
 
|| Do you have questions in THIS Spoken Tutorial?
 
|| Do you have questions in THIS Spoken Tutorial?
  
Choose the minute and second where you have the question.Explain your question briefly.
+
Choose the minute and second where you have the question. Explain your question briefly.
  
 
Someone from the FOSSEE team will answer them.
 
Someone from the FOSSEE team will answer them.

Revision as of 23:28, 13 May 2020

Title of the script: Overview of R and RStudio

Author: Madhuri Ganapathi and Sudhakar Kumar (IIT Bombay)

Keywords: R, RStudio, command-line interface, pie chart, programming, video tutorial

Visual Cue Narration
Show slide

Opening Slide

Welcome to this spoken tutorial on Overview of R and RStudio.
Show slide

Learning Objectives

In this tutorial, we will learn:
  • About R programming language
  • Applications of R
  • About RStudio
  • Topics covered in this series
Show slide

System Specifications

This tutorial is recorded on
  • Ubuntu Linux OS version 18.04
  • R version 3.4.4
  • RStudio version 1.2.5033

It is recommended to install R version 3.2.0 or higher.

Show slide

Prerequisites

To follow this series, the learner should have knowledge of basic Mathematics and Statistics.
Show slide

About R

Let us first see what R is.
  • R is a programming language developed by Ross Ihaka and Robert Gentleman in 1993.
  • R is used in statistics, data analysis, and machine learning algorithms.
  • It is a free/libre open source software.
Show slide

Features of R

Some of its salient features are:
  • R provides support for both procedural programming and object-oriented programming.
  • R can be integrated with other programming languages like C, C++, Python, Java, etc.
Show slide

Features of R

  • R is capable of effective data handling and storage.
  • R has more than 10,000 packages in its repository.
  • R is cross-platform compatible.
Show slide

Features of R

  • R facilitates complex operations with vectors, arrays, and data frames.
  • R has community support of avid developers.
Show Slide

Applications of R

R is extensively used in-
  • Fraud detection especially in the finance and banking sectors.
  • Social media analysis to discover potential customers in online advertising.
Show Slide

Applications of R

  • Healthcare to perform pre-clinical trials and analyze the drug-safety data.
  • E-Commerce companies to analyze the purchases made by customers.
Now, we will learn about some of the companies that use R.
Show Slide

Names of the companies that use R

ANZ Bank:
  • ANZ Bank, the fourth largest bank in Australia, uses R for credit risk analysis.
  • It also uses R to fit models for mortgage loss.
Show Slide

Names of the companies that use R

Facebook:
  • Facebook uses R for exploratory data analysis, experimental analysis, etc.
  • It also uses R for analyzing user behavior like status updates and profile pictures.
Names of the companies that use R Google:
  • Google uses R to calculate return on investment on advertising campaigns.
  • It also uses R to improve the efficiency of online advertising.
Web browser

https://www.r-project.org/

Let us now see the homepage of R.

This is the homepage of the R Project for Statistical Computing.

Highlight download R on the home page Here, we can download R.
Highlight About R on the right side of the home page Under the R Project section, click on About R.

This will tell us more about R and its environment.

I will scroll down the page.
Highlight Getting Help on the right side of the home page Under the section Help With R, click on Getting Help to know how to get help.
Highlight the Terminal on your computer Now we will see the command line interface of R.

I have already installed R on my machine. So, I will launch R from the terminal.

Press Ctrl + Alt + T keys on the keyboard.

Highlight the Terminal.

Type R in the terminal.

Press Ctrl + Alt + T keys on the keyboard to open a terminal window.

Now, in the terminal, type capital R and press Enter.

Highlight R command-line interface The command-line interface of R opens.

In the terminal, scroll up.

Notice that R version 3.4.4 is installed on my machine.

Now, scroll down.

Highlight R command-line interface Here, we can type the commands to be executed in R.

As an example, we’ll evaluate the value of 10 raised to the power of 0.1.

R console:

Type 10^0.1

Type 10 carat sign 0.1 and press Enter.
Highlight the value in the Terminal The expected value is being displayed on the R terminal window.
Now, we will learn about RStudio.
Show slide

About RStudio

  • RStudio is an integrated development environment for R.
  • It includes a console and an editor.
  • It provides tools for plotting, debugging and workspace management.
Web browser

https://rstudio.com/

Let us now see the homepage of RStudio.

There is a lot of information about RStudio on this page.

You can go through it in detail, later.

Now let’s get familiar with the RStudio interface.

I have already installed RStudio on my machine.

So, I will launch RStudio using the terminal.

Highlight the Terminal

Type rstudio and press Enter.

Press Ctrl + Alt + T keys together to open a new terminal window.

In the terminal, type rstudio and press Enter.

Highlight the Console Window RStudio is launched.

In the bottom left panel of RStudio, we can see the Console window.

In the Console, R is waiting for us to type a command.

Let us draw a simple pie chart.

RStudio

pie(1:5)

In the Console window, type p i e and in parentheses one colon five.

Press Enter.

Highlight the pie chart in the Plots window The expected pie chart is generated in the Plots window.
Highlight Environment and History window Apart from this, we have Environment and History tabs.

Here all the used variables and commands are stored.

Highlight Files window.


Point to the current working directory.

In the bottom right panel of RStudio, we have the Files tab.

Click on the Files tab.

It shows the current working directory.

We will learn about other tabs later in this series.

Let’s now have a look at the content that will be covered in the R series.
Show Slide

Installing R and RStudio

These two tutorials will teach how to install R and RStudio on Linux and Windows OS.
  1. Installing R and RStudio on Linux.
  2. Installing R and RStudio on Windows.
Show Slide

Basics of R and RStudio

These five tutorials will cover the basics of R, Data Frames in R and RStudio.
  1. Introduction to basics of R.
  2. Introduction to Data Frames in R
  3. Introduction to RStudio
  4. Introduction to R script
  5. Working directories in RStudio
Show Slide

Basics of data frames and lists

We’ll also learn how to store and analyze our Data Frames.

These three tutorials and the next three tutorials will teach the basics of using Data frames and lists.

  1. Indexing and Slicing Data Frames
  2. Creating Matrices using Data Frames
  3. Operations on Matrices and Data Frames
Show Slide

Basics of data frames and lists

# Merging and Importing Data
  1. Data types and Factors
  2. Lists and its Operations
Show Slide

Visualizing data using plots

Next, we will learn how to visualize our data using simple plots and charts.

These two tutorials will teach how to draw histograms, pie charts, bar charts, etc.

  1. Plotting Histograms and Pie Chart
  2. Plotting Bar Charts and Scatter Plot
Show Slide

Customizing plots using ggplot2

After this, we will see how to customize our plots using the ggplot2 package.

These two tutorials will teach how to use the functions available in the ggplot2 package.

  1. Introduction to ggplot2
  2. Aesthetic Mapping in ggplot2


Show Slide

Manipulating data using dplyr

Then we will learn how to manipulate our data using the dplyr package and pipe operator.


These three tutorials will teach how to use the dplyr package and pipe operator.

  1. Data Manipulation using dplyr Package
  2. More functions in the dplyr Package
  3. Pipe Operator


Show Slide

Using conditional statements

At last, we will learn how to use conditional statements in R and to create user-defined functions.
  1. Conditional Statements
  2. Functions in R


Show Slide

Advanced Level Topics

  • Later on, we may add a few more advanced level topics in this series.
  • Please visit this series at regular intervals for the latest updates.


This completes the overview of spoken tutorials on the R series.
Let us summarize.
Show slide

Summary

In this tutorial, we have learned
  • About R programming language
  • Applications of R
  • About RStudio and
  • Topics covered in this series


Show slide

About the Spoken Tutorial Project

The video at the following link summarises the Spoken Tutorial project.

Please download and watch it.

Show slide

Spoken Tutorial Workshops

We conduct workshops using Spoken Tutorials and give certificates.


Please contact us.

Show Slide

Spoken Tutorial Forum to answer questions

Do you have questions in THIS Spoken Tutorial?

Choose the minute and second where you have the question. Explain your question briefly.

Someone from the FOSSEE team will answer them.

Please visit this site.

Show Slide

Spoken Tutorial Forum for specific questions:

The Spoken Tutorial forum is for specific questions on this tutorial.

Please do not post unrelated and general questions on them.

This will help reduce the clutter.

With less clutter, we can use these discussions as instructional material.

Show Slide

Forum to answer questions

Do you have any general/technical questions?

Please visit the forum given in the link.

Show Slide

Textbook Companion

The FOSSEE team coordinates the coding of solved examples of popular books.

We give honorarium and certificates to those who do this.

For more details, please visit these sites.

Show Slide

Acknowledgment

The Spoken Tutorial project is funded by NMEICT, MHRD, Govt. of India
Show Slide

Thank You

This tutorial is contributed by Sudhakar Kumar and Madhuri Ganapathi from IIT Bombay. Thank you for watching.

Contributors and Content Editors

Madhurig, Nancyvarkey, Sudhakarst