User Tools

Site Tools


b:r_cookbook

Table of Contents

R Cookbook

Chapter 1 Getting Started and Getting Help

Introduction

Downloading and Installing R

Starting R

Entering Commands

Exiting from R

Interrupting R

Viewing the Supplied Documentation

Getting Help on a Function

Searching the Supplied Documentation

Getting Help on a Package

Searching the Web for Help

Finding Relevant Functions and Packages

Searching the Mailing Lists

Submitting Questions to the Mailing Lists

Chapter 2 Some Basics

Introduction

Printing Something

Setting Variables

Listing Variables

Deleting Variables

Creating a Vector

Computing Basic Statistics

Creating Sequences

Comparing Vectors

Selecting Vector Elements

Performing Vector Arithmetic

Getting Operator Precedence Right

Defining a Function

Typing Less and Accomplishing More

Avoiding Some Common Mistakes

Chapter 3 Navigating the Software

Introduction

Getting and Setting the Working Directory

Saving Your Workspace

Viewing Your Command History

Saving the Result of the Previous Command

Displaying the Search Path

Accessing the Functions in a Package

Accessing Built-in Datasets

Viewing the List of Installed Packages

Installing Packages from CRAN

Setting a Default CRAN Mirror

Suppressing the Startup Message

Running a Script

Running a Batch Script

Getting and Setting Environment Variables

Locating the R Home Directory

Customizing R

Chapter 4 Input and Output

Introduction

Entering Data from the Keyboard

Printing Fewer Digits (or More Digits)

Redirecting Output to a File

Listing Files

Dealing with “Cannot Open File” in Windows

Reading Fixed-Width Records

Reading Tabular Data Files

Reading from CSV Files

Writing to CSV Files

Reading Tabular or CSV Data from the Web

Reading Data from HTML Tables

Reading Files with a Complex Structure

Reading from MySQL Databases

Saving and Transporting Objects

Chapter 5 Data Structures

Introduction

Appending Data to a Vector

Inserting Data into a Vector

Understanding the Recycling Rule

Creating a Factor (Categorical Variable)

Combining Multiple Vectors into One Vector and a Factor

Creating a List

Selecting List Elements by Position

Selecting List Elements by Name

Building a Name/Value Association List

Removing an Element from a List

Flatten a List into a Vector

Removing NULL Elements from a List

Removing List Elements Using a Condition

Initializing a Matrix

Performing Matrix Operations

Giving Descriptive Names to the Rows and Columns of a Matrix

Selecting One Row or Column from a Matrix

Initializing a Data Frame from Column Data

Initializing a Data Frame from Row Data

Appending Rows to a Data Frame

Preallocating a Data Frame

Selecting Data Frame Columns by Position

Selecting Data Frame Columns by Name

Selecting Rows and Columns More Easily

Changing the Names of Data Frame Columns

Editing a Data Frame

Removing NAs from a Data Frame

Excluding Columns by Name

Combining Two Data Frames

Merging Data Frames by Common Column

Accessing Data Frame Contents More Easily

Converting One Atomic Value into Another

Converting One Structured Data Type into Another

Chapter 6 Data Transformations

Introduction

Splitting a Vector into Groups

Applying a Function to Each List Element

Applying a Function to Every Row

Applying a Function to Every Column

Applying a Function to Groups of Data

Applying a Function to Groups of Rows

Applying a Function to Parallel Vectors or Lists

Chapter 7 Strings and Dates

Introduction

Getting the Length of a String

Concatenating Strings

Extracting Substrings

Splitting a String According to a Delimiter

Replacing Substrings

Seeing the Special Characters in a String

Generating All Pairwise Combinations of Strings

Getting the Current Date

Converting a String into a Date

Converting a Date into a String

Converting Year, Month, and Day into a Date

Getting the Julian Date

Extracting the Parts of a Date

Creating a Sequence of Dates

Chapter 8 Probability

Introduction

Counting the Number of Combinations

Generating Combinations

Generating Random Numbers

Generating Reproducible Random Numbers

Generating a Random Sample

Generating Random Sequences

Randomly Permuting a Vector

Calculating Probabilities for Discrete Distributions

Calculating Probabilities for Continuous Distributions

Converting Probabilities to Quantiles

Plotting a Density Function

Chapter 9 General Statistics

Introduction

Summarizing Your Data

Calculating Relative Frequencies

Tabulating Factors and Creating Contingency Tables

Testing Categorical Variables for Independence

Calculating Quantiles (and Quartiles) of a Dataset

Inverting a Quantile

Converting Data to Z-Scores

Testing the Mean of a Sample (t Test)

Forming a Confidence Interval for a Mean

Forming a Confidence Interval for a Median

Testing a Sample Proportion

Forming a Confidence Interval for a Proportion

Testing for Normality

Testing for Runs

Comparing the Means of Two Samples

Comparing the Locations of Two Samples Nonparametrically

Testing a Correlation for Significance

Testing Groups for Equal Proportions

Performing Pairwise Comparisons Between Group Means

Testing Two Samples for the Same Distribution

Chapter 10 Graphics

Introduction

Creating a Scatter Plot

Adding a Title and Labels

Adding a Grid

Creating a Scatter Plot of Multiple Groups

Adding a Legend

Plotting the Regression Line of a Scatter Plot

Plotting All Variables Against All Other Variables

Creating One Scatter Plot for Each Factor Level

Creating a Bar Chart

Adding Confidence Intervals to a Bar Chart

Coloring a Bar Chart

Plotting a Line from x and y Points

Changing the Type, Width, or Color of a Line

Plotting Multiple Datasets

Adding Vertical or Horizontal Lines

Creating a Box Plot

Creating One Box Plot for Each Factor Level

Creating a Histogram

Adding a Density Estimate to a Histogram

Creating a Discrete Histogram

Creating a Normal Quantile-Quantile (Q-Q) Plot

Creating Other Quantile-Quantile Plots

Plotting a Variable in Multiple Colors

Graphing a Function

Pausing Between Plots

Displaying Several Figures on One Page

Opening Additional Graphics Windows

Writing Your Plot to a File

Changing Graphical Parameters

Chapter 11 Linear Regression and ANOVA

Introduction

Performing Simple Linear Regression

Performing Multiple Linear Regression

Getting Regression Statistics

Understanding the Regression Summary

Performing Linear Regression Without an Intercept

Performing Linear Regression with Interaction Terms

Selecting the Best Regression Variables

Regressing on a Subset of Your Data

Using an Expression Inside a Regression Formula

Regressing on a Polynomial

Regressing on Transformed Data

Finding the Best Power Transformation (Box?Cox Procedure)

Forming Confidence Intervals for Regression Coefficients

Plotting Regression Residuals

Diagnosing a Linear Regression

Identifying Influential Observations

Testing Residuals for Autocorrelation (Durbin?Watson Test)

Predicting New Values

Forming Prediction Intervals

Performing One-Way ANOVA

Creating an Interaction Plot

Finding Differences Between Means of Groups

Performing Robust ANOVA (Kruskal?Wallis Test)

Comparing Models by Using ANOVA

Chapter 12 Useful Tricks

Introduction

Peeking at Your Data

Widen Your Output

Printing the Result of an Assignment

Summing Rows and Columns

Printing Data in Columns

Binning Your Data

Finding the Position of a Particular Value

Selecting Every nth Element of a Vector

Finding Pairwise Minimums or Maximums

Generating All Combinations of Several Factors

Flatten a Data Frame

Sorting a Data Frame

Sorting by Two Columns

Stripping Attributes from a Variable

Revealing the Structure of an Object

Timing Your Code

Suppressing Warnings and Error Messages

Taking Function Arguments from a List

Defining Your Own Binary Operators

Chapter 13 Beyond Basic Numerics and Statistics

Introduction

Minimizing or Maximizing a Single-Parameter Function

Minimizing or Maximizing a Multiparameter Function

Calculating Eigenvalues and Eigenvectors

Performing Principal Component Analysis

Performing Simple Orthogonal Regression

Finding Clusters in Your Data

Predicting a Binary-Valued Variable (Logistic Regression)

Bootstrapping a Statistic

Factor Analysis

Chapter 14 Time Series Analysis

Introduction

Representing Time Series Data

Plotting Time Series Data

Extracting the Oldest or Newest Observations

Subsetting a Time Series

Merging Several Time Series

Filling or Padding a Time Series

Lagging a Time Series

Computing Successive Differences

Performing Calculations on Time Series

Computing a Moving Average

Applying a Function by Calendar Period

Applying a Rolling Function

Plotting the Autocorrelation Function

Testing a Time Series for Autocorrelation

Plotting the Partial Autocorrelation Function

Finding Lagged Correlations Between Two Time Series

Detrending a Time Series

Fitting an ARIMA Model

Removing Insignificant ARIMA Coefficients

Running Diagnostics on an ARIMA Model

Making Forecasts from an ARIMA Model

Testing for Mean Reversion

Smoothing a Time Series

Colophon

b/r_cookbook.txt · Last modified: 2018/02/02 02:33 by hkimscil