Mastering Scientific Computing with R

Book Image

Mastering Scientific Computing with R

Book Image

Mastering Scientific Computing with R

Overview of this book

Mastering Scientific Computing with R

Mastering Scientific Computing with R

Credits

About the Authors

About the Authors

About the Reviewers

About the Reviewers

www.PacktPub.com

www.PacktPub.com

Preface

Free Chapter

Programming with R

Programming with R

Data structures in R

Loading data into R

Basic plots and the ggplot2 package

General programming and debugging tools

Statistical Methods with R

Statistical Methods with R

Descriptive statistics

Probability distributions

Fitting distributions

Hypothesis testing

Linear Models

An overview of statistical modeling

Linear regression

Analysis of variance

Generalized linear models

Generalized additive models

Linear discriminant analysis

Principal component analysis

Nonlinear Methods

Nonlinear Methods

Nonparametric and parametric models

The adsorption and body measures datasets

Theory-driven nonlinear regression

Visually exploring nonlinear relationships

Extending the linear framework

Nonparametric nonlinear methods

Nonparametric methods with the np package

Linear Algebra

Matrices and linear algebra

The physical functioning dataset

Basic matrix operations

Triangular matrices

Matrix decomposition

Principal Component Analysis and the Common Factor Model

Principal Component Analysis and the Common Factor Model

A primer on correlation and covariance structures

Datasets used in this chapter

Principal component analysis and total variance

Formative constructs using PCA

Exploratory factor analysis and reflective constructs

Structural Equation Modeling and Confirmatory Factor Analysis

Structural Equation Modeling and Confirmatory Factor Analysis

The basic ideas of SEM

Matrix representation of SEM

SEM model fitting and estimation methods

Comparing OpenMx to lavaan

Simulations

Basic sample simulations in R

Pseudorandom numbers

Monte Carlo simulations

Monte Carlo integration

Rejection sampling

Importance sampling

Simulating physical systems

Optimization

One-dimensional optimization

Linear programming

Quadratic programming

General non-linear optimization

Other optimization packages

Advanced Data Management

Advanced Data Management

Cleaning datasets in R

String processing and pattern matching

Floating point operations and numerical data types

Memory management in R

The Amelia package

The mice package

Index

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Principal component analysis

Principal component analysis (PCA) is another exploratory method you can use to separate your samples into groups. PCA converts a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components. PCA is widely used as a dimension reduction technique to help visualize data. PCA is different from LDA because it doesn't rely on class information to explore the relationship between the variable values and the sample group numbers. For example, let's perform a PCA to explore our simulated fish.data dataset. Before performing PCA, it is important to remember that the magnitude of the variables and any skews in the data will influence the resulting principal components. So, we need to scale and transform our data.

First, we recommend you to log transform the data (if necessary). Then, run PCA using the prcomp() function as follows:

> fish.data.mx <- as.matrix(fish.data[, 1:3])
> fish.data...