Book Image

Python Data Analysis

By : Ivan Idris
Book Image

Python Data Analysis

By: Ivan Idris

Overview of this book

Table of Contents (22 chapters)
Python Data Analysis
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Key Concepts
Online Resources
Index

Interfacing with R


R provides a datasets package that contains sample datasets. The morley dataset has data from measurements of the speed of light made in 1879. The speed of light is a fundamental physical constant and its value is currently known very precisely. The data is described at http://stat.ethz.ch/R-manual/R-devel/library/datasets/html/morley.html. The speed of light value can be found in the scipy.constants module. The R data is stored in an R dataframe with three columns:

  • The experiment number from one to five

  • The run number with twenty runs per experiment, bringing the total measurements to 100

  • The measured speed of light in kilometers per second with 299,000 subtracted

The rpy2.robjects.r() function executes R code in a Python environment. Load the data as follows:

ro.r('data(morley)')

The pandas library provides an R interface via the pandas.rpy.common module. Load the data into a pandas DataFrame as follows:

df = com.load_data('morley')

Let's group the data by experiment with the...