Book Image

Hands-On Data Science with Anaconda

By : Yuxing Yan, James Yan
Book Image

Hands-On Data Science with Anaconda

By: Yuxing Yan, James Yan

Overview of this book

Anaconda is an open source platform that brings together the best tools for data science professionals with more than 100 popular packages supporting Python, Scala, and R languages. Hands-On Data Science with Anaconda gets you started with Anaconda and demonstrates how you can use it to perform data science operations in the real world. The book begins with setting up the environment for Anaconda platform in order to make it accessible for tools and frameworks such as Jupyter, pandas, matplotlib, Python, R, Julia, and more. You’ll walk through package manager Conda, through which you can automatically manage all packages including cross-language dependencies, and work across Linux, macOS, and Windows. You’ll explore all the essentials of data science and linear algebra to perform data science tasks using packages such as SciPy, contrastive, scikit-learn, Rattle, and Rmixmod. Once you’re accustomed to all this, you’ll start with operations in data science such as cleaning, sorting, and data classification. You’ll move on to learning how to perform tasks such as clustering, regression, prediction, and building machine learning models and optimizing them. In addition to this, you’ll learn how to visualize data using the packages available for Julia, Python, and R.
Table of Contents (15 chapters)

Review questions and exercises

  1. Why is understanding various packages important?
  2. What are package dependencies?
  3. For R, Python, Julia, and Octave, find out how many packages are available for each of them, today.
  4. How do we install a package in R, Python, and Julia?
  5. How do we update a package in R, Python, and Julia?
  6. What is the task view for R?
  7. How do we install all R packages included in a task view?
  8. After an R package is installed, how do you find its related directory? What is the usage to find its related directory? You could use the R package called healthcare as an example. Note that the package is about tools for healthcare machine learning.
  9. Find out more details about the task views related to the subject of Econometrics. Then install all related R packages. How many are there?
  10. How do we update one R package? How would we do it for Octave?
  11. How do we find the manual for...