Index
A
- ACID properties
- about / Understanding SQLite
- Ambari / Understanding Hadoop subprojects
- Apache Hadoop 1.0.3 / Installing Hadoop on Linux, Ubuntu flavor (single node cluster)
- Apache HBase / Understanding Hadoop subprojects
- Apache Solr / Understanding Hadoop subprojects
- Apache Sqoop / Understanding Hadoop subprojects
- Apache Zookeeper / Understanding Hadoop subprojects
- Application Programming Interface (API) / Performing data operations
- architecture, HDFS / Understanding the HDFS architecture
- architecture, MapReduce / Understanding the MapReduce architecture
- architecture, RHadoop / Understanding the architecture of RHadoop
- architecture, RHIPE / Understanding the architecture of RHIPE
- artificial neural networks
- about / Clustering
B
- Bash command / Exploring HDFS data
- Big Data university
- Bulk Synchronous Parallel (BSP) / Understanding the basics of MapReduce
- business analytics
- MapReduce definitions, used / Understanding several possible MapReduce definitions to solve business problems
C
- CDH
- about / Installing Cloudera Hadoop on Ubuntu
- installing, on Ubuntu / Installing Cloudera Hadoop on Ubuntu
- installing, prerequisites / Installing Cloudera Hadoop on Ubuntu
- CentOS / Installing MongoDB
- classification technique / Performing data modeling in R
- client / Listing Hadoop MapReduce entities
- close function / Understanding basic R functions used in Hadoop MapReduce scripts
- Cloudera
- clustering
- about / Clustering
- performing, with R / Clustering with R
- performing, with RHadoop / Performing clustering with R and Hadoop
- clustering technique / Performing data modeling in R
- cmdenv option / Understanding the basics of Hadoop streaming
- combine function / Understanding how to code a MapReduce application
- Combiner function / Shuffling and sorting
- combiner option / Understanding the basics of Hadoop streaming
- command prompt
- Hadoop streaming job, executing from / Executing a Hadoop streaming job from the command prompt
- output, exploring from / Exploring an output from the command prompt
- community support, R
- increasing / Increasing community support
- Coursera
- URL, for machine learning / R + Hadoop help materials
- URL, for Data Science / R + Hadoop help materials
- CRAN
- about / Using R packages
- URL / Using R packages
D
- D&R analysis / Learning RHIPE
- Dashboard charts / Visualizing data
- data
- loading, into HDFS / Loading data into HDFS
- preprocessing / Preprocessing data
- importing, into R / Importing the data into R
- exporting, into R / Exporting the data from R
- data, Google Analytics
- extracting / Collecting data
- data analysis / Performing data operations
- data analytics
- performing / Performing analytics over data
- data analytics problems
- about / Understanding data analytics problems
- web page categorization, exploring / Exploring web pages categorization
- stock market change frequency, computing / Computing the frequency of stock market change
- case study / Predicting the sale price of blue book for bulldozers – case study
- data analytics problems, case study
- problem, identifying / Identifying the problem
- data requirement, designing / Designing data requirement
- data, preprocessing / Preprocessing data
- data analytics, performing / Performing analytics over data
- data analytics project life cycle
- about / Understanding the data analytics project life cycle
- problem, identifying / Identifying the problem
- data requirement, designing / Designing data requirement
- data, preprocessing / Preprocessing data
- data analytics, performing / Performing analytics over data
- data, visualizing / Visualizing data
- data attributes, Google Analytics / Understanding the required Google Analytics data attributes
- data cleaning / Performing data operations
- data exploration / Performing data operations
- data files
- about / Learning about data files as database
- R package, installing / Installing R packages
- data, importing into R / Importing the data into R
- data, exporting into R / Exporting the data from R
- data files, types
- data mining, techniques
- regression / Performing data modeling in R
- classification / Performing data modeling in R
- clustering / Performing data modeling in R
- recommendation / Performing data modeling in R
- data modeling
- about / Performing data modeling in R
- DataNode / Understanding HDFS components
- data operations
- performing / Performing data operations
- data processing operations
- data cleaning / Performing data operations
- data exploration / Performing data operations
- data analysis / Performing data operations
- data requirement
- designing / Designing data requirement
- data visualization
- about / Visualizing data
- dbSendQuery function / Understanding data manipulation
- Decisionstats
- URL / Popular R contributors
- deserialization / Understanding the different Java concepts used in Hadoop programming
- directory operation / The hdfs package
- dist.fun function / Performing clustering with R and Hadoop
- Distributed File System (DFS) / Understanding the basics of MapReduce
- Divide and Recombine technique / Introducing RHIPE
E
- Eclipse / Writing a Hadoop MapReduce example
- entities, Hadoop MapReduce
- listing / Listing Hadoop MapReduce entities
- environment variables
- setting up / Environment variables, Installing RHadoop
- Excel
- about / Understanding Excel
- installing / Installing Excel
- data, importing into R / Importing data into R
- data manipulation / Understanding data manipulation with R and Excel
- data, exporting to / Exporting the data to Excel
F
- file function / Understanding basic R functions used in Hadoop MapReduce scripts
- file manipulation / The hdfs package
- file option / Understanding the basics of Hadoop streaming
- file read/write / The hdfs package
- fitting, types
- under fitting / Fitting random forests with RHadoop
- normal fitting / Fitting random forests with RHadoop
- over fitting / Fitting random forests with RHadoop
- Flume / Loading data into HDFS
- Fourier Transformation / Identifying the problem
- from.dfs function / The rmr package
- full distributed mode / Understanding different Hadoop modes
G
- getwd command / HDFS
- ggplot2 package
- URL / Visualizing data
- about / Visualizing data
- glm model / Logistic regression
- Google Analytics / Exploring web pages categorization
- data attributes / Understanding the required Google Analytics data attributes
- data, extracting / Collecting data
- Google filesystem
- reference links / Understanding MapReduce
- Google MapReduce
- URL / Understanding MapReduce
- Gzip / Shuffling and sorting
H
- Hadoop
- installing / Installing Hadoop, Understanding Hadoop installation steps, Installing Hadoop
- modes / Understanding different Hadoop modes
- installing,over Ubuntu OS with pseudo mode / Installing Hadoop on Linux, Ubuntu flavor (single node cluster)
- installing, on multinode cluster / Installing Hadoop on Linux, Ubuntu flavor (multinode cluster)
- features / Understanding Hadoop features
- Java concepts / Understanding the different Java concepts used in Hadoop programming
- Hadoop, features
- HDFS / Understanding HDFS
- MapReduce / Understanding MapReduce
- Hadoop Distributed File System (HDFS) / Installing Hadoop on Linux, Ubuntu flavor (single node cluster)
- Hadoop installation / Installing RHadoop
- Hadoop MapReduce
- about / Introducing Hadoop MapReduce
- entities, listing / Listing Hadoop MapReduce entities
- data, loading into HDFS / Loading data into HDFS
- Map phase, executing / Executing the Map phase
- shuffling / Shuffling and sorting
- sorting / Shuffling and sorting
- phase execution, reducing / Reducing phase execution
- limitations / Understanding the limitations of MapReduce
- problem solving ability / Understanding Hadoop's ability to solve problems
- fundamentals / Understanding the Hadoop MapReduce fundamentals
- coding, in R / Learning the different ways to write Hadoop MapReduce in R
- Hadoop MapReduce, fundamentals
- MapReduce objects / Understanding MapReduce objects
- number of Maps, deciding / Deciding the number of Maps in MapReduce
- number of Reducers, deciding / Deciding the number of Reducers in MapReduce
- MapReduce dataflow / Understanding MapReduce dataflow
- Hadoop MapReduce terminologies / Taking a closer look at Hadoop MapReduce terminologies
- Hadoop MapReduce example
- coding / Writing a Hadoop MapReduce example
- Hadoop MapReduce job
- Hadoop MapReduce scripts
- R function, using / Understanding basic R functions used in Hadoop MapReduce scripts
- Hadoop MapReduce terminologies
- Hadoop MRv1 / Installing Hadoop on Linux, Ubuntu flavor (single node cluster)
- Hadoop streaming
- about / Learning Hadoop streaming, Understanding the basics of Hadoop streaming
- running, with R / Understanding how to run Hadoop streaming with R
- MapReduce application / Understanding a MapReduce application
- MapReduce application, coding / Understanding how to code a MapReduce application
- MapReduce application, running / Understanding how to run a MapReduce application
- MapReduce application output, exploring / Understanding how to explore the output of MapReduce application
- R function, used in Hadoop MapReduce scripts / Understanding basic R functions used in Hadoop MapReduce scripts
- Hadoop MapReduce job, monitoring / Monitoring the Hadoop MapReduce job
- HadoopStreaming
- Hadoop streaming job
- executing, from command prompt / Executing a Hadoop streaming job from the command prompt
- executing, from R / Executing the Hadoop streaming job from R or an RStudio console
- executing, from RStudio console / Executing the Hadoop streaming job from R or an RStudio console
- running / Running a Hadoop streaming job
- executing / Executing the Hadoop streaming job
- HadoopStreaming R package
- exploring / Exploring the HadoopStreaming R package
- hsTableReader function / Understanding the hsTableReader function
- hsKeyValReader function / Understanding the hsKeyValReader function
- hsLineReader function / Understanding the hsLineReader function
- Hadoop streaming job, running / Running a Hadoop streaming job
- Hadoop subprojects / Understanding Hadoop subprojects
- HBase
- about / Understanding HBase
- features / Understanding HBase features
- installing / Installing HBase
- Thrift, installing / Installing thrift
- RHBase, installing / Installing RHBase
- data, importing into R / Importing the data into R
- data manipulation / Understanding data manipulation
- HDFS / Listing Hadoop MapReduce entities, Understanding the architecture of RHIPE, Preprocessing data
- about / Understanding HDFS, Introducing RHIPE, HDFS, Linear regression with R and Hadoop
- characteristics / Understanding the characteristics of HDFS
- architecture / Understanding the HDFS architecture
- components / Understanding HDFS components
- architecture, understanding by plot / Understanding the HDFS and MapReduce architecture by plot
- data, loading / Loading data into HDFS
- rhls command / HDFS
- getwd command / HDFS
- setwd command / HDFS
- rhput command / HDFS
- rhcp command / HDFS
- rhdel command / HDFS
- rhget command / HDFS
- rwrite command / HDFS
- HDFS, components
- NameNode / Understanding HDFS components
- DataNode / Understanding HDFS components
- Secondary NameNode / Understanding HDFS components
- hdfs.chmod function / The hdfs package
- hdfs.close function / The hdfs package
- hdfs.copy function / The hdfs package
- hdfs.defaults function / The hdfs package
- hdfs.delete function / The hdfs package
- hdfs.dircreate function / The hdfs package
- hdfs.file.info function / The hdfs package
- hdfs.file function / The hdfs package
- hdfs.init function / The hdfs package
- hdfs.ls function / The hdfs package
- hdfs.mkdir function / The hdfs package
- hdfs.move function / The hdfs package
- hdfs.put function / The hdfs package
- hdfs.read function / The hdfs package
- hdfs.rename function / The hdfs package
- hdfs.rm function / The hdfs package
- hdfs.write function / The hdfs package
- HDFS data
- exploring / Exploring HDFS data
- hdfs package
- initialization / The hdfs package
- file manipulation / The hdfs package
- file read/write / The hdfs package
- directory operation / The hdfs package
- Utility / The hdfs package
- Hive / Understanding Hadoop subprojects
- about / Understanding Hive
- features / Understanding features of Hive
- installing / Installing Hive
- configuration, setting up / Setting up Hive configurations
- RHive, installing / Installing RHive
- RHive operations / Understanding RHive operations
- Hortonworks
- hsKeyValReader function / Understanding the hsKeyValReader function
- hsLineReader function / Understanding the hsLineReader function
- hsTableReader function / Understanding the hsTableReader function
- Hypertext Transfer Protocol (HTTP) / Shuffling and sorting
I
- initialization
- about / Initialization
- rhinit command / Initialization
- inputformat option / Understanding the basics of Hadoop streaming
- input option / Understanding the basics of Hadoop streaming
- inputreader option / Understanding the basics of Hadoop streaming
- installation, CDH
- Ubuntu / Installing Cloudera Hadoop on Ubuntu
- installation, Excel / Installing Excel
- installation, Hadoop / Installing Hadoop, Understanding Hadoop installation steps, Installing Hadoop
- over Ubuntu OS / Installing Hadoop on Linux, Ubuntu flavor (single node cluster)
- prerequisites / Installing Hadoop on Linux, Ubuntu flavor (single node cluster)
- on multinode cluster / Installing Hadoop on Linux, Ubuntu flavor (multinode cluster)
- installation, HBase / Installing HBase
- installation, Hive / Installing Hive
- installation, MongoDB / Installing MongoDB
- installation, MySQL
- on Linux / Installing MySQL
- installation, PostgreSQL / Installing PostgreSQL
- installation, protocol buffers / Installing protocol buffers
- installation, R / Installing R, Installing R
- installation, RHadoop / Installing RHadoop
- installation, RHBase / Installing RHBase
- installation, RHIPE / Installing RHIPE, Installing RHIPE
- installation, RHive / Installing RHive
- installation, rJava package / The rJava package installation
- installation, rmongodb / Installing rmongodb
- installation, RMySQL
- on Linux / Installing RMySQL
- installation, R package / Installing R packages
- installation, R packages / Installing RHadoop
- installation, RPostgreSQL / Installing RPostgreSQL
- installation, RSQLite / Installing RSQLite
- installation, RStudio / Installing RStudio
- installation, SQLite / Installing SQLite
- installation, Thrift / Installing thrift
- Integrated Development Environment (IDE) / Installing R
- item-based recommendations / Recommendation algorithms
J
- .jar file / Understanding the steps to run a MapReduce job
- jar option / Understanding the basics of Hadoop streaming
- Java collection / Understanding the different Java concepts used in Hadoop programming
- Java concepts
- Java concurrency / Understanding the different Java concepts used in Hadoop programming
- Java Development Kit (JDK) / Installing Cloudera Hadoop on Ubuntu
- Java generics / Understanding the different Java concepts used in Hadoop programming
- Java Virtual Machine (JVM) / Understanding different Hadoop modes, Understanding the different Java concepts used in Hadoop programming
- JobTracker / Understanding MapReduce components, Listing Hadoop MapReduce entities, Understanding the architecture of RHIPE
K
- k-means.map function / Performing clustering with R and Hadoop
- k-means.mr function / Performing clustering with R and Hadoop
- k-means.reduce function / Performing clustering with R and Hadoop
- k-means method / Clustering with R
- Kaggle / Visualizing data, Identifying the problem
- keyval function / The rmr package
L
- linear regression
- about / Linear regression
- performing, with R / Linear regression with R
- performing, with RHadoop / Linear regression with R and Hadoop
- lm() model / Linear regression with R and Hadoop
- logistic.regression MapReduce function / Logistic regression with R and Hadoop
- logistic regression
- about / Linear regression, Logistic regression
- performing, with R / Logistic regression with R
- performing, in RHadoop / Logistic regression with R and Hadoop
- lr.map Mapper function / Logistic regression with R and Hadoop
- lr.reducer Reducer function / Logistic regression with R and Hadoop
M
- 10 MapReduce Tips
- machine learning
- Wiki URL / Installing Hadoop
- about / Introduction to machine learning
- machine learning algorithms
- machine learning algorithms, types
- supervised machine-learning algorithms / Types of machine-learning algorithms
- unsupervised machine learning algorithms / Types of machine-learning algorithms, Unsupervised machine learning algorithm
- recommendation algorithms / Types of machine-learning algorithms, Recommendation algorithms
- Mahout / Understanding Hadoop subprojects
- main() method / Understanding MapReduce objects
- Map() function / Taking a closer look at Hadoop MapReduce terminologies
- mapdebug option / Understanding the basics of Hadoop streaming
- Mapper function / Understanding the hsTableReader function
- mapper option / Understanding the basics of Hadoop streaming
- Map phase / Understanding MapReduce
- executing / Executing the Map phase
- attributes / Taking a closer look at Hadoop MapReduce terminologies
- Map phase, attributes
- InputFiles / Taking a closer look at Hadoop MapReduce terminologies
- InputFormat / Taking a closer look at Hadoop MapReduce terminologies
- InputSplits / Taking a closer look at Hadoop MapReduce terminologies
- RecordReader / Taking a closer look at Hadoop MapReduce terminologies
- Mapper / Taking a closer look at Hadoop MapReduce terminologies
- MapReduce
- about / Understanding MapReduce
- Map phase / Understanding MapReduce
- Reduce phase / Understanding MapReduce
- architecture / Understanding the MapReduce architecture
- components / Understanding MapReduce components
- architecture, understanding by plot / Understanding the HDFS and MapReduce architecture by plot
- basics / Understanding the basics of MapReduce
- rhwatch command / MapReduce
- rhex command / MapReduce
- rhjoin command / MapReduce
- rhkill command / MapReduce
- rhoptions command / MapReduce
- rhstatus command / MapReduce
- MapReduce, components
- JobTracker / Understanding MapReduce components
- TaskTracker / Understanding MapReduce components
- MapReduce application
- MapReduce dataflow
- about / Understanding MapReduce dataflow
- MapReduce definitions
- used, for business analytics / Understanding several possible MapReduce definitions to solve business problems
- mapreduce function / The rmr package
- MapReduce job 1
- Mapper task status, tracking / Performing analytics over data
- Reducer task status, tracking / Performing analytics over data
- HDFS output location, tracking / Performing analytics over data
- MapReduce job1
- metadata, tracking / Performing analytics over data
- MapReduce job 2
- metadata, tracking / Performing analytics over data
- Reducer task status, tracking / Performing analytics over data
- Mapper task status, tracking / Performing analytics over data
- HDFS output location, tracking / Performing analytics over data
- MapReduce objects
- about / Understanding MapReduce objects
- Mapper / Understanding MapReduce objects
- Reducer / Understanding MapReduce objects
- Driver / Understanding MapReduce objects
- Maven / Writing a Hadoop MapReduce example
- Message Passing Interface (MPI) / Understanding the basics of MapReduce
- Michael Noll
- modes, Hadoop / Understanding different Hadoop modes
- standalone mode / Understanding different Hadoop modes
- pseudo mode / Understanding different Hadoop modes
- full distributed mode / Understanding different Hadoop modes
- MongoDB
- about / Understanding MongoDB
- features / Understanding MongoDB
- installing / Installing MongoDB
- SQL, mapping / Mapping SQL to MongoDB
- rmongodb, installing / Installing rmongodb
- data, importing into R / Importing the data into R
- data manipulation / Understanding data manipulation
- MongoQL
- SQL, mapping / Mapping SQL to MongoQL
- Myrrix
- MySQL / Understanding MySQL
- installing, on Linux / Installing MySQL
- RMySQL, installing / Installing RMySQL
- table, listing / Learning to list the tables and their structure
- table structure, listing / Learning to list the tables and their structure
- data, importing into R / Importing the data into R
- data manipulation / Understanding data manipulation
N
- NameNode / Understanding HDFS components
- normal fitting / Fitting random forests with RHadoop
- number of Maps
- deciding / Deciding the number of Maps in MapReduce
- number of Reducers
- deciding / Deciding the number of Reducers in MapReduce
- numReduceTasks option / Understanding the basics of Hadoop streaming
O
- output
- exploring, from command prompt / Exploring an output from the command prompt
- exploring, from R / Exploring an output from R or an RStudio console
- exploring, from RStudio console / Exploring an output from R or an RStudio console
- outputformat option / Understanding the basics of Hadoop streaming
- output option / Understanding the basics of Hadoop streaming
- over fitting / Fitting random forests with RHadoop
P
- parallel computing
- steps / Understanding MapReduce
- partitioner option / Understanding the basics of Hadoop streaming
- phase execution
- reducing / Reducing phase execution
- Pig / Understanding Hadoop subprojects
- Plain Old Java Objects (POJO) / Understanding the different Java concepts used in Hadoop programming
- Plots for facet scales / Visualizing data
- Poisson sampling
- about / Performing analytics over data, Understanding Poisson-approximation resampling, Fitting random forests with RHadoop
- random forest model, fitting / Fitting random forests with RHadoop
- PostgreSQL
- about / Understanding PostgreSQL
- features / Understanding features of PostgreSQL
- installing / Installing PostgreSQL
- RPostgreSQL, installing / Installing RPostgreSQL
- data, exporting from R / Exporting the data from R
- print() function / Understanding the RHIPE function reference
- print function / Understanding basic R functions used in Hadoop MapReduce scripts
- problem
- identifying / Identifying the problem
- protocol buffers
- installing / Installing protocol buffers
- pseudo mode / Understanding different Hadoop modes
Q
- quick check package / Introducing RHadoop
R
- R
- installing / Installing R, Installing R
- features / Understanding the features of R language
- data operations, performing / Performing data operations
- community support, increasing / Increasing community support
- data modeling / Performing data modeling in R
- Hadoop MapReduce, coding / Learning the different ways to write Hadoop MapReduce in R
- Hadoop streaming job, executing from / Executing the Hadoop streaming job from R or an RStudio console
- output, exploring from / Exploring an output from R or an RStudio console
- linear regression, performing / Linear regression with R
- logistic regression, performing / Logistic regression with R
- clustering, performing / Clustering with R
- recommendation algorithms, generating / Steps to generate recommendations in R
- R-Bloggers
- URL / Popular R contributors
- random access memory (RAM) / Installing Hadoop
- random forest model
- fitting, with RHadoop / Fitting random forests with RHadoop
- R bloggers
- R blogs / Increasing community support
- R books / Increasing community support
- rCharts package
- about / Visualizing data
- URL / Visualizing data
- RClient / Understanding the architecture of RHIPE
- RDataMining
- URL / Popular R contributors
- R documentation
- recommendation algorithms
- about / Types of machine-learning algorithms, Recommendation algorithms
- generating, in R / Steps to generate recommendations in R
- generating, in RHadoop / Generating recommendations with R and Hadoop
- recommendation algorithms, types
- user-based recommendations / Recommendation algorithms
- item-based recommendations / Recommendation algorithms
- recommendation technique / Performing data modeling in R
- Recommender() method / Performing data modeling in R
- Reduce() method / Taking a closer look at Hadoop MapReduce terminologies
- reducedebug option / Understanding the basics of Hadoop streaming
- Reduce phase / Understanding MapReduce
- Reducer method / Reducing phase execution
- reducer option / Understanding the basics of Hadoop streaming
- regression technique / Performing data modeling in R
- remote procedure calls / Understanding the different Java concepts used in Hadoop programming
- Revolution Analytics / Learning RHadoop, Linear regression with R and Hadoop, Recommendation algorithms
- R function
- used, in Hadoop MapReduce scripts / Understanding basic R functions used in Hadoop MapReduce scripts
- R groups / Increasing community support
- RHadoop
- about / Learning RHadoop, Introducing RHadoop
- rhdfs / Introducing RHadoop
- rmr / Introducing RHadoop
- rhbase / Introducing RHadoop
- quick check package / Introducing RHadoop
- architecture / Understanding the architecture of RHadoop
- installing / Installing RHadoop
- linear regression, performing / Linear regression with R and Hadoop
- URL / Linear regression with R and Hadoop
- logistic regression, performing / Logistic regression with R and Hadoop
- clustering, performing / Performing clustering with R and Hadoop
- recommendation algorithms, generating / Generating recommendations with R and Hadoop
- reference link / R + Hadoop help materials
- RHadoop example
- about / Understanding RHadoop examples
- word count, identifying / Word count
- RHadoop function
- rmr package / Understanding the RHadoop function reference, The rmr package
- hdfs package / Understanding the RHadoop function reference, The hdfs package
- RHadoop installation, prerequisites
- R installation / Installing RHadoop
- Hadoop installation / Installing RHadoop
- R packages, installing / Installing RHadoop
- environment variables, setting / Installing RHadoop
- RHBase
- pre-requisites / Understanding HBase features
- installing / Installing RHBase
- rhbase package
- about / Introducing RHadoop
- rhcp command / HDFS
- rhdel command / HDFS
- rhdfs package
- about / Introducing RHadoop
- rhex command / MapReduce
- rhget command / HDFS
- rhinit() method / RHIPE sample program (Map only)
- rhinit command / Initialization
- RHIPE
- about / Learning RHIPE, Introducing RHIPE
- goals / Introducing RHIPE
- installing / Installing RHIPE, Installing RHIPE
- architecture / Understanding the architecture of RHIPE
- reference link / R + Hadoop help materials
- RHIPE, components
- RClient / Understanding the architecture of RHIPE
- JobTracker / Understanding the architecture of RHIPE
- TaskTracker / Understanding the architecture of RHIPE
- HDFS / Understanding the architecture of RHIPE
- RHIPE function, category
- initialization / Understanding the RHIPE function reference
- HDFS / Understanding the RHIPE function reference
- MapReduce / Understanding the RHIPE function reference
- RHIPE installation, prerequisites
- Hadoop, installing / Installing Hadoop
- R, installing / Installing R
- protocol buffers, installing / Installing protocol buffers
- environment variables, setting up / Environment variables
- rJava package, installing / The rJava package installation
- RHIPE sample program
- about / RHIPE sample program (Map only)
- word count, identifying / Word count
- RHive
- prerequisites / Understanding features of Hive
- installing / Installing RHive
- RHive operations
- about / Understanding RHive operations
- rhjoin command / MapReduce
- rhkill command / MapReduce
- rhls command / HDFS
- rhoptions command / MapReduce
- rhput command / HDFS
- rhstatus command / MapReduce
- rhwatch() method / RHIPE sample program (Map only)
- rhwatch command / MapReduce
- R installation / Installing RHadoop
- rJava package
- installing / The rJava package installation
- R mailing list / Increasing community support
- rmongodb
- installing / Installing rmongodb
- rmr package
- about / Introducing RHadoop
- data store/retrieve / The rmr package
- MapReduce / The rmr package
- RMySQL
- installing, on Linux / Installing RMySQL
- R package
- rhdfs / Learning RHadoop
- rmr / Learning RHadoop
- rhbase / Learning RHadoop
- installing / Installing R packages
- R packages
- using / Using R packages
- about / Using R packages
- installing / Installing RHadoop
- RPostgreSQL
- installing / Installing RPostgreSQL
- RSQLite
- installing / Installing RSQLite
- data, importing into R / Importing the data into R
- data manipulation / Understanding data manipulation
- RStudio
- installing / Installing RStudio
- URL / Popular R contributors
- RStudio console
- Hadoop streaming job, executing from / Executing the Hadoop streaming job from R or an RStudio console
- output, exploring from / Exploring an output from R or an RStudio console
- rwrite command / HDFS
S
- search engine / Understanding several possible MapReduce definitions to solve business problems
- Secondary NameNode / Understanding HDFS components
- Secure Shell (SSH) / Installing Hadoop on Linux, Ubuntu flavor (multinode cluster)
- serialization / Understanding the different Java concepts used in Hadoop programming
- setwd command / HDFS
- sink function / Understanding basic R functions used in Hadoop MapReduce scripts
- sort function / Understanding the hsTableReader function
- SQL
- mapping, to MongoDB / Mapping SQL to MongoDB
- mapping, to MongoQL / Mapping SQL to MongoQL
- SQLite
- about / Understanding SQLite
- features / Understanding features of SQLite
- installing / Installing SQLite
- RSQLite, installing / Installing RSQLite
- Sqoop / Loading data into HDFS
- stack overflow / Increasing community support
- standalone mode / Understanding different Hadoop modes
- stdin function / Understanding basic R functions used in Hadoop MapReduce scripts
- stdout function / Understanding basic R functions used in Hadoop MapReduce scripts
- stock market analysis / Understanding several possible MapReduce definitions to solve business problems
- stock market change
- frequency, computing / Computing the frequency of stock market change
- problem, identifying / Identifying the problem
- data requirement, designing / Designing data requirement
- data, preprocessing / Preprocessing data
- data analytics, performing / Performing analytics over data
- data, visualizing / Visualizing data
- Sum() function / Linear regression with R and Hadoop
- summary command
- parameters / Linear regression with R
- supervised machine learning algorithms
- about / Types of machine-learning algorithms
- linear regression / Supervised machine-learning algorithms, Linear regression
- logistic regression / Linear regression, Logistic regression
- system() method / Performing analytics over data
- system command / Executing the Hadoop streaming job from R or an RStudio console
- system method / Exploring an output from R or an RStudio console
T
- TaskTracker / Understanding MapReduce components, Listing Hadoop MapReduce entities, Understanding the architecture of RHIPE
- Thrift
- installing / Installing thrift
- to.dfs function / The rmr package
- transactions
- ACID properties / Understanding SQLite
U
- Ubuntu 12.04 / Installing MongoDB
- under fitting / Fitting random forests with RHadoop
- unsupervised machine learning algorithms
- about / Types of machine-learning algorithms, Unsupervised machine learning algorithm
- clustering / Clustering
- artificial neural networks / Clustering
- vector quantization / Clustering
- user-based recommendations / Recommendation algorithms
- Utility / The hdfs package
V
- vector quantization
- about / Clustering
- verbose option / Understanding the basics of Hadoop streaming
W
- web page categorization
- exploring / Exploring web pages categorization
- problem, identifying / Identifying the problem
- data requirement, designing / Designing data requirement
- data, preprocessing / Preprocessing data
- data analytics, performing / Performing analytics over data
- data, visualizing / Visualizing data
- web server log processing / Understanding several possible MapReduce definitions to solve business problems
- website statistics / Understanding several possible MapReduce definitions to solve business problems
- write function / Understanding basic R functions used in Hadoop MapReduce scripts
X
- xlsx packages
- prerequisites / Installing Excel
Y
- Yahoo!