Index
A
- acid-code-line / How it works…
- acid-code function / How it works…
- adapters / How it works…
- agents
- program complexity, managing with / Managing program complexity with agents, Getting ready, How to do it…, How it works…
- and STM, combining / Combining agents and STM, How to do it…, How it works…
- errors, recovering / Recovering from errors in agents, How to do it…
- aggregate operators
- creating / Creating aggregate operators
- Algorithms to calculate variance
- alter function / Getting better performance with commute
- Ant
- URL / Getting ready
- Apache Ant
- URL / Getting ready
- Apache HDFS
- data, distributing with / Distributing data with Apache HDFS, Getting ready, How to do it…, How it works…
- Apriori algorithm
- associations finding, in data with / Finding associations in data with the Apriori algorithm, How to do it…, How it works…
- URL / There's more…
- ARFF file
- loading, into Weka / Loading CSV and ARFF files into Weka, How to do it…, There's more…
- associations
- finding in data, with Apriori algorithm / Finding associations in data with the Apriori algorithm, How to do it…, How it works…
- atom / Introducing safe side effects into the STM
- Attribute-Relation File Format (ARFF) / Loading CSV and ARFF files into Weka
- Avro
- URL / There's more
B
- bar charts
- non-numeric data, graphing in / Graphing non-numeric data in bar charts, Getting ready, How it works...
- creating, with NVD3 / Creating bar charts with NVD3, How to do it…, How it works…
- Bayesian
- Bayesian modeling and classifiers
- URL / There's more…, There's more…
- Benford's law
- data errors, finding with / Finding data errors with Benford's law, How to do it…, There's more…
- URL / There's more…
- binding command / How it works…
- bootstrapping
- sample statistics, validating with / Validating sample statistics with bootstrapping, Getting ready, How it works…
- about / Validating sample statistics with bootstrapping
- URL / There's more…
- Brown corpus
- URL / Getting ready
- buffer operators
- creating / Creating buffer operators
- build-in operations
- URL / There's more
C
- C2
- URL / There's more…
- Cascading
- URL / Introduction
- Cascalog
- URL / Introduction, Aggregating data with Cascalog
- initializing, for distributed processing / Initializing Cascalog and Hadoop for distributed processing, How to do it…, How it works…
- data, querying with / Querying data with Cascalog, Getting ready, How it works…
- CSV files, parsing with / Parsing CSV files with Cascalog, How to do it…, How it works…
- complex queries, executing with / Executing complex queries with Cascalog, How to do it…
- data, aggregating with / Aggregating data with Cascalog, How to do it…
- data, transforming with / Transforming data with Cascalog, How it works…
- Cascalog operators
- defining / Defining new Cascalog operators, How to do it…
- map operators, creating / Creating map operators
- map concatenation operators, creating / Creating map concatenation operators
- filter operators, creating / Creating filter operators
- buffer operators, creating / Creating buffer operators
- aggregate operators, creating / Creating aggregate operators
- parallel aggregate operators, creating / Creating parallel aggregate operators
- charts
- customizing, with JFreeChart / Customizing charts with JFreeChart, How to do it..., How it works...
- Clojars
- URL / How it works...
- Clojuratica
- URL / How to do it…, Getting ready
- Mathematica functions, calling from / Calling Mathematica functions from Clojuratica, How it works…
- matrixes, sending to Mathematica from / Sending matrixes to Mathematica from Clojuratica, How it works…
- Mathematica scripts, evaluating from / Evaluating Mathematica scripts from Clojuratica, How to do it…
- Clojure
- about / Introduction
- URL / Using type hints, Getting ready, See also…
- setting up / Setting up Clojure
- R functions, calling from / Calling R functions from Clojure
- R files, evaluating from / Evaluating R files from Clojure, How to do it…, How it works…
- R, plotting from / Plotting in R from Clojure, How to do it…, There's more…
- clojure.java.jdbc library
- URL / See also
- clojure.tools.trace
- URL / There's more...
- Clojure data structures
- loading, into datasets / Loading Clojure data structures into datasets, How to do it…, See also…
- Clojure documentation
- URL / See also
- ClojureScript
- URL / Introduction, Setting up to use ClojureScript
- setting up / Setting up to use ClojureScript, How to do it…, How it works…
- Clojure wrapper
- URL / Tokenizing text
- Cloudera
- URL / Getting ready
- clustering
- with K-means / Clustering with K-Means
- Colt library
- columns
- selecting, with $ / Selecting columns with $, How to do it…, How it works…
- deleting, in Weka / Filtering, renaming, and deleting columns in Weka datasets
- renaming, in Weka / Filtering, renaming, and deleting columns in Weka datasets
- filtering, in Weka / Filtering, renaming, and deleting columns in Weka datasets
- renaming / Renaming columns
- removing / Removing columns
- hiding / Hiding columns, How it works…
- comma-separated values (CSV) / Introduction
- commute
- better performance, obtaining with / Getting better performance with commute, How to do it…
- complex queries
- executing, with Cascalog / Executing complex queries with Cascalog, How to do it…
- Compojure
- URL / Introduction, Serving data with Ring and Compojure
- about / Introduction
- data, serving with / Serving data with Ring and Compojure, How to do it…
- Compojure API documentation
- URL / There's more…
- compute-report function / How it works…
- concurrent programming
- about / Introduction
- concurrent programs
- debugging, with watchers / Debugging concurrent programs with watchers, How to do it…
- consistency
- maintaining, with synonym maps / Maintaining consistency with synonym maps, How it works…
- maintaining, with ensure / Maintaining consistency with ensure, How to do it…
- content words
- focusing on, with stoplists / Focusing on content words with stoplists, Getting ready
- correct function / How it works…
- Coursera
- URL / Introduction
- create-chart function / How it works…
- Criterium
- URL / Getting ready
- benchmarking with / Benchmarking with Criterium, How to do it…
- Criterium library
- CSV data
- reading, into Incanter datasets / Reading CSV data into Incanter datasets, Getting ready, How it works…
- CSV file
- loading, into Weka / Loading CSV and ARFF files into Weka, How to do it…, There's more…
- CSV files
- parsing, with Cascalog / Parsing CSV files with Cascalog, How to do it…, How it works…
- currencies
- URL / Getting ready
- currency data
- custom data formats
- parsing / Parsing custom data formats, How to do it…
- custom error handler
- using / Using a custom error handler
D
- D3
- URL / Introduction, There's more…
- time series charts, creating with / Creating time series charts with D3, How to do it…, How it works…, There's more…
- interactive visualizations, creating with / Creating interactive visualizations with D3, How to do it…, How it works…, There's more…
- D3 JavaScript library
- data
- reading, from Excel with Incanter / Reading data from Excel with Incanter, How it works…
- reading, from JDBC databases / Reading data from JDBC databases, How to do it…, How it works…
- scraping, from tables in web pages / Scraping data from tables in web pages, Getting ready, How to do it…, How it works…
- aggregating, from different formats / Aggregating data from different formats, How to do it…
- triple store, creating / Creating the triple store
- exchange rates, scraping / Scraping exchange rates
- currency data, loading / Loading currency data and tying it all together, See also
- cleaning, with regular expressions / Cleaning data with regular expressions, How to do it…, How it works…
- validating, with Valip / Validating data with Valip, How to do it…
- chunking, for pmap / Chunking data for pmap
- querying, with Cascalog / Querying data with Cascalog, How to do it…, How it works…
- distributing, with Apache HDFS / Distributing data with Apache HDFS, Getting ready, How to do it…, How it works…
- aggregating, with Cascalog / Aggregating data with Cascalog, How to do it…, There's more
- transforming, with Cascalog / Transforming data with Cascalog, How it works…
- grouping, with $group-by / Grouping data with $group-by, How to do it…
- saving, as JSON / Saving data as JSON, How it works…
- classifying, with decision trees / Classifying data with decision trees, How to do it…, How it works…
- classifying, with Naive Bayesian classifier / Classifying data with the Naive Bayesian classifier, How to do it…, How it works…
- classifying, with SVMs / Classifying data with support vector machines, Getting ready, How to do it…
- serving, with Ring / Serving data with Ring and Compojure, How to do it…
- serving, with Compojure / Serving data with Ring and Compojure, How to do it…
- serving / Serving data
- data consistency
- maintaining, with validators / Maintaining data consistency with validators, How to do it…, How it works…
- data errors
- finding, with Benford's law / Finding data errors with Benford's law, Getting ready, How to do it…, There's more…
- dataset constructor / How it works…
- datasets
- Clojure data structures, loading into / Loading Clojure data structures into datasets, How to do it…, See also…
- viewing, with view / Viewing datasets interactively with view, How it works…
- converting, to matrices / Converting datasets to matrices, Getting ready, There's more…
- filtering, with $where / Filtering datasets with $where, How it works…
- saving, to CSV / Saving datasets to CSV and JSON, How it works…
- saving, to JSON / Saving datasets to CSV and JSON, How it works…
- URL / How to do it…
- dates
- parsing / Parsing dates and times, How to do it…, There's more…
- DBPedia
- decision trees
- data, classifying with / Classifying data with decision trees, How to do it…, How it works…
- delete-char function / How it works…
- Dirichlet distribution
- distributed processing
- Cascalog, initializing for / Initializing Cascalog and Hadoop for distributed processing, How to do it…, How it works…
- Hadoop, initializing for / How to do it…, How it works…
- document frequencies
- obtaining / Getting document frequencies
- scaling, by document size / Scaling document frequencies by document size, How to do it…, How it works…
- scaling, with TF-IDF / Scaling document frequencies with TF-IDF, How to do it…, How it works…
- documents
- mapping, to vector space representation / Mapping documents to a sparse vector space representation, How to do it…
- document size
- document frequencies, scaling by / Scaling document frequencies by document size, How to do it…, How it works…
- domain specific language (DSL)
- duplicate data
- dynamic charts
- creating, with Incanter / How to do it..., How it works...
E
- edit-distance parameter / How it works…
- embedded domain-specific language (EDSL) / Querying RDF data with SPARQL
- Enlive
- ensure
- consistency, maintaining with / Maintaining consistency with ensure, How to do it…
- ensure function / Maintaining consistency with ensure
- equations
- adding, to Incanter charts / Adding equations to Incanter charts, There's more...
- error handler / How to do it…
- error mode / How to do it…
- errors
- recovering, in agents / Recovering from errors in agents, How to do it…
- failing on / Failing on errors
- continuing on / Continuing on errors
- custom error handler, using / Using a custom error handler
- EuroClojure 2012
- URL / There's more...
- Excel, with Incanter
- data, reading from / Reading data from Excel with Incanter, How it works…
- exchange rates
- scraping / Scraping exchange rates
F
- <| function / How it works…
- FASTA data
- FASTA format
- filter operators
- creating / Creating filter operators
- force-directed layouts
- graphs, visualizing with / Visualizing graphs with force-directed layouts, Getting ready, How to do it…, How it works…
- function calls
- combining, with reducers / Combining function calls with reducers, How to do it…, There's more...
- function plots
- creating, with Incanter / Creating function plots with Incanter, How it works...
- functions
- creating, from Mathematica / Creating functions from Mathematica, How to do it…
- function words
- fuzzy-dist
- about / How it works…
G
- $group-by
- data, grouping with / Grouping data with $group-by, How to do it…
- get-corpus-terms function / How to do it…
- get-dataset function / How it works…
- URL / There's more...
- get-idf-cache function / How to do it…
- Google Closure library
- URL / There's more…
- Google Finance
- graphs
- visualizing, with force-directed layouts / Visualizing graphs with force-directed layouts, Getting ready, How to do it…, How it works…
- Graphviz
- URL / Getting ready
- groups of data
- discovering, K-Means clustering used / Discovering groups of data using K-Means clustering, How to do it…
H
- Hadoop
- URL / Introduction
- initializing, for distributed processing / Initializing Cascalog and Hadoop for distributed processing, How to do it…, How it works…
- hadoop command / How it works…
- Hadoop Distributed File System (HDFS) / Initializing Cascalog and Hadoop for distributed processing
- handlers
- defining / Defining routes and handlers
- Harvard
- URL / Introduction
- Heroku
- URL / Introduction
- Hiccup
- URL / Introduction, There's more…
- HTML, creating with / Creating HTML with Hiccup, How to do it…, How it works…
- hierarchical clustering
- URL / There's more…
- hierarchical clusters
- finding, in Weka / Finding hierarchical clusters in Weka, How to do it…, How it works…
- histograms
- creating, with Incanter / Creating histograms with Incanter, How it works...
- creating, with NVD3 / Creating histograms with NVD3, How to do it…, How it works…
- HTML
- creating, with Hiccup / Creating HTML with Hiccup, How to do it…, How it works…
I
- ID3 algorithm
- URL / There's more…
- Incanter
- URL / Introduction, Introduction, There's more...
- processing, parallelizing with / Parallelizing processing with Incanter
- infix formulas, using / Using infix formulas in Incanter, How to do it…, How it works…
- SOMs, clustering with / Clustering with SOMs in Incanter, How to do it…
- scatter plots, creating with / Creating scatter plots with Incanter, How to do it..., How it works...
- histograms, creating with / Creating histograms with Incanter, How it works...
- function plots, creating with / Creating function plots with Incanter, How it works...
- dynamic charts, creating with / How it works...
- incanter.core/query-dataset
- URL / There's more…
- Incanter charts
- equations, adding to / Adding equations to Incanter charts
- Incanter datasets
- CSV data, reading into / Reading CSV data into Incanter datasets, Getting ready, How it works…
- JSON data, reading into / Reading JSON data into Incanter datasets, Getting ready, How to do it…
- XML data, reading into / Reading XML data into Incanter datasets, Getting ready, How to do it…, There's more…
- Incanter documentation
- URL / How it works…, There's more…
- Incanter Zoo
- time series data, working with / Working with time series data with Incanter Zoo, Getting ready, There's more...
- infix formulas
- used, in Incanter / Using infix formulas in Incanter, How to do it…, How it works…
- Infochimps
- URL / Getting ready, Getting ready
- insert-split function / How it works…
- interactive visualizations
- creating, with D3 / Creating interactive visualizations with D3, How to do it…, How it works…, There's more…
- ionosphere dataset
- URL / There's more…
J
- Java data types, R
- URL / There's more…
- Java Development Kit
- URL / Getting ready
- JavaDocs
- URL / There's more
- JavaDocs, for Pattern class
- URL / There's more...
- JavaScript Object Notation (JSON)
- Java tutorial, on regular expressions
- URL / There's more...
- JDBC
- JDBC databases
- data, reading from / Reading data from JDBC databases, How to do it…, How it works…
- Jetty
- JFreeChart
- charts, customizing with / Customizing charts with JFreeChart, How to do it..., How it works...
- Joda Java library
- URL / Parsing dates and times
- JSON
- and XML, comparing / Comparing XML and JSON
- data, saving / Saving data as JSON, How it works…
- JSON data
- reading, into Incanter datasets / Reading JSON data into Incanter datasets, Getting ready, How to do it…
K
- K-Means clustering
- used, for discovering groups of data / Discovering groups of data using K-Means clustering, How to do it…
- results, analyzing / Analyzing the results
- macros, building / Building macros
- K-means clustering
- kr Clojure library
- URL / Reading RDF data
- kr library
- URL / How to do it…
L
- large data sets
- processing / Lazily processing very large data sets, How to do it…, How it works…
- sampling / Sampling from very large data sets
- sampling, by percentage / Sampling by percentage
- sampling, for exact count / Sampling exactly, How it works…
- large inputs
- managing, with sized queues / Managing large inputs with sized queues, How it works...
- least squares linear regression
- about / How it works…
- Leiningen
- lein new command / How to do it...
- LibSVM class
- URL / There's more…
- linear regression
- about / Modeling linear relationships
- linear relationships
- modeling / Modeling linear relationships, How to do it…, How it works…
- lines
- adding, to scatter charts / Adding lines to scatter charts, How it works...
- Linux
- Mathematica, setting up for / Setting up Mathematica to talk to Clojuratica for Mac OS X and Linux, How to do it…, There's more…
- Lisp's (List Processing) / How it works…
- load-stopwords function / How to do it…
- load-table-data function / How it works…
- load-xml-data
- parameters / How it works…
M
- Mac OS X
- Mathematica, setting up for / Setting up Mathematica to talk to Clojuratica for Mac OS X and Linux, How to do it…, There's more…
- macros
- building / Building macros, See also…
- main function / How it works…
- MALLET
- topic modeling, performing with / Performing topic modeling with MALLET, How to do it…, See also…
- URL / Performing topic modeling with MALLET
- naïve Bayesian classification, performing with / Performing naïve Bayesian classification with MALLET, Getting ready, How to do it…, See also…
- map concatenation operators
- creating / Creating map concatenation operators
- map operators
- creating / Creating map operators
- MapReduce algorithm
- about / Introduction
- URL / Introduction
- Mathematica
- URL / Introduction, Getting ready, There's more…, Getting ready
- setting up, for Mac OS X / Setting up Mathematica to talk to Clojuratica for Mac OS X and Linux, How to do it…, There's more…
- setting up, for Linux / Setting up Mathematica to talk to Clojuratica for Mac OS X and Linux, How to do it…, There's more…
- setting up, for Windows / Setting up Mathematica to talk to Clojuratica for Windows, How to do it..., How it works...
- functions, creating from / Creating functions from Mathematica, How to do it…
- Mathematica functions
- calling, from Clojuratica / Calling Mathematica functions from Clojuratica, How it works…
- Mathematica scripts
- evaluating, from Clojuratica / Evaluating Mathematica scripts from Clojuratica, How to do it…
- Mathematica StackExchange
- URL / How it works…
- matrices
- datasets, converting to / Converting datasets to matrices, Getting ready, There's more…
- matrixes
- sending to Mathematica, from Clojuratica / Sending matrixes to Mathematica from Clojuratica, How it works…
- Maven
- metaheuristic
- about / There's more…
- middleware / How it works…
- Monte Carlo methods
- Monte Carlo simulations
- partitioning / Partitioning Monte Carlo simulations for better pmap performance, How to do it…, How it works…
- estimating with / Estimating with Monte Carlo simulations
- data, chunking for pmap / Chunking data for pmap
- multinomial Bayesian distributions
- multiple datasets, with $join
- projecting from / Projecting from multiple datasets with $join, How to do it…
- mushroom dataset
- URL / There's more…
N
- NaiveBayes class
- URL / There's more…
- Naive Bayesian classifier
- data, classifying with / Classifying data with the Naive Bayesian classifier, How to do it…, How it works…
- natural language processing (NLP)
- about / Tokenizing text
- Natural Language Toolkit
- URL / Getting ready
- naïve Bayesian classification
- performing, with MALLET / Performing naïve Bayesian classification with MALLET, Getting ready, How to do it…, See also…
- NER
- people, finding with / Finding people, places, and things with Named Entity Recognition, How to do it…, How it works…
- places, finding with / Finding people, places, and things with Named Entity Recognition, How to do it…, How it works…
- things, finding with / Finding people, places, and things with Named Entity Recognition, How to do it…, How it works…
- non-linear relationships
- non-numeric data
- graphing, in bar charts / Graphing non-numeric data in bar charts, Getting ready, How it works...
- normalize function / How to do it…
- numbers
- regularizing / Regularizing numbers, How it works…
- NVD3
- URL / Introduction, There's more…
- scatter plots, creating with / Creating scatter plots with NVD3, How to do it…, How it works…
- bar charts, creating with / Creating bar charts with NVD3, How to do it…, How it works…
- histograms, creating with / Creating histograms with NVD3, How to do it…, How it works…
- NVD3 library
- NVD3 style sheet
- URL / How to do it…
O
- online summary statistics
- generating, for data streams with reducers / Generating online summary statistics for data streams with reducers, How to do it…
- online tester, RegexPlant
- URL / There's more...
- OpenNLP
- URL / Tokenizing text
- Open Secrets
- URL / Getting ready
- optimal partition size
- finding, with simulated annealing / Finding the optimal partition size with simulated annealing, Getting ready, How to do it…, There's more…
- output-points function / How to do it…
P
- parallel aggregate operators
- creating / Creating parallel aggregate operators
- composing / Composing Cascalog queries, Getting ready, How to do it…, How it works…
- Parallel Colt Java library
- parallelism
- about / Introduction
- parallelization
- about / Introduction
- parallel programming
- about / Introduction
- parse-ez library
- people
- finding, with NER / Finding people, places, and things with Named Entity Recognition, How to do it…, How it works…
- Pig / Initializing Cascalog and Hadoop for distributed processing
- pipeline processing
- about / Processing in a pipeline
- places
- finding, with NER / Finding people, places, and things with Named Entity Recognition, How to do it…, How it works…
- pmap
- processing, parallelizing with / Parallelizing processing with pmap, How to do it…, How it works…
- POI Factory
- URL / Getting ready
- Political Action Committee (PAC) / Getting ready
- Pretrained models
- URL / Getting ready
- processing
- monitoring, with watchers / Monitoring processing with watchers, How to do it…, How it works…
- parallelizing, with pmap / Parallelizing processing with pmap, How to do it…
- parallelizing, with Incanter / Parallelizing processing with Incanter, How to do it…
- program complexity
- managing, with STM / Managing program complexity with STM, Getting ready, How to do it…, How it works…, See also
- managing, with agents / Managing program complexity with agents, Getting ready, How to do it…, How it works…
- project
- creating / Creating a new project, How it works...
- Project Gutenberg
- URL / Getting ready
Q
- qr
- URL / There's more…
R
- $rollup
- summary statistics, generating with / Generating summary statistics with $rollup, How it works…
- $rollup function / How it works…
- URL / How it works…
- R
- URL / Introduction, Introduction, Getting ready
- setting up / Setting up R to talk to Clojure, Setting up R
- vectors, passing into / Passing vectors into R, How it works…
- plotting, from Clojure / Plotting in R from Clojure, How to do it…, There's more…
- RDF data
- reading / Reading RDF data, How to do it…, How it works…
- querying, with SPARQL / Querying RDF data with SPARQL, Getting ready, How to do it…, How it works…
- Reduce operation
- about / Introduction
- reducers
- function calls, combining with / Combining function calls with reducers, How to do it…, There's more...
- parallelizing with / Parallelizing with reducers, How to do it…, How it works…
- online summary statistics, generating for data streams with / Generating online summary statistics for data streams with reducers, How to do it…
- regular expression
- (?x) / How it works…
- (\d{3}) / How it works…
- \D{0,2} / How it works…
- \D? / How it works…
- (\d{4}) / How it works…
- regular expressions
- data, cleaning with / Cleaning data with regular expressions, How to do it…, How it works…
- relative values
- calculating / Calculating relative values, How to do it…, How it works…
- REPL
- about / Creating a new project
- replace-split function / How it works…
- Resource Description Format (RDF) / How it works…
- results
- analyzing / Analyzing the results
- R files
- evaluating, from Clojure / Evaluating R files from Clojure, How to do it…, How it works…
- R functions
- calling, from Clojure / Calling R functions from Clojure
- R gallery
- URL / There's more…
- Ring
- URL / Introduction, Serving data with Ring and Compojure, How it works…
- about / Introduction
- data, serving with / Serving data with Ring and Compojure, How to do it…
- handlers / How it works…
- middleware / How it works…
- adapters / How it works…
- Ring's API documentation
- URL / There's more…
- routes
- defining / Defining routes and handlers
- rows
- selecting, with $ / Selecting rows with $, How it works…
- R project
- URL / Introduction
- Rserve
- URL / Setting up R
S
- sample datasets, Incanter
- loading / Loading Incanter's sample datasets, How it works…
- sample statistics
- validating, with bootstrapping / Validating sample statistics with bootstrapping, Getting ready, How it works…
- scatter charts
- lines, adding to / Adding lines to scatter charts, Getting ready, How it works...
- scatter plots
- creating, with Incanter / Creating scatter plots with Incanter, How to do it..., How it works...
- creating, with NVD3 / Creating scatter plots with NVD3, How to do it…, How it works…
- Screen
- URL / There's more…
- screen
- URL / How it works…
- self-organizing maps
- URL / There's more…
- sentences
- finding / Finding sentences, How it works…
- seque function / How it works...
- Sequence files
- URL / There's more
- server
- running / Running the server, How it works…, There's more…
- Sesame
- URL / Reading RDF data
- Sesame data store
- URL / Getting ready
- SimpleKMeans class
- URL / See also…
- simulated annealing
- optimal partition size, finding with / Finding the optimal partition size with simulated annealing, Getting ready, How to do it…, There's more…
- about / Finding the optimal partition size with simulated annealing
- sine wave function
- about / How it works...
- sized queues
- large inputs, managing with / Managing large inputs with sized queues, How it works...
- Software Transactional Memory (STM)
- about / Introduction
- URL / Introduction
- SOMs
- clustering, in Incanter / Clustering with SOMs in Incanter, How to do it…
- source tap
- URL / There's more
- SpamAssassin website
- URL / Getting ready
- SPARQL
- RDF data, querying with / Querying RDF data with SPARQL, Getting ready, How to do it…, How it works…
- sparql-select-query function / How it works…
- sparse vectors / Mapping documents to a sparse vector space representation
- spelling errors
- fixing / Fixing spelling errors, How to do it…, How it works…
- split-word function / How it works…
- SQLite database
- standard error
- State of the Union (SOTU)
- about / Getting ready
- URL / Getting ready
- statistical tokenizer
- URL / Getting ready
- Stat Trek
- about / How it works…
- STM
- program complexity, managing with / Managing program complexity with STM, Getting ready, How to do it…, How it works…, See also
- and agents, combining / Combining agents and STM, How to do it…, How it works…
- side effects, introducing into / Introducing safe side effects into the STM, How to do it…
- stoplists
- about / Focusing on content words with stoplists
- content words, focusing on with / Focusing on content words with stoplists, Getting ready
- stopwords
- structures
- navigating, with zippers / Navigating structures with zippers
- summary statistics
- generating, with $rollup / Generating summary statistics with $rollup, How it works…
- SVG
- URL / There's more…
- SVMs
- data, classifying with / Classifying data with support vector machines, Getting ready, How to do it…
- synonym maps
- consistency, maintaining with / Maintaining consistency with synonym maps, How it works…
T
- term frequency-inverse document frequency (tf-idf)
- text
- tokenizing / Tokenizing text, How it works…
- textual data
- scraping, from web pages / Scraping textual data from web pages, How to do it…, How it works…
- TF-IDF
- document frequencies, scaling with / Scaling document frequencies with TF-IDF, How to do it…, How it works…
- tf-idf-freqs function / How to do it…
- things
- finding, with NER / Finding people, places, and things with Named Entity Recognition, How to do it…, How it works…
- thread starvation / Introducing safe side effects into the STM
- thunk / How to do it…
- times
- parsing / Parsing dates and times, How to do it…, There's more…
- time series charts
- creating, with D3 / Creating time series charts with D3, How to do it…, How it works…, There's more…
- time series data
- working, with Incanter Zoo / Working with time series data with Incanter Zoo, Getting ready, There's more...
- tmux
- URL / How it works…, There's more…
- to-dataset function / How it works…
- to-matrix function / How it works…
- tokenization
- about / Tokenizing text
- topic modeling
- performing, with MALLET / Performing topic modeling with MALLET, How to do it…, See also…
- about / Performing topic modeling with MALLET
- transpose-char function / How it works…
- triple store
- creating / Creating the triple store
- type hints
- using / Using type hints, How to do it…, See also
U
- UCI datasets
- URL / Getting ready
- update-totals function / How to do it…
V
- validators
- data consistency, maintaining with / Maintaining data consistency with validators, How to do it…, See also
- Valip
- data, validating with / Validating data with Valip, How to do it…
- URL / Validating data with Valip
- values change
- working with / Working with changes in values, How to do it…, How it works…
- Variable binding
- URL / There's more
- variable relationships
- simplifying, for scaling variables / Scaling variables to simplify variable relationships, How it works…
- variables
- scaling, to simplify variable relationships / Scaling variables to simplify variable relationships, How it works…
- smoothing, to decrease variation / Smoothing variables to decrease variation, Getting ready, How to do it…, How it works…
- variation
- decreasing, for smoothing variables / Smoothing variables to decrease variation, Getting ready, How to do it…, How it works…
- vectors
- passing, into R / Passing vectors into R, How it works…
- vector space representation
- documents, mapping to / Mapping documents to a sparse vector space representation, How to do it…
- view
- datasets, viewing with / Viewing datasets interactively with view, How it works…
W
- $where
- datasets, filtering with / Filtering datasets with $where, How it works…
- watchers
- processing, maintaining with / Monitoring processing with watchers, How to do it…, How it works…
- about / Monitoring processing with watchers
- concurrent programs, debugging with / Debugging concurrent programs with watchers, How to do it…
- web application
- configuring / Configuring and setting up the web application
- setting up / Configuring and setting up the web application
- web pages
- data, scraping from tables / Scraping data from tables in web pages, Getting ready, How to do it…, How it works…
- textual data, scraping from / Scraping textual data from web pages, How to do it…, How it works…
- Weka
- URL / Introduction, See also…
- CSV file, loading into / Loading CSV and ARFF files into Weka, How to do it…, There's more…
- ARFF file, loading into / Loading CSV and ARFF files into Weka, How to do it…, See also…
- columns, filtering / Filtering, renaming, and deleting columns in Weka datasets, How to do it…
- columns, renaming / Filtering, renaming, and deleting columns in Weka datasets, How to do it…
- columns, deleting / Filtering, renaming, and deleting columns in Weka datasets, How to do it…
- hierarchical clusters, finding / Finding hierarchical clusters in Weka, How to do it…, How it works…
- Weka datasets
- URL / Getting ready
- Weka documentation
- URL / There's more…
- Weka library
- about / Introduction
- Windows
- Mathematica, setting up for / Setting up Mathematica to talk to Clojuratica for Windows, How to do it..., How it works...
- within-cluster sum of squared errors (WCSS) / Analyzing the results
- World Bank
- URL / Getting ready
X
- X-Rates
- XML
- and JSON, comparing / Comparing XML and JSON
- XML data
- reading, into Incanter datasets / Reading XML data into Incanter datasets, Getting ready, How to do it…, There's more…
- structures, navigating with zippers / Navigating structures with zippers
- processing, in pipeline / Processing in a pipeline
- XML and JSON, comparing / Comparing XML and JSON
Z
- 7-Zip
- URL / Setting up R
- zipper
- about / How it works…
- URL / Navigating structures with zippers