Factual
How does using lowercase help in analyzing text?
Why are there so many sparse entries? Does this number make sense?
Determine how to order the instructors matrix.
When, how, and why?
How would you remove the Unicode sequences from the text?
In what list of terms would you be interested in finding associations?
How could you adjust the course credits to be inclusive of the ranges of credits?
Challenges
Can you determine the benefit of using word stems in the analysis?
Can you figure out how to display the actual text words in the dendogram rather than their index point?
Is there a way to convert a non-heterogeneous XML dataset to a matrix?