Now that you have some knowledge about Genomics, let's look at how a supercomputer can help an R user investigating bacterial infection in newborn babies.
It is possible to use genomic data (like microarray gene expression data) to identify sets of genes that, taken together, can predict if a new biological sample belongs to a particular class sample (that is, a healthy sample or a diseased sample). In the case study presented here, we will look at the research by the Division of Infection and Pathway Medicine at The University of Edinburgh into diagnosing bacterial infection in young infants by measuring gene expression in blood samples. We want to look at how effectively a supercomputer can be used by R to process the large gene expression datasets involved.