Book Image

Bioinformatics with Python Cookbook

By : Tiago R Antao, Tiago Antao
Book Image

Bioinformatics with Python Cookbook

By: Tiago R Antao, Tiago Antao

Overview of this book

Table of Contents (16 chapters)
Bioinformatics with Python Cookbook
Credits
About the Author
About the Reviewers
www.PacktPub.com
Preface
Index

Aligning genetic and genomic data


Before we can perform any phylogenetic analysis, we need to align our genetic and genomic data. Here, we will use MAFFT (http://mafft.cbrc.jp/alignment/software/) to perform the genome analysis and the gene analysis will be performed using MUSCLE (http://www.drive5.com/muscle/).

Getting ready

To perform the genomic alignment, you will need to install MAFFT, and to perform the genic alignment, MUSCLE will be used. Also, we will use TrimAl (http://trimal.cgenomics.org/) to remove spurious sequences and poorly aligned regions in an automated manner. On Ubuntu and Linux, MAFFT and MUSCLE can be installed using apt-get install mafft muscle packages. TrimAl will have to be manually installed.

As usual, this information is available in the corresponding notebook at 05_Phylo/Alignment.ipynb. You will need to have run the previous notebook as it will generate files that are required here.

In this chapter, we will use Biopython.

How to do it...

Take a look at the following...