Homework Assignment #1
Due in class 3/11
Part 1: Pairwise Sequence Alignment
Using the BLOSUM 50
substitution matrix, show (a) the alignment matrices, and (b) the
returned alignments for the following three cases:
- global alignment of the sequence HEART and the sequence HIT with linear gap penalty function where g = -3
- global alignment of HEART and HIT with affine gap penalty function where g = -1 and h = -4
- local alignment of PARK and SRQ with linear gap penalty function where g = -3
Be sure to illustrate the links followed in the traceback procedure.
If there is more than one optimal alignment, return the
highroad alignment. For the affine penalty case you should
use distinct colors or symbols to indicate into which matrix each link
is pointing.
Part 2: Whole Genome Alignment
Consider using MUMmer to align the two toy genomes:
Genome A: atctactt
Genome B: ttatgta
- Show the suffix tree for these two genomes.
- What are the MUMs for the two genomes?
- Show the alignment that would be returned by MUMmer, including the
longest subsequence of MUMs, and descriptions of any other key features
of the alignment.