Large set of DNA reads for testing: Your program should take no more than 4 minutes to assemble this set of reads on the machine mi1.biostat.wisc.edu.
A small subset of 454 reads from the sequencing project
for the bacterial species Enterobacter cloacae subsp. cloacae ATCC
13047. The sequencing of this species is part of the Assembling the Tree of Life:
Enterobacteriaceae project, here at UW-Madison.
Once you have assembled these reads, use BLASTX
on your assembled sequence to determine which famous gene is contained
in the region of the genome you have assembled. For the Database,
choose "Non-redundant protein sequences (nr)", and for the Genetic
Code, choose "Bacteria and Archea (11)". Hint: you have seen this
gene near the end of the Introduction to Molecular Biology lectures.