Data and Text Mining for Computational Biology
CS 6365
Fall 2009
“An Introduction to Bioinformatics Algorithms (Computational Molecular Biology)”, by Neil C. Jones and Pavel A. Pevzner, MIT Press, 2004.
ISBN
0262101068
448 pages
Available on Amazon.com or Barnes and Noble for $49
“Data Mining : Concepts and Techniques” by Jiawei Han and Micheline Kamber, Elsevier, 2nd edition, 2006.
ISBN 1558609016
800 pages
Available on Amazon.com or Barnes and Noble for $55
“Bioinformatics: The Machine Learning Approach” by Pierre Baldi and Soren Brunak, 2nd edition, 2001.
“Data mining : multimedia, soft computing, and bioinformatics” by Sushmita Mitra and Tinku Acharya, 2003.
Both of the above are available as full-text eBooks via http://library.utdallas.edu.
Biology: “Molecular Biology of the Cell”, by Bruce Alberts et al., 4th edition, 2002.
Machine learning: “Machine Learning”, by Tom Mitchell, 1997.
Statistics: “The elements of statistical learning: data mining, inference, and prediction”, by Trevor Hastie, Robert Tibshirani and Jerome Friedman, 2001.
Data structures and algorithms: “Introduction to Algorithms”, by Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, and Clifford Stein, 2nd edition, 2001.
Grading
Class participation: 20%
Homework assignments: 30% total
Midterm: 10%
Team project: 20%
Final exam: 20%
90+ gets you an “A”, 70+ a “B”
|
Topic |
Date |
Lecture
Notes |
|
Introduction |
Monday 8/24 |
|
|
Biology, Part 1: Classification, Cells, and Proteins |
Wednesday 8/26 |
|
|
Biology, Part 2: DNA, RNA, Replication, and Reproduction |
Monday 8/31 |
|
|
Biology, Part 3: Mitosis, Meiosis, Transcription, and Translation |
Wednesday 9/2 |
|
|
Biology, Part 4: Regulation, Gene Networks, and Systems Biology |
Wednesday 9/9 |
|
|
Biology, Part 5: Text Mining and DNA Microarrays |
Monday 9/14 |
|
|
Biology, Part 6: Evolution and Forensic Biology |
Wednesday 9/16 |
|
|
Biology, Part 7: Gene Amplification and Recombinant DNA |
Monday 9/21 |
|
|
Challenges in Bioinformatics |
Wednesday 9/23 |
|
|
Databases, part 1 |
Monday 9/28 |
|
|
Databases, part 2 |
Wednesday 9/30 |
|
|
Alignment, part 1 |
Monday 10/5 |
|
|
Alignment, part 2 |
Wednesday 10/7 |
|
|
Alignment, part 3 |
Monday 10/12 |
|
|
Local Alignment |
Wednesday 10/14 |
|
|
Approximate Alignment |
Monday 10/19 |
|
|
Midterm |
Wednesday 10/21 |
No slides |
|
Classifier Evaluation |
Monday 10/26 |
|
|
Motifs |
Wednesday 10/28 |
|
|
Finding Motifs |
Monday 11/2 |
|
|
Multiple Sequence Alignment |
Wednesday 11/4 |
|
|
Statistical Estimation |
Monday 11/9 |
|
|
Simulation and Classification |
Wednesday 11/11 |
|
|
Classification |
Monday 11/16 |
|
|
Classification Methods |
Wednesday 11/18 |
Lecture #24 |
Additional lecture information will be added to the table above as the course progresses.
Information about homework
Homework #1. Assigned Wednesday September 30, due Wednesday October 14.
Homework #2. Assigned Monday October 26, due Monday November 16.
Homework #3. Assigned Wednesday November 18, due Wednesday December 2.
The schedule of student presentations of projects will be listed here.
Presentations will take place in early December.
Students must discuss their proposed project with the instructor by early November.
Papers and additional slides that supplement the course material will be listed here.
Russel Lande. “Models of speciation by sexual selection on polygenic traits”. Proceedings of the National Academies of Sciences, 78(6):3721-25, June 1981.