Haixu Tang
Assistant Professor of Informatics
Adjunct Assistant Professor in Computer
Science
Affiliated Faculty at Center for
Genomics and Bioformatics
Executive committee member of National
Center for Glycomics and Glycoproteomics
Office: Informatics 225
Phone: (812)-856-1859
Fax: (812)-856-4764
E-mail: hatang (at) indiana.edu
Lab: Computational Omics Lab (COL)
Go to Bioinformatics programs at IU Bloomington
Mailing address:
Informatics Building
901 E. 10th St.
Bloomington, IN 47408-3912
Educaltional Background:
Ph.D. in Biochemistry, Shanghai Institute of Biochemistry, Chinese
Academy of Sciences.
B.S. in Physics, Department of Physics, Nanjing University, China. 
Research Interests:
Algorithmic and statistical problems in Bioinformatics, particularly
in
- repeats and segmental duplication in
eukaryotic genomes
- fragment assembly in DNA sequencing
- mass spectrometry data analysis for
proteomics, glycomics and glycoproteomics
- gene regulatory analysis
Teaching:
Fall 2004
I602: Capstone
project for bioinformatics master students
Spring 2005
I590:
Topics in Informatics:
Introduction to genomics (for non-biology
students)
Fall 2006
I519:
Introduction to Bioinformatics
Spring 2006
I690:
Computational techniques
in comparative genomics
I627:
Seminar in Bioinformatics
Fall 2006
I617:
Informatics in life sciences and chemistry
Spring 2007
I529:
Biological sequence analysis
Fall 2007
I690:
Advanced algorithms in bioinformatics
Spring 2008
I529:
Biological sequence analysis
Fall 2008
I201:
Mathematics foundations in Informatics
Awards
-
NSF Early Career Development (CAREER) Award, 2007.
Books and Book Chapters
-
S. Kim, H. Tang and E. Mardis, ed. Genome sequencing technology and algorithms, Artech House Publishers (2007). Amazon
-
H. Tang and S. Kim, Bioinformatics: Mining the massive data from high throughput genomics experiments, pp1-24,
in Analysis of Biological Data: A Soft Computing Approach, edited by Sanghamitra Bandyopadhyay, Ujjwal Maulik and Jason T. L. Wang, World Scientific Press (2007).
-
Y. Ye and H. Tang, Dynamic programming algorithms for sequence and structure comparison, pp9-28,
in
Bioinformatics Algorithms: Techniques and Applications
, edited by Ion Mandoiu and Alexander Zelikovsky , Wiley Press (2008).
Recent Publications:
Q. Sheng, Y. Mechref, Y. Li, M. V. Novotny, H. Tang (2008), A computational approach to characterizing bond linkage of glycan isomers using MALDI-TOF-TOF mass spectrometry, Rapid Comm. Mass Spec. in press.
C. Shen, Q. Sheng, J. Dai, Y. Li, R. Zeng, H. Tang (2008), On the estimation of false positives in peptide identifications using decoy search strategy, Proteomics, in press.
C. Yuan, Q. Sheng, H. Tang, Y. Li, R. Zeng, R. J. Solaro (2008), Quantitative comparison of Sarcomeric phosphoproteomes of neonatal and adult rat hearts, Am. J Physiol. Heart Circ. Physiol., in press.
Y. Ye, H. Tang (2008), An ORFome assembly approach to metagenomics sequences analysis. Proceedings of the 7th Annual International Conference on Computational Systems Biology (CSB'08), 3-13. CSB online
Y. F. Li, R. J. Arnold, Y. Li, P. Radivojac, Q. Sheng, H. Tang (2008), A Bayesian approach to protein inference problem in shotgun proteomics. Proceedings of the 12th Annual International Conference on Computational Molecular Biology (RECOMB08), LNBI 4955, 167-180. LNBI online
S. Saha, S. H. Harrison, C. Shen, H. Tang, P. Radivojac, R. J. Arnold, X. Zhang, J. Y. Chen (2008), HIP2: An online database of human plasma proteins from healthy individuals. BMC Med Genomics. 1:12. Pubmed
J. H. Choi, S. Kim, H. Tang, J. Andrew, D. G. Gilbert, J. K. Colbourne (2008), A machine-learning approach to combined evidence validation of genome assemblies, Bioinformatics, 24(6):744-50. Pubmed
P. Alves, R. J. Arnold, D. E. Clemmer, Y. Li, J. P. Reilly, Q. Sheng, H. Tang, Z. Xun, R. Zeng, and P. Radivojac (2008), Fast and accurate identification of semi-tryptic peptides in shotgun proteomics, Bioinformatics, 24: 102-109. Pubmed
- Z. Jiang, H. Tang, M. Ventura, M. F. Cardone, T. Marques-Bonet, X. She, P. A. Pevzner, E. E. Eichler (2007), Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution. Nat Genet. 39:1361-1368. Pubmed, Commentary on Nature Genetics.
- H. Tang (2007) Genome assembly, rearrangement and repeats, Chem Rev., 107(8):3391-3406. Pubmed.
- S. H. Bae, H. Tang, J. Wu, J. Xie and S. Kim (2007), dPattern: transcription factor binding site (TFBS) discovery in human genome using a discriminative pattern analysis. 23:2619-2621. Pubmed.
- M. Rho, J. H. Choi, S. Kim, M. Lynch and H. Tang (2007), De novo identification of LTR retrotransposons in eukaryotic genomes. BMC Genomics, 8:90. Pubmed.
- A. Sundquist, M. Ronaghi, H. Tang, P. A. Pevzner and S. Batzoglou (2007), Whole-genome sequencing and assembly with high-throughput, short-read technologies. PLoS ONE, 2:e484. Pubmed.
- Y. Wu, Y. Mechref, I. Klouckova, M. V. Novotny and H. Tang (2007), A computational approach for the identification of site-specific protein glycosylations through ion-trap mass spectrometry, The Third RECOMB Satellite meeting on Proteomics, Lecture Notes in Bioinformatics, 4532:96-107, LNCS online.
- P. Alves, R. J. Arnold, M. V. Novotny, P. Radivojac, J. P. Reilly and H. Tang (2007), Advancement in protein inference from shotgun proteomics using peptide detectability. Pacific Symposium on Biocomputing, 12:409-420. Fulltext from PSB online proceeding.
- R. Patwardhan, H. Tang, S. Kim and M. Dalkilic (2006), An approximate de Bruijn graph approach to multiple local alignment and motif discovery in protein sequences, The First International Workshop in data mining and bioinformatics, Lecture Notes in Bioinformatics, 4316:158-169.
- D. Zhi, B. Raphael, A. Price, H. Tang and P. Pevzner (2006), Identifying repeat domains in large genomes, Genome Biology, 7(1):R7, Pubmed.
- D. Zhi, R. Keich, P. Pevzner, S. Heber and H. Tang (2006), Checking for base-calling errors in repeats. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 4(1):54-64, 2007, Pubmed
- H. Tang, R. J. Arnold, P. Alves, Z. Xun, D. E. Clemmer, M. V. Novotny, J. P. Reilly and P. Radivojac (2006), A computational approach toward label-free protein quantification using predicted peptide detectability.
Bioinformatics, 22(14):e481-488, ISMB 2006. Pubmed.
- V. Bafna, H. Tang and Shaojie Zhang (2006), Consensus Folding of Unaligned RNA Sequences Revisited. J. Comp. Biol. 13(2):283-295, Pubmed.
- R. J. Arnold, N. Jayasankar, D. Aggarwal, H. Tang and P. Radivojac (2006), A machine learning approach to predicting peptide fragmentation spectra. Proceeding of Pacific Symposium on Biocomputing, 11:219-230, Fulltext from PSB online proceeding.
- H. Tang, Y. Mechref and M. Novotny (2005), Automatic Interpretation of MS/MS Spectra of Oligosaccharides. Bioinformatics, 21 Suppl 1:i431-i439, ISMB 2005, Pubmed
- V. Bafna, H. Tang and Shaojie Zhang (2005), Consensus Folding of Unaligned RNA Sequences Revisited. Proceedings of the Ninth Annual International Conference on Computational Molecular Biology (RECOMB'05), 172-187, May 2005, Boston, USA, ACM.
- B. Raphael, D. Zhi, H. Tang and P. A. Pevzner, A novel method for
multiple alignment of sequences with repeats and shuffled elements.
Genome Res. 2004, 14: 2336-2346.
Pubmed
- N. Bandeira, H. Tang, V. Bafna and P. A. Pevzner, Shotgun protein
sequencing by tandem mass assembly. Analytical Chemistry, 2004,
76:7221-33.
Pubmed
- P. A. Pevzner, H. Tang and G. P. Tesler, De novo repeat classification and
fragment assembly. Genome Res. 2004 Sep; 14(9): 1786-96.
Pubmed
- M. Chaisson M, P. A. Pevzner and H. Tang, Fragment assembly with
short reads. Bioinformatics. 2004 Sep 1; 20(13): 2067-74.
Pubmed
- P. A. Pevzner, H. Tang and G. P. Tesler, De novo repeat classification and
fragment assembly. Proceedings of the Eighth Annual International
Conference on Computational Molecular Biology (RECOMB'04), April 2004, San Diego, USA, ACM
Press. 2004 Sep; 14(9): 1786-96.
- S. Heber, M. Alekseyev M, S. H. Sze, H. Tang and P. A. Pevzner,
Splicing graphs and EST assembly problem. Bioinformatics. 2002; 18
Suppl 1 :S181-8 (ISMB 2002 issue).
Pubmed
- P. A. Pevzner and H. Tang, Fragment assembly
with double-barreled data. Bioinformatics. 2001 Jun;17 Suppl 1:S225-33
(Special ISMB 2001 issue).
Pubmed
- P. A. Pevzner, H. Tang and M. S. Waterman
(2001) A New Approach to Fragment Assembly in DNA
Sequencing. Proceedings of the Fifth Annual International Conference on
Computational Molecular Biology (RECOMB'01), April 2001, Montreal, Canada, ACM
Press.
- Q. Tu, H. Tang and D. Ding, MedBlast: searching articles related to a biological sequence.
Bioinformatics. 2004,20:75-77.
Pubmed
- P. A. Pevzner, H. Tang and M. S. Waterman
(2001), An Eulerian path approach to DNA fragment assembly. Proc. Natl.
Acad. Sci. USA, 98:9748-9753.
Pubmed
Nature News
- S. Kruglyak and H. Tang (2001) A New
Estimator of Significance of Correlation in Time Series Data, J. Comp.
Biol. 2001,8:463-470.
Pubmed
- S. Kruglyak and H. Tang (2000) Regulation of
Adjecent Yeast Genes. Trends in Genetics, 16:109-111.
Pubmed