Sequence similarity search and sequence clustering algorithms

We develop novel sequence similarity-search and sequence clustering algorithms.



SECLAF: Sequence classification using deep learning

METAHMM finds genes in metagenomes

Protein Sequence Analyzer with graphic multiple choice options

SwissAlign fast alignment with the Smith-Waterman algorithm

ProtDict: Protein cross-reference tool


Balázs Szalkai, Vince Grolmusz: MetaHMM: A Webserver for Identifying Novel Genes with Specified Functions in Metagenomic Samples;  Genomics, available online May 23, 2018,

Balázs Szalkai, Vince Grolmusz: SECLAF: A Webserver and Deep Neural Network Design Tool for Hierarchical Biological Sequence Classification,  Bioinformatics, Vol 34, No. 14, pp. 2487-2489 2018

Balázs Szalkai, Vince Grolmusz: Near Perfect Protein Multi-Label Classification with Deep Neural Networks,  Methods Vol. 132, pp. 50-56, (2018),

Gábor Iván, Vince Grolmusz: Dimension reduction of clustering results in bioinformatics; Biochimica et Biophysica Acta (BBA)- Proteins and Proteomics; 1844 (2014), pp. 2277-2283; DOI: 10.1016/j.bbapap.2014.08.015  Preprint version: arXiv preprint arXiv:1309.1892

Balázs Szalkai, Ildikó Scheer, Kinga Nagy, Beáta G Vértessy, Vince Grolmusz, The Metagenomic Telescope, PLoS One, Vol. 9, No. 7, e101605, July 2014,

Dániel Bánky, Balázs Szalkai, Vince Grolmusz: An Intuitive Graphical Webserver for Multiple-Choice Protein Sequence Search; arXiv preprint arXiv:1312.4660; also in Gene, Vol. 539, No. 1, pp. 152-153, April 2014,

Gábor Iván, Dániel Bánky, Vince Grolmusz: Fast and Exact Sequence Alignment with the Smith-Waterman Algorithm: The SwissAlign Webserver. arXiv preprint arXiv:1309.1895; Journal version appeared in Gene Reports, Vol. 4, September 2016, Pages 26-28.

Gábor Iván, Zoltán Szabadka, Vince Grolmusz: A Hybrid Clustering of Protein Binding Sites (also as a preliminary version).  FEBS Journal Vol. 277, No. 6. pp. 1494-1502  (2010).

Gabor Ivan, Vince Grolmusz: Revealing the density-based clustering structure of the SwissProt database (poster) Page 606 of Abstract book, International Congress of Mathematicians 2010 Hyderabad, India, 19th-27th August, 2010.

Gábor Ivan, Daniel Banky, Vince Grolmusz: Refining the protein interaction network generated by NASCENT using sequence alignment 18th Annual International Conference on Intelligent Systems for Molecular Biology (ISMB), Boston, MA July 11-13, 2010

Gábor Iván, Zoltán Szabadka, Vince Grolmusz: On the asymmetry of the residue compositions of the binding sites on protein surfaces Journal of Bioinformatics and Computational Biology, Vol. 7. No. 6. (2009) pp. 931-938.