Searching protein 3-D structures for optimal structure alignment using intelligent algorithms and data structures

Novosád, Tomáš; Snášel, Václav; Abraham, Ajith; Yang, Jack Y.

dc.contributor.author	Novosád, Tomáš
dc.contributor.author	Snášel, Václav
dc.contributor.author	Abraham, Ajith
dc.contributor.author	Yang, Jack Y.
dc.date.accessioned	2010-12-07T14:10:51Z
dc.date.available	2010-12-07T14:10:51Z
dc.date.issued	2010
dc.identifier.citation	IEEE Transactions on Information Technology in Biomedicine. 2010, vol. 14, no. 6, p. 1378-1386.	en
dc.identifier.issn	1089-7771
dc.identifier.uri	http://hdl.handle.net/10084/83472
dc.description.abstract	In this paper, we present a novel algorithm for measuring protein similarity based on their 3-D structure (protein tertiary structure). The algorithm used a suffix tree for discovering common parts of main chains of all proteins appearing in the current research collaboratory for structural bioinformatics protein data bank (PDB). By identifying these common parts, we build a vector model and use some classical information retrieval (IR) algorithms based on the vector model to measure the similarity between proteins - all to all protein similarity. For the calculation of protein similarity, we use term frequency inverse document frequency (tf × idf) term weighing schema and cosine similarity measure. The goal of this paper is to introduce new protein similarity metric based on suffix trees and IR methods. Whole current PDB database was used to demonstrate very good time complexity of the algorithm as well as high precision. We have chosen the structural classification of proteins (SCOP) database for verification of the precision of our algorithm because it is maintained primarily by humans. The next success of this paper would be the ability to determine SCOP categories of proteins not included in the latest version of the SCOP database (v. 1.75) with nearly 100% precision.	en
dc.language.iso	en	en
dc.publisher	IEEE Engineering in Medicine and Biology Society	en
dc.relation.ispartofseries	IEEE Transactions on Information Technology in Biomedicine	en
dc.relation.uri	https://doi.org/10.1109/TITB.2010.2079939	en
dc.subject	bioinformatics	en
dc.subject	information retrieval	en
dc.subject	pattern classification	en
dc.subject	proteins	en
dc.subject	proteomics	en
dc.subject	tree data structures	en
dc.title	Searching protein 3-D structures for optimal structure alignment using intelligent algorithms and data structures	en
dc.type	article	en
dc.identifier.location	Není ve fondu ÚK	en
dc.identifier.doi	10.1109/TITB.2010.2079939
dc.identifier.wos	000283982200008

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Publikační činnost Katedry informatiky / Publications of Department of Computer Science (460) [562]
Kolekce obsahuje bibliografické záznamy publikační činnosti (článků) akademických pracovníků Katedry informatiky (460) v časopisech a v Lecture Notes in Computer Science registrovaných ve Web of Science od roku 2003 po současnost.
Publikační činnost VŠB-TUO ve Web of Science / Publications of VŠB-TUO in Web of Science [7798]
Kolekce obsahuje bibliografické záznamy článků akademických pracovníků VŠB-TUO publikovaných v časopisech indexovaných ve Web of Science od roku 1990 po současnost.
Články z časopisů s impakt faktorem / Articles from Impact Factor Journals [6377]
Články z časopisů (od roku 2008), které v době vydání článku měly impakt faktor.

Show simple item record