Implementation of a Parallel Protein Structure Alignment Service on Cloud

Protein structure alignment has become an important strategy by which to identify evolutionary relationships between protein sequences. Several alignment tools are currently available for online comparison of protein structures. In this paper, we propose a parallel protein structure alignment servic...

Full description

Saved in:
Bibliographic Details
Main Authors: Che-Lun Hung, Yaw-Ling Lin
Format: Article
Language:English
Published: Wiley 2013-01-01
Series:International Journal of Genomics
Online Access:http://dx.doi.org/10.1155/2013/439681
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Protein structure alignment has become an important strategy by which to identify evolutionary relationships between protein sequences. Several alignment tools are currently available for online comparison of protein structures. In this paper, we propose a parallel protein structure alignment service based on the Hadoop distribution framework. This service includes a protein structure alignment algorithm, a refinement algorithm, and a MapReduce programming model. The refinement algorithm refines the result of alignment. To process vast numbers of protein structures in parallel, the alignment and refinement algorithms are implemented using MapReduce. We analyzed and compared the structure alignments produced by different methods using a dataset randomly selected from the PDB database. The experimental results verify that the proposed algorithm refines the resulting alignments more accurately than existing algorithms. Meanwhile, the computational performance of the proposed service is proportional to the number of processors used in our cloud platform.
ISSN:2314-436X
2314-4378