Header menu link for other important links
X
Analysis of DNA data using hadoop distributed file system
, P. Ilango
Published in Research Journal of Pharmaceutical, Biological and Chemical Sciences
2016
Volume: 7
   
Issue: 3
Pages: 796 - 803
Abstract
The objective of this paper is to present how data can be parallel processed using hadoop distributed file system. Here, we are implementing the DNA data i.e., its sequence to find out the number of copies of the sequence, molecular weight and the number of copies of the molecular weight in the given data. Hadoop distributed file system is used to find out the repetitions of the given data so that the amount of original data can be reduced in size. Thus saving the space for storing a huge amount of data. Moreover, the replicas of a same data can be replaced with a single data and their replicas can be tracked with a unique identification and thus providing with efficient use of storages for a large set of data. © 2010 RJPBCS.
About the journal
JournalResearch Journal of Pharmaceutical, Biological and Chemical Sciences
PublisherResearch Journal of Pharmaceutical, Biological and Chemical Sciences
ISSN09758585