Jornal Internacional de Mineração de Dados Biomédicos

Jornal Internacional de Mineração de Dados Biomédicos
Acesso livre

ISSN: 2090-4924

Abstrato

Implementation of Decision Tree Using Hadoop MapReduce

Tianyi Yang and Anne Hee Hiong Ngu

Hadoop is one of the most popular general-purpose computing platforms for the distributed processing of big data. HDFS is implementation of distributed file system by Hadoop to be able to store huge amount of data in a reliable way and serve data processing component by Hadoop at the same time. MapReduce is the main processing engine of Hadoop. In this study, we have implemented HDFS and MapReduce for a well- known learning algorithm—decision tree in a scalable fashion to large input problem size. Computational performance with node count and problem size is evaluated.

Isenção de responsabilidade: Este resumo foi traduzido com recurso a ferramentas de inteligência artificial e ainda não foi revisto ou verificado.
Top