An Introduction to Apache Kafka

What is Kafka? Kafka is an open-source distributed streaming platform by Apache software foundation and it is used as a platform for real-time data pipeline.  It is a ...
Read More

MapReduce - Distributing Your Processing Power

MapReduce (MR) is one of the core features of the Hadoop ecosystem which works in accordance with YARN (Yet Another Resource Negotiator). This is an out-of-the-box solution ...
Read More

Hadoop Distributed File System - An Overview

Hadoop Distributed File System (HDFS) is the file system on which  Hadoop stores its data. This is the underlying technology that helps the data to be stored in the distributed ...
Read More

Introducing Apache Ambari

Apache Ambari is a web-based tool for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides Restful APIs and a web-based management interface.
Read More

Hadoop - Handling Big Data

Apache Hadoop is all about handling Big Data especially unstructured data. It helps in streamlining data for any distributed processing system across clusters of computers. ...
Read More