Apache HBase is a non-relational database modeled after Google’s
Kafka is comparable to traditional messaging systems such as
Spark powers a stack of high-level tools including Spark SQL, MLlib for _________
Differentiate between SQL and NOSQL
Define Spark MLlib
List few applications that are implemented using Spark?
Name few applications handling streaming data and why is it required?
______________and ___________ are the two main components of Hadoop ecosystem?
List the features of Hadoop 2.0 ?
Apache Hadoop YARN stands for ___________
Which non-mapreduce application is used to analyse the Social Network’s Graph?
Some applications that run over YARN and HDFSby not using MapReduce directly are
___________ is general-purpose computing model and runtime system for distributed data analytics.
Which of the following is a wrong statement.
Hadoop is a framework that works with two parts namely,
According to analysts, traditional IT systems provide a foundation for integrating with big data technologies like Hadoop, Specify the purpose?
Functionality of Iterator is
Motivation behind Map-Reduce is
What are the important parts of MapReduce in Hadoop 2.0?
Group by key functionality is used by which of the following functions
If a file name and text is given in map-reduce which one forms the key
The term Mapreduce is borrowed from the language
Map-Reduce architecture consists of
Which of the following are components of Distributed File System
Name the function that merges all intermediate values associated with the same intermediate key
Specify the stages of data flow in MapReduce?
Consider, hadoop system has default 128 MB as split data size.
And it will store the 1 TB data into 8 block.
How many mappers are required in this case?
What is the limitation of the original MapReduce v1 paradigm?
In MapReduce1.0 which component is responsible for Job scheduling and Resource Allocation?
Name the 2 components of Map-Reduce