Big Data Interview Questions
Questions Answers Views Company eMail

How much Metadata will be created on NameNode in Hadoop?

1

What do you mean by metadata in Hadoop?

1

How to Delete directory and files recursively from HDFS?

1

How does HDFS Index Data blocks? Explain.

1

How to access HDFS?

1

Explain the process that overwrites the replication factors in HDFS?

1

Difference Between Hadoop and HDFS?

1

How would you import data from MYSQL into HDFS ?

1

Define HDFS and talk about their respective components?

1

What is the module in HDFS?

1

Tell me two most commonly used commands in HDFS?

1

What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?

1

HDFS is used for applications with large data sets, not why Many small files?

1

What is impala?

29

Why we need impala hadoop?

12







Un-Answered Questions { Big Data }

How do I stop flume agent?

11


Explain about trformations and actions in the context of rdds?

24


What is a spark shuffle?

21


Explain what happens if, during the PUT operation, HDFS block is assigned a replication factor 1 instead of the default value 3?

1


Explain about the core components of a distributed Spark application?

29






Why is Hive not suitable for OLTP systems?

130


what is ODBC and JDBC connectivity in Hive?

115


Explain Apache Ambari architecture?

3


What is big data concept?

19


Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?

21


What Platforms Cassandra runs on?

12


What is faster than apache spark?

17


Is it possible to leverage real time analysis on the big data collected by flume directly? If yes, then explain how?

10


What is the best hardware configuration to run Hadoop?

684


Explain Clustering in Hive?

107