Big Data Interview Questions
Questions Answers Views Company eMail

How much Metadata will be created on NameNode in Hadoop?

1

What do you mean by metadata in Hadoop?

1

How to Delete directory and files recursively from HDFS?

1




How does HDFS Index Data blocks? Explain.

1

How to access HDFS?

1

Explain the process that overwrites the replication factors in HDFS?

1

Difference Between Hadoop and HDFS?

1

How would you import data from MYSQL into HDFS ?

1

Define HDFS and talk about their respective components?

1

What is the module in HDFS?

1

Tell me two most commonly used commands in HDFS?

1

What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?

1

HDFS is used for applications with large data sets, not why Many small files?

1

What is impala?

8

Why we need impala hadoop?

6







Un-Answered Questions { Big Data }

State some key Points about Apache Avro?

9


What is memtable?

3


Can NameNode and DataNode be a commodity hardware?

530


How to drop database in apache tajo?

1


Define Spark Streaming.

84






What does secondary name-node means?

3


How to use 'foreach' operation in pig scripts?

5


Can you join multiple fields in Apache

4


Explain sum(), max(), min() operation in Apache Spark?

5


Define role of velocity in big data?

7


What is the difference between apache mahout and apache spark’s mllib?

1


What is Bucketing and Clustering in Hive?

24


What are the essential hooping tools that improve performance? Big data?

7


What are the debugging tools used for Apache Pig scripts?

123


Define the roles of the file system in any framework?

8