Big Data Interview Questions
Questions Answers Views Company eMail

How much Metadata will be created on NameNode in Hadoop?

1

What do you mean by metadata in Hadoop?

1

How to Delete directory and files recursively from HDFS?

1




How does HDFS Index Data blocks? Explain.

1

How to access HDFS?

1

Explain the process that overwrites the replication factors in HDFS?

1

Difference Between Hadoop and HDFS?

1

How would you import data from MYSQL into HDFS ?

1

Define HDFS and talk about their respective components?

1

What is the module in HDFS?

1

Tell me two most commonly used commands in HDFS?

1

What is a block in HDFS? what is the default size in Hadoop 1 and Hadoop 2? Can we change the block size?

1

HDFS is used for applications with large data sets, not why Many small files?

1

What is impala?

27

Why we need impala hadoop?

6







Un-Answered Questions { Big Data }

How to restart NameNode or all the daemons in Hadoop HDFS?

1


Explain how can you debug hadoop code?

3


What is throughput? How does HDFS provide good throughput?

1


What is the use of illustrate in pig?

26


What happen when namenode enters in safemode in hadoop?

1






What is the purpose of DataNode block scanner?

310


How Pig programming gets converted into MapReduce jobs?

53


Can you list some useful zookeeper tools?

1


Can you define oozie?

9


Replication causes data redundancy then why is is pursued in HDFS?

1


what are views in Hive?

81


Explain HCatalog Create Table CLI along with its syntax?

1


Specify some uses of HBase?

9


Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?

1


What is parallelize in spark?

19