Big Data Interview Questions
Questions Answers Views Company eMail

Distinguish HDFS Block and Input Unit?

24

What are the difference between of the “HDFS Block” and “Input Split”?

58

What happens when two users try to access to the same file in HDFS?

47

Does the HDFS go wrong? If so, how?

34

Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?

33

What happens when two clients try to access the same file on HDFS?

105

What are the different file permissions in the HDFS for files or directory levels?

81

How to read file in HDFS?

32

When NameNode enter in Safe Mode?

25

Write command to copy a file from HDFS to linux(local).

34

How to create directory in HDFS?

38

How to Delete directory from HDFS?

26

What are the main features of hdfssite.xml?

22

Write the command to copy a file from linux to hdfs?

40

How to copy file from HDFS to local?

38


Un-Answered Questions { Big Data }

What are tokens in cassandra?

46


Can We Change settings within Hive Session? If Yes, How?

448


What are the port numbers of task tracker?

255


Which are the elements of kafka?

339


What types of costs are associated in creating index on hive tables?

492






What are the types of Apache Spark transformation?

196


Explain the difference between a MapReduce InputSplit and HDFS block?

392


What does it mean by Columnar Storage Format?

216


Explain how HDFS communicates with Linux native file system?

26


What are some of the apache pig use cases you can think of?

287


Define a combiner?

356


What is the purpose of DataNode block scanner?

663


Explain job scheduling through JobTracker

402


How to enable/configure the compression of map output data in hadoop?

414


What is the key- value pair in MapReduce?

380