Big Data Interview Questions
Questions Answers Views Company eMail

What is crontab? Explain with suitable example?

670

What is the difference between Hadoop and Traditional RDBMS?

667

What is Partioner in hadoop? Where does it run

621

Does Hadoop requires RAID?

651

What is the purpose of DataNode block scanner?

663

What is a checkpoint?

663

Did you ever ran into a lop sided job that resulted in out of memory error

926

What is the communication channel between client and namenode/datanode?

1418

What are the different operational commands in HBase at record level and table level?

309

What is the difference between Gen1 and Gen2 Hadoop with regards to the Namenode?

901

What is the Job interface in MapReduce framework?

634

What is the problem with HDFS and streaming data like logs

671

What is the Use of SSH in Hadoop ?

642

What infrastructure do we need to process 100 TB data using Hadoop?

736

What is the difference between a Hadoop and Relational Database and Nosql?

708


Un-Answered Questions { Big Data }

What are the different file permissions in the HDFS for files or directory levels?

79


Mention some use cases of apache mahout?

41


Explain some Kafka Streams real-time Use Cases?

277


What is anti-entropy and how is it associated with merkel tree?

42


Define a namenode?

369






Define Partitions?

190


Explain about the different channel types in Flume.

71


What are the different methods to run Spark over Apache Hadoop?

414


What is the functionality of Query Processor in Apache Hive?

437


Explain what are the basic parameters of a mapper?

455


Can we change the file cached by distributed cache

231


What are the various types of shared variable in apache spark?

183


Explain what happens in text format?

300


What is difference between dataset and dataframe?

214


What stored in HDFS?

615