Big Data Interview Questions
Questions Answers Views Company eMail

Which company initially developed Hive ?

1 1902

What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?

710

What are the components of Hive architecture?

782

What is HDFS Federation?

645

What are watches?

690

Explain the Job OutputFormat?

692

Virtual Box & Ubuntu Installation?

615

What is the difference between HDFS and NAS ?

954

How Mapper is instantiated in a running job?

738

Where do you specify the Mapper Implementation?

623

What is the meaning of speculative execution in Hadoop? Why is it important?

745

What is the InputFormat ?

671

Command to format the NameNode?

626

Who is a 'user' in HDFS?

650

What is the meaning of the term "non-DFS used" in Hadoop web-console?

934


Un-Answered Questions { Big Data }

What is difference between spark and hadoop?

180


How does impala process join queries for large tables?

43


Why comparison of types is important for MapReduce?

670


What is the full form of MSLAB?

146


Explain what are the various types of Transformation on DStream?

192






What are the various diagnostic operators available in Apache Pig?

449


Is the keyword 'DEFINE' like a function name?

582


Where can I find impala documentation?

62


What is the role zookeeper plays in a cluster of kafka?

274


What is the history of apache mahout? Once did it start?

37


Can you use spark to access and analyze data stored in cassandra databases?

210


What are the different methods to run Spark over Apache Hadoop?

414


What is the difference between rdbms and hadoop?

388


What counter in Hadoop MapReduce?

372


How do you check if a particular partition exists?

409