adspace




Big Data Interview Questions
Questions Answers Views Company eMail

What is difference between hive and hdfs?

1 712

What is skew data in hive?

1 790

Is kafka an etl tool?

1 565

What language is apache kafka written in?

1 621

What is zookeeper server?

1 87

What is the difference between map and reduce?

1 733

What is optimal size of a file for distributed cache?

1 746

What can skew the mean?

1 373

What is vectorized query execution?

1 408

What is map side join?

1 381

What does dag stand for?

1 377

What is data ingestion pipeline?

1 409

What is the difference between reducebykey and groupbykey?

1 389

What is data skew and how do you fix it?

1 413

Is databricks a database?

1 424


Un-Answered Questions { Big Data }

When we are using queries instead of scripting?

796


How can you import only a subset of rows from a table?

73


What is the roadmap for apache mahout version 1.0?

95


State some DDL Command with brief Description?

55


Explain how RDDs work with Scala in Spark

405


did you maintain the hadoop cluster in-house or used hadoop in the cloud?

1074


What is the function of UNION and SPLIT operators? Give examples?

620


How can I delete the above index named index_bonuspay?

804


How can you set an arbitrary number of Reducers to be created for a job in Hadoop?

547


Where does the data of a Hive table gets stored?

743


What is meant by Transformation? Give some examples.

382


What are the different types of tables available in Hive?

766


Which modes can Hadoop be run in? List a few features for each mode?

540


When to use explode in Hive?

832


How to set property in apache tajo?

41