adspace




Big Data Interview Questions
Questions Answers Views Company eMail

What are shared variables in spark?

1 400

What is the future of apache spark?

1 363

How can I improve my spark performance?

1 381

What is apache spark architecture?

1 408

Why spark is faster than hive?

1 410

What happens if rdd partition is lost due to worker node failure?

1 550

What is pair rdd in spark?

1 369

What is difference between cache and persist in spark?

1 374

Is bigger than spark driver maxresultsize?

1 383

Does spark use java?

1 420

How do you process big data with spark?

1 374

What is a spark shuffle?

1 429

Why do we need apache spark?

1 380

How do I optimize my spark code?

1 401

What is the difference between client mode and cluster mode in spark?

1 412


Un-Answered Questions { Big Data }

How to set property in apache tajo?

43


How to skip header rows from a table in Hive?

906


What is a Hive variable? What for we use it?

880


How you can contact your client everyday ?

1042


Can free-form SQL queries be used with Sqoop import command? If yes, then how can they be used?

73


Explain how RDDs work with Scala in Spark

411


Where does the data of a Hive table gets stored?

750


What is meant by Transformation? Give some examples.

385


Can We Change settings within Hive Session? If Yes, How?

859


How do I set up flume agent?

144


What is the roadmap for apache driver version one.0?

80


What is the stable version of Hive ?

2695


What is the latest version of ambari that is available in the present market?

112


State some DDL Command with brief Description?

60


How can I delete the above index named index_bonuspay?

807