adspace




Big Data Interview Questions
Questions Answers Views Company eMail

How many ways we can create rdd?

1 375

What does repartition do in spark?

1 420

What is the driver program in spark?

1 382

What is spark submit?

1 376

How do I clear my spark cache?

1 359

What is a partition in spark?

1 470

What is spark vectorization?

1 411

What is off heap memory in spark?

1 398

What is a tuple in spark?

1 364

Is spark an etl?

1 379

How is rdd distributed?

1 422

What are the common transformations in apache spark?

1 379

What is the difference between dataset and dataframe in spark?

1 462

What is distributed cache in spark?

1 438

What is catalyst framework in spark?

1 386


Un-Answered Questions { Big Data }

When we are using queries instead of scripting?

797


How you can contact your client everyday ?

1039


What is the latest version of sqoop?

80


What is meant by Transformation? Give some examples.

382


Where does the data of a Hive table gets stored?

744


When to use explode in Hive?

832


How to set property in apache tajo?

41


What is the roadmap for apache mahout version 1.0?

95


How can you set an arbitrary number of Reducers to be created for a job in Hadoop?

547


Can free-form SQL queries be used with Sqoop import command? If yes, then how can they be used?

73


How job tracker schedules an assignment?

496


How to skip header rows from a table in Hive?

900


Can We Change settings within Hive Session? If Yes, How?

854


What is the stable version of Hive ?

2694


What is the latest version of ambari that is available in the present market?

111