Big Data Interview Questions
Questions Answers Views Company eMail

how Hadoop is different from other data processing tools?

376

List the configuration parameters that have to be specified when running a MapReduce job.

358

what is a sequence file in Hadoop?

403

What are the key differences between Pig vs MapReduce?

350

When is it not recommended to use MapReduce paradigm for large

360

what happens in textinformat ?

391

Explain the differences between a combiner and reducer

377

What is a TaskInstance?

393

What is the default input type in MapReduce?

381

Which directory does hadoop install to?

264

Why use hadoop?

236

What are the actions followed by hadoop?

236

What kind of hardware is best for hadoop?

225

Where are hadoop’s configuration files located and list them?

220

How jobtracker assign tasks to the tasktracker?

230


Un-Answered Questions { Big Data }

What are the data components used by Hadoop?

420


What is map in apache spark?

184


Explain the need for MapReduce while programming in Apache Pig?

544


Why is cqlsh used?

108


Tell something about the query language used in Cassandra Database?

52






What is Cassandra-CQL collection?

49


How do you integrate spark and hive?

198


Explain the lookup() operation in Spark?

149


How can we change the split size if our commodity hardware has less storage space?

377


Can you explain how it is different from doing machine learning in r or sas?

32


What is SSTable?

53


What is Flatten and what it do in PIG?

332


What is difference between hive and spark?

186


Explain Alter Table Statement in HCatalog?

5


What is Sqoop?

5