Big Data Interview Questions
Questions Answers Views Company eMail

What is difference between hive and hdfs?

380

What is skew data in hive?

426

Is kafka an etl tool?

259

What language is apache kafka written in?

280

What is zookeeper server?

1

What is the difference between map and reduce?

342

What is optimal size of a file for distributed cache?

369

What can skew the mean?

187

What is vectorized query execution?

215

What is map side join?

185

What does dag stand for?

199

What is data ingestion pipeline?

184

What is the difference between reducebykey and groupbykey?

201

What is data skew and how do you fix it?

211

Is databricks a database?

212


Un-Answered Questions { Big Data }

What can I do with my m&s sparks points?

190


Explain about the partitioning, shuffle and sort phase in MapReduce?

489


Mention what is rack awareness?

223


What database are supported by Hive?

400


What is setmaster in spark?

184






What are the prime features of apache zookeeper?

1


We have already sql then why nosql?

230


What are the important differences between apache and hadoop?

393


Define the term Column Families?

63


What is Hadoop Map Reduce ?

535


How many types of NoSQL databases are there?

85


What are some of the apache pig use cases you can think of?

287


Describe Memtable?

53


What is the difference between SQL and NoSQL?

656


How Hadoop’s CLASSPATH plays a vital role in starting or stopping in Hadoop daemons?

215