Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...



Hadoop Interview Questions
Questions Answers Views Company eMail

Explain a simple Map/Reduce problem.

Capital One,

723

Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?

Twitter,

748

How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?

Twitter,

694

Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.

LinkedIn,

687

Explain how RDDs work with Scala in Spark

Capital One,

341

Define HRegionServer in HBase

157

What is the use of shutdown command?

210

What is HBase HMaster?

360

What is the function of HMaster?

255

Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?

158

Explian the Limitations of HBase?

187

State some applications of HBase?

167

What is a Column family in hbase?

196

Discuss about the different tombstone markers used for deletion purposes in HBase.?

185

Explain the Scope operators used in hbase?

182


Un-Answered Questions { Hadoop }

What is a combiner in hadoop?

510


Difference between groupByKey vs reduceByKey in Apache Spark?

470


How is Apache Spark better than Hadoop?

300


Can spark work without hadoop?

331


What is a speculative execution in Apache Hadoop MapReduce?

793


What is hbase fsck?

214


Who is the founder of spark?

312


The difference between GROUP and COGROUP operators in Pig?

568


What is Clustring in Hive?

741


Where is spark rdd?

272


What are the various advantages of DataFrame over RDD in Apache Spark?

309


Explain Multi-tenancy?

558


Why is cqlsh used?

135


What happens to zk sessions while the cluster is down?

1


Explain the memtable in cassandra?

72