Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How to enable recycle bin in hadoop?
Explain the repartition() operation in Spark?
What is Derby database?
Explain the commit log?
Explain about the different types of trformations on dstreams?
Clarify how ordering in hdfs is finished?
Define speculative execution?
Explain how cassandra writes changed data into commitlog?
Explain write ahead log(journaling) in spark?
What is distributed copy (distcp)?
What are the limitations of Spark?
How can one write custom record reader?
What does job conf class do?
Explain the lookup() operation in Spark?
Define replication factor?