Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How do I change hive execution engine to spark?
What is the function of mapreduce partitioner?
How do you define "block" in HDFS?
Define partitioning key?
What is apache spark engine?
What do you mean by schema on reading?
What apache spark is used for?
Explain the lookup() operation in Spark?
How data or file is written into HDFS?
Can you explain accumulators in apache spark?
How to handle record boundaries in Text files or Sequence files in MapReduce InputSplits?
how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?
What is driver and executor in spark?
What is python stress test in cassandra?
How to identify that the given operation is transformation or action?