Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain bagtotuple?
What is Clustring in Hive?
Connection between hadoop and big data?
In a given spark program, how will you identify whether a given operation is Transformation or Action ?
What is ObjectInspector functionality?
What does a Spark Engine do?
What are the difference between of the “HDFS Block” and “Input Split”?
Explain about the different cluster managers in Apache Spark
Explain the term paired RDD in Apache Spark?
What is avro format?
What is lineage graph?
Can we run spark without hadoop?
How would you tackle counting words in several text documents?
What is dataframe in spark?
What do you understand by Kundera?