Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is Flume?
Why is HDFS only suitable for large data sets and not the correct tool to use for many small files?
Explain Spark join() operation?
What language is apache spark?
How does master slave architecture in the hadoop?
What is the difference between dataset and dataframe in spark?
Explain the maximum size of a message that can be received by the Kafka?
When to use coalesce and repartition in spark?
While starting hadoop services, datanode service is not running?
Explain the Reducer's reduce phase?
What are barriers?
Hive new version supported Hadoop Versions ?
How to create an rdd?
Explain the usage of Context Object?
Name some companies that are already using Spark Streaming?