Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain the benefits of block transfer?
Explain REPEAT function in Hive with example?
What are the disadvantages of using Apache Spark over Hadoop MapReduce?
Why is sqoop is used?
How do I try impala out?
Do we need to place 2nd and 3rd data in rack 2 only?
Define primary key in Apache Cassandra?
Why do we need hdfs?
What operations does rdd support?
What do you mean by cassandra-cqlsh?
Which one is default?
Which is better hadoop or spark?
Explain the terms Spark Partitions and Partitioners?
What is difference between dataset and dataframe?
How does Apache Spark handles accumulated Metadata?