Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What are the Hadoop features extended to its eco-system components ?
Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?
Is it necessary to kill the topology while updating the running topology?
What do you mean by “data centre” in cassandra?
Is spark difficult to learn?
What can skew the mean?
If datanodes increase, then do we need to upgrade namenode?
What is the difference between hadoop and other data processing tools?
How to handle record boundaries in Text files or Sequence files in MapReduce InputSplits?
How does spark work with python?
Clarify how ordering in hdfs is finished?
What do you mean by Free Form Import in Sqoop?
How is the distance between two nodes defined in Hadoop?
What kind of datawarehouse application is suitable for Hive?
When should we use SORT BY instead of ORDER BY?