Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2777
Explain the process of spilling in Hadoop MapReduce?
Which directory does hadoop install to?
What are the common mistakes developers make when running Spark applications?
How many types of nosql databases?
How can you use adminclient api?
What is the difference between cache and persist in spark?
What are 4 v's of big data?
Can hadoop replace relational database?
How can we see all the clusters that are available in Ambari?
What problem does Apache Flume solve?
What is amazon spark?
How data is spilt in Hadoop?
What is the use of cassandra cql collection?
Explain about ACID transactions in Hive?
What is the default level of parallelism in apache spark?