Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Are spark dataframes distributed?
Clarify Memtable?
In hbase what is column families?
What are the functions of presto?
What is the difference betwaeen mapreduce engine and hdfs cluster?
What is the importance of dfs.namenode.name.dir in HDFS?
Can the name of a view be same as the name of a hive table?
Why is spark fast?
What are combiners? When should I use a combiner in my MapReduce Job?
How data or file is read in HDFS?
What problems have you faced when you are working on Hadoop code?
What is meant by streaming access?
Why big data?
Differentiate between the terms: node, a cluster, and data center in cassandra?
When to use Avro, explain?