Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the key elements in ZooKeeper Architecture?
Why is spark good?
How does impala compare to hive and pig?
What is the difference between Primary, Partition and Cassandra ?
Why do I have to use refresh and invalidate metadata, what do they do?
How can an application connect to Hive run as a server?
What are the benefits of Spark lazy evaluation?
What daemons run on master nodes?
Can we change the data type of a column in a hive table?
What mode(s) can hadoop code be run in?
What is the usefulness of the options file in sqoop?
What is the role of the secondary namenode?
Why HDFS stores data using commodity hardware despite the higher chance of failures?
What do you understand by compaction?
What do you understand by composite type?