Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Is kafka open source?
What are the downsides of Spark?
What is the difference between a MapReduce InputSplit and HDFS block?
Why Do We Need Apache Pig?
How do you do a file system check in hdfs?
Mention what are the main configuration parameters that user need to specify to run mapreduce job?
What does ISR stand in Kafka environment?
Define durable writes?
What is the difference between Cassandra and Hadoop ?
What does the file hadoop-metrics.properties do?
Define Thrift?
Why do we need a password-less ssh in fully distributed environment?
Why spark is faster than hive?
Which database the sqoop metastore runs on?
What are the different methods to run Spark over Apache Hadoop?