Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the various configuration parameters required to run a mapreduce job?
How many layers of Hadoop components are supported by Apache Ambari and what are they?
Mention some use cases of apache mahout?
What is streaming in Hadoop?
What platform and java version are required to run hadoop?
What are Apache Spark, Flume, Lucene, Hama, HCatalog, Mahout, Drill, Crunch and Thrift?
How to use Hive using the command line and Beeline?
If the hadoop administrator needs to make a change, which configuration file does he need to change?
Explain what is a cluster in cassandra?
How to enable trash/recycle bin in hadoop?
Will various customers write into an hdfs record simultaneously?
What is Spark Core?
How is the splitting of file invoked in Hadoop ?
Can you explain spark core?
What is the difference between Hive CLI and Beeline?