Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How to submit extra files(jars, static files) for Hadoop MapReduce job during runtime?
How to Delete file from HDFS?
Define various running modes of apache spark?
Is it possible to create cartesian join between 2 tables, using hive?
Is client the end user in HDFS?
What is Reducer in Hadoop?
What is the work of hive/hcatalog?
Wherever (Different Directory) I run the hive query, it creates new metastore_db, please explain the reason for it?
I have a row or key cache hit rate of 0.XX123456789 reported by JMX. Is that XX% or 0.XX% ?
What is active and passive NameNode in HDFS?
Why do we use ‘filters’ Pig scripts?
What is spark etl?
What is a block in Hadoop HDFS? What should be the block size to get optimum performance from the Hadoop cluster?
What do you understand by snitches?
How kafka communicate with clients and servers?