Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Describe join() operation. How is outer join supported?
Does impala performance improve as it is deployed to more hosts in a cluster in much the same way that hadoop performance does?
What are the default configuration files that are used in hadoop?
Can a spark cause a fire?
what is Speculative Execution?
What is column families? What happens if you alter the block size of ColumnFamily on an already populated database?
How is rdd distributed?
What are the stable versions of Hadoop?
What is the command for archiving a group of files in hdfs.
How to change the column data type in hive? Explain rlike in hive.
Explain the wordcount implementation via hadoop framework ?
What is unstructured data?
What is the process of changing the split size if there is limited storage space on Commodity Hardware?
Explain the different types of repairs.
How kafka communicate with clients and servers?