Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What operations RDD support?
What is Hadoop Distributed File System- HDFS?
What is Thrift?
Is hadoop the future?
While writing evaluate UDF, which method has to be overridden?
What are the main components of spark?’
Explain the use of .mecia class?
What kind of datawarehouse application is suitable for Hive?
Explain how can you debug hadoop code?
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
What is apache spark good for?
Map reduce jobs take too long. What can be done to improve the performance of the cluster?
What are the problems with Hadoop 1.0?
What are the various libraries available on top of Apache Spark?
Explain Erasure Coding in Apache Hadoop?