Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How to copy a file into HDFS with a different block size to that of existing block size configuration?
Explain how do ‘map’ and ‘reduce’ works?
Discuss writeahead logging in Apache Spark Streaming?
What is the use of exists command?
What are file permissions in HDFS? how does HDFS check permissions for files/directory?
What operations does rdd support?
Can you use spark to access and analyze data stored in cassandra databases?
What are common spark ecosystems?
How many compaction types are in HBase?
What do you mean by Schema Resolution?
What is mapper in map reduce?
Name the most common Input Formats defined in Hadoop? Which one is default?
What is structured data?
Explain first() operation in Apache Spark?
What is a local repository and when it is useful while using ambari environment?