Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why is spark used?
How to specify more than one path for storage in Hadoop?
what is gossip protocol?
What are the various InputFormats in Hadoop?
What are the main classes of Data Transfer API?
How many InputSplits will be made by hadoop framework?
Why would nosql be better than using a sql database? And how much better is it?
How Mapper is instantiated in a running job?
What is a difference between an input split and hdfs block?
What is pig properties?
Is spark a special attack?
What is the disadvantage of spark sql?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
What is the use of spark sql?
How is transformation on rdd different from action?