Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Name the language supported by apache spark for developing big data applications?
What does rdd mean?
why should we use 'group' keyword in pig scripts?
Does google use spark?
how can you debug Hadoop code?
What happens if the preferred replica is not in the isr?
What should be the HDFS Block size to get maximum performance from Hadoop cluster?
State the usage of 'filters', 'group' , 'orderBy', 'distinct' keywords in pig scripts?
How is spark sql different from hql and sql?
Explain what does hbase consists of?
What is the Cassandra Coefficient ?
Mention how hadoop is different from other data processing tools?
Is it possible to add or delete column families in a working group?
Difference between Sqoop and Cassandra?
Why is spark good?