Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are tools available to send the streaming data to hdfs?
How do I start a spark master?
What are the different catalog tables in hbase?
Define taskinstance?
What is the way of creating Avro Schemas?
How can client interact with Hive?
Explain the composite key?
What does apache mahout do?
What are the befefits of nosql over relational database?
How would you pipeline large amounts of data?
Is it possible to provide multiple inputs to hadoop? If yes, explain.
What is the history of apache mahout? When did it start?
How does spark work with python?
Give some important features of SPM?
What is the use of "order by" in Hive?