Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Which is the reliable channel in Flume to ensure that there is no data loss?
what is ODBC and JDBC connectivity in Hive?
What is a rack awareness algorithm and why is it used in hadoop?
Explain partitions?
Is sqoop an etl tool?
How to save RDD?
Explain about the partitioning, shuffle and sort phase in MapReduce?
What is Pig Statistics? What are all stats classes in the Java API package available?
What is spark vcores?
How can you connect an application
On which port does ssh work?
Can hive run without hadoop?
What is spark databricks?
Explain slot in Hadoop Map-Reduce v1?
How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?