Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is atom in pig?
Explain Features of Pig?
What is spark tool?
What is the difference between hadoop and other data processing tools?
What is Disk Balancer in Apache Hadoop?
What exactly kafka does?
Can you mention some features of spark?
How to keep files in HDFS?
How do you process big data with spark?
What is "GraphX" in Spark?
If the source data gets updated every now and then, how will you synchronize the data in hdfs that is imported by sqoop?
Elaborate kafka architecture?
hbase support syntax structure like sql. Yes or no?
Where can I get sample data to try?
How many JVMs run on a slave node?