Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What are the challenges Of Distributed Applications?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
What are the primitive data types in Pig?
What are the different operational commands in HBase at record level and table level?
How to create an rdd?
Can you join multiple fields in Apache
What is a IdentityMapper and IdentityReducer in MapReduce ?
What is data ingestion pipeline?
Explain about the indexing process in hdfs?
What is Hive query processor?
What is identity mapper and reducer? In which cases can we use them?
Define HRegionServer in HBase
What is a bookkeeper client in bookkeeper?
It can be possible that a Job has 0 reducers?
Which language is more suitable for text analytics? R or python?