Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
655Write a Hive UDF that returns a sentiment score. For example, if good = 1, bad = -1, and average = 0, then a review of a restaurant states "Good food, bad service," your score might be 1 - 1 = 0.
649Suppose that your data is stored in collections, for instance, some binary data, message data or metadata is all keyed on the same value. Will you use HBase for this?
157
What are the functionalities of jobtracer?
How to start hbase services?
What are the port numbers of task tracker?
What is the relationship between Jobs and Tasks in Hadoop?
Explain Spark saveAsTextFile() operation?
When was spark introduced?
What do you know about transformations in spark?
Can you explain hadoop streaming?
Explain InputSplit in Hadoop MapReduce?
How to do ‘map’ and ‘reduce’ works?
Big Data Engineer Can you explain what REST is?
What do you mean by the High Availability of a NameNode in Hadoop HDFS?
Explain is it possible to search for files using wildcards?
Why do we need buckets?
Why big data?