Why data cleaning plays a vital role in the analysis?
What are outlier values?
You are given two tables- friend_request and request_accepted. Friend_request contains requester_id, time and sent_to_id and request_accepted table contains time, acceptor_id and requestor_id. How will you determine the overall acceptance rate of requests?
In a given day, how many birthday posts occur on Facebook?
Can you enumerate the various differences between supervised and unsupervised learning?
What are the differences between data science, machine learning, and artificial intelligence?
What is the difference between a bagged model and a boosted model?
What are the factors used to produce "People You May Know" data product on LinkedIn?
List out the libraries in python used for data analysis and scientific computations?
What is the differences between univariate, bivariate and multivariate analysis?
why do you need to perform resampling?
What is the best programming language to use in data science?
How can you compute an inverse matrix faster by playing with some computation tricks?
What is a generating function?
How can bogus Facebook accounts be detected?