You are creating a report for user content uploads every month and observe a sudden increase in the number of upload for the month of November. The increase in uploads is particularly in image uploads. What do you think will be the cause for this and how will you test this sudden spike?
401You have a bag with 6 marbles. One marble is white. You reach the bag 100 times. After taking out a marble, it is placed back in the bag. What is the probability of drawing a white marble at least once?
313Suppose that American Express has 1 million card members along with their transaction details. They also have 10,000 restaurants and 1000 food coupons. Suggest a method which can be used to pass the food coupons to users given that some users have already received the food coupons so far.
332You are given a training dataset of users that contain their demographic details, the pages on Facebook they have liked so far and results of psychology test based on their personality i.e. their openness to like FB pages or not. How will you predict the age, gender and other demographics of unseen data?
339Burn two ropes, one needs 60 minutes of time to burn and the other needs 30 minutes of time. How will you achieve this in 45 minutes of time ?
338How can you build and test a metric to compare ranked list of TV shows or Movies for two Netflix users?
566How do you take millions of users with 100's of transactions each, amongst 10000's of products and group the users together in a meaningful segments?
430Post New Data Science Questions
Why data cleaning plays a vital role in the analysis?
Differentiate between skewed and uniform distribution?
What is the advantage of performing dimensionality reduction before fitting an svm?
Explain the method to collect and analyze data to use social media to predict the weather condition?
Explain while working on a data set, how do you select important variables?
A dice is rolled twice, what is the probability that on the second chance it will be a 6?
How many sorting algorithms are available?
Can you explain data munging or data wrangling?
Why is data cleaning essential in data science?
What is the importance of selection bias?
What would you add to Facebook and how would you pitch it and measure its success?
What are recommender systems?
Could you explain how to define the number of clusters in a clustering algorithm?
How will you prove that the square root of 2 is irrational?
Explain the difference between Supervised and Unsupervised Learning through examples.