Data Science Interview Questions
Questions Answers Views Company eMail

What is the degree of freedom for lasso?

Adobe,

120

What are Type 1 and Type 2 errors ?

Citi Bank,

97

You are creating a report for user content uploads every month and observe a sudden increase in the number of upload for the month of November. The increase in uploads is particularly in image uploads. What do you think will be the cause for this and how will you test this sudden spike?

110

What are the factors used to produce "People You May Know" data product on LinkedIn?

LinkedIn,

104

You have a bag with 6 marbles. One marble is white. You reach the bag 100 times. After taking out a marble, it is placed back in the bag. What is the probability of drawing a white marble at least once?

Microsoft,

106

How will you handle missing data ?

IBM,

107

Check whether a given integer is a palindrome or not without converting it to a string.

Adobe,

151

Suppose that American Express has 1 million card members along with their transaction details. They also have 10,000 restaurants and 1000 food coupons. Suggest a method which can be used to pass the food coupons to users given that some users have already received the food coupons so far.

American Express,

95

You are given a training dataset of users that contain their demographic details, the pages on Facebook they have liked so far and results of psychology test based on their personality i.e. their openness to like FB pages or not. How will you predict the age, gender and other demographics of unseen data?

American Express,

106

Do you have some knowledge of R - analyse a given dataset in R?

Airbnb,

116

What will be your expected earnings with the two roll strategy?

260

Burn two ropes, one needs 60 minutes of time to burn and the other needs 30 minutes of time. How will you achieve this in 45 minutes of time ?

Citi Bank,

93

How can you build and test a metric to compare ranked list of TV shows or Movies for two Netflix users?

248

How do you take millions of users with 100's of transactions each, amongst 10000's of products and group the users together in a meaningful segments?

Apple,

170

How can you solve a problem that has no solution?

111


Post New Data Science Questions

Un-Answered Questions { Data Science }

What tools or devices help you succeed in your role as a data scientist?

133


Explain the method to collect and analyze data to use social media to predict the weather condition?

240


What are the various types of selection bias?

110


Can you write the formula to calculate r-square?

103


Can you define feature vector?

95






What is overfitting?

98


Create a program in a language of your choice to read a text file with various tweets. The output should be 2 text files-one that contains the list of all unique words among all tweets along with the count for repeated words and the second file should contain the medium number of unique words for all tweets.

135


How will you fix multi-colinearity in a regression model?

101


Suppose that American Express has 1 million card members along with their transaction details. They also have 10,000 restaurants and 1000 food coupons. Suggest a method which can be used to pass the food coupons to users given that some users have already received the food coupons so far.

95


Can you explain difference between data modeling and database design?

116


Explain data cleansing?

96


Name some kinds of graphs and explain how you would build them in python or r.

101


What is the difference between a bagged model and a boosted model?

129


List the differences between supervised and unsupervised learning.

112


Define data reduction?

82