Data Science Interview Questions
Questions Answers Views Company eMail

What will you do if removing missing values from a dataset cause bias?

Airbnb,

158

How can you reduce bias in a given data set?

Airbnb,

168

How will you impute missing information in a dataset?

Airbnb,

155

Estimate the probability of a disease in a particular city given that the probability of the disease on a national level is low.

Amazon,

164

How will inspect missing data and when are they important for your analysis?

Amazon,

195

How will you decide whether a customer will buy a product today or not given the income of the customer, location where the customer lives, profession and gender? Define a machine learning algorithm for this.

Amazon,

142

From a long sorted list and a short 4 element sorted list, which algorithm will you use to search the long sorted list for 4 elements.

Amazon,

160

How can you compare a neural network that has one layer, one input and output to a logistic regression model?

Amazon,

148

How do you treat colinearity?

Amazon,

144

How will you deal with unbalanced data where the ratio of negative and positive is huge?

Amazon,

143

What is the difference between Stack and Queue

Amazon,

178

What is the difference between Linkedin and Array

Amazon,

156

You are about to get on a plane to Seattle, you want to know whether you have to bring an umbrella or not. You call three of your random friends and as each one of them if it's raining. The probability that your friend is telling the truth is 2/3 and the probability that they are playing a prank on you by lying is 1/3. If all 3 of them tell that it is raining, then what is the probability that it is actually raining in Seattle.

Facebook,

151

You have been given the data on Facebook user's friending or defriending each other. How will you determine whether a given pair of Facebook users are friends or not?

Facebook,

167

Estimate the number of square feet pizza's eaten in US each year.

Goldman Sachs,

160


Post New Data Science Questions

Un-Answered Questions { Data Science }

Give some situations where you will use an SVM over a RandomForest Machine Learning algorithm and vice-versa?

281


You own a clothing enterprise and want to improve your place in the market. How will you do it from the ground level ?

141


You have a bag with 6 marbles. One marble is white. You reach the bag 100 times. After taking out a marble, it is placed back in the bag. What is the probability of drawing a white marble at least once?

158


Which companies participating in Insight would you be interested in working for?

184


How do data scientists code in r?

136






Find out K most frequent numbers from a given stream of numbers on the fly.

139


Explain the two main components of the hadoop framework?

119


What is the job of analytics?

154


Describe the various steps involved while carrying out an analytical project?

168


Explain k-mean?

132


State the difference between the expected value and mean value?

139


How would you do market basket analysis in r and python?

164


What are outlier values and how do you treat them?

135


What are the types of business decisions?

144


What is interpolation and extrapolation?

147