What is logistic and linear regression? How do you treat multicollinearity and heteroscedasticity in regression? Name some packages in r and python for building regression models.
311What is nosql? Name some examples of nosql databases. What is a key value store? What is column storage? What is a document database?
297What is the central limit theorem? How is a normal distribution different from chi square distribution?
352Post New Data Science Questions
What are the key elements of the statistical graphics?
Can you cite some examples where a false negative important than a false positive?
Explain how to define the number of clusters in a clustering algorithm?
Please explain the goal of a/b testing.
What are various steps involved in an analytics project?
A box has 12 red cards and 12 black cards. Another box has 24 red cards and 24 black cards. You want to draw two cards at random from one of the two boxes, which box has a higher probability of getting cards of same colour and why?
Can you explain cross-validation?
What is a z test?
Explain how to write a table to a file?
How would you create a taxonomy to identify key customer trends in unstructured data?
In a given day, how many birthday posts occur on Facebook?
Implement a sorting algorithm for a numerical dataset in Python.
what the aim of conducting a/b testing?
Why is DBSCAN required?
Could you draw a comparison between overfitting and underfitting?