Explain the difference between L1 and L2 regularization methods.What is precision and recall? How do they relate to the ROC curve?.How would you create a logistic regression model?.How is kNN different from k-means clustering?.How would you effectively represent data with 5 dimensions?.Can you explain the process of developing a ML algorithm.Modeling can play a pivotal role in efficiently making sense of large, multidimensional data sets. A lot of data science is interpreting large data sets, and making sense of what is happening. Understanding the fundamentals of modeling data will be important for your data science interview. What is the probability that you picked the fair coin? You randomly pick coin and flip it twice, and get heads both times. You have two coins, one of which is fair and comes up heads with a probability 1/2, and the other which is biased and comes up heads with probability 3/4.Given a pool of 10 independent users, what is the probabilty that at least two will be a match? A match is declared between two users if they match on at least 4 adjectives. On a dating site, users can select 1 out of 10 adjectives to describe themselves.What is the probability of rolling a 4 or 7 for two 6 sided dice?.What is the probability that they have two girls? A couple tells you that they have two children, at least one of which is a girl.How far apart do the means need to be in order for this distribution to be bimodal? You have an 50-50 mixture of two normal distributions with the same standard deviation.What is the probability that you see at least one shooting star in the period of an hour? In any 15-minute interval, there is a 20% probability that you will see at least one shooting star.If you have a monthly collection of time series data, how can you tell if there is a "significant" difference between this month and previous month data?.How would you estimate the disease probability in a city given the probability is very low nationwide?.What is a confidence interval and how do you interpret it?.What are key factors to running a successful A/B test?.What are the assumptions of a logistic regression?.Be prepared to answer conditional probability questions and Bayesian probability questions.How would you explain a logistic regression to a non-technical person?.What are the assumptions of linear regression?.How would you explain a linear regression to a non-technical person?.What is Type I error and how is it different from Type II error?.What is sampling? Can you provide an example of a sampling method? Can you provide an example of a time in the past where you needed to use sampling?.What is the importance of the Central Limit Theorem?.Statistics are the guiding principles to collection, organizing, and interpreting data, sounds pretty core to data science huh? Below are some data science interview questions covering statistics. If you're interested in practicing further (and having the option to receive solutions), sign up for our email newsletter, where we send a few interview questions per week. This guide contains 70 data science interview questions, broken out by high-level topics. Interviewing for a role as a data scientist or analyst? You've come to the right place! Below we've curated a list of data science interview questions from multiple sources to help make your preparation easier.įor a data science or data analyst interview, the interviewer will ask a wide range of topics covering statistics, programming (Python and SQL), data modeling (including machine learning), and overall business acumen.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |