Join 📚 Kevin's Highlights

A batch of the best highlights from what Kevin's read, .

![](https://userimg-assets.customeriomail.com/images/client-env-99697/1687330856432_4%20uality%20work_01H3EC37M3HMSBTQ3ZHY9Z2TA6.png)

The Meditations Newsletter #034

Alex from Sunsama

9. What distribution is typically used to estimate the mean of a normally distributed population when its standard deviation is unknown?

Fifteen Questions to Test Your Statistics Knowledge

Keith McNulty

Here’s what the entity resolution query looks like: ![](https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97ec307e-6b10-46fd-8b6a-773f9f163875_1212x728.png) I’m joining the table with itself on state + zipcode to reduce the search space and using string similarity thresholds for filtering potential duplicates. In entity resolution methodology this is known as “blocking.”

Fundamental Data Engineering Concepts - Part 2

Ergest Xheblati

...catch up on these, and many more highlights