Join 📚 Kevin's Highlights

A batch of the best highlights from what Kevin's read, .

Here’s what the entity resolution query looks like: ![](https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F97ec307e-6b10-46fd-8b6a-773f9f163875_1212x728.png) I’m joining the table with itself on state + zipcode to reduce the search space and using string similarity thresholds for filtering potential duplicates. In entity resolution methodology this is known as “blocking.”

Fundamental Data Engineering Concepts - Part 2

Ergest Xheblati

![](https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2262bc1-f3bb-4171-b3d1-060958f0d4a3_1412x1310.png)

Converting CSV to Excel With Python

Mike Driscoll

You don’t have to know how to code to participate, though! There are many different ways to participate, including: • Writing a blog post or social media post highlighting an open source project that you find interesting • Exploring documentation • Triaging issues • Writing issues • Attending an event that focuses on open source • Exploring open source projects to contribute to

How to Join the #100daysofOSS Challenge and Embrace the Power of Open Source

BekahHW

...catch up on these, and many more highlights