Join 📚 Kevin's Highlights
A batch of the best highlights from what Kevin's read, .
Here’s what the entity resolution query looks like:

I’m joining the table with itself on state + zipcode
to reduce the search space and
using string similarity thresholds
for filtering potential duplicates.
In entity resolution methodology this is known as “blocking.”
Fundamental Data Engineering Concepts - Part 2
Ergest Xheblati

Converting CSV to Excel With Python
Mike Driscoll
You don’t have to know how to code to participate, though! There are many different ways to participate, including:
• Writing a blog post or social media post highlighting an open source project that you find interesting
• Exploring documentation
• Triaging issues
• Writing issues
• Attending an event that focuses on open source
• Exploring open source projects to contribute to
How to Join the #100daysofOSS Challenge and Embrace the Power of Open Source
BekahHW
...catch up on these, and many more highlights