Updates on building a powerful sourcing technology

professional profile

January 03, 2026

by a professional from University of Chicago in Flagler Beach, Florida, USA

Starting with Florida, I've cleaned up the state databases to cross join information on registered corporations, emails, trademarks, fictitious names, et al. This allows me to create a matching engine from web_content -> real llcs. The matching can get quite tricky, but I've started iterating over the the highest similarity matches. I'm optimizing my systems and compute infra but can rip through about 10,000 businesses per day. There will be added data cleaning time as I move this out to other states and databases. The goal is to then generate embedding text to map all of these businesses to a high-dimensional vector space for natural semantic search. All of this work is to support the tool I'm building out at https://try-buybox.com. If you're interested in using this, feel free to shoot me an email redacted or submit the beta registration form: https://try-buybox.com/beta Cheers!
0
0
33
Replies
0
Join the discussion