The background cleaning tools grew with tools to draw straight lines (good to capture thin bird legs), and even something to make edges magnetic. I thought about switching to an established photo-editing tool, vibecoding an interface straight into the web app worked fine, and was convenient for the rest of the workflow.
Embeddings
I embedded the bird images using the image foundational models CLIP and BioCLIP (the CLIP objective trained on photos of birds and other lifeforms). This gives me a location in space that is relevant semantically.
So for example, if I took a list of bird names and embedded it with BioCLIP, and then did the same thing on these images of chipping sparrows, ideally they'd show up near each other, and I get a way to automatically label my images with the bird species. Nice!