One creative solution I can think of (outside of clay) is to vibecode a puppeteer/playwright script; then feed the image to claude API for analysis into a .csv to be uploaded back to clay. Takes time so I’d do this only if it’s mission critical. There might also be better ways I can’t think of right now