Hey, team! 👋 Looking for some help from this group for reliable ways in Clay to fetch and read web-sourced PDFs at scale in order to analyze/summarize/extract key fields. We're starting by first discovering the URL of the PDF via GPT5.1 (has been the best results so far -- would love anyone's thoughts on using a lighter-weight model for this) which works, but agents often fail to open the actual PDF from a direct URL. We much prefer Clay-first solutions, while open to minimal external helper/API if there’s a proven pattern.
More details in 🧵 . PS: please redirect me to a better channel for this if I'm barking up the wrong tree! 