Hey everyone, im trying to extract data from a website using clayagent. However i am struggling to find the right prompt. the websites are always built the same way, theoretically all information can be pulled from the same div class from the html. the target website looks like this. the expected output would be the website, e-mail and phone number from the right hand side (see screenshot). can somebody help me with the write prompt? (will be using 4o or 4o mini) 🙏
what's your current prompt?
Extract the following four pieces of information from the given webpage: 1. Website URL 2. Email Address 3. Telephone Number Rules: • Focus only on information near or within sections labeled with “Kontakt”, “Contact”, or near the company name/description. • Ignore any values found in the footer, header, or social media links (e.g., Twitter, LinkedIn). • Do not use any contact information from sections that include “Follow Us”, “Newsletter”, or generic terms like “Support”. • Prefer values located within the main body of the page rather than sidebars or peripheral elements. Return the output in this format: • Website: [URL] • Email: [Email Address] • Telefon: [Text]
_______ Actually, there might be a different issue, as the content on the target url has an age check, clayagent might not be able to access the data?
What's the current output with this?
Yes, if it has an age check it will not be able to proceed. The same happens with websites requiring logins.
it does work for some rows tho - super weird
is there a way to have clay click the ‘over 18’ cta?
Maybe the website doesn't age check prompt for the rows that have filled
So it could access the info
Try the same prompt in other websites without an age check
If it works, you know it's that. If it doesn't, you gotta refine your prompt
its alwayts the same website, its the exhibitor detail page from a conference page. so will always be the same url except for exhibitor id
Got you. Try on a different website without an age check prompt at all.
If Claygent doesn't give you a specific error, it might be because the prompt actioned that time
But it usually either returns something, or gives out an error
