Morning everyone! Could you please tell me if it is possible to scrape the contents of this page? https://www.xoprivate.com/madifushi-private-island/ All URLs following the trailing / and the content isn't available on a directory so the help guide isn't helping 🤔
Yes, you can scrape content from pages and their sub-URLs even when they're not in a directory structure. You have several options: 1. Use the "Scrape Website" action in Clay to extract data from specific URLs. You can set parameters like scrape delay and specify output fields. 2. For more complex scenarios, try the Clay Chrome Extension with URL patterns. By replacing unique parts of URLs with variables (e.g., /:parameter), you can create recipes that work across similar pages with different URLs. 3. For advanced scraping needs, consider: • ScrapeMagic integration for structured data extraction • Zenrows for handling JavaScript-heavy sites or those with bot protection Which approach would work best depends on the specific website structure and content you're trying to scrape.
Hey Amy, thanks for reaching out. They have some sort of directory under "the masters" e.g. https://www.xoprivate.com/estate-masters/ Here’s how you can pull this data into a table: 1. Download the Instant Data Scraper Chrome extension. 2. Use the extension to extract the contact details from e.g. https://www.xoprivate.com/estate-masters/ . 3. Create a new table and upload CSV to format and enrich the data properly. 4. Use Claygent to pull any relevant info or scrape the full website with zenrows or our native scrape integration Let me know if this does the trick for you :)
Have a great one, Daniel
This thread was picked up by our in-app web widget and will no longer sync to Slack. If you are the original poster, you can continue this conversation by logging into https://app.clay.com and clicking "Support" in the sidebar. If you're not the original poster and require help from support, please post in 02 Support.