Clay Icon

Troubleshooting Web Scraping for Conference Data Extraction

·
·

i am trying to scrape this conference website to get the company name, website link, description, and booth information. I tried Clay/s chrome extension, but could never get it right. Autodetect always misses data, as well as manual selector mode. I’ve also tried to start a blank sheet, and then add Zenrows enrichment, but it does not accept the website link. Does someone have any ideas how this might work? here is the website: https://robobusinessdtwest2024.mapyourshow.com/8_0/explore/exhibitor-gallery.cfm?featured=false

  • Avatar of Channeled
    Channeled
    APP
    ·
    ·

    👋 Hey there! Our support team has got your message - we'll be back in touch within 24 hours (often sooner!). If you haven't already, please include the URL of your table in this thread so that we can help you as quickly as possible!

  • Avatar of Bo (.
    Bo (.
    ·
    ·

    Hey, thanks for reaching out! It’s possible the missing data is due to companies enabling scrape preventions or how they’re formatting their HTML. Could you show me what you’ve managed to extract so far? If you’ve already pulled the first page with all the exhibition numbers, I’d suggest adding a formula in your table to replace the booth ID. For example, with the following URL: https://robobusinessdtwest2024.mapyourshow.com/8_0/floorplan/?hallID=A&selectedBooth=booth~311 You can use a prompt like this: • Example prompt: Replace the booth number with your booth ID column using a forward slash. If you’d like, feel free to send me your table, and I can assist with setting it up! 😊

  • Avatar of Channeled
    Channeled
    APP
    ·
    ·

    This thread was picked up by our in-app web widget and will no longer sync to Slack. If you are the original poster, you can continue this conversation by logging into https://app.clay.com and clicking "Support" in the sidebar. If you're not the original poster and require help from support, please post in 02 Support.