Clay Icon

Best Tools for Scraping Data from PDFs

ยท
ยท

Also what's a good tool to use to scrape PDFs? Found a resource with table list of company names, will be awesome if I can scrape it

  • Avatar of Bo (.
    Bo (.
    ยท
    ยท

    For PDFs, you have two great options in Clay: 1. Scrape Website tool - find it in the enrichment panel by clicking "add enrichment" at the top right 2. Claygent - can analyze and extract data from PDFs by answering specific questions Does that helps?

  • Avatar of Bruce W.
    Bruce W.
    ยท
    ยท

    Isn't Claygent scraping based on every row in the table though, in this case I do not even have the existing data yet

  • Avatar of Arturo O.
    Arturo O.
    ยท
    ยท

    Hey Bruce, thanks for the follow up context! Jumping in here to help a bit. To clarify, you have a PDF file that includes a list of company names that you'd like to transfer into a table to then enrich them? Or are you trying to find PDL URLs that have specific words or located under a specific site to then further scrape each one?

  • Avatar of Channeled
    Channeled
    APP
    ยท
    ยท

    This thread was picked up by our in-app web widget and will no longer sync to Slack. If you are the original poster, you can continue this conversation by logging into https://app.clay.com and clicking "Support" in the sidebar. If you're not the original poster and require help from support, please post in 02 Support.

  • Avatar of Bruce W.
    Bruce W.
    ยท
    ยท

    Yes it is the former Arturo