Clay Icon

Suggestions for Scraping PDFs to Extract Council Data

ยท
ยท

Any suggestions for scraping a PDF? (link to PDF) I'm trying to get all 317 councils as rows on Clay.

  • Avatar of kushagra
    kushagra
    ยท
    ยท

    feed into gemini and ask for a table then import to clay?

  • Avatar of Owen L.
    Owen L.
    ยท
    ยท

    Thanks kushagra. I did this in ChatGPT, and it bugged out, but I will try Gemini now.

  • Avatar of kushagra
    kushagra
    ยท
    ยท

    yeah gemini has highest context window and apparently is great at pdf extraction

  • Avatar of kushagra
    kushagra
    ยท
    ยท

    here you go - please check for accuracy, i just got this in 2mins and shared (didn't check)

  • Avatar of Owen L.
    Owen L.
    ยท
    ยท

    Awesome. Thanks for this. I tried myself and got this error. What had you done differently?

  • Avatar of kushagra
    kushagra
    ยท
    ยท

    Used AI studio. The chat version of gemini is not great.

  • Avatar of Owen L.
    Owen L.
    ยท
    ยท

    Thanks kushagra - I appreciate your input and help ๐Ÿ™

  • Avatar of kushagra
    kushagra
    ยท
    ยท

    glad to help owen