Clay Icon

Best Practices for Merging Data From Multiple Job Boards Efficiently

·
·

Hey there, I am looking to scrape data from 3 different jobboards into clay and then process them I wonder what is the best approach to merge data from 3 different sources ? Is this schema feels right to you ? How would you do it ? Explanations :

  • 1 source = 1 table in the workbook

  • the 3 "source tables" are pushed into one "merged table" and then my enrichment flow starts

  • WelcomeToTheJungle can only be accessed through apify, Indeed and LinkedIn can be accessed by clay or Apify, I wonder which solution is best ?

Constraints :

  • I want to get it refreshed on a weekly basis with the fewer manual actions

  • I want the whole flow that comes afterwards to be automated and streamlined

  • Avatar of Bo (.
    Bo (.
    ·
    ·

    Hi Louis! 👋 Your approach sounds solid: using one table per source and then merging them into a single table for further enrichment is a great way to organize the data flow. However, Clay’s refresh rates are set to daily, so it isn’t possible to control or schedule a weekly refresh directly within Clay. If you need more flexibility with refresh timing, I recommend using Apify data sent through a webhook. This way, you can control when Apify pushes the data to Clay. Here’s a helpful guide on setting up Apify webhooks: Apify Webhooks Documentation. Once the data is in Clay, you can automate updates and keep the entire flow streamlined using Clay’s auto-update feature. For a detailed walkthrough, you can check out this Loom video. This setup should help you automate everything with minimal manual effort. Let me know if you have any more questions or need further assistance! 😊🚀

  • Avatar of Louis M.
    Louis M.
    ·
    ·

    thanks for your answer, any tuto on how to use apify as data source via webhook and make sure everything is correctly refreshed ? also, how do i organize my "source tables" ? everything in a workbook ? 1 apify task -> 1 table, and then autoupdate to be written in another "main table" ?

  • Avatar of Bo (.
    Bo (.
    ·
    ·

    Thanks for your question! For using Apify as a data source via webhook and ensuring everything refreshes correctly, you can check out these helpful resources: 1. Apify Webhooks: Apify Webhook Guide — This will guide you on setting up webhooks with Apify. 2. Clay Webhooks: Clay Webhook and Inbound Lead Sources — This explains how to integrate and manage webhooks within Clay. Regarding how to organize your source tables: • It’s best to use one table per Apify task. This keeps things manageable and gives you better control over each dataset. • To auto-update and combine data into a main table, you can use the “Write to Table” feature in Clay: Write to Table Guide. This will allow you to transfer and organize data efficiently between tables. If you need any more help setting this up, just let me know! 😊

  • Avatar of Channeled
    Channeled
    APP
    ·
    ·

    We haven't heard back from you in a bit, so we're going to go ahead and close things out here - feel free to let us know if you still need something!

  • Avatar of Channeled
    Channeled
    APP
    ·
    ·

    Hi Louis M.! This thread was recently closed by our Support team. If you have a moment, please share your feedback: