I work on a scoring lead table. I have a weekly synchronized table with regular updates for all users. Example : xxx@xxx.com / webinar X / File X xxx@xxx.com / webinar Y / File X I want to have only one line by user (using email address) and then compute all the other columns to get a score by column for each user. Example : xxx@xxx.com / 2 webinars / 2 file X What are the steps to get there? Not sure if my request is clear ...
Here's how to consolidate your lead table by email and compute scores:
Step 1: Enable Auto-dedupe
Click the Auto-dedupe icon at the bottom right of your table, then toggle on "Enable automatic deduplication" and select your email column as the dedupe column. [1]
Step 2: Create scoring formulas
Step 3: Set up aggregation formulas
In each formula column, reference your email column using "/" and specify what you want to count. For example, "Count how many times this email appears with webinar data" or "Sum all file interactions for this email."
The auto-dedupe will keep only one row per email while your formulas calculate the totals across all original entries for that user.
Was that helpful?
If I auto-dedupe before creating formulas, isn’t it going to delete the rows before they were counted and aggregated?
You're absolutely right to be concerned! Auto-dedupe removes duplicate rows immediately, which would delete your data before you can count it.
Here's the correct approach:
First create your aggregation formulas while all duplicate rows still exist. Use formulas to count webinars, files, etc. for each email address across all rows.
Then enable auto-dedupe on the email column. This will keep only one row per email but preserve your calculated scores in the formula columns. [1]
The key is doing the counting/aggregation first, then deduplicating second so you don't lose the data you need to calculate from.
Is that what you were looking for?
This will keep only one row per email but preserve your calculated scores in the formula columns. Are you sure about that? If I calculate and then dedupe, isn’t going to recalculate all my formulas with the remaining row only?
Good question. When you enable auto-dedupe after creating your aggregation formulas, the formulas should still reference all original rows for each email, not just the deduped row. This way, the scores remain accurate. Could you share more about how your formulas are set up? That will help me understand your situation better.
If you still need help with setting up your formulas and deduplication correctly, I’m here to assist. Would you like to provide more details about your current formula setup so we can continue troubleshooting?
is it possible talking to someone?
I understand you'd like to speak with a human agent, so I’m routing you to our team now for further assistance.
Clay Support is sharing a file with you.
Our support team has got your message and we'll get back to you soon!
If you’re dealing with a specific table, drop the URL below so we can help you quicker. Otherwise, someone from our team will be in touch soon!
Hi Léa - thanks so much for reaching out and happy to help here.
I believe that a good approach here would be to aggregate first, then dedupe. You can keep all rows and use formulas like COUNTIF or SUMIF to total webinars, file downloads, or scores for each email across the entire dataset. Once those totals are displayed for every user, you can dedupe the rows.
Hope that helps.
thanks for your answer
can I please share my table so you can have a look maybe and make sure it will work?
My problem is that on clay the only way to countif or sumif is the enrichment lookup multiple rows if I’m not mistaken? Whereas I need to get the number of webinars occurrences for a same email address. It would look more like a pivot table on spreadsheet
.png)