SplitOut Automate

Used in SplitOut, this automated workflow scrapes emails from specified websites via a webhook, removes duplicates, and responds with the collected data. It streamlines email extraction, ensuring clean and unique results, enhancing efficiency in data collection processes.

7/8/2025
10 nodes
Medium
webhookmediumsplitoutremoveduplicatesrespondtowebhooksticky noteintegrationapilogicconditional
Categories:
Webhook TriggeredMedium Workflow
Integrations:
SplitOutRemoveDuplicatesRespondToWebhookSticky Note

Target Audience

This workflow is ideal for:
- Marketers looking to gather email addresses from various websites for outreach campaigns.
- Developers who need a quick solution to scrape emails from web pages using APIs.
- Businesses aiming to enhance their email lists for newsletters or promotional purposes.
- Data Analysts who require a systematic approach to collect and analyze email data from multiple sources.

Problem Solved

This workflow addresses the challenge of email scraping from websites efficiently. It automates the process of fetching email addresses from specified URLs, ensuring that users can easily gather leads without manual effort. By integrating with webhooks, it allows for real-time data collection and minimizes the risk of duplicates, thereby enhancing the quality of the email list.

Workflow Steps

  • Webhook Trigger: The workflow begins with a webhook that listens for incoming requests. Users can trigger this by sending a query with the website URL.
    2. Get Website Data: Once the webhook is triggered, it makes an HTTP request to the specified website to retrieve its content.
    3. Extract Emails: The workflow then processes the website data to extract any email addresses found using a regex pattern.
    4. Conditional Check: It checks if any emails were extracted. If no emails are found, the workflow can gracefully handle this scenario.
    5. Remove Duplicates: For any extracted emails, the workflow removes duplicates to ensure that the final list is clean.
    6. Respond to Webhook: Finally, the workflow responds back to the webhook with the results, providing either the list of unique emails or a message indicating no emails were found.
  • Customization Guide

    Users can customize this workflow in several ways:
    - Webhook URL: Change the webhook ID and path to create a unique endpoint for your application.
    - Email Extraction Logic: Modify the regex pattern in the 'Extract the emails found' step to cater to specific email formats or to include/exclude certain domains.
    - Data Processing: Add additional nodes to process the extracted emails further, such as sending them to a CRM or an email marketing tool.
    - Error Handling: Adjust the error handling settings in nodes to ensure that failures are managed according to your requirements, such as logging errors or sending notifications.