Welcome to Tomba Docs! Create a free Tomba accountand gain instant access to 400+ million contacts!
Overview

Data

When you use Tomba.io, you're tapping into one of the most advanced and comprehensive datasets on the public web. We analyze and process over 2 billion pages monthly to index fresh, relevant business data with zero reliance on third-party data providers.

You can explore how our robot works here.


🔍 How We Find Email Addresses

We use a combination of powerful web crawlers and smart algorithms to build the most accurate B2B email database.

1. Collecting Publicly Available Emails

Our crawlers scan:

  • Blog posts
  • Company websites
  • PDFs, XLS files
  • Forums and comment sections

These emails are publicly accessible, and we store only what’s relevant and verified.

2. Guessing Emails with Algorithms

We use pattern recognition like:

  • firstname.lastname@domain.com
  • f.lastname@domain.com

We iterate intelligently based on company email formats. If the guessed email is likely valid, we return it with a confidence score (0–100).


🤖 Crawlers – Our Backbone

Tomba’s crawlers don’t just find emails. They perform dozens of specialized tasks to enrich and verify every record:

🔧 Jobs Performed by Our Crawlers

  • DNS, WHOIS, and SMTP resolution
  • Detect gender from names
  • Real-time data enrichment
  • Identify and update personal vs. generic emails
  • Discover and store email patterns for each domain
  • Extract website descriptions and social media links
  • Detect and store geolocation (country, city, ZIP, state)
  • Validate and delete invalid records
  • Add phone numbers and tech stack data

We crawl and process over 2 billion URLs each month, constantly refining our database to maintain accuracy and freshness.

🧼 Data Cleanliness: We only store about 6% of crawled emails and strictly avoid personal emails (like Gmail, Yahoo, Outlook).


🚫 What We Don’t Store

At Tomba.io, data quality and privacy are our priorities:

  • No duplicate records
  • No personal webmail addresses
  • No fake or low-confidence emails

🌐 Data at Scale

Tomba.io is home to the largest professional email database:

MetricValue
B2B Emails450+ million
Unique Company Domains70+ million
URLs Crawled Monthly2+ billion

All data is unified under a global standardized schema ready to use and integrate.


🧠 Types of Data We Provide

Tomba.io provides much more than emails. Here are the categories of data we crawl, enrich, and deliver:

  • Website Data: Metadata, descriptions, contact details, social links, and company details.
  • LinkedIn Data: Find emails via LinkedIn URLs with precision.
  • Technologies Used: Identify what tools and platforms the domain is built on.
  • Similar Websites: Get competitor and look-alike sites to target similar businesses.

By combining web-scale crawling, real-time enrichment, and intelligent filtering, Tomba.io delivers the cleanest and most powerful B2B data in the market helping you connect with your ideal prospects at scale. 🚀