Architecture

Five stages.
One straight line to a signal.

bypath is a modular pipeline: scrapers → normalizer → classifier → enrichment → delivery. Each stage is interchangeable, every queue is observable, every failure has a paper trail.

01
Listen

Apify-backed scrapers tap 5 social networks. Rate-limited, proxied, deduplicated by canonical URL. We respect every platform’s API ToS.

15M public posts/day
Inside this stage
  • Instagram · IG Hashtag scraper
  • TikTok · apidojo
  • LinkedIn · harvestapi
  • X · akshaynceo Twitter scraper
  • Reddit · comchat reddit-api-scraper
02
Normalize

Posts from any platform are reshaped into a single 14-field schema. No platform-specific logic leaks into the brain.

14-field unified schema
Inside this stage
  • url, caption, shortCaption
  • username, fullName
  • timestamp, locationName
  • likesCount, commentsCount
  • hashtags, mentions
  • displayUrl, ownerId, platform
03
Qualify

A primary GPT classifier triages every post into B2B intent / B2C intent / commerce / noise. A second-pass GPT re-checks edge cases. Noise is dropped.

<60s avg · 72% B2B precision
Inside this stage
  • Primary classifier
  • Brain second-pass review
  • Drop noise · keep intent
  • Continuous re-training from your feedback loop
04
Enrich

7 enrichment passes attach email, phone, LinkedIn, website, location and intent vector. Every contact is verified by deliverability test before delivery.

6 facts · 6 of 7 success
Inside this stage
  • Email finder + verifier
  • Phone finder
  • LinkedIn person + company
  • Website discovery
  • Location resolution
  • Intent vector classification
05
Ship

Signals land in your inbox / Slack / CRM / webhook with a one-click reply template. No dashboards to babysit, no SDR floor to staff.

Email · Slack · API · CSV · CRM
Inside this stage
  • Email digest (instant or hourly)
  • Slack DM · channel · webhook
  • REST API · OpenAPI 3
  • CSV export
  • Native HubSpot / Pipedrive sync (Engine tier)
Why a pipeline, not a dashboard

A signal that lives in a dashboard is a signal that dies.

Most “lead intelligence” tools dump a screen at you and call it a day. bypath delivers each qualified post into the place where you already work — inbox, Slack, CRM — with the contact data already attached. The fastest path from “heard it” to “sent the message”.