01 / Source map
Define public sources before writing any crawler
List the exact public pages, keywords, categories, social channels, supplier directories, crowdfunding pages, or search result types that matter to the business.
This prevents blind scraping and keeps the workflow tied to a buyer problem.
02 / Field design
Design the fields that turn a page into a lead record
Fields can include product category, company name, buyer role, region, platform, visible contact route, demand clue, pain point, MOQ clue, urgency, and source URL.
Good fields make data cleaning possible later.
03 / Collection control
Collect politely with limits, logs, and retry rules
Use Playwright, Scrapling, Yingdao RPA, or browser queues with clear schedules, source logs, delay logic, error capture, and platform-rule awareness.
A stable workflow is more valuable than a large unstable scrape.
04 / AI extraction
Extract buyer meaning from messy pages and comments
AI prompts classify product, audience, pain point, stage, objection, and next action from raw text. The extraction result is stored beside the original source.
This bridges public content and practical sales language.
05 / Cleaning
Normalize, deduplicate, and mark missing fields
Clean company names, country names, emails, phone formats, platform handles, product tags, lead status, and duplicate records before scoring.
Without cleaning, sales teams waste time on repeated or misleading records.
06 / Scoring
Score the lead before it interrupts the team
Rules and Bayesian-style scoring can rank fit, urgency, buyer language, source quality, negative signals, and confidence.
Only qualified signals should create alerts.
07 / Route
Push the clean record into Feishu, Telegram, CRM, or sheet
A high-fit record becomes a concise alert with source link, summary, score, missing fields, owner, suggested follow-up, and next reminder.
The workflow becomes action, not just a report.
08 / Learn
Feed repeated questions back into SEO and GEO pages
Repeated buyer questions, product names, objections, and platform phrases become FAQ pages, answer-hub entries, case pages, and sales scripts.
Automation improves the website, not only the sales inbox.