SYSTEM all green source lever.co queue 12,408 boards p99 latency 118ms dataflirt.com · scraper/lever-co
RUN * 142 active pipelines * lever.co live

Lever job data,
at warehouse scale.

We extract job postings, department structures, workplace models, and salary bands from Lever hosted boards. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Jobs extracted
482K /day
Board updates
84K /24h
Closed roles detected
12K /run
Active pipelines
142
Uptime
99.98%
Data Dictionary

Every field we extract from lever.co

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Job Postings objects from lever.co. All fields typed and schema-versioned.

job_idtitlecompany_namedepartmentteamlocationworkplace_typedescriptionrequirementsapply_urlposted_at
job_postings
● 200 OK
"job_id": "a1b2c3d4-e5f6",
"title": "Senior Backend Engineer",
"company_name": "TechCorp",
"department": "Engineering",
"location": "Bengaluru, India",
"workplace_type": "Hybrid",
"posted_at": "2026-05-10T14:30:00Z"
# job_idtitlecompany_namedepartmentteamlocation
1
2
3

Complete list of extractable fields for Company Data objects from lever.co. All fields typed and schema-versioned.

company_idcompany_namelever_urlwebsite_urllogo_urldescriptionindustrysizehq_locationactive_jobs_count
company_data
● 200 OK
"company_name": "TechCorp",
"lever_url": "jobs.lever.co/techcorp",
"website_url": "techcorp.io",
"active_jobs_count": 42,
"hq_location": "San Francisco, CA",
"industry": "Enterprise Software"
# company_idcompany_namelever_urlwebsite_urllogo_urldescription
1
2
3

Complete list of extractable fields for Departments & Teams objects from lever.co. All fields typed and schema-versioned.

department_iddepartment_nameteam_namecompany_namejob_countlocation_distributionremote_job_countparent_departmentslug
departments_& teams
● 200 OK
"department_name": "Engineering",
"team_name": "Infrastructure",
"company_name": "TechCorp",
"job_count": 14,
"remote_job_count": 8,
"slug": "engineering-infrastructure"
# department_iddepartment_nameteam_namecompany_namejob_countlocation_distribution
1
2
3

Complete list of extractable fields for Salary & Benefits objects from lever.co. All fields typed and schema-versioned.

job_idcurrencymin_salarymax_salarypay_periodequity_offeredbenefits_listbonus_eligiblecommission_structure
salary_& benefits
● 200 OK
"job_id": "a1b2c3d4-e5f6",
"currency": "USD",
"min_salary": 140000.0,
"max_salary": 180000.0,
"pay_period": "Annually",
"equity_offered": true
# job_idcurrencymin_salarymax_salarypay_periodequity_offered
1
2
3

Complete list of extractable fields for Custom Application Fields objects from lever.co. All fields typed and schema-versioned.

job_idfield_idfield_typefield_labelrequiredoptions_listplaceholder_textvalidation_rules
custom_application fields
● 200 OK
"job_id": "a1b2c3d4-e5f6",
"field_type": "multiple_choice",
"field_label": "Will you now or in the future require sponsorship?",
"required": true,
"options_list": "['Yes', 'No']",
"placeholder_text": "Select an option"
# job_idfield_idfield_typefield_labelrequiredoptions_list
1
2
3

Capabilities

Everything you need from Lever boards

Our Lever scraper targets the underlying JSON APIs and DOM structures of hosted boards. We handle pagination, dynamic loading, and multi-location job deduplication automatically.

Full Job Data Extraction

Title, department, team, workplace type, full description text, and requirements parsed directly from the Lever board.

Closed Role Detection

Monitor active job lists and flag roles that disappear or return 404s to track hiring velocity and filled positions.

Department & Team Mapping

Extract the hierarchical structure of a company's hiring board, grouping jobs by functional area and sub-team.

Workplace Model Classification

Identify remote, hybrid, and onsite roles, including multi-location postings mapped to specific geographic coordinates.

Salary Band Extraction

Capture minimum and maximum salary ranges, currency, and equity indicators when provided in the job description or metadata.

Global Location Normalisation

Standardise inconsistent location strings into clean city, state, and country fields for downstream analytics.

Company Discovery

Input a list of domains and we automatically discover their active Lever board URLs and extract the current job counts.

Custom Application Questions

Extract the specific screening questions and dropdown options configured for each job application form.

Scheduled + Streaming Modes

Run one-off bulk exports or configure continuous pipelines at hourly, daily, or real-time cadences with change-detection diffing.

// engagement pipeline

From company list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide company domains, Lever board URLs, or target industries. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for lever.co.

Validation & QA
d 4–6

Schema validation, null-rate checks, location normalisation, and sample records before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Lever pipeline handles the hard parts

Extracting job boards at scale requires more than a simple HTTP GET. Here is how we build resilient ATS pipelines.

pipeline-monitor · lever.co · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
API extraction
Direct JSON parsing

Lever boards often load data via underlying JSON endpoints. We target these APIs directly to extract clean, structured data without relying on brittle DOM parsing, ensuring high fidelity and low latency.

Change detection
Only re-scrape what has changed

For large company lists, we maintain a hash index of last-seen jobs. Subsequent runs only push diffs and flag closed roles, reducing compute cost and downstream processing load.

Deduplication
Handling multi-location postings

Companies often post the same job across multiple cities. We extract the parent job ID and map all associated locations to a single record, preventing inflated hiring metrics.

Discovery
Finding hidden boards

Companies frequently use custom domains or obscure Lever board names. We use automated discovery spiders to find the correct jobs.lever.co URL for any given corporate domain.

Monitoring & alerting
24/7 pipeline health

Every run emits structured logs to our observability stack. We alert on null-rate spikes, schema drift, and coverage drops. SLA uptime is contractual, not aspirational.

Applications

Who uses Lever data and how

Teams across industries use lever.co data to build competitive products and smarter operations.

01
Job Board Aggregation

Niche job boards and aggregators automatically backfill their platforms with highly targeted roles from specific companies.

02
Competitor Hiring Intelligence

Corporate strategy teams monitor competitor hiring velocity and new department creation to infer product roadmaps.

03
Labour Market Analytics

Economic researchers and investment analysts track aggregate hiring trends, remote work adoption, and salary inflation.

04
Lead Generation for B2B

Sales teams trigger outreach campaigns when target accounts open specific roles, indicating budget and immediate need.

05
Salary Benchmarking

HR teams aggregate public salary bands across thousands of Lever postings to ensure competitive compensation packages.

06
ATS Syncing

Recruiting agencies sync open roles from client Lever boards directly into their internal CRM systems.

Why DataFlirt

"Lever hosts thousands of fast-growing startup job boards, representing a massive real-time signal of company growth, strategic direction, and tech stack adoption."

Most teams underestimate the investment required to monitor thousands of Lever boards simultaneously. Reliable extraction requires board discovery, closed-role detection, daily selector maintenance, and anomaly monitoring. DataFlirt absorbs that complexity so your engineers can focus on the analysis.

Technical Spec

Lever scraper technical capabilities

Everything supported by our lever.co scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JSON API extraction
Directly targets Lever's underlying API for clean, structured data
Supported
Custom field mapping
Extracts custom questions and dropdowns from application forms
Supported
Salary band extraction
Parses min, max, and currency data from job descriptions and metadata
Supported
Closed job detection
Flags jobs that are no longer active on the board
Supported
Multi-location normalisation
Groups duplicate job postings across different geographic locations
Supported
Board discovery spidering
Automatically finds the correct Lever URL for a given domain
Supported
Change detection (diffs)
Only emits records with changed fields since the last run
Supported
Candidate application submission
Automated resume submission and candidate application via the Lever form
Partial
Internal hiring manager notes
Private ATS notes, interview scorecards, and internal candidate pipelines
Partial
Infrastructure

Infrastructure powering the Lever pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows. Combined via scrapy-playwright middleware.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across IN/US/UK/DE regions. Rotation happens per-request with sticky sessions where required. IP score monitoring prevents blacklisted pool contamination.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested - schema versioned per run
CSV
Flat file with typed columns - Excel/Sheets compatible
XLS
Standard Excel format for business analysts
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery - compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoints for on-demand data access
BigQuery
Streamed directly into your dataset with schema auto-detect
Postgres
Upsert into your existing schema with conflict resolution
Snowflake
Stage + COPY INTO workflow - incremental or full-replace
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About lever.co scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Lever boards legal?

Scraping publicly available job postings is generally permissible under applicable law. DataFlirt targets only public, non-authenticated job descriptions and metadata. We do not extract personal candidate data, circumvent authentication walls, or access internal hiring manager notes. Clients should consult legal counsel for specific use cases.

How do you detect when a job is closed?

We maintain a stateful index of all active jobs per company. During each run, we compare the current board state against our index. Jobs that are no longer present or return a 404 status are flagged as closed, providing accurate hiring velocity metrics.

Can you extract salary bands?

Yes. We extract minimum and maximum salary figures, currency, and equity indicators when they are explicitly stated in the job description text or provided in the underlying structured data.

How fresh is the data?

Real-time streaming pipelines achieve sub-60-minute latency for specific target companies. Full board refreshes across thousands of companies complete within a 6-12 hour window depending on volume.

What is the minimum viable engagement?

Our smallest packages start at a defined list of 500 company domains with weekly delivery. For larger aggregations or custom schema requirements, we price based on volume and delivery frequency.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 50 company boards as part of the pre-engagement scoping process so you can validate schema fit, field completeness, and data quality before signing any contract.

$ dataflirt scope --new-project --source=lever.co ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off dump of SaaS job postings or a continuous feed of hiring signals across 10,000 companies, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →