SYSTEM all green source worknz.co.nz queue 12,419 pages p99 latency 185ms dataflirt.com · scraper/worknz-co.nz
RUN · 42 active pipelines · worknz.co.nz live

New Zealand job data,
at warehouse scale.

We extract job postings, salary bands, skill requirements, and employer profiles from worknz.co.nz. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Jobs extracted
14.2K /day
Salary updates
8.9K /24h
Employer records
4.1K /run
Active pipelines
42
Uptime
99.94%
Data Dictionary

Every field we extract from worknz.co.nz

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Job Postings objects from worknz.co.nz. All fields typed and schema-versioned.

job_idtitleemployer_nameemployer_idlocationregionsalary_minsalary_maxjob_typeposted_datedescription_htmldescription_textremote_eligibleapplication_url
job_postings
● 200 OK
"job_id": "WNZ-89421",
"title": "Senior Civil Engineer",
"employer_name": "Fletcher Construction",
"location": "Auckland CBD",
"salary_min": 120000.0,
"salary_max": 145000.0,
"job_type": "Full-time",
"posted_date": "2026-08-14T08:30:00Z",
"remote_eligible": false
# job_idtitleemployer_nameemployer_idlocationregion
1
2
3

Complete list of extractable fields for Salary Data objects from worknz.co.nz. All fields typed and schema-versioned.

job_idtitleemployer_namesalary_minsalary_maxsalary_exactcurrencypay_periodbenefits_listedequity_offeredsuperannuation_included
salary_data
● 200 OK
"job_id": "WNZ-89421",
"title": "Senior Civil Engineer",
"salary_min": 120000.0,
"salary_max": 145000.0,
"currency": "NZD",
"pay_period": "ANNUAL",
"benefits_listed": "['Health Insurance', 'Company Vehicle']",
"superannuation_included": true
# job_idtitleemployer_namesalary_minsalary_maxsalary_exact
1
2
3

Complete list of extractable fields for Employer Profiles objects from worknz.co.nz. All fields typed and schema-versioned.

employer_idcompany_nameindustrywebsite_urlcompany_sizeheadquartersactive_jobs_countdescriptionfounded_yearlogo_url
employer_profiles
● 200 OK
"employer_id": "EMP-4029",
"company_name": "Fletcher Construction",
"industry": "Construction & Engineering",
"company_size": "1000-5000",
"headquarters": "Auckland",
"active_jobs_count": 47,
"founded_year": 1909
# employer_idcompany_nameindustrywebsite_urlcompany_sizeheadquarters
1
2
3

Complete list of extractable fields for Skill Requirements objects from worknz.co.nz. All fields typed and schema-versioned.

job_idskill_nameexperience_yearsmandatorycategorynormalized_skillcontext_snippetimportance_score
skill_requirements
● 200 OK
"job_id": "WNZ-89421",
"skill_name": "AutoCAD",
"experience_years": 5,
"mandatory": true,
"category": "Software",
"normalized_skill": "autocad",
"context_snippet": "Must have at least 5 years of commercial experience using AutoCAD."
# job_idskill_nameexperience_yearsmandatorycategorynormalized_skill
1
2
3

Complete list of extractable fields for Search Results objects from worknz.co.nz. All fields typed and schema-versioned.

keywordlocation_filterpositionjob_idtitleemployer_namepromotedsnippetscraped_at
search_results
● 200 OK
"keyword": "civil engineer",
"location_filter": "Auckland",
"position": 3,
"job_id": "WNZ-89421",
"promoted": false,
"employer_name": "Fletcher Construction",
"scraped_at": "2026-08-15T10:12:44Z"
# keywordlocation_filterpositionjob_idtitleemployer_name
1
2
3

Capabilities

Everything you need from Worknz.co.nz — nothing you don't

Our scraper handles every layer of the Worknz.co.nz platform: job listings, salary bands, employer directories, and search pagination — with anti-bot circumvention built in.

Full Job Listing Extraction

Title, full HTML description, job type, location, remote eligibility, and posting dates scraped accurately from individual job pages.

Salary Band Parsing

Extract stated salary ranges, exact figures, pay periods, and currency. We normalise text like '120k-145k + Super' into structured numeric fields.

Employer Directory Mining

Capture company name, industry, size, headquarters, and total active open roles directly from employer profile pages.

Skill Normalisation

Parse unstructured job descriptions to isolate specific technical requirements, certifications, and years of experience requested.

Location & Remote Flags

Distinguish between on-site, hybrid, and fully remote roles across all New Zealand regions and specific postcodes.

SERP & Keyword Scraping

Track organic vs promoted position for any job title search, monitoring which employers are aggressively bidding for visibility.

Expired Job Tracking

Monitor active URLs to detect exactly when a job is taken down, providing accurate time-to-fill metrics for the NZ market.

Scheduled + Streaming Modes

Run one-off bulk exports of historical jobs or configure continuous pipelines at hourly cadences with change-detection diffing.

Rate Limit Evasion

Job boards deploy strict traffic shaping. We manage concurrency and proxy rotation to ensure zero IP bans and complete data extraction.

// engagement pipeline

From search criteria to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide job categories, locations, keywords, or specific employer names. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy / Playwright crawlers, proxy rotation, session management, and CAPTCHA handling for worknz.co.nz.

Validation & QA
d 4–6

Schema validation, null-rate checks, salary-outlier detection, and sample listings before full launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Worknz.co.nz pipeline handles the hard parts

Job boards deploy strict rate limits to protect their inventory. Here is how we stay resilient.

pipeline-monitor · worknz.co.nz · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Residential proxy rotation + fingerprint spoofing

Worknz.co.nz uses traffic analysis to block scrapers. Our crawlers use residential ISP proxies with realistic browser fingerprints, randomised request timing, and full cookie session management — trained on real applicant behaviour patterns.

JavaScript rendering
Full Playwright execution for dynamic content

Salary reveals and application modals are often JavaScript-rendered. We run full Playwright browser sessions with JavaScript execution and lazy-load triggering, capturing data that headless HTTP clients miss entirely.

Schema stability
Resilient selectors with fallback chains

Job boards frequently update their DOM structure. Our selector strategy uses multiple fallback chains per field — CSS selectors, XPath, text-pattern matching, and structured data extraction (LD+JSON) — so a layout change does not break your data pipeline.

Change detection
Only re-scrape what has changed

For large job catalogues, we maintain a hash index of last-seen values per field. Subsequent runs only push diffs or status updates (e.g., job expired) — reducing compute cost and downstream processing load.

Monitoring & alerting
24/7 pipeline health with anomaly detection

Every run emits structured logs to our observability stack. We alert on null-rate spikes, volume drops, schema drift, and coverage gaps — and respond before you notice. SLA uptime is contractual.

Applications

Who uses Worknz.co.nz data — and how

Teams across industries use worknz.co.nz data to build competitive products and smarter operations.

01
Labor Market Analytics

Economists and researchers track hiring volume, regional demand, and skill shifts across the New Zealand economy.

02
Salary Benchmarking

HR teams and compensation analysts aggregate salary bands to ensure their offers remain competitive in specific NZ regions.

03
Lead Generation (B2B)

Recruitment agencies and B2B software vendors monitor new job postings to identify companies actively expanding their teams.

04
Competitor Hiring Intelligence

Corporate strategy teams monitor competitor job postings to deduce strategic direction, technology stack choices, and expansion plans.

05
Job Board Aggregation

Niche job boards and career portals ingest Worknz.co.nz listings to backfill their own inventory and provide comprehensive search.

06
Economic Forecasting

Hedge funds and quantitative analysts correlate job posting velocity with corporate health and macroeconomic trends.

Why DataFlirt

"Worknz.co.nz holds the pulse of the New Zealand labor market — but building a reliable feed requires navigating rate limits and shifting DOM structures."

Most teams underestimate the investment required: reliable job board scraping requires residential proxies, full JavaScript rendering, CAPTCHA handling, daily selector maintenance, and anomaly monitoring. DataFlirt absorbs that complexity so your engineers can focus on the analysis — not the infrastructure.

Technical Spec

Worknz.co.nz scraper — technical capabilities

Everything supported by our worknz.co.nz scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions — required for dynamic salary widgets and lazy-loaded descriptions
Supported
CAPTCHA bypass
Automated 2Captcha + CapSolver integration with fallback to manual queue
Supported
Residential proxy rotation
ISP-grade residential IPs from NZ/AU pools — rotated per request
Supported
Salary extraction
Parsing of both structured salary fields and unstructured text mentions
Supported
Expired listing detection
Continuous monitoring to flag when a job URL returns 404 or inactive status
Supported
Employer profile scraping
Extraction of company metadata and aggregation of all active listings per employer
Supported
Change detection (diffs)
Hash-based diff: only emit records with changed fields since last run
Supported
Webhook delivery
HTTP POST per record or batch — useful for real-time job alerting systems
Supported
Applicant tracking / Resume data
Extraction of candidate profiles, CVs, or applicant volumes
Partial
Logged-in user saved jobs
Extraction of personal user account data, saved searches, or application history
Partial
Infrastructure

Infrastructure powering the Worknz.co.nz pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows. Combined via scrapy-playwright middleware.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across New Zealand and Australia regions. Rotation happens per-request with sticky sessions where required. IP score monitoring prevents blacklisted pool contamination.

Cloud-Native Orchestration

Pipelines run on AWS Lambda (burst) and ECS (sustained). Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested — schema versioned per run
CSV
Flat file with typed columns — Excel/Sheets compatible
Parquet
Columnar format for BigQuery, Snowflake, Athena
S3
Direct bucket delivery — compatible with any data lake
BigQuery
Streamed directly into your dataset with schema auto-detect
Webhook
HTTP POST per record for real-time downstream processing
Postgres
Upsert into your existing schema with conflict resolution
Snowflake
Stage + COPY INTO workflow — incremental or full-replace
// faq

Common questions.

About worknz.co.nz scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Worknz.co.nz legal?

Scraping publicly available job postings is generally permissible under applicable law, reinforced by rulings like hiQ v. LinkedIn. DataFlirt targets only public, non-authenticated job and employer data. We do not extract personal candidate data or circumvent authentication walls. Clients should review terms of service and consult legal counsel for specific use cases.

How do you handle anti-bot systems?

We use residential ISP proxies, full Playwright browser sessions with realistic fingerprints, and request timing modelled on human applicant behaviour. Our selectors have multi-layer fallback chains so DOM changes do not break the pipeline. We monitor for rate limits in real time and trigger pool rotation automatically.

Can you extract salary data if it is hidden in the description?

Yes. While we extract structured salary fields directly from the metadata, our parsing engine also uses regex and NLP to identify and extract numeric salary bands and pay periods mentioned within the raw HTML job description.

How fresh is the job data?

Real-time streaming pipelines achieve sub-60-minute latency for new job postings based on specific keyword or category alerts. Full site refreshes at daily cadence complete within a 4-8 hour window depending on total volume.

Do you track when a job is removed?

Yes. For ongoing pipelines, we maintain state of all active job URLs. If a URL returns a 404 or an 'expired' tag, we emit an update record marking the job as closed, allowing you to calculate precise time-to-fill metrics.

What is the minimum viable engagement?

Our smallest packages start at a defined set of categories or keywords with weekly delivery. For full-site extraction or custom schema requirements, we price based on volume and delivery frequency. Contact us with your use case for a scoped quote.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 500 job listings or 50 search result pages as part of the pre-engagement scoping process — so you can validate schema fit, field completeness, and data quality before signing any contract.

$ dataflirt scope --new-project --source=worknz.co.nz ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off regional extraction or a continuous feed of all New Zealand job postings — we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →