SYSTEM all green source jobstreet.com queue 14,892 pages p99 latency 215ms dataflirt.com · scraper/jobstreet-com
RUN · 89 active pipelines · jobstreet.com live

Jobstreet data,
at warehouse scale.

We extract job listings, salary estimates, company reviews, and skill requirements from Jobstreet. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Jobs extracted
342K /day
Salary data points
128K /day
Company profiles
45K /run
Active pipelines
89
Uptime
99.98%
Data Dictionary

Every field we extract from jobstreet.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Job Postings objects from jobstreet.com. All fields typed and schema-versioned.

job_idtitlecompany_namelocationclassificationsub_classificationsalary_minsalary_maxsalary_typework_typedate_posteddescriptionrequirementsurl
job_postings
● 200 OK
"job_id": "JS-984729",
"title": "Senior Software Engineer",
"company_name": "TechCorp Asia",
"location": "Kuala Lumpur",
"salary_min": 12000,
"salary_max": 18000,
"work_type": "Full Time",
"date_posted": "2023-10-24T08:30:00Z"
# job_idtitlecompany_namelocationclassificationsub_classification
1
2
3

Complete list of extractable fields for Company Profiles objects from jobstreet.com. All fields typed and schema-versioned.

company_idnameindustrywebsitesizeheadquartersdescriptionratingreview_countjobs_countbenefitslogo_url
company_profiles
● 200 OK
"company_id": "C-44921",
"name": "TechCorp Asia",
"industry": "Information Technology",
"size": "501-1000",
"rating": 4.2,
"review_count": 342,
"jobs_count": 15
# company_idnameindustrywebsitesizeheadquarters
1
2
3

Complete list of extractable fields for Salary Data objects from jobstreet.com. All fields typed and schema-versioned.

job_idtitlecompany_namelocationcurrencysalary_minsalary_maxsalary_typesalary_visiblemarket_estimate
salary_data
● 200 OK
"job_id": "JS-984729",
"currency": "MYR",
"salary_min": 12000,
"salary_max": 18000,
"salary_type": "Monthly",
"salary_visible": true,
"market_estimate": 15000
# job_idtitlecompany_namelocationcurrencysalary_min
1
2
3

Complete list of extractable fields for Skill Requirements objects from jobstreet.com. All fields typed and schema-versioned.

job_idtitleclassificationrequired_skillspreferred_skillsyears_experienceeducation_levelcertificationslanguages
skill_requirements
● 200 OK
"job_id": "JS-984729",
"required_skills": "['Python', 'AWS', 'PostgreSQL']",
"years_experience": 5,
"education_level": "Bachelor's Degree",
"languages": "['English', 'Malay']",
"certifications": "['AWS Certified Solutions Architect']"
# job_idtitleclassificationrequired_skillspreferred_skillsyears_experience
1
2
3

Complete list of extractable fields for Search Results objects from jobstreet.com. All fields typed and schema-versioned.

keywordlocationpage_numberpositionjob_idtitlecompany_nameis_promotedposted_timesalary_snippet
search_results
● 200 OK
"keyword": "Data Engineer",
"location": "Singapore",
"position": 3,
"job_id": "JS-11234",
"is_promoted": false,
"posted_time": "2 hours ago",
"salary_snippet": "SGD 6,000 - SGD 9,000"
# keywordlocationpage_numberpositionjob_idtitle
1
2
3

Capabilities

Everything you need from Jobstreet - nothing you don't

Our Jobstreet scraper handles every layer of the platform: job listings, company profiles, salary bands, and skill requirements - with JavaScript rendering and anti-bot circumvention built in.

Full Job Posting Extraction

Title, description, location, work type, and classification hierarchy extracted at scale.

Salary Band Visibility

Capture stated salary ranges, currency, and pay periods across all job listings.

Company Profile Mining

Extract industry classification, company size, benefits, and active job counts for employer analysis.

Skill & Requirement Parsing

Isolate required experience levels, educational qualifications, and specific technical skills.

Promoted Listing Detection

Track sponsored vs organic job placements across specific search keywords and locations.

Multi-Region Support

jobstreet.com.my, jobstreet.com.sg, jobstreet.co.id, and jobstreet.com.ph supported natively.

Historical Trend Tracking

Monitor posting velocity, duration active, and expired listings over time.

Remote Work Identification

Filter and extract roles tagged for remote, hybrid, or on-site work models.

Scheduled Diff Delivery

Run daily pipelines that output only new, updated, or closed jobs to minimise storage bloat.

// engagement pipeline

From search parameters to warehouse records

Brief in. Clean data out.

Define Scope
d 0

Provide target locations, job classifications, or company lists. We map the extraction schema.

Pipeline Build
d 2–4

We configure Scrapy crawlers, proxy rotation, and session management for jobstreet.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, and data normalisation before full production launch.

Delivery
ongoing

JSON / CSV / Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage.

Under the hood

How our Jobstreet pipeline handles the hard parts

Jobstreet uses commercial bot protection and dynamic front-end frameworks. Here is how we stay resilient.

pipeline-monitor · jobstreet.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Anti-bot layer
Residential proxy rotation + fingerprinting

Jobstreet uses commercial bot protection. Our crawlers use residential ISP proxies with realistic TLS fingerprints and HTTP headers.

Dynamic content
JavaScript rendering for job details

Many Jobstreet elements load asynchronously via GraphQL APIs. We intercept these network requests directly or render via Playwright.

Schema stability
Resilient selectors

Jobstreet updates its front-end framework frequently. We use multi-layer fallback chains and API payload extraction to prevent breakages.

Change detection
Only re-scrape what changes

We hash job IDs and content states. Subsequent runs only emit new jobs, closed jobs, or modified listings.

Monitoring
24/7 pipeline health

Every run emits structured logs. We alert on null-rate spikes in salary fields or coverage drops across classifications.

Applications

Who uses Jobstreet data - and how

Teams across industries use jobstreet.com data to build competitive products and smarter operations.

01
Labour Market Analytics

Economists and researchers track hiring trends, skill demand, and salary inflation across Southeast Asia.

02
Competitor Intelligence

Enterprises monitor competitor hiring velocity, open roles, and department expansion signals.

03
Salary Benchmarking

HR teams aggregate salary bands by role and location to optimise compensation packages.

04
Lead Generation

B2B sales teams identify companies expanding specific departments to pitch relevant software or services.

05
Job Board Aggregation

Niche job boards and aggregators backfill their platforms with targeted Jobstreet listings.

06
Skill Gap Analysis

EdTech companies analyse required skills in job postings to design relevant training courses and curricula.

Why DataFlirt

"Jobstreet holds the most accurate pulse on Southeast Asia's labour market - but extracting that data requires navigating complex bot protection and dynamic front-ends."

Most teams underestimate the investment required: reliable Jobstreet scraping demands residential proxies, GraphQL API interception, daily selector maintenance, and anomaly monitoring. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.

Technical Spec

Jobstreet scraper - technical capabilities

Everything supported by our jobstreet.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Playwright sessions for dynamic content loading
Supported
GraphQL API interception
Direct extraction from Jobstreet backend API responses
Supported
Residential proxy rotation
ISP-grade IPs from SG/MY/ID/PH pools
Supported
Multi-region support
Malaysia, Singapore, Indonesia, Philippines
Supported
Salary band extraction
Capture min, max, and currency values
Supported
Historical tracking
Track job posting duration and closure dates
Supported
Change detection
Hash-based diff for new and expired listings
Supported
Applicant volume metrics
Exact number of applicants per job posting
Partial
Candidate profile data
Resumes and personal details of job seekers
Partial
Infrastructure

Infrastructure powering the Jobstreet pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration, deduplication, and retry logic. Playwright handles JavaScript rendering, cookie sessions, and interaction flows.

Residential Proxy Infrastructure

We maintain pools of residential ISP proxies across SG/MY/ID/PH regions. Rotation happens per-request with sticky sessions where required.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling, dependency management, and SLA alerting. All state stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested arrays
CSV
Flat file with typed columns
Parquet
Columnar format for BigQuery, Snowflake, Athena
S3
Direct bucket delivery
Webhook
HTTP POST per record for real-time downstream processing
API
RESTful endpoints for on-demand querying
XLS
Excel compatible files for business teams
Postgres
Upsert into your existing schema
// faq

Common questions.

About jobstreet.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Jobstreet legal?

Scraping publicly available job postings is generally permissible under applicable laws. DataFlirt targets only public, non-authenticated job and company data. We do not extract personal candidate information or circumvent authentication walls.

How do you handle Jobstreet's bot protection?

We use residential ISP proxies, realistic browser fingerprints, and request timing modelled on human behaviour. We monitor for 403 blocks in real time and trigger pool rotation automatically.

Which Jobstreet regions do you support?

We support jobstreet.com.my, jobstreet.com.sg, jobstreet.co.id, and jobstreet.com.ph, delivering data in a unified, normalised schema.

How fresh is the data?

Daily pipelines capture new job postings within 24 hours of publication. Real-time monitoring can be configured for specific high-priority search keywords.

Can you track when a job is closed or filled?

Yes. Our change detection system checks the status of previously scraped job IDs. If a listing is removed or marked inactive, we emit an updated status record.

Do you extract salary data if it is hidden?

We extract salary ranges if they are visible in the page DOM or API payload. We cannot extract salaries that are entirely withheld by the employer on the backend.

$ dataflirt scope --new-project --source=jobstreet.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a daily feed of tech jobs in Singapore or a historical archive of salary trends across Malaysia - we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →