SYSTEM all green source thepointsguy.com queue 11,842 pages p99 latency 184ms dataflirt.com · scraper/thepointsguy-com
RUN, 31 active pipelines, thepointsguy.com live

Travel reward data,
at warehouse scale.

We extract credit card offers, points valuations, airline reviews, and loyalty programme data from The Points Guy. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Articles extracted
18.4K /run
Card offer updates
2.1K /24h
Points valuations
142 /month
Active pipelines
31
Uptime
99.98%
Data Dictionary

Every field we extract from thepointsguy.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Credit Card Offers objects from thepointsguy.com. All fields typed and schema-versioned.

card_nameissuernetworkannual_feeintro_bonusbonus_spend_reqbonus_timeframeregular_aprforeign_transaction_feespend_multiplierscard_typepage_url
credit_card offers
● 200 OK
"card_name": "Chase Sapphire Preferred",
"issuer": "Chase",
"annual_fee": 95,
"intro_bonus": "60,000 points",
"bonus_spend_req": 4000,
"bonus_timeframe": "3 months",
"regular_apr": "21.49% to 28.49% Variable",
"foreign_transaction_fee": "None"
# card_nameissuernetworkannual_feeintro_bonusbonus_spend_req
1
2
3

Complete list of extractable fields for Points Valuations objects from thepointsguy.com. All fields typed and schema-versioned.

programme_nameprogramme_typevaluation_centsprior_valuation_centstrendlast_updatedtransfer_partnerspoint_expiry_policypage_urlscraped_at
points_valuations
● 200 OK
"programme_name": "Chase Ultimate Rewards",
"programme_type": "Credit Card",
"valuation_cents": 2.05,
"prior_valuation_cents": 2.0,
"trend": "up",
"last_updated": "2023-10-01"
# programme_nameprogramme_typevaluation_centsprior_valuation_centstrendlast_updated
1
2
3

Complete list of extractable fields for Airline & Hotel Reviews objects from thepointsguy.com. All fields typed and schema-versioned.

review_titleauthorpublish_dateairline_hotel_nameroute_locationcabin_classrating_scorepros_listcons_listbottom_lineimagesurl
airline_& hotel reviews
● 200 OK
"airline_hotel_name": "Emirates",
"cabin_class": "First Class",
"route_location": "JFK to DXB",
"rating_score": 9.2,
"pros_list": "['Shower spa', 'Dom Perignon']",
"author": "Zach Honig"
# review_titleauthorpublish_dateairline_hotel_nameroute_locationcabin_class
1
2
3

Complete list of extractable fields for Flight Deals objects from thepointsguy.com. All fields typed and schema-versioned.

deal_titleorigin_airportsdestination_airportsairlinepricecabintravel_datesbook_byalliancepage_url
flight_deals
● 200 OK
"deal_title": "Fly to Paris for $350 round trip",
"origin_airports": "['JFK', 'EWR']",
"destination_airports": "['CDG', 'ORY']",
"airline": "Air France",
"price": 350,
"cabin": "Economy"
# deal_titleorigin_airportsdestination_airportsairlinepricecabin
1
2
3

Complete list of extractable fields for Editorial Articles objects from thepointsguy.com. All fields typed and schema-versioned.

article_idtitlecategorytagsauthorpublish_dateupdate_datecontent_bodyfeatured_imageaffiliate_linksrelated_articlesurl
editorial_articles
● 200 OK
"article_id": "tpg-98421",
"title": "Maximizing British Airways Avios",
"category": "Loyalty Programmes",
"author": "Benji Stawski",
"publish_date": "2023-09-15",
"tags": "['Oneworld', 'Avios']"
# article_idtitlecategorytagsauthorpublish_date
1
2
3

Capabilities

Comprehensive travel reward data extraction

Our pipeline handles dynamic content on The Points Guy, resolving complex affiliate redirects, tracking changing credit card offers, and extracting structured loyalty programme valuations.

Credit Card Offer Tracking

Extract sign-up bonuses, annual fees, spend multipliers, and introductory APR periods directly from comparison tables and review pages.

Points Valuation Extraction

Capture cents-per-point values and historical trend analysis for every airline, hotel, and credit card loyalty programme.

Affiliate Link Resolution

Unroll complex redirect chains to identify the final advertiser destination URLs for credit card applications and travel bookings.

Airline & Hotel Review Parsing

Extract rating scores, cabin classes, pros and cons lists, and bottom-line summaries from detailed travel reviews.

Flight Deal Monitoring

Monitor origin and destination airports, prices, airlines, and alliance data from published flight deal alerts.

Loyalty Programme Guides

Structure transfer partner lists, elite status tier requirements, and point expiration policies from comprehensive guide articles.

Author & Contributor Tracking

Capture article metadata including publication dates, update timestamps, author names, and category tags.

JavaScript Rendering

Execute Playwright sessions to hydrate dynamic calculators, interactive tables, and client-side rendered offer details.

Scheduled Updates

Run daily or weekly pipelines to catch offer changes, valuation updates, and new article publications immediately.

// engagement pipeline

From URL list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target categories, article tags, or specific credit card issuer pages. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy and Playwright crawlers, network interceptors for affiliate links, and proxy rotation.

Validation & QA
d 4–6

Schema validation, null-rate checks, and redirect resolution testing before full launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our pipeline handles travel media scraping

Travel sites deploy sophisticated affiliate networks and dynamic content loading. Here is how we extract clean data.

pipeline-monitor · thepointsguy.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Affiliate link unrolling
Following 301 and 302 redirects to final advertiser

Credit card application links are routed through multiple affiliate networks. Our network interception layer follows the redirect chain to capture the final destination URL, ensuring you know exactly which offer is being promoted.

DOM structure variations
Handling different article templates

Editorial sites use varying templates for reviews, news, and guides. We deploy resilient CSS and XPath selector chains that adapt to different page layouts, ensuring consistent data extraction regardless of the article format.

JavaScript hydration
Playwright for dynamic calculators

Points valuation calculators and dynamic offer tables require client-side execution. We use Playwright to render the full DOM, capturing data that headless HTTP clients miss entirely.

Change detection
Only pushing updated card offers

We maintain a hash index of last-seen values for credit card offers and points valuations. Subsequent runs only push diffs, reducing downstream processing load and providing a clean changelog.

Proxy management
Avoiding geo-blocks on specific deals

Certain flight deals and credit card offers are geo-restricted. We route requests through region-specific residential proxies to ensure we capture the exact content presented to target demographics.

Applications

Who uses travel reward data and how

Teams across industries use thepointsguy.com data to build competitive products and smarter operations.

01
Competitor Intelligence

Credit card issuers track competitor sign-up bonuses, annual fee changes, and spend category multipliers to optimise their own products.

02
Affiliate Marketing Analysis

Affiliate networks audit link placements, redirect chains, and promotional terms across major publishers to ensure compliance.

03
Travel Aggregators

Flight search engines integrate points valuations to show users the true cash value of booking with miles versus currency.

04
Fintech Applications

Personal finance applications power their wallet optimisation engines using structured data on card perks and bonus categories.

05
Market Research

Analysts track historical trends in points valuations and loyalty programme devaluations to assess market health.

06
Content Strategy

Publishers identify high-traffic travel topics and review formats to inform their own editorial calendars.

Why DataFlirt

"The Points Guy holds the industry standard for points valuations and credit card offer history, but accessing this data programmatically requires complex redirect resolution and continuous monitoring."

Credit card offers and flight deals change daily. Relying on manual data entry leads to stale information and broken affiliate links. DataFlirt automates the extraction of dynamic tables, resolves redirect chains, and delivers structured JSON directly to your warehouse, ensuring your downstream applications always reflect the latest market conditions.

Technical Spec

The Points Guy scraper technical specifications

Everything supported by our thepointsguy.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions required for interactive calculators and dynamic offer tables
Supported
Affiliate link resolution
Network interception to capture final destination URLs across redirect chains
Supported
Residential proxy rotation
ISP-grade residential IPs to bypass rate limits and geo-restrictions
Supported
Change detection (diffs)
Hash-based diff to only emit records with changed offers or valuations
Supported
Article body extraction
Clean HTML or markdown extraction of editorial content
Supported
Table parsing
Structured extraction of comparison tables and points charts
Supported
Webhook delivery
HTTP POST per record for real-time downstream processing
Supported
TPG App user wallet data
Personalised wallet tracking requires authenticated user sessions
Partial
Personalised card recommendations
CardMatch and tailored suggestions behind user quizzes and login walls
Partial
Infrastructure

Infrastructure powering the extraction pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and deduplication. Playwright handles JavaScript rendering and interactive elements. Combined via scrapy-playwright middleware.

Redirect Resolution Engine

Custom network interception layer captures HTTP 301 and 302 redirects, logging the entire chain to extract the final affiliate destination URL.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling and dependency management. All state is stored in managed PostgreSQL.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested, schema versioned per run
CSV
Flat file with typed columns, Excel and Sheets compatible
XLS
Legacy spreadsheet format for business analysts
Parquet
Columnar format for BigQuery, Snowflake, Athena
AWS S3
Direct bucket delivery, compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoint to query historical snapshots and diffs
PostgreSQL
Upsert into your existing schema with conflict resolution
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About thepointsguy.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping The Points Guy legal?

Scraping publicly available information from editorial sites is generally permissible under applicable law. DataFlirt targets only public, non-authenticated articles, reviews, and offer tables. We do not extract personal data or circumvent authentication walls.

How do you handle affiliate links?

We use network interception during the browser session to capture the full redirect chain. This allows us to extract the final destination URL of the credit card application or travel booking, bypassing intermediary tracking domains.

Can you track changes in points valuations over time?

Yes. Every pipeline run produces timestamped snapshots. We maintain a time-series record for points valuations, allowing you to track devaluation trends historically.

How fresh is the credit card offer data?

Pipelines can be configured to run daily or weekly. For critical offer tracking, daily runs ensure you capture limited-time sign-up bonuses as soon as they are published.

Do you extract full article text?

Yes. We can deliver full editorial content as clean HTML or structured markdown, stripping out advertisements and navigation boilerplate.

What is the minimum viable engagement?

Our packages start at defined category tracking, such as all credit card reviews or points valuations, with weekly delivery. We price based on volume and delivery frequency.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 100 articles or credit card offers as part of the scoping process, so you can validate schema fit and data quality.

$ dataflirt scope --new-project --source=thepointsguy.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a historical archive of points valuations or a continuous feed of credit card offers, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →