SYSTEM all green source gympass.com queue 12,408 pages p99 latency 218ms dataflirt.com · scraper/gympass-com
RUN, 42 active pipelines, gympass.com live

Gympass data,
at warehouse scale.

We extract gym locations, class timetables, amenity lists, and wellness partner catalogues from Gympass. Delivered as clean JSON, CSV, or Parquet to S3, BigQuery, or Snowflake on your cadence.

Gyms extracted
54,291 /run
Class schedules
892K /week
Wellness apps
1,402 /run
Active pipelines
42
Uptime
99.98%
Data Dictionary

Every field we extract from gympass.com

Structured, schema-consistent data across all major object types — delivered clean, typed, and ready to query.

Complete list of extractable fields for Gyms & Studios objects from gympass.com. All fields typed and schema-versioned.

gym_idnamecategoryrequired_tieraddress_line_1citystatecountrypostal_codelatitudelongituderatingreview_countcontact_phonewebsite_urldescriptionimage_urls
gyms_& studios
● 200 OK
"gym_id": "GYM-89421",
"name": "Equinox Hudson Yards",
"category": "Premium Fitness",
"required_tier": "Diamond",
"city": "New York",
"rating": 4.8,
"review_count": 1240,
"latitude": 40.7538,
"longitude": -74.0011
# gym_idnamecategoryrequired_tieraddress_line_1city
1
2
3

Complete list of extractable fields for Class Schedules objects from gympass.com. All fields typed and schema-versioned.

class_idgym_idclass_nameinstructor_namestart_timeend_timeduration_minutesday_of_weekcategorydifficulty_levelcapacityspots_availablebooking_required
class_schedules
● 200 OK
"class_id": "CLS-55912",
"gym_id": "GYM-89421",
"class_name": "Vinyasa Flow Yoga",
"instructor_name": "Sarah Jenkins",
"start_time": "2026-10-14T18:00:00Z",
"duration_minutes": 60,
"category": "Yoga",
"spots_available": 12,
"booking_required": true
# class_idgym_idclass_nameinstructor_namestart_timeend_time
1
2
3

Complete list of extractable fields for Amenities objects from gympass.com. All fields typed and schema-versioned.

gym_idhas_poolhas_parkinghas_showershas_lockershas_wifihas_saunahas_towel_servicehas_cafeaccessibility_featuresequipment_listlast_updated
amenities
● 200 OK
"gym_id": "GYM-89421",
"has_pool": true,
"has_parking": false,
"has_showers": true,
"has_sauna": true,
"has_towel_service": true,
"has_wifi": true,
"last_updated": "2026-10-12T09:00:00Z"
# gym_idhas_poolhas_parkinghas_showershas_lockershas_wifi
1
2
3

Complete list of extractable fields for Wellness Apps objects from gympass.com. All fields typed and schema-versioned.

app_idapp_namedevelopercategoryrequired_tierdescriptionfeaturessupported_platformsratingreview_countintegration_typeimage_url
wellness_apps
● 200 OK
"app_id": "APP-104",
"app_name": "Calm",
"category": "Mental Health",
"required_tier": "Gold",
"supported_platforms": "['iOS', 'Android', 'Web']",
"rating": 4.9,
"review_count": 45021,
"integration_type": "SSO"
# app_idapp_namedevelopercategoryrequired_tierdescription
1
2
3

Complete list of extractable fields for Reviews & Ratings objects from gympass.com. All fields typed and schema-versioned.

review_idgym_iduser_idratingreview_textvisit_datepost_datehelpful_votesfacility_cleanliness_ratingequipment_quality_ratingstaff_friendliness_rating
reviews_& ratings
● 200 OK
"review_id": "REV-99210",
"gym_id": "GYM-89421",
"rating": 5,
"review_text": "Incredible facilities and top tier equipment. The rooftop pool is unmatched.",
"post_date": "2026-09-28",
"helpful_votes": 34,
"facility_cleanliness_rating": 5,
"staff_friendliness_rating": 4
# review_idgym_iduser_idratingreview_textvisit_date
1
2
3

Capabilities

Everything you need from Gympass, nothing you do not

Our Gympass scraper handles geographic search variations, dynamic class schedules, and tier-gated visibility rules with JavaScript rendering and session management built in.

Gym Location Data

Extract precise addresses, latitude and longitude coordinates, contact details, and operating hours for every partner facility globally.

Dynamic Class Timetables

Capture daily schedules, instructor names, class durations, and category tags across thousands of studios simultaneously.

Gympass Tier Mapping

Identify exactly which subscription tier is required to access specific gyms, studios, or premium wellness applications.

Amenity Extraction

Categorise facilities by available amenities including pools, parking, showers, towel service, and specialised equipment.

Wellness App Catalogue

Extract the complete database of digital wellness partners, including mental health, nutrition, and personal training apps.

Instructor Profiles

Gather data on fitness professionals, including their biographies, specialties, and the specific classes they teach.

Review & Rating Mining

Extract user ratings, detailed text reviews, and specific feedback on facility cleanliness and equipment quality.

Geolocation Search

Perform radius-based scraping using coordinate grids to ensure complete coverage of urban and suburban fitness markets.

Scheduled Updates

Run continuous pipelines to capture class schedule changes, new gym additions, and tier requirement adjustments in real time.

// engagement pipeline

From location list to warehouse record

Brief in. Clean data out.

Define Scope
d 0

Provide target cities, coordinates, or specific gym categories. We design the extraction schema together.

Pipeline Build
d 2–4

We configure Scrapy and Playwright crawlers, geographic proxy rotation, and session management for gympass.com.

Validation & QA
d 4–6

Schema validation, null-rate checks, and coordinate verification before full launch.

Delivery
ongoing

JSON, CSV, or Parquet pushed to your S3 bucket, BigQuery dataset, or Snowflake stage on agreed cadence.

Under the hood

How our Gympass pipeline handles the hard parts

Gympass relies on map-based dynamic loading and complex session states. Here is how we maintain reliable extraction pipelines.

pipeline-monitor · gympass.com · live ● active
// fingerprinting
Identity rotation
TLS fingerprintrandomised
User-agentrotated
IP poolresidential
Challenges blocked0
// pagination
Page coverage
48,291 pages queued running
// observability
Pipeline health
99.9%
uptime
142ms
p99 lat
0.3%
null rate
2
alerts
Map interactions
Handling coordinate-based dynamic loading

Gympass surfaces locations through a map interface that requires precise API coordinate bounding boxes. We simulate these geographic queries programmatically to ensure no facility is missed during extraction.

SPA rendering
Full Playwright execution for schedule data

Class timetables and dynamic availability load via complex JavaScript frameworks. We run full Playwright browser sessions to hydrate these components properly before extraction.

Geolocation spoofing
Matching IP addresses to target regions

Gympass alters visibility based on the user location. We route requests through residential proxies physically located in the target search radius to capture accurate regional data.

Change detection
Only re-scrape what has changed

For massive class schedules, we maintain a hash index of last-seen values per field. Subsequent runs only push diffs, reducing downstream processing load.

Monitoring
24/7 pipeline health checks

Every run emits structured logs to our observability stack. We alert on null-rate spikes or coverage drops and respond before you notice.

Applications

Who uses Gympass data and how

Teams across industries use gympass.com data to build competitive products and smarter operations.

01
Competitor Analysis

Fitness aggregators and boutique studio chains track Gympass partner growth and tier placement to optimise their own pricing strategies.

02
Corporate Wellness Benchmarking

HR departments and wellness consultants analyse tier requirements to evaluate the true value of corporate fitness subsidies.

03
Real Estate Planning

Commercial property developers use gym density and amenity data to identify underserved neighbourhoods for new fitness facility investments.

04
Fitness Market Research

Analysts track the rise of specific class categories and wellness apps to forecast broader health and fitness industry trends.

05
Aggregator Platforms

Health insurance providers integrate Gympass facility data to enrich their own member wellness portals and directories.

06
Pricing Strategy

Independent gyms monitor which competitors join the Gympass network and at which tier to position their own membership fees.

Why DataFlirt

"Gympass maps the global fitness ecosystem, but extracting structured class and tier data requires navigating complex geographic search parameters."

Most teams underestimate the investment required: reliable Gympass scraping requires geographic IP matching, full JavaScript rendering for map interfaces, and daily selector maintenance. DataFlirt absorbs that complexity so your engineers can focus on the analysis, not the infrastructure.

Technical Spec

Gympass scraper technical capabilities

Everything supported by our gympass.com scraper — rendered SPA elements, auth walls, rate-limit evasion and beyond.

JavaScript rendering
Full Playwright sessions required for map interfaces and dynamic class schedules
Supported
CAPTCHA bypass
Automated 2Captcha and CapSolver integration for bot protection walls
Supported
Residential proxy rotation
Geo-targeted ISP residential IPs to match search coordinates
Supported
Coordinate grid search
Automated bounding box generation to scrape entire metropolitan areas
Supported
Tier requirement mapping
Extraction of specific Gympass plan levels required for entry
Supported
Class schedule extraction
Full daily and weekly timetables parsed into structured datetime formats
Supported
Change detection
Hash-based diffing to emit only new or modified class schedules
Supported
Webhook delivery
HTTP POST per record for real-time schedule updates
Supported
User check-in history
Gated data requiring authenticated user accounts and historical attendance
Partial
Corporate account pricing
B2B negotiated rates hidden behind enterprise sales portals
Partial
Infrastructure

Infrastructure powering the Gympass pipeline

Open-source tooling on proven cloud infra — no vendor lock-in, full observability.

ScrapyPlaywrightPython 3.12RedisPostgreSQLApache AirflowAWS LambdaS3CloudWatch2CaptchaCapSolverResidential ProxiesDockerKubernetesGrafanaPrometheus
Scrapy + Playwright Stack

Scrapy handles crawl orchestration and deduplication. Playwright handles JavaScript rendering, map interactions, and complex session states.

Geographic Proxy Infrastructure

We maintain pools of residential ISP proxies mapped to specific global regions to ensure accurate local search results and bypass geo-blocks.

Cloud-Native Orchestration

Pipelines run on AWS Lambda and ECS. Airflow handles scheduling and dependency management. All state is stored in managed Postgres.

Output & Delivery

Your data, your destination

Data delivered to where your team already works — no new tooling required.

JSON
Newline-delimited or nested schema versioned per run
CSV
Flat file with typed columns for spreadsheet analysis
XLS
Excel format for direct business user consumption
Parquet
Columnar format for BigQuery, Snowflake, and Athena
AWS S3
Direct bucket delivery compatible with any data lake
Webhook
HTTP POST per record for real-time downstream processing
API
REST endpoints to query your extracted dataset programmatically
PostgreSQL
Upsert into your existing schema with conflict resolution
BigQuery
Streamed directly into your dataset with schema auto-detect
Snowflake
Stage and copy workflow for incremental or full replacement
S3
Direct bucket delivery — compatible with any data lake
// faq

Common questions.

About gympass.com scraping, legality, and pipeline operations.

Ask us directly →
Is scraping Gympass legal?

Scraping publicly available information from Gympass is generally permissible under applicable law. DataFlirt targets only public, non-authenticated facility, class, and tier data. We do not extract personal user data or circumvent authentication walls. Clients should review the Gympass terms of service and consult legal counsel for specific use cases.

How do you handle location-based search results?

We use coordinate bounding boxes and route requests through residential ISP proxies located in the target region. This ensures we capture the exact gyms and classes visible to a user in that specific geographic area.

Can you extract class schedules for specific gyms?

Yes. We can extract daily and weekly timetables, including instructor names, class categories, and durations, for any facility listed on the platform.

How fresh is the data?

Real-time streaming pipelines achieve sub-60-minute latency for schedule updates on a defined facility set. Full city-wide refreshes at daily cadence complete within a 4-8 hour window depending on geographic size.

Do you extract the required Gympass tier for each gym?

Yes. Every facility record includes the specific Gympass subscription tier required for entry, allowing you to map the value of different plan levels.

What is the minimum viable engagement?

Our smallest packages start at a defined city or region list with weekly delivery. For global coverage or custom schema requirements, we price based on volume and delivery frequency. Contact us with your use case for a scoped quote.

Can I request a sample dataset before committing?

Absolutely. We provide a sample run of up to 500 facilities or 5 target cities as part of the pre-engagement scoping process so you can validate schema fit, field completeness, and data quality before signing any contract.

$ dataflirt scope --new-project --source=gympass.com ready

Tell us what
to extract.
We do the rest.

20-minute scoping call. Pilot dataset within the week. Production within two. Whether you need a one-off gym catalogue dump or a continuous class schedule feed across 50 cities, we scope, build, and operate the pipeline. Tell us what you need.

hello@dataflirt.com · Bengaluru · IST · typical reply < 4h
Services

Data Extraction for Every Industry

View All Services →