IP Intelligence

IP & Patent Data Monitored Continuously

Continuously extract and monitor patent applications, grants, trademark filings, citation networks, and assignee portfolios from USPTO, EPO, WIPO PCT, and 50+ national IP offices โ€” structured, normalised, and delivered with real-time filing alerts.

100M+
Patents indexed
50+
IP offices covered
24โ€“48hr
New filing capture
180+
Jurisdictions
โ—† Enterprise Readyโ—† SOC 2 Awareโ—† GDPR Compliantโ—† 99.9% Uptimeโ—† Global Coverageโ—† 24/7 Monitoringโ—† API-Firstโ—† Managed Serviceโ—† Real-Time Dataโ—† Custom Schemasโ—† Bengaluru HQโ—† Enterprise Readyโ—† SOC 2 Awareโ—† GDPR Compliantโ—† 99.9% Uptimeโ—† Global Coverageโ—† 24/7 Monitoringโ—† API-Firstโ—† Managed Serviceโ—† Real-Time Dataโ—† Custom Schemasโ—† Bengaluru HQ
What & Why

What Is IP & Patent Data Scraping?

Intellectual property data scraping is the automated collection of structured information from national and international patent databases, trademark registries, and copyright records. This includes patent applications, granted patents, claim text, inventor and assignee details, filing and grant dates, forward and backward citation networks, patent family linkages, legal status, and trademark registration history.

IP databases are notoriously fragmented. The USPTO, EPO, WIPO, JPO, and CNIPA each have their own search interfaces, data formats, and update schedules. A single global company's patent portfolio may be spread across a dozen national registries with different record formats and latencies. DataFlirt unifies these sources into a consistent, cross-jurisdiction schema so your team can monitor competitor IP activity, map technology landscapes, and track your own portfolio without switching between a dozen databases.

For corporate R&D teams, IP law firms, investment analysts, and technology strategy consultants, structured patent and trademark data is the foundation of freedom-to-operate analysis, prior art searches, competitive intelligence, and portfolio valuation. DataFlirt automates the collection layer so your analysts spend time on insights, not data gathering.

Why IP Data Is Strategic Intelligence
๐Ÿ”ฌ
R&D Investment Signals
Patent filing activity reveals where competitors are directing R&D spend โ€” often 18 months before products launch.
๐Ÿ›ก๏ธ
Freedom to Operate
Comprehensive patent landscape data is the raw material for FTO analysis in any technology domain.
๐Ÿ’ก
Whitespace Identification
Citation network and classification analysis reveals technology areas with limited patent coverage.
โš–๏ธ
Litigation Risk Monitoring
Track patent assertion entity (PAE) activity, licensing demands, and ITC filing patterns in your technology space.
๐Ÿ’ฐ
Portfolio Valuation
Citation frequency, forward citation counts, and claim breadth data feed IP asset valuation models.
Capabilities

Everything You Need

Comprehensive extraction built for reliability, accuracy, and scale.

๐Ÿ’ก
Patent Scraping

Extract full patent documents: title, abstract, claims, description, drawings metadata, classification codes, and legal status.

โ„ข๏ธ
Trademark Monitoring

Track trademark applications, registrations, renewals, oppositions, and cancellations across national and international registries.

ยฉ๏ธ
Copyright Databases

Scrape copyright registration records, licensing information, and rights holder data from US Copyright Office and international databases.

๐ŸŒ
Global IP Office Coverage

USPTO, EPO, WIPO PCT, JPO, KIPO, CNIPA, IPO India, and 45+ national IP offices normalised into a unified schema.

๐Ÿ”—
Citation Network Mapping

Forward and backward citation relationships extracted and linked to build technology landscape and influence maps.

โš ๏ธ
Infringement Signal Detection

Monitor e-commerce platforms and domain registrations for potential trademark and patent infringement signals.

Data Fields

What We Extract

Every field you need, structured and ready to use downstream.

Patent NumberTitleAbstractClaimsDescriptionInventorAssigneeFiling DatePublication DateGrant DatePriority DateLegal StatusIPC ClassCPC ClassForward CitationsBackward CitationsPatent FamilyJurisdictionProsecution HistoryTrademark ClassNice ClassRegistration NumberOppositionGoods & Services DescriptionCopyright RegistrationRights Holder
Process

From IP Registry to Structured Portfolio Intelligence

A proven process that turns any source into clean structured data โ€” reliably.

01
Define Technology Scope
Specify assignees, technology classifications (IPC/CPC), keywords, or jurisdictions to monitor.
02
Multi-Office Collection
We collect from all relevant national and international IP offices simultaneously, handling each office's unique access patterns.
03
Family & Citation Mapping
Patent families linked across jurisdictions; forward and backward citation graphs built automatically.
04
Alerts & Continuous Monitoring
Real-time alerts for new filings, grants, status changes, and citation events matching your defined criteria.
Sample Output
response.json
{
  "patent_number": "US11924214B2",
  "office": "USPTO",
  "title": "Neural network-based query optimisation system",
  "assignee": "Google LLC",
  "inventors": ["Zhang, Wei", "Patel, Sanjay"],
  "filing_date": "2022-04-12",
  "grant_date": "2025-03-04",
  "cpc_classes": ["G06F16/2453", "G06N3/08"],
  "forward_citations": 14,
  "backward_citations": 38,
  "family_size": 7,
  "legal_status": "Active"
}
Technical Stack

Enterprise-Grade Infrastructure

Built on proven open-source tools and cloud infrastructure โ€” no vendor lock-in.

๐ŸŒ
Multi-Office Normalisation

Each IP office has a different schema, identifier format, and classification system. We normalise all into a single cross-jurisdiction model.

๐Ÿ”—
Graph-Based Citation Analysis

Patent citation networks stored in Neo4j for traversal queries โ€” identify influential patents, technology clusters, and citation loops.

๐Ÿ“ก
Real-Time Filing Alerts

New applications captured within 24โ€“48 hours of publication. Webhook alerts fire for assignee, technology class, or keyword matches.

๐Ÿ”
Full-Text Claim Extraction

Independent and dependent claims extracted and structured individually โ€” enabling claim-level search and breadth analysis.

๐ŸŒ
Asian IP Office Coverage

JPO (Japan), KIPO (Korea), CNIPA (China), and IPO India covered with translated abstracts and normalised classification codes.

โš–๏ธ
Legal Status Tracking

Patent prosecution history and legal status changes (abandoned, lapsed, opposed, licensed) tracked and updated on every crawl.

Tools & Technologies
PythonScrapyaiohttplxmlBeautifulSoup4PostgreSQLNeo4jRedisAWS LambdaDockerUSPTO Open Data APIEPO OPS API
Use Cases

Built for Every Team

From solo analysts to enterprise data teams โ€” here's how organizations use this data.

01
Patent Landscape Analysis
Map the technology patent landscape before committing R&D investment โ€” identify whitespace, crowded areas, and key players.
02
Freedom to Operate Research
Build the patent dataset foundation for FTO analysis in any jurisdiction or technology domain.
03
Competitor IP Monitoring
Track competitor patent and trademark filing activity in real time โ€” know what they're protecting before they announce it.
04
IP Portfolio Valuation
Citation frequency, claim count, family size, and legal status data to support IP asset valuation for transactions and licensing.
05
Brand Protection
Monitor trademark application filings and domain registrations for potential brand infringement across jurisdictions.
06
Technology Due Diligence
Assess an acquisition target's IP position: portfolio depth, citation influence, prosecution quality, and competitive exposure.

IP Intelligence Is Innovation Intelligence

Patent databases are the most detailed public record of where technology is heading and where companies are placing their bets. A competitor's filing activity today is their product roadmap 24 months from now. DataFlirt delivers structured, continuously monitored IP intelligence โ€” across every jurisdiction, every office, and every technology class โ€” so your R&D and strategy teams are never surprised by what's coming.

Pricing

Simple, Scalable Pricing

Start free and scale as your data needs grow.

Starter
$99/mo

For small teams and projects getting started with data.

  • 50,000 records/month
  • 5 data sources
  • Daily refresh
  • JSON & CSV export
  • Email support
Get Started
Enterprise
Custom

For large organizations with custom requirements.

  • Unlimited records
  • Dedicated infrastructure
  • Real-time delivery
  • SLA guarantees
  • Account manager
  • Custom integrations
Contact Sales
FAQ

Common Questions

Everything you need to know before getting started.

Which patent offices do you cover?
USPTO, EPO, WIPO PCT, JPO (Japan), KIPO (Korea), CNIPA (China), IPO India, UKIPO, DPMA (Germany), INPI (France/Brazil), and 40+ additional national offices. For trademark data, we cover EUIPO, WIPO Madrid System, USPTO TESS, and major national registries.
How quickly do you capture new patent applications?
New US patent applications are captured within 24 hours of USPTO publication (typically Thursday morning). EPO and WIPO publications are captured within 24โ€“48 hours. National offices vary โ€” most within 72 hours of publication.
Can you extract full claim text?
Yes. We extract each independent and dependent claim as a separate structured record, enabling claim-level searching and analysis rather than only searching abstract or title text.
Can you build patent landscape maps for a technology area?
Yes. We can construct patent landscape datasets for any technology domain using IPC/CPC classification codes, keyword searches, and assignee lists โ€” with citation network data for influence mapping.
Do you monitor for trademark infringement online?
Yes. We monitor major e-commerce platforms (Amazon, Flipkart, eBay), domain registrars, and App Store/Google Play for potential trademark violations โ€” new filings that conflict with your marks, or product listings using protected brand terms.
Is Indian IP data (IPO India) covered?
Yes. The Indian Patent Office (IPO) publication database is fully covered, with patent application numbers, publication dates, applicant details, and IPC classifications extracted and normalised. We also cover trademark filings under IP India's Trade Marks Registry.
Get Started

Ready to Start Collecting IP & Patent Data?

Join data teams worldwide using DataFlirt to power products, research, and operations with reliable, structured web data.