All-State RERA Data Aggregation — How a PropTech Platform Unified 28 State Portals
Industry
PropTech / Real Estate Data
Region
India (28 states + UTs)
Scale
14M+ project data points
Engagement
Multi-State RERA Pipeline
Executive Summary
A PropTech platform serving home buyers, brokers, and lenders needed unified visibility into India's 32+ state RERA portals. Before engaging Actowiz, they covered 4 states with manual updates. Within 7 months, Actowiz delivered automated coverage of all major state RERA portals — over 14 million project data points in their database. The platform now powers builder risk scoring used by 3 NBFCs and 200+ brokers.
The Customer
A 5-year-old PropTech platform combining real estate listings, builder profiles, and compliance data. Their flagship product is a builder risk score used by lenders during home loan underwriting. Founded by ex-bankers and ex-real estate executives. ~50 employees, growing 80% YoY.
The Challenge
Problem 1: 32 Portals, Each Different
India's RERA framework requires every state to maintain its own portal. Each portal has its own URL structure, search interface, data fields, CAPTCHA requirements, and update cadence. No two are alike. Manually keeping any 5 of them up to date was already a full-time job.
Problem 2: Patchy Coverage
The customer covered Maharashtra, Karnataka, Gujarat, and Telangana before engaging Actowiz. Together these accounted for 60% of project volume — but the gaps mattered. Their NBFC clients needed pan-India coverage, not just metros. Tier-2 city projects (Lucknow, Patna, Bhubaneswar, Indore) needed their own RERA reconciliation.
Problem 3: PDF QPRs
Quarterly Progress Reports (QPRs) — the most valuable RERA data — are uploaded as PDFs across most state portals. Extracting structured data from PDFs at scale was beyond their internal capability.
Problem 4: Builder Identity Reconciliation
A builder operating in Mumbai + Pune + Bengaluru has 3 different RERA registrations. Without canonical builder identity, their risk scoring couldn't aggregate across projects. The customer's data team had spent 8 months trying to solve this and got it ~70% right.
Client Feedback
"Our NBFC partners were patient with us when we covered 4 states. They got impatient when one of their largest deals fell apart because we didn't have data for an Uttar Pradesh project. We had 3 weeks to deliver pan-India coverage or lose the contract. That's when we called Actowiz."
— Co-Founder & Chief Data Officer
The Solution — Three Phases of Enrichment
Step 1: All-State Crawler Inventory
Actowiz had pre-built crawlers for 24 of India's 32 state RERA portals — accumulated knowledge from earlier projects. The customer engagement focused on:
Adapting existing crawlers to the customer's specific schema
Building 8 new state crawlers (smaller states with newer portals)
CAPTCHA-solving infrastructure for portals requiring it (5 states)
Daily refresh cadence on changing data, weekly on stable data
Step 2: PDF QPR Extraction
Custom OCR + parsing pipeline extracts structured data from quarterly progress reports. Output schema:
Construction stage (foundation / superstructure / finishing / completed)
% completion claimed by builder
Number of units sold vs total
Booking advance collected
Cost incurred to date vs project cost
Material and labour utilization
Step 3: Builder Identity Reconciliation
Multi-state builder identity reconciliation used:
PAN number (where disclosed)
Director list overlap (DIN matching)
Address fingerprinting
Project portfolio similarity
Result: 92% accuracy on multi-state builder reconciliation, validated against the customer's NBFC partners' independent records.
Step 4: Builder Risk Score Pipeline
Beyond raw data delivery, Actowiz built scoring inputs:
Historical project completion ratio (% on time, % within 6 months delay, % beyond)
Complaints filed against builder (count, type, resolution status)
Litigation against builder (count, severity)
Geographic concentration (single-city vs multi-city operations)
Vintage (years operating)
Project portfolio size and diversity
The customer combined these inputs with their proprietary risk model to produce a unified Builder Risk Score that NBFCs trusted.
Results — Year 1
14M+
Project data points
28 states
Pan-India coverage
92%
Builder reconciliation
3 NBFCs
Production users
Pan-India Coverage Delivered
Within 7 months, the customer had unified data from 28 states + 4 UTs. NBFC partner that had previously walked away returned and signed a 3-year data licensing agreement. 2 additional NBFCs signed in months 8-9.
Builder Risk Scoring Adopted
The customer's Builder Risk Score is now used in 200+ broker workflows and 3 NBFC underwriting pipelines. Estimated 25,000+ home loans per year reference the score in some form.
Data Volume
14M+ project-level data points and 800K+ builder records. Updated daily on changing data, with full refresh quarterly. The customer's database is now considered one of the most comprehensive RERA datasets in India outside the regulatory bodies themselves.
Revenue Impact
Data licensing revenue grew from ₹2.4 Cr (pre-engagement) to ₹14 Cr (annualized) by month 12. The customer's PropTech platform now earns more from data licensing than from advertising — a structural shift enabled by data depth.
Client Feedback
"Before Actowiz, we were a real estate listing site that sold ads. After Actowiz, we are a real estate intelligence company that licenses data to banks. The difference is everything."
— CEO
Engagement Economics
State Portals Covered
28 States + 4 Union Territories
Refresh Cadence
Daily (high-change portals)
Weekly (stable portals)
QPR PDFs Processed
Approximately 120,000 PDFs per quarter
Builder Records
800,000+ canonical builder identities
Project Records
14 Million+ data points
CAPTCHA-Solving Infrastructure
Required for 5 portals
Why It Worked
Pre-built crawler inventory (24/32 states already done) — massive time savings
Multi-state builder reconciliation cracked the most valuable analytics use case
PDF extraction unlocked data competitors couldn't access
Customer kept the risk scoring IP — Actowiz delivered the data plumbing

Comments
Post a Comment