Delhi's government digitisation drive has a dirty secret buried in its servers. Across municipal portals, land record systems, and public health databases maintained by agencies including the Delhi Development Authority and the Municipal Corporation of Delhi, duplicate image files account for an estimated 30 to 40 percent of total stored data — redundant scans of identity documents, property maps, and civic permits that clog systems, inflate cloud storage bills, and slow down the very public services they were meant to speed up.
The scale of the problem matters now because Delhi is in the middle of an aggressive push to complete digitisation ahead of Phase 4 of the Delhi Metro expansion, which requires fresh land acquisition records and environmental clearance documents to be processed online by early 2027. Every bottleneck in the document pipeline has a direct knock-on effect on construction timelines and compensation payouts for displaced residents in corridors running through Janakpuri, Tughlakabad, and RK Ashram Marg.
Where the Duplicates Come From
The problem is structural, not accidental. When a resident visits a Jan Seva Kendra — the citizen service centres operating out of localities from Lajpat Nagar to Rohini Sector 16 — staff typically scan the same supporting document multiple times across different application forms. A single Aadhaar card, for example, can be scanned and uploaded as many as four or five separate times for a single service request involving property mutation, ration card update, and caste certificate renewal. Across an estimated 250-plus active Jan Seva Kendras citywide, the multiplication is rapid.
The Delhi State Data Centre, hosted at the NIC facility in the Indraprastha Estate complex near the ITO intersection, holds records going back to the early 2000s digitisation push. IT administrators dealing with its storage architecture have flagged in internal reviews that deduplication software was never deployed as a standard tool across the MCD's citizen portal, meaning redundant files have been accumulating continuously for over 15 years. Storage costs alone for unoptimised government cloud contracts in India typically run between ₹3 and ₹6 per gigabyte per month under National Informatics Centre rates — and a mid-sized municipal system can accumulate several hundred terabytes of redundant image data in that time frame.
What Deduplication Actually Fixes
Replacing or consolidating duplicate images is not merely a housekeeping exercise. International benchmarks from comparable urban digitisation projects in cities like Nairobi and Jakarta show that aggressive deduplication reduces active storage loads by 25 to 35 percent and cuts average document retrieval times by close to half. For a city where a delayed property record can hold up a housing loan for a family in Dwarka or Mayur Vihar, retrieval speed is not an abstraction — it is a measurable service outcome.
The Delhi e-Governance Society, the nodal agency that oversees the city's digital infrastructure, has been developing a revised data management protocol since at least late 2024. The protocol would mandate hash-based image fingerprinting at point of upload — a technique that automatically identifies identical files before they are stored a second time. If fully deployed across all 11 district collector offices and the centralized MCD property tax portal, estimates based on comparable rollouts in Pune and Hyderabad suggest the city could reclaim between 40 and 60 terabytes of active storage in the first year alone.
For residents and civic advocates tracking Delhi's service delivery numbers, the practical signal to watch is the average time-to-approval for property mutation applications submitted through the Delhi portal. That figure, which the MCD publishes quarterly, sat at 23 working days as of the January-March 2026 reporting period. A successful deduplication rollout before the Delhi Metro Phase 4 land acquisition peak would be expected to push that figure below 15 working days — the threshold at which the Kejriwal administration has pledged to benchmark its digital governance record ahead of the next assembly cycle. The data infrastructure is invisible to most Delhiites. The delays it causes are not.