Delhi's public-facing digital infrastructure is carrying an estimated 40 to 60 percent rate of duplicate image files across major civic portals, according to an internal audit framework being piloted by the Delhi e-Governance Society this year. The redundancy is not trivial. Bloated databases slow grievance-response systems, push up cloud-storage costs, and in at least one case delayed the rollout of a key module of the Delhi One mobile app by several weeks in early 2026.
The timing matters. The Arvind Kejriwal administration has staked significant political capital on digital governance as a counterweight to pressure from the central BJP government over control of administrative services. A sprawling, inefficient data architecture undermines that argument. Cleaning it up is now an operational priority, not just a housekeeping exercise.
Where the Clutter Lives — and What It Costs
The problem concentrates in a handful of high-traffic systems. The Delhi Jal Board's citizen portal, which handles billing and complaint logging for roughly 3.5 million registered households, had accumulated more than 1.2 million image files as of a March 2026 internal review — a figure that officials familiar with the audit say includes a duplication rate above 50 percent. The Municipal Corporation of Delhi's property-tax database, accessible through its office on Lal Bahadur Shastri Marg in Civil Lines, has a similar structural problem: property photographs uploaded at the time of assessment are routinely re-uploaded during appeals, generating parallel records that are never reconciled.
Storage is not free. Delhi's government currently pays for cloud capacity under a contract administered through the National Informatics Centre. Industry benchmarks for comparable municipal deployments suggest that unmanaged duplication of this scale can inflate annual storage expenditure by 25 to 35 percent above what a deduplicated system would require. Applied to a mid-size civic cloud budget, that easily runs into crores of rupees per financial year.
The Delhi Metro Rail Corporation's Phase 4 expansion, which is adding corridors through Janakpuri West and Tughlakabad among other zones, has its own documentation pipeline. Construction-progress photographs, taken at multiple sites daily, feed into a project-management system that the DMRC has been building out since 2024. Engineers working on the project have noted that identical images captured by different site supervisors and uploaded separately represent a structural data-quality issue — one that makes audit trails harder to verify and inflates reported file counts.
Measuring the Gap, Planning the Fix
The Delhi e-Governance Society is piloting a deduplication protocol at two nodes: the Pragati Maidan integrated transit hub data centre and the Citizen Service Bureau cluster on Rajpur Road in Civil Lines. The pilot, launched in April 2026, uses perceptual hashing — a technique that identifies visually identical or near-identical images without needing to store the original file twice. Early results from the Pragati Maidan node reportedly show a 38 percent reduction in active image storage after the first pass.
The approach is not new globally. London's Transport for Urban Development systems and Seoul's Smart City operations centre both implemented similar deduplication pipelines before 2023. Delhi's version, however, must work across legacy systems that predate consistent metadata standards, which is the harder engineering challenge.
For residents, the practical downstream effects are real. Grievance applications filed through the Delhi government's Doorstep Delivery scheme — which processed more than 1.1 million service requests in the 2024-25 financial year — require photograph uploads as proof of address or property status. A cleaner database means faster verification, fewer rejections on technical grounds, and shorter wait times at the 1,800-plus Common Service Centres operating across the city.
The deduplication audit is scheduled to produce a full citywide report by September 2026, ahead of the next budget cycle. If the pilot numbers hold, the case for a system-wide rollout — covering the Delhi Jal Board, MCD, revenue department, and the education directorate's school-mapping database — becomes financially self-sustaining within two fiscal years. The political will is there. The engineering work is underway. What comes next depends on whether the September numbers match the April optimism.