Delhi's municipal and transport agencies are sitting on a sprawling archive problem. Tens of thousands of duplicate photographs — many mislabelled, some years out of date — clutter the digital databases of bodies including the Delhi Development Authority and the Delhi Metro Rail Corporation, slowing down everything from planning approvals to public-facing apps that commuters use to navigate the Phase 4 corridor expansion. The scale of the redundancy, which urban data specialists describe as a structural legacy of agencies digitising paper records in rapid, uncoordinated bursts across the 2010s, has prompted a quiet but consequential push to clean house.
The timing matters. Delhi Metro Phase 4 — stretching new lines through Janakpuri West, Tughlakabad and points in between — is generating fresh georeferenced image data at a rate that legacy systems were never designed to handle. When duplicate or conflicting images attach themselves to station-site records, project clearances slow. The problem is not hypothetical: civic tech observers note that image-deduplication failures in urban infrastructure databases have delayed land-acquisition paperwork in other Indian metros, compounding costs in projects already under political scrutiny.
What Delhi Is Actually Doing
The most visible effort is inside the DMRC's own GIS division, which since early 2025 has been running a hash-based deduplication protocol across its site-survey image library — a standard technique that flags photographs with identical or near-identical pixel fingerprints and queues them for manual review before archival. The DDA, for its part, launched a broader data-hygiene initiative under its Digital Delhi Master Plan 2041 framework, targeting its land-records image repository in areas including Rohini and Dwarka, where rapid construction in the 1990s and 2000s generated overlapping photographic surveys that were never reconciled.
Old Delhi presents a harder case. The heritage-preservation database maintained jointly by the Archaeological Survey of India and the Delhi Urban Heritage Foundation contains photographic records stretching back to the 1970s for monuments in and around Chandni Chowk and the Walled City. Duplicate images here are not merely an administrative nuisance — they can distort restoration budgets and misdirect conservation funding when a degraded photograph from 1987 is mistaken for current-condition evidence. As of June 2026, the DUHF has cleared roughly 40 percent of its pre-2000 image backlog through a combination of automated detection software and volunteer archivists, according to the foundation's published progress update.
How Delhi Compares Globally
Mumbai's Municipal Corporation launched a centralised image-management platform in late 2023 under its Digital Mumbai initiative, integrating deduplication at the point of upload rather than retrospectively — a structurally superior approach that Delhi's agencies have been slower to adopt. Seoul's Smart City Division went further, embedding AI-assisted image verification directly into its urban planning portal by 2022, meaning duplicate or low-resolution photographs are rejected before they enter any official record. São Paulo's city government, managing a similarly vast and historically fragmented archive, chose a different path: a phased outsourcing arrangement with a local civic-tech consortium that processed more than 2 million municipal photographs between 2023 and 2025, reducing its duplicate-image rate by a claimed 67 percent.
Delhi's agencies, by contrast, have largely worked in silos. The DMRC's approach does not yet integrate with the DDA's system, and neither connects to the ASI's separate heritage database. Urban data researchers who study South Asian municipal infrastructure have noted this fragmentation as a common pattern in cities where digitisation was driven by departmental budgets rather than city-wide architecture — a description that fits Delhi's administrative structure, which distributes significant powers between the AAP-led city government and bodies that report directly to the central government in New Delhi.
The practical stakes for ordinary Delhiites are modest but real. Commuters using the DMRC's official app have occasionally encountered station-entry photographs that show construction hoardings for sites now fully operational — a minor irritant that is nonetheless a symptom of the underlying image-duplication problem. For residents in Rohini or Dwarka dealing with DDA land queries, the delays are less trivial.
The DMRC has indicated that its deduplication project will extend to cover all Phase 4 corridor sites before the new lines open for commercial service, with completion targeted before the end of the 2026–27 financial year. Whether the DDA and ASI will align their systems with DMRC's before then is the more pressing open question for anyone who believes Delhi's digital infrastructure should function as a single coherent city, not three overlapping ones.