Delhi's official photographic archive — assembled across more than a decade of digitisation drives by the Delhi Urban Art Commission and the Archaeological Survey of India's Delhi Circle — contains an estimated tens of thousands of duplicate image files that are clogging storage systems, distorting public records and, archivists warn, crowding out authentic documentation of the city's most contested heritage zones. The problem has now reached a decision point. With Phase 4 of the Delhi Metro expansion pushing new construction through neighbourhoods from Janakpuri West to Lajpat Nagar, the demand for accurate, deduplicated visual documentation of affected streetscapes has never been more urgent.
The stakes are not abstract. When a road-widening project along Chandni Chowk's main spine was audited in 2023, civic technicians reportedly found that multiple versions of the same survey photograph had been filed under different reference numbers, making it impossible to establish a clean before-and-after evidentiary record. That kind of administrative confusion has direct consequences — for heritage protection claims, for compensation disputes and for the public record that residents of Shahjahanabad's walled-city lanes rely on when challenging demolition orders.
Why Deduplication Decisions Cannot Wait
The timing is compounded by two converging pressures. First, the Delhi government's own Smart Cities Digital Heritage Programme, which operates under the Urban Development Ministry, has a stated mandate to publish a unified open-access image repository by the end of financial year 2026-27. Second, the Yamuna Riverfront Redevelopment project — a politically charged initiative that spans the ghats from Nigambodh to the Signature Bridge corridor — is generating fresh photographic surveys every week. Without a deduplication protocol locked in before those images enter the main archive, the same problem will replicate itself at scale.
Deduplication is not technically complex. Perceptual hashing tools — software that generates a fingerprint for each image and flags near-identical files — are freely available and widely used by institutions from the Library of Congress to the British Library. The Municipal Corporation of Delhi's IT wing piloted one such tool, pHash, on a subset of Mehrauli Archaeological Park survey files in late 2024. But the pilot covered fewer than 4,000 images and no formal policy followed. The archive as a whole remains unprocessed.
The real decisions are institutional, not technical. Who holds the master repository — the Delhi government's Directorate of Archaeology, the DUAC, or the ASI's Delhi Circle? Which duplicate gets designated the canonical file when two agencies have independently scanned the same object? And crucially, who pays for the staff time to review flagged duplicates that automated tools cannot resolve, particularly for images of Qutb Minar complex carvings or the Red Fort's internal courtyards, where subtle differences in angle or lighting carry genuine documentary value?
What the Next Six Months Look Like
Three decisions will define whether the archive stabilises or deteriorates further before the 2026-27 deadline. The first is a governance call: the Delhi Cabinet's Heritage Committee needs to formally designate a single lead agency with write-access authority over the master database. That meeting, according to the committee's published schedule, is due before the end of July 2026.
The second decision is procurement. Running a full deduplication sweep across a large municipal archive, with human review of ambiguous cases, typically costs between ₹15 lakh and ₹40 lakh depending on volume and contractor rates in the NCR market — a modest line item against the Metro Phase 4 civil works budget, but one that requires a specific tender to be floated. The MCD IT wing has not yet published that tender as of this week.
The third decision is public access. Once duplicates are removed or tagged, the surviving canonical images need to be accessible — not locked in a departmental server on ITO's Indraprastha Estate. Advocates working on Old Delhi conservation have long argued that open access to survey photographs is the most effective check on undocumented demolitions in areas like Ballimaran and Matia Mahal. Whether the unified repository will be genuinely searchable by residents, journalists and lawyers, or will remain an internal bureaucratic tool, is still unresolved. That choice, more than any algorithm, will determine what the archive is actually for.