The Daily Delhi

Delhi news, every day

News

Delhi's Heritage Documentation Drive Races to Fix Duplicate Image Crisis in City Archives

A data-cleaning push across municipal digitisation projects is exposing years of duplicated photographs and mislabelled records — and the race to fix it is intensifying this week.

By Delhi News Desk · Published 5 July 2026, 12:28 am

3 min read

Delhi's Heritage Documentation Drive Races to Fix Duplicate Image Crisis in City Archives
Photo: Photo by Shantanu Kumar on Pexels

Archivists and digital records officers working across Delhi's Municipal Corporation of Delhi (MCD) digitisation cells this week flagged a growing problem: thousands of duplicate images clogging the city's official heritage documentation databases, slowing down access to records tied to everything from building permits in Chandni Chowk to environmental compliance files along the Yamuna riverfront. The issue, long known internally but never publicly disclosed, is now being treated as an operational emergency after a mid-year audit review was circulated among department heads in late June 2026.

The timing matters because Delhi is at a critical juncture in three parallel documentation pushes. Delhi Metro Rail Corporation's Phase 4 corridor work — which cuts through culturally sensitive zones including Civil Lines and Janakpuri West — requires up-to-date photographic baselines for archaeological clearance. Separately, the Delhi Pollution Control Committee's air-quality monitoring program depends on geo-tagged site photography that is now compromised when duplicate or mismatched images are uploaded without proper deduplication checks. And the Archaeological Survey of India's ongoing survey of protected monuments in Mehrauli and Nizamuddin has been stalled at certain verification stages because image metadata errors have created redundant entries in shared federal-state databases.

How the Duplicate Problem Grew

The root cause, according to internal documentation reviewed this week, is the absence of a unified image registry across Delhi's civic agencies. When MCD, the Delhi Development Authority and state heritage bodies digitised their physical archives in separate waves — MCD began a major push in 2021, the DDA accelerated its own in 2023 — each agency used different naming conventions and different scanning vendors. Images of the same site were scanned multiple times, uploaded under different file names, and cross-referenced incorrectly. At Purana Qila and at sections of the Old Delhi walled city near Lal Darwaza, the same archival photograph appears, in some cases, under four distinct catalogue entries with conflicting date stamps.

Estimates circulating among records officers — drawn from a preliminary deduplication scan run in May 2026 — suggest that between 15 and 20 percent of entries in the shared state heritage image repository are duplicates or near-duplicates. That figure, if confirmed, would represent tens of thousands of records requiring manual review or automated scrubbing before the databases can be considered reliable for legal or planning purposes.

What's Being Done and What Comes Next

The MCD's Information Technology department, based out of offices near Indraprastha Estate, confirmed this week that a vendor has been shortlisted to deploy a perceptual hashing tool — a standard image-matching algorithm widely used in large-scale archiving — across the primary heritage image database. The contract, expected to be signed before the end of July 2026, would cover an initial batch of roughly 80,000 images. The Delhi Urban Art Commission, which maintains its own records of approved structural changes in sensitive zones, is understood to be watching the MCD process before deciding whether to integrate its own repository.

For residents and businesses dealing with heritage-zone clearances, the practical consequence is straightforward: processing times for documentation requests at offices serving areas like Shahjahanabad and the Walled City of Delhi may remain slower than usual through the month of July while the deduplication work proceeds. Those with pending applications tied to heritage-related NOCs have been advised — through a notice posted to the MCD's citizen services portal this week — to retain physical copies of all submitted photographs with original timestamps, since digital records may be temporarily inaccessible during database maintenance windows.

The longer fix requires what officials have been reluctant to commit to: a single, standardised image registry shared across MCD, DDA and ASI, with a common naming protocol enforced at the point of upload. Without that structural change, the same duplication problem will resurface each time a new digitisation drive is launched. The mid-year audit review, circulated internally in June, reportedly recommended exactly this kind of unified approach. Whether the three agencies — each operating under different administrative lines, two state-level and one central — can agree on a shared system before the next major documentation deadline in early 2027 is the question officials and archivists are now pressing hardest.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.