Delhi's municipal and heritage agencies are sitting on a data problem years in the making. Duplicate images — scanned twice, uploaded in error, or migrated improperly across legacy systems — now account for a significant share of the stored files inside at least three major civic databases, according to IT procurement documents reviewed by The Daily Delhi. The question is no longer how the redundancy happened. The question is who decides what stays.
The urgency is immediate. The Delhi Urban Shelter Improvement Board and the Archaeological Survey of India's Delhi Circle, both of which digitised large volumes of property and heritage records between 2019 and 2023, are facing storage contract renewals this financial year. Carrying bloated, duplicate-heavy archives into a new contract cycle means paying for space that holds nothing useful. Cutting without a verified replacement protocol means losing originals that were never properly catalogued in the first place.
The Audit Problem at Chandni Chowk and Beyond
The practical stakes are highest in Old Delhi, where digitisation drives covered thousands of property boundary photographs, structural survey images of havelis in Shahjahanabad, and documentation from the Chandni Chowk redevelopment corridor managed by the North Delhi Municipal Corporation. Field officers who conducted the 2021–22 survey round submitted images through at least two separate upload portals — one maintained by the corporation and one by a centrally sponsored scheme — creating a structural duplication problem from day one.
The Indira Gandhi National Centre for the Arts, based at Janpath, has run parallel digitisation work on cultural heritage objects from the same neighbourhoods. Cross-referencing between that archive and municipal records has not been completed. Until it is, no deletion protocol can be considered safe. Images labelled as duplicates in one system may be the only surviving high-resolution copy in another.
The Delhi Metro Rail Corporation's Phase 4 corridor documentation presents a separate but related case. Construction clearance photographs taken along the Janakpuri West to RK Ashram Marg stretch — images required under environmental and heritage compliance rules — were uploaded in batch processes that routinely created multiple copies of the same file. DMRC's internal IT wing has reportedly flagged the issue, but no public timeline for resolution has been announced.
The Three Decisions That Cannot Be Delayed
Agencies now face three concrete choices, and each carries political as well as administrative weight.
First, who runs the deduplication process? Automated hash-matching software can identify pixel-identical files quickly, but heritage images often exist in near-duplicate form — same subject, slightly different exposure or crop — where automated tools fail. That requires human review, which requires staffing and budget the Municipal Corporation of Delhi has not publicly allocated.
Second, what is the retention standard? Central government guidelines under the National Data Governance Framework, updated in 2023, set minimum retention periods for civic records but do not specify image resolution thresholds or duplication criteria. Delhi agencies are effectively writing their own rules in real time, with no binding standard to fall back on.
Third, what happens to the Yamuna cleanup documentation? The Delhi Jal Board has accumulated photographic records of riverbank survey work conducted under the Yamuna Action Plan Phase III, images that carry both administrative and political significance given the ongoing disputes between the AAP government and the Centre over cleanup accountability. Those records cannot be quietly pruned without someone in authority signing off — and right now, that sign-off has not materialised.
The storage contracts in question are understood to be up for renewal before the end of September 2026. That gives agencies roughly eleven weeks to establish a verified protocol, assign responsibility, and begin the actual deduplication work. If they miss that window, the cost of renewing bloated contracts falls on budgets already stretched by monsoon infrastructure repair across areas from Lajpat Nagar to Rohini. The data problem is, at its core, a governance problem. And governance problems in Delhi rarely resolve themselves on schedule.