The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archive Crisis: What Happens Next and the Key Decisions Ahead

As thousands of duplicate and degraded images clog government databases and heritage records across the capital, officials face a hard deadline and harder choices about what gets saved and what gets deleted.

By Delhi News Desk · Published 5 July 2026, 12:55 am

3 min read

Delhi's Digital Archive Crisis: What Happens Next and the Key Decisions Ahead
Photo: Photo by Riven Apwbihls on Pexels

Delhi's municipal and heritage agencies are sitting on a data problem years in the making. Duplicate images — scanned twice, uploaded in error, or migrated improperly across legacy systems — now account for a significant share of the stored files inside at least three major civic databases, according to IT procurement documents reviewed by The Daily Delhi. The question is no longer how the redundancy happened. The question is who decides what stays.

The urgency is immediate. The Delhi Urban Shelter Improvement Board and the Archaeological Survey of India's Delhi Circle, both of which digitised large volumes of property and heritage records between 2019 and 2023, are facing storage contract renewals this financial year. Carrying bloated, duplicate-heavy archives into a new contract cycle means paying for space that holds nothing useful. Cutting without a verified replacement protocol means losing originals that were never properly catalogued in the first place.

The Audit Problem at Chandni Chowk and Beyond

The practical stakes are highest in Old Delhi, where digitisation drives covered thousands of property boundary photographs, structural survey images of havelis in Shahjahanabad, and documentation from the Chandni Chowk redevelopment corridor managed by the North Delhi Municipal Corporation. Field officers who conducted the 2021–22 survey round submitted images through at least two separate upload portals — one maintained by the corporation and one by a centrally sponsored scheme — creating a structural duplication problem from day one.

The Indira Gandhi National Centre for the Arts, based at Janpath, has run parallel digitisation work on cultural heritage objects from the same neighbourhoods. Cross-referencing between that archive and municipal records has not been completed. Until it is, no deletion protocol can be considered safe. Images labelled as duplicates in one system may be the only surviving high-resolution copy in another.

The Delhi Metro Rail Corporation's Phase 4 corridor documentation presents a separate but related case. Construction clearance photographs taken along the Janakpuri West to RK Ashram Marg stretch — images required under environmental and heritage compliance rules — were uploaded in batch processes that routinely created multiple copies of the same file. DMRC's internal IT wing has reportedly flagged the issue, but no public timeline for resolution has been announced.

The Three Decisions That Cannot Be Delayed

Agencies now face three concrete choices, and each carries political as well as administrative weight.

First, who runs the deduplication process? Automated hash-matching software can identify pixel-identical files quickly, but heritage images often exist in near-duplicate form — same subject, slightly different exposure or crop — where automated tools fail. That requires human review, which requires staffing and budget the Municipal Corporation of Delhi has not publicly allocated.

Second, what is the retention standard? Central government guidelines under the National Data Governance Framework, updated in 2023, set minimum retention periods for civic records but do not specify image resolution thresholds or duplication criteria. Delhi agencies are effectively writing their own rules in real time, with no binding standard to fall back on.

Third, what happens to the Yamuna cleanup documentation? The Delhi Jal Board has accumulated photographic records of riverbank survey work conducted under the Yamuna Action Plan Phase III, images that carry both administrative and political significance given the ongoing disputes between the AAP government and the Centre over cleanup accountability. Those records cannot be quietly pruned without someone in authority signing off — and right now, that sign-off has not materialised.

The storage contracts in question are understood to be up for renewal before the end of September 2026. That gives agencies roughly eleven weeks to establish a verified protocol, assign responsibility, and begin the actual deduplication work. If they miss that window, the cost of renewing bloated contracts falls on budgets already stretched by monsoon infrastructure repair across areas from Lajpat Nagar to Rohini. The data problem is, at its core, a governance problem. And governance problems in Delhi rarely resolve themselves on schedule.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.