The Daily Delhi

Delhi news, every day

News

Delhi's Duplicate Image Crisis: The Key Decisions That Will Determine What Comes Next

Thousands of heritage photographs and civic records held across Delhi's government archives face deletion or misclassification as authorities grapple with a backlog of duplicate digital images — and the choices made in the next few weeks will shape how the city preserves its past.

By Delhi News Desk · Published 5 July 2026, 12:46 am

3 min read

Delhi's Duplicate Image Crisis: The Key Decisions That Will Determine What Comes Next
Photo: Photo by Ranjeet Chauhan on Pexels

A significant data management problem has surfaced inside Delhi's civic digitisation drive. Duplicate images — some running into tens of thousands of scanned files — have accumulated across at least three major archival systems operated by the Delhi government, creating a bottleneck that threatens to delay the broader Phase 2 rollout of the city's Digital Delhi Records Initiative, which was scheduled to reach full operational status by September 2026.

The problem matters now because the stakes have risen sharply. The Delhi Metro Rail Corporation has been feeding construction survey photographs and land acquisition documents into a shared civic repository since early 2025 as part of Phase 4 expansion work running through Janakpuri West, Tughlakabad, and corridors toward the Aerocity interchange. When duplicate images pile up in that system, engineers and planners risk pulling the wrong version of a survey file — and in a project where land parcel boundaries are disputed, that is not a trivial administrative inconvenience.

Where the Duplicates Are Piling Up

Three institutions sit at the centre of the problem. The Delhi State Archives, housed in a building on Teen Murti Marg, holds the oldest photographic collection — glass-plate negatives and early twentieth-century prints that were scanned in batches between 2021 and 2024. Staff there say the scanning workflow generated automatic backup copies at three different resolution levels, and nobody built a deduplication step into the intake process. The result is that a single photograph of Chandni Chowk from 1932 may now exist in the system under a dozen separate file entries.

The Municipal Corporation of Delhi's property tax digitisation cell, operating out of offices near the Civic Centre on Jawaharlal Nehru Marg, faces a different version of the same issue. When ward-level offices uploaded property photographs to comply with a 2024 MCD directive on digital property cards, duplicate submissions arrived from overlapping ward boundaries — particularly around Karol Bagh and Paharganj, where jurisdiction lines shift mid-street.

The Delhi Heritage Society, a registered trust that has collaborated with the Archaeological Survey of India on documentation projects in Mehrauli and the Qutb Minar complex, flagged the duplication problem in a formal submission to the Delhi government's IT department in March 2026, noting that automated tagging systems were assigning conflicting metadata to identical images, effectively splitting the historical record in two.

The Decisions That Cannot Wait

Three choices are now pressing. First, authorities must decide whether to run a bulk automated deduplication — using hash-matching software that compares files byte-by-byte — or to conduct a manual review. Automated deduplication is faster but carries a known risk: images that look identical at the file level may carry different handwritten annotations on their reverse sides, information that was not captured during scanning. Losing those annotations permanently is a real possibility if the process is not supervised.

Second, the Delhi government must settle on a master repository architecture before September. Right now the Delhi State Archives, the MCD property cell, and the DMRC survey database are operating on separate servers with no shared access protocol. Officials from the Department of Information Technology have reportedly circulated at least two competing technical proposals, but no decision had been announced as of 4 July 2026.

Third, and most immediately, the Yamuna riverfront redevelopment project — which has generated thousands of before-and-after survey photographs along the 22-kilometre stretch from Wazirabad to Okhla — needs a clear instruction on which image set counts as the authoritative record for environmental baseline assessments. Court proceedings related to the Yamuna cleanup have already required documentary evidence submissions, and conflicting file versions in a government archive are the kind of detail that opposing lawyers notice.

The September deadline is firm. If deduplication protocols, a unified repository decision, and metadata standards are not locked down before Phase 2 goes live, Delhi risks entering a new digitisation cycle on top of an unresolved old one — compounding a problem that is already large enough to matter in courtrooms, construction sites, and the archive reading rooms where historians are still waiting for access they were promised two years ago.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.