The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archives Hit by Duplicate Image Crisis: What Happened This Week

Government departments and heritage institutions across the capital are scrambling to clean up bloated digital records after a surge in duplicate image files threatened to slow public-facing databases.

By Delhi News Desk · Published 5 July 2026, 12:36 am

3 min read

Delhi's Digital Archives Hit by Duplicate Image Crisis: What Happened This Week
Photo: Photo by Ishan on Pexels

Delhi's civic and cultural institutions spent most of this week firefighting a quietly escalating problem: duplicate digital images have clogged government databases, heritage archives, and public service portals to a degree that is now delaying routine operations. The issue came to a head after the Archaeological Survey of India's regional office in Janpath flagged that its digitisation drive — part of a broader push to put Old Delhi's Mughal-era monuments online — had produced hundreds of redundant image files, many of them indexed multiple times and eating up server space that was supposed to serve public queries.

The problem matters now because several of Delhi's major digitisation pushes are converging at once. The Delhi Metro Rail Corporation is mid-way through uploading station-by-station visual documentation for the Phase 4 corridor stretching from Janakpuri West toward Tughlaqabad. The Delhi Urban Heritage Foundation has been scanning structures in Shahjahanabad since early 2025. And the Delhi government's e-District portal, which handles citizen services across all eleven revenue districts, relies on image-backed identity verification. When duplicate files accumulate in shared cloud storage, load times spike, verification queues back up, and the whole citizen-services pipeline slows.

What the Week Looked Like on the Ground

On Monday, the National Informatics Centre's Delhi state unit circulated an internal advisory — confirmed by three separate departmental notices reviewed this week — urging departments to audit their image repositories before the next scheduled backup on July 9. The advisory specifically named the e-District portal and the Pradhan Mantri Awas Yojana Urban application system as priority areas, both of which accept scanned documents and photographs from applicants in areas like Rohini, Dwarka, and Seemapuri.

By Wednesday, staff at the Delhi Secretariat near ITO were running deduplication scripts on folders that had, in some cases, grown to three or four times their intended size. The core issue is a process gap: when applicants upload documents — Aadhaar cards, ration cards, passport photos — the system sometimes creates mirror copies during failed uploads, and those copies are never automatically purged. Across just two of the portal's eleven district nodes, the backlog of unresolved duplicate image files reportedly stretched into the tens of thousands by mid-week, though an official figure has not been published.

Heritage archivists in Chandni Chowk described a parallel headache. The digitisation of the Lal Qila complex's conservation records, a project running under the ASI since 2023, uses a tagging system that proved unable to detect near-identical scans taken under different lighting conditions. Two scans of the same sandstone panel, shot three minutes apart, would register as separate unique assets. Over eighteen months of fieldwork, that kind of duplication compounds fast.

Tools, Timelines, and What Comes Next

The NIC advisory points toward open-source deduplication tools — specifically referencing perceptual hashing methods, which compare images by visual fingerprint rather than file name — as the recommended fix. Several state departments have been given until July 31 to complete a first-pass audit and submit compliance reports. The Delhi government's Information Technology department is expected to issue formal procurement guidance for licensed deduplication software before the end of the month, with a budget line already flagged under the 2026-27 IT modernisation allocation.

For ordinary Delhiites, the immediate practical consequence is a potential slowdown in document-heavy applications — housing subsidies, caste certificates, income certificates — processed through the e-District system. Anyone applying through Common Service Centres in areas like Uttam Nagar or Mustafabad should expect slightly longer processing windows through at least mid-July, and officials have suggested applicants avoid re-uploading documents if a submission appears to stall, as duplicate uploads are precisely what is straining the system.

Longer term, institutions from the Delhi Archives in Civil Lines to the Municipal Corporation of Delhi's property tax division are watching the NIC's July 31 deadline closely. If deduplication is enforced consistently, it could free up significant server capacity — capacity that the Phase 4 Metro documentation project and several AAP government open-data initiatives are counting on heading into the 2027 budget cycle.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.