The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archives Get a Long-Overdue Fix: How the City Is Tackling Its Duplicate Image Problem This Week

Government databases, heritage records, and metro expansion files have been clogging civic systems with redundant imagery — and a coordinated push launched this week aims to clean up the mess.

By Delhi News Desk · Published 5 July 2026, 12:41 am

3 min read

Delhi's Digital Archives Get a Long-Overdue Fix: How the City Is Tackling Its Duplicate Image Problem This Week
Photo: Photo by Himanshu Singh on Pexels

Delhi's public records infrastructure has a clutter problem. Thousands of duplicate digital images — spanning heritage documentation for Old Delhi's Shahjahanabad quarter, Yamuna riverfront survey photographs, and Delhi Metro Rail Corporation planning files — have been identified in civic databases this week, prompting an inter-agency effort to flag, consolidate, and delete the redundant files before they further slow down already strained government servers.

The timing matters. Delhi is midway through a monsoon-season push to digitise land and planning records under the Delhi Development Authority's broader e-governance initiative, and bloated image libraries are slowing upload speeds and creating version-control confusion between departments. With Phase 4 of the Delhi Metro expansion now in active construction across corridors including Janakpuri West to RK Ashram and Aerocity to Tughlakabad, the volume of survey and geo-tagged site photography flowing into shared government servers has spiked sharply in 2026.

What Happened This Week

The Delhi Urban Shelter Improvement Board and the Delhi State Spatial Data Infrastructure cell — both based out of the Vikas Minar complex on ITO — began a joint audit on Monday, July 1, targeting image repositories linked to the Yamuna Floodplain Rejuvenation Project. Early findings flagged significant duplication: multiple departments had independently uploaded identical drone survey images of the Okhla Barrage stretch and the Wazirabad reservoir zone, sometimes in three or four separate folders with different file names. The duplication is not unique to water-body surveys. Heritage documentation teams working under the Archaeological Survey of India's Delhi Circle — which oversees monuments from the Red Fort to Mehrauli's Qutb complex — have reported similar stacking of imagery in shared drives, with identical photographs of carved facades and inscriptions filed under variant project codes across multiple financial years.

The practical consequence is measurable. Server storage costs for municipal data centres, including the primary facility at Pragati Maidan's technology annex, have climbed as image libraries expanded without systematic deduplication protocols in place. A government tender document circulated in June 2026 cited a target of reducing redundant civic image data by 40 percent before October, ahead of a planned migration to a unified cloud platform. That October deadline is now being treated as a hard stop by the DDA's IT wing.

Why This Is Harder Than It Sounds

Deduplication sounds mechanical — run a hash-check, delete the copies — but Delhi's civic image problem is complicated by inconsistent metadata. Photographs taken at the same location on the same date arrive in different systems with different timestamps, file formats, and geotag precision levels, meaning automated tools flag them as distinct files. The Delhi Metro Rail Corporation's GIS team, which photographs every construction site along the Phase 4 Pink Line extension from Majlis Park toward Maujpur, has had to manually review hundreds of near-identical images of tunnel boring machine positions because automated tools couldn't reliably confirm they were duplicates.

The Indraprastha Institute of Information Technology Delhi, which has an ongoing research collaboration with city agencies on civic data management, has been brought in to test a perceptual hashing tool that compares images visually rather than byte-by-byte — an approach better suited to photographs taken seconds apart from slightly different angles. Pilot testing on a batch of 12,000 Yamuna survey images began on Thursday at IIIT-Delhi's Okhla campus.

For residents and civic researchers who rely on Delhi's open data portals — particularly the Delhi Open Data portal hosted at data.delhi.gov.in — the cleanup should eventually mean faster load times and more reliable search results when pulling ward-level or neighbourhood-level imagery. The DDA's IT wing has said the deduplication audit will run through July and August, with a public-facing update to the portal expected before the end of the third quarter. Departments are being asked to appoint a single image custodian for each active project, a structural fix that, if it holds, would prevent the same problem from resurging when Phase 5 planning begins.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.