The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archives Have a Duplicate Image Problem — And Officials Are Finally Talking About It

From the Municipal Corporation to the Delhi Metro Rail Corporation, administrators and archivists are debating how to fix a silent but costly data management crisis buried inside the capital's public databases.

By Delhi News Desk · Published 5 July 2026, 6:05 am

3 min read

Delhi's public record systems are quietly drowning in redundant imagery. Thousands of duplicate photographs — submitted through citizen grievance portals, stored in heritage documentation drives, and logged inside civic infrastructure databases — are clogging storage servers, inflating IT maintenance budgets, and, in some cases, pushing out authentic archival material that cannot easily be recovered. The problem has no dramatic headline, but administrators who manage digital systems for the Municipal Corporation of Delhi and the Delhi Metro Rail Corporation say it has become impossible to ignore.

The timing matters. The Delhi government has spent the past two years accelerating its push toward digital-first governance under Chief Minister Arvind Kejriwal's administration, channelling resources into portals such as the Doorstep Delivery Scheme platform and the e-District services hub. Every new intake pipeline that accepts image uploads — from property tax filings to Yamuna cleanup monitoring photographs — adds to a backlog that no single agency currently has a standardised protocol to manage. As Phase 4 of the Delhi Metro expansion progresses along corridors including the Janakpuri West to Krishna Park Extension stretch, construction documentation teams are generating thousands of site photographs each month, many of which get submitted multiple times by different contractors.

What Officials and Experts Are Pointing To

The conversation has sharpened in recent weeks. Archivists working with the Delhi State Archives on Shamnath Marg have raised concerns internally about the quality of the digital catalogue for Old Delhi heritage structures — buildings in Shahjahanabad, including the lanes around Chandni Chowk and Lal Qila, that were photographed extensively during a 2023-24 documentation drive. When the same image appears under three different file names and two different record entries, it muddies provenance, wastes retrieval time, and risks the wrong photograph being used in an official heritage report. Experts in digital asset management say the root cause is the absence of a hash-based deduplication standard — a technical protocol that flags identical or near-identical files at the point of upload rather than after the fact.

The MCD's IT directorate has reportedly been evaluating software solutions since early 2025, though no public tender has been issued as of this writing. The National Informatics Centre, which provides backend infrastructure for several Delhi government portals, maintains its own data quality guidelines, but those guidelines do not currently mandate image deduplication for state-level uploads. That gap is at the centre of the current debate.

Professionals in Delhi's growing GIS and urban data sector — several of whom work with institutions like the School of Planning and Architecture on Indraprastha Estate — argue that the fix is straightforward on paper but politically complicated. Standardising deduplication requires one agency to own the problem, and in Delhi's layered governance structure, where the MCD, the Delhi government, and central bodies often share overlapping mandates, ownership is rarely uncontested.

The Cost of Inaction

Storage is not cheap. Government cloud and on-premise server costs in India's public sector have risen sharply since 2022, with some procurement documents showing per-terabyte annual costs that make redundant data an actual fiscal line item. A database carrying 30 percent duplicate images — a figure that independent data audits in comparable municipal systems in Mumbai and Bengaluru have recorded in recent years — is not an abstract inefficiency. It is a budget drain and a legal liability when archived photographs are used in court proceedings or compensation assessments for projects like the Yamuna floodplain restoration work near Mayur Vihar.

The practical path forward, according to those familiar with the discussions, runs through two decisions. First, the Delhi government needs to issue a unified data quality standard that applies to all image-accepting portals, not just the most prominent ones. Second, the MCD and DMRC need to build deduplication checks into their contractor submission workflows before Phase 4 construction documentation scales further. Neither step requires new legislation. Both require an agency to move first. Right now, everyone agrees the problem is real. The question is who picks up the file.)

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.