The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archives Are Flooded With Duplicate Images — Officials and Experts Are Finally Talking About It

From the Delhi Metro Rail Corporation to the Municipal Corporation of Delhi, government bodies are grappling with a costly and chaotic problem hiding inside their own servers.

By Delhi News Desk · Published 5 July 2026, 12:30 am

3 min read

Delhi's Digital Archives Are Flooded With Duplicate Images — Officials and Experts Are Finally Talking About It
Photo: Photo by The Vanity Photography Co. on Pexels

Tens of thousands of duplicate images are clogging the digital records systems of Delhi's major civic bodies, and the people responsible for those systems are, at last, being asked to answer for it. The problem — long treated as a low-priority housekeeping matter — has surfaced in recent weeks as agencies racing to digitise heritage documents, urban planning maps, and public infrastructure records discover that redundant image files are consuming storage, inflating costs, and in some cases causing version-confusion that delays project approvals.

The issue matters now because Delhi is mid-sprint through several digitisation drives with real deadlines attached. The Archaeological Survey of India's Delhi Circle has been cataloguing photographic records of monuments across Mehrauli, Nizamuddin, and the Qutub Minar complex. Separately, the Municipal Corporation of Delhi launched a building-plan portal in 2024 that now holds uploaded architectural drawings from across all twelve zones — many submitted multiple times by the same applicants in different file sizes, generating duplicates that the system's current software cannot automatically flag or remove.

What the Institutions Are Saying

Officials at the Delhi Metro Rail Corporation have acknowledged in internal communications — described in presentations at a February 2026 smart-infrastructure seminar held at the India Habitat Centre in Lodhi Road — that their asset-management image library had accumulated redundancy rates requiring a dedicated data-hygiene audit. No official figure for the scale of duplication has been released publicly by DMRC. The corporation declined to provide a comment for this article.

At the level of state government, the Delhi government's IT department has been working since late 2025 on an updated procurement framework for cloud storage under its Delhi Cloud Policy. Officials familiar with the process — speaking in their institutional capacity at a public forum — have noted that duplicate-image proliferation is one of the documented inefficiencies the policy is meant to address. The Delhi Skill and Entrepreneurship University in Dwarka, which manages substantial volumes of student-record photography, has been cited in policy working papers as one of the bodies that would benefit from automated deduplication tools built into any new storage contract.

Experts in digital asset management point to a gap that is neither technical nor financial — it is procedural. India's Bureau of Indian Standards published IS 16433, a records management standard, in 2015, but adoption across municipal bodies has been inconsistent. Practitioners who presented at the National Institute of Urban Affairs conference in Delhi in March 2026 argued that without mandatory metadata tagging at the point of image upload, deduplication software alone cannot reliably distinguish between a duplicate and a legitimate updated version of the same document photograph.

The Cost of Inaction

Storage is not free. Commercial cloud pricing in India for government-tier contracts typically runs between ₹2 and ₹5 per gigabyte per month depending on the vendor and redundancy tier — figures drawn from publicly available government e-marketplace rate cards. For an agency holding several hundred terabytes of image data, even a 20 percent duplication rate translates into a meaningful and recurring expenditure. The MCD's building-plan portal, which went live in phases beginning in January 2024, was allocated a digitisation budget that officials described publicly at the time as being under pressure by its second year of operation.

The National Informatics Centre, which provides back-end infrastructure for many Delhi government portals including the Delhi District Courts' case-record systems based in Patiala House, has established internal deduplication guidelines. But those guidelines apply to text-based records more rigorously than to image files, a distinction that practitioners say needs to close.

Government bodies and institutions managing image-heavy digital archives should, according to guidance circulated by NIC this year, prioritise three steps: mandating SHA-256 hash-based duplicate checks at the point of upload; training records officers — not just IT staff — in image metadata standards; and scheduling quarterly audits rather than treating deduplication as a one-time migration task. For Delhi's agencies, the window to get ahead of the problem is narrowing as Phase 4 Metro stations bring new infrastructure photography requirements and as the ASI pushes toward full digital cataloguing of the capital's protected monuments by 2027.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.