Tens of thousands of duplicate images are clogging the digital records systems of Delhi's major civic bodies, and the people responsible for those systems are, at last, being asked to answer for it. The problem — long treated as a low-priority housekeeping matter — has surfaced in recent weeks as agencies racing to digitise heritage documents, urban planning maps, and public infrastructure records discover that redundant image files are consuming storage, inflating costs, and in some cases causing version-confusion that delays project approvals.
The issue matters now because Delhi is mid-sprint through several digitisation drives with real deadlines attached. The Archaeological Survey of India's Delhi Circle has been cataloguing photographic records of monuments across Mehrauli, Nizamuddin, and the Qutub Minar complex. Separately, the Municipal Corporation of Delhi launched a building-plan portal in 2024 that now holds uploaded architectural drawings from across all twelve zones — many submitted multiple times by the same applicants in different file sizes, generating duplicates that the system's current software cannot automatically flag or remove.
What the Institutions Are Saying
Officials at the Delhi Metro Rail Corporation have acknowledged in internal communications — described in presentations at a February 2026 smart-infrastructure seminar held at the India Habitat Centre in Lodhi Road — that their asset-management image library had accumulated redundancy rates requiring a dedicated data-hygiene audit. No official figure for the scale of duplication has been released publicly by DMRC. The corporation declined to provide a comment for this article.
At the level of state government, the Delhi government's IT department has been working since late 2025 on an updated procurement framework for cloud storage under its Delhi Cloud Policy. Officials familiar with the process — speaking in their institutional capacity at a public forum — have noted that duplicate-image proliferation is one of the documented inefficiencies the policy is meant to address. The Delhi Skill and Entrepreneurship University in Dwarka, which manages substantial volumes of student-record photography, has been cited in policy working papers as one of the bodies that would benefit from automated deduplication tools built into any new storage contract.
Experts in digital asset management point to a gap that is neither technical nor financial — it is procedural. India's Bureau of Indian Standards published IS 16433, a records management standard, in 2015, but adoption across municipal bodies has been inconsistent. Practitioners who presented at the National Institute of Urban Affairs conference in Delhi in March 2026 argued that without mandatory metadata tagging at the point of image upload, deduplication software alone cannot reliably distinguish between a duplicate and a legitimate updated version of the same document photograph.
The Cost of Inaction
Storage is not free. Commercial cloud pricing in India for government-tier contracts typically runs between ₹2 and ₹5 per gigabyte per month depending on the vendor and redundancy tier — figures drawn from publicly available government e-marketplace rate cards. For an agency holding several hundred terabytes of image data, even a 20 percent duplication rate translates into a meaningful and recurring expenditure. The MCD's building-plan portal, which went live in phases beginning in January 2024, was allocated a digitisation budget that officials described publicly at the time as being under pressure by its second year of operation.
The National Informatics Centre, which provides back-end infrastructure for many Delhi government portals including the Delhi District Courts' case-record systems based in Patiala House, has established internal deduplication guidelines. But those guidelines apply to text-based records more rigorously than to image files, a distinction that practitioners say needs to close.
Government bodies and institutions managing image-heavy digital archives should, according to guidance circulated by NIC this year, prioritise three steps: mandating SHA-256 hash-based duplicate checks at the point of upload; training records officers — not just IT staff — in image metadata standards; and scheduling quarterly audits rather than treating deduplication as a one-time migration task. For Delhi's agencies, the window to get ahead of the problem is narrowing as Phase 4 Metro stations bring new infrastructure photography requirements and as the ASI pushes toward full digital cataloguing of the capital's protected monuments by 2027.