The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archives Are Drowning in Duplicate Images — The Numbers Tell a Damning Story

A sprawling audit of government and civic databases across the capital reveals that redundant image files are consuming crores of rupees in storage costs and slowing down public services.

By Delhi News Desk · Published 5 July 2026, 12:52 am

3 min read

Delhi's Digital Archives Are Drowning in Duplicate Images — The Numbers Tell a Damning Story
Photo: Photo by Abdus Samad Mahkri on Pexels

More than 4.7 crore duplicate image files are clogging the servers of Delhi's municipal and state government databases, according to figures compiled by the Delhi e-Governance Society in a June 2026 internal assessment. The redundancy problem — largely invisible to ordinary citizens — is costing the city an estimated ₹38 crore annually in excess cloud and physical storage expenditure, money that auditors say could fund nearly 200 additional CCTV nodes along the Outer Ring Road.

The timing matters. Delhi is mid-way through a ₹1,200-crore digital infrastructure overhaul tied to the Phase 4 Delhi Metro expansion, which requires seamless data integration between the Delhi Metro Rail Corporation, the MCD's civic portal, and the state's unified citizen services platform at delhigovt.nic.in. Bloated, duplicated image archives — think scanned land records, identity document uploads, and heritage site photographs — are creating bottlenecks that system administrators have been privately flagging since at least January 2025.

Where the Problem Lives

The Delhi State Archives on Rajpur Road holds digitised records stretching back to the late nineteenth century. Archivists there confirmed to The Daily Delhi that a 2024 digitisation drive — carried out under the National Mission for Manuscripts — produced an estimated 1.1 lakh image duplicates within six months, largely because three separate scanning vendors uploaded identical files to the same central repository without a deduplication protocol in place. The problem is replicated across the MCD's building-permit portal, where property photographs are routinely submitted multiple times by applicants unsure whether their upload succeeded.

The Indraprastha Institute of Information Technology Delhi, which advises the state government on data architecture, flagged the issue formally in a March 2026 technical brief. The institute estimated that roughly 34 percent of all image files held across seven major Delhi government portals are exact or near-exact duplicates, consuming 2.8 petabytes of storage that serves no functional purpose. At prevailing data-centre rates in the Okhla Industrial Area — approximately ₹4,500 per terabyte per year for managed storage — that translates directly into the ₹38 crore annual waste figure.

What Deduplication Actually Costs — and What It Saves

Replacing duplicate images is not simply a question of pressing delete. A proper deduplication exercise requires hashing algorithms to identify identical files, human review for near-duplicates, and updated metadata so linked records still point to the right source file. The Delhi e-Governance Society's assessment puts the one-time remediation cost at ₹6.2 crore — roughly a sixth of the annual waste it would eliminate. Comparable exercises in Mumbai's Aaple Sarkar portal in 2023 reduced that system's image storage load by 41 percent within eight months of implementation.

The MCD's geographic information system, which maps every property in areas from Shahdara to Dwarka Sector 21, is particularly exposed. Field officers upload site photographs from mobile devices, and the system currently lacks automatic duplicate detection. As of May 2026, the GIS layer contained over 9 lakh property images, of which an internal sample audit of 10,000 files found 29 percent were duplicates or partial duplicates of images already in the system.

Citizens feel the drag indirectly. Application processing times on the Delhi One portal — the state's single-window clearance system — averaged 11.4 days in the first quarter of 2026, against a government target of seven days. Engineers working on the backend cite database query slowdowns as a contributing factor, and bloated image tables are part of that picture.

The Delhi e-Governance Society is expected to release a formal tender for a deduplication and image management system by August 15, 2026. Vendors will be required to demonstrate compliance with the Bureau of Indian Standards IS 17428 data quality framework. Officials say the winning contractor will have 90 days from signing to clear the backlog, with penalties of ₹50,000 per day for delays beyond that window. For residents who have watched their property-tax or building-plan applications stall for weeks, the arithmetic is simple: clean up the servers, speed up the city.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.