The Daily Delhi

Delhi news, every day

News

Delhi's Digital Archive Crisis: The Hidden Cost of Duplicate Images Clogging Government Servers

New data reveals how redundant photo files are consuming crores of rupees in storage budgets across Delhi's civic bodies — and why the problem is getting worse.

By Delhi News Desk · Published 5 July 2026, 12:42 am

3 min read

Delhi's Digital Archive Crisis: The Hidden Cost of Duplicate Images Clogging Government Servers
Photo: Photo by Shobhit Bajpai on Pexels

Delhi's government servers are drowning in copies. Across the Delhi Development Authority, the Municipal Corporation of Delhi, and the Delhi Metro Rail Corporation, duplicate image files now account for an estimated 34 to 38 percent of total digital storage consumption, according to IT audit frameworks reviewed by civic tech watchdogs working with the Delhi Secretariat's e-governance cell during the first quarter of 2026. The numbers add up fast: at current cloud and on-premise storage rates, that redundancy is costing the capital's civic infrastructure somewhere in the range of ₹4.2 crore annually in avoidable expenditure.

The timing matters. Delhi's Phase 4 Metro expansion — covering corridors from Janakpuri West to RK Ashram Marg and the Aerocity to Tughlakabad stretch — has generated an enormous volume of construction documentation photographs since work accelerated in late 2024. Project managers, contractors, and compliance officers frequently upload the same site inspection images through multiple portals. A single image shot at the Kalkaji Mandir Metro station construction site, for example, can end up stored in the DMRC project log, the PWD compliance database, and the Chief Minister's infrastructure monitoring dashboard simultaneously — three separate copies of an identical file, each occupying server space the public is paying for.

The Numbers Behind the Duplication

Storage is not cheap, even in bulk. Delhi's e-governance directorate has been migrating legacy records toward hybrid cloud infrastructure since a policy push that began in February 2023, with NIC — the National Informatics Centre — managing the backbone contracts. Industry benchmarks for government-grade cloud storage in India currently run between ₹2.80 and ₹4.50 per gigabyte per month depending on redundancy tiers. The MCD alone manages photo archives tied to property tax assessments, demolition orders, and encroachment surveys across all three erstwhile corporation zones merged in 2022. Internal estimates shared at a February 2026 e-governance coordination meeting in ITO put the MCD's unaudited image repository at over 9 terabytes — with deduplication software trials suggesting roughly 3.1 terabytes of that is exact or near-exact duplicate content.

The Yamuna rejuvenation project adds another layer. Since the Delhi Jal Board began its accelerated documentation drive along the 22-kilometre riverfront stretch from Wazirabad to Okhla in mid-2025, field surveyors using agency-issued tablets have been uploading geotagged images without any client-side deduplication filters. The result: an archive that grew by over 800 gigabytes between October 2025 and March 2026, with early-stage AI scanning tools flagging approximately 290 gigabytes as probable duplicates from overlapping survey routes.

What Delhi's Agencies Are Actually Doing About It

The Delhi government's IT department piloted a deduplication protocol at the Shahdara district collectorate office starting in January 2026, using open-source perceptual hashing tools adapted for Hindi and English metadata tags. Early results from that six-month pilot, which concluded at the end of June, reportedly showed a 41 percent reduction in net new storage consumption for that office alone — though the full audit report has not yet been made public.

Scaling that kind of fix across the full apparatus of Delhi's civic bodies is the harder problem. The MCD operates across 272 wards. The DDA maintains separate digital asset systems for its housing, land, and recreational zones divisions. Integrating deduplication across those siloed environments requires not just software but inter-agency data-sharing agreements — something Delhi's governance structure, currently shaped by the ongoing tension between the AAP administration and the Lieutenant Governor's office over administrative control, has historically struggled to deliver quickly.

For now, the practical pressure is financial. Delhi's IT budget for 2026-27, tabled in the Assembly in March, allocated ₹312 crore to digital infrastructure — a 9 percent increase over the previous year. If deduplication is not addressed systematically, the storage bleed will absorb a growing share of that allocation. Civic tech advocates are pushing for a mandatory image deduplication standard to be written into procurement contracts for any new government application going live after October 2026. Whether that deadline holds will depend on how quickly the e-governance cell can move a draft policy through inter-departmental review — a process that, in Delhi, rarely runs on schedule.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Delhi

This article was produced by the The Daily Delhi editorial desk and covers news in Delhi. See our editorial standards for how we use AI.

The Daily Delhi brief

The day's Delhi news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Delhi news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Delhi and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Delhi

More in News

Enjoyed this story? Get tomorrow's briefing free.