Delhi's sprawling network of government databases is carrying a quiet, expensive problem: thousands of duplicate images clogging official records, from civic agency portals to transit authority archives, slowing systems and inflating storage costs. Administrators across multiple departments say the issue has reached a point where it can no longer be ignored.
The pressure to act is coming partly from the scale of digitisation that has happened fast. The Municipal Corporation of Delhi launched its Unified MCD portal after the three-way merger was formalised in 2022, inheriting image records from three separate bodies simultaneously. The Delhi Metro Rail Corporation, meanwhile, has been cataloguing construction documentation for Phase 4 expansion across corridors stretching from Janakpuri West to RK Ashram Marg. In both cases, technologists working on the projects say redundant image files have accumulated in the tens of thousands, consuming server capacity and complicating search and retrieval.
The timing matters because Delhi is mid-cycle on several high-visibility digitisation drives. The Delhi government's e-District portal — which handles citizen service requests across all eleven revenue districts — processes document image uploads daily. Duplicate entries in that system do not merely waste storage; they can attach incorrect or outdated photographs to citizen records, creating verification problems that delay services from ration card renewals to property mutation approvals.
What Officials and Technical Experts Are Saying
Officials at the Delhi Secretariat have acknowledged in public-facing administrative notices that image deduplication has been identified as a priority under the city's broader IT consolidation plan, though no firm implementation deadline has been publicly announced. Experts in civic technology who have advised Delhi government bodies describe the problem in blunt terms: legacy procurement meant different departments bought different content management systems with no shared image hashing standards, so the same photograph of, say, a building in Shahjahanabad's heritage zone could exist in thirty different formats across three separate portals.
The heritage angle carries particular weight. The Delhi Urban Art Commission and the Archaeological Survey of India both maintain photographic inventories of monuments concentrated in areas like Mehrauli and the Walled City of Old Delhi. Conservationists working in that corridor say inconsistent image management across these bodies means field officers sometimes cannot determine whether a damage photograph is recent or years old — a gap with real consequences for restoration decisions.
On the transit side, DMRC's Phase 4 documentation requirements are substantial. The Janakpuri West–RK Ashram Marg corridor alone spans roughly 28.92 kilometres and involves construction at dozens of sites, each generating inspection photographs. Engineers familiar with large infrastructure projects say duplicate image accumulation in project management systems is a near-universal problem when multiple contractors upload documentation through a single portal without automated deduplication checks at the point of ingestion.
The Cost Question and What Comes Next
Storage is not cheap at government scale. Cloud and hybrid data centre pricing in India's public sector, under agreements routed through the National Informatics Centre, runs on per-gigabyte models where redundant data translates directly into avoidable expenditure. Civic technology researchers who have studied MCD's digitisation programme estimate that image duplication rates in merged-entity databases commonly run between 15 and 30 percent of total image volume — figures that, applied to the MCD's post-merger archive, represent a substantial and recoverable cost.
The practical path forward, according to technologists who have worked on similar consolidations for other Indian state governments, involves deploying perceptual hashing algorithms that can identify near-identical images — not just exact byte-for-byte duplicates — and flagging them for human review before automated removal. Several open-source tools capable of this have been adapted for Hindi-language metadata environments.
For Delhi, the window to act is the current financial year. The MCD's IT budget cycle closes in March 2027, and departments seeking to embed deduplication infrastructure into procurement plans need sign-off from the Standing Committee before October. Officials familiar with that timeline say proposals are being drafted, though nothing has yet been tabled. Citizens who interact with MCD or e-District services and notice mismatched document images attached to their records are advised to raise a formal grievance through the CM Helpline — number 1076 — to create an audit trail that administrators say is useful in building the case for faster remediation.