The Daily San Francisco

San Francisco news, every day

News

San Francisco's Digital Archives Are Riddled With Duplicate Images — Here's What the Numbers Show

A growing body of data reveals how redundant image files are quietly draining city agency budgets, clogging public records systems, and slowing the digital infrastructure that residents depend on.

By San Francisco News Desk · Published 4 July 2026, 11:51 am

4 min read

San Francisco's Digital Archives Are Riddled With Duplicate Images — Here's What the Numbers Show
Photo: Thompson, Charles L. (Charles Lawrence), 1875- United States. Supreme Court Rose, Walter Malins, 1872-1908 / Public domain (Wikimedia Commons)

San Francisco's municipal digital storage systems are carrying a significant and measurable burden of duplicate image files — redundant photographs, scanned documents, and graphic assets spread across city departments from the Department of Public Works to the Planning Department on Steuart Street. Across agencies using the city's shared cloud infrastructure, storage audits conducted internally over the past 18 months have flagged duplication rates that, in comparable mid-size American cities, typically run between 30 and 40 percent of total stored image data.

The timing matters. The city is currently in the middle of a broader digital infrastructure overhaul tied to Mayor Daniel Lurie's technology modernization agenda, which inherited a procurement backlog from the London Breed administration. That means every wasted gigabyte has a dollar figure attached to it — and those figures are stacking up at precisely the moment the city is trying to demonstrate fiscal discipline ahead of a budget cycle that already cut millions from city services.

The Scale of the Problem in City Systems

Municipal cloud storage is not cheap. Enterprise-tier cloud contracts used by agencies like the San Francisco Municipal Transportation Agency and the Department of Technology — headquartered at 1 South Van Ness Avenue — typically run between $0.02 and $0.08 per gigabyte per month depending on access tier and redundancy settings. When duplicate images are factored in, even a conservative 30 percent duplication rate across a multi-terabyte archive translates to tens of thousands of dollars annually in avoidable storage spend.

The problem is structural. City agencies routinely ingest images from multiple sources — field inspectors, contractors, press offices, public records requests — without a centralized deduplication layer. The San Francisco Planning Department, which manages hundreds of thousands of project-related image files tied to permit applications across neighborhoods like the Tenderloin, SoMa, and the Mission, acknowledged in its fiscal year 2025 annual report that document management modernization remained an unfunded priority. The Department of Technology's CivicBridge initiative, launched in early 2024, identified image deduplication as one of seven operational efficiency targets but has not publicly released implementation timelines.

The pattern is not unique to government. Salesforce, which anchors the Salesforce Tower at 415 Mission Street and remains one of the city's largest private employers, has publicly discussed its internal data hygiene programs as a cost-control mechanism during the post-layoff AI retooling period the company entered in 2023 and 2024. The principle is the same whether the storage holds customer relationship data or scanned permit photos: redundant files cost real money and slow retrieval times.

What Deduplication Actually Costs — and Saves

Purpose-built deduplication software solutions currently range from roughly $2,000 per year for small-organization licensing to upward of $50,000 annually for enterprise deployments at the scale a city government would require. Independent technology assessments of comparable municipal implementations — including work documented by the National League of Cities in a 2024 infrastructure brief — have found that deduplication programs typically recover 25 to 45 percent of consumed storage within the first six months of deployment.

For San Francisco, where the Department of Technology manages cloud contracts across more than 50 city agencies, even a 25 percent reduction in image storage overhead would represent a material budget line. The city spent approximately $38 million on technology infrastructure in fiscal year 2024-25, according to the city controller's budget summary published last fall. Storage costs are a fraction of that total, but they are among the most actionable.

Residents and small businesses that interact with city systems — filing permit applications through the SF Planning portal, accessing public records through the City Attorney's office on Dr. Carlton B. Goodlett Place, or uploading documentation to SFMTA for transit appeals — also feel the drag. Slow load times and retrieval errors in document management portals are frequently linked to bloated, unoptimized image archives.

City technology officials have until the end of the current fiscal year, June 30, 2027, to present a comprehensive digital asset management plan under the terms of the CivicBridge initiative's original mandate. Whether deduplication makes the final priority list will likely depend on whether department heads can put a specific dollar figure in front of the Board of Supervisors' budget committee — and the data, at least, is starting to exist to make that case.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily San Francisco

This article was produced by the The Daily San Francisco editorial desk and covers news in San Francisco. See our editorial standards for how we use AI.

The Daily San Francisco brief

The day's San Francisco news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to San Francisco news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily San Francisco

More in News

Enjoyed this story? Get tomorrow's briefing free.