The Daily San Francisco

San Francisco news, every day

News

San Francisco's Duplicate Image Problem: Why a Digital Housekeeping Crisis Is Costing Residents Real Money and Real Services

Across city agencies, hospitals, and nonprofits, redundant digital images are quietly draining storage budgets and slowing the systems locals depend on every day.

By San Francisco News Desk · Published 4 July 2026, 12:25 pm

3 min read

San Francisco's Duplicate Image Problem: Why a Digital Housekeeping Crisis Is Costing Residents Real Money and Real Services
Photo: Waterways Experiment Station (U.S.) / Public domain (Wikimedia Commons)

San Francisco's public-facing digital systems are quietly choking on themselves. From the Department of Public Health's patient portals to the Planning Department's permit databases on Stevenson Street, city agencies and major nonprofits are sitting on enormous backlogs of duplicate image files — redundant photographs, scanned documents, and digital records stored two, three, sometimes dozens of times across aging infrastructure. The result is slower systems, ballooning IT costs, and delays that ripple directly into resident services.

The problem has accelerated alongside the city's post-pandemic push to digitize everything. Between 2021 and 2025, San Francisco agencies moved millions of paper records online, a process that generated massive volumes of duplicate scans. Without standardized deduplication protocols, those files accumulated. Now, as city departments face budget shortfalls heading into fiscal year 2027, the storage overhead is no longer a background nuisance — it's a line item that competes with staffing, outreach, and infrastructure.

What Duplicate Images Actually Cost San Francisco

Cloud storage is not cheap at government scale. Enterprise-tier storage contracts — the kind the City and County of San Francisco uses for sensitive data — can run between $0.02 and $0.05 per gigabyte per month, depending on redundancy and compliance requirements. When agencies store the same 10-megabyte scan 15 times, those fractions compound fast across departments managing tens of millions of records. Nonprofit partners working with the city face the same bind: organizations like the Tenderloin Housing Clinic and Glide Memorial on Taylor Street, both of which handle high-volume case files for unhoused residents, rely on shared digital platforms where image bloat slows intake processing at exactly the wrong moment.

The San Francisco Department of Technology has flagged data hygiene as a priority in its most recent strategic framework, though the city has not yet published a deduplication-specific budget or timeline. The broader context matters: San Francisco's IT budget for fiscal year 2025-26 was set at roughly $145 million across all departments, according to the Controller's Office budget documents. A meaningful share of that goes toward cloud storage contracts that could shrink significantly with systematic deduplication.

UCSF Medical Center at Mission Bay and Zuckerberg San Francisco General Hospital on Potrero Avenue both operate under electronic health record systems that routinely generate duplicate diagnostic images — X-rays, MRIs, and scans that get uploaded multiple times during patient transfers or system migrations. Health IT specialists have long identified duplicate medical imaging as one of the largest drivers of unnecessary storage cost in urban hospital systems. At UCSF, which manages one of the largest research imaging archives in Northern California, that overhead is substantial.

What Residents Can Do — and What's Coming

For ordinary San Franciscans, the most direct impact shows up in wait times. When a Planning Department staffer in Civic Center pulls up a permit file and the system lags because the underlying database is cluttered with redundant attachments, that's seconds turning into minutes turning into a longer queue at the counter or on the phone. The SF311 system, which handles hundreds of thousands of service requests annually, is similarly affected when its backend image storage grows unmanaged.

Residents who interact with city systems — applying for permits in the Excelsior, submitting housing applications through the Mayor's Office of Housing and Community Development, or uploading documentation for SFMTA parking appeals — can help by compressing images before uploading and avoiding duplicate submissions. Most city portals now accept JPEG or PDF formats under 5 megabytes; submissions within those limits process faster and cost less to store.

On the institutional side, the Department of Technology is expected to release updated data governance guidelines later in 2026 that will include mandatory deduplication checkpoints for major agencies. Vendors pitching AI-assisted deduplication tools have been in conversations with city procurement offices since early this year, according to public contracting records. The technology exists. The urgency is growing. The question now is whether city hall moves fast enough to act before next year's budget cycle forces the choice between storage costs and the services residents actually see.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily San Francisco

This article was produced by the The Daily San Francisco editorial desk and covers news in San Francisco. See our editorial standards for how we use AI.

The Daily San Francisco brief

The day's San Francisco news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to San Francisco news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily San Francisco

More in News

Enjoyed this story? Get tomorrow's briefing free.