The Daily San Francisco

San Francisco news, every day

News

SF's Duplicate Image Crisis: What Happens Next and the Key Decisions Ahead

City agencies and nonprofits are racing to fix a growing problem with duplicated digital records—and the choices made in the next 90 days will shape how San Francisco manages public data for years.

By San Francisco News Desk · Published 4 July 2026, 11:51 am

3 min read

SF's Duplicate Image Crisis: What Happens Next and the Key Decisions Ahead
Photo: Committee on Foreign Affairs / Public domain (Wikimedia Commons)

San Francisco's network of city departments, social services agencies, and transit providers is sitting on a sprawling, tangled archive of duplicate digital images—photographs, scanned documents, and case-file attachments that have multiplied across government servers for more than a decade. The problem has reached a point where IT administrators at multiple departments are being asked to make hard calls: purge, consolidate, or migrate. The decisions made before the end of Q3 2026 will determine how much of that data is recoverable, how much taxpayer money gets spent cleaning it up, and whether frontline workers in places like the Tenderloin and SoMa can actually pull up an accurate client record without wading through redundant files.

The timing matters. San Francisco is in the middle of a technology infrastructure overhaul driven partly by the AI productivity push sweeping the local tech sector and partly by budget pressure at City Hall. Mayor Daniel Lurie's administration inherited a Department of Technology budget with deferred maintenance baked into nearly every line item. Storage sprawl—duplicate images chief among its symptoms—has become a concrete, measurable cost problem, not just an abstract IT complaint.

Where the Bottlenecks Are Forming

The crunch is most visible at two pressure points. The Department of Public Health, which operates Zuckerberg San Francisco General Hospital on Potrero Avenue as well as dozens of behavioral health clinics across the city, maintains patient-facing case management systems where duplicate intake photographs and scanned ID documents have accumulated over years of software migrations. Each redundant file is a liability—both a storage cost and a potential HIPAA compliance exposure. Meanwhile, the Human Services Agency, headquartered on Otis Street near Civic Center, runs benefits programs whose caseworkers routinely encounter duplicate images attached to client profiles when pulling records for CalFresh or General Assistance applications. Staff working those cases have flagged the issue internally as a source of workflow delays.

BART and Muni are facing a parallel version of the problem on the infrastructure side. The Muni Metro system's maintenance division stores inspection photographs for rolling stock and station assets. According to city IT governance protocols adopted in 2023, departments were supposed to implement deduplication tooling by January 2026. Several missed that deadline.

The cost of inaction is not abstract. Enterprise cloud storage rates for government contracts in California have risen sharply since 2023, and unmanaged duplicate image libraries can balloon storage consumption by 30 to 60 percent above baseline, according to published benchmarks from the Cloud Security Alliance—a figure that translates directly into contract overage fees when departments exceed their provisioned storage tiers.

The Decision Points Coming This Fall

Three choices will define what happens next. First, departments need to decide whether to run automated deduplication against live databases or take systems offline for a controlled audit. The automated route is faster but carries a risk of false-positive matches—two different clients whose intake photos are flagged as identical by hash-matching algorithms. The Department of Technology is expected to issue guidance on this question before September 1.

Second, the city needs to determine which agency leads the consolidated effort. The Mayor's Office of Civic Innovation, based at 1 Dr. Carlton B. Goodlett Place, has positioned itself as the coordinating body for cross-departmental data projects, but the Department of Technology holds the actual infrastructure contracts. That jurisdictional overlap has stalled similar initiatives before.

Third, there is the question of vendor selection. Several proposals are reportedly circulating from Bay Area firms offering AI-assisted deduplication services—a pitch that has obvious appeal in the current moment but that city procurement rules require to go through a formal RFP process, which typically runs four to six months minimum.

For San Franciscans who interact with city services—whether filing a permit at the Planning Department on Stevenson Street, accessing shelter services through the Department of Homelessness and Supportive Housing, or riding the 38-Geary—the downstream effect of getting this right is faster, more accurate service. The upstream effect of getting it wrong is continued cost overruns, compliance risk, and the kind of administrative dysfunction that has plagued city IT projects for a generation. The next 90 days will tell a lot.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily San Francisco

This article was produced by the The Daily San Francisco editorial desk and covers news in San Francisco. See our editorial standards for how we use AI.

The Daily San Francisco brief

The day's San Francisco news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to San Francisco news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily San Francisco

More in News

Enjoyed this story? Get tomorrow's briefing free.