The Daily San Francisco

San Francisco news, every day

News

SF's Digital Records Push Hits a Snag: Officials and Experts Weigh In on the Duplicate Image Problem

As the city accelerates a sweeping digitization effort across multiple departments, a growing chorus of archivists, technologists, and planning officials are sounding alarms about redundant and misidentified files clogging the system.

By San Francisco News Desk · Published 4 July 2026, 12:11 pm

3 min read

SF's Digital Records Push Hits a Snag: Officials and Experts Weigh In on the Duplicate Image Problem
Photo: Photo by Vision plug on Pexels

San Francisco's ambitious push to digitize decades of paper records has run into a stubborn technical headache: thousands of duplicate images embedded in municipal databases are slowing workflows, inflating storage costs, and in some cases causing city staff to pull the wrong documents during planning and permitting reviews. The problem has moved from a background IT complaint to a front-burner policy conversation at City Hall.

The timing matters. The city's Department of Technology has been under pressure since early 2025 to modernize its document infrastructure after a backlog in the Planning Department's permit queue drew sustained criticism from housing advocates and developers. Mayor Daniel Lurie, who took office in January 2025 following London Breed's defeat, made housing production a signature priority, and permitting delays tied to document errors have become a political liability. Against that backdrop, the duplicate-image issue is no longer a purely administrative annoyance — it sits at the intersection of housing policy, government efficiency, and the city's broader AI ambitions.

What the City Is Actually Dealing With

The problem is structural. Over several years, the San Francisco Planning Department, the Department of Building Inspection, and the Office of the Assessor-Recorder each built or acquired separate document management systems that do not share a common deduplication protocol. When staff scan property records, site photographs, or permit drawings — particularly for properties in dense corridors like the Mission District or the Tenderloin — the same image can end up stored multiple times under slightly different file names across different platforms. Archivists at the San Francisco History Center, located inside the Main Library on Larkin Street, have flagged the same issue on the historical side, where digitized photograph collections from the 1906 earthquake era contain significant overlap that makes catalog searches unreliable.

Technology consultants working with the city — several of whom have worked on similar projects with the Los Angeles Bureau of Engineering — describe the core challenge as an identity problem: without consistent metadata standards, no automated tool can confidently determine whether two image files showing the same building facade are true duplicates or distinct records from different inspection dates. That distinction matters enormously for a planning reviewer trying to establish a property's condition at a specific point in time.

The San Francisco Civil Grand Jury, in its June 2025 report on city IT governance, noted that storage redundancy across municipal systems was costing the city measurably in both licensing fees and staff hours, though it did not publish a line-item figure specific to images. Separately, the Department of Technology's fiscal year 2025–2026 budget, approved by the Board of Supervisors, allocated approximately $4.2 million toward cloud infrastructure consolidation — a figure that department leadership has pointed to as the funding vehicle for a broader deduplication effort.

Experts Push for a Unified Standard Before the Next Phase

Digital preservation specialists and open-government advocates in the Bay Area have been urging the city to establish a metadata governance framework before it deploys any AI-assisted tool to auto-flag duplicates. The concern is practical: an algorithm trained on imperfect data could delete or quarantine records that only appear redundant. For properties in historically complex neighborhoods like Chinatown or the Haight-Ashbury, where ownership chains and permit histories are layered across generations, a wrongly discarded image could impede a legal proceeding or a historic preservation review.

The nonprofit Code for America, which maintains offices in San Francisco and has worked with multiple California counties on civic tech challenges, has publicly advocated for open metadata standards in government digitization projects — a position relevant to exactly this kind of inter-departmental image problem.

For residents and businesses waiting on permits, the practical advice from planning professionals is straightforward: when submitting documentation to the Department of Building Inspection at 49 South Van Ness Avenue, include unique file naming conventions with dates and addresses embedded in each image file name. It does not solve the city's backend problem, but it reduces the chance that a staff reviewer pulls the wrong version of a document during the permitting review. The city has indicated it expects to release updated submission guidelines by September 2026.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily San Francisco

This article was produced by the The Daily San Francisco editorial desk and covers news in San Francisco. See our editorial standards for how we use AI.

The Daily San Francisco brief

The day's San Francisco news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to San Francisco news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily San Francisco

More in News

Enjoyed this story? Get tomorrow's briefing free.