The Daily San Francisco

San Francisco news, every day

News

SF City Agencies Race to Purge Duplicate Images From Public Records Portals This Week

A coordinated push across several San Francisco departments to eliminate redundant and mis-tagged photographs from online databases is exposing deeper problems in how the city manages its digital archives.

By San Francisco News Desk · Published 4 July 2026, 11:35 am

3 min read

SF City Agencies Race to Purge Duplicate Images From Public Records Portals This Week
Photo: O'Malley, Kevin D / Public domain (Wikimedia Commons)

San Francisco's Department of Technology and at least two other city agencies spent the first week of July running systematic audits of their public-facing digital asset libraries, pulling hundreds of duplicate and incorrectly labeled images that had accumulated across the city's open-data portals and permitting systems. The effort, which city IT staff began in earnest on Monday, June 30, targets a backlog that has grown steadily since the pandemic-era push to digitize municipal records accelerated in 2020 and 2021.

The timing is not accidental. The city's Planning Department, headquartered at 49 South Van Ness Avenue, is preparing to relaunch its public-facing project lookup tool later this summer. Officials there want the interface clean before residents and developers begin using the upgraded portal to search building permit applications. Duplicate images — the same property photograph uploaded under multiple permit numbers, for instance — have been generating false results and slowing load times for users in neighborhoods from the Sunset District to SoMa.

Why the Cleanup Matters Beyond Aesthetics

Redundant image files create real problems downstream. When the city's homelessness response teams at the Department of Homelessness and Supportive Housing update site assessments with field photographs, duplicate entries have occasionally caused case managers to log the same location twice, wasting staff time. The department, which manages more than 10,000 units of supportive housing across the city according to its fiscal year 2025 annual report, relies on accurate geotagged imagery to track encampment clearances and shelter placements in real time.

The San Francisco Public Library's digital collections unit, based at the main branch on Larkin Street in the Civic Center, ran into a parallel issue earlier this year when its digitized historical photograph archive ballooned to roughly 340,000 files — a figure the library's digital services team disclosed in a January 2026 internal presentation obtained by this reporter. Automated deduplication software flagged more than 12,000 potentially redundant entries, though librarians must manually review each one before deletion to avoid losing genuinely distinct archival images.

City Hall's broader IT consolidation effort, known internally as DataSF's asset management modernization initiative, has been pushing individual departments to adopt a single image-tagging standard since late 2024. Progress has been uneven. The Recreation and Parks Department, which maintains facilities at more than 220 parks citywide, completed its own duplicate-image audit in March 2026 and reduced its asset library size by roughly 18 percent. Other departments have lagged.

What Comes Next for Residents and Contractors

For anyone who files permits through SF Planning's portal or checks city project maps — contractors working in the Mission District, homeowners pulling ADU permits in the Excelsior, developers tracking entitlements in Chinatown — the practical effect should be faster searches and more reliable property photographs attached to the correct addresses. The Planning Department has said the upgraded portal is targeted for a beta release in August 2026, though that date could shift depending on how the image audit proceeds.

The Department of Technology has not publicly disclosed a budget figure for the cleanup effort, and a request for that information submitted this week had not been answered by publication time. The work is being handled largely by existing staff rather than outside contractors, according to a department spokesperson who declined to speak on the record about ongoing internal projects.

Independent technologists who work with San Francisco's open-data ecosystem say the city's image duplication problem is not unique — most large municipal governments that accelerated digital uploads during COVID-era office closures are dealing with similar bloat. What matters now is whether departments build automatic deduplication checks into their upload workflows going forward, rather than scheduling periodic manual cleanups that immediately begin falling behind again. City IT staff have indicated they are evaluating at least two commercial software solutions that would flag duplicate images at the point of upload, but no contract has been awarded yet.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily San Francisco

This article was produced by the The Daily San Francisco editorial desk and covers news in San Francisco. See our editorial standards for how we use AI.

The Daily San Francisco brief

The day's San Francisco news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to San Francisco news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily San Francisco

More in News

Enjoyed this story? Get tomorrow's briefing free.