The Daily San Francisco

San Francisco news, every day

News

SF City Agencies Push to Purge Duplicate Images From Public Records Systems This Week

A coordinated cleanup effort targeting redundant digital files is saving storage costs and untangling years of bureaucratic data sprawl across municipal departments.

By San Francisco News Desk · Published 4 July 2026, 12:06 pm

3 min read

SF City Agencies Push to Purge Duplicate Images From Public Records Systems This Week
Photo: Photo by Jeffry Surianto on Pexels

San Francisco's Department of Technology quietly hit a milestone this week, completing the first phase of a citywide duplicate-image-replacement initiative that has been working through the municipal records infrastructure since January. The effort targets redundant scanned documents, permit photographs, and archived case files that have accumulated across at least a dozen city agencies — clogging servers, inflating cloud storage bills, and slowing the retrieval systems that city workers and the public rely on daily.

The timing matters. San Francisco is in the middle of a broader push to modernize its data infrastructure, partly in response to pressure from the Board of Supervisors to cut operational overhead at a moment when the city's general fund is under sustained strain. Duplicate digital assets — particularly images attached to building permits, homeless services case files, and Planning Department records — have been flagged by internal auditors as a low-profile but significant cost driver. The problem compounds every year as agencies digitize older paper archives without first checking for existing scans.

What Happened This Week

On Wednesday, the Department of Technology confirmed to staff that automated deduplication tools deployed across the Planning Department's permit portal — accessible via the city's online services hub at sfgov.org — had identified and flagged more than 400,000 redundant image files for review. A subset of those, roughly 60,000 files linked to closed building permits in the Mission District and SoMa, were cleared for replacement or deletion after manual verification. The Planning Department, headquartered at 49 South Van Ness Avenue, has one of the largest image repositories in city government, owing to decades of photographic documentation attached to conditional-use permits and environmental impact reviews.

The Department of Homelessness and Supportive Housing, which operates out of offices on Seventh Street and manages intake documentation for shelter placements across the city, is next in line for the deduplication sweep, scheduled to begin the week of July 14. That agency's case management system, the Coordinated Entry system used across HSH's provider network, has been flagged by the Controller's Office as carrying a significant volume of duplicate intake photographs uploaded during system migrations in 2022 and 2023.

City officials have not publicly disclosed a dollar figure for projected savings, and The Daily San Francisco has not independently confirmed internal cost estimates. However, comparable municipal deduplication projects in cities of similar scale — including Denver's 2024 records modernization effort — have reported storage cost reductions of 20 to 35 percent within the first year of systematic cleanup, according to published case studies from the Government Technology research group.

Why It Matters Beyond File Housekeeping

The practical stakes reach further than server space. San Francisco's permit-retrieval system has drawn complaints from contractors working on housing projects in the Tenderloin and the Castro, who say slow document-loading times delay job starts. When a permit file carries dozens of duplicate images — often high-resolution photographs uploaded multiple times during system glitches or staff handoffs — retrieval times spike. The city's housing production emergency declaration, issued in 2024, put pressure on Planning and the Department of Building Inspection to streamline permitting. Clogged back-end systems undercut that goal.

The deduplication project also intersects with the city's AI procurement push. Several departments are piloting machine-learning tools for permit review and code enforcement — tools that perform better when trained on clean, non-redundant datasets. A bloated image library full of near-identical files degrades model accuracy and inflates the cost of AI inference, according to documentation from the city's Office of Emerging Technology published in March 2026.

For residents and small contractors, the most immediate change should be faster load times on the SF Planning portal and the Department of Building Inspection's online permit tracker at sfdbi.org. The Department of Technology has indicated it will publish a public progress dashboard by the end of July, allowing users to track which agency databases have been cleared. Anyone who relies on archived permit photographs — title companies, architects working on Accessory Dwelling Unit projects in the Sunset or Richmond districts, or attorneys handling property disputes — should expect intermittent access pauses to specific file sets over the next three weeks as the cleanup proceeds agency by agency.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily San Francisco

This article was produced by the The Daily San Francisco editorial desk and covers news in San Francisco. See our editorial standards for how we use AI.

The Daily San Francisco brief

The day's San Francisco news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to San Francisco news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily San Francisco and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily San Francisco

More in News

Enjoyed this story? Get tomorrow's briefing free.