San Francisco's Department of Building Inspection has begun a systematic sweep of its digital permit archive, targeting duplicate images that have accumulated across tens of thousands of property records dating back to the city's 2009 digitization push. The problem is bigger than it sounds: redundant image files are slowing permit review times, inflating server costs, and — in at least a handful of documented cases — causing inspectors to pull the wrong version of a structural photo during hearings.
The timing matters. San Francisco is in the middle of a state-mandated housing production push, with the city required under its sixth-cycle Housing Element to zone for roughly 82,000 new units by 2031. Any friction inside the permit pipeline, including bloated and disorganized image databases, draws scrutiny from housing advocates and city controllers alike. When a single permit application for a Tenderloin SRO conversion can contain upward of 400 attached images — many of them identical JPEGs submitted multiple times by different contractors — the cumulative drag on the system becomes a real operational problem, not an abstract one.
What San Francisco Is Actually Doing
The Department of Building Inspection began its deduplication pilot in March 2026, partnering with the city's Department of Technology on a tool that uses perceptual hashing — a technique that identifies visually identical or near-identical images without requiring exact file matches. The pilot is currently running across permit records filed for properties in the Mission District and SoMa, two neighborhoods where construction activity surged between 2015 and 2020 and where duplicate submissions became especially common during the pandemic-era remote inspection period.
The San Francisco Planning Department, which maintains a parallel image archive tied to environmental review records, is not yet part of the pilot but has been in conversations with DBI about eventually syncing the two systems. Planning's archive, housed at 49 South Van Ness Avenue, holds visual documentation for every major discretionary review case going back decades — and staff there have flagged the duplicate problem internally for at least three years, according to department budget presentations reviewed by The Daily San Francisco.
The city's Office of Digital Services, established under the broader SmartSF initiative, is overseeing data governance for the project. The deduplication tool is expected to process roughly 1.2 million image files in the first phase, completing the Mission and SoMa sweep by September 2026 before expanding citywide.
How Other Cities Have Handled This
San Francisco is not the first city to wrestle with this. London's Planning Portal, which consolidated borough-level applications into a single national system in stages between 2018 and 2022, ran into the same duplicate-image problem at scale. The UK's Planning Inspectorate publicly acknowledged in 2023 that redundant file uploads had contributed to processing delays on major infrastructure applications, and the portal introduced mandatory deduplication checks on upload as part of a 2024 technical update.
Singapore's Urban Redevelopment Authority took a different approach. Rather than cleaning up existing archives retroactively, the URA in 2022 required all new building submissions through its GoBusiness licensing portal to pass an automated image check before a submission could be marked complete. That front-end gate reduced duplicate files in new applications by a figure the URA described in a 2023 annual report as substantial, though the legacy archive problem remained unaddressed.
New York City's Department of Buildings has a comparable backlog. Its DOB NOW portal, launched in 2016, accumulated years of duplicate submissions before a 2025 internal audit — reported by City Limits — found that storage costs for the Buildings Department's digital archive had grown faster than any other agency in the city's technology portfolio. A remediation contract was awarded in early 2026, but the project is still in procurement.
San Francisco's approach sits between London's reactive cleanup and Singapore's proactive gate. City officials say the hybrid model was chosen partly because retrofitting an upload validator onto the existing Accela permit platform would require a contract modification estimated at over $800,000 — money the department does not have in the current fiscal year budget, which runs through June 2027. The retroactive sweep, using existing Department of Technology staff time, was the faster and cheaper path to start.
For property owners and contractors filing permits this summer, the practical advice from DBI is straightforward: before uploading site photos to a new application, check whether those images were already attached to a prior permit on the same address. The department's permit search tool at sfdbi.org can pull all prior filings by parcel number. Redundant uploads don't void an application, but they do add to review time — and right now, with the housing pipeline under pressure, every day counts.