San Francisco's Department of Building Inspection is sitting on a backlog of duplicate scanned images embedded in its permit archive — redundant files that have quietly inflated the city's document database for years and now threaten to slow the rollout of a new digital permitting system expected to launch in early 2027. The problem has moved from a low-priority IT housekeeping issue to an urgent operational one, with the agency facing a hard deadline and a series of decisions that will shape how contractors, architects, and homeowners interact with the city's permitting infrastructure for the next decade.
The stakes are higher than they might appear. The Mission District alone has thousands of historic property files that were scanned in multiple batches during digitisation drives in 2009 and again between 2017 and 2019, according to city public records. Duplicate image files — sometimes three or four copies of the same elevation drawing or site plan — make automated document retrieval unreliable and drive up cloud storage costs. For anyone who has waited at the Civic Center permit counter at 49 South Van Ness Avenue, the downstream effect is familiar: a staff member scrolling through redundant pages trying to locate the authoritative version of a decades-old drawing.
Why the Timing Is Pressing
The city's push to accelerate housing production has turned permit velocity into a political priority. Mayor Daniel Lurie, who took office in January 2025 after defeating London Breed, campaigned in part on cutting the permit wait times that have strangled housing construction in neighborhoods like the Sunset, the Excelsior, and SoMa. A bloated, duplicate-laden archive directly undermines that goal by slowing the automated review tools the new platform is supposed to deploy.
San Francisco's Planning Department and the Department of Building Inspection jointly submitted a technology upgrade proposal to the city's Committee on Information Technology in late 2025. That proposal identified document deduplication as a prerequisite for migrating to the new system. The city has roughly 1.4 million permit records in its legacy archive, a figure cited in publicly available Capital Planning documents. Even a conservative estimate of five percent duplication means more than 70,000 redundant files that need to be identified, reviewed, and either merged or deleted before migration can proceed cleanly.
The financial exposure is real. Cloud storage at the scale the city currently maintains runs to hundreds of thousands of dollars annually in vendor contracts, and migrating a bloated dataset into a new system multiplies those costs while also increasing the risk of data errors during transfer. The San Francisco Controller's Office flagged technology project overruns as a budget risk in its fiscal year 2025-26 report — a reminder that poorly managed migrations have burned the city before.
The Decisions That Need to Be Made
Three choices are now in front of city technology and building officials. First, they must decide whether to run an automated deduplication pass — faster but prone to accidentally deleting a document that appears duplicated but is actually a revised version — or a manual audit, which is slower and more expensive in staff hours. Second, they need to establish a chain-of-custody protocol so that property owners, title companies, and firms like Gensler or smaller Mission Street architecture practices can flag if a document they rely on gets incorrectly removed. Third, the agencies have to agree on a completion date that is realistic enough to protect the 2027 launch but firm enough to prevent the cleanup from drifting indefinitely.
Community organizations along the 24th Street corridor in the Mission and housing advocates in the Tenderloin who use permit records to track displacement and code enforcement have a direct interest in how those decisions land. If the deduplication process is rushed, a badly executed purge could create gaps in historical records that inform eviction defense and landmark preservation cases.
The next formal checkpoint is a joint technology briefing scheduled before the Board of Supervisors' Government Audit and Oversight Committee, expected in September 2026. That session will likely be the first public test of whether the city has a credible plan — or whether the duplicate image problem quietly becomes another reason why San Francisco's permit system misses yet another promised reform deadline.