Breakpad
Three Weeks with the New Socorro File System
Three weeks ago today, we deployed the new Socorro file system into production. It was the first in in a series of engineered improvements to the Socorro codebase. By “engineered”, I mean that it was the first major improvement to the code that wasn’t done during an emergency with a gun to... »
Socorro’s File System Storage
As the scope and depth of the Socorro/Breakpad project has evolved in the last nine months, the most nonvolatile requirement of the project has been a file system as the initial server side storage for submitted crash dumps. The file system gets used as an ad hoc hierarchical database, but it isn’t optimized... »
Socorro Delays
Over the last week, we’ve encountered some problems in our monitor and processor caused by a large number of pending jobs: main monitor thread takes > 1 hr to complete a full scan of pending jobs on disk priority job processing depends on this thread In order to fix these delays, we have made priority job monitoring... »
Socorro Database Updates
The Socorro database, which is the main database for Firefox 3 crashes, will be going through some maintenance upgrades this week. Starting tonight, this means: Report data prior to June 23rd will be temporarily unavailable. New reports will work as usual on our new database. Old data will be imported into the new database using an... »
Socorro Processor Updates
Last Friday we pushed some important updates to Socorro: Bug 426940 – Reduce or eliminate delay in collector to monitor hand-off Bug 426940 – Fix processor handling of error conditions Bug 428300 – status page too slow This means: When you submit a crash report you won’t have to wait longer than 30-60 seconds to view your report The processor... »
Crash Analysis: now in Open Source flavor
History can tell you that companies don’t disclose crashes in their software. They keep a pretty close eye on what crashes and bugs are disclosed. Mozilla doesn’t. Rather than being the exception, openness is the rule, and that is one of the coolest things about being a part of this. My job, my everday... »
Socorro Updates
We’ve pushed some important updates in the last couple of days: refactor of processor code, which is 1/3 of the breakpad server architecture update of reporter to allow for instant queuing of requested reports This means: If you submit a crash, going to that crash page will: Show you a “haven’t queued it yet” page instead of a 404 page... »
Socorro Updates
Socorro has had a few improvements over the last week. bsmedberg, ispiked and luser worked hard to bring you: Graphs to show crash population over time Crash reports by operating system Crash reports by build and operating system There are plenty of fun graphs that show trends over time. For example, see this graph that shows... »
Partitioning Fun in PostgreSQL
Last week I learned a few things (the hard way) about PostgreSQL (pgsql) partitioning: You really have to read the “caveats” part of the manual (scroll down, very bottom) Server config matters (SET constraint_exclusion = on;) if you want to avoid unnecessarily checking out-of-bounds partitions Child tables don’t inherit permissions, so owning the parent table won’t automagically... »
