All about SUMO: Third Post
May 15th, 2008 by seth bindernagel
Mean responses and the missing values
In the last two posts about survey 2, we learned the purpose of the survey and who took it.
For this post, we will start looking into the responses to our questions. Remember that we asked the following:
- Please rate the overall quality of the support for each product;
- How much does the user agree or disagree with a list of statements describing the support experience;
- Please rate the usefulness of particular content types (tutorials, screencasts, chats, etc.).
We can start by summarizing the means to each answer, typically the easiest statistic to interpret and visualize. But, as a forewarning, the chart below is pretty long!
Darker bars are the mean responses and the thin vertical grey lines represent the levels of the scale for each question:
- ‘Lowest quality’ to ‘Highest quality’ for the first question
- ‘Strongly Disagree’, ‘Disagree’, ‘Neutral’, ‘Agree’ and ‘Strongly Agree’ for the remaining two questions
Perhaps it’s not surprising, but Firefox’s support was rated highest (our guess: because of its depth and completeness), whereas products like SeaMonkey, Calendar and Camino had a very high ratio of missing values (the ‘I don’t know’ responses). As the support rating lowers for each product, the percentage of the missing values increases, which also constitutes an important piece of information. 60-70% of the survey takers didn’t know how to rate the quality of support for SeaMonkey or Calendar—probably because they hadn’t used support or the application itself. For Firefox, we can say that its support was rated between 4 and 5, where 5 was “highest quality”. NB, only about 6% of the survey takers didn’t know how to rate Firefox support.  Please keep in mind that every respondent was thinking about support in their language, so in our next post we will calculate the mean responses to this question for every locale.
For the next two questions, we should interpret the bars differently. Each of the thin vertical grey lines corresponds to a level starting with ‘Strongly Disagree’ on the left, through ‘Disagree’, ‘Neutral’, ‘Agree’ and, for the rightmost line, ‘Strongly Agree’. The length of the darker bar for the first statement means that, again on average, the respondents tended to agree that “the support sites loaded quickly.” As for the brighter bar, we interpret it in the same way as above: about 10% of the survey takers didn’t know how to react to this statement.
When you look at the average responses to the second question, you’ll see some interesting findings:
- “Support in English is sufficient” scores lowest (between ‘disagree’ and ‘neutral’), and at the same time, it has very few missing values. So, people certainly had an opinion on this one since there were not many “Don’t Know” responses.
- Two to three other statements have neutral response means (the corresponding dark bars end close to the middle of the chart)
- Six statements show that the survey takers well above the midpoint and reflecting agreement.
Is this helpful data if we see mostly “Agrees”, “Neutrals” and one “Disagree”?Â
Keep in mind that every person who responded to these two questions reflected on their unique support experience and then submitted their response. Also, remember that support of the Mozilla products
- can be accessed in different languages (and the quality of the support depends on quality of the localization),
- is often decentralized (each local community has its own discussion board),
- and aims to satisfy each users distinct needs.
If the cumulative responses were “Agree”, the means may still differ across certain groups of people. We can see that the “Most of the relevant support can be found in my language” statement scores high (respondents tended to agree with it), which is a good first step toward understanding how the local communities are doing in terms of the user-to-user support. One interesting thing to note is the “The support sites make use of video” statement. It scores low, but also has an outstanding ratio of missing values. Could you argue that an “I don’t know” answer is close to “Disagree”? If someone hadn’t ever seen a video on a support site, but was careful in their judgments and supposed that there might be other support sites where videocasts were available, they were probably more likely to answer “I don’t know” instead of “Disagree”. In this survey, we assumed that an “I don’t know” answer equaled “I haven’t ever seen any of them”. Therefore, we decided that the survey takers who described themselves as end-users were more likely to give an “I don’t know” answer to this statement.
Mean responses by user profile
Thanks to the user profile variable, we were ablt to split the cases and plot new charts for 3 different user profiles. These were community members (49% of respondents), active community members (localizing, coding, etc.; 32%) and end-users (19% of the survey takers).
We observe the biggest discrepancies between two extreme profiles: end-users and active contributors. The community member profile seems to be in the middle for most of the questions.  For example, end-users rate the quality of the support for each product poorer than the contributors (green bars are longer than the yellow ones). More visible differences can also be observed in the “Most of the relevant support can be found in my language” statement. It seems that active contributors are more enthusiastic here.  Why is this the case? Perhaps, it’s due to prior knowledge about the products (and computers in general), greater familiarity with the technology, and a better command of English. What do you think?
Two other statements bring forward even bigger differences in the way the end-users and the active contributors responded.
- “The support sites make use of video”
Why do end-users agree more than active contributors that there are support sites using video? This is probably a result of what we have discussed above: respondents might have preferred the “I don’t know” answer to this question over “Strongly disagree” or “Disagree”, even if these ones would have been more accurate. We can see that there were nearly 50% of end-users answering “I don’t know” to this statement, and only half as many of the active contributors. Some end-users did agree on this topic, but some didn’t and we can assume that some of them selected the “I don’t know” answers instead of “Disagree”. Using this theory, we can understand why end-users scored this higher: there were not enough “disagree” responses to counterbalance the “agree” ones.
- “Users rarely ask the same question more than once”
It seems understandable that active contributors are more likely to disagree. After all, they’d probably seen lots of duplicated questions in the forums and Bugzilla.
![]()
In the third question we see that end-users tend to favor direct communication (IM, VoIP) over text-based communication, compared to other user profiles. We also notice that forums turned out to be one of the most favorite channels of communication for active contributors—and much less so for the other profiles.
Finally, here is the graph of the missing values across different user profiles.
* * *
In the next posts we will reduce the number of variables by means of the factor analysis. This will allow us to better synthesize the analysis without loosing much of the information enclosed in the original set of variables.
Thanks for reading this to the end!!



















