<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Blog of Metrics &#187; results</title>
	<atom:link href="http://blog.mozilla.com/metrics/category/results/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.mozilla.com/metrics</link>
	<description></description>
	<lastBuildDate>Wed, 08 Feb 2012 03:44:22 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Comparing the Bias in Telemetry Data vs The Typical Firefox User</title>
		<link>http://blog.mozilla.com/metrics/2011/12/13/comparing-the-bias-in-telemetry-data-vs-the-typical-firefox-user/</link>
		<comments>http://blog.mozilla.com/metrics/2011/12/13/comparing-the-bias-in-telemetry-data-vs-the-typical-firefox-user/#comments</comments>
		<pubDate>Tue, 13 Dec 2011 19:25:49 +0000</pubDate>
		<dc:creator>Saptarshi Guha</dc:creator>
				<category><![CDATA[results]]></category>
		<category><![CDATA[anova]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[log linear models]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=5286</guid>
		<description><![CDATA[Telemetry  is a feature in Firefox that captures performance metrics such as start up time, DNS latency among others. The number of metrics captured is in the order of a couple hundred. The data is sent back to the Mozilla Bagheera servers  which is then analyzed by the engineers. The Telemetry feature asks the Nightly/Aurora (pre-release) users  if they [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: justify;"><a title="Telemetry" href="https://addons.mozilla.org/en-US/firefox/addon/abouttelemetry/">Telemetry</a>  is a feature in Firefox that captures performance metrics such as start up time, DNS latency among others. The number of metrics captured is in the order of a couple hundred. The data is sent back to the Mozilla <a title="Bagheera" href="https://github.com/mozilla-metrics/bagheera">Bagheera</a> servers  which is then analyzed by the engineers.</p>
<p>The Telemetry feature asks the Nightly/Aurora (pre-release) users  if they would like to submit their anonymized performance data . This resulted in  a response rate (number of people who opted in divided by the number of people who were asked) of less than 3%. This led to two concerns: small number of responses (which changed when Telemetry became part of  Firefox release) and more importantly representativeness: <em>are the performance measurements as collected from the 3% representative of those of people who chose not to  opt in?</em></p>
<p>Measuring the bias is not easy unless we have measurements about the users who did not opt in. Firefox sends the following pieces of information to the Mozilla servers: operating system, Firefox version, extension identifiers and the time for the session to be restored. This is sent by all Firefox installations unless the distribution or user have the feature turned off (this is called the services AMO ping). The Telemetry data contains the same pieces of information.</p>
<p>What this implies is that we have start up times for i) <em>the users who opted in</em> <em>to Telemetry and ii) everyone</em>. We can now answer the question &#8220;<em>Are the startup</em> <em>times for the people who opted into Telemetry representative of the typical</em> <em>Firefox user?&#8221;</em></p>
<p>Note: &#8216;everyone&#8217; is <em>almost everyone. </em>Very few have this feature turned off.</p>
<p><strong>Data Collection</strong></p>
<p>We collected start up times for Firefox 7,8 and 9 for November, 2011 from the log files of services.addons.mozilla.org (SAMO). We also took the same information for the same period from the Telemetry data contained in HBase ( some code examples can be found at the end of the article).</p>
<p><strong>Objective</strong></p>
<p>Are start up times different by Firefox version and/or Source, where source can be SAMO or Telemetry.</p>
<p><strong>Displays</strong></p>
<p>Figure 1 is boxplot of log of start up time for Telemetry (tele) vs. SAMO (samo) by Firefox version. At first glance it appears the start up times from Telemetry are less than those of SAMO. But the length of the bars makes it difficult to stand by this conclusion.</p>
<div id="attachment_5287" class="wp-caption aligncenter" style="width: 624px"><a href="http://blog.mozilla.com/metrics/files/2011/12/box.png"><img class="size-large wp-image-5287 " title="Figure 1:Boxplot of Log SessionRestored for Telemetry/SAMO by FF Version" src="http://blog.mozilla.com/metrics/files/2011/12/box-1024x1024.png" alt="Figure 1: Boxplot of Log SessionRestored for Telemetry/SAMO by FF Version" width="614" height="614" /></a><p class="wp-caption-text">Figure 1: Boxplot of Log SessionRestored for Telemetry/SAMO by FF Version</p></div>
<p style="text-align: justify;">Figure 2 is the difference in the deciles of log of start up time. In other words, approximately speaking, the deciles of ratio of Telemetry start up time to SAMO start up time. The medians hover in the 0.8 region, though the bars are very wide and do not support to a the quick conclusion that Telemetry start up time is smaller.</p>
<div id="attachment_5297" class="wp-caption aligncenter" style="width: 624px"><a href="http://blog.mozilla.com/metrics/files/2011/12/box4.png"><img class="size-large wp-image-5297 " title="Figure 2: Difference of Deciles of Logs" src="http://blog.mozilla.com/metrics/files/2011/12/box4-1024x1020.png" alt="Figure 2: Difference of Deciles of Logs" width="614" height="612" /></a><p class="wp-caption-text">Figure 2: Difference of Deciles of Logs</p></div>
<p style="text-align: justify;">In Figure 3, we have the mean of medians of 1000 samples: red circles are for telemetry and black for SAMO. The ends of the line segments correspond the sample 95% confidence interval (based on the sample of sample medians). The CI for the SAMO data lies entirely within that of the Telemetry data. This makes one believe that the two groups are not different.</p>
<div id="attachment_5298" class="wp-caption aligncenter" style="width: 624px"><a href="http://blog.mozilla.com/metrics/files/2011/12/box5.png"><img class="size-large wp-image-5298 " title="Figure 4: Mean of the medians (circles) with their 95% confidence intervals. Red isTelemetry, Black is SAMO" src="http://blog.mozilla.com/metrics/files/2011/12/box5-1024x1024.png" alt="Figure 4: Mean of the medians (circles) with their 95% confidence intervals. Red isTelemetry, Black is SAMO" width="614" height="614" /></a><p class="wp-caption-text">Figure 4: Mean of the medians (circles) with their 95% confidence intervals. Red isTelemetry, Black is SAMO</p></div>
<p><strong>Analysis of Variance</strong></p>
<p>For a more numerical approach, we can estimate the analayis of variance components. The model is</p>
<p style="text-align: center;"><em>log(startup time) ~ version + src</em></p>
<p>(we ignore interaction). Since the data is in the order of billions of rows, I instead take 1000 samples of approximately 20,000 (sampling rate of 0.001%) rows each. Compute ANOVA results of each and then average the summary tables of the <em>lm</em> function in R. In other words we make our conclusions based on the average of the 1000 samples of ~20,000 rows each. ( I should point out that the residuals (as per a quick visual check) were roughly distributed as gaussian and other diagnostics came out clean)</p>
<p>The average ANOVA indicates does not support version effect or source effect (at the 1% level). In other words, the log of start up time is not affected by the version nor is it affected by the source (Telemetry/ SAMO).</p>
<pre>               Estimate Std. Error     t value   Pr(&gt;|t|)
(Intercept)  8.62635472 0.01171420 736.4390937 0.00000000
vers8       -0.05995627 0.01928947  -3.1089666 0.02922402
vers9       -0.03382135 0.10466330  -0.3247165 0.48286903
vers10      -0.03862282 0.29308642  -0.1418623 0.48228122
srctele     -0.02290538 0.03946150  -0.5811779 0.45300964</pre>
<p>This is good news! <em><strong>Insofar start up time is concerned, Telemetry is representative of SAMO.</strong></em></p>
<p><strong>A Different Approach and Some Checks</strong></p>
<p>By now, the reader should note that we have answered our question (see last line of previous section). Two questions remain:</p>
<p>1. The samples are representative. We are sampling on 3 dimensions: startup time, src and version. Consider the 1000 quantiles of startup time, the 2 levels of src and 4 levels of version. All in all, we have 1000x2x4 or 8000 cells. Sampling from the population might result in several empty cells, so much so, that the joint distribution of the sample might be very different from that of the population. To confirm that our cell distribution of the samples reflect the cell distribution of the population, we computed Chi Square tests comparing the sample cell counts with that of the parent. All 1000 samples passed!</p>
<p>2. Why use samples? We can do a log linear regression testing on the 8000 cell counts (i.e all the 1.9 BN data points) . This of course loses a lot of power: we are binning the data and all monotonic transformations are equivalent. The model equivalent (using R&#8217;s formula language) of the ANOVA described above is</p>
<p style="text-align: center;"><em>log(cell count) ~ src+ver+binned_startup:(src+ver)</em></p>
<p> If the effects of<em> binned_startup:src</em> and<em> binned_startup:ver</em> are not significant this corresponds to our conclusion in the previous section. And nicely enough, it does!  Output of <em>summary(aov(glm(&#8230;)))</em> is</p>
<pre>summary(aov(glmout &lt;- glm(n~ver+src+sesscut:(ver+src)
                          , family=poisson
                          , data=cells3.parent))</pre>
<pre>              Df     Sum Sq    Mean Sq   F value Pr(&gt;F)
ver            3 4.6465e+14 1.5488e+14 1131.8666 &lt;2e-16 ***
src            1 3.2705e+14 3.2705e+14 2390.0704 &lt;2e-16 ***
ver:sesscut 3952 5.4969e+13 1.3909e+10    0.1016      1
src:sesscut  988 2.0009e+13 2.0252e+10    0.1480      1
Residuals   2967 4.0600e+14 1.3684e+11</pre>
<p><strong>Some R Code and Data Sizes:</strong></p>
<p>1. The data for SAMO was obtained from Hive, sent to a text file and then imported to blocked R data frames using <a title="RHIPE" href="http://code.google.com/p/rhipe/">RHIPE</a>. All subsequent analysis was done using <a href="http://code.google.com/p/rhipe/">RHIPE</a>.</p>
<p>2. The data for Telemetry, was obtained from Hbase using Pig (<a href="http://code.google.com/p/rhipe/">RHIPE</a> can read HBase, but I couldn&#8217;t install it on this particular cluster). The text data was then imported as blocked R data frames and placed in the same directory as the<br />
imported SAMO data.</p>
<p>3. Data sizes were in the few hundreds of gigabytes. All computations were done using <a href="http://code.google.com/p/rhipe/">RHIPE</a> (R not on the on the nodes) on  a 350TB/33 node Hadoop cluster.</p>
<p>3. I include some sample code to give a flavor of <a href="http://code.google.com/p/rhipe/">RHIPE</a>.</p>
<p style="text-align: justify;"><strong>Importing text data as Data Frames</strong></p>
<pre>map         &lt;- expression({
  ln        &lt;- strsplit(unlist(map.values),"\001")
  a         &lt;- do.call("rbind",ln)
  addonping &lt;- data.frame(ds=a[,1]
                         ,vers=a[,3]
                         ,sesssionrestored=as.numeric(a[,6])
                         ,src=rep("samo",length(a[,6]))
                         ,stringsAsFactors=FALSE)
  rhcollect(runif(1),addonping)
})
z &lt;- rhmr(map=map
          ,ifolder="/user/sguha/somequants"
          ,ofolder="/user/sguha/teledf/samo"
          ,zips="/user/sguha/Rfolder.tar.gz"
          ,inout=c("text","seq")
          ,mapred=list(mapred.reduce.tasks=120
             ,rhipe_map_buff_size=5000))
rhstatus(rhex(z,async=TRUE),mon.sec=4)</pre>
<p style="text-align: justify;"><strong>Creating Random Samples</strong></p>
<pre>map         &lt;- expression({
  y         &lt;- do.call('rbind', map.values)
  p         &lt;- 20000/1923725302
  for(i in 1:1000){
    zz      &lt;- runif(nrow(y)) &lt; p
    mu      &lt;- y[zz,,drop=FALSE]
    if(nrow(mu)&gt;0)
      rhcollect(i,mu)
  }
})
reduce      &lt;- expression(
    pre={ x &lt;- NULL}
    ,reduce = {
      x     &lt;- rbind(x,do.call('rbind',reduce.values))
    }
    ,post={ rhcollect(reduce.key,x) }
    )
z &lt;- rhmr(map=map,reduce=reduce
          ,ifolder="/user/sguha/teledfsubs/p*"
          ,ofolder="/user/sguha/televers/dfsample"
          ,inout=c('seq','seq')
          ,orderby='integer'
          ,partition=list(lims=1,type='integer')
          ,zips="/user/sguha/Rfolder.tar.gz"
          ,mapred=list(mapred.reduce.tasks=72
             ,rhipe_map_buff_size=20))
rhstatus(rhex(z,async=TRUE),mon.sec=5)</pre>
<pre></pre>
<p><strong>Run Models Across Samples</strong></p>
<pre>map        &lt;- expression({
  cuts     &lt;- unserialize(charToRaw(Sys.getenv("mcuts")))
  lapply(map.values, function(y){
    y$tval &lt;- sapply(y$sesssionrestored
                     ,function(r) {
                       if(is.na(r)) return( r)
                       max(min(r,cuts[2]),cuts[1])
                     })
    mdl    &lt;- lm(log(tval)~vers+src,data=y)
    rhcollect(NULL, summary(mdl))
  })})
z &lt;- rhmr(map=map
          ,ifolder="/user/sguha/televers/dfsample/p*"
          ,ofolder="/user/sguha/televers2",
          ,zips="/user/sguha/Rfolder.tar.gz"
          ,inout=c("seq","seq")
          ,mapred=list(mapred.reduce.tasks=0))
rhstatus(rhex(z,async=TRUE),mon.sec=4)</pre>
<p><strong>Computing Cell Counts For A Log Linear Model</strong></p>
<pre>cuts2                &lt;- wtd.quantile(tms$x,tms$n,
                                     p=seq(0,1,length=1000))
cuts2[1]             &lt;- cuts[1]
cuts2[length(cuts2)] &lt;- cuts[2]
map.count &lt;- expression({
  cuts       &lt;- unserialize(charToRaw(Sys.getenv("mcuts")))
  z          &lt;- do.call(rbind,map.values)
  z$tval     &lt;- sapply(z$sesssionrestored,function(r)
                  max(min(r,cuts[length(cuts)]),cuts[1]))
  z$sessCuts &lt;-
    factor(findInterval(z$tval,
                        cuts),ordered=TRUE)
  f          &lt;- split(z,list(z$vers,z$sessCuts,z$src),drop=FALSE)
  for(i in seq_along(f)){
    y &lt;-strsplit(names(f)[[i]],"\\.")[[1]]
    rhcollect(y,nrow(f[[i]])) }
})
z &lt;-
  rhmr(map=map.count,reduce=rhoptions()$templates$scalarsummer
       ,combiner=TRUE,
       ifolder="/user/sguha/teledfsubs/p*"
       ,ofolder="/user/sguha/telecells",
       ,zips="/user/sguha/Rfolder.tar.gz"
       ,inout=c("seq","seq") ,mapred=
       list(mapred.task.timeout=0
            ,rhipe_map_buff_size=40
            ,mcuts=rawToChar(serialize(cuts2, NULL,
                                ascii=TRUE))))</pre>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2011/12/13/comparing-the-bias-in-telemetry-data-vs-the-typical-firefox-user/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Do 90% of People Not Use CTRL+F?</title>
		<link>http://blog.mozilla.com/metrics/2011/08/25/do-90-of-people-not-use-ctrlf/</link>
		<comments>http://blog.mozilla.com/metrics/2011/08/25/do-90-of-people-not-use-ctrlf/#comments</comments>
		<pubDate>Thu, 25 Aug 2011 18:31:06 +0000</pubDate>
		<dc:creator>Diyang Tang</dc:creator>
				<category><![CDATA[results]]></category>
		<category><![CDATA[Test Pilot]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=5138</guid>
		<description><![CDATA[According to an article in The Atlantic floating around the internet, 90% of users don&#8217;t know how to use CTRL+F or Command+F to search a webpage. We were surprised at that percentage. Fortunately, Mozilla has TestPilot studies with open data, and we can see if Firefox users behave similarly. One relevant 7-day TestPilot study of [...]]]></description>
			<content:encoded><![CDATA[<p>According to an <a href="http://www.theatlantic.com/technology/archive/2011/08/crazy-90-percent-of-people-dont-know-how-to-use-ctrl-f/243840/" target="_blank">article</a> in <em>The Atlantic</em> floating around the internet, 90% of users don&#8217;t know how to use CTRL+F or Command+F to search a webpage. We were surprised at that percentage. Fortunately, Mozilla has <a href="https://testpilot.mozillalabs.com/" target="_blank"> TestPilot studies</a> with <a href="https://testpilot.mozillalabs.com/testcases/beta/aggregated-data.html" target="_blank">open data</a>, and we can see if Firefox users behave similarly. One relevant 7-day <a href="https://testpilot.mozillalabs.com/testcases/betaui2" target="_blank">TestPilot study</a> of about 69,000 Windows users focused on Firefox&#8217;s user interface. Along with seeing how users interacted with the navigation bar, their bookmarks, etc., the study looked at how often people used keyboard shortcuts.</p>
<p>What we found is that about <strong>81%</strong> of TestPilot users didn&#8217;t use CTRL+F during the course of the study. While 81% is lower than the 90% in the article, TestPilot users are usually more technologically experienced than the general population, since they are largely Firefox Beta users. When we look at TestPilot users who consider themselves beginners, the percentage goes up to 85%. Therefore, our 81% figure does not belie the <em>Atlantic</em> piece.</p>
<p>In addition, those who use CTRL+F on average use keyboard shortcuts <strong>twice as much</strong> as those who don&#8217;t, even when we ignore those people who don&#8217;t use any keyboard shortcuts at all. This implies that people who use CTRL+F are more comfortable with keyboard shortcuts in general. The only keyboard shortcut the users who use CTRL+F lag behind in is Full Screen, or F11.</p>
<p>Feel free to take a look at <a href="https://testpilot.mozillalabs.com/testcases/beta/aggregated-data.html" target="_blank">the data</a> yourself and <a href="https://testpilot.mozillalabs.com/testcases/submit" target="_blank">let us know</a> about any interesting trends you discover!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2011/08/25/do-90-of-people-not-use-ctrlf/feed/</wfw:commentRss>
		<slash:comments>20</slash:comments>
		</item>
		<item>
		<title>Text mining users&#8217; definitions of browsing privacy</title>
		<link>http://blog.mozilla.com/metrics/2011/08/15/text-mining-users-definitions-of-browsing-privacy/</link>
		<comments>http://blog.mozilla.com/metrics/2011/08/15/text-mining-users-definitions-of-browsing-privacy/#comments</comments>
		<pubDate>Mon, 15 Aug 2011 18:32:11 +0000</pubDate>
		<dc:creator>Rebecca Weiss</dc:creator>
				<category><![CDATA[results]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=4960</guid>
		<description><![CDATA[One issue that’s been on everyone’s mind lately is privacy.  Privacy is extremely important to us at Mozilla, but it isn&#8217;t exactly clear how Firefox users define privacy.  For example, what do Firefox users consider to be essential privacy issues?  What features of a browsing experience lead users to consider a browser to be more [...]]]></description>
			<content:encoded><![CDATA[<p><span style="color: #000000;">One issue that’s been on everyone’s mind lately is privacy.  Privacy is <span style="color: #0000ff;"><a href="http://firstpersoncookie.wordpress.com/2011/01/12/mozillas-draft-privacy-data-operating-principles/"><span style="color: #0000ff;">extremely</span></a> <a href="https://wiki.mozilla.org/Privacy/Roadmap_2011#Operating_Principles:"><span style="color: #0000ff;">important</span></a></span> to us at Mozilla, but it isn&#8217;t exactly clear how Firefox users define privacy.  For example, what do Firefox users consider to be essential privacy issues?  What features of a browsing experience lead users to consider a browser to be more or less private?</span></p>
<div>
<p><span style="color: #000000;">In order to answer  these questions, we asked users to give us their definitions of privacy, specifically <strong><em>privacy</em><em> while browsing</em></strong>, in order to answer these questions.  The assumption was that users will have different definitions, but that there will be enough similarities between groups of responses that we could identify “themes” amongst the responses. By text mining user responses to an open-ended survey question asking for definitions of browsing privacy,  we were able to identify themes directly from the users&#8217; mouths:</span></p>
<ol>
<li><span style="color: #000000;">Regarding  privacy issues, people know that tracking and browser history are  different issues, validating the need for browser features that address  these issues independently (&#8220;private browsing&#8221; and &#8220;do not track&#8221;)</span></li>
<li><span style="color: #000000;">People&#8217;s definition of personal information vary, but we can group people  according to the different ways they refer to personal information (this leads to a natural follow-up question; what makes some information more personal than others?)</span></li>
<li><span style="color: #000000;">Previous focus group research, contracted by Mozilla, showed that users are aware that spam indicates a  security risk, but what didn&#8217;t come out of the focus group research was that users also also consider spam to be an invasion of their privacy (a follow-up question, what do users define as “spam?”  Do they consider targeted ads to be spam?)<br />
</span></li>
<li><span style="color: #000000;">There are users who don&#8217;t distinguish privacy and security from each other</span></li>
</ol>
<h2 id="magicdomid12"><span style="color: #000000;">Some previous research on browsing and privacy</span></h2>
<p id="magicdomid14"><span style="color: #000000;">We  knew from our own focus group research that users are concerned about viruses, theft of their personal information and passwords, that a  website might misuse their information, that someone may track their  online “footprint”, or that their browser history is visible to others.   Users view things like targeted ads, spam, browser crashes, popups, and  windows imploring them to install updates as security risks.</span></p>
<p><span style="color: #000000;">But it&#8217;s difficult to broadly generalize findings from focus groups.  One group may or may not have the same concerns as the general population.  The quality of the discussion moderator, or some unique combination of participants,  the moderator, and/or the setting can also influence the findings you get from focus groups.</span></p>
<p><span style="color: #000000;">One way of validating the representativeness of focus group research is to use surveys.  But while surveys may increase the representativeness of your findings, they are not as flexible as focus groups.  You have to give survey respondents their answer options up front.  Therefore, by providing the options that a respondent can endorse, you are limiting their voice.</span></p>
<p><span style="color: #000000;">A typical  way to approach this problem in surveys is to use open-ended survey questions.  In the pre-data mining days, we would have to manually code  each of these survey responses: a first pass of all responses to get an idea of respondent “themes” or “topics” and a second pass to code each  response according to those themes.  This approach is costly in terms of time and effort, plus it also suffers from the problem of reproducibility; unless themes are extremely obvious, different coders might not classify a response as part of the same theme.  But with modern text mining methods, we can simulate this coding process much more quickly and reproducibly.</span></p>
<h2 id="magicdomid19"><span style="color: #000000;"><strong>Text mining open-ended survey questions</strong></span></h2>
<p><span style="color: #000000;">Because text mining is growing in popularity primarily due to its computational feasibility , it’s important to review the  methods in some detail.  Text mining, as with any machine learning-based approach, isn’t magic.  There are a number of caveats to make about the text mining approach used. First, the clustering algorithm I chose to use requires an arbitrary and a priori decision regarding the number of clusters.  I looked at 4 to 8 clusters and decided that 6 provided the best trade-off between themes expressed and redundancy.  Second, there is a random component to  clustering, meaning that one clustering of the same set of data may not produce the exact same results as another clustering. Theoretically,  there shouldn&#8217;t be tremendous differences between the themes expressed in one clustering over another, but it&#8217;s important to keep these details in mind.</span></p>
<p id="magicdomid20"><span style="color: #000000;">The general idea of text mining is to assume that you can represent documents as &#8220;bags of words&#8221;, that bags of words can be represented or coded quantitatively, and that the quantitative representation of text can be projected into a multi-dimensional space. For example, I can represent survey respondents in two dimensions, where each point is a respondent&#8217;s answer.  Points that are tightly clustered together mean that these responses are theoretically very similar with respect to lexical content (e.g., commonality of words).</span></p>
<div><span style="color: #000000;"><a href="http://blog.mozilla.com/metrics/files/2011/06/p_kmeans_cosdist_clusters.png"><span style="color: #000000;"><img class="size-full wp-image-4986 aligncenter" title="p_kmeans_cosdist_clusters" src="http://blog.mozilla.com/metrics/files/2011/06/p_kmeans_cosdist_clusters.png" alt="" width="480" height="480" /></span></a></span></div>
<p><span style="color: #000000;">I  also calculated a score that identifies the relative frequency of each word in a cluster, which is reflected in the size of the word on each  cluster’s graph.  In essence, the larger the word, the more it “defines”  the cluster (i.e. its location and shape in the space).</span></p>
<p><span style="color: #000000;"><a href="http://blog.mozilla.com/metrics/files/2011/06/p_kmeans_freq.png"><span style="color: #000000;"><img class="size-full wp-image-4988 aligncenter" title="p_kmeans_freq" src="http://blog.mozilla.com/metrics/files/2011/06/p_kmeans_freq.png" alt="" width="480" height="480" /></span></a></span></p>
<p><span style="color: #000000;">Higher resolution .pdf files of these graphs can be found <span style="color: #0000ff;"><a href="http://blog.mozilla.com/metrics/?attachment_id=4991"><span style="color: #0000ff;">here</span></a></span> and <span style="color: #0000ff;"><a href="http://blog.mozilla.com/metrics/?attachment_id=4990"><span style="color: #0000ff;">here</span></a></span>.</span></p>
<h2><span style="color: #000000;"><strong>Cluster summaries</strong></span></h2>
<div id="magicdomid26">
<ul>
<li><span style="color: #000000;"><strong>&#8220;Privacy and Personal information&#8221;: </strong>Clusters  1, 4, and 5 are dominated by, unsurprisingly, concerns about  <strong>information</strong>.  What’s interesting are the lower-level associations  between the clusters and the words.  The largest, densest cluster  (cluster 4)  deals mostly with access to <strong>personal </strong>information whereas  cluster 1 addresses personal information as it relates to <strong>identity</strong>  issues (such as when banking).  Cluster 5 is subtly different from both 1  and 4.  The extra emphasis on &#8220;share&#8221; could imply that users have  different expectations of privacy with personal information that they <strong>explicitly</strong> choose to leak onto the web as opposed to personal information that they  aren&#8217;t aware they are expressing.  One area of further investigation would be to seek out user definitions on personal information; what makes some information more &#8220;personal&#8221; than others?</span></li>
</ul>
</div>
<ul>
<li><span style="color: #000000;"><strong>&#8220;Privacy and Tracking&#8221;: </strong>Cluster  6 clearly shows that people associate being tracked as a <strong>privacy</strong> issue.   The lower-scored words indicate what kind of tracked information  concerns them (e.g., keystrokes, cookies, site visits), but in general  the notion of “tracking” is paramount to respondents in this cluster.   Compare this with cluster 2, which is more strongly defined by the words  “look” and “history.”  This is obviously a reference to the role that <strong> browsing</strong> history has in defining privacy.  It&#8217;s interesting that these clusters are so distinct from each other, because it implies that users  are aware there is a difference between their browser history and other  behaviors they exhibit that could be tracked.  It&#8217;s also interesting  that users who consider browser history a privacy issue also consider  <strong>advertising and ads</strong> (presumably a reference to targeted ads) as privacy  issues as well.  We can use this information to extend the focus group  research on targeted ads; in addition to a security risk, some users  also view targeted ads as an invasion of privacy.  One interesting question naturally arises: do users differentiate between spam and  targeted advertisements?</span></li>
</ul>
<div id="magicdomid28">
<ul>
<li><span style="color: #000000;"><strong>&#8220;Privacy and Security&#8221;: </strong>The  weakest defined group is cluster 3, which can be interpreted in many ways.  The least controversial inference could be that these users simply don&#8217;t have a strong definition of privacy aside from a notion  that privacy is related to identity and security.  This validates a notion from our focus group research that some users really <strong>don&#8217;t differentiate</strong> between privacy and  security.</span></li>
</ul>
</div>
<h2 id="magicdomid29"><span style="color: #000000;"><strong>Final thoughts</strong></span></h2>
<p id="magicdomid30"><span style="color: #000000;">User  privacy and browser security are very important to us at Mozilla, and  developing a product that improves on both requires a deep and evolving  understanding of what those words mean to people of all communities - our entire user population.    In this post, we’ve shown how text mining can enhance our understanding  of pre-existing focus group research and generate novel directions for  further research. Moreover, we’ve also shown how it can provide insight into  users&#8217; perception by looking at the differences in the language they use  to define a concept.  In the next post, I&#8217;ll be using the same text  mining approach to evaluate user definitions of security while browsing  the web.</span></p>
</div>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2011/08/15/text-mining-users-definitions-of-browsing-privacy/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Why People Don’t Upgrade Their Browser – Part IV</title>
		<link>http://blog.mozilla.com/metrics/2010/04/21/why-people-don%e2%80%99t-upgrade-their-browser-%e2%80%93-part-iv/</link>
		<comments>http://blog.mozilla.com/metrics/2010/04/21/why-people-don%e2%80%99t-upgrade-their-browser-%e2%80%93-part-iv/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 19:14:35 +0000</pubDate>
		<dc:creator>Christopher Jung</dc:creator>
				<category><![CDATA[plans]]></category>
		<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=3436</guid>
		<description><![CDATA[Web technology moves quickly, and we at Mozilla do our best to balance our values of keeping users safe and secure with also ensuring that we are giving users the ability to make choices about the software on their computers. We can’t keep up with the cost of providing security updates to older versions forever, [...]]]></description>
			<content:encoded><![CDATA[<p>Web technology moves quickly, and we at Mozilla do our best to balance our values of keeping users safe and secure with also ensuring that we are giving users the ability to make choices about the software on their computers. We can’t keep up with the cost of providing security updates to older versions forever, though, so we’ll often try to encourage users to migrate to the latest version. A few weeks ago, we made another such offer to our Firefox 3 users, explaining that we were not planning on supporting that release anymore and asking them to upgrade to Firefox 3.6:</p>
<p><a href="http://www.mozilla.com/en-US/firefox/3.0.19/whatsnew/"><img src="http://blog.mozilla.com/metrics/files/2010/04/30019_Whatsnew_Page_v2.png" alt="" title="30019_Whatsnew_Page_v2" width="550" class="alignnone size-full wp-image-3592" /></a></p>
<p><a href="http://blog.mozilla.com/metrics/2010/01/28/why-people-dont-upgrade-their-browser-part-iii/">In the past</a>, soliciting feedback from users has helped us understand why users were opting to not upgrade, so this time around, people who clicked on the &#8220;No thanks&#8221; option above were directed to <a href="http://blog.mozilla.com/metrics/files/2010/04/survey_screenshot.png">a survey</a>.  Over 40,000 people were kind enough to share their thoughts with us.</p>
<p>So what do the results look like this time?</p>
<p>Let’s start by taking a look at responses to the first question:</p>
<p><a href="http://blog.mozilla.com/metrics/files/2010/04/Question-12.png"><img class="alignnone size-full wp-image-3498" title="Question 1" src="http://blog.mozilla.com/metrics/files/2010/04/Question-12.png" alt="" width="500" /></a></p>
<p>Over half of the respondents stated that they were simply content with Firefox 3. And compared to what we saw last survey, far fewer users selected “Other reason.” Still, a quarter of users took the time to give an alternative explanation, so exploring these answers can definitely help us gain additional insight. A list of the most common phrases entered into the “Other reason” text field is presented below:</p>
<p><img src="http://blog.mozilla.com/metrics/files/2010/04/question1_other_responses.png" alt="" title="question1_other_responses" width="237" height="401" class="alignnone size-full wp-image-3527" /></p>
<p>Two things immediately jump out:</p>
<ul>
<li>First, confusion over cost has virtually disappeared &#8212; this had been a problem <a href="http://blog.mozilla.com/metrics/2010/01/28/why-people-dont-upgrade-their-browser-part-iii/">previously</a>.</li>
<li>Second, the vast majority of comments are now about a lack of time.</li>
</ul>
<p>The lack of comments about costs is a pleasant surprise. Acting on feedback from the last survey, we revised the update prompt, highlighting that the upgrade is free in two places. It seems that this simple addition was enough to eliminate most of the confusion.</p>
<p>The update prompt itself may also partly account for the rise in “no time” comments. With prior upgrade initiatives, users saw an advertisement and button similar to what you see below:</p>
<p><img src="http://blog.mozilla.com/metrics/files/2010/04/MU_dialog_box.png" alt="" title="MU_dialog_box" width="456" height="403" class="aligncenter size-full wp-image-3535" /></p>
<p>Users could then click “Get the new version” to initiate a 30-second update procedure. This time, however, instead of seeing a typical software dialog box, people encountered the update prompt via the Firefox “Whats new” page and were presented with the customary green Firefox download button.  Hitting the button also initiated the update, but via the more involved process of downloading and installing a fresh version of the browser.</p>
<p>Pushing out the update in this way was a one-time situation, and we’ll revert to the usual process and software dialog box in the future. To alleviate this issue further, we should also perhaps add to the prompt some indication that updating Firefox is relatively quick and painless.</p>
<p>Next, lets turn to Question 2. This was a free form text box so we manually read through a random sample of 10% of the responses and parsed them into categories.</p>
<p><a href="http://blog.mozilla.com/metrics/files/2010/04/Question-21.png"><img class="aligncenter size-full wp-image-3565" title="Question 2" src="http://blog.mozilla.com/metrics/files/2010/04/Question-21.png" alt="" width="500" /></a></p>
<p>Firefox 3.6’s compatibility, both with add-ons and with specific websites/applications, remains a key issue (although user perception might be playing a small role as well). On the other hand, general stability and performance issues are cited considerably less often: crash and speed comments together account for 11% of responses, down from the 25% we saw last survey.</p>
<p>Our two main insights from above are also evident here. Cost comments have essentially disappeared from Question 2 as well, although they previously made up 7% of responses. And again, time concerns have become a real sticking point for users.</p>
<p>The rise of these “no time” responses, coupled with the still considerable “too many updates” category, has triggered suggestions that extend beyond prompt and wording revisions. For example, some propose that updates occur automatically in the background unbeknownst to the user (à la chrome), perhaps with an easy option to downgrade (unlike chrome).</p>
<p>Clearly, we are a long ways from making any major changes to the update process, but we’d love to hear your thoughts on any of this!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/04/21/why-people-don%e2%80%99t-upgrade-their-browser-%e2%80%93-part-iv/feed/</wfw:commentRss>
		<slash:comments>51</slash:comments>
		</item>
		<item>
		<title>Dramatic Stability Improvements in Firefox</title>
		<link>http://blog.mozilla.com/metrics/2010/04/08/dramatic-stability-improvements-in-firefox/</link>
		<comments>http://blog.mozilla.com/metrics/2010/04/08/dramatic-stability-improvements-in-firefox/#comments</comments>
		<pubDate>Thu, 08 Apr 2010 16:32:43 +0000</pubDate>
		<dc:creator>Ken Kovash</dc:creator>
				<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=3329</guid>
		<description><![CDATA[Over the past five months, Firefox has seen a 40% improvement in stability.  That’s incredible!  While the Firefox development community continues to focus its efforts on stability and performance (i.e., 100% improvement is still being strived for), we’re proud to highlight the most recent numbers. How did we calculate that 40% improvement?  We took a [...]]]></description>
			<content:encoded><![CDATA[<p>Over the past five months, Firefox has seen a <strong>40% improvement in stability</strong>.  That’s incredible!  While the Firefox development community continues to focus its efforts on stability and performance (i.e., 100% improvement is still being strived for), we’re proud to highlight the most recent numbers.</p>
<p>How did we calculate that 40% improvement?  We took a look at the <a href="http://crash-stats.mozilla.com" target="_blank">Crash Reports data</a>, along with <a href="http://crash-stats.mozilla.com/daily?p=Firefox&amp;v[]=" target="_blank">an estimated ratio</a> of daily crash instances to active daily users of Firefox.  With that data in hand, we first wanted to know how the number of daily crash instances has been trending by major version of Firefox, e.g., is Fx3.6 particularly more stable than previous releases and what does its trending look like?</p>
<p><img class="alignnone size-full wp-image-3331" title="chart_by_version4" src="http://blog.mozilla.com/metrics/files/2010/04/chart_by_version4.png" alt="" width="554" height="510" /></p>
<p>The chart above looks amazing… and it brought us to the next question… rolling up major versions, what does the total universe of Firefox stability look like?  Using the same numbers as seen above, we simply added them together to come up with the high-level metric – estimated ratio of daily crash instances to active daily users of Firefox.  The trend of the blue line below represents a 40% reduction since early November.  Spectacular!</p>
<p><img class="alignnone size-full wp-image-3353" title="aggregate_crash_trend" src="http://blog.mozilla.com/metrics/files/2010/04/aggregate_crash_trend.png" alt="" width="555" height="509" /></p>
<p>So, how exactly did this happen?</p>
<p>As mentioned <a href="http://blog.mozilla.com/metrics/2010/01/05/firefoxs-improving-stability/" target="_blank">previously</a>, improving Firefox stability was a top level objective of the Mozilla community during the last few months of 2009.  That effort was led by a variety of folks across a wide breath of the community… people who saw the many complaints about crashes, collected the data, figured out what was causing the most pain, addressed those issues (sometimes in code, sometimes by working with partners) and saw product improvements.</p>
<p>And in addition to hitting a home run with Fx3.6 and reducing the crashiness of Fx3.5, a majority of Firefox users quickly migrated to Fx3.6… adding all that up drove what you see in the charts above.</p>
<p>What’s next?</p>
<p>There are still more improvements to come.  For example, with a significant percentage of crashes in Flash, a lot of work around stability improvement for Firefox users is currently going into <a href="http://labs.adobe.com/downloads/flashplayer10.html" target="_blank">Flash Player 10.1</a>.  If you&#8217;re personally experiencing any crashiness issues when using Firefox, please make sure you’re on the <a href="http://www.firefox.com" target="_blank">latest version</a> of Firefox and feel free to use tools like <a href="https://bugzilla.mozilla.org/" target="_blank">bugzilla</a> and <a href="http://hendrix.mozilla.org/" target="_blank">hendrix</a> to notify the community.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/04/08/dramatic-stability-improvements-in-firefox/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
		<item>
		<title>Firefox 3.6 Upgrade Offer &#8211; An Early Success</title>
		<link>http://blog.mozilla.com/metrics/2010/03/17/firefox-3-6-upgrade-offer-an-early-success/</link>
		<comments>http://blog.mozilla.com/metrics/2010/03/17/firefox-3-6-upgrade-offer-an-early-success/#comments</comments>
		<pubDate>Wed, 17 Mar 2010 23:04:18 +0000</pubDate>
		<dc:creator>Ken Kovash</dc:creator>
				<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=2633</guid>
		<description><![CDATA[Late last week, Mozilla pushed a Firefox 3.6 upgrade offer to people on older versions of Firefox.  Here was the actual offer: What has been the impact in just a few short days? Looking at the chart below, you’ll notice that the percentage of all Firefox users on Firefox 3.6 increased dramatically in recent days.  [...]]]></description>
			<content:encoded><![CDATA[<p>Late last week, Mozilla pushed a Firefox 3.6 <a href="https://developer.mozilla.org/devnews/index.php/2010/03/11/upgrade-offer-to-be-issued-to-firefox-3-and-firefox-3-5-users/" target="_blank">upgrade offer to people on older versions</a> of Firefox.  Here was the actual offer:</p>
<p><img class="alignnone size-full wp-image-2634" title="3.5-to-3.6-major-update" src="http://blog.mozilla.com/metrics/files/2010/03/3.5-to-3.6-major-update.png" alt="" width="467" height="412" /></p>
<p>What has been the impact in just a few short days?</p>
<p>Looking at the chart below, you’ll notice that the percentage of all Firefox users on Firefox 3.6 increased dramatically in recent days.  Late last week, the percentage of users on 3.6 stood at roughly 23%, and as of yesterday that number had climbed to 43% (btw, that 20% pick up translates to roughly 75 million total people who made the switch &#8212; that&#8217;s huge!).  Past major update offers have generally converted in the ballpark of 10% to 20% of users, so this most recent push has definitely had a substantial impact.</p>
<p><img class="alignnone size-full wp-image-2635" title="Fx_36_Billboard_Effect2" src="http://blog.mozilla.com/metrics/files/2010/03/Fx_36_Billboard_Effect2.png" alt="" width="552" height="475" /></p>
<p>And below is one further way to interpret these recent numbers.  Looking at the launches of Firefox 3, Firefox 3.5, and Firefox 3.6, we considered how quickly each latest &amp; greatest grabbed usage share from its predecessor.  You’ll notice that the shapes of the 3.0 and 3.6 curves (blue and green) look almost identical… the only difference being the timing of the first offer/advertisement being pushed (note: there was an issue with our early 3.5 update offers, hence the red curve seems to be missing the same upward spike).</p>
<p><img class="alignnone size-full wp-image-2636" title="Fx_36_Adoption_Comparison" src="http://blog.mozilla.com/metrics/files/2010/03/Fx_36_Adoption_Comparison.png" alt="" width="540" height="505" /></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/03/17/firefox-3-6-upgrade-offer-an-early-success/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>Why Do Firefox Downloads Spike on Release Days?</title>
		<link>http://blog.mozilla.com/metrics/2010/02/19/why-do-firefox-downloads-spike-on-release-days/</link>
		<comments>http://blog.mozilla.com/metrics/2010/02/19/why-do-firefox-downloads-spike-on-release-days/#comments</comments>
		<pubDate>Fri, 19 Feb 2010 23:22:52 +0000</pubDate>
		<dc:creator>Ken Kovash</dc:creator>
				<category><![CDATA[process]]></category>
		<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=2392</guid>
		<description><![CDATA[As Daniel pointed out, there has always been a dramatic increase in fresh downloads/installs of Firefox at the time of each minor version release – separate from people simply being updated.  We’ve never entirely understood this user behavior until Daniel started some digging yesterday.  Here’s what we know… Yesterday (just after 3.0.18 and 3.5.8 were [...]]]></description>
			<content:encoded><![CDATA[<p>As Daniel <a href="http://blog.mozilla.com/data/2010/02/18/firefoxdownloadsonreleaseday/" target="_blank">pointed out</a>, there has always been a dramatic increase in fresh downloads/installs of Firefox at the time of each minor version release – separate from people simply being updated.  We’ve never entirely understood this user behavior until Daniel started some digging yesterday.  Here’s what we know…</p>
<ul>
<li>Yesterday (just after 3.0.18 and 3.5.8 were released) we saw a spike in fresh downloads/installs of Firefox.  The typical daily number is in the ballpark of 2 Million and yesterday it shot up to over 4 Million.</li>
<li>As Daniel <a href="http://blog.mozilla.com/data/2010/02/18/firefoxdownloadsonreleaseday/" target="_blank">highlighted</a>, nearly all of the download activity was for Firefox 3.6.</li>
<li>Digging a little deeper, we also discovered that the entire spike in 3.6 downloads was coming from people on Firefox 3.5.8.  This means people successfully got the update yesterday (3.5.8), and then went out of their way to manually do one further update (i.e., get 3.6).</li>
</ul>
<p>Why or how is this happening?</p>
<p>It turns out that the answer was right under our nose.   When people get an update, they see an <a href="http://en-us.www.mozilla.com/en-US/firefox/3.6/whatsnew/" target="_blank">update page</a>.  And if they’re not on the current major version (e.g., 3.6), the <a href="http://en-us.www.mozilla.com/en-US/firefox/3.5.8/whatsnew/" target="_blank">page suggests</a> that they go and download the lastest and greatest.   So, what happened within this user interaction yesterday?</p>
<p><img class="alignnone size-full wp-image-2393" title="blog_post_358_36_downloads" src="http://blog.mozilla.com/metrics/files/2010/02/blog_post_358_36_downloads.png" alt="blog_post_358_36_downloads" width="515" height="725" /></p>
<p>So, this explains a common experience for millions of users each time a Firefox update is shipped.  And it’s good to see the messaging on that 3.5.8 update page (and all older update pages) is paying off.  Perhaps we should consider changing the concept of those pages to be even more aggressive in getting people to update to the latest and greatest.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/02/19/why-do-firefox-downloads-spike-on-release-days/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>An Improved Experience for New Users of Firefox</title>
		<link>http://blog.mozilla.com/metrics/2010/02/09/an-improved-experience-for-new-users-of-firefox/</link>
		<comments>http://blog.mozilla.com/metrics/2010/02/09/an-improved-experience-for-new-users-of-firefox/#comments</comments>
		<pubDate>Tue, 09 Feb 2010 22:50:15 +0000</pubDate>
		<dc:creator>Ken Kovash</dc:creator>
				<category><![CDATA[plans]]></category>
		<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=2341</guid>
		<description><![CDATA[Over the past year, we set out to identify and solve any possible pain points that might arise during a person’s experience downloading and installing Firefox (previous posts are here, here, here, here, and here).  Thanks to feedback from users, and some resulting product changes, we can now safely say that there are no issues [...]]]></description>
			<content:encoded><![CDATA[<p>Over the past year, we set out to identify and solve any possible pain points that might arise during a person’s experience downloading and installing Firefox (previous posts are <a href="http://blog.mozilla.com/metrics/2009/03/31/why-people-dont-install-firefox-part-i/" target="_blank">here</a>, <a href="http://blog.mozilla.com/metrics/2009/04/02/why-people-dont-install-firefox-part-ii/" target="_blank">here</a>, <a href="http://blog.mozilla.com/metrics/2009/04/06/why-people-dont-install-firefox-part-iii/" target="_blank">here</a>, <a href="http://blog.mozilla.com/metrics/2009/07/30/an-improved-experience-for-2000000-non-firefox-users/" target="_blank">here</a>, and <a href="http://blog.mozilla.com/metrics/2009/08/03/more-changes-coming-to-the-firefox-installer/" target="_blank">here</a>).  Thanks to feedback from users, and some resulting product changes, we can now safely say that there are no issues confronting new users when installing Firefox for the very first time.</p>
<p>How do we know this?</p>
<p>Last week, we re-ran our installer feedback mechanism for a short period of time.  If a user clicked “cancel” while walking through the Firefox installer, they were asked if they wanted to provide feedback.</p>
<p><img class="alignnone size-full wp-image-2342" title="cancel_step1_blog" src="http://blog.mozilla.com/metrics/files/2010/02/cancel_step1_blog.png" alt="cancel_step1_blog" width="508" height="391" /></p>
<p>After making an initial round of product improvements based on our <a href="http://blog.mozilla.com/metrics/2009/04/06/why-people-dont-install-firefox-part-iii/" target="_blank">first time feedback</a> (March &#8217;09), here are the transformed feedback results from our more recent efforts (both <a href="http://blog.mozilla.com/metrics/2009/07/30/an-improved-experience-for-2000000-non-firefox-users/" target="_blank">July</a> <a href="http://blog.mozilla.com/metrics/2009/08/03/more-changes-coming-to-the-firefox-installer/" target="_blank">2009</a> and last week):</p>
<p><img class="alignnone size-full wp-image-2360" title="pie_comparison2" src="http://blog.mozilla.com/metrics/files/2010/02/pie_comparison2.png" alt="pie_comparison2" width="554" height="474" /></p>
<p>While we still have plans to tackle the remaining big slice of the pie (see concluding paragraph), we were able to successfully solve the red and green pie slices from last time.  In our latest feedback results (pie on the right), the big pie slice now represents nearly 100% of the total feedback (the previously seen categories virtually evaporated).  One way to interpret this is that we’ve now successfully identified and resolved 3 of the top 4 issues originally encountered by users.</p>
<p>Here were the specific actions we took addressing those red and green slices (details are in bug <a href="https://bugzilla.mozilla.org/show_bug.cgi?id=508684" target="_blank">508684</a>):</p>
<p><strong>Don’t Want Firefox as Default</strong></p>
<p>People indicating this issue were missing the selection option earlier in the installation process, arrived at the end, and mistakenly believed that we were making Firefox their default without being given a choice.  So, we added the choice to the final step in the installer:</p>
<p><img class="alignnone size-full wp-image-2345" title="installer_default_choice_blog" src="http://blog.mozilla.com/metrics/files/2010/02/installer_default_choice_blog.png" alt="installer_default_choice_blog" width="508" height="394" /></p>
<p><strong>Confusion About Updating-Upgrading-Installing</strong></p>
<p>We did a few different things to help address this area of confusion.  First, we <a href="http://www.mozilla.com/en-US/firefox/update/" target="_blank">added content</a> to mozilla.com and prominently displayed it on the main Firefox product pages seen by existing users:</p>
<p><a href="http://www.mozilla.com/en-US/firefox/upgrade.html" target="_blank"><img class="alignnone size-full wp-image-2346" style="border: 1px solid black;" title="upgradehtml_blog" src="http://blog.mozilla.com/metrics/files/2010/02/upgradehtml_blog.png" alt="upgradehtml_blog" width="547" height="374" /></a></p>
<p><a href="http://www.mozilla.com/en-US/firefox/personal.html" target="_blank"><img class="alignnone size-full wp-image-2347" style="border: 1px solid black;" title="personalhtml_blog" src="http://blog.mozilla.com/metrics/files/2010/02/personalhtml_blog.png" alt="personalhtml_blog" width="547" height="433" /></a></p>
<p>Second, within the Firefox installer, we changed the Install button to say “Upgrade” instead of &#8220;Install&#8221;:</p>
<p><img class="alignnone size-full wp-image-2348" title="Installer_Upgrade_button_blog" src="http://blog.mozilla.com/metrics/files/2010/02/Installer_Upgrade_button_blog.png" alt="Installer_Upgrade_button_blog" width="512" height="396" /></p>
<p>Thanks to Rob Strong, the Firefox team, the Funnelcake team, John Slater, and Laura Mesa, among others, for implementing the changes highlighted above.</p>
<p>Lastly, there remains one outstanding problem for installers of Firefox – “it tells me to close Fx, but it’s not open” (the big pie slice in the charts above).  This issue affects people who already have Firefox and are attempting to reinstall it, and as we’ve noted previously, this cohort becomes fairly frustrated during the experience.    Some fixes are starting to be contemplated (e.g., bugs <a href="https://bugzilla.mozilla.org/show_bug.cgi?id=496207" target="_blank">496207</a>, <a href="https://bugzilla.mozilla.org/show_bug.cgi?id=544356" target="_blank">544356</a>)… and I’ll make sure to talk more here once some progress is made.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/02/09/an-improved-experience-for-new-users-of-firefox/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Why People Don&#8217;t Upgrade Their Browser &#8211; Part III</title>
		<link>http://blog.mozilla.com/metrics/2010/01/28/why-people-dont-upgrade-their-browser-part-iii/</link>
		<comments>http://blog.mozilla.com/metrics/2010/01/28/why-people-dont-upgrade-their-browser-part-iii/#comments</comments>
		<pubDate>Thu, 28 Jan 2010 18:36:36 +0000</pubDate>
		<dc:creator>Ken Kovash</dc:creator>
				<category><![CDATA[plans]]></category>
		<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=2290</guid>
		<description><![CDATA[Mozilla recently advertised a Firefox 3.5 upgrade to users of Firefox 3 (this is also referred to it as a “major update prompt”) in an effort to migrate people to the latest version of Firefox.  As of earlier this month, about 32% of all Firefox users were still on a version of Fx3, and as [...]]]></description>
			<content:encoded><![CDATA[<p>Mozilla recently advertised a Firefox 3.5 upgrade to users of Firefox 3 (this is also referred to it as a “major update prompt”) in an effort to migrate people to the latest version of Firefox.  As of earlier this month, about 32% of all Firefox users were still on a version of Fx3, and as a result of the mid-January push, that number is now down to about 22%.</p>
<p>As a side benefit to this initiative, we also took the opportunity to see what feedback people had, specifically asking users to tell us what was on their mind if they were choosing not to upgrade.  We did this <a href="http://blog.mozilla.com/metrics/2009/08/21/why-people-dont-upgrade-their-browser-part-i/" target="_blank">once before</a> when upgrading users from Fx2 to Fx3, and <a href="http://blog.mozilla.com/metrics/2009/08/21/why-people-dont-upgrade-their-browser-part-i/" target="_blank">the results</a> were extremely impactful, so we wanted to continue this once again as part of our broader user outreach efforts.</p>
<p>For people interesting in leaving feedback, here is the survey they saw:</p>
<p><img class="alignnone size-full wp-image-2291" style="border: 1px solid black;" title="survey_screenshot2" src="http://blog.mozilla.com/metrics/files/2010/01/survey_screenshot2.png" alt="survey_screenshot2" width="568" height="578" /></p>
<p>A little more than 5,000 people were kind enough to share their thoughts.  In turning to the results, let’s start with question #1:</p>
<p><img class="alignnone size-full wp-image-2292" title="MU_survey_results_summary2" src="http://blog.mozilla.com/metrics/files/2010/01/MU_survey_results_summary2.png" alt="MU_survey_results_summary2" width="512" height="499" /></p>
<p>The most surprising insight above is that 53% of respondents selected the “Other reason” check box.  That suggests that the proposed answers we listed were found somewhat unsatisfactory and that people had other ideas on their mind.  Clearly, understanding “Other reason” and seeing what users said within question #2 should provide us with much more insight than the chart above.</p>
<p>Below is a list of the most common phrases people typed into the “Other reason” box.  What’s most surprising here?  The vast majority of comments are about cost, i.e., “is this upgrade free?”.  Taking this insight and turning it into action, we’re planning to make clear that “Firefox is free” within future upgrade prompts/advertisements.</p>
<p><img class="alignnone size-full wp-image-2293" title="Other_response_field2" src="http://blog.mozilla.com/metrics/files/2010/01/Other_response_field2.png" alt="Other_response_field2" width="293" height="363" /></p>
<p>Next, let’s look at how people responded to question #2.  It’s a free form text box, so we manually read through comments, sorting them into different categories:</p>
<p><img class="alignnone size-full wp-image-2294" title="open_ended_comments" src="http://blog.mozilla.com/metrics/files/2010/01/open_ended_comments.png" alt="open_ended_comments" width="540" height="643" /></p>
<p>One easy way to interpret this pie chart is to compare it with what we saw <a href="http://blog.mozilla.com/metrics/2009/08/21/why-people-dont-upgrade-their-browser-part-i/" target="_blank">last time</a> (when users were upgrading from Fx2 to Fx3).  UI related comments have almost vanished.  On the other hand, add-on and extension compatibility comments (Norton was far and away the #1 cited) and crash comments have both risen dramatically as their total share of the pie.  For “not compatible with specific website”, Facebook and specific Google pages (e.g., calendar) seemed to be the most frequently mentioned.</p>
<p>Moving forward, it will be critical that we acknowledge and address the concerns faced by these users.  We’ve been working hard in recent months to reduce the crashiness of Firefox, and some <a href="http://blog.mozilla.com/metrics/2010/01/05/firefoxs-improving-stability/" target="_blank">positive results</a> are already evident.  For addressing the add-on and toolbar compatibility issues, the Firefox Support team has been raising the visibility of the <a href="https://wiki.mozilla.org/Firefox/DeliveryMeetings/2010-01-27#Support" target="_blank">top extension issues cited by users</a> (Norton, Roboform, etc.).</p>
<p>And how do we properly communicate all of this the next time we advertise a major upgrade?  We currently highlight these three bullet points:</p>
<ul>
<li> Twice as fast as Firefox 3.</li>
<li> Private browsing, tear-off tabs and more.</li>
<li> The most advanced Firefox yet.</li>
</ul>
<p>The last two should probably be changed to “This upgrade is free” and “Improved stability, fewer crashes.”  Your thoughts?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/01/28/why-people-dont-upgrade-their-browser-part-iii/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>People in France and Australia Are Also Switching Browsers</title>
		<link>http://blog.mozilla.com/metrics/2010/01/21/people-in-france-and-australia-are-also-switching-browsers/</link>
		<comments>http://blog.mozilla.com/metrics/2010/01/21/people-in-france-and-australia-are-also-switching-browsers/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 04:02:11 +0000</pubDate>
		<dc:creator>Ken Kovash</dc:creator>
				<category><![CDATA[results]]></category>

		<guid isPermaLink="false">http://blog.mozilla.com/metrics/?p=2242</guid>
		<description><![CDATA[After last week&#8217;s warning from the German government against the use of Internet Explorer, the governments of France and Australia followed suit earlier this week. Similar to our analysis of the impact in Germany, we wanted to see what happened this week with IE users downloading Firefox in both France and Australia.  There are a [...]]]></description>
			<content:encoded><![CDATA[<p>After last week&#8217;s <a href="http://blog.mozilla.com/metrics/2010/01/19/people-in-germany-are-switching-browsers/" target="_blank">warning from the German government</a> against the use of Internet Explorer, the governments of <a href="http://mashable.com/2010/01/18/france-against-internet-explorer/" target="_blank">France</a> and <a href="http://www.abc.net.au/news/stories/2010/01/19/2795684.htm" target="_blank">Australia</a> followed suit earlier this week.</p>
<p>Similar to our <a href="http://blog.mozilla.com/metrics/2010/01/19/people-in-germany-are-switching-browsers/" target="_blank">analysis</a> of the impact in Germany, we wanted to see what happened this week with IE users downloading Firefox in both France and Australia.  There are a couple patterns you&#8217;ll notice with the pictures below:</p>
<ul>
<li>The rate at which IE users downloaded Firefox roughly doubled in each country.  That’s a huge increase!</li>
<li>As we mentioned last time, the shaded orange areas are meant to represent the incremental number of downloads each day that are above what we would have expected on those days (i.e., they are downloads that can be described as directly attributable to the government warnings).</li>
<li>The cumulative orange area for France equates to about 60,000 downloads and the cumulative area for Australia translates to about 35,000 downloads.</li>
</ul>
<p><strong>France:</strong></p>
<p><img class="alignnone size-full wp-image-2243" title="Impact_from_France" src="http://blog.mozilla.com/metrics/files/2010/01/Impact_from_France.png" alt="Impact_from_France" width="471" height="462" /></p>
<p><strong>Australia:</strong></p>
<p><strong><img class="alignnone size-full wp-image-2247" title="Impact_from_Australia" src="http://blog.mozilla.com/metrics/files/2010/01/Impact_from_Australia1.png" alt="Impact_from_Australia" width="463" height="450" /><br />
</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.mozilla.com/metrics/2010/01/21/people-in-france-and-australia-are-also-switching-browsers/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

