<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for Blog of Data</title>
	<atom:link href="http://blog.mozilla.com/data/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.mozilla.com/data</link>
	<description>Mozilla metrics team&#039;s technical articles</description>
	<lastBuildDate>Fri, 28 Aug 2009 18:54:26 -0700</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by peter</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-113</link>
		<dc:creator>peter</dc:creator>
		<pubDate>Fri, 28 Aug 2009 18:54:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-113</guid>
		<description>A prime example for the usage of hadoop. Great!
But I don&#039;t think, that the conclusions you gained are reliable. Maybe FF should introduce an anonymous ID for each program, so the issues with multiple users behind an IP or changing ones are solved.
But even then, we can&#039;t be sure, about the spread of particular addons, since update pings maybe skipped a day (People should take a walk on weekends and not browse the internet... ;o) So there wouldn&#039;t be an update check right?)</description>
		<content:encoded><![CDATA[<p>A prime example for the usage of hadoop. Great!<br />
But I don&#8217;t think, that the conclusions you gained are reliable. Maybe FF should introduce an anonymous ID for each program, so the issues with multiple users behind an IP or changing ones are solved.<br />
But even then, we can&#8217;t be sure, about the spread of particular addons, since update pings maybe skipped a day (People should take a walk on weekends and not browse the internet&#8230; ;o) So there wouldn&#8217;t be an update check right?)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by deinspanjer</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-106</link>
		<dc:creator>deinspanjer</dc:creator>
		<pubDate>Wed, 19 Aug 2009 16:03:49 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-106</guid>
		<description>These &quot;versioncheck&quot; pings normally happen on a 24 hour timer.  There is a slight complication though in that when a user is updating Firefox itself, there are a couple of additional add-on versioncheck pings to determine compatibility with the Firefox version being installed, and if add-ons are updated, then the new versions of them ping again.</description>
		<content:encoded><![CDATA[<p>These &#8220;versioncheck&#8221; pings normally happen on a 24 hour timer.  There is a slight complication though in that when a user is updating Firefox itself, there are a couple of additional add-on versioncheck pings to determine compatibility with the Firefox version being installed, and if add-ons are updated, then the new versions of them ping again.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by SteveL</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-105</link>
		<dc:creator>SteveL</dc:creator>
		<pubDate>Wed, 19 Aug 2009 13:37:21 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-105</guid>
		<description>We in the Hadoop project are always pleased to see other OSS projects doing interesting stuff with our code, thank you for the writeup. And no, that hadoop on ubuntu wiki page is still lagging -we should really do our own .deb files and so have much simpler setup instructions: &quot;select Apache Hadoop on Synaptic&quot;.

A big issue for the stats is &quot;how often are updates checked for&quot;; without data on that, you can&#039;t be sure what the numbers really mean -interesting as they are.</description>
		<content:encoded><![CDATA[<p>We in the Hadoop project are always pleased to see other OSS projects doing interesting stuff with our code, thank you for the writeup. And no, that hadoop on ubuntu wiki page is still lagging -we should really do our own .deb files and so have much simpler setup instructions: &#8220;select Apache Hadoop on Synaptic&#8221;.</p>
<p>A big issue for the stats is &#8220;how often are updates checked for&#8221;; without data on that, you can&#8217;t be sure what the numbers really mean -interesting as they are.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by city_zen</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-101</link>
		<dc:creator>city_zen</dc:creator>
		<pubDate>Mon, 17 Aug 2009 05:09:45 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-101</guid>
		<description>Thanks for publishing this.
I&#039;m surprised that the number of users with add-ons installed is not HIGHER. I mean, about 50% of Firefox users don&#039;t have EVEN ONE add-on installed? Hard to believe. I think I would use another browser if Firefox didn&#039;t have add-ons to customize it. Don&#039;t take me wrong, Firefox is a great browser, but for me it&#039;s the add-ons that push it over any other browser out there.
Would it be possible to publish some more information about the usage patterns that you found? Starting with 28 GB of data, there must be more than 3 or 4 bullet points worth of output info, right? ;)
A few examples that maybe are available to you after all that processing (great job, btw):
- Top ten most installed add-ons
- Percentage of users with more than 10 add-ons installed
- Maximum number of add-ons installed (don&#039;t look at me, I ONLY have a few dozens installed :D )
- Distribution by country
- OS used
etc.

Thank you</description>
		<content:encoded><![CDATA[<p>Thanks for publishing this.<br />
I&#8217;m surprised that the number of users with add-ons installed is not HIGHER. I mean, about 50% of Firefox users don&#8217;t have EVEN ONE add-on installed? Hard to believe. I think I would use another browser if Firefox didn&#8217;t have add-ons to customize it. Don&#8217;t take me wrong, Firefox is a great browser, but for me it&#8217;s the add-ons that push it over any other browser out there.<br />
Would it be possible to publish some more information about the usage patterns that you found? Starting with 28 GB of data, there must be more than 3 or 4 bullet points worth of output info, right? ;)<br />
A few examples that maybe are available to you after all that processing (great job, btw):<br />
- Top ten most installed add-ons<br />
- Percentage of users with more than 10 add-ons installed<br />
- Maximum number of add-ons installed (don&#8217;t look at me, I ONLY have a few dozens installed :D )<br />
- Distribution by country<br />
- OS used<br />
etc.</p>
<p>Thank you</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Update: Bugzilla SQR by deinspanjer</title>
		<link>http://blog.mozilla.com/data/2009/08/14/update-bugzilla-sqr/comment-page-1/#comment-99</link>
		<dc:creator>deinspanjer</dc:creator>
		<pubDate>Sat, 15 Aug 2009 16:32:14 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=140#comment-99</guid>
		<description>http://lmgtfy.com/?q=ETL
SQR is a tough one.  It stands for Software Quality Reports.  It was the original name for the project that allows us to generate useful reports on Bugzilla data.
Here you go. ;)</description>
		<content:encoded><![CDATA[<p><a href="http://lmgtfy.com/?q=ETL" rel="nofollow">http://lmgtfy.com/?q=ETL</a><br />
SQR is a tough one.  It stands for Software Quality Reports.  It was the original name for the project that allows us to generate useful reports on Bugzilla data.<br />
Here you go. ;)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Update: Bugzilla SQR by Mossop</title>
		<link>http://blog.mozilla.com/data/2009/08/14/update-bugzilla-sqr/comment-page-1/#comment-98</link>
		<dc:creator>Mossop</dc:creator>
		<pubDate>Sat, 15 Aug 2009 09:27:07 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=140#comment-98</guid>
		<description>SQL, ETL, all sounds very exciting, if only there was some explanation of what it means.</description>
		<content:encoded><![CDATA[<p>SQL, ETL, all sounds very exciting, if only there was some explanation of what it means.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by skrueger</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-96</link>
		<dc:creator>skrueger</dc:creator>
		<pubDate>Wed, 12 Aug 2009 18:21:53 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-96</guid>
		<description>Ingo:  I see how frequently changing IPs is an issue, and I did not think of this problem while developing.  But I don&#039;t think this is a large problem because I only think that this could make our distinct IP smaller than it actually is since the FIrefox Add-on Manager will ping once in a 24hour time window.  The problem I am thinking of could happen when user A from IP x.x.x.x pings for updates and then loses/changes his IP.  Then user B picks up IP x.x.x.x and pings for updates.  In this case 2 distinct users (not behind the same router) pinged for updates but only one IP was counted.  However, I am not certain on how often this case happens.</description>
		<content:encoded><![CDATA[<p>Ingo:  I see how frequently changing IPs is an issue, and I did not think of this problem while developing.  But I don&#8217;t think this is a large problem because I only think that this could make our distinct IP smaller than it actually is since the FIrefox Add-on Manager will ping once in a 24hour time window.  The problem I am thinking of could happen when user A from IP x.x.x.x pings for updates and then loses/changes his IP.  Then user B picks up IP x.x.x.x and pings for updates.  In this case 2 distinct users (not behind the same router) pinged for updates but only one IP was counted.  However, I am not certain on how often this case happens.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by Ingo Lütkebohle</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-94</link>
		<dc:creator>Ingo Lütkebohle</dc:creator>
		<pubDate>Wed, 12 Aug 2009 09:57:41 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-94</guid>
		<description>Thanks for writing up your modeling approach, that is very good example material for using hadoop!

I have a question about your ip-to-user estimate: At least in Germany, many users do not keep an IP address for more than 24 hours. This is because almost all consumer broadband/DSL providers disconnect them about once a day. Routers immediately reconnect, but get assigned a different IP then. Do you think this is of relevance for the number of users per IP and if yes, have you accounted for it?</description>
		<content:encoded><![CDATA[<p>Thanks for writing up your modeling approach, that is very good example material for using hadoop!</p>
<p>I have a question about your ip-to-user estimate: At least in Germany, many users do not keep an IP address for more than 24 hours. This is because almost all consumer broadband/DSL providers disconnect them about once a day. Routers immediately reconnect, but get assigned a different IP then. Do you think this is of relevance for the number of users per IP and if yes, have you accounted for it?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Tracking down the number of Firefox Addon users with hadoop by Wladimir Palant</title>
		<link>http://blog.mozilla.com/data/2009/08/10/tracking-down-the-number-of-firefox-addon-users-with-hadoop/comment-page-1/#comment-87</link>
		<dc:creator>Wladimir Palant</dc:creator>
		<pubDate>Mon, 10 Aug 2009 23:02:36 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=108#comment-87</guid>
		<description>Interesting results. This seems to suggest that roughly 50% of all Firefox users have add-ons installed. If we then take 36 million users pinging AMO on a day (at least that&#039;s how I interpret your numbers) and 8730182 pings for Adblock Plus on that day - we get that 12% of all Firefox users are using Adblock Plus. Finally a way to put ADU numbers in relation, the result is significantly higher than my previous estimates however. One possible explanation is that users who have add-ons installed typically use their browser more frequently and send out more pings than those who don&#039;t. This would make it difficult to compare ADU numbers between Firefox and add-ons, add-on users are probably significantly fewer than 50%.</description>
		<content:encoded><![CDATA[<p>Interesting results. This seems to suggest that roughly 50% of all Firefox users have add-ons installed. If we then take 36 million users pinging AMO on a day (at least that&#8217;s how I interpret your numbers) and 8730182 pings for Adblock Plus on that day &#8211; we get that 12% of all Firefox users are using Adblock Plus. Finally a way to put ADU numbers in relation, the result is significantly higher than my previous estimates however. One possible explanation is that users who have add-ons installed typically use their browser more frequently and send out more pings than those who don&#8217;t. This would make it difficult to compare ADU numbers between Firefox and add-ons, add-on users are probably significantly fewer than 50%.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Shell script analytics by Arthur</title>
		<link>http://blog.mozilla.com/data/2009/07/29/shell-script-analytics/comment-page-1/#comment-76</link>
		<dc:creator>Arthur</dc:creator>
		<pubDate>Thu, 30 Jul 2009 07:49:12 +0000</pubDate>
		<guid isPermaLink="false">http://blog.mozilla.com/data/?p=99#comment-76</guid>
		<description>@Jesse: ~75% Intel vs. ~25% PPC</description>
		<content:encoded><![CDATA[<p>@Jesse: ~75% Intel vs. ~25% PPC</p>
]]></content:encoded>
	</item>
</channel>
</rss>
