<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: On the dangers of spidering badly</title>
	<atom:link href="http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly/feed" rel="self" type="application/rss+xml" />
	<link>http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly</link>
	<description>Pontification without all the gritty gravitas</description>
	<pubDate>Sun, 20 Jul 2008 00:11:43 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Nemo</title>
		<link>http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly#comment-14043</link>
		<dc:creator>Nemo</dc:creator>
		<pubDate>Fri, 29 Dec 2006 08:38:38 +0000</pubDate>
		<guid isPermaLink="false">http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly#comment-14043</guid>
		<description>No kidding!  They're still in action, hitting a server I run for ~900 pages in less than three minutes; I wondered why the load was over 2...

&lt;blockquote&gt;
server seems busy, (you may need to increase StartServers, or Min/MaxSpareServers), spawning 8 children, there are 4 idle, and 103 total children
&lt;/blockquote&gt;

It's like a one-person ./ing...

&lt;i&gt;# apf -d 208.101.36.2&lt;/i&gt;</description>
		<content:encoded><![CDATA[<p>No kidding!  They&#8217;re still in action, hitting a server I run for ~900 pages in less than three minutes; I wondered why the load was over 2&#8230;</p>
<blockquote><p>
server seems busy, (you may need to increase StartServers, or Min/MaxSpareServers), spawning 8 children, there are 4 idle, and 103 total children
</p></blockquote>
<p>It&#8217;s like a one-person ./ing&#8230;</p>
<p><i># apf -d 208.101.36.2</i></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tozier</title>
		<link>http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly#comment-3975</link>
		<dc:creator>Tozier</dc:creator>
		<pubDate>Tue, 17 Oct 2006 02:39:35 +0000</pubDate>
		<guid isPermaLink="false">http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly#comment-3975</guid>
		<description>I'm not sure what an appropriate interval is, but in this case the pair.com admin told me there were 315 concurrent blog processes running, since every time a link is clicked one launches, and so many were launched in such a short time that one didn't finish before the next started.</description>
		<content:encoded><![CDATA[<p>I&#8217;m not sure what an appropriate interval is, but in this case the pair.com admin told me there were 315 concurrent blog processes running, since every time a link is clicked one launches, and so many were launched in such a short time that one didn&#8217;t finish before the next started.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex</title>
		<link>http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly#comment-3974</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Tue, 17 Oct 2006 02:06:13 +0000</pubDate>
		<guid isPermaLink="false">http://williamtozier.com/slurry/2006/10/16/on-the-dangers-of-spidering-badly#comment-3974</guid>
		<description>So ... what -is- a good backoff interval [assuming the links you want to follow actually make sense to follow] ? Or, for that matter, why does fast spidering piss off the admin -- because it looks like a DoS attack ?</description>
		<content:encoded><![CDATA[<p>So &#8230; what -is- a good backoff interval [assuming the links you want to follow actually make sense to follow] ? Or, for that matter, why does fast spidering piss off the admin &#8212; because it looks like a DoS attack ?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
