<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Sindice @ 20+ Millions and Openings</title>
	<atom:link href="http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Thu, 12 Jan 2012 17:51:59 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Traduction : How to Publish Linked Data on the Web? (10/10) &#171; Blogabriel</title>
		<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/comment-page-1/#comment-967</link>
		<dc:creator>Traduction : How to Publish Linked Data on the Web? (10/10) &#171; Blogabriel</dc:creator>
		<pubDate>Wed, 10 Aug 2011 09:00:56 +0000</pubDate>
		<guid isPermaLink="false">http://blog.sindice.com/?p=5#comment-967</guid>
		<description>[...] developed by DERI Ireland, currently indexes over 20 million  RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked [...]</description>
		<content:encoded><![CDATA[<p>[...] developed by DERI Ireland, currently indexes over 20 million  RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Blogabriel &#187; Traduction française : How to Publish Linked Data on the Web?</title>
		<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/comment-page-1/#comment-627</link>
		<dc:creator>Blogabriel &#187; Traduction française : How to Publish Linked Data on the Web?</dc:creator>
		<pubDate>Tue, 22 Jun 2010 23:49:31 +0000</pubDate>
		<guid isPermaLink="false">http://blog.sindice.com/?p=5#comment-627</guid>
		<description>[...] developed by DERI Ireland, currently indexes over 20 million  RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked [...]</description>
		<content:encoded><![CDATA[<p>[...] developed by DERI Ireland, currently indexes over 20 million  RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Blogabriel &#187; Traduction : How to Publish Linked Data on the Web? (10/10)</title>
		<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/comment-page-1/#comment-584</link>
		<dc:creator>Blogabriel &#187; Traduction : How to Publish Linked Data on the Web? (10/10)</dc:creator>
		<pubDate>Thu, 18 Mar 2010 11:33:46 +0000</pubDate>
		<guid isPermaLink="false">http://blog.sindice.com/?p=5#comment-584</guid>
		<description>[...] developed by DERI Ireland, currently indexes over 20 million  RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked [...]</description>
		<content:encoded><![CDATA[<p>[...] developed by DERI Ireland, currently indexes over 20 million  RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeremy Flowers</title>
		<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/comment-page-1/#comment-12</link>
		<dc:creator>Jeremy Flowers</dc:creator>
		<pubDate>Wed, 15 Oct 2008 18:04:40 +0000</pubDate>
		<guid isPermaLink="false">http://blog.sindice.com/?p=5#comment-12</guid>
		<description>Saw a thread on LinkedIn with item posted by David Peterson that mentioned your site under Semantic Web group. I&#039;&#039;ve been developing a growing interest in this type of stuff. Web Crawlers Bots, Data extraction etc.
Would be keen to know what reading material you&#039;ve found the best?
I&#039;ve been thinking of writing a search engine to search for IT jobs with real employers (not the fake ones agencies post on job boards) and bringing back results in geographic radius.
I&#039;ve recently been reading Lucene in Action, HTTP Programming Bots in Java (Jeff Heaton).  Web Content Mining with Java (Tony Loton). He had good way of creating a string representing DOM structure then using wildcards to extract rows of data, so you ended up with an SQL like tool to get data out of tables..
I&#039;ve also got the Collective Intelligence in Action on order.. Due out any day..
Saw your not about candidates above. This kind of stuff does appeal to me. But I think I&#039;d have some catching up to do to get to the level you folks are at.

PS:Was looking at WebMonkey today too. Thinking I need to understand Microformats/RDF better too. A lot to learn! But I&#039;m up for it!</description>
		<content:encoded><![CDATA[<p>Saw a thread on LinkedIn with item posted by David Peterson that mentioned your site under Semantic Web group. I&#8221;ve been developing a growing interest in this type of stuff. Web Crawlers Bots, Data extraction etc.<br />
Would be keen to know what reading material you&#8217;ve found the best?<br />
I&#8217;ve been thinking of writing a search engine to search for IT jobs with real employers (not the fake ones agencies post on job boards) and bringing back results in geographic radius.<br />
I&#8217;ve recently been reading Lucene in Action, HTTP Programming Bots in Java (Jeff Heaton).  Web Content Mining with Java (Tony Loton). He had good way of creating a string representing DOM structure then using wildcards to extract rows of data, so you ended up with an SQL like tool to get data out of tables..<br />
I&#8217;ve also got the Collective Intelligence in Action on order.. Due out any day..<br />
Saw your not about candidates above. This kind of stuff does appeal to me. But I think I&#8217;d have some catching up to do to get to the level you folks are at.</p>
<p>PS:Was looking at WebMonkey today too. Thinking I need to understand Microformats/RDF better too. A lot to learn! But I&#8217;m up for it!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Vinay Kumar</title>
		<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/comment-page-1/#comment-14</link>
		<dc:creator>Vinay Kumar</dc:creator>
		<pubDate>Wed, 11 Jun 2008 10:39:11 +0000</pubDate>
		<guid isPermaLink="false">http://blog.sindice.com/?p=5#comment-14</guid>
		<description>Im a student of india, i love to work here as an intern. How to apply?</description>
		<content:encoded><![CDATA[<p>Im a student of india, i love to work here as an intern. How to apply?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Shantanu</title>
		<link>http://blog.sindice.com/2007/11/22/sindice-20-millions-and-openings/comment-page-1/#comment-10</link>
		<dc:creator>Shantanu</dc:creator>
		<pubDate>Tue, 04 Mar 2008 10:45:57 +0000</pubDate>
		<guid isPermaLink="false">http://blog.sindice.com/?p=5#comment-10</guid>
		<description>The work is really interesting. One would have worked on this project even without the salary</description>
		<content:encoded><![CDATA[<p>The work is really interesting. One would have worked on this project even without the salary</p>
]]></content:encoded>
	</item>
</channel>
</rss>

