<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
    <author>
    <name>User Administrator</name>
    <uri>http://www.freebase.com/view/user/user_administrator</uri>
  </author>
    <generator uri="http://www.freebase.com/">Freebase Atom Feed Generator</generator>
    <id>http://www.freebase.com/view/user/zenkat</id>
    <link rel="self" href="http://www.freebase.com/feed/discuss/active/user/zenkat"/>
    <title>zenkat</title>
    <updated>2008-10-06T13:22:44Z</updated>
  <entry>
    <author>
    <name>zenkat</name>
    <uri>http://www.freebase.com/view/user/zenkat</uri>
  </author>
    <content type="html">&lt;p&gt;If you're interested in doing deep parsing of wikipedia, I'd suggest looking at WEX.&amp;nbsp; It's a parsed and XML-formatted version of wikipedia we use for internal processing here at Metaweb.&amp;nbsp; It's freely available here:&lt;/p&gt;&lt;p&gt;http://download.freebase.com/wex/ &lt;/p&gt;&lt;p&gt;It works best with postgres, but it can also be processed by Hadoop or by local clients.&amp;nbsp; Let us know if you have any questions! &lt;/p&gt;&lt;p&gt;Brian &lt;/p&gt;</content>
    <id>http://www.freebase.com/view/guid/9202a8c04000641f800000000903c4bb</id>
    <link rel="alternate" type="text/html" href="http://www.freebase.com/view/guid/9202a8c04000641f800000000903c4bb" title="zenkat: wikipedia mining"/>
    <summary type="html">If you're interested in doing deep parsing of wikipedia, I'd suggest looking at WEX.&amp;nbsp; It's a...</summary>
    <title>zenkat: wikipedia mining</title>
    <updated>2008-09-02T17:25:21.0012Z</updated>
  </entry><entry>
    <author>
    <name>spencermountain</name>
    <uri>http://www.freebase.com/view/user/spencermountain</uri>
  </author>
    <content type="html">&lt;p&gt;hi brian, i loved your talk on mining wikipedia. i'd like to mine the soft data, is there a smart way of parsing the unstructured stuff? like small,&amp;nbsp;adhoc stuff -ie&amp;nbsp;for pages in [[category: racecar drivers]] find&amp;nbsp;&amp;quot;sponsored by _____&amp;quot;&amp;nbsp;?&lt;/p&gt;&lt;p&gt;you're cool. &lt;/p&gt;&lt;p&gt;i applied for an internship but didn't hear back. so i suppose i, and the idea&amp;nbsp;on a limb.cheers-&lt;/p&gt;</content>
    <id>http://www.freebase.com/view/guid/9202a8c04000641f8000000009032e43</id>
    <link rel="alternate" type="text/html" href="http://www.freebase.com/view/guid/9202a8c04000641f8000000009032e43" title="zenkat: wikipedia mining"/>
    <summary type="html">hi brian, i loved your talk on mining wikipedia. i'd like to mine the soft data, is there a smart...</summary>
    <title>zenkat: wikipedia mining</title>
    <updated>2008-08-30T02:34:41.0017Z</updated>
  </entry>
</feed>