hi brian, i loved your talk on mining wikipedia. i'd like to mine the soft data, is there a smart way of parsing the unstructured stuff? like small, adhoc stuff -ie for pages in [[category: racecar drivers]] find "sponsored by _____" ?
you're cool.
i applied for an internship but didn't hear back. so i suppose i, and the idea on a limb.cheers-

