Performing a Bulk Upload » Discuss

Discussions on Performing a Bulk Upload

  1.  

    Citation Management

    1. I would like to upload my EndNote library (see http://www.freebase.com/view/reference_management_software) so that I can support my entries in Freebase via citations. In a perfect world, I would be able to synch (or in a less perfect world, import/export) a Refworks/EndNote/Procite library into Freebase. Is this an existing or planned capability? Most journal clearinghouses provide the option of exporting article metadata to a reference management software. It would be really, really nice if Freebase could accept these export files and reference management libraries.

      1. I agree that this would be an excellent feature. I don't think the publishing domain schemata are quite up to handling this type of data yet, but we are working on improving them now, particularly with better support for scientific and academic journals, papers, etc. Once the schemata are in place, it should be possible for someone to create an application to import bibliographic data from a standard format or formats. I can't promise that Freebase itself will supply this kind of application any time soon (although we are working on some user-facing import tools), but it will certainly be possible through our API.

      2. The EndNote schema can be downloaded here: http://www.endnote.com/support/helpdocs/RSXML.zip

  2.  

    Broken Link

    1. The documentation states:

      1 If you're a programmer, you can attempt to write in bulk using MQL

      However the link provided results in a page "Not Found". I'm sure this is undesirable.

      1. Thanks for reporting. This is now fixed.

      2. http://www.freebase.com/
        -> Data
        -> Data imports
        -> "Perform a bulk upload" or "Doing a large write?"

        still a broken link.

      3. Thanks for pointing this out. We'll look into it.

      4. This is fixed now.

  3.  

    Object Relational Schemas

    1. I'm perfectly happy to simply dump the data as a Perl hash file, with an obvious schema consisting of name, date, URL and state=>value map, and let you import it as you please, but it seems there is a need for three enhancements to the freebase schema system:

      1) A time dimension (yes I know about dated integers) based on something akin to atomic action semantics.

      2) Inheritance

      3) Computed properties (views)

      The first two, in particular, are important as exemplified by Google's Gapminder:

      http://tools.google.com/gapminder/

      The third is valuable since, for example, national data can be computed from State data, and per capita data can be computed from some property divided by the population property of that entity (nation or State or, for that matter, county or city).

  4.  

    Hundreds of US State Properties

    1. I've got hundreds of variables, by US State, at LaboratoryOfTheStates.com that I can dump for you, including the URLs from which the data were derived and the years at which they were sampled.

      What do I do?

      1. This sounds like a lot of interesting and useful data - I'd like to find out more about what you have available. How do you see it being integrated into the current schemas?