Other projects
There exist a few other projects in the space of web data extraction and reconciliation, it can be helpful to understand the differences. This page will compare uberblic.org to some of these other projects.
DBpedia is an academic community project to extract structured information from Wikipedia and interlink it with other Linked Data projects on the web. DBpedia was initially started in early 2007 by researcher of Freie Universität Berlin and Universität Leipzig with support of Openlink Software as a proof-of-concept project to demonstrate Wikipedia infobox extraction. It is widely considered the nucleus of the Linking Open Data movement. Georgi Kobilarov, founder of Uberblic Labs, was also co-founder and lead developer of DBpedia at FU Berlin.
Freebase is a collaboratively maintained knowledge base for structured data. Like DBpedia, Freebase extracts structured data from Wikipedia infoboxes. But Freebase imports data also from a variety of other sources. It enables inline editing of structured data, and used web data sources to initially bootstrap the data repository.
Comparison
Wikipedia extraction: All three projects extract data from Wikipedia infoboxes, DBpedia uses Wikipedia as its sole data source, Freebase and Uberblic use Wikipedia as one of their data sources. DBpedia data is generated every 4-6 months from Wikipedia exports. Freebase imports data from Wikipedia every 2-4 weeks. Uberblic imports Wikipedia changes every 5-10 seconds.
Other data sources: Uberblic and Freebase import several public data sources from the web. DBpedia is a RDF representation of only Wikipedia. All three projects are part of the Linking Open Data project, and interlinked with other RDF data set in the Linked Data cloud. DBpedia can as pure Linked Data project only be easily used in combination with other RDF data sets, while Uberblic and Freebase import and use data that exists in diverse data formats.
Software platform: DBpedia uses an open data extraction platform which is maintained by researchers of Freie Universität Berlin and Universität Leipzig. Freebase runs on a closed source platform developed by Metaweb Technologies. Uberblic.org runs on a closed source platform developed by Uberblic Labs.
Data dumps: All three projects provide data dumps of their repositories. DBpedia and Uberblic provide RDF dumps, Freebase provide dumps in tab-separated value (tsv) format.
URIs: Freebase and Uberblic use persistent identifiers for entities in their repository by assigning opaque IDs to entities. DBpedia reuses Wikipedia article names as IDs for entities which makes URI more readable but non-persistent due to Wikipedia article movement.
Data editing: Freebase is user-editable and contributions can be made through a public interface. DBpedia and Uberblic are not user-editable, but instead push the editing to the original data sources.
Data provenance: DBpedia is meant to be a RDF representation of Wikipedia. For each data set release, DBpedia links to the snapshot of Wikipedia from which facts where extracted. Freebase is a user-edited repository for structured data which uses other data sources to populate facts into its own repository. Individual facts (or statements) in Freebase are not linked to their sources, each statement might have been edited by a user. Uberblic is a consolidated view over multiple data sources. Each statement is linked to its original source(s). Users decide individually which data sources to trust and can to exclude untrusted sources accordingly from the consolidated view.
