<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Georgi Kobilarov</title>
	<atom:link href="http://blog.georgikobilarov.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.georgikobilarov.com</link>
	<description>Linked Data Integration</description>
	<lastBuildDate>Fri, 20 Aug 2010 13:55:39 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Data Journalism Meetup Berlin, September 1st 2010</title>
		<link>http://blog.georgikobilarov.com/2010/08/data-journalism-meetup-berlin-september-1st-2010/</link>
		<comments>http://blog.georgikobilarov.com/2010/08/data-journalism-meetup-berlin-september-1st-2010/#comments</comments>
		<pubDate>Fri, 20 Aug 2010 13:48:26 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=507</guid>
		<description><![CDATA[After two successful Web of Data meetups in London with 200 guests each, it was time to bring the Web of Data meetup to Berlin. Data Journalism and the new and exciting possibilities that the Web of Data opens up for creators and consumers of news and media online will be the topic of this [...]]]></description>
			<content:encoded><![CDATA[<p>After two successful <a href="http://www.meetup.com/web-of-data/">Web of Data meetups in London</a> with 200 guests each, it was time to bring the <a href="http://www.meetup.com/web-of-data-berlin/">Web of Data meetup to Berlin</a>.</p>
<p>Data Journalism and the new and exciting possibilities that the Web of Data opens up for creators and consumers of news and media online will be the topic of <a href="http://www.meetup.com/Web-of-Data-Berlin/calendar/14460678/">this first meetup in Berlin on September 1st 2010</a>.</p>
<p>We have a brilliant lineup of speakers from media organisations like the BBC, The Guardian, the Deutsche Presse Agentur, the Bertelsmann Foundation, and ZEIT Online coming to Berlin and talking about data journalism and the latest development and projects in this field. Join the Berlin meetup group and <a href="http://www.meetup.com/Web-of-Data-Berlin/calendar/14460678/">sign up for the event now</a>. Thanks to my friends at <a href="http://www.fjordnet.com">Fjord </a>and at the <a href="http://www.okfn.org">Open Knowledge Foundation</a> for their help and support!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/08/data-journalism-meetup-berlin-september-1st-2010/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>When live data matters</title>
		<link>http://blog.georgikobilarov.com/2010/07/when-live-data-matters/</link>
		<comments>http://blog.georgikobilarov.com/2010/07/when-live-data-matters/#comments</comments>
		<pubDate>Tue, 06 Jul 2010 14:40:02 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=475</guid>
		<description><![CDATA[Who is Michael Ballack? Let&#8217;s see what some Linked Data services which use Wikipedia as a data source say about him. According to Wikipedia (see page here): Michael Ballack (born 26 September 1976) is a German footballer, playing for Bundesliga club Bayer Leverkusen and is the first choice captain of the German national football team, [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Who is Michael Ballack?</strong> Let&#8217;s see what some Linked Data services which use Wikipedia as a data source say about him.</p>
<p><span id="more-475"></span></p>
<p>According to <a href="http://en.wikipedia.org">Wikipedia </a>(see page <a href="http://en.wikipedia.org/wiki/Michael_Ballack">here</a>):<br />
Michael Ballack (born 26 September 1976) is a German footballer, playing for Bundesliga club Bayer Leverkusen and is the first choice captain of the German national football team, although he has been temporarily replaced by Philipp Lahm for the 2010 FIFA World Cup after Ballack missed out due to injury.</p>
<p>According to <a href="http://uberblic.org">Uberblic </a>(see his page <a href="http://uberblic.org/resource/9e102da2-935d-4f6f-98c5-d4d4dd506d58#thing">here</a>):<br />
Michael Ballack (born 26 September 1976) is a German footballer, playing for Bundesliga club Bayer Leverkusen and is the first choice captain of the German national football team, although he has been temporarily replaced by Philipp Lahm for the 2010 FIFA World Cup after Ballack missed out due to injury.</p>
<p>According to <a href="http://dbpedia.org">DBpedia </a>(see his page <a href="http://dbpedia.org/resource/Michael_Ballack">here</a>):<br />
Michael Ballack (born 26 September 1976) is a German footballer, who currently plays for Chelsea and is the captain of the German national team.</p>
<p>According to <a href="http://www.freebase.com">Freebase </a>(see his page <a href="http://www.freebase.com/view/en/michael_ballack">here</a>):<br />
Michael Ballack (born 26 September 1976) is a German footballer, who is currently a free agent since his release from Chelsea and is the former captain of the German national football team.</p>
<p>Live data matters, doesn&#8217;t it?</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/07/when-live-data-matters/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Why does the Semantic Web feel so over-engineered?</title>
		<link>http://blog.georgikobilarov.com/2010/05/why-does-the-semantic-web-feel-so-over-engineered/</link>
		<comments>http://blog.georgikobilarov.com/2010/05/why-does-the-semantic-web-feel-so-over-engineered/#comments</comments>
		<pubDate>Tue, 25 May 2010 23:52:47 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=442</guid>
		<description><![CDATA[Following the discussions in the Semantic Web community, to build an ontology for any use case whatsoever appears to be a heavy task . But isn&#8217;t the promise of the Semantic Web that everything is so agile, that a data schema can be changed easily, and that it can evolve over time? Quite often, the term over-engineered comes [...]]]></description>
			<content:encoded><![CDATA[<p>Following the discussions in the Semantic Web community, to build an ontology for any use case whatsoever appears to be a heavy task . But isn&#8217;t the promise of the Semantic Web that everything is so agile, that a data schema can be changed easily, and that it can evolve over time?</p>
<p><span id="more-442"></span>Quite often, the term <em>over-engineered</em> comes to my mind when I look at ontologies. I wonder if that&#8217;s a characteristic of the Semantic Web or of the mainly academic community behind it? For a comparison, look at Facebook&#8217;s Open Graph Protocol: People, Places, Products, and a few properties for each. Good enough, isn&#8217;t it?</p>
<p>Yes, I will loose every single argument over every single ontology I call over-engineered, because there will always be a specific reason to make the ontology as specific as it is. Sure.</p>
<p>And things must have different URIs than the documents about the things. Yes. All right.</p>
<p>Still, over-engineered. How about if we stop worrying about the perfect ontology and instead start curating open linked data properly? Less can be more.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/05/why-does-the-semantic-web-feel-so-over-engineered/feed/</wfw:commentRss>
		<slash:comments>30</slash:comments>
		</item>
		<item>
		<title>The web as a CMS for data</title>
		<link>http://blog.georgikobilarov.com/2010/05/the-web-as-a-cms/</link>
		<comments>http://blog.georgikobilarov.com/2010/05/the-web-as-a-cms/#comments</comments>
		<pubDate>Thu, 20 May 2010 09:58:07 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=418</guid>
		<description><![CDATA[Tom Scott wrote one year ago a great post about using the web as a CMS at the BBC, in which he describes how BBC editors contribute to websites like Musicbrainz and Wikipedia, and pull the content for their own web offerings like BBC Music from these community websites instead of forking off and curating [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://derivadow.com/">Tom Scott</a> wrote one year ago a great post about using <a href="http://derivadow.com/2009/01/13/the-web-as-a-cms/">the web as a CMS</a> at the BBC, in which he describes how BBC editors contribute to websites like Musicbrainz and Wikipedia, and pull the content for their own web offerings like BBC Music from these community websites instead of forking off and curating their own internal music database. I&#8217;m a big fan of this concept, and yet I just realized that I overlooked an important aspect. I was thinking only about <em>the Web</em> instead of <em>the web</em>. Marginal difference? Not at all&#8230;</p>
<p><span id="more-418"></span>The difference I&#8217;d like to use to make my point is the following: <em>the Web</em> is the whole thing, and in the context of the CMS discussion, it is the opposite of the enterprise intranet. The outside world as a whole, from a point of view within the enterprise firewall. On the other hand, <em>the web</em> emphasizes the distributed aspect, and not so much whether it&#8217;s internal or external content.</p>
<p>For many organisations the shift towards using external content (from the Web) in their business processes is a huge shift in mindset, and one that feels scary. Because &#8211; in theory - you can&#8217;t control the outside world as you can control the inside world.  However, the realization is that when it comes to curating data, the editorial power of the community can be so much bigger than your own, so that specialized community websites like Musicbrainz do provide higher quality and more up-to-date data than your own team could. The BBC is probably ahead of the industry, but I believe that this mindset will eventually become common.</p>
<p>What I&#8217;d like to argue is that this editorial power is a result of the specialization of websites, their communities, and tools. The Web as a whole is quite a messy place. But when we talk about the distributed aspect of a content web on the other hand, that is a pool of separate, specialized, connectable and interoperable services and sources. Musicbrainz is not necessarily a good data source because it is on the Web, but it is a good data source because it&#8217;s maintained and curated by a specialized community of music fans.</p>
<p>The distributed published nature of open linked data is vital to the health of the datasets. It means that each dataset is curated by a community that cares <em>specifically</em> about that data. However, this argument of distributed publishing and curation is in my opinion not only valid in the public space, but also for data and content sources within enterprises, which with their many departments and teams form webs of their own. The scientist running an experiment for example is who knows best about the resulting data, so let her curate it. The sales department has the best understanding of their sales data, the product development team of their knowledge base.</p>
<p>In the end this is all similar to the discussion about open data (as in <em>data on the Web</em>) and linked data (as in a <em>technology for data webs</em>). They are complementary, but each of them has its own and independent value proposition.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/05/the-web-as-a-cms/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>With a Web of Data, what would you do?</title>
		<link>http://blog.georgikobilarov.com/2010/04/with-a-web-of-data-what-would-you-do/</link>
		<comments>http://blog.georgikobilarov.com/2010/04/with-a-web-of-data-what-would-you-do/#comments</comments>
		<pubDate>Thu, 08 Apr 2010 11:31:51 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=339</guid>
		<description><![CDATA[There is so much excitement about Linked Data, Open Data, and in general the emerging Web of Data. Consensus appears to be to just get data out on the Web, and hope for amazing things will happen. Well, but what could these amazing things look like? If we had a Web of Data, what would [...]]]></description>
			<content:encoded><![CDATA[<p>There is so much excitement about Linked Data, Open Data, and in general the emerging <a href="http://www.readwriteweb.com/archives/the_state_of_linked_data_in_2010.php">Web of Data</a>. Consensus appears to be to just get data out on the Web, and hope for amazing things will happen. Well, but what could these amazing things look like?</p>
<p><strong>If we had a Web of Data, what would you build?</strong></p>
<p>I&#8217;d like to start a challenge to get people to talk about some more concrete ideas for applications. <span id="more-339"></span>The rules are easy:</p>
<p>1. Assume all the world&#8217;s open data is available to you, and you don&#8217;t have to worry about data formats, schemas, integration, etc.</p>
<p>2. Your idea is concrete enough that you can imagine yourself or someone else actually implementing it. No magic, no artificial intelligence, no algorithms that don&#8217;t exist today.</p>
<p>3. Write about your idea within the next 30 days (before May 8th) and ping this post.</p>
<p>4. If some of the data sources for your app already exist on the Web, name them.</p>
<p>Here&#8217;s my idea: If we had a Web of Data, I would built an application for painless travel planning. It would integrate flight plans, train timetables, bus routes, car rental offers, etc. And the user would be able to just say: I want to go from A to B: Find me the best/cheapest/fastest routes.</p>
<p>Because today travel planning is a pain: often your destination city has no airport, so you have to choose one of multiple airports of nearby cities. And the optimal choice for that depends on train connections from those airports to your destination city. And the choice of a train station depends on bus connections.  And so on. With a Web of Data, an application could do all that combining for me, the same way flight booking sites do that today for just flights.</p>
<p><strong>What is your idea?</strong> Spread the word, and post the idea either in the comments section or write it down on your own blog and ping this post <strong>within the next 30 days</strong>. I&#8217;ll set up a poll to select the best ideas from all submissions.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/04/with-a-web-of-data-what-would-you-do/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Linked Data and Enterprise Data Integration</title>
		<link>http://blog.georgikobilarov.com/2010/03/linked-data-and-enterprise-data-integration/</link>
		<comments>http://blog.georgikobilarov.com/2010/03/linked-data-and-enterprise-data-integration/#comments</comments>
		<pubDate>Wed, 24 Mar 2010 13:21:07 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Integration]]></category>
		<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[enterprise]]></category>
		<category><![CDATA[linkeddata]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=288</guid>
		<description><![CDATA[What can linked data do for the enterprise, can it solve the CIO&#8217;s headaches around data integration problems? That topic comes up more and more often in the linked data community. Where does Linked Data fit into the enterprise? Let&#8217;s explore that by looking at conventional enterprise data integration first. I stumbled upon a blog [...]]]></description>
			<content:encoded><![CDATA[<p>What can <a href="http://en.wikipedia.org/wiki/Linked_Data">linked data</a> do for the enterprise, can it solve the CIO&#8217;s headaches around data integration problems? That topic comes up more and more often in the linked data community. Where does Linked Data fit into the enterprise? Let&#8217;s explore that by looking at conventional enterprise data integration first.</p>
<p><span id="more-288"></span>I stumbled upon a <a href="http://www.ebizq.net/blogs/dataservice/2010/03/data-virtualization-provides-a-single-view-into-the-enterprise.php">blog post</a> describing the challenges of providing a single view of enterprise data sources. That post and the <a href="http://www.ebizq.net/blogs/dataservice/2010/02/good-things-are-usually-expensive-mdm-case-in-point.php">previous</a> of its series describe typical corporate IT: SOAs, SAPs, IBMs, multi-million dollar integration projects, legacy systems all over the place, tons of code for special purpose application integration, etc. The usual ecosystem that has grown over decades, which from the outside might appear like a mess, but can not be replaced or consolidated. Because consolidating all those systems would be unaffordable, and might sometimes be politically difficult to pursue, because for each individual department and applications, things mostly work well enough the way they are. It&#8217;s the cross-application integration (and hence often cross-department) integration that causes headaches.</p>
<p>My friends at the BBC and I <a href="http://www.georgikobilarov.com/publications/2009/eswc2009-bbc-dbpedia.pdf">wrote a paper</a> describing the challenges at the BBC and how Linked Data can help (read chapter one). Enterprise-wide IT system consolidation is not an option. The desired solution is something on top of existing systems that provides integrated views of all data repositories and workflows without the risk of breaking existing applications.</p>
<p>The blog post above describes a case of &#8220;small pieces of information about customers [being] littered throughout the  data center&#8221;, in the billing system, marketing system, CRM system, support system, etc. The suggested solution is data virtualization (or federation): A software that allowes users to define aggregated views over the different pieces of data, and analysts and application developers query those views instead of querying the multiple different data sources directly. The integration layer acts as a middleware and retrieves the information from these disparate systems real time when  requested. Whether or not to virtualize or materialize the integration layer (i.e. whether data from sources is retrieved at query time, or gets replicated into a central repository) depends on the concrete case. Complex data joins are difficult to virtualize, and query execution time is often much worse than in materialization systems. Also, in virtualized views there is the problem of availability: If one data source is unavailable, your queries will not execute, so you&#8217;d want proxies anyway.</p>
<p>The power of these integration layers is that they separate the business logic from the data sources. IT staff can focus on the data they need and don&#8217;t have to deal with the different systems that data is stored. They can start glueing data together instead of glueing systems together in a point-to-point manner.</p>
<p>Now, where does Linked Data fit in? Linked Data enables you to push the integration down to the level of the actual data. Think of it is a network (or web) of all the little pieces of information: One customer in your CRM system links to her doppelgänger in the billing systems.  So the data object about Anna Smith in the customer database links to the according Anna Smith in the billing system. And that Anna Smith links to her doppelgänger in the support system, and so on. Applications can follow these links through the different systems  and that way get all the data about Anna Smith. The beauty of that? The links do not have to stop at your firewall. Your data objects can link to <a href="http://linkeddata.org">data sources on the Web</a> or to your suppliers as well.You can link to whatever other Linked Data source you want, the technical barriers disappear.</p>
<p>Disambiguity is another important aspect. There are probably many Anna Smiths in any large enterprise customer database. With linked data, the objects become unambiguous, like a database IDs. Anna Smith lives in Cambridge? There are unambiguous IDs for the Cambridge in Massachusetts and the Cambridge in England, so Anna can link to the correct one unambiguously. And if you use a link to a linked data source on the web, like <a href="http://www.geonames.org">Geonames</a>, your applications can fetch information about Cambridge from there. Once the disambiguation is done, and links between objects are established, they are available for all your applications to use.</p>
<p>All that makes data integration much more lightweight and agile, and at the same time much more powerful. And your integration layer software can do much more clever things in a more agile way. Is there still a need for that integration layer? Yes, there is. The integration layer becomes the place where the links between data objects get managed, where <a href="http://blockslabpillar.com/2010/03/06/the-importance-of-curation-in-a-metadata-data-driven-information-architecture/">data collections get curated</a>, where it gets defined which data sources and pieces of information to trust for which use  case, where <a href="http://derivadow.com/2010/03/11/some-thoughts-on-moving-beyond-the-resource/">data collections are built</a>, and where the data from all your enterprise and web data sources gets consolidated. Providing the <a href="http://uberblic.org/2010/01/uberblic-release/">single point of access into the web of data</a> that exists in your enterprise and on the Web.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/03/linked-data-and-enterprise-data-integration/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Search is about more than just the answer</title>
		<link>http://blog.georgikobilarov.com/2010/03/search-is-about-more-than-just-the-answer/</link>
		<comments>http://blog.georgikobilarov.com/2010/03/search-is-about-more-than-just-the-answer/#comments</comments>
		<pubDate>Mon, 22 Mar 2010 13:11:44 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Semantic Web]]></category>
		<category><![CDATA[faceted browser]]></category>
		<category><![CDATA[semantic search]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=260</guid>
		<description><![CDATA[In the space of Semantic Search we more often see attempts to build so called &#8220;answer machines&#8221;, like Wolfram Alpha. You specify your query as detailed as possible, and the service gives you the answer directly. I sometimes hear people say that this is the future of search. No, it is not. There are very [...]]]></description>
			<content:encoded><![CDATA[<p>In the space of Semantic Search we more often see attempts to build so called &#8220;answer machines&#8221;, like Wolfram Alpha. You specify your query as detailed as possible, and the service gives you <em>the answer </em>directly. I sometimes hear people say that this is the future of search. No, it is not.</p>
<p><span id="more-260"></span></p>
<p>There are very few problems that can be mapped to a simple query-answer model. For most search tasks, reviewing intermediate results is an important part of the process. We can&#8217;t simply delegate that to the machine.</p>
<p>I call those <em>exploratory search</em> tasks: Your interests (and hence the result) changes while you are reviewing the information presented to you. Ian Dickinson and I <a href="http://www.georgikobilarov.com/publications/2008/Kobilarov-Dickinson-LDOW2008-Humboldt.pdf" target="_blank">wrote a paper</a> when we worked at HP Labs about how faceted browsing of structured data can facilitate this kind of exploratory search through an operation we called <em>pivot</em>ing. David Huynh, author of Exhibit and now with Metaweb, calls that set-based-browsing in his Parallax <a href="http://davidhuynh.net/media/papers/2009/www2009-parallax.pdf" target="_self">paper</a>. Both Humboldt (the browser demonstrating our <em>pivoting operation</em>) as well as Parallax (David&#8217;s set based browser) were more or less just research prototypes, but I think already demonstrated what could be done in this space.</p>
<p>I&#8217;m convinced that the <em>answer machines</em> are not the future, but exploratory interfaces are. Or can you imagine yourself someday saying: &#8220;Computer, where do I want to eat today?&#8221;</p>
<p>One example of a exploratory search: Your inital query is to find an offer for a particular music album to buy. While searching, you discover that the band played a concert in a venue near to you that you didn&#8217;t know yet. You explore the venue and the upcoming events there, and you discover a concert of a different band. You listen to samples of their music, and then decide to buy one of their albums. So in the end, you bought a music album. But a different one than you planned to&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/03/search-is-about-more-than-just-the-answer/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>uberblic.org launched</title>
		<link>http://blog.georgikobilarov.com/2010/01/uberblic-org-release/</link>
		<comments>http://blog.georgikobilarov.com/2010/01/uberblic-org-release/#comments</comments>
		<pubDate>Thu, 28 Jan 2010 12:51:38 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[Uberblic]]></category>
		<category><![CDATA[linkeddata]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=177</guid>
		<description><![CDATA[In early 2009 I had the idea for a linked data integration platform, capable of importing and mapping all of the publicly available data sets in the Linked Data cloud into a single repository, represented in a coherent ontology, reconciled and accessible through central APIs. What started out as an idea a year ago now [...]]]></description>
			<content:encoded><![CDATA[<p>In early 2009 I had the idea for a linked data integration platform, capable of importing and mapping all of the publicly available data sets in the Linked Data cloud into a single repository, represented in a coherent ontology, reconciled and accessible through central APIs.</p>
<p>What started out as an idea a year ago now became reality. Today, we have released <a href="http://uberblic.org">uberblic.org</a>, a service for integrating the web of data.</p>
<p><span id="more-177"></span></p>
<p>The <a href="http://uberblic.org/2010/01/uberblic-release/">project announcement</a> has a summary and screencast of what Uberblic is and how it works.</p>
<p>I&#8217;m very excited about today&#8217;s release! Many thanks to my friends at <a href="http://www.talis.com">Talis</a> for their help and support!</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="425" height="344" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://www.youtube.com/v/JctenmbFevk&amp;hl=de_DE&amp;fs=1" /><param name="allowfullscreen" value="true" /><embed type="application/x-shockwave-flash" width="425" height="344" src="http://www.youtube.com/v/JctenmbFevk&amp;hl=de_DE&amp;fs=1" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/01/uberblic-org-release/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The harm of &#8220;non-commercial&#8221;</title>
		<link>http://blog.georgikobilarov.com/2010/01/the-harm-of-non-commercial/</link>
		<comments>http://blog.georgikobilarov.com/2010/01/the-harm-of-non-commercial/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 14:32:50 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=163</guid>
		<description><![CDATA[Many data sources on the web are licensed under non-commercial licenses. There are understandable reasons for data publishers to choose non-commercial licensing: Their data might have been funded with tax money, or they license the content from other content creators under contracts that only allow non-commercial republishing. The intention is that the data should be [...]]]></description>
			<content:encoded><![CDATA[<p>Many data sources on the web are licensed under non-commercial licenses. There are understandable reasons for data publishers to choose non-commercial licensing: Their data might have been funded with tax money, or they license the content from other content creators under contracts that only allow non-commercial republishing.<br />
<span id="more-163"></span><br />
The intention is that the data should be available to anyone who doesn&#8217;t earn any income by using the data. </p>
<p>But there&#8217;s a huge problem: Not earning any income isn&#8217;t equivalent to non-commercial usage. A consultation done by Creative Commons has shown that a large share of content publishers consider all services and web sites run by a for-profit organisation commercial usage. Even if those offerings do not generate any income at all. </p>
<p>That creates a large uncertainty for companies and startups: You don&#8217;t know whether a data license actually permits the usage of content for your free service, but instead you need to sign custom agreements with each data publisher, approving that your definition of non-commercial meets their definition. </p>
<p>I doubt that was the intention behind data licenses such as Creative Common Noncommercial&#8230;</p>
<p>Let me cite from CC-BY-NC 3.0: &#8220;You may not exercise any of the rights granted to You in Section 3 above in any manner that is primarily intended for or directed toward commercial advantage or private monetary compensation.&#8221;</p>
<p>Both <em>primarily </em>and <em>commercial advantage</em> are quite vague, and that&#8217;s the main harm:<br />
License terms should not be vague!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2010/01/the-harm-of-non-commercial/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>DBpedia Ontology &#8211; designed to break?</title>
		<link>http://blog.georgikobilarov.com/2009/11/dbpedia-ontology-designed-to-break/</link>
		<comments>http://blog.georgikobilarov.com/2009/11/dbpedia-ontology-designed-to-break/#comments</comments>
		<pubDate>Fri, 06 Nov 2009 17:08:19 +0000</pubDate>
		<dc:creator>Georgi</dc:creator>
				<category><![CDATA[Linked Data]]></category>
		<category><![CDATA[DBpedia]]></category>

		<guid isPermaLink="false">http://blog.georgikobilarov.com/?p=133</guid>
		<description><![CDATA[One of the bigger changes of the upcoming DBpedia 3.4 release is the ontology&#8217;s new URI schema: Property URIs are now partitioned by the property&#8217;s domain. While before it was http://dbpedia.org/ontology/artitect, now it is http://dbpedia.org/ontology/Building/artitect. In the past, there&#8217;s been the statement that http://dbpedia.org/ontology/architect has the rdfs:domain http://dbpedia.org/ontology/Building, now this fact is in addition also [...]]]></description>
			<content:encoded><![CDATA[<p>One of the bigger changes of the upcoming DBpedia 3.4 release is the ontology&#8217;s new URI schema: Property URIs are now partitioned by the property&#8217;s domain. While before it was <code>http://dbpedia.org/ontology/artitect</code>, now it is <code>http://dbpedia.org/ontology/Building/artitect</code>. In the past, there&#8217;s been the statement that <code>http://dbpedia.org/ontology/architect</code> has the <code>rdfs:domain</code> <code>http://dbpedia.org/ontology/Building</code>, now this fact is in addition also coded into the URI.</p>
<p>Looks ok? Maybe on the first sight. But in my opinion, it&#8217;s a big mistake.<br />
<span id="more-133"></span><br />
Let&#8217;s first look at the reasons for that change in the URI schema. It aims to provide a solution for semantically ambiguous properties. For example, the word &#8220;length&#8221; can be used to describe the long dimension of an object, like the length of a bridge. But it can be also used as a synonym for the runtime of a song or movie (like 90 minutes). Now, with only the one URI <code>http://dbpedia.org/ontology/length</code>, it&#8217;s unclear whether the range of that property is measured in metres or minutes (let alone inch, feet, and miles ;) ) So in order to properly represent the two different semantics, we need two different URIs. Consensus so far&#8230;</p>
<p>Now there are to possible solutions: Either you use two different property IDs (such as <code>length </code>and <code>movie_length</code>), or you use two different namespaces. The DBpedia team chose the latter. The problem is that they did the partitioning for every single property, even those unambiguous ones. And since the DBpedia ontology wasn&#8217;t entirely carefully designed upfront, but is instead due to community refinement, that leaves us with URIs that will most probably break in the future.</p>
<p>See for example <code>http://dbpedia.org/ontology/ceo</code>. Its domain is <code>http://dbpedia.org/ontology/SoccerClub</code>, which seems kind of strange, but is due to the way how the ontology was created: The Infobox Soccer Club was probably the only one with a ceo property, so the domain of <code>http://dbpedia.org/ontology/ceo</code> became <code>SoccerClub</code>. Clearly, once the community starts refining the ontology, the domain will be changed to <code>Company </code>or something similar more reasonable. That wouldn&#8217;t be much of deal since we&#8217;d only see a changed statement about the ceo property. But with the new URI schema, the SoccerClub is part of the property&#8217;s URI (<code>http://dbpedia.org/ontology/SoccerClub/ceo</code>). We have a problem&#8230;</p>
<p>If we&#8217;d change the property URI to <code>http://dbpedia.org/ontology/Company/ceo</code> now, all existing queries will break. If we leave the URI as it is, the encoded SoccerClub becomes misleading. It would have been even better to use <code>http://dbpedia.org/ontology/abc123/ceo</code> instead, since that string doesn&#8217;t suppose to have any meaning.</p>
<p>The simple, straightforward solution would have been: use different IDs to disambiguate properties which actually need to be disambiguated. Or to do it like Freebase and partition very carefully by topic&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.georgikobilarov.com/2009/11/dbpedia-ontology-designed-to-break/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
	</channel>
</rss>
