<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ontogenesis</title>
	<atom:link href="http://ontogenesis.knowledgeblog.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://ontogenesis.knowledgeblog.org</link>
	<description>An Ontology Tutorial</description>
	<lastBuildDate>Wed, 24 Feb 2010 10:26:42 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Review of What is an ontology?</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/02/24/review-of-what-is-an-ontology-2/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/02/24/review-of-what-is-an-ontology-2/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 10:16:55 +0000</pubDate>
		<dc:creator>dosumis</dc:creator>
				<category><![CDATA[Peer Review]]></category>
		<category><![CDATA[Barry Smith]]></category>
		<category><![CDATA[concept]]></category>
		<category><![CDATA[realism]]></category>
		<category><![CDATA[universal]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=685</guid>
		<description><![CDATA[This is a review of What is an ontology? by Robert Stevens, Alan Rector and Duncan Hull
This article could be split quite neatly in two articles.  One is an excellent article that begins about a third of the way through the full piece. It covers the technical aspects of ontology building: subsumption hierarchies; necessary [...]]]></description>
			<content:encoded><![CDATA[<p>This is a review of <a href="http://ontogenesis.knowledgeblog.org/2010/01/22/what/">What is an ontology?</a> by Robert Stevens, Alan Rector and Duncan Hull</p>
<p>This article could be split quite neatly in two articles.  One is an excellent article that begins about a third of the way through the full piece. It covers the technical aspects of ontology building: subsumption hierarchies; necessary vs necessary and sufficient conditions for class membership; disjointness; relations; upper ontologies and their usefulness in restricting the choice of appropriate relations.  It draws heavily on upper ontologies developed by philosophers (at least some of them realists) and shows why they are useful.  It concludes with a clear and strong case for why good ontologies are needed in the biosciences.  I have no argument with this article.</p>
<p>The other is, to me at least, a rather confusing attempt to argue that ontologies consist of concepts, as opposed to statements about reality.  I find these arguments difficult to square with some of the statements made in the rest of the article.  However, I&#8217;m also not convinced that there is much difference between the author&#8217;s position and a realist stance. There argument hinges on the subtle issue of the reality of classes and they don&#8217;t make other arguments commonly made against a realist stance &#8211; for example, the &#8216;argument from intellectual modesty&#8217; (Smith et al., 2006), or the belief that ontology terms should simply follow the use of terms in language.  In fact, they clearly argue for ontology as a means to overcome the latter:</p>
<blockquote>
<p>&#8220;Ontology should be distinguished from thesauri&#8230;&#8221;</p>
<p>&#8220;Human beings can give multiple labels to &#8230; categories. This habit of giving multiple labels to the same category and the same label to different categories (polysemy) leads to grave problems&#8230;&#8221;</p>
</blockquote>
<p>Their argument begins with what strikes me as a cheap rhetorical trick designed to close down debate:</p>
<blockquote>
<p>&#8220;The definition here will not suit a lot of people and upset many (especially use of the word &#8220;concept&#8221;); We make no apology for this situation, only noting that the argument can take up resources better used in helping biologists describe and use their data more effectively.&#8221;</p>
</blockquote>
<p>If the authors think this discussion is a waste of resources, then why bother spending a few paragraphs making their case?  I suspect that they do actually care about the argument because they worry about the implications of taking a realist stance.   If so, it would have been interesting to hear some of those concerns (on the realist status of maths for example) made more explicit.</p>
<p>There is also a notable lack of reference to any sources of opposing argument. For those who wish to pursue this argument further, waste of time though it might be, some references to counter arguments would be good.  Either of these references (or both) would do nicely:</p>
<p><a href="http://ontology.buffalo.edu/bfo/Terminology_for_Ontologies.pdf">Smith, et al., 2006. Towards a reference terminology for ontology research and development in the biomedical domain. Proceedings of KR-MED 2006</a></p>
<p><a href="http://ontology.buffalo.edu/bfo/BeyondConcepts.pdf">Smith, 2004. Beyond Concepts:  Ontology as Reality Representation. Proceedings of FOIS 2004</a></p>
<p>Of the arguments against a realist stance, the weakest uses a straw man: </p>
<blockquote>
<p>&#8220;&#8230; with a computer science ontology &#8230; there is less concern with a true account of reality as it is information that is being processed, not reality.&#8221;</p>
</blockquote>
<p>Who could argue?  Surely the question is whether the information being processed is making assertions about reality or not?  The authors case would be stronger if this line were deleted.</p>
<p>The heart of their argument is stated here:</p>
<blockquote>
<p>&#8220;As human beings, we put these objects into categories or classes. These categories are a description of that which is described in a body of data. The categories themselves are a human conception. We live in a world of objects, but the categories into which humans put them are merely a way of describing the world; they do not themselves exist.&#8221;</p>
</blockquote>
<p>A perhaps pedantic point: are classes &#8220;described in a body of data&#8221;? I would have thought it more likely they are assertions about reality that are a reasonable scientific interpretation of a body of data.  This confusion of data and its interpretation as assertions occurs consistently throughout the article.</p>
<p>More importantly, what might it mean to state that a class is real?  Even the authors seem to agree that there is regularity in the universe, whether we observe it or not. For example, later in the article, they state that:</p>
<blockquote>
<p>&#8220;Each instance of a &#8216;Helium&#8217; object was not discovered in 1903; most helium atoms existed prior to that date, but humans discovered and labelled that category at that date.&#8221;</p>
</blockquote>
<p>In 1903, humans discovered something that already <em>existed</em>: a <em>class</em> of atoms that share specific properties.  Surely this means that a definition of the class &#8216;helium atom&#8217; is making assertions about reality. Is this not different from some arbitrary class defined as including say: all helium atoms, horses, unicorns and two bedroom flats in North London?</p>
<p>This is not to say that there is only one true way to categorise any one object, or that there is a clean dividing line between classes we might be happy to define as Universals (Smith ), such as Helium atoms, and more contingent classes.</p>
<p>In the abstract, the authors argue that the debate over a realist vs a conceptual stance  is a distraction that &#8220;&#8230; can take up resources better used in helping biologists describe and use their data more effectively.&#8221;  Why might a realist stance be useful in helping scientists?</p>
<p>I believe that a realist stance is useful for ontologies made up of scientific assertions (for example, about chemistry, anatomy, or physiology), because it gives us a way to judge the quality of an ontology. If the ontology makes assertions that run counter to what we have good reason to believe is true, then it is misleading as a knowledge-base about science and its use in inference and grouping of annotations will produce results that we have good reason to believe are incorrect.  Surely such an ontology would be bad &#8211; even when judged purely in practical terms.</p>
<p>Having said all this, I&#8217;m happy for this article to pass review for the Ontogenesis Knowledge Blog as long as the authors add references to opposing arguments.  The authors may wish to consider taking into account my points with regard to the abstract and the apparent use of a straw-man argument. The article already provides an excellent introduction to the basic technical aspects of ontology building. With the addition of references to opposing arguments, the article and this review should provide a good starting point for those interested in exploring the realism vs conceptual(ism?) debate further.</p>
<a name="wptoc_0_0_0"></a><h3>Minor corrections:</h3>
<p>foundary -&gt; foundry</p>
<p>polysemy &#8211; missing initial bracket</p>
<a name="wptoc_0_0_1"></a><h3>License</h3>
<p>This paper is an open access work distributed under the terms of the Creative Commons Attribution License 3.0, which permits unrestricted use, distribution, and reproduction in any medium, provided that the original author and source are attributed.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/02/24/review-of-what-is-an-ontology-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Back to Books: Researchers should be recognized for writing books to convey and develop science</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/02/04/back-to-books/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/02/04/back-to-books/#comments</comments>
		<pubDate>Thu, 04 Feb 2010 10:13:32 +0000</pubDate>
		<dc:creator>Duncan Hull</dc:creator>
				<category><![CDATA[Meta]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[editorial]]></category>
		<category><![CDATA[nature]]></category>
		<category><![CDATA[publishing]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=680</guid>
		<description><![CDATA[There is an interesting editorial on books [1] today in Nature, related to Ontogenesis and books.
&#8220;Back to books: Researchers should be recognized for writing books to convey and develop science.&#8221;
References

Nature, Vol. 463, No. 7281. (03 February 2010), pp. 588-588. DOI:10.1038/463588a

]]></description>
			<content:encoded><![CDATA[<p>There is an interesting editorial on books [1] today in <em>Nature</em>, related to Ontogenesis and books.</p>
<p>&#8220;Back to books: Researchers should be recognized for writing books to convey and develop science.&#8221;</p>
<a name="wptoc_0_0_0"></a><h3>References</h3>
<ol>
<li>Nature, Vol. 463, No. 7281. (03 February 2010), pp. 588-588. <a href="http://dx.doi.org/10.1038/463588a">DOI:10.1038/463588a</a></li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/02/04/back-to-books/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Registration Required</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/02/01/registration-required/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/02/01/registration-required/#comments</comments>
		<pubDate>Mon, 01 Feb 2010 12:05:31 +0000</pubDate>
		<dc:creator>phillord</dc:creator>
				<category><![CDATA[Meta]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=666</guid>
		<description><![CDATA[Registration is now required for commenting due to the inevitable spam which has followed the first meeting. Unfortunately, this is not a &#8220;personal&#8221; blog, so I can&#8217;t use akismet without a license key. At the moment, the cost of this is prohibitive.
Unfortunate, but not surprising. Pingbacks still work, so commenting without registration can still happen [...]]]></description>
			<content:encoded><![CDATA[<p>Registration is now required for commenting due to the inevitable spam which has followed the first meeting. Unfortunately, this is not a &#8220;personal&#8221; blog, so I can&#8217;t use akismet without a license key. At the moment, the cost of this is prohibitive.</p>
<p>Unfortunate, but not surprising. Pingbacks still work, so commenting without registration can still happen this way.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/02/01/registration-required/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Stats now available</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/27/stats-now-available/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/27/stats-now-available/#comments</comments>
		<pubDate>Wed, 27 Jan 2010 15:48:54 +0000</pubDate>
		<dc:creator>phillord</dc:creator>
				<category><![CDATA[Meta]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=664</guid>
		<description><![CDATA[I&#8217;ve installed the stats plugin now, so that we can trace the posts and pages that are being widely viewed. The graph was initiallly broken after I made the mistake of following the installation instructions; it should be working now.
The hit count is still at around 3000 hits per day, which works out at around [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve installed the stats plugin now, so that we can trace the posts and pages that are being widely viewed. The graph was initiallly broken after I made the mistake of following the installation instructions; it should be working now.</p>
<p>The hit count is still at around 3000 hits per day, which works out at around 150 article views a day.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/27/stats-now-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ontogenesis hits the Blogosphere</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/25/ontogenesis-hits-the-blogosphere/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/25/ontogenesis-hits-the-blogosphere/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 19:13:15 +0000</pubDate>
		<dc:creator>phillord</dc:creator>
				<category><![CDATA[Meta]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=659</guid>
		<description><![CDATA[I&#8217;m please to note that Ontogenesis and its articles have hit the blogosphere already. One post the day after the meeting or one day after the first, peer-reviewed post went life. The second was from Doug Kell of BBSRC. A nice demonstration of the speed of this form of scientific publication.
Apologies to those who noticed [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m please to note that Ontogenesis and its articles have hit the blogosphere already. One <a href="http://ontogoo.blogspot.com/2010/01/what-is-ontology-ontogenesis.html">post</a> the day after the meeting or one day after the first, peer-reviewed post went life. The second was from <a href="http://blogs.bbsrc.ac.uk/index.php/2010/01/metabolomics-food-security-blogging-book/">Doug Kell</a> of <a href="http://www.bbsrc.ac.uk">BBSRC</a>. A nice demonstration of the speed of this form of scientific publication.</p>
<p>Apologies to those who noticed the outage from the server this afternoon. Naturally, I chose the period of initial internet exposure to fiddle with the server and, so, took apache down for the duration. Possibly not the greatest decision ever; sometimes I amaze even myself. There will be a second outage later in the week for further work.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/25/ontogenesis-hits-the-blogosphere/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Reflections on Blogging a Book</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/25/reflections-on-blogging-a-book/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/25/reflections-on-blogging-a-book/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 10:49:10 +0000</pubDate>
		<dc:creator>Sean Bechhofer</dc:creator>
				<category><![CDATA[Meta]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=647</guid>
		<description><![CDATA[


We&#8217;ve just had an interesting couple of days at the Ontogenesis Blogging a Book Meeting. I found myself adopting the position of naysayer on a few occasions (that&#8217;s usually Phil&#8217;s job, but he was running the meeting), raising questions about the process and technologies being applied. This post is an attempt to reflect on the [...]]]></description>
			<content:encoded><![CDATA[<div id="preamble">
<div class="sectionbody">
<div class="paragraph">
<p>We&#8217;ve just had an interesting couple of days at the Ontogenesis <em>Blogging a Book Meeting</em>. I found myself adopting the position of <em>naysayer</em> on a few occasions (that&#8217;s usually Phil&#8217;s job, but he was running the meeting), raising questions about the process and technologies being applied. This post is an attempt to reflect on the meeting and try and identify why I was uncomfortable with the exercise, and what one might do to address the situation.</p>
</div>
<div class="paragraph">
<p>I should first make it clear that I agree with the overall aims of the exercise (see below). In addition, although this commentary is perhaps negative in a number of ways, it is intended to be <em>constructive</em> criticism. This was a very interesting and thought-provoking couple of days. Many thanks to Robert Stevens, George Moulton and Phil Lord for organising the meeting and providing the initial stimulus. Now, out with the knives!</p>
</div>
<div class="paragraph">
<p>Note that the opinions expressed here are mine and may not represent the views of others involved in either the ontogenesis network or this particular meeting. This may also be a slightly half-baked rendering of my thoughts and may be subject to review!</p>
</div>
</div>
</div>
<a name="wptoc_0_0_0"></a><h2 id="_are_we_nearly_there_yet">Are we nearly there yet?</h2>
<div class="sectionbody">
<div class="paragraph">
<p>What were we trying to achieve? I think the meeting had two purposes.</p>
</div>
<div class="olist upperalpha">
<ol class="upperalpha">
<li>
<p>
Writing a number of short &#8220;encyclopedia style&#8221; articles relating to ontologies (and their use in bioinformatics).
</p>
</li>
<li>
<p>
Investigating new models for the publication process, in particular the use of a blog in order to manage the review process.
</p>
</li>
</ol>
</div>
<div class="paragraph">
<p>The former was to be realised through the latter. The rationale for A is clear, for B, the intention is to try and reduce some of the overhead and time delay that can be present when using traditional publishing routes. However, in my final analysis I think the difference between the kinds of short article for an encyclopedia and longer scientific papers means that the process hampered us somewhat in the production of our initial articles (&#8220;why didn&#8217;t we just use a wiki&#8221;). This is not to say that the blogging approach is not appropriate as a mechanism to support the publication process (in fact I think it might work fine if tweaked), but the jury&#8217;s still out.</p>
</div>
</div>
<a name="wptoc_0_0_1"></a><h2 id="_process">Process</h2>
<div class="sectionbody">
<div class="paragraph">
<p>The process for the meeting was roughly as follows. A number of topics for entries were identified. People selected topics that they wanted to write an entry for (in some cases this may involve multiple authors). Entries were then written as a blog entry. Once the author considered the entry ready for review, it was tagged appropriately. Reviews were also written as blog post. Through the use of the Wordpress&#8217;s trackback (or was it pingback?) mechanism, by including a link to the original post in the review, the review appears as a comment in the original post.</p>
</div>
<div class="paragraph">
<p>Categories were used to indicate the status of articles (under review, reviewed, peer review), while tagging indicated content.</p>
</div>
<div class="paragraph">
<p>The idea is that through the use of the commenting mechanism, we can preserve a trail of comments and reviews, which not only provide an insight into the evolution of the article, but also provide some attribution and credit for the work of the reviewers. Note that this assumes that the process is <em>open</em>, with reviewer&#8217;s comments and identity visible. This is a great idea, but the meeting showed that there are some issues with the actual delivery of this using the technology.</p>
</div>
</div>
<a name="wptoc_0_0_2"></a><h2 id="_short_or_long">Short or Long?</h2>
<div class="sectionbody">
<div class="paragraph">
<p>The meeting was focused on writing short, collaborative articles with a quick turnaround (we hoped to produce a number of initial articles by the end of the meeting). However, such writing is very different from the lengthy scholarly articles that one might expect to find in a journal. Encyclopedia articles tend to be (or at least should be, imo) objective and factual rather than opinion or subjective interpretation. The kinds of review required for the activities are different. For a short article with rapid turnaround, I would like to get quick feedback about whether there are significant pieces missing, or whether content is on- or off-topic. The fact that we were all co-located also meant that I wanted/expected quick feedback&#8201;&#8212;&#8201;&#8221;shouts across the room&#8221;. A review of a full paper would be lengthier and in-depth, and could also required more detailed referencing of specific sections of the original article. I&#8217;d also be happy to wait longer.</p>
</div>
</div>
<a name="wptoc_0_0_3"></a><h2 id="_wiki_or_blog">Wiki or Blog?</h2>
<div class="sectionbody">
<div class="paragraph">
<p>A number of times during the meeting, the question &#8220;why aren&#8217;t we doing this through a <del datetime="2010-01-25T12:05:39+00:00">blog</del>wiki?&#8221; was asked. Valid question, and one which I&#8217;m not sure there really was a good answer for. The reality is that what we were trying to do falls somewhere between what&#8217;s offered by a blog and a wiki. A wiki may well have been a better environment for supporting the collaborative writing and commenting process for the encyclopedia.</p>
</div>
</div>
<a name="wptoc_0_0_4"></a><h2 id="_communication">Communication</h2>
<div class="sectionbody">
<div class="paragraph">
<p>Authors and reviewers needed to communicate. For example, the process as defined required authors to request reviews for articles. This was done &#8220;out of band&#8221; (e.g. without using the mechanisms of the blog). In practice, this was done by email, or simply by direct communication (as all the participants were actually co-located for the two days).</p>
</div>
<div class="paragraph">
<p>There were occasionally suggestions that reviewers/authors could communicate directly, e.g. in order to exchange information on minor typographical errors. In this case, distinguishing what kinds of communication should occur in band and out of band is important, particularly if records of the communication are intended to be part of the process&#8201;&#8212;&#8201;at what point do grammatical changes become substantial changes to the intention of an article?</p>
</div>
</div>
<a name="wptoc_0_0_5"></a><h2 id="_technologies">Technologies</h2>
<div class="sectionbody">
<div class="paragraph">
<p>As an aside, I found the Wordpress UI a nightmare to work with. I don&#8217;t really like browser based editors, so prefer to author text using a text editor, and then cut&#8217;n&#8217;paste into the web tool. Once I&#8217;d pasted text into the text box, however, I found it messed with my underling HTML markup (adding lots of <tt>&lt;br/&gt;</tt> elements and stripping all my <tt>&lt;p/&gt;</tt>&#8217;s out). Tables seemed problematic too, although that may be something to do with the underlying style.</p>
</div>
</div>
<a name="wptoc_0_0_6"></a><h2 id="_conclusions">Conclusions</h2>
<div class="sectionbody">
<div class="paragraph">
<p>In order to prevent this from just being a pointless rant, I would like to conclude with some suggestions/observations. This was a useful activity, but I would probably approach it in a different way if I was to repeat it. Below are a number of questions/points that I think need further investigation. Some of these are technology related (e.g. Wordpress doesn&#8217;t do what&#8217;s needed), some are more about identifying the process and requirements.</p>
</div>
<div class="olist arabic">
<ol class="arabic">
<li>
<p>
A clearer identification of the process. What are the different steps/phases that an article will go through? This is particularly important, as I think the processes involved in writing the encyclopedia are different to those that would be in place for &#8220;journal style&#8221; reviewing.
</p>
</li>
<li>
<p>
Versioning. What is the versioning strategy that is used? Are articles edited &#8220;in place&#8221;, or should edits result in a new article? In which case, should all edits result in new articles, or can we fix typos? Who then decides what edits are &#8220;acceptable&#8221;?
</p>
</li>
<li>
<p>
What&#8217;s the role of the <em>editor</em> (if any)? Do we need a central controller?
</p>
</li>
<li>
<p>
Communication between authors/reviewers/editors. Mechanisms are needed that allow communication between the various actors. What is in-band and out-of-band? How much did/does the physical co-location of the participants impact on the process?
</p>
</li>
<li>
<p>
The ability to deal with different kinds of information in a review. There can be comments about the presentational aspects of the work (e.g. typos, grammatical errors and so on), as well as more substantive comments relating to the content of the work.
</p>
</li>
<li>
<p>
A clearer identification of the activities/tasks that actually required interaction and communication, and how those are managed. A number of things (soliciting reviews for example) were done &#8220;in person&#8221;. Clearly this would not be possible if participants hadn&#8217;t been co-located (although email would also work).
</p>
</li>
</ol>
</div>
<div class="paragraph">
<p>Some of the points above (e.g. 1 and 2) need to be considered independently of the technology being used to deliver them (although it is important to bear in mind what is possible/feasible). There was an occasional tendency to bend what we were trying to do to fit with the Wordpress functionality (e.g. single categories).</p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/25/reflections-on-blogging-a-book/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Review of Automatic Maintenance of Multiple Inheritance Ontologies</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/23/review-of-automatic-maintenance-of-multiple-inheritance-ontologies-2/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/23/review-of-automatic-maintenance-of-multiple-inheritance-ontologies-2/#comments</comments>
		<pubDate>Sat, 23 Jan 2010 14:13:50 +0000</pubDate>
		<dc:creator>phillord</dc:creator>
				<category><![CDATA[Peer Review]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=630</guid>
		<description><![CDATA[This is a review article

Comments
This is a clear, precise and well-written article describing the topic. It covers the motivation, process and technology of this approach for the development of a polyhierarchy while being clear that the axiomatisation necessary can have upfront costs. The article also makes clear use of figures demonstrating the use, as well [...]]]></description>
			<content:encoded><![CDATA[<p>This is a <a href="http://ontogenesis.knowledgeblog.org/2010/01/21/automatic-maintenance-of-multiple-inheritance-ontologies/">review article</a></p>
<hr />
<a name="wptoc_0_0_0"></a><h2><a name="_comments"></a>Comments</h2>
<p>This is a clear, precise and well-written article describing the topic. It covers the motivation, process and technology of this approach for the development of a polyhierarchy while being clear that the axiomatisation necessary can have upfront costs. The article also makes clear use of figures demonstrating the use, as well as examples clarifying the more technical statements.</p>
<p>The latter half of the article addresses the issue of normalisation which is a development methodology that enables automatic maintenance of multiple inheritance ontologies. Well this is appropriate material for the article, it is not strictly necessary to completely normalize an ontology to use reasoning. The conclusions, in particular, could be improved in structure by saying that automatic maintaince can be supported by reasoning, and that normalisation exploits this fully.</p>
<p>Otherwise, an excellent article.</p>
<hr />
<a name="wptoc_0_0_1"></a><h2><a name="_minor_corrections"></a>Minor Corrections</h2>
<a name="wptoc_1_1_0"></a><h3><a name="_general"></a>General</h3>
<p>The section titles should use HTML headers, rather than the typographic markup used. Authors email should be hyperlinked.</p>
<a name="wptoc_1_1_1"></a><h3><a name="_multiple_inheritance_ontologies"></a>Multiple Inheritance Ontologies</h3>
<p>The effort is considerable, but worthy, as the automated reasoner is able to maintain the whole structure, avoiding human errors. Also, using such expressive axiomisation enables richer queries and other advantages.</p>
<p>Change to</p>
<p>is considerable requiring a richer axiomatisiation but worthwhile as the&#8230;</p>
<p>The more expressive axiomatisation also enables richer&#8230;</p>
<p>para 2 The difficult on maintaining -&#8594; of maintaining</p>
<a name="wptoc_1_1_2"></a><h3><a name="_what_is_normalisation"></a>What is normalisation</h3>
<p>Therefore, adequete and precise -&#8594; However, adequete</p>
<p>OWL provides -&#8594; Languages such as OWL provide</p>
<p>(e.g. part_of some (nucleus and (has_function only photosynthesis))</p>
<p>Would be nice to describe what this means in English.</p>
<p>&#8220;Normalised CL there&#8221; -&#8594; what does CL mean in this context?</p>
<a name="wptoc_1_1_3"></a><h3><a name="_advantages_of_8230"></a>Advantages of &#8230;</h3>
<p>Such modelling dynamic also results in a modular ontology -&#8594; This modelling process</p>
<p>as such relation is the result of both having a common condition -&#8594; as this relation is the result&#8230;.</p>
<a name="wptoc_1_1_4"></a><h3><a name="_conclusions"></a>Conclusions</h3>
<p>long term, a manual maintenance  -&#8594; long term, manual maintenance.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/23/review-of-automatic-maintenance-of-multiple-inheritance-ontologies-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>First Post</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/23/first-post/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/23/first-post/#comments</comments>
		<pubDate>Sat, 23 Jan 2010 14:13:47 +0000</pubDate>
		<dc:creator>phillord</dc:creator>
				<category><![CDATA[Meta]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=629</guid>
		<description><![CDATA[The Ontogenesis knowledgeblog has achieved first post from authoring and through peer review. Congratulations to Allyson Lister for being the first.
]]></description>
			<content:encoded><![CDATA[<p>The Ontogenesis knowledgeblog has achieved <a href="http://ontogenesis.knowledgeblog.org/2010/01/21/semantic-integration-in-the-life-sciences/">first post</a> from authoring and through peer review. Congratulations to Allyson Lister for being the first.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/23/first-post/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Review of Components of an Ontology</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/22/review-of-components-of-an-ontology/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/22/review-of-components-of-an-ontology/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 15:15:22 +0000</pubDate>
		<dc:creator>mikeleganaaranguren</dc:creator>
				<category><![CDATA[Peer Review]]></category>
		<category><![CDATA[classes]]></category>
		<category><![CDATA[instances]]></category>
		<category><![CDATA[Mikel Egana-Aranguren]]></category>
		<category><![CDATA[Phillip Lord]]></category>
		<category><![CDATA[relations]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=550</guid>
		<description><![CDATA[This is a review of the paper entitled Components of an Ontology, by Phil Lord.
This paper describes the three main components that can be found in an ontology (classes, instances, relations). Therefore it should be of interest to any newcomer to ontology development, as the confusion of which entities to use (especially instance vs. class) [...]]]></description>
			<content:encoded><![CDATA[<p>This is a review of the paper entitled <a href="http://ontogenesis.knowledgeblog.org/2010/01/22/components-of-an-ontology/">Components of an Ontology</a>, by Phil Lord.</p>
<p>This paper describes the three main components that can be found in an ontology (classes, instances, relations). Therefore it should be of interest to any newcomer to ontology development, as the confusion of which entities to use (especially instance vs. class) is a major problem when learning how to develop an ontology.</p>
<p>There is a difficult sentence at the end of introduction: &#8220;These components can be separated into two kinds; those that describe the <strong>Entities</strong> of the domain being described, and those which either enable the use of the ontology or describe the ontology itself.&#8221; There is no more references to such distinction in the rest of the paper. Also, it is not clear what are the components that describe the entities of the domain (axioms, individuals, classes?), the components that enable the use of the ontology (editors, APIs?) and the components that describe the ontology (metadata, metamodelling?).</p>
<p>The whole paper, although it describes general ontological components, has an OWL flavour, and the author should be explicit about it.</p>
<p>I would change the example for illustrating existential/universal, to an example where the class and the filler are different entities (not person/person), to make it more understandable.</p>
<p>I recommend accepting this paper.</p>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/22/review-of-components-of-an-ontology/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ontology Learning</title>
		<link>http://ontogenesis.knowledgeblog.org/2010/01/22/ontology-learning/</link>
		<comments>http://ontogenesis.knowledgeblog.org/2010/01/22/ontology-learning/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 14:29:57 +0000</pubDate>
		<dc:creator>Christopher Brewster</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Christopher Brewster]]></category>
		<category><![CDATA[Natural Language Processing]]></category>
		<category><![CDATA[Ontology Learning]]></category>
		<category><![CDATA[Simon Jupp]]></category>

		<guid isPermaLink="false">http://ontogenesis.knowledgeblog.org/?p=331</guid>
		<description><![CDATA[		
Introduction
The Need for ontology learning
Ontologies are the fundamental form of knowledge representation in contemporary Artificial Intelligence systems, especially systems of used in the Life Sciences (link to What is an Ontology article). The vast majority of currently used ontologies have been built entirely by hand, including the major of Life Science ontologies such GO and [...]]]></description>
			<content:encoded><![CDATA[<p>		<!-- Processed by MultiMarkdown --></p>
<a name="wptoc_0_0_0"></a><h2 id="introduction">Introduction</h2>
<a name="wptoc_0_1_0"></a><h3 id="theneedforontologylearning">The Need for ontology learning</h3>
<p>Ontologies are the fundamental form of knowledge representation in contemporary Artificial Intelligence systems, especially systems of used in the Life Sciences (link to What is an Ontology article). The vast majority of currently used ontologies have been built entirely by hand, including the major of Life Science ontologies such <a href="http://ontogenesis.knowledgeblog.org/glossary#go">GO</a> and those in OBO Foundry. This manual development process represents a major knowledge acquisition bottleneck as sometimes hundreds of hours of effort have been involved and there are ongoing teams of people in place to keep the ontologies up to date <span class="markdowncitation"> (<a href="#Brewster_BMC09" title="see citation">1</a>)</span>. One consequence of this has been a series of ongoing efforts largely led by members of the Natural Language Processing (NLP) and Text Mining communities to automate, or semi-automate the ontology construction process. Ontology Learning is motivated by a number of factors:</p>
<ul>
<li>The high manual cost of ontology construction</li>
<li>The continuous change in science and knowledge in general</li>
<li>The very large amount of existing text with numbers growing exponentially (PubMed has over 2000 papers added to it per day)</li>
<li>The extensive need for a variety of ontology type structures ranging from the relatively informal such as structured vocabularies, through somewhat more formal taxonomies, to fully rigorous ontologies expressed in <a href="http://ontogenesis.knowledgeblog.org/glossary#owl">OWL</a>.</li>
</ul>
<p>This article will provide a brief introduction to some of the key concepts, approaches and challenges.</p>
<a name="wptoc_0_0_1"></a><h2 id="underlyingassumptions">Underlying Assumptions</h2>
<p>The work undertaken on ontology learning from texts is dependent on a number of core assumptions. All knowledge representation models including ontologies are linguistic artefacts, for two reasons <span class="markdowncitation"> (<a href="#Davis_AIMag1993" title="see citation">2</a>)</span>. Firstly, ontologies function as a means for human beings to interact with machines, even if in a highly structured manner. Secondly, nearly all ontologies use linguistic terms (or quasi-linguistic terms) as labels for concepts. The fundamental assumption made by the NLP community is that there is an identity between terms in an ontology and in a text which look the same. This perception is further re-enforced by the extensive use of ontologies (especially in the Life Sciences) to annotate data including textual data.</p>
<p>On the NLP technology side, two other assumptions need to be identified. The distributional hypothesis is a very important working assumption which states that words which appear in similar contexts must be semantically identical or closely related <span class="markdowncitation"> (<a href="#Harris_Word54" title="see citation">3</a>)</span>, <span class="markdowncitation"> (<a href="#Firth_Papers57" title="see citation">4</a>)</span>. All efforts to organise and cluster terms in texts, and to identify semantic similarity (synonymy) depend on the distributional hypothesis, and the identification of semantic difference or identity. </p>
<p>A further assumption is that texts actually contain the necessary information to derive ontologically relevant definitions. A common ontology learning model involves taking a set of domain specific texts and deriving (as far as possible) a corresponding ontology. The inadequacies which this approach has <span class="markdowncitation"> (<a href="#Brewster_SIGIR03" title="see citation">5</a>)</span> have led to attempts to use multiple knowledge sources for the ontology learning process <span class="markdowncitation"> (<a href="#Cimiano_Learning05" title="see citation">6</a>)</span>.</p>
<a name="wptoc_0_0_2"></a><h2 id="currentapproaches">Current Approaches</h2>
<p>Ontology Learning remains an active area of research with considerable potential for facilitating the work of ontology engineers. Further progress in Natural Language Processing and the integration of multiple sources of knowledge have the promise to substantially reduce the burden of formalisation by the ontology engineer. Concurrently, the outputs of ontology learning systems are of great utility in less formal context wheren taxonomies or formal vocabularies are required.</p>
<a name="wptoc_2_1_0"></a><h3 id="thebasicsofontologylearning">The Basics of Ontology Learning</h3>
<p>At its most simplistic an ontology learning system (or workflow) allows the input of one or more texts and the output of some form of taxonomy. At a more detailed level, we can analyse the ontology learning from text process as follows (adapted from <span class="markdowncitation"> (<a href="#Brewster_Knowledge08" title="see citation">7</a>)</span>):</p>
<ol>
<li><strong>Text Collection</strong>. A corpus of texts has to be identified, collected and preprocessed.</li>
<li><strong>Term recognition or keyword extraction</strong>. Stopwords, named entities and other noise need to be removed and a subset of the vocabulary in the text identified.</li>
<li><strong>Relation Extraction: Synonymy and Clustering</strong>. Terms which <em>do</em> have a relation need to be identified. This step also identifies terms or expressions such as abbreviations which are considered to be synonymous.</li>
<li><strong>Relation Labelling</strong>. Relations which have been identified as existing need to be labelled. (Steps 3 and 4, often occur together).</li>
<li><strong>Hierarchy Construction</strong>. Most ontologies contain a subsumption hierarchies if classes, however, it may also be desirable to extract different types of axioms from the text, including disjointness and equivalence.</li>
</ol>
<p>Each step can be undertaken in a number of different ways, some of which we will briefly consider below.</p>
<p><strong>Step 1 Text Collection:</strong> Text collections or corpora are usually &#8220;convenience corpora&#8221; i.e. based on a convenient collection of texts readily accessible to the researcher. Some authors have selected texts from the web, or extracted abstracts or complete texts from sources such as PubMed using specific key words.</p>
<p><strong>Step 2 Term Recognition and keyword extraction:</strong> There are a number of standard approaches in the literature including <span class="markdowncitation"> (<a href="#Ahmad_Corpus01" title="see citation">8</a>)</span>, <span class="markdowncitation"> (<a href="#Velardi_ACL01" title="see citation">9</a>)</span>, <span class="markdowncitation"> (<a href="#Park_COLING02" title="see citation">10</a>)</span>, <span class="markdowncitation"> (<a href="#Ananiadou_Automatic06" title="see citation">11</a>)</span>, <span class="markdowncitation"> (<a href="#LeMoigno_AMIA02" title="see citation">12</a>)</span>. A comparative evaluation and integration of different methodologies relevant to ontology learning is presented in <span class="markdowncitation"> (<a href="#Zhang_LREC08" title="see citation">13</a>)</span>.</p>
<p><strong>Step 3 Synonymy and Clustering:</strong> There s a large literature on different approaches cf. <span class="markdowncitation"> (<a href="#ManningSchutze99" title="see citation">14</a>)</span> Chap. 14. Classic systems for the creation of synonym sets include Sextant <span class="markdowncitation"> (<a href="#grefenstette_Explorations94" title="see citation">15</a>)</span>, and the work of the speech technology community in language models <span class="markdowncitation"> (<a href="#Brown_CL92" title="see citation">16</a>)</span>.</p>
<p><strong>Step 4 Relation Labelling:</strong> Automated approaches to ontology learning have tended to restrict themselves mostly to learning ISA hierarchies (hyponomy and hyperonomy relations). There exist some limited efforts to learn other types of relations<span class="markdowncitation"> (<a href="#Villaverde_ESA09" title="see citation">17</a>)</span>. There are two standard approaches to relation labelling:</p>
<ul>
<li>String Inclusion: This approach is used with multi-word terms with a structure ABC, the string inclusion approach assumes ABC ISA BC ISA C. There is a strong argument to interpret this relation not as a hyponym relation but (as defined in <a href="http://ontogenesis.knowledgeblog.org/2010/01/21/skos/">SKOS</a>) as a &#8220;broader than/narrower than&#8221; relation. This is a high recall but low precision approach <span class="markdowncitation"> (<a href="#Brewster_BMC09" title="see citation">1</a>)</span>.</li>
<li>Lexico-syntactic Patterns: Ever since Hearst&#8217;s work <span class="markdowncitation"> (<a href="#Hearst92" title="see citation">18</a>)</span>, there has been a strong tradition of using lexico-syntactic patterns to identify a variety of ontological/semantic relations between terms. These patterns are usually of the form &#8220;NP is a type of NP&#8221; or &#8220;NP and other NPs&#8221;. This approach has a tendency to have nigh precision but low recall.</li>
</ul>
<p><strong>Step 5 Hierarchy Construction:</strong> Hierarchy construction usually falls out of the relation labelling step and may be combined. There are other approaches which focus first of hierarchy construction and ignore the relation labelling stage <span class="markdowncitation"> (<a href="#Sanderson_SIGIR99" title="see citation">19</a>)</span>.</p>
<a name="wptoc_2_1_1"></a><h3 id="otherknowledgesources">Other Knowledge Sources</h3>
<p>It is common practice in ontology learning approaches to use multiple sources of knowledge apart from textual ones. Thus a variety of existing knowledge structures have been used to build ontologies semi-automatically. These include WordNet <span class="markdowncitation"> (<a href="#fellbaum_book98" title="see citation">20</a>)</span>, existing thesauri such as Rogets, and in the Life Science domain such structures as UMLS. There is an inherent contradiction in using existing structures as part of the point is build up to date representations of knowledge as reflected in corpus of texts. However, ever since the work of Agirre et al. <span class="markdowncitation"> (<a href="#Agirre_ECAI00" title="see citation">21</a>)</span>, there have been many attempts to use a starting ontology and augment or update the ontology using corpora or the web.</p>
<a name="wptoc_0_0_3"></a><h2 id="specificsystemsandtools">Specific Systems and Tools</h2>
<ul>
<li>Text2Onto:  Combines machine learning approaches with  basic linguistic approaches such as tokenisation and part-of-speech (POS) tagging. Text2Onto is built upon the General Architecture for Text Engineering (GATE) that allows for flexible integration of natural language processing components. It also allows Java Annotation Pattern Engine (JAPE) rules to be written that can be used to facilitate recognition of ontological primitives. </li>
<li>Abraxas: This is a system based on an iterative open ended approach to the ontology learning process. A seed corpus is used, terms are identified, ontological knowledge in the form of triples are extracted using lexico-syntactic patterns, gaps in ontologically explicit knowledge are identified and suitable texts to &#8220;cover&#8221; the gaps are identified from an external repository such as the web <span class="markdowncitation"> (<a href="#Wilks_FTWS09" title="see citation">22</a>)</span>.</li>
<li>KnowItAll: While strictly not an ontology learning system, the KnowItAll system is a good example of large scale knowledge acquisition from the web. It uses most of the component techniques widely used in ontology learning except it outputs knowledge fragments, rather than complete ontologies <span class="markdowncitation"> (<a href="#Etzioni_WWW04" title="see citation">23</a>)</span>.</li>
<li>OntoLearn: This is a system <span class="markdowncitation"> (<a href="#Navigli_CL04" title="see citation">24</a>)</span> which uses extensively WordNet as an external resource. A corpus has term extraction applied (the OntoLearn team have a high quality term recognition component <span class="markdowncitation"> (<a href="#Sclano_IESA07" title="see citation">25</a>)</span>), and then knowledge about terms from WordNet is integrated. Ontolearn have undertaken the most extensive end user evaluations of their system <span class="markdowncitation"> (<a href="#Velardi_Evaluation05" title="see citation">26</a>)</span>.</li>
</ul>
<a name="wptoc_0_0_4"></a><h2 id="challengesandfuturedirections">Challenges and Future Directions</h2>
<p>There are two major challenges for automated ontology learning. Both have practical and philosophical implications.</p>
<a name="wptoc_4_1_0"></a><h3 id="thenatureoftext">The Nature of Text</h3>
<p>There is a significant gap between the vagueness, fluidity and ambiguity of natural language and the logical rigour of fully formal ontologies. One of the key challenges in the ontology learning field remains the extent to which techniques can be found to bridge this gap. The current approaches tend to either ignore this gap or make explicit the need for human post editing of the output of the ontology learning system. Domain specific text collections do not contain sufficient instances of use of terms, and sufficient variety of contexts to enable the accurate determination of ontological relations and features <span class="markdowncitation"> (<a href="#Brewster_SIGIR03" title="see citation">5</a>)</span>. In effect a large proportion of the knowledge that needs to be formalised is not present in most corpora, especially domain specific collections. It is in this light that the wider use of the web as a corpus for knowledge extraction/collection is highly attractive (cf. especially the KnowItAll system <span class="markdowncitation"> (<a href="#Etzioni_WWW04" title="see citation">23</a>)</span>,<span class="markdowncitation"> (<a href="#Etzioni_AI05" title="see citation">27</a>)</span>) but this will tend to increase the problems of ambiguity and domain specific senses.</p>
<a name="wptoc_4_1_1"></a><h3 id="evaluation">Evaluation</h3>
<p>The evaluation of the output of ontology learning systems remains a major challenge. The usual approaches to ontology evalauation have largely been based on quality control of the ontology building process and ensuring the ontology abides by certain principles <span class="markdowncitation"> (<a href="#Gomez-Perez_HO04" title="see citation">28</a>)</span>, <span class="markdowncitation"> (<a href="#Oltramari_OL02" title="see citation">29</a>)</span>. From an ontology learning from text perspective, these approaches are inapplicable. The usual approach is to use a Gold Standard with which to compare the automatically generated output <span class="markdowncitation"> (<a href="#Cimiano_Learning05" title="see citation">6</a>)</span>, sometimes using WordNet as the Gold Standard. The main problem with a Gold Standard is that if the ontology learning system is supposed to discover new knowledge then a GS based evaluation may penalise a system for discovering knowledge absent from the Standard. Also as systems scale up, it becomes harder and harder t get an overall assessment of the quality of the resulting output in a given case.</p>
<a name="wptoc_0_0_5"></a><h2 id="conclusion">Conclusion</h2>
<p>Ontology learning is an active field of research with considerable potential to make the task of ontology engineering substantially easier. Another important role for OL is to allow ontologies to be kept up to date more effectively. Finally, one should mention that in many contexts where less formal ontologies are needed (taxonomies or structured vocabularies), they output of OL systems is very close to that required.</p>
<a name="wptoc_5_1_0"></a><h3 id="acknowledgements">Acknowledgements</h3>
<p>This paper is an open access work distributed under the terms of the Creative Commons Attribution License 3.0 (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided that the original author and source are attributed. </p>
<p>The paper and its publication environment form part of the work of the Ontogenesis Network, supported by EPSRC grant EP/E021352/1.</p>
<div class="bibliography">
<hr />
<p>Bibliography</p>
<div id="Brewster_BMC09">
<p>[1] <span class="item">Brewster, C.; Jupp, S.; Luciano, J.; Shotton, D.; Stevens, R. &amp; Zhang, Z. <a href="http://www.biomedcentral.com/1471-2105/10/S5/S1">Issues in learning an ontology from text</a> BMC Bioinformatics, 2009, 10, S1</span></p>
</div>
<div id="Davis_AIMag1993">
<p>[2] <span class="item">Davis, R.; Shrobe, H. &amp; Szolovits, P. <a href="http://www.aaai.org/ojs/index.php/aimagazine/article/viewArticle/1029" title="What Is a Knowledge Representation? | Davis | AI Magazine">What is a Knowledge Representation AI Magazine</a>, 1993, 14, 17-33</span></p>
</div>
<div id="Harris_Word54">
<p>[3] <span class="item">Harris, Z. Distributional structure Word, 1954, 23, 146-162</span></p>
</div>
<div id="Firth_Papers57">
<p>[4] <span class="item">Firth, J. R. Papers in Linguistics, 1934-1951 Oxford University Press, 1957</span></p>
</div>
<div id="Brewster_SIGIR03">
<p>[5] <span class="item">Brewster, C.; Ciravegna, F. &amp; Wilks, Y. Background and Foreground Knowledge in Dynamic Ontology Construction Proceedings of the Semantic Web Workshop, Toronto, August 2003, 2003</span></p>
</div>
<div id="Cimiano_Learning05">
<p>[6] <span class="item">Cimiano, P.; Pivk, A.; Schmidt-Thieme, L. &amp; Staab, S. <a href="http://www.aifb.uni-karlsruhe.de/WBS/pci/OLPb14a7b8059d9c055954c92674ce60032Bookb14a7b8059d9c055954c92674ce60032Cimiano.pdf">Learning Taxonomic Relations from Heterogeneous Sources of Evidence</a> Ontology Learning from Text: Methods, Evaluation and Applications, IOS Press, 2005</span></p>
</div>
<div id="Brewster_Knowledge08">
<p>[7] <span class="item">Brewster, C. [Mind the Gap: Bridging from Text to Ontological Knowledge] Department of Computer Science, University of Sheffield, 2008</span></p>
</div>
<div id="Ahmad_Corpus01">
<p>[8] <span class="item">Ahmad, K. &amp; Rogers, M. A. Wright, S.-E. &amp; Budin, G. (ed.) <a href="">Corpus Linguistics and Terminology Extraction</a> Handbook of Terminology Management (Volume 2), John Benjamins Publishing Company, 2001, 725-760</span></p>
</div>
<div id="Velardi_ACL01">
<p>[9] <span class="item">Velardi, P.; Missikoff, M. &amp; Basili, R. <a href="www.aclweb.org/anthology/W/W01/W01-1005.pdf">Identification of Relevant Terms to Support the Construction of Domain Ontologies</a> Proceedings of the ACL 2001 Workshop on Human Language Technology and Knowledge Management, 2001</span></p>
</div>
<div id="Park_COLING02">
<p>[10] <span class="item">Park, Y.; Byrd, R. J. &amp; Boguraev, B. <a href="www.aclweb.org/anthology/C/C02/C02-1142.pdf">Automatic Glossary Extraction: Beyond Terminology Identification</a> 19th International Conference on Computational Linguistics &#8211; COLING 02, Howard International House and Academia Sinica, 2002</span></p>
</div>
<div id="Ananiadou_Automatic06">
<p>[11] <span class="item">Ananiadou, S. &amp; Nenadic, G. Ananiadou, S. &amp; McNaught, J. (ed.) Automatic Terminology Management in Biomedicine, Text Mining for Biology and Biomedicine, Artech House Books, 2006, 67-98</span></p>
</div>
<div id="LeMoigno_AMIA02">
<p>[12] <span class="item">Moigno, S. L.; Charlet, J.; Bourigault, D. &amp; Jaulent, M.-C. <a href="http://www-sop.inria.fr/acacia/WORKSHOPS/ECAI2002-OLT/Proceedings/LeMoigno.pdf">Terminology extraction from text to build an ontology in surgical intensive care</a> Proceedings of the AMIA 2002 annual symposium, 2002</span></p>
</div>
<div id="Zhang_LREC08">
<p>[13] <span class="item">Zhang, Z.; Iria, J.; Brewster, C. &amp; Ciravegna, F. <a href="http://www.dcs.shef.ac.uk/~kiffer/papers/Zhangb14a7b8059d9c055954c92674ce60032LREC08.pdf">A Comparative Evaluation of Term Recognition Algorithms</a> Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC08), 2008</span></p>
</div>
<div id="ManningSchutze99">
<p>[14] <span class="item">Manning, C. D. &amp; Schütze, H. [Foundations of Statistical Natural Language Processing] MIT Press, 1999</span></p>
</div>
<div id="grefenstette_Explorations94">
<p>[15] <span class="item">Grefenstette, G. [Explorations in Automatic Thesaurus Discovery] Kluwer, 1994</span></p>
</div>
<div id="Brown_CL92">
<p>[16] <span class="item">Brown, P.; Pietra, V. D.; deSouza, P.; Lai, J. &amp; Mercer, R. <a href="http://acl.ldc.upenn.edu/J/j92/J92-4003.pdf">Class Based n-gram Models of Natural Language</a> Computational Linguistics, 1992, 18, 467-479</span></p>
</div>
<div id="Villaverde_ESA09">
<p>[17] <span class="item">Villaverde, J.; Persson, A.; Godoy, D. &amp; Amandi, A. <a href="http://www.sciencedirect.com/science/article/B6V03-4VH4DC9-3/2/7e38c0319113a0be80fe51c08e1f2c98">Supporting the discovery and labeling of non-taxonomic relationships in ontology learning</a> Expert Systems with Applications, 2009, 36, 10288 &#8211; 10294</span></p>
</div>
<div id="Hearst92">
<p>[18] <span class="item">Hearst, M. <a href="http://acl.ldc.upenn.edu/C/C92/C92-2082.pdf.">Automatic Acquisition of Hyponyms from Large Text Corpora</a> Proceedings of the Fourteenth International Conference on Computational Linguistics (COLING 92), Nantes, France, July 1992, 1992</span></p>
</div>
<div id="Sanderson_SIGIR99">
<p>[19] <span class="item">Sanderson, M. &amp; Croft, B. <a href="http://dis.shef.ac.uk/mark/publications/myb14a7b8059d9c055954c92674ce60032papers/SIGIR99.pdf">Deriving concept hierarchies from text</a> Proceedings of the 22nd ACM SIGIR Conference, 1999, 206-213</span></p>
</div>
<div id="fellbaum_book98">
<p>[20] <span class="item">Fellbaum, C. D. [WordNet: An Electronic Lexical Database] MIT Press, 1998</span></p>
</div>
<div id="Agirre_ECAI00">
<p>[21] <span class="item">Agirre, E.; Ansa, O.; Hovy, E. H. &amp; Martnez, D. Staab, S.; Maedche, A.; Nedellec, C. &amp; Wiemer-Hastings, P. (ed.) <a href="http://SunSITE.Informatik.RWTH-Aachen.DE/Publications/CEUR336d5ebc5436534e61d16e63ddfca327WS/Vol-31/EAgirreb14a7b8059d9c055954c92674ce6003214.pdf">Enriching very large ontologies using the WWW.</a> ECAI Workshop on Ontology Learning, CEUR-WS.org, 2000, 31</span></p>
</div>
<div id="Wilks_FTWS09">
<p>[22] <span class="item">Wilks, Y. &amp; Brewster, C. <a href="http:/dx.doi.org/10.1561/1800000002">Natural Language Processing as a Foundation of the Semantic Web</a> Foundations and Trends in Web Science, 2009, 1, 199-327</span></p>
</div>
<div id="Etzioni_WWW04">
<p>[23] <span class="item">Etzioni, O.; Cafarella, M.; Downey, D.; Kok, S.; Popescu, A.-M.; Shaked, T.; Soderland, S.; Weld, D. S. &amp; Yates, A. Feldman, S. I.; Uretsky, M.; Najork, M. &amp; Wills, C. E. (ed.) <a href="http://www.cs.washington.edu/research/knowitall/papers/www-paper.pdf">Web-scale information extraction in knowitall (preliminary results).</a> Proceedings of the 13th international conference on World Wide Web, WWW 2004, New York, NY, USA, May 17-20, 2004, ACM, 2004, 100-110</span></p>
</div>
<div id="Navigli_CL04">
<p>[24] <span class="item">Navigli, R. &amp; Velardi, P. [Learning Domain Ontologies from Document Warehouses and Dedicated Websites] Computational Linguistics, 2004, 30, 151-179</span></p>
</div>
<div id="Sclano_IESA07">
<p>[25] <span class="item">Sclano, F. &amp; Velardi, P. <a href="http://lcl2.uniroma1.it/termextractor/help/I-ESAb14a7b8059d9c055954c92674ce600322007b14a7b8059d9c055954c92674ce60032Sclanob14a7b8059d9c055954c92674ce60032Velardi.pdf">TermExtractor: a Web Application to Learn the Shared Terminology of Emergent Web Communities</a> Proceedings of the 3rd International Conference on Interoperability for Enterprise Software and Applications (I-ESA 2007), 2007</span></p>
</div>
<div id="Velardi_Evaluation05">
<p>[26] <span class="item">Velardi, P.; Navigli, R.; Cucchiarelli, A. &amp; Neri, F. Buitelaar, P.; Cimiano, P. &amp; Magnini, B. (ed.) [Evaluation of OntoLearn, a methodology for automatic learning of domain ontologies] Ontology Learning from Text: Methods, Evaluation and Applications, IOS Press, 2005</span></p>
</div>
<div id="Etzioni_AI05">
<p>[27] <span class="item">Etzioni, O.; Cafarella, M. J.; Downey, D.; Popescu, A.-M.; Shaked, T.; Soderland, S.; Weld, D. S. &amp; Yates, A. <a href="http://dx.doi.org/10.1016/j.artint.2005.03.001">Unsupervised named-entity extraction from the Web An experimental study.</a> Artif. Intell., 2005, 165, 91-134</span></p>
</div>
<div id="Gomez-Perez_HO04">
<p>[28] <span class="item">Gómez-Pérez, A. Staab, S. &amp; Studer, R. (ed.) [Ontology Evaluation.] Handbook on Ontologies, Springer, 2004, 251-274</span></p>
</div>
<div id="Oltramari_OL02">
<p>[29] <span class="item">Oltramari, A.; Gangemi, A.; Guarino, N. &amp; Masolo, C. <a href="http://www.loa-cnr.it/Papers/Oltramari2.pdf">Restructuring WordNet&#8217;s Top-Level: The OntoClean approach</a> Proceedings of the Workshop OntoLex&#8217;2, Ontologies and Lexical Knowledge Bases, 2002</span></p>
</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://ontogenesis.knowledgeblog.org/2010/01/22/ontology-learning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
