<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.evoio.org/w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Hilmar</id>
	<title>Evolutionary Interoperability and Outreach - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://www.evoio.org/w/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Hilmar"/>
	<link rel="alternate" type="text/html" href="https://www.evoio.org/wiki/Special:Contributions/Hilmar"/>
	<updated>2026-05-15T08:02:47Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.41.1</generator>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MIAPA&amp;diff=4648</id>
		<title>MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MIAPA&amp;diff=4648"/>
		<updated>2018-06-01T22:14:47Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This serves as a web home and central hub for resources related to developing and deploying a Minimal Information for a Phylogenetic Analysis (MIAPA) standard. &lt;br /&gt;
&lt;br /&gt;
== Current versions ==&lt;br /&gt;
&lt;br /&gt;
The most current versions of MIAPA artifacts such as checklist, ontologies, and tools are maintained on Github: https://github.com/evoinfo/miapa&lt;br /&gt;
&lt;br /&gt;
* [https://github.com/evoinfo/miapa/blob/master/checklist/MIAPA-checklist.md MIAPA checklist]&lt;br /&gt;
* [https://github.com/evoinfo/miapa/blob/master/ontology/miapa.owl MIAPA ontology] ([https://raw.githubusercontent.com/evoinfo/miapa/master/ontology/miapa.owl OWL format])&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
'''Active projects''' &lt;br /&gt;
* Karen Cranston and Elliot Hauser surveyed the community about the the MIAPA checklist terms&lt;br /&gt;
** [https://duke.qualtrics.com/SE/?SID=SV_b2upIMII4fxJkAA link to survey]&lt;br /&gt;
** resulting manuscript is in preparation on [https://github.com/eah13/otol GitHub]&lt;br /&gt;
* Elliott Hauser's 2012 Google Summer of Code project &amp;quot;[http://informatics.nescent.org/wiki/PhyloSoC:_NeXML_to_MIAPA_mapping NeXML to MIAPA mapping]&amp;quot;&lt;br /&gt;
* Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPAWorkshop2011 Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard] workshop at TDWG 2011&lt;br /&gt;
** This workshop produced a [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey checklist of metadata elements] for a proposed community survey.&lt;br /&gt;
* [[MIAPA Draft RFC]]&lt;br /&gt;
* Plan for a [[PhyloRelevantLiteratureSurvey]]&lt;br /&gt;
* [[Conference Plan 2011]]&lt;br /&gt;
* [[Tree Best Practices]], article in progress for BMC, based largely on TDWG report&lt;br /&gt;
* [[MIAPA Survey]] on barriers to re-use of phylogenies and associated data and metadata&lt;br /&gt;
** [[Reuse Cases]]&lt;br /&gt;
** [[Barriers to reuse]] -- from user stories collected by the survey team&lt;br /&gt;
** [[Reuse Barriers]] -- Hilmar's list regarding TreeBASE submissions&lt;br /&gt;
* [[MIAPA/Demonstration Project| MIAPA Demonstration Project]] in the Leebens-Mack lab, spring 2011&lt;br /&gt;
* The [[MIAPA/PhyloWays| PhyloWays]] list of reference workflow annotations (papers with extracted workflow information)&lt;br /&gt;
&lt;br /&gt;
'''Resources'''&lt;br /&gt;
* There is an [http://groups.google.com/group/miapa-discuss email list] for discussion of MIAPA.  Please consider joining if you would like to play an active role in developing and supporting a minimal-information checklist for phylogenetic studies. &lt;br /&gt;
* [http://mibbi.sourceforge.net/projects/MIAPA.shtml MIAPA at MIBBI]&lt;br /&gt;
* [[CommentsOnOpenAnnotationDataModel]]&lt;br /&gt;
* [https://evoinfo.nescent.org/MIAPA_WhitePaper MIAPA white paper] at NESCent&lt;br /&gt;
* Arlin's [https://evoinfo.nescent.org/Supporting_MIAPA  proof-of-concept and an idea for a knowledge-capture experiment]&lt;br /&gt;
* Leebens-Mack et al (2006) Taking the first steps towards a standard for reporting on phylogenies: Minimum Information About a Phylogenetic Analysis (MIAPA). OMICS: a Journal of Integrative Biology 10 (2): 231-7. [http://dx.doi.org/10.1089/omi.2006.10.231 doi:10.1089/omi.2006.10.231].&lt;br /&gt;
&lt;br /&gt;
[[Category:MIAPA]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MIAPA&amp;diff=4647</id>
		<title>MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MIAPA&amp;diff=4647"/>
		<updated>2018-06-01T22:12:32Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Current version of the MIAPA checklist */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;This serves as a web home and central hub for resources related to developing and deploying a Minimal Information for a Phylogenetic Analysis (MIAPA) standard. &lt;br /&gt;
&lt;br /&gt;
== Current versions ==&lt;br /&gt;
&lt;br /&gt;
The most current versions of MIAPA artifacts such as checklist, ontologies, and tools are maintained on Github: https://github.com/evoinfo/miapa&lt;br /&gt;
&lt;br /&gt;
* [https://github.com/evoinfo/miapa/blob/master/checklist/MIAPA-checklist.md MIAPA checklist]&lt;br /&gt;
* [https://github.com/evoinfo/miapa/blob/master/ontology/miapa.owl MIAPA ontology] ([https://raw.githubusercontent.com/evoinfo/miapa/master/ontology/miapa.owl OWL format])&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
'''Active projects''' &lt;br /&gt;
* Karen Cranston and Elliot Hauser surveyed the community about the the MIAPA checklist terms&lt;br /&gt;
** [https://duke.qualtrics.com/SE/?SID=SV_b2upIMII4fxJkAA link to survey]&lt;br /&gt;
** resulting manuscript is in preparation on [https://github.com/eah13/otol GitHub]&lt;br /&gt;
* Elliott Hauser's 2012 Google Summer of Code project &amp;quot;[http://informatics.nescent.org/wiki/PhyloSoC:_NeXML_to_MIAPA_mapping NeXML to MIAPA mapping]&amp;quot;&lt;br /&gt;
* Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPAWorkshop2011 Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard] workshop at TDWG 2011&lt;br /&gt;
** This workshop produced a [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey checklist of metadata elements] for a proposed community survey.&lt;br /&gt;
* [[MIAPA Draft RFC]]&lt;br /&gt;
* Plan for a [[PhyloRelevantLiteratureSurvey]]&lt;br /&gt;
* [[Conference Plan 2011]]&lt;br /&gt;
* [[Tree Best Practices]], article in progress for BMC, based largely on TDWG report&lt;br /&gt;
* [[MIAPA Survey]] on barriers to re-use of phylogenies and associated data and metadata&lt;br /&gt;
** [[Reuse Cases]]&lt;br /&gt;
** [[Barriers to reuse]] -- from user stories collected by the survey team&lt;br /&gt;
** [[Reuse Barriers]] -- Hilmar's list regarding TreeBASE submissions&lt;br /&gt;
* [[MIAPA/Demonstration Project| MIAPA Demonstration Project]] in the Leebens-Mack lab, spring 2011&lt;br /&gt;
* The [[MIAPA/PhyloWays| PhyloWays]] list of reference workflow annotations (papers with extracted workflow information)&lt;br /&gt;
&lt;br /&gt;
'''Resources'''&lt;br /&gt;
* There is an [http://groups.google.com/group/miapa-discuss email list] for discussion of MIAPA.  Please consider joining if you would like to play an active role in developing and supporting a minimal-information checklist for phylogenetic studies. &lt;br /&gt;
* [http://mibbi.sourceforge.net/projects/MIAPA.shtml MIAPA at MIBBI]&lt;br /&gt;
* [[CommentsOnOpenAnnotationDataModel]]&lt;br /&gt;
* [https://www.nescent.org/wg_evoinfo/MIAPA_WhitePaper MIAPA white paper] at NESCent&lt;br /&gt;
* Arlin's [https://www.nescent.org/wg_evoinfo/Supporting_MIAPA  proof-of-concept and an idea for a knowledge-capture experiment]&lt;br /&gt;
* Leebens-Mack et al (2006) Taking the first steps towards a standard for reporting on phylogenies: Minimum Information About a Phylogenetic Analysis (MIAPA). OMICS: a Journal of Integrative Biology 10 (2): 231-7. [http://dx.doi.org/10.1089/omi.2006.10.231 doi:10.1089/omi.2006.10.231].&lt;br /&gt;
&lt;br /&gt;
[[Category:MIAPA]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-email-ph&amp;diff=4646</id>
		<title>MediaWiki:Createacct-email-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-email-ph&amp;diff=4646"/>
		<updated>2017-11-19T01:59:35Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Created page with &amp;quot;Leave empty - provided by Google&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Leave empty - provided by Google&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpassword-ph&amp;diff=4645</id>
		<title>MediaWiki:Createacct-yourpassword-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpassword-ph&amp;diff=4645"/>
		<updated>2017-11-19T01:58:20Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Leave empty - not needed&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpasswordagain-ph&amp;diff=4644</id>
		<title>MediaWiki:Createacct-yourpasswordagain-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpasswordagain-ph&amp;diff=4644"/>
		<updated>2017-11-19T01:57:54Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Leave empty - not needed&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Sidebar&amp;diff=4643</id>
		<title>MediaWiki:Sidebar</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Sidebar&amp;diff=4643"/>
		<updated>2017-11-19T01:52:47Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* navigation&lt;br /&gt;
** Main_Page|EvoIO Home&lt;br /&gt;
** Main_Page#What.27s_happening|EvoIO News&lt;br /&gt;
** ReUseCases|Re-Use-cases&lt;br /&gt;
** http://www.nescent.org |NESCent&lt;br /&gt;
&lt;br /&gt;
* Projects&lt;br /&gt;
** PhyloStars|PhyloStars&lt;br /&gt;
** Phylotastic|Phylotastic!&lt;br /&gt;
** MIAPA|MIAPA&lt;br /&gt;
** BarriersToReUse|Barriers To Reuse&lt;br /&gt;
** HIP|HIP&lt;br /&gt;
** VoCamp1| VoCamp&lt;br /&gt;
&lt;br /&gt;
* Related projects&lt;br /&gt;
** http://evoinfo.nescent.org/Concept_Glossary| Concept Glossary&lt;br /&gt;
** http://www.nexml.org| NeXML&lt;br /&gt;
** http://www.evolutionaryontology.org/|CDAO&lt;br /&gt;
** http://evoinfo.nescent.org/PhyloWS|phyloWS&lt;br /&gt;
&lt;br /&gt;
* for editors&lt;br /&gt;
** recentchanges-url|recentchanges&lt;br /&gt;
** Help:Wiki|Wiki Help&lt;br /&gt;
&lt;br /&gt;
* browse site&lt;br /&gt;
** Special:Allpages|All Pages&lt;br /&gt;
** Special:Categories|By Category&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpasswordagain-ph&amp;diff=4642</id>
		<title>MediaWiki:Createacct-yourpasswordagain-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpasswordagain-ph&amp;diff=4642"/>
		<updated>2017-11-19T01:51:02Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Enter local password again (optional)&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpassword-ph&amp;diff=4641</id>
		<title>MediaWiki:Createacct-yourpassword-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpassword-ph&amp;diff=4641"/>
		<updated>2017-11-19T01:50:52Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Enter a local password (optional)&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-submit&amp;diff=4640</id>
		<title>MediaWiki:Createacct-submit</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-submit&amp;diff=4640"/>
		<updated>2017-11-19T01:48:53Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;DO NOT USE - use Google instead&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-submit&amp;diff=4639</id>
		<title>MediaWiki:Createacct-submit</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-submit&amp;diff=4639"/>
		<updated>2017-11-19T01:48:22Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;DO NOT USE - create with Google instead&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-submit&amp;diff=4638</id>
		<title>MediaWiki:Createacct-submit</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-submit&amp;diff=4638"/>
		<updated>2017-11-19T01:47:02Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Created page with &amp;quot;Local account creation disabled - DO NOT USE&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Local account creation disabled - DO NOT USE&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpasswordagain-ph&amp;diff=4637</id>
		<title>MediaWiki:Createacct-yourpasswordagain-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpasswordagain-ph&amp;diff=4637"/>
		<updated>2017-11-19T01:45:23Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Created page with &amp;quot;Enter password again (optional)&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Enter password again (optional)&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpassword-ph&amp;diff=4636</id>
		<title>MediaWiki:Createacct-yourpassword-ph</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-yourpassword-ph&amp;diff=4636"/>
		<updated>2017-11-19T01:44:55Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Created page with &amp;quot;Enter a password (optional)&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Enter a password (optional)&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4635</id>
		<title>MediaWiki:Createacct-helpusername</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4635"/>
		<updated>2017-11-19T01:43:37Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;''username you desire, '''''not''''' your Google ID''&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4634</id>
		<title>MediaWiki:Createacct-helpusername</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4634"/>
		<updated>2017-11-19T01:42:56Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;''desired username for this wiki, '''''not''''' your Google ID''&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4633</id>
		<title>MediaWiki:Createacct-helpusername</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4633"/>
		<updated>2017-11-19T01:41:16Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;(desired username, ''not'' your Google ID)&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4630</id>
		<title>MediaWiki:Createacct-helpusername</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4630"/>
		<updated>2017-11-19T01:22:30Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;(Required also for Google)&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4629</id>
		<title>MediaWiki:Createacct-helpusername</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MediaWiki:Createacct-helpusername&amp;diff=4629"/>
		<updated>2017-11-19T01:15:48Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Created page with &amp;quot;Hurray!&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Hurray!&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=User:Hilmar&amp;diff=4628</id>
		<title>User:Hilmar</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=User:Hilmar&amp;diff=4628"/>
		<updated>2016-04-17T19:15:54Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;My most up-to-date profile and what I've recently been up to is usually at my ORCID record: http://orcid.org/0000-0001-9107-0714&lt;br /&gt;
&lt;br /&gt;
You can also find me on [http://twitter.com/hlapp Twitter], [http://www.linkedin.com/in/hlapp LinkedIn], [https://plus.google.com/117016856028818567812 Google+], [http://github.com/hlapp Github], and a few other social networks (usually with handle hlapp).&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=EvoIO:About&amp;diff=4627</id>
		<title>EvoIO:About</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=EvoIO:About&amp;diff=4627"/>
		<updated>2016-04-16T00:25:09Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;EvoIO aims to be a nucleating center for developing, applying and disseminating interoperability technologies and initiatives that connect and coordinate between users, developers, and other stakeholders in evolutionary informatics.&lt;br /&gt;
&lt;br /&gt;
== About this site ==&lt;br /&gt;
&lt;br /&gt;
This community wiki was hosted and administered by [http://nescent.org NESCent] until 2015, when NESCent's NSF funding ended. This wiki, and several other evolutionary informatics community wikis initiated as a result of NESCent's informatics activities, were publicly archived as part of NESCent's wind down ([http://dx.doi.org/10.5281/zenodo.19018 doi:10.5281/zenodo.19018], [http://dx.doi.org/10.5281/zenodo.19000 doi:10.5281/zenodo.19000], [http://dx.doi.org/10.5281/zenodo.18998 doi:10.5281/zenodo.18998], [http://dx.doi.org/10.5281/zenodo.19004 doi:10.5281/zenodo.19004]). This wiki, as well as the [https://informatics.nescent.org Phyloinformatics Wiki] and the [https://evoinfo.nescent.org Evolutionary Informatics Working Group wiki] have since been restored from those archives, and are now independently hosted and maintained.&lt;br /&gt;
&lt;br /&gt;
== Contact ==&lt;br /&gt;
&lt;br /&gt;
Please email admin at evoio dot org to reach the site's administrators.&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=171</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=171"/>
		<updated>2015-05-03T17:49:30Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* What's happening */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Who we are ==&lt;br /&gt;
&lt;br /&gt;
The EvoIO collaboration emerged in 2009 from several NESCent-sponsored activities that focused on software and data interoperability for evolutionary analysis, including the Evolutionary Informatics working group (2006-2009), and the Evolutionary Database Interoperability hackathon (2009).  EvoIO aims to be a nucleating center for developing, applying and  disseminating interoperability technology that connects and coordinates between stakeholders, developers, and standards bodies. &lt;br /&gt;
&lt;br /&gt;
Members of the EvoIO group, which include biologists and computer scientists, have over the past 3  years harnessed a variety of collaborative events to successfully build an initial stack of interoperability  technologies that is owned by the community and open to participation: &lt;br /&gt;
* [http://nexml.org NeXML], a NEXUS-inspired XML format that is validatable yet extensible&lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO], an ontology of comparative data analysis formalizing the semantics of evolutionary data and metadata; and &lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS], a web-services interface standard for querying, retrieving, and referencing phylogenetic data on the web. &lt;br /&gt;
&lt;br /&gt;
For more information about how to get involved, read on.  &lt;br /&gt;
&lt;br /&gt;
== What's happening ==&lt;br /&gt;
* A grant proposal to NSF's Advances in Biological Informatics (ABI) program titled &amp;quot;An Ontology-Based System for Querying Life in a Post-Taxonomic Age&amp;quot; (PIs: Cellinese and Lapp) has now been funded. The proposal and public abstract are available on Figshare ([http://dx.doi.org/10.6084/m9.figshare.1401984 doi:10.6084/m9.figshare.1401984]). The project picks up some ideas from the [[Phyloreferencing subgroup]] of the [[VoCamp1|Phyloinformatics VoCamp]]. &lt;br /&gt;
&lt;br /&gt;
* Many of us are PIs or Leadership Team members of a NESCent working group called [http://www.evoio.org/wiki/HIP Hackathons, Interoperability, Phylogenies (HIP)].   HIP is the current locus of most of our activity .  So far we have had two '''[[Phylotastic]]''' hackathons focused on a web-services implementation of the pruning, grafting, name-reconciliation and other functionalities necessary for researchers to take advantage of emerging megatrees.  HIP is planning a third hackathon to develop a socially networked phyloinformatics community of practice.  &lt;br /&gt;
&lt;br /&gt;
* We organized a workshop on &amp;quot;[http://docs.google.com/document/pub?id=1DiuMEK7g_DJSLzT9ElAwEtJiKRXuQAvoc0JtfoX0sqE Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard]&amp;quot; at the 2011 TDWG Conference in New Orleans, LS. The workshop took place on Monday, Oct 17, and Tuesday, Oct 18. The goal of the workshop was to determine how a future reporting standard for phylogenetic analyses can best serve biodiversity science and related research applications. &lt;br /&gt;
&lt;br /&gt;
* We submitted an [[ABI 2011 proposal|NSF ABI proposal]] on development and (finally!) linking TreeBASE and ToLWeb, entitled &amp;quot;Towards a comprehensive, community- owned and sustainable repository of reusable phylogenetic knowledge&amp;quot; (Lapp, Bietz, Cranston, Piel, Donoghue, Westneat). The proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
* At [http://ievobio.org/ iEvoBio] 2011 in Norman, OK, Arlin Stoltzfus gave a talk on &amp;quot;Publishing re-usable phylogenetic trees, in theory and practice&amp;quot;&lt;br /&gt;
&lt;br /&gt;
* In December 2010, EvoIO participants submitted a proposal to NESCent for a working group entitled [[HIP| HIP: Hackathons, Interoperability, Phylogenies]] (Vos, Stoltzfus).  The proposal is for a series of hackathons partly supported by outside partners.  NESCent recently announced that this proposal will be funded.  Woo hoo!&lt;br /&gt;
&lt;br /&gt;
* A proposal for a new NESCent course titled &amp;quot;Phyloinformatics: linking with online data and services&amp;quot; was submitted by EvoIO participants (Stoltzfus, Vos, Cranston, Lapp) in July 2010 and subsequently selected for funding, but had since to be canceled due to conflicting obligations of the PIs. Instead, Vos and Piel co-taught a [http://academy.nescent.org/wiki/Computational_phyloinformatics Computational Phyloinformatics course] modeled after previous instances of this course, but supplemented with programming exercises using PhyloWS wrappers for ToLWeb and uBio. The course was held in Kyoto in August 2011, and supported by the Bioinformatics Center of Kyoto University and DBCLS/JST, with co-sponsorship by NESCent.&lt;br /&gt;
&lt;br /&gt;
* EvoIO participants in various ways participated in the [http://ievobio.org iEvoBio satellite conference] at Evolution 2010 in Portland, OR:&lt;br /&gt;
** Rutger Vos gave a talk on &amp;quot;TreeBase2: Rise of the machines&amp;quot; (Rutger Vos, Hilmar Lapp, Bill Piel, Val Tannen)&lt;br /&gt;
** Brandon Chisham presented &amp;quot;CDAO-Store: A New Vision for Data Integration&amp;quot; (Brandon Chisham, Trung Le, Enrico Pontelli, Tran Son, Ben Wright)&lt;br /&gt;
** Arlin Stoltzfus presented &amp;quot;EvoIO: Community-driven standards for sustainable interoperability&amp;quot; (Arlin Stoltzfus, Nico Cellinese, Karen Cranston, Hilmar Lapp, Sheldon McKay, Enrico Pontelli, Rutger Vos)&lt;br /&gt;
&lt;br /&gt;
* The EvoIO group staged a successful [[VoCamp1|Phyloinformatics VoCamp]] November 7-11, 2009 in Montpellier, France, co-located with the annual meeting of the &amp;lt;span class=&amp;quot;plainlinks&amp;quot;&amp;gt;[http://www.tdwg.org/ International Biodiversity Information Standards Organization (TDWG)]&amp;lt;/span&amp;gt;.   A VoCamp is a hands-on meeting for investigators to create and develop ontologies and lightweight vocabularies in support of data integration and re-use-- in this case, the integration and re-use of phylogenetic trees and associated data and metadata.  More information at [[VoCamp1]].&lt;br /&gt;
&lt;br /&gt;
* We submitted a proposal to the NSF INTEROP panel in 2009 for &amp;quot;A network for enabling community-driven standards to link evolution to the global web of data (EvoIO)&amp;quot;. More information at [[NSF INTEROP 2009|NSF INTEROP 2009]]. This proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
== How to get involved ==&lt;br /&gt;
&lt;br /&gt;
How to get involved?  Each stack component has an open community of developers and a mailing list.  Click on the links below to find out more.  You can sign up for the mailing list and start to contribute.  &lt;br /&gt;
* [http://nexml.org NeXML]'s mailing list is nexml-discuss@lists.sourceforge.net (contact Rutger Vos, &amp;lt;firstname&amp;gt;aldo@gmail.com)&lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS]'s group is phylows@googlegroups.com (contact Hilmar Lapp, &amp;lt;firstinitial&amp;gt;lapp@nescent.org) &lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO]'s mailing list is cdao-discuss@lists.sourceforge.net (contact Arlin Stoltzfus, &amp;lt;firstname&amp;gt;@umd.edu)&lt;br /&gt;
* [http://pipes.yahoo.com/pipes/pipe.run?_id=b269b02171b6411c4a27ae0714dc76da&amp;amp;_render=rss RSS feed of relevant projects]&lt;br /&gt;
Also EvoIO folks are looking for interoperability “targets”, which can be either single projects that want to make resources more interoperable, or pairs of resources that want to integrate.  Contact any of us if you have an idea.&lt;br /&gt;
&lt;br /&gt;
== MediaWiki Help if you need it ==&lt;br /&gt;
&lt;br /&gt;
* [http://meta.wikipedia.org/wiki/MediaWiki_User%27s_Guide User's Guide]&lt;br /&gt;
* [http://www.mediawiki.org/wiki/Help:FAQ MediaWiki FAQ]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=170</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=170"/>
		<updated>2015-05-03T17:49:04Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* What's happening */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Who we are ==&lt;br /&gt;
&lt;br /&gt;
The EvoIO collaboration emerged in 2009 from several NESCent-sponsored activities that focused on software and data interoperability for evolutionary analysis, including the Evolutionary Informatics working group (2006-2009), and the Evolutionary Database Interoperability hackathon (2009).  EvoIO aims to be a nucleating center for developing, applying and  disseminating interoperability technology that connects and coordinates between stakeholders, developers, and standards bodies. &lt;br /&gt;
&lt;br /&gt;
Members of the EvoIO group, which include biologists and computer scientists, have over the past 3  years harnessed a variety of collaborative events to successfully build an initial stack of interoperability  technologies that is owned by the community and open to participation: &lt;br /&gt;
* [http://nexml.org NeXML], a NEXUS-inspired XML format that is validatable yet extensible&lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO], an ontology of comparative data analysis formalizing the semantics of evolutionary data and metadata; and &lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS], a web-services interface standard for querying, retrieving, and referencing phylogenetic data on the web. &lt;br /&gt;
&lt;br /&gt;
For more information about how to get involved, read on.  &lt;br /&gt;
&lt;br /&gt;
== What's happening ==&lt;br /&gt;
* A grant proposal to NSF's Advances in Biological Informatics (ABI) program titled &amp;quot;An Ontology-Based System for Querying Life in a Post-Taxonomic Age&amp;quot; (PIs: Cellinese and Lapp) has now been funded. The proposal and public abstract] are available on Figshare ([http://dx.doi.org/10.6084/m9.figshare.1401984 doi:10.6084/m9.figshare.1401984]). The project picks up some ideas from the [[Phyloreferencing subgroup]] of the [[VoCamp1|Phyloinformatics VoCamp]]. &lt;br /&gt;
&lt;br /&gt;
* Many of us are PIs or Leadership Team members of a NESCent working group called [http://www.evoio.org/wiki/HIP Hackathons, Interoperability, Phylogenies (HIP)].   HIP is the current locus of most of our activity .  So far we have had two '''[[Phylotastic]]''' hackathons focused on a web-services implementation of the pruning, grafting, name-reconciliation and other functionalities necessary for researchers to take advantage of emerging megatrees.  HIP is planning a third hackathon to develop a socially networked phyloinformatics community of practice.  &lt;br /&gt;
&lt;br /&gt;
* We organized a workshop on &amp;quot;[http://docs.google.com/document/pub?id=1DiuMEK7g_DJSLzT9ElAwEtJiKRXuQAvoc0JtfoX0sqE Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard]&amp;quot; at the 2011 TDWG Conference in New Orleans, LS. The workshop took place on Monday, Oct 17, and Tuesday, Oct 18. The goal of the workshop was to determine how a future reporting standard for phylogenetic analyses can best serve biodiversity science and related research applications. &lt;br /&gt;
&lt;br /&gt;
* We submitted an [[ABI 2011 proposal|NSF ABI proposal]] on development and (finally!) linking TreeBASE and ToLWeb, entitled &amp;quot;Towards a comprehensive, community- owned and sustainable repository of reusable phylogenetic knowledge&amp;quot; (Lapp, Bietz, Cranston, Piel, Donoghue, Westneat). The proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
* At [http://ievobio.org/ iEvoBio] 2011 in Norman, OK, Arlin Stoltzfus gave a talk on &amp;quot;Publishing re-usable phylogenetic trees, in theory and practice&amp;quot;&lt;br /&gt;
&lt;br /&gt;
* In December 2010, EvoIO participants submitted a proposal to NESCent for a working group entitled [[HIP| HIP: Hackathons, Interoperability, Phylogenies]] (Vos, Stoltzfus).  The proposal is for a series of hackathons partly supported by outside partners.  NESCent recently announced that this proposal will be funded.  Woo hoo!&lt;br /&gt;
&lt;br /&gt;
* A proposal for a new NESCent course titled &amp;quot;Phyloinformatics: linking with online data and services&amp;quot; was submitted by EvoIO participants (Stoltzfus, Vos, Cranston, Lapp) in July 2010 and subsequently selected for funding, but had since to be canceled due to conflicting obligations of the PIs. Instead, Vos and Piel co-taught a [http://academy.nescent.org/wiki/Computational_phyloinformatics Computational Phyloinformatics course] modeled after previous instances of this course, but supplemented with programming exercises using PhyloWS wrappers for ToLWeb and uBio. The course was held in Kyoto in August 2011, and supported by the Bioinformatics Center of Kyoto University and DBCLS/JST, with co-sponsorship by NESCent.&lt;br /&gt;
&lt;br /&gt;
* EvoIO participants in various ways participated in the [http://ievobio.org iEvoBio satellite conference] at Evolution 2010 in Portland, OR:&lt;br /&gt;
** Rutger Vos gave a talk on &amp;quot;TreeBase2: Rise of the machines&amp;quot; (Rutger Vos, Hilmar Lapp, Bill Piel, Val Tannen)&lt;br /&gt;
** Brandon Chisham presented &amp;quot;CDAO-Store: A New Vision for Data Integration&amp;quot; (Brandon Chisham, Trung Le, Enrico Pontelli, Tran Son, Ben Wright)&lt;br /&gt;
** Arlin Stoltzfus presented &amp;quot;EvoIO: Community-driven standards for sustainable interoperability&amp;quot; (Arlin Stoltzfus, Nico Cellinese, Karen Cranston, Hilmar Lapp, Sheldon McKay, Enrico Pontelli, Rutger Vos)&lt;br /&gt;
&lt;br /&gt;
* The EvoIO group staged a successful [[VoCamp1|Phyloinformatics VoCamp]] November 7-11, 2009 in Montpellier, France, co-located with the annual meeting of the &amp;lt;span class=&amp;quot;plainlinks&amp;quot;&amp;gt;[http://www.tdwg.org/ International Biodiversity Information Standards Organization (TDWG)]&amp;lt;/span&amp;gt;.   A VoCamp is a hands-on meeting for investigators to create and develop ontologies and lightweight vocabularies in support of data integration and re-use-- in this case, the integration and re-use of phylogenetic trees and associated data and metadata.  More information at [[VoCamp1]].&lt;br /&gt;
&lt;br /&gt;
* We submitted a proposal to the NSF INTEROP panel in 2009 for &amp;quot;A network for enabling community-driven standards to link evolution to the global web of data (EvoIO)&amp;quot;. More information at [[NSF INTEROP 2009|NSF INTEROP 2009]]. This proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
== How to get involved ==&lt;br /&gt;
&lt;br /&gt;
How to get involved?  Each stack component has an open community of developers and a mailing list.  Click on the links below to find out more.  You can sign up for the mailing list and start to contribute.  &lt;br /&gt;
* [http://nexml.org NeXML]'s mailing list is nexml-discuss@lists.sourceforge.net (contact Rutger Vos, &amp;lt;firstname&amp;gt;aldo@gmail.com)&lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS]'s group is phylows@googlegroups.com (contact Hilmar Lapp, &amp;lt;firstinitial&amp;gt;lapp@nescent.org) &lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO]'s mailing list is cdao-discuss@lists.sourceforge.net (contact Arlin Stoltzfus, &amp;lt;firstname&amp;gt;@umd.edu)&lt;br /&gt;
* [http://pipes.yahoo.com/pipes/pipe.run?_id=b269b02171b6411c4a27ae0714dc76da&amp;amp;_render=rss RSS feed of relevant projects]&lt;br /&gt;
Also EvoIO folks are looking for interoperability “targets”, which can be either single projects that want to make resources more interoperable, or pairs of resources that want to integrate.  Contact any of us if you have an idea.&lt;br /&gt;
&lt;br /&gt;
== MediaWiki Help if you need it ==&lt;br /&gt;
&lt;br /&gt;
* [http://meta.wikipedia.org/wiki/MediaWiki_User%27s_Guide User's Guide]&lt;br /&gt;
* [http://www.mediawiki.org/wiki/Help:FAQ MediaWiki FAQ]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=169</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=169"/>
		<updated>2015-05-03T17:47:01Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* What's happening */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Who we are ==&lt;br /&gt;
&lt;br /&gt;
The EvoIO collaboration emerged in 2009 from several NESCent-sponsored activities that focused on software and data interoperability for evolutionary analysis, including the Evolutionary Informatics working group (2006-2009), and the Evolutionary Database Interoperability hackathon (2009).  EvoIO aims to be a nucleating center for developing, applying and  disseminating interoperability technology that connects and coordinates between stakeholders, developers, and standards bodies. &lt;br /&gt;
&lt;br /&gt;
Members of the EvoIO group, which include biologists and computer scientists, have over the past 3  years harnessed a variety of collaborative events to successfully build an initial stack of interoperability  technologies that is owned by the community and open to participation: &lt;br /&gt;
* [http://nexml.org NeXML], a NEXUS-inspired XML format that is validatable yet extensible&lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO], an ontology of comparative data analysis formalizing the semantics of evolutionary data and metadata; and &lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS], a web-services interface standard for querying, retrieving, and referencing phylogenetic data on the web. &lt;br /&gt;
&lt;br /&gt;
For more information about how to get involved, read on.  &lt;br /&gt;
&lt;br /&gt;
== What's happening ==&lt;br /&gt;
* A grant proposal to NSF's Advances in Biological Informatics (ABI) program titled &amp;quot;An Ontology-Based System for Querying Life in a Post-Taxonomic Age&amp;quot; (PIs: Cellinese and Lapp) has now been funded. The proposal and public abstract] are available on Figshare ([http://dx.doi.org/10.6084/m9.figshare.1401984 doi:10.6084/m9.figshare.1401984].&lt;br /&gt;
&lt;br /&gt;
* Many of us are PIs or Leadership Team members of a NESCent working group called [http://www.evoio.org/wiki/HIP Hackathons, Interoperability, Phylogenies (HIP)].   HIP is the current locus of most of our activity .  So far we have had two '''[[Phylotastic]]''' hackathons focused on a web-services implementation of the pruning, grafting, name-reconciliation and other functionalities necessary for researchers to take advantage of emerging megatrees.  HIP is planning a third hackathon to develop a socially networked phyloinformatics community of practice.  &lt;br /&gt;
&lt;br /&gt;
* We organized a workshop on &amp;quot;[http://docs.google.com/document/pub?id=1DiuMEK7g_DJSLzT9ElAwEtJiKRXuQAvoc0JtfoX0sqE Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard]&amp;quot; at the 2011 TDWG Conference in New Orleans, LS. The workshop took place on Monday, Oct 17, and Tuesday, Oct 18. The goal of the workshop was to determine how a future reporting standard for phylogenetic analyses can best serve biodiversity science and related research applications. &lt;br /&gt;
&lt;br /&gt;
* We submitted an [[ABI 2011 proposal|NSF ABI proposal]] on development and (finally!) linking TreeBASE and ToLWeb, entitled &amp;quot;Towards a comprehensive, community- owned and sustainable repository of reusable phylogenetic knowledge&amp;quot; (Lapp, Bietz, Cranston, Piel, Donoghue, Westneat). The proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
* At [http://ievobio.org/ iEvoBio] 2011 in Norman, OK, Arlin Stoltzfus gave a talk on &amp;quot;Publishing re-usable phylogenetic trees, in theory and practice&amp;quot;&lt;br /&gt;
&lt;br /&gt;
* In December 2010, EvoIO participants submitted a proposal to NESCent for a working group entitled [[HIP| HIP: Hackathons, Interoperability, Phylogenies]] (Vos, Stoltzfus).  The proposal is for a series of hackathons partly supported by outside partners.  NESCent recently announced that this proposal will be funded.  Woo hoo!&lt;br /&gt;
&lt;br /&gt;
* A proposal for a new NESCent course titled &amp;quot;Phyloinformatics: linking with online data and services&amp;quot; was submitted by EvoIO participants (Stoltzfus, Vos, Cranston, Lapp) in July 2010 and subsequently selected for funding, but had since to be canceled due to conflicting obligations of the PIs. Instead, Vos and Piel co-taught a [http://academy.nescent.org/wiki/Computational_phyloinformatics Computational Phyloinformatics course] modeled after previous instances of this course, but supplemented with programming exercises using PhyloWS wrappers for ToLWeb and uBio. The course was held in Kyoto in August 2011, and supported by the Bioinformatics Center of Kyoto University and DBCLS/JST, with co-sponsorship by NESCent.&lt;br /&gt;
&lt;br /&gt;
* EvoIO participants in various ways participated in the [http://ievobio.org iEvoBio satellite conference] at Evolution 2010 in Portland, OR:&lt;br /&gt;
** Rutger Vos gave a talk on &amp;quot;TreeBase2: Rise of the machines&amp;quot; (Rutger Vos, Hilmar Lapp, Bill Piel, Val Tannen)&lt;br /&gt;
** Brandon Chisham presented &amp;quot;CDAO-Store: A New Vision for Data Integration&amp;quot; (Brandon Chisham, Trung Le, Enrico Pontelli, Tran Son, Ben Wright)&lt;br /&gt;
** Arlin Stoltzfus presented &amp;quot;EvoIO: Community-driven standards for sustainable interoperability&amp;quot; (Arlin Stoltzfus, Nico Cellinese, Karen Cranston, Hilmar Lapp, Sheldon McKay, Enrico Pontelli, Rutger Vos)&lt;br /&gt;
&lt;br /&gt;
* The EvoIO group staged a successful [[VoCamp1|Phyloinformatics VoCamp]] November 7-11, 2009 in Montpellier, France, co-located with the annual meeting of the &amp;lt;span class=&amp;quot;plainlinks&amp;quot;&amp;gt;[http://www.tdwg.org/ International Biodiversity Information Standards Organization (TDWG)]&amp;lt;/span&amp;gt;.   A VoCamp is a hands-on meeting for investigators to create and develop ontologies and lightweight vocabularies in support of data integration and re-use-- in this case, the integration and re-use of phylogenetic trees and associated data and metadata.  More information at [[VoCamp1]].&lt;br /&gt;
&lt;br /&gt;
* We submitted a proposal to the NSF INTEROP panel in 2009 for &amp;quot;A network for enabling community-driven standards to link evolution to the global web of data (EvoIO)&amp;quot;. More information at [[NSF INTEROP 2009|NSF INTEROP 2009]]. This proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
== How to get involved ==&lt;br /&gt;
&lt;br /&gt;
How to get involved?  Each stack component has an open community of developers and a mailing list.  Click on the links below to find out more.  You can sign up for the mailing list and start to contribute.  &lt;br /&gt;
* [http://nexml.org NeXML]'s mailing list is nexml-discuss@lists.sourceforge.net (contact Rutger Vos, &amp;lt;firstname&amp;gt;aldo@gmail.com)&lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS]'s group is phylows@googlegroups.com (contact Hilmar Lapp, &amp;lt;firstinitial&amp;gt;lapp@nescent.org) &lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO]'s mailing list is cdao-discuss@lists.sourceforge.net (contact Arlin Stoltzfus, &amp;lt;firstname&amp;gt;@umd.edu)&lt;br /&gt;
* [http://pipes.yahoo.com/pipes/pipe.run?_id=b269b02171b6411c4a27ae0714dc76da&amp;amp;_render=rss RSS feed of relevant projects]&lt;br /&gt;
Also EvoIO folks are looking for interoperability “targets”, which can be either single projects that want to make resources more interoperable, or pairs of resources that want to integrate.  Contact any of us if you have an idea.&lt;br /&gt;
&lt;br /&gt;
== MediaWiki Help if you need it ==&lt;br /&gt;
&lt;br /&gt;
* [http://meta.wikipedia.org/wiki/MediaWiki_User%27s_Guide User's Guide]&lt;br /&gt;
* [http://www.mediawiki.org/wiki/Help:FAQ MediaWiki FAQ]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=User:Hilmar&amp;diff=2101</id>
		<title>User:Hilmar</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=User:Hilmar&amp;diff=2101"/>
		<updated>2015-04-20T17:02:27Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Hilmar moved page User:Hlapp to User:Hilmar: Automatically moved page while merging the user &amp;quot;Hlapp&amp;quot; to &amp;quot;Hilmar&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;embedurl&amp;gt;http://www.mendeley.com/profiles/hilmar-lapp/widget/3263/2610980486/3f90794c14759c7403ac053a5954c88c5e2e55b9/&amp;lt;/embedurl&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=User:Hilmar&amp;diff=2100</id>
		<title>User:Hilmar</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=User:Hilmar&amp;diff=2100"/>
		<updated>2014-04-21T13:07:32Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Hlapp moved page User:Hilmar to User:Hlapp&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&amp;lt;embedurl&amp;gt;http://www.mendeley.com/profiles/hilmar-lapp/widget/3263/2610980486/3f90794c14759c7403ac053a5954c88c5e2e55b9/&amp;lt;/embedurl&amp;gt;&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=168</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Main_Page&amp;diff=168"/>
		<updated>2013-10-09T13:19:04Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* What's happening */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Who we are ==&lt;br /&gt;
&lt;br /&gt;
The EvoIO collaboration emerged in 2009 from several NESCent-sponsored activities that focused on software and data interoperability for evolutionary analysis, including the Evolutionary Informatics working group (2006-2009), and the Evolutionary Database Interoperability hackathon (2009).  EvoIO aims to be a nucleating center for developing, applying and  disseminating interoperability technology that connects and coordinates between stakeholders, developers, and standards bodies. &lt;br /&gt;
&lt;br /&gt;
Members of the EvoIO group, which include biologists and computer scientists, have over the past 3  years harnessed a variety of collaborative events to successfully build an initial stack of interoperability  technologies that is owned by the community and open to participation: &lt;br /&gt;
* [http://nexml.org NeXML], a NEXUS-inspired XML format that is validatable yet extensible&lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO], an ontology of comparative data analysis formalizing the semantics of evolutionary data and metadata; and &lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS], a web-services interface standard for querying, retrieving, and referencing phylogenetic data on the web. &lt;br /&gt;
&lt;br /&gt;
For more information about how to get involved, read on.  &lt;br /&gt;
&lt;br /&gt;
== What's happening ==&lt;br /&gt;
* Many of us are PIs or Leadership Team members of a NESCent working group called [http://www.evoio.org/wiki/HIP Hackathons, Interoperability, Phylogenies (HIP)].   HIP is the current locus of most of our activity .  So far we have had two '''[[Phylotastic]]''' hackathons focused on a web-services implementation of the pruning, grafting, name-reconciliation and other functionalities necessary for researchers to take advantage of emerging megatrees.  HIP is planning a third hackathon to develop a socially networked phyloinformatics community of practice.  &lt;br /&gt;
&lt;br /&gt;
* We organized a workshop on &amp;quot;[http://docs.google.com/document/pub?id=1DiuMEK7g_DJSLzT9ElAwEtJiKRXuQAvoc0JtfoX0sqE Steps towards a Minimum Information About a Phylogenetic Analysis (MIAPA) Standard]&amp;quot; at the 2011 TDWG Conference in New Orleans, LS. The workshop took place on Monday, Oct 17, and Tuesday, Oct 18. The goal of the workshop was to determine how a future reporting standard for phylogenetic analyses can best serve biodiversity science and related research applications. &lt;br /&gt;
&lt;br /&gt;
* We submitted an [[ABI 2011 proposal|NSF ABI proposal]] on development and (finally!) linking TreeBASE and ToLWeb, entitled &amp;quot;Towards a comprehensive, community- owned and sustainable repository of reusable phylogenetic knowledge&amp;quot; (Lapp, Bietz, Cranston, Piel, Donoghue, Westneat). The proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
* At [http://ievobio.org/ iEvoBio] 2011 in Norman, OK, Arlin Stoltzfus gave a talk on &amp;quot;Publishing re-usable phylogenetic trees, in theory and practice&amp;quot;&lt;br /&gt;
&lt;br /&gt;
* In December 2010, EvoIO participants submitted a proposal to NESCent for a working group entitled [[HIP| HIP: Hackathons, Interoperability, Phylogenies]] (Vos, Stoltzfus).  The proposal is for a series of hackathons partly supported by outside partners.  NESCent recently announced that this proposal will be funded.  Woo hoo!&lt;br /&gt;
&lt;br /&gt;
* A proposal for a new NESCent course titled &amp;quot;Phyloinformatics: linking with online data and services&amp;quot; was submitted by EvoIO participants (Stoltzfus, Vos, Cranston, Lapp) in July 2010 and subsequently selected for funding, but had since to be canceled due to conflicting obligations of the PIs. Instead, Vos and Piel co-taught a [http://academy.nescent.org/wiki/Computational_phyloinformatics Computational Phyloinformatics course] modeled after previous instances of this course, but supplemented with programming exercises using PhyloWS wrappers for ToLWeb and uBio. The course was held in Kyoto in August 2011, and supported by the Bioinformatics Center of Kyoto University and DBCLS/JST, with co-sponsorship by NESCent.&lt;br /&gt;
&lt;br /&gt;
* EvoIO participants in various ways participated in the [http://ievobio.org iEvoBio satellite conference] at Evolution 2010 in Portland, OR:&lt;br /&gt;
** Rutger Vos gave a talk on &amp;quot;TreeBase2: Rise of the machines&amp;quot; (Rutger Vos, Hilmar Lapp, Bill Piel, Val Tannen)&lt;br /&gt;
** Brandon Chisham presented &amp;quot;CDAO-Store: A New Vision for Data Integration&amp;quot; (Brandon Chisham, Trung Le, Enrico Pontelli, Tran Son, Ben Wright)&lt;br /&gt;
** Arlin Stoltzfus presented &amp;quot;EvoIO: Community-driven standards for sustainable interoperability&amp;quot; (Arlin Stoltzfus, Nico Cellinese, Karen Cranston, Hilmar Lapp, Sheldon McKay, Enrico Pontelli, Rutger Vos)&lt;br /&gt;
&lt;br /&gt;
* The EvoIO group staged a successful [[VoCamp1|Phyloinformatics VoCamp]] November 7-11, 2009 in Montpellier, France, co-located with the annual meeting of the &amp;lt;span class=&amp;quot;plainlinks&amp;quot;&amp;gt;[http://www.tdwg.org/ International Biodiversity Information Standards Organization (TDWG)]&amp;lt;/span&amp;gt;.   A VoCamp is a hands-on meeting for investigators to create and develop ontologies and lightweight vocabularies in support of data integration and re-use-- in this case, the integration and re-use of phylogenetic trees and associated data and metadata.  More information at [[VoCamp1]].&lt;br /&gt;
&lt;br /&gt;
* We submitted a proposal to the NSF INTEROP panel in 2009 for &amp;quot;A network for enabling community-driven standards to link evolution to the global web of data (EvoIO)&amp;quot;. More information at [[NSF INTEROP 2009|NSF INTEROP 2009]]. This proposal was not funded.&lt;br /&gt;
&lt;br /&gt;
== How to get involved ==&lt;br /&gt;
&lt;br /&gt;
How to get involved?  Each stack component has an open community of developers and a mailing list.  Click on the links below to find out more.  You can sign up for the mailing list and start to contribute.  &lt;br /&gt;
* [http://nexml.org NeXML]'s mailing list is nexml-discuss@lists.sourceforge.net (contact Rutger Vos, &amp;lt;firstname&amp;gt;aldo@gmail.com)&lt;br /&gt;
* [http://evoinfo.nescent.org/PhyloWS PhyloWS]'s group is phylows@googlegroups.com (contact Hilmar Lapp, &amp;lt;firstinitial&amp;gt;lapp@nescent.org) &lt;br /&gt;
* [http://www.evolutionaryontology.org CDAO]'s mailing list is cdao-discuss@lists.sourceforge.net (contact Arlin Stoltzfus, &amp;lt;firstname&amp;gt;@umd.edu)&lt;br /&gt;
* [http://pipes.yahoo.com/pipes/pipe.run?_id=b269b02171b6411c4a27ae0714dc76da&amp;amp;_render=rss RSS feed of relevant projects]&lt;br /&gt;
Also EvoIO folks are looking for interoperability “targets”, which can be either single projects that want to make resources more interoperable, or pairs of resources that want to integrate.  Contact any of us if you have an idea.&lt;br /&gt;
&lt;br /&gt;
== MediaWiki Help if you need it ==&lt;br /&gt;
&lt;br /&gt;
* [http://meta.wikipedia.org/wiki/MediaWiki_User%27s_Guide User's Guide]&lt;br /&gt;
* [http://www.mediawiki.org/wiki/Help:FAQ MediaWiki FAQ]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4395</id>
		<title>PhyloStars/BeforeAndAfter</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4395"/>
		<updated>2013-10-01T16:19:13Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Assessment Goals ==&lt;br /&gt;
&lt;br /&gt;
The things we would like to know, and how they ''might'' be measured.   We would need to have a baseline for each one, based on current BioStars activity in all domains. &lt;br /&gt;
&lt;br /&gt;
# Level of activity  &lt;br /&gt;
#* number of new evolution-related questions (responses, answers) in the past month&lt;br /&gt;
#* up-votes and responses to existing evolution-related questions in the past month&lt;br /&gt;
# Responsiveness and effectiveness&lt;br /&gt;
#* how many questions get an answer within a week?   a response?  &lt;br /&gt;
#* what is the distribution of waiting times for answers?  responses?&lt;br /&gt;
# Community &lt;br /&gt;
#* how many users respond to (view) evolution-related questions? &lt;br /&gt;
#* what is the average expertise of users in this community? &lt;br /&gt;
#* what is the apparent male:female ratio?  &lt;br /&gt;
#* how many ''different'' people view evolution-related content in a month?&lt;br /&gt;
&lt;br /&gt;
Caveat: the seasonality of academia-related efforts.  Prior knowledge is that there are some really strange patterns, e.g., spike of activity just before winter break.  If we do &amp;quot;before&amp;quot; in August, we really should wait until next August to do the &amp;quot;after&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
== Methods and APIs ==&lt;br /&gt;
&lt;br /&gt;
* You can get a picture of activity by clicking on the tag cloud.  Activity can be sorted by votes. For example: http://www.biostars.org/show/tag/phylogenentics/?sort=votes&amp;amp;since=all%20time&lt;br /&gt;
* Everything can be measured retroactively, and there are API's to query site history. For example: a snapshot of the site 100 days ago can be obtained with http://www.biostars.org/api/stats/100/&lt;br /&gt;
&lt;br /&gt;
==Before ==&lt;br /&gt;
&lt;br /&gt;
=== relative topics ===&lt;br /&gt;
reported number of results based on August 20, 2013 search &lt;br /&gt;
&lt;br /&gt;
phylostars &lt;br /&gt;
* phylostars 0 &lt;br /&gt;
&lt;br /&gt;
evolution &lt;br /&gt;
* evolution 760&lt;br /&gt;
&lt;br /&gt;
molecular evolution &lt;br /&gt;
* ortholog* 1790 &lt;br /&gt;
* paralog* 186&lt;br /&gt;
&lt;br /&gt;
phylogenetics - interestingly, most references to &amp;quot;distance matrix&amp;quot; and &amp;quot;mega&amp;quot; are the phylogenetics meanings&lt;br /&gt;
* phylogen* 2920&lt;br /&gt;
* Phylip 241&lt;br /&gt;
* distance matrix 221&lt;br /&gt;
* MEGA 137 &lt;br /&gt;
* MrBayes 124&lt;br /&gt;
* PAUP 70&lt;br /&gt;
* BEAST 68&lt;br /&gt;
* phylogram 36&lt;br /&gt;
* neighbor joining 28&lt;br /&gt;
* cladogram 4&lt;br /&gt;
&lt;br /&gt;
population genetics &lt;br /&gt;
* population genetics 260&lt;br /&gt;
* &amp;quot;dN/dS&amp;quot; 175&lt;br /&gt;
* tajima's D 78&lt;br /&gt;
* &amp;quot;nucleotide diversity&amp;quot; 22&lt;br /&gt;
* coalescent 19&lt;br /&gt;
* McDonald-Kreitman 1&lt;br /&gt;
* &amp;quot;site frequency spectrum&amp;quot; 1&lt;br /&gt;
&lt;br /&gt;
[[Category:PhyloStars]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars/Planning_And_Meetings&amp;diff=4401</id>
		<title>PhyloStars/Planning And Meetings</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars/Planning_And_Meetings&amp;diff=4401"/>
		<updated>2013-10-01T16:18:50Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== October 9, 2013 ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== September 25, 2013 ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== September 11, 2013 ==&lt;br /&gt;
&lt;br /&gt;
* importance of building moment in order to reach critical mass at (or soon after) hackathon&lt;br /&gt;
* progress: before-and-after comparison&lt;br /&gt;
* need to show folks that phylostars is the fastest way to get a quality, custom answer&lt;br /&gt;
* example of r-sig-phylo as success story&lt;br /&gt;
** mailing list created from NESCent hackathon &lt;br /&gt;
** 960 people &lt;br /&gt;
** 100-200 posts per month&lt;br /&gt;
** includes people like JF who does not use R but likes to get involved in the theory issues&lt;br /&gt;
&lt;br /&gt;
== first meeting ==&lt;br /&gt;
&lt;br /&gt;
* importance of having some high-quality tutorials&lt;br /&gt;
** 2 pages&lt;br /&gt;
** good tutorial can take a day to produce if you are familiar with the topic&lt;br /&gt;
* possibility of modifications of BioStars (Istvan)&lt;br /&gt;
* availability of history for purposes of before-after comparison&lt;br /&gt;
&lt;br /&gt;
[[Category:PhyloStars]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars&amp;diff=4379</id>
		<title>PhyloStars</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars&amp;diff=4379"/>
		<updated>2013-10-01T16:18:15Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Home page for PhyloStars project, which aims to develop a socially networked online community of practice for evolutionary applications of bioinformatics.&lt;br /&gt;
&lt;br /&gt;
== About ==&lt;br /&gt;
&lt;br /&gt;
We propose to engage the evolution community with a participant-driven meeting aimed at (1) developing and disseminating topical bioinformatics solutions that serve recognized needs of researchers, and (2) developing the capacity to meet this ongoing need in a self-sustaining way, via an online, global, socially networked community-of-practice.  We have pledges of support for this project from [http://www.smbe.org SMBE] and [http://synthesis.eol.org BioSynC]. &lt;br /&gt;
&lt;br /&gt;
The main platform for this PhyloStars community (hashtag #phylostars) will be BioStars.org, an existing bioinformatics Q &amp;amp; A site.  The developer of BioStars, Istvan Albert, has joined our organizing team.   &lt;br /&gt;
&lt;br /&gt;
The main features of the plan (as currently developed) are as follows &lt;br /&gt;
* advertise to stakeholders our plan to leverage BioStars to create an online community that shares phyloinformatics solutions. &lt;br /&gt;
* encourage stakeholders to join the community and build up status points&lt;br /&gt;
* sponsor a burst of activity at a 4-day face-to-face meeting &lt;br /&gt;
* track usage statistics (users, questions, answers) throughout the process&lt;br /&gt;
* follow up with a workshop at an international meeting&lt;br /&gt;
&lt;br /&gt;
== Current news and announcements ==&lt;br /&gt;
&lt;br /&gt;
* We have not yet scheduled the f2f meeting.  Probably February. &lt;br /&gt;
* The organizing team now includes Istvan Albert, Julie Allen, Hilmar Lapp (PI), Michael Rosenberg, Brian Sidlauskas, Arlin Stoltzfus (PI), Rutger Vos, and Mark Westneat (PI). &lt;br /&gt;
&lt;br /&gt;
== Links to Resources ==&lt;br /&gt;
&lt;br /&gt;
* relevant Q&amp;amp;A sites&lt;br /&gt;
** [http://www.biostars.org BioStars]&lt;br /&gt;
** [http://www.seqanswers.com SeqAnswers] (next-gen sequencing) has quite a few phylogeny-related questions&lt;br /&gt;
** [http://www.stackoverflow.com StackOverflow] has programming-related phylogeny questions&lt;br /&gt;
** Phylo-relevant questions (e.g., [https://www.researchgate.net/post/How_to_create_consensus_phylogenetic_tree_for_sequence_clusters?cp=re95_x_p1_521dc21b0976d&amp;amp;ch=reg&amp;amp;loginT=F_7joj5lv_3WdgU0cV0WzUg9BvuG00FN&amp;amp;pli=1 consensus tree]) sometimes appear on ResearchGate&lt;br /&gt;
* examples of content that could be worked into a mini-tutorial &lt;br /&gt;
** Peter Forey on [http://www.geol.umd.edu/~tholtz/G331/lectures/cladistics5.pdf consensus trees and support values]&lt;br /&gt;
** wikipedia [http://en.wikipedia.org/wiki/List_of_phylogenetic_tree_visualization_software list of tree viz software] &lt;br /&gt;
* [[PhyloStars/BeforeAndAfter| Before vs. After view]]&lt;br /&gt;
* [[File:PhyloStarsProposal.pdf | PhyloStars proposal to SMBE (PDF) ]]&lt;br /&gt;
* [[PhyloStars/Planning And Meetings| Planning and Meetings]]&lt;br /&gt;
&lt;br /&gt;
[[Category:PhyloStars]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars&amp;diff=4378</id>
		<title>PhyloStars</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars&amp;diff=4378"/>
		<updated>2013-10-01T16:17:51Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Links to Resources */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Home page for PhyloStars project, which aims to develop a socially networked online community of practice for evolutionary applications of bioinformatics.&lt;br /&gt;
&lt;br /&gt;
== About ==&lt;br /&gt;
&lt;br /&gt;
We propose to engage the evolution community with a participant-driven meeting aimed at (1) developing and disseminating topical bioinformatics solutions that serve recognized needs of researchers, and (2) developing the capacity to meet this ongoing need in a self-sustaining way, via an online, global, socially networked community-of-practice.  We have pledges of support for this project from [http://www.smbe.org SMBE] and [http://synthesis.eol.org BioSynC]. &lt;br /&gt;
&lt;br /&gt;
The main platform for this PhyloStars community (hashtag #phylostars) will be BioStars.org, an existing bioinformatics Q &amp;amp; A site.  The developer of BioStars, Istvan Albert, has joined our organizing team.   &lt;br /&gt;
&lt;br /&gt;
The main features of the plan (as currently developed) are as follows &lt;br /&gt;
* advertise to stakeholders our plan to leverage BioStars to create an online community that shares phyloinformatics solutions. &lt;br /&gt;
* encourage stakeholders to join the community and build up status points&lt;br /&gt;
* sponsor a burst of activity at a 4-day face-to-face meeting &lt;br /&gt;
* track usage statistics (users, questions, answers) throughout the process&lt;br /&gt;
* follow up with a workshop at an international meeting&lt;br /&gt;
&lt;br /&gt;
== Current news and announcements ==&lt;br /&gt;
&lt;br /&gt;
* We have not yet scheduled the f2f meeting.  Probably February. &lt;br /&gt;
* The organizing team now includes Istvan Albert, Julie Allen, Hilmar Lapp (PI), Michael Rosenberg, Brian Sidlauskas, Arlin Stoltzfus (PI), Rutger Vos, and Mark Westneat (PI). &lt;br /&gt;
&lt;br /&gt;
== Links to Resources ==&lt;br /&gt;
&lt;br /&gt;
* relevant Q&amp;amp;A sites&lt;br /&gt;
** [http://www.biostars.org BioStars]&lt;br /&gt;
** [http://www.seqanswers.com SeqAnswers] (next-gen sequencing) has quite a few phylogeny-related questions&lt;br /&gt;
** [http://www.stackoverflow.com StackOverflow] has programming-related phylogeny questions&lt;br /&gt;
** Phylo-relevant questions (e.g., [https://www.researchgate.net/post/How_to_create_consensus_phylogenetic_tree_for_sequence_clusters?cp=re95_x_p1_521dc21b0976d&amp;amp;ch=reg&amp;amp;loginT=F_7joj5lv_3WdgU0cV0WzUg9BvuG00FN&amp;amp;pli=1 consensus tree]) sometimes appear on ResearchGate&lt;br /&gt;
* examples of content that could be worked into a mini-tutorial &lt;br /&gt;
** Peter Forey on [http://www.geol.umd.edu/~tholtz/G331/lectures/cladistics5.pdf consensus trees and support values]&lt;br /&gt;
** wikipedia [http://en.wikipedia.org/wiki/List_of_phylogenetic_tree_visualization_software list of tree viz software] &lt;br /&gt;
* [[PhyloStars/BeforeAndAfter| Before vs. After view]]&lt;br /&gt;
* [[File:PhyloStarsProposal.pdf | PhyloStars proposal to SMBE (PDF) ]]&lt;br /&gt;
* [[PhyloStars/Planning And Meetings| Planning and Meetings]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars&amp;diff=4377</id>
		<title>PhyloStars</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars&amp;diff=4377"/>
		<updated>2013-10-01T16:17:26Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Links to Resources */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Home page for PhyloStars project, which aims to develop a socially networked online community of practice for evolutionary applications of bioinformatics.&lt;br /&gt;
&lt;br /&gt;
== About ==&lt;br /&gt;
&lt;br /&gt;
We propose to engage the evolution community with a participant-driven meeting aimed at (1) developing and disseminating topical bioinformatics solutions that serve recognized needs of researchers, and (2) developing the capacity to meet this ongoing need in a self-sustaining way, via an online, global, socially networked community-of-practice.  We have pledges of support for this project from [http://www.smbe.org SMBE] and [http://synthesis.eol.org BioSynC]. &lt;br /&gt;
&lt;br /&gt;
The main platform for this PhyloStars community (hashtag #phylostars) will be BioStars.org, an existing bioinformatics Q &amp;amp; A site.  The developer of BioStars, Istvan Albert, has joined our organizing team.   &lt;br /&gt;
&lt;br /&gt;
The main features of the plan (as currently developed) are as follows &lt;br /&gt;
* advertise to stakeholders our plan to leverage BioStars to create an online community that shares phyloinformatics solutions. &lt;br /&gt;
* encourage stakeholders to join the community and build up status points&lt;br /&gt;
* sponsor a burst of activity at a 4-day face-to-face meeting &lt;br /&gt;
* track usage statistics (users, questions, answers) throughout the process&lt;br /&gt;
* follow up with a workshop at an international meeting&lt;br /&gt;
&lt;br /&gt;
== Current news and announcements ==&lt;br /&gt;
&lt;br /&gt;
* We have not yet scheduled the f2f meeting.  Probably February. &lt;br /&gt;
* The organizing team now includes Istvan Albert, Julie Allen, Hilmar Lapp (PI), Michael Rosenberg, Brian Sidlauskas, Arlin Stoltzfus (PI), Rutger Vos, and Mark Westneat (PI). &lt;br /&gt;
&lt;br /&gt;
== Links to Resources ==&lt;br /&gt;
&lt;br /&gt;
* relevant Q&amp;amp;A sites&lt;br /&gt;
** [http://www.biostars.org BioStars]&lt;br /&gt;
** [http://www.seqanswers.com SeqAnswers] (next-gen sequencing) has quite a few phylogeny-related questions&lt;br /&gt;
** [http://www.stackoverflow.com StackOverflow] has programming-related phylogeny questions&lt;br /&gt;
** Phylo-relevant questions (e.g., [https://www.researchgate.net/post/How_to_create_consensus_phylogenetic_tree_for_sequence_clusters?cp=re95_x_p1_521dc21b0976d&amp;amp;ch=reg&amp;amp;loginT=F_7joj5lv_3WdgU0cV0WzUg9BvuG00FN&amp;amp;pli=1 consensus tree]) sometimes appear on ResearchGate&lt;br /&gt;
* examples of content that could be worked into a mini-tutorial &lt;br /&gt;
** Peter Forey on [http://www.geol.umd.edu/~tholtz/G331/lectures/cladistics5.pdf consensus trees and support values]&lt;br /&gt;
** wikipedia [http://en.wikipedia.org/wiki/List_of_phylogenetic_tree_visualization_software list of tree viz software] &lt;br /&gt;
* [[PhyloStars/BeforeAndAfter| Before vs. After view]]&lt;br /&gt;
* [[File:PhyloStarsProposal.pdf | PhyloStars proposal to SMBE (PDF) ]]&lt;br /&gt;
* [[PhyloStars/PlanningAndMeetings]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PlanningAndMeetings&amp;diff=4414</id>
		<title>PlanningAndMeetings</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PlanningAndMeetings&amp;diff=4414"/>
		<updated>2013-10-01T16:16:35Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Hilmar moved page PlanningAndMeetings to PhyloStars/Planning And Meetings&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#REDIRECT [[PhyloStars/Planning And Meetings]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars/Planning_And_Meetings&amp;diff=4400</id>
		<title>PhyloStars/Planning And Meetings</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars/Planning_And_Meetings&amp;diff=4400"/>
		<updated>2013-10-01T16:16:35Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Hilmar moved page PlanningAndMeetings to PhyloStars/Planning And Meetings&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== October 9, 2013 ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== September 25, 2013 ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== September 11, 2013 ==&lt;br /&gt;
&lt;br /&gt;
* importance of building moment in order to reach critical mass at (or soon after) hackathon&lt;br /&gt;
* progress: before-and-after comparison&lt;br /&gt;
* need to show folks that phylostars is the fastest way to get a quality, custom answer&lt;br /&gt;
* example of r-sig-phylo as success story&lt;br /&gt;
** mailing list created from NESCent hackathon &lt;br /&gt;
** 960 people &lt;br /&gt;
** 100-200 posts per month&lt;br /&gt;
** includes people like JF who does not use R but likes to get involved in the theory issues&lt;br /&gt;
&lt;br /&gt;
== first meeting ==&lt;br /&gt;
&lt;br /&gt;
* importance of having some high-quality tutorials&lt;br /&gt;
** 2 pages&lt;br /&gt;
** good tutorial can take a day to produce if you are familiar with the topic&lt;br /&gt;
* possibility of modifications of BioStars (Istvan)&lt;br /&gt;
* availability of history for purposes of before-after comparison&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=HIP&amp;diff=1737</id>
		<title>HIP</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=HIP&amp;diff=1737"/>
		<updated>2013-08-22T23:09:56Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Leadership Team */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Hackathons, Interoperability, Phylogenies (HIP)''', a [http://www.nescent.org NESCent] working group, envision the future as a virtual phyloinformatics bazaar in which comparative data and phylogenies are saved, shared, annotated, liked, re-used, aggregated, mashed up, and linked in. In pursuit of this vision, the working group will stage a series of hackathons (intensive participant-driven code-development meetings) that empower early-career scientists to build the links of an emerging network of interoperable evolutionary resources [1].  &lt;br /&gt;
&lt;br /&gt;
* '''Hackathons''' - A hackathon is an event in which programmers (&amp;quot;hackers&amp;quot;) meet to do collaborative programming.  Hackathons naturally attract early-career researchers, and they provide a unique experience that combines learning, achievement and teamwork.   Members of the HIP leadership team have extensive experience organizing and participating in hackathons (including 6 years of NESCent hackathons, as well as other bioinformatics hackathons).   &lt;br /&gt;
* '''Interoperability''' - ISO defines &amp;quot;interoperability&amp;quot; as &amp;quot;The capability to communicate, execute programs, or transfer data among various functional units in a manner that requires the user to have little or no knowledge of the unique characteristics of those units&amp;quot; [http://jtc1sc36.org/doc/36N0646.pdf]. An operational definition is that interoperability exists when objectives can be achieved automatically, using human labor or &amp;quot;mind work&amp;quot; only where absolutely essential (e.g., for tasks that cannot or should not be automated, like fighting fires or answering the phone; see http://www.ariadne.ac.uk/issue24/interoperability/intro.html)&lt;br /&gt;
* '''Phylogenies''' - The focus of HIP is on supporting a variety of use-cases in evolutionary analysis that depend critically on making and using phylogenies (thus it is much broader than the problem of statistical inference of phylogenetic trees).   This [http://informatics.nescent.org/wiki/UseCases list of use-cases] includes such problems as analyzing character evolution, building super-trees, making functional inferences about genes and proteins, and calibrating a molecular phylogeny with fossils.  &lt;br /&gt;
&lt;br /&gt;
== Hackathons ==&lt;br /&gt;
&lt;br /&gt;
* '''[[PhyloStars]]: Developing a Socially Networked Bioinformatics Community-of-practice for the Evolution Research Community''' : To be announced soon&lt;br /&gt;
* '''[[Phylotastic | Phylotastic 2]]''' (Jan 28 to Feb 1, 2013) took place at iPlant headquarters in Tucson, Arizona, building on the success of the first phylotastic hackathon described below&lt;br /&gt;
* '''[[Phylotastic | Phylotastic: the Tree of Life, as you need it]]''' (June 4 to 8, NESCent) was HIP's first hackathon event.  Woo hoo!  At its January meeting, the HIP leadership team decided to throw away the playbook and opt for a new (for us) style of hackathon in which all participants are focused on a single over-arching goal.  In this case, the goal is to '''build web-services infrastructures that provide the pruning, grafting, name-reconciliation, and translation operations needed for users to take advantage of the growing supply of megatrees.'''  For more information:&lt;br /&gt;
** [[Phylotastic| project wiki ]] &lt;br /&gt;
** [[PhylotasticParticipants| participants page]]&lt;br /&gt;
** slide presentation in [http://bit.ly/xGvwRr ppt] or [http://bit.ly/wcQzMI PDF] format&lt;br /&gt;
&lt;br /&gt;
== Partners ==&lt;br /&gt;
&lt;br /&gt;
* [http://nescent.org NESCent]&lt;br /&gt;
* [http://synthesis.eol.org BioSynC]&lt;br /&gt;
* [http://www.iplantcollaborative.org iPlant]&lt;br /&gt;
&lt;br /&gt;
== How to Get Involved == &lt;br /&gt;
&lt;br /&gt;
* HIP events will be advertised on the NESCent web site, the evoldir email list, and by other means&lt;br /&gt;
* if you have an idea for a hackathon target (something that a small team can do in a few days)&lt;br /&gt;
** discuss it with someone on the LT&lt;br /&gt;
** add it to the [[HIPTargetsAndIdeasList| targets &amp;amp; ideas list]]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
* HIP [[HIPTargetsAndIdeasList| Targets &amp;amp; Ideas List]]&lt;br /&gt;
* Some thoughts on evolving a [[PhylotasticDataPolicy]]&lt;br /&gt;
* All pages on this wiki in [[:Category:HIP]]&lt;br /&gt;
* [[EvoIOWorkingGroupProposal | HIP working group proposal]]&lt;br /&gt;
* NESCent's [http://informatics.nescent.org/wiki/Hackathon_Whitepaper_Guidelines guidelines] for proposing a hackathon&lt;br /&gt;
* [http://pipes.yahoo.com/pipes/pipe.run?_id=b269b02171b6411c4a27ae0714dc76da&amp;amp;_render=rss RSS feed of relevant projects]&lt;br /&gt;
&lt;br /&gt;
== Leadership Team ==&lt;br /&gt;
&lt;br /&gt;
HIP is managed by a leadership team (HIP-LT) consisting of the 3 PIs, 6 other members representing different dimensions of evolutionary informatics, and ad hoc members helping with planning of specific events.  HIP-LT manages all aspects of the project-- advertising events, managing information resources, staging hackathons, and so on.  The group meets electronically to conduct its business, and keeps a [[HIP_LT_Meetings|record of its meetings]].  &lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;5&amp;quot; align=&amp;quot;left&amp;quot;&lt;br /&gt;
! Name&lt;br /&gt;
! width=&amp;quot;25%&amp;quot; | Affiliation&lt;br /&gt;
! Image&lt;br /&gt;
!&lt;br /&gt;
! Name&lt;br /&gt;
! width=&amp;quot;25%&amp;quot; |  Affiliation&lt;br /&gt;
! Image&lt;br /&gt;
|-&lt;br /&gt;
| [http://www.ibbr.umd.edu/profiles/arlin-stoltzfus Arlin Stoltzfus] &lt;br /&gt;
| Fellow, IBBR (U Md) and Research Biologist, NIST &lt;br /&gt;
| [[Image:Arlin_head.jpg|Arlin Stoltzfus]]&lt;br /&gt;
|&lt;br /&gt;
| [[User:RutgerVos|Rutger Vos]]&lt;br /&gt;
| Marie Curie Research Fellow, University of Reading, UK&lt;br /&gt;
| [[Image:Rutger_Vos.jpg|Rutger Vos]]&lt;br /&gt;
|-&lt;br /&gt;
| [http://www.cs.nmsu.edu/~epontell Enrico Pontelli]&lt;br /&gt;
| Professor and Head, Department of Computer Science, New Mexico State University&lt;br /&gt;
| [[File:Enrico.jpg|100px|Enrico Pontelli]]&lt;br /&gt;
|&lt;br /&gt;
| [http://www.mendeley.com/profiles/karen-cranston/ Karen Cranston]&lt;br /&gt;
| National Evolutionary Synthesis Center (NESCent) &lt;br /&gt;
| [[File:Karen.jpg|100px|Karen Cranston]]&lt;br /&gt;
|-&lt;br /&gt;
| [http://www.matasci.info Naim Matasci]&lt;br /&gt;
| Scientific Lead,  The iPlant Collaborative and Adjunct Assistant Professor, Department of Ecology and Evolutionary Biology, University of Arizona, Tucson&lt;br /&gt;
| [[Image:Naim_matasci_face.jpg|100px|Naim Matasci]]&lt;br /&gt;
|&lt;br /&gt;
| [http://www.brianomeara.info Brian O'Meara]&lt;br /&gt;
| Assistant Professor, Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville&lt;br /&gt;
| [[Image:Brian_o%27meara_face.jpg|100px|Brian O'Meara]]&lt;br /&gt;
|-&lt;br /&gt;
| [http://www.rosenberglab.net/ Michael S Rosenberg] &lt;br /&gt;
| Associate Professor, School of Life Sciences &amp;amp;amp; Center for Evolutionary Medicine&amp;lt;br&amp;gt;and Informatics, The Biodesign Institute, Arizona State University&lt;br /&gt;
| [[File:MSRhead.jpg|100px|Michael S Rosenberg]] &lt;br /&gt;
|&lt;br /&gt;
| [http://people.oregonstate.edu/~sidlausb/ Brian Sidlauskas] &lt;br /&gt;
| Assistant Professor and Curator of Fishes, Oregon State University, Department of Fisheries and Wildlife&lt;br /&gt;
| [[Image:BrianHeadshot100.jpg]]&lt;br /&gt;
|-&lt;br /&gt;
| [http://www Mark Westneat] &lt;br /&gt;
| Curator and Chair of Zoology, Director, Biodiversity Synthesis Center, Field Museum of Natural History, Chicago&lt;br /&gt;
| [[Image:MarkStaffPhoto3.jpg|100px|Mark Westneat]]&lt;br /&gt;
|&lt;br /&gt;
| [[User:Hilmar|Hilmar Lapp]] &lt;br /&gt;
| National Evolutionary Synthesis Center (NESCent) &lt;br /&gt;
| [[File:Hilmar.jpg|100px|Hilmar Lapp]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Past members of the Leadership Team&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;5&amp;quot; align=&amp;quot;left&amp;quot;&lt;br /&gt;
! Name&lt;br /&gt;
! width=&amp;quot;25%&amp;quot; | Affiliation&lt;br /&gt;
! Image&lt;br /&gt;
!&lt;br /&gt;
! Name&lt;br /&gt;
! width=&amp;quot;25%&amp;quot; | Affiliation&lt;br /&gt;
! Image&lt;br /&gt;
|-&lt;br /&gt;
| [http://www.datam0nk3y.org/hyphy/doku.php?id=sergei_l_kosakovsky_pond Sergei Kosakovsky Pond] &lt;br /&gt;
| Assistant Adjunct Professor, Department of Medicine, University of California, San Diego&lt;br /&gt;
| [[Image:Sergei.jpg|Sergei Kosakovsky Pond]]&lt;br /&gt;
|&lt;br /&gt;
| [http://www.medgen.ubc.ca/research/research_faculty/mwilkinson.htm Mark D Wilkinson] &lt;br /&gt;
| Associate Professor, Institute for Heart + Lung Health, University of British Columbia, Canada&lt;br /&gt;
| [[Image:Mark_Wilkinson.jpg|Mark Wilkinson]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category:HIP]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4392</id>
		<title>PhyloStars/BeforeAndAfter</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4392"/>
		<updated>2013-08-20T23:49:49Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Assessment Goals */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Assessment Goals ==&lt;br /&gt;
&lt;br /&gt;
The things we would like to know, and how they ''might'' be measured.   We would need to have a baseline for each one, based on current BioStars activity in all domains. &lt;br /&gt;
&lt;br /&gt;
# Level of activity  &lt;br /&gt;
#* number of new evolution-related questions (responses, answers) in the past month&lt;br /&gt;
#* up-votes and responses to existing evolution-related questions in the past month&lt;br /&gt;
# Responsiveness and effectiveness&lt;br /&gt;
#* how many questions get an answer within a week?   a response?  &lt;br /&gt;
#* what is the distribution of waiting times for answers?  responses?&lt;br /&gt;
# Community &lt;br /&gt;
#* how many users respond to (view) evolution-related questions? &lt;br /&gt;
#* what is the average expertise of users in this community? &lt;br /&gt;
#* what is the apparent male:female ratio?  &lt;br /&gt;
#* how many ''different'' people view evolution-related content in a month?&lt;br /&gt;
&lt;br /&gt;
Caveat: the seasonality of academia-related efforts.  Prior knowledge is that there are some really strange patterns, e.g., spike of activity just before winter break.  If we do &amp;quot;before&amp;quot; in August, we really should wait until next August to do the &amp;quot;after&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
== Methods and APIs ==&lt;br /&gt;
&lt;br /&gt;
* You can get a picture of activity by clicking on the tag cloud.  Activity can be sorted by votes. For example: http://www.biostars.org/show/tag/phylogenentics/?sort=votes&amp;amp;since=all%20time&lt;br /&gt;
* Everything can be measured retroactively, and there are API's to query site history. For example: a snapshot of the site 100 days ago can be obtained with http://www.biostars.org/api/stats/100/&lt;br /&gt;
&lt;br /&gt;
==Before ==&lt;br /&gt;
&lt;br /&gt;
=== relative topics ===&lt;br /&gt;
reported number of results based on August 20, 2013 search &lt;br /&gt;
&lt;br /&gt;
phylostars &lt;br /&gt;
* phylostars 0 &lt;br /&gt;
&lt;br /&gt;
evolution &lt;br /&gt;
* evolution 760&lt;br /&gt;
&lt;br /&gt;
molecular evolution &lt;br /&gt;
* ortholog* 1790 &lt;br /&gt;
* paralog* 186&lt;br /&gt;
&lt;br /&gt;
phylogenetics - interestingly, most references to &amp;quot;distance matrix&amp;quot; and &amp;quot;mega&amp;quot; are the phylogenetics meanings&lt;br /&gt;
* phylogen* 2920&lt;br /&gt;
* Phylip 241&lt;br /&gt;
* distance matrix 221&lt;br /&gt;
* MEGA 137 &lt;br /&gt;
* MrBayes 124&lt;br /&gt;
* PAUP 70&lt;br /&gt;
* phylogram 36&lt;br /&gt;
* cladogram 4&lt;br /&gt;
&lt;br /&gt;
population genetics &lt;br /&gt;
* population genetics 260&lt;br /&gt;
* coalescent 19&lt;br /&gt;
* McDonald-Kreitman 1&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4391</id>
		<title>PhyloStars/BeforeAndAfter</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4391"/>
		<updated>2013-08-20T23:46:00Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Assessment Goals */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Assessment Goals ==&lt;br /&gt;
&lt;br /&gt;
The things we would like to know, and how they ''might'' be measured.   We would need to have a baseline for each one, based on current BioStars activity in all domains. &lt;br /&gt;
&lt;br /&gt;
# Level of activity  &lt;br /&gt;
#* number of new evolution-related questions (responses, answers) in the past month&lt;br /&gt;
#* up-votes and responses to existing evolution-related questions in the past month&lt;br /&gt;
# Responsiveness and effectiveness&lt;br /&gt;
#* how many questions get an answer within a week?   a response?  &lt;br /&gt;
#* what is the distribution of waiting times for answers?  responses?&lt;br /&gt;
# Community &lt;br /&gt;
#* how many users respond to (view) evolution-related questions? &lt;br /&gt;
#* what is the average expertise of users in this community? &lt;br /&gt;
#* what is the apparent male:female ratio?  &lt;br /&gt;
#* how many ''different'' people view evolution-related content in a month?&lt;br /&gt;
&lt;br /&gt;
Caveat: the seasonality of academia-related efforts.  Prior knowledge is that there are some really strange patterns, e.g., spike of activity just before winter break.  If we do &amp;quot;before&amp;quot; in August, we really should wait until next August to do the &amp;quot;after&amp;quot;.&lt;br /&gt;
&lt;br /&gt;
==Before ==&lt;br /&gt;
&lt;br /&gt;
=== relative topics ===&lt;br /&gt;
reported number of results based on August 20, 2013 search &lt;br /&gt;
&lt;br /&gt;
phylostars &lt;br /&gt;
* phylostars 0 &lt;br /&gt;
&lt;br /&gt;
evolution &lt;br /&gt;
* evolution 760&lt;br /&gt;
&lt;br /&gt;
molecular evolution &lt;br /&gt;
* ortholog* 1790 &lt;br /&gt;
* paralog* 186&lt;br /&gt;
&lt;br /&gt;
phylogenetics - interestingly, most references to &amp;quot;distance matrix&amp;quot; and &amp;quot;mega&amp;quot; are the phylogenetics meanings&lt;br /&gt;
* phylogen* 2920&lt;br /&gt;
* Phylip 241&lt;br /&gt;
* distance matrix 221&lt;br /&gt;
* MEGA 137 &lt;br /&gt;
* MrBayes 124&lt;br /&gt;
* PAUP 70&lt;br /&gt;
* phylogram 36&lt;br /&gt;
* cladogram 4&lt;br /&gt;
&lt;br /&gt;
population genetics &lt;br /&gt;
* population genetics 260&lt;br /&gt;
* coalescent 19&lt;br /&gt;
* McDonald-Kreitman 1&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4390</id>
		<title>PhyloStars/BeforeAndAfter</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=PhyloStars/BeforeAndAfter&amp;diff=4390"/>
		<updated>2013-08-20T23:45:01Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Assessment Goals ==&lt;br /&gt;
&lt;br /&gt;
The things we would like to know, and how they ''might'' be measured.   We would need to have a baseline for each one, based on current BioStars activity in all domains. &lt;br /&gt;
&lt;br /&gt;
# Level of activity  &lt;br /&gt;
#* number of new evolution-related questions (responses, answers) in the past month&lt;br /&gt;
#* up-votes and responses to existing evolution-related questions in the past month&lt;br /&gt;
# Responsiveness and effectiveness&lt;br /&gt;
#* how many questions get an answer within a week?   a response?  &lt;br /&gt;
#* what is the distribution of waiting times for answers?  responses?&lt;br /&gt;
# Community &lt;br /&gt;
#* how many users respond to (view) evolution-related questions? &lt;br /&gt;
#* what is the average expertise of users in this community? &lt;br /&gt;
#* what is the apparent male:female ratio?  &lt;br /&gt;
#* how many ''different'' people view evolution-related content in a month?   &lt;br /&gt;
&lt;br /&gt;
==Before ==&lt;br /&gt;
&lt;br /&gt;
=== relative topics ===&lt;br /&gt;
reported number of results based on August 20, 2013 search &lt;br /&gt;
&lt;br /&gt;
phylostars &lt;br /&gt;
* phylostars 0 &lt;br /&gt;
&lt;br /&gt;
evolution &lt;br /&gt;
* evolution 760&lt;br /&gt;
&lt;br /&gt;
molecular evolution &lt;br /&gt;
* ortholog* 1790 &lt;br /&gt;
* paralog* 186&lt;br /&gt;
&lt;br /&gt;
phylogenetics - interestingly, most references to &amp;quot;distance matrix&amp;quot; and &amp;quot;mega&amp;quot; are the phylogenetics meanings&lt;br /&gt;
* phylogen* 2920&lt;br /&gt;
* Phylip 241&lt;br /&gt;
* distance matrix 221&lt;br /&gt;
* MEGA 137 &lt;br /&gt;
* MrBayes 124&lt;br /&gt;
* PAUP 70&lt;br /&gt;
* phylogram 36&lt;br /&gt;
* cladogram 4&lt;br /&gt;
&lt;br /&gt;
population genetics &lt;br /&gt;
* population genetics 260&lt;br /&gt;
* coalescent 19&lt;br /&gt;
* McDonald-Kreitman 1&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=AdvancingMIAPA&amp;diff=4277</id>
		<title>AdvancingMIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=AdvancingMIAPA&amp;diff=4277"/>
		<updated>2013-02-03T00:32:46Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Hilmar moved page AdvancingMIAPA to Advancing MIAPA&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;#REDIRECT [[Advancing MIAPA]]&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4246</id>
		<title>Advancing MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4246"/>
		<updated>2013-02-03T00:32:46Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: Hilmar moved page AdvancingMIAPA to Advancing MIAPA&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Factors shaping our conception of source-tree annotations ==&lt;br /&gt;
&lt;br /&gt;
# what folks in the evoinfo community believe is the Minimal Information About a Phylogenetic Analysis (MIAPA)&lt;br /&gt;
#* The current synopsis of this is the MIAPA checklist from teh 2011 TDWG meeting. &lt;br /&gt;
# need to support assignment of credit (blame) for tree-producers and phylotastic service-providers &lt;br /&gt;
# need to support licensing that protects creators, resource-providers and end-users&lt;br /&gt;
# need to contribute to a credible provenance report for phylotastic-generated trees&lt;br /&gt;
#* e.g., a tree might be returned with information as follows (free text form): &amp;quot;This tree was obtained on Jan 29, 2013.  An input list of 58 names was submitted to Taxosaurus, resulting in 45 valid species binomials ( list ).  This list was sent to a pruner with instructions to prune out the indicated species from the phylogeny of Bininda-Emonds, et al. 2007.  The resulting sub-tree with 40 species was scaled using teh DateLife service. &amp;quot;  &lt;br /&gt;
#  support the most common user criteria for phylotastic searching &lt;br /&gt;
#* limits on source trees &lt;br /&gt;
#** '''return tree with maximal coverage of list ::= { species }'''&lt;br /&gt;
#*** specify namespace of &amp;lt;list&amp;gt; &lt;br /&gt;
#** restrict by publication status (published or not)&lt;br /&gt;
#** restrict by publication year (e.g., no trees older than 5 years)&lt;br /&gt;
#** restrict by author (e.g., author = bininda-emonds) &lt;br /&gt;
#** restrict by other citation information (e.g., jrnl = index fungorum)&lt;br /&gt;
#** restrict by method&lt;br /&gt;
#*** '''use (exclude) trees made with method class = { parsimony, likelihood, Bayesian, distance, supertree, supermatrix, hand-crafted. . . } '''&lt;br /&gt;
#*** use (exclude)  trees made with evolutionary model = {  }&lt;br /&gt;
#*** '''use (exclude) trees made with software = { RaXML, BEAST, PAUP*, . . . }'''&lt;br /&gt;
#*** other restrictions (e.g.,  supertree, . . . )&lt;br /&gt;
#** restrict by availability of source data such as character matrix&lt;br /&gt;
#*** restrict by minimum number of characters in matrix &lt;br /&gt;
#** '''restrict by type of source data = { molecular, morphological, mixed } '''&lt;br /&gt;
#** restrict by annotation level = { platinum, gold, silver, bronze, polystyrene } &lt;br /&gt;
#** require feature&lt;br /&gt;
#*** '''require support values'''&lt;br /&gt;
#*** require rooting &lt;br /&gt;
#*** '''require branch lengths'''&lt;br /&gt;
#*** require fully resolved&lt;br /&gt;
#*** require other features &lt;br /&gt;
#* limits on phylotastic manipulations &lt;br /&gt;
#** disallow species substitution &lt;br /&gt;
#** disallow grafting of source trees&lt;br /&gt;
#** scaling: provide median age estimates &lt;br /&gt;
#** scaling: provide only lower age estimates&lt;br /&gt;
#** TNRS: prohibit fuzzy matching&lt;br /&gt;
#** TNRS: use matches from source = { NCBI, ITIS, et }&lt;br /&gt;
# support adequate annotation of the provenance of the resulting phylotastic tree&lt;br /&gt;
#* imagine that you have to publish an analysis using a phylogenetic tree-- how do you describe it?  &amp;quot;The phylogeny of 40 mammal species was obtained via phylotastic (phylotastic.org &lt;br /&gt;
#* citation information&lt;br /&gt;
&lt;br /&gt;
== Suggestions for MIAPA ontology ==&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
* Add a class for parsimony under algorithm. [http://github.com/miapa/miapa/issues/7 Filed as Issue #7] &lt;br /&gt;
&lt;br /&gt;
* It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation. [http://github.com/miapa/miapa/issues/8 Filed as Issue #8]&lt;br /&gt;
&lt;br /&gt;
* Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
** ''Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.''&lt;br /&gt;
** ''Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.''&lt;br /&gt;
&lt;br /&gt;
* Remove class for SILVA. [http://github.com/miapa/miapa/issues/9 Filed as Issue #9]&lt;br /&gt;
&lt;br /&gt;
* Add new class for set of trees. [http://github.com/miapa/miapa/issues/10 Filed as Issue #10]&lt;br /&gt;
&lt;br /&gt;
== Lessons learned from tree-finding and annotation ==&lt;br /&gt;
&lt;br /&gt;
* data frequently is not readily accessible online &lt;br /&gt;
** e.g. Jetz one must go to birdtree.org&lt;br /&gt;
** e.g., we obtained hymenoptera tree by pers. communication from Peters&lt;br /&gt;
** only one of the studies has trees in TreeBASE (GEBA)&lt;br /&gt;
** another study has a tree in Dryad (Smith)&lt;br /&gt;
* authors often don't provide minimal information explicitly&lt;br /&gt;
** whether or not a tree is rooted is usually not explicit&lt;br /&gt;
*** e.g., taxonomic hierarchies (APG, ToLWeb, NCBI) imply rootedness&lt;br /&gt;
*** e.g., Peters, et al 2011 never invokes the term &amp;quot;root&amp;quot; but we infer rooting from the description of outgroups not present in the final tree&lt;br /&gt;
** the methodology is largely unexplained for curated or hand-crafted taxonomic frameworks like APGIII or NCBI&lt;br /&gt;
* a study frequently has several trees by slightly different methods, but the checklist seems to imply one method&lt;br /&gt;
** e.g., Goloboff molecules only vs. molecules with morphology&lt;br /&gt;
** e.g., Bininda-Emonds best dates vs min dates vs max dates &lt;br /&gt;
* sometimes a study has many trees that all represent outputs of the same method&lt;br /&gt;
** e.g., Jetz provide a large sample from the posterior distribution &lt;br /&gt;
* process of constructing tree does not follow sequences--&amp;gt; alignment--&amp;gt; tree&lt;br /&gt;
** e.g., supertree method in Bininda-Emonds&lt;br /&gt;
** e.g., hand-crafted APG, Smith, trees&lt;br /&gt;
* process of constructing tree cannot be condensed easily &lt;br /&gt;
** e.g., Goloboff, iterative procedure with divide-and-conquer search to find parsimony tree (&amp;quot;tree fuse&amp;quot; &amp;amp; &amp;quot;sector&amp;quot; search repeated)&lt;br /&gt;
** GEBA tree, { missing explanation }&lt;br /&gt;
** partitioned alignment (e.g., Smith, et al), but miapa implies &amp;quot;model of evolution&amp;quot; as though there were only one, whereas in Peters, et al there are 2 models for 2 partitions&lt;br /&gt;
* clustering to define orthologs not included in checklist, but seems important &lt;br /&gt;
** Smith phlawd, alignment: no pre-orthology.  &lt;br /&gt;
* mixed data is common&lt;br /&gt;
** Goloboff has morpho and molecular &lt;br /&gt;
** multiple studies have DNA (e.g., SSU rDNA) and protein sequences&lt;br /&gt;
* concatenated alignments are common, e.g., multiple proteins &lt;br /&gt;
** this means accession:OTU mapping is not 1:1 but many to one&lt;br /&gt;
* not encountered in our inputs but sometimes the OTU is &amp;lt;genus_sp&amp;gt; and the data are fused from multiple species (this is common in MorphoBank)&lt;br /&gt;
* many important trees  do not have branch lengths&lt;br /&gt;
** e.g., APGIII is a taxonomic framework&lt;br /&gt;
** e.g., some supertrees don't have branch lengths &lt;br /&gt;
* do binomials count as meaningful external identifiers for OTUs?  &lt;br /&gt;
** in some cases, the methods make clear that these come from a specific source &lt;br /&gt;
*** e.g., Goloboff names clearly come directly from NCBI via their bioinfo pipeline&lt;br /&gt;
*** e.g., Bininda-Emonds publication declares that naming authority is Wilson &amp;amp; Roeder (Mammal Species of the World)&lt;br /&gt;
*** e.g., NCBI taxonomy comes as a database dump wth taxids and synonyms, so it represents its own authority&lt;br /&gt;
** usually the naming authority is not clear&lt;br /&gt;
* was any study straightforward? &lt;br /&gt;
* OTUs checklist question may be redundant: why have external identifier and then ask for collections information?  &lt;br /&gt;
* OTU external refs in checklist: not applicable for supertree methods, consensus methods, hand-crafted&lt;br /&gt;
&lt;br /&gt;
== MIAPA Resources ==&lt;br /&gt;
* The main [[MIAPA]] page.  Primarily:  &lt;br /&gt;
** [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop.&lt;br /&gt;
** Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
** [http://www.slideshare.net/ElliottHauser/phylogenetics-data-provenance-survey-results Slideshow] with some results from the [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey MIAPA community survey] orchestrated by the Open Tree of Life project in fall 2012.&lt;br /&gt;
* Development:&lt;br /&gt;
** [http://github.com/miapa/miapa MIAPA repo on Github]&lt;br /&gt;
* Ontologies &lt;br /&gt;
** [http://www.evolutionaryontology.org/cdao CDAO]&lt;br /&gt;
** [http://www.w3.org/TR/prov-o/ W3C Provenance Ontology]&lt;br /&gt;
** [http://code.google.com/p/information-artifact-ontology/ Information Artefact Ontology]&lt;br /&gt;
** [http://theswo.sourceforge.net/ Software Ontology] (includes the [http://edamontology.org/ EDAM ontologies])&lt;br /&gt;
** [http://bioportal.bioontology.org/ontologies/49276?p=terms phylont] from [http://arnetminer.org/publication/phylont-a-domain-specific-ontology-for-phylogeny-analysis-3601839.html;jsessionid=E93A88FA38E826E1113A957BBA5BB352.tt Maryam Panahiazar, et al]. (We did not use this.)&lt;br /&gt;
&lt;br /&gt;
== Annotation ==&lt;br /&gt;
* representation of metadata: [http://www.nexml.org NeXML]&lt;br /&gt;
* some stuff that Rutger did mapping the metadata from ToLWeb XML format onto semantic annotations in NeXML.&lt;br /&gt;
** ToLWeb XML described here: http://tolweb.org/tree/home.pages/downloadtree.html&lt;br /&gt;
** a simple script that does the conversion:  https://github.com/ncbnaturalis/bio-phylo/blob/master/experimental/tolconvert.pl&lt;br /&gt;
** Here's an example input file: https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol.xml&lt;br /&gt;
** Here's the resulting output file (indented): https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol-nexml-pp.xml&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4245</id>
		<title>Advancing MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4245"/>
		<updated>2013-02-03T00:31:16Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Lessons learned from tree-finding and annotation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Factors shaping our conception of source-tree annotations ==&lt;br /&gt;
&lt;br /&gt;
# what folks in the evoinfo community believe is the Minimal Information About a Phylogenetic Analysis (MIAPA)&lt;br /&gt;
#* The current synopsis of this is the MIAPA checklist from teh 2011 TDWG meeting. &lt;br /&gt;
# need to support assignment of credit (blame) for tree-producers and phylotastic service-providers &lt;br /&gt;
# need to support licensing that protects creators, resource-providers and end-users&lt;br /&gt;
# need to contribute to a credible provenance report for phylotastic-generated trees&lt;br /&gt;
#* e.g., a tree might be returned with information as follows (free text form): &amp;quot;This tree was obtained on Jan 29, 2013.  An input list of 58 names was submitted to Taxosaurus, resulting in 45 valid species binomials ( list ).  This list was sent to a pruner with instructions to prune out the indicated species from the phylogeny of Bininda-Emonds, et al. 2007.  The resulting sub-tree with 40 species was scaled using teh DateLife service. &amp;quot;  &lt;br /&gt;
#  support the most common user criteria for phylotastic searching &lt;br /&gt;
#* limits on source trees &lt;br /&gt;
#** '''return tree with maximal coverage of list ::= { species }'''&lt;br /&gt;
#*** specify namespace of &amp;lt;list&amp;gt; &lt;br /&gt;
#** restrict by publication status (published or not)&lt;br /&gt;
#** restrict by publication year (e.g., no trees older than 5 years)&lt;br /&gt;
#** restrict by author (e.g., author = bininda-emonds) &lt;br /&gt;
#** restrict by other citation information (e.g., jrnl = index fungorum)&lt;br /&gt;
#** restrict by method&lt;br /&gt;
#*** '''use (exclude) trees made with method class = { parsimony, likelihood, Bayesian, distance, supertree, supermatrix, hand-crafted. . . } '''&lt;br /&gt;
#*** use (exclude)  trees made with evolutionary model = {  }&lt;br /&gt;
#*** '''use (exclude) trees made with software = { RaXML, BEAST, PAUP*, . . . }'''&lt;br /&gt;
#*** other restrictions (e.g.,  supertree, . . . )&lt;br /&gt;
#** restrict by availability of source data such as character matrix&lt;br /&gt;
#*** restrict by minimum number of characters in matrix &lt;br /&gt;
#** '''restrict by type of source data = { molecular, morphological, mixed } '''&lt;br /&gt;
#** restrict by annotation level = { platinum, gold, silver, bronze, polystyrene } &lt;br /&gt;
#** require feature&lt;br /&gt;
#*** '''require support values'''&lt;br /&gt;
#*** require rooting &lt;br /&gt;
#*** '''require branch lengths'''&lt;br /&gt;
#*** require fully resolved&lt;br /&gt;
#*** require other features &lt;br /&gt;
#* limits on phylotastic manipulations &lt;br /&gt;
#** disallow species substitution &lt;br /&gt;
#** disallow grafting of source trees&lt;br /&gt;
#** scaling: provide median age estimates &lt;br /&gt;
#** scaling: provide only lower age estimates&lt;br /&gt;
#** TNRS: prohibit fuzzy matching&lt;br /&gt;
#** TNRS: use matches from source = { NCBI, ITIS, et }&lt;br /&gt;
# support adequate annotation of the provenance of the resulting phylotastic tree&lt;br /&gt;
#* imagine that you have to publish an analysis using a phylogenetic tree-- how do you describe it?  &amp;quot;The phylogeny of 40 mammal species was obtained via phylotastic (phylotastic.org &lt;br /&gt;
#* citation information&lt;br /&gt;
&lt;br /&gt;
== Suggestions for MIAPA ontology ==&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
* Add a class for parsimony under algorithm. [http://github.com/miapa/miapa/issues/7 Filed as Issue #7] &lt;br /&gt;
&lt;br /&gt;
* It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation. [http://github.com/miapa/miapa/issues/8 Filed as Issue #8]&lt;br /&gt;
&lt;br /&gt;
* Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
** ''Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.''&lt;br /&gt;
** ''Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.''&lt;br /&gt;
&lt;br /&gt;
* Remove class for SILVA. [http://github.com/miapa/miapa/issues/9 Filed as Issue #9]&lt;br /&gt;
&lt;br /&gt;
* Add new class for set of trees. [http://github.com/miapa/miapa/issues/10 Filed as Issue #10]&lt;br /&gt;
&lt;br /&gt;
== Lessons learned from tree-finding and annotation ==&lt;br /&gt;
&lt;br /&gt;
* data frequently is not readily accessible online &lt;br /&gt;
** e.g. Jetz one must go to birdtree.org&lt;br /&gt;
** e.g., we obtained hymenoptera tree by pers. communication from Peters&lt;br /&gt;
** only one of the studies has trees in TreeBASE (GEBA)&lt;br /&gt;
** another study has a tree in Dryad (Smith)&lt;br /&gt;
* authors often don't provide minimal information explicitly&lt;br /&gt;
** whether or not a tree is rooted is usually not explicit&lt;br /&gt;
*** e.g., taxonomic hierarchies (APG, ToLWeb, NCBI) imply rootedness&lt;br /&gt;
*** e.g., Peters, et al 2011 never invokes the term &amp;quot;root&amp;quot; but we infer rooting from the description of outgroups not present in the final tree&lt;br /&gt;
** the methodology is largely unexplained for curated or hand-crafted taxonomic frameworks like APGIII or NCBI&lt;br /&gt;
* a study frequently has several trees by slightly different methods, but the checklist seems to imply one method&lt;br /&gt;
** e.g., Goloboff molecules only vs. molecules with morphology&lt;br /&gt;
** e.g., Bininda-Emonds best dates vs min dates vs max dates &lt;br /&gt;
* sometimes a study has many trees that all represent outputs of the same method&lt;br /&gt;
** e.g., Jetz provide a large sample from the posterior distribution &lt;br /&gt;
* process of constructing tree does not follow sequences--&amp;gt; alignment--&amp;gt; tree&lt;br /&gt;
** e.g., supertree method in Bininda-Emonds&lt;br /&gt;
** e.g., hand-crafted APG, Smith, trees&lt;br /&gt;
* process of constructing tree cannot be condensed easily &lt;br /&gt;
** e.g., Goloboff, iterative procedure with divide-and-conquer search to find parsimony tree (&amp;quot;tree fuse&amp;quot; &amp;amp; &amp;quot;sector&amp;quot; search repeated)&lt;br /&gt;
** GEBA tree, { missing explanation }&lt;br /&gt;
** partitioned alignment (e.g., Smith, et al), but miapa implies &amp;quot;model of evolution&amp;quot; as though there were only one, whereas in Peters, et al there are 2 models for 2 partitions&lt;br /&gt;
* clustering to define orthologs not included in checklist, but seems important &lt;br /&gt;
** Smith phlawd, alignment: no pre-orthology.  &lt;br /&gt;
* mixed data is common&lt;br /&gt;
** Goloboff has morpho and molecular &lt;br /&gt;
** multiple studies have DNA (e.g., SSU rDNA) and protein sequences&lt;br /&gt;
* concatenated alignments are common, e.g., multiple proteins &lt;br /&gt;
** this means accession:OTU mapping is not 1:1 but many to one&lt;br /&gt;
* not encountered in our inputs but sometimes the OTU is &amp;lt;genus_sp&amp;gt; and the data are fused from multiple species (this is common in MorphoBank)&lt;br /&gt;
* many important trees  do not have branch lengths&lt;br /&gt;
** e.g., APGIII is a taxonomic framework&lt;br /&gt;
** e.g., some supertrees don't have branch lengths &lt;br /&gt;
* do binomials count as meaningful external identifiers for OTUs?  &lt;br /&gt;
** in some cases, the methods make clear that these come from a specific source &lt;br /&gt;
*** e.g., Goloboff names clearly come directly from NCBI via their bioinfo pipeline&lt;br /&gt;
*** e.g., Bininda-Emonds publication declares that naming authority is Wilson &amp;amp; Roeder (Mammal Species of the World)&lt;br /&gt;
*** e.g., NCBI taxonomy comes as a database dump wth taxids and synonyms, so it represents its own authority&lt;br /&gt;
** usually the naming authority is not clear&lt;br /&gt;
* was any study straightforward? &lt;br /&gt;
* OTUs checklist question may be redundant: why have external identifier and then ask for collections information?  &lt;br /&gt;
* OTU external refs in checklist: not applicable for supertree methods, consensus methods, hand-crafted&lt;br /&gt;
&lt;br /&gt;
== MIAPA Resources ==&lt;br /&gt;
* The main [[MIAPA]] page.  Primarily:  &lt;br /&gt;
** [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop.&lt;br /&gt;
** Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
** [http://www.slideshare.net/ElliottHauser/phylogenetics-data-provenance-survey-results Slideshow] with some results from the [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey MIAPA community survey] orchestrated by the Open Tree of Life project in fall 2012.&lt;br /&gt;
* Development:&lt;br /&gt;
** [http://github.com/miapa/miapa MIAPA repo on Github]&lt;br /&gt;
* Ontologies &lt;br /&gt;
** [http://www.evolutionaryontology.org/cdao CDAO]&lt;br /&gt;
** [http://www.w3.org/TR/prov-o/ W3C Provenance Ontology]&lt;br /&gt;
** [http://code.google.com/p/information-artifact-ontology/ Information Artefact Ontology]&lt;br /&gt;
** [http://theswo.sourceforge.net/ Software Ontology] (includes the [http://edamontology.org/ EDAM ontologies])&lt;br /&gt;
** [http://bioportal.bioontology.org/ontologies/49276?p=terms phylont] from [http://arnetminer.org/publication/phylont-a-domain-specific-ontology-for-phylogeny-analysis-3601839.html;jsessionid=E93A88FA38E826E1113A957BBA5BB352.tt Maryam Panahiazar, et al]. (We did not use this.)&lt;br /&gt;
&lt;br /&gt;
== Annotation ==&lt;br /&gt;
* representation of metadata: [http://www.nexml.org NeXML]&lt;br /&gt;
* some stuff that Rutger did mapping the metadata from ToLWeb XML format onto semantic annotations in NeXML.&lt;br /&gt;
** ToLWeb XML described here: http://tolweb.org/tree/home.pages/downloadtree.html&lt;br /&gt;
** a simple script that does the conversion:  https://github.com/ncbnaturalis/bio-phylo/blob/master/experimental/tolconvert.pl&lt;br /&gt;
** Here's an example input file: https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol.xml&lt;br /&gt;
** Here's the resulting output file (indented): https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol-nexml-pp.xml&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3922</id>
		<title>Tree Annotation</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3922"/>
		<updated>2013-02-03T00:29:52Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* additional suggestions for MIAPA ontology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Synopsis''' Annotate a small set of large trees used as sources of phylogenetic knowledge in an automated delivery system for tree-o-life knowledge called &amp;quot;Phylotastic&amp;quot;.   &lt;br /&gt;
&lt;br /&gt;
== Quick links ==&lt;br /&gt;
&lt;br /&gt;
* Overview: [[Media:FinalPres.pptx| Enrico's final presentation]]&lt;br /&gt;
** Also see [[:Image:PetersGraphic.jpg|example annotation graph for the Peters et al. tree and alignment]]. &lt;br /&gt;
* [[AnnotatedPhylotasticSourceTrees]] - report on the set of source trees&lt;br /&gt;
* [[TreestoreMetadataQueryDemonstration]] - report on the workflow and querying &lt;br /&gt;
* [[AdvancingMIAPA]] - report on MIAPA-relevant findings and developments&lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist checklist Reconciled_draft_checklist checklist from TDWG 2011]&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
We (1) identified a set of 10 large trees useful as phylotastic source trees (2) created free-text annotations (metadata) for citations, sources and methods, (3) encoded the data and metadata as RDF using CDAO and a new ontology, (4) loaded the encoded information into a triplestore, and (5) demonstrated logical querying based on data and metadata attributes.  During the hackathon, group members spent their time developing and revising a strategy,  interpreting source materials, developing language support, encoding annotations, working out technical bugs in the workflow, and addressing emerging challenges.  The tangible outcomes of this exercise include &lt;br /&gt;
* a set of 10 source trees (720 to 250,000 species) with low-res metadata (see [[AnnotatedPhylotasticSourceTrees]])&lt;br /&gt;
* a demonstration of semantic annotation, data-basing, and querying (see [[TreestoreMetadataQueryDemonstration]])&lt;br /&gt;
** a workflow plan for encoding tree data and metadata and loading them into a treestore&lt;br /&gt;
** a treestore instance populated with some of these data and metadata&lt;br /&gt;
* Advances in Minimum Information About a Phylogenetic Analysis (see [[AdvancingMIAPA]])&lt;br /&gt;
** a new &amp;quot;MIAPA&amp;quot; ontology  that leverages several existing ontologies&lt;br /&gt;
** recommendations on the draft checklist, and input form&lt;br /&gt;
* optionally&lt;br /&gt;
** a screencast&lt;br /&gt;
** a draft GSOC proposal&lt;br /&gt;
&lt;br /&gt;
== Background, Motivation, and Aims ==&lt;br /&gt;
&lt;br /&gt;
Metadata annotations represent an essential part of the design of phylotastic systems, for two reasons.  First, while we do not have a robust and detailed understand of how users will make use of phylotastic systems, we assume that they will wish to identify trees based on sources and methods.  For instance, a user may restrict a phylotastic query so as to include only trees inferred by Maximum Likelihood, or to exclude grafted trees, or to implicate only the tree associated with the publication by Bininda-Emonds, 2007.  &lt;br /&gt;
&lt;br /&gt;
Second, one of the design criteria of phylotastic systems is to provide credible results, which in the scientific world means providing a description of provenance suitable for a scientific publication.  To be credible, a tree generated by a phylotastic system must include a description of how it was derived, which includes information on source trees as well as a description of any subsequent manipulations.  Yet, metadata play little or no role in current phylotastic component implementations.  &lt;br /&gt;
&lt;br /&gt;
Some guidance may be obtained from prior art relating to databases and to metadata.  Two databases for trees exist already, TreeBASE and Dryad  Dryad does not provide any explicit support for tree-specific annotations.  The TreeBASE input interface allows citation data, creates links to species ident, and links a matrix to a tree with a &amp;quot;analysis&amp;quot; link that may implicate a particular software program.  Though useful, the TreeBASE model falls far short of the recommendations for a &amp;quot;minimum information&amp;quot; standard for phylogeny metadata known as MIAPA, or &amp;quot;Minimum Information About a Phylogenetic Analysis&amp;quot;.  For instance, the draft MIAPA checklist from TDWG2011 calls for an explicit indication of whether a tree is a gene or species tree, whether it is rooted, what software (and version number) was used to derive it, and so on. &lt;br /&gt;
&lt;br /&gt;
The TreeAnnotation team of hackathon 2 (Enrico, Hilmar, Joachim, Arlin, Ramona and 0.5 of Andrea) decided to conduct an annotation exercise that would cover the flow of information from initial annotation of trees, to querying of treestores (not including the annotation of subsequent phylotastic manipulations such as pruning or scaling).   &lt;br /&gt;
&lt;br /&gt;
The motivation for this exercise relates partly to the aspirational nature of MIAPA, which was proposed many years ago but has never evolved into a clear standard supported with convenient technology.   Those of us who have been involved in MIAPA-related efforts sensed a need for practical experiences in real-world uses of annotations.  While the scope of Phylotastic is narrower than that of MIAPA, in the sense of covering only species trees, and mainly large ones, the challenge of supporting useful metadata queries in a treestore represents a critical test of the relevance of the MIAPA checklist and the technology for encoding and managing semantic annotations.&lt;br /&gt;
&lt;br /&gt;
We also hoped to enrich current phylotastic implementations by providing metadata for a specific set of useful trees.  Hackathon participants have been using a handful of trees (APGIII, Bininda-Emonds, etc) without any metadata on citations or methods.  &lt;br /&gt;
&lt;br /&gt;
Thus, our approach has 3 inter-connected aims: &lt;br /&gt;
* to create a set of 10 usefully annotated source trees&lt;br /&gt;
* to demonstrate the feasibility of metadata-based querying in a treestore&lt;br /&gt;
* to leverage a practical annotation exercise to advance the MIAPA project&lt;br /&gt;
&lt;br /&gt;
== Approach ==&lt;br /&gt;
&lt;br /&gt;
Our approach consisted of the following steps&lt;br /&gt;
# identify 10 useful source trees with available publications&lt;br /&gt;
# generate free-text annotations &lt;br /&gt;
# encode citations and annotations in computable form&lt;br /&gt;
# load the citation, annotations, and trees into a treestore&lt;br /&gt;
# demonstrate querying based on metadata&lt;br /&gt;
&lt;br /&gt;
In particular, we chose to gather metadata corresponding to the MIAPA draft checklist, to enode it as RDF using a new ontology that imports several other ontologies, and to load the results into Ben Morris's Virtuoso-based treestore implementation.  On Day 3, we decided to begin by focusing  on citations, which are not in the MIAPA checklist, with the plan to carry citation data through steps 3 to 5.  &lt;br /&gt;
&lt;br /&gt;
'''workflow, in more detail'''&lt;br /&gt;
# Identify 10 trees for use as phylotastic source trees.  Most of the trees were identified pre-hackathon by Arlin.  One of the trees was replaced on day 2 due to lack of metadata (the unpublished fish tree of Westneat &amp;amp; Lundberg).  &lt;br /&gt;
# Annotate them in free-text form.  This was done on day #2 as a team effort by Ramona, Enrico, Arlin and Andrea.  &lt;br /&gt;
#* create [https://docs.google.com/spreadsheet/gform?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c&amp;amp;hl=en#edit web form] in Google docs for input of annotations, based on [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop&lt;br /&gt;
#* Spread sheet has pull down menus, plus options for free text entries under &amp;quot;other&amp;quot;&lt;br /&gt;
# Transform annotations into a formal language statements in RDF.  This was done on Days 3 and 4 by Ramona, Arlin, Enrico and Hilmar.  &lt;br /&gt;
#* Literature Citations&lt;br /&gt;
#** after some discussion,  we decided to use BIBO (not dc or prism alone)&lt;br /&gt;
#** we spent 6 to 8 person hours trying to do this interactively in Protege before finding an automated pathway of discovery and conversion via PubMed--&amp;gt; EndNote --(bibtex export)--&amp;gt; Zotero --&amp;gt; bibo export (bibliontology RDF).  &lt;br /&gt;
#** here is the [[file:10trees_bibliontology.rdf]]&lt;br /&gt;
#* Hilmar developed an annotation ontology that incorporates CDAO, OBI, PROV and other ontologies&lt;br /&gt;
# Load trees into TreeStore.  On days 2 to 3, Joachim worked on the technology for getting our encodings into a triplestore.  Part of the challenge was deciding on an URL scheme.  &lt;br /&gt;
# Execute queries to demonstrate success.  On Day 3 we had success in querying for citation metadata.  On Day 5 ? ?  ?&lt;br /&gt;
&lt;br /&gt;
== Model for semantic encoding ==&lt;br /&gt;
&lt;br /&gt;
== more annotations ==&lt;br /&gt;
&lt;br /&gt;
miapa ontology&lt;br /&gt;
* topology &lt;br /&gt;
** gene tree vs species tree: Network:Tree:'Gene tree' or SpeciesTree&lt;br /&gt;
** rooted: Network:Tree:RootedTree or UnrootedTree&lt;br /&gt;
** 'Consensus tree'&lt;br /&gt;
* otus&lt;br /&gt;
** toTaxon, object property, points to taxon concept, can be URI from NCBI or other authority &lt;br /&gt;
** derived_from specimen &lt;br /&gt;
** location imported from geo&lt;br /&gt;
* branch properties&lt;br /&gt;
** branch lengths:&lt;br /&gt;
*** data property edge length&lt;br /&gt;
*** object property has_Annotation edge_length&lt;br /&gt;
** branch support: data property has support value either bootstrap or posterior prob&lt;br /&gt;
* character matrix&lt;br /&gt;
* alignment method &lt;br /&gt;
** name of software, version &lt;br /&gt;
** parameters &lt;br /&gt;
** manual correction &lt;br /&gt;
* tree inference method&lt;br /&gt;
** name of software, version: tree wasGeneratedBy (activity=) software procedure; software procedure wasAssociatedWith instance of software agent named &amp;quot;RaXML&amp;quot;&lt;br /&gt;
** parameters: (activity) used instance of a parameter specification (which is a kind of plan)  &lt;br /&gt;
** character weights&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4244</id>
		<title>Advancing MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4244"/>
		<updated>2013-02-03T00:28:52Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* MIAPA Resources */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Factors shaping our conception of source-tree annotations ==&lt;br /&gt;
&lt;br /&gt;
# what folks in the evoinfo community believe is the Minimal Information About a Phylogenetic Analysis (MIAPA)&lt;br /&gt;
#* The current synopsis of this is the MIAPA checklist from teh 2011 TDWG meeting. &lt;br /&gt;
# need to support assignment of credit (blame) for tree-producers and phylotastic service-providers &lt;br /&gt;
# need to support licensing that protects creators, resource-providers and end-users&lt;br /&gt;
# need to contribute to a credible provenance report for phylotastic-generated trees&lt;br /&gt;
#* e.g., a tree might be returned with information as follows (free text form): &amp;quot;This tree was obtained on Jan 29, 2013.  An input list of 58 names was submitted to Taxosaurus, resulting in 45 valid species binomials ( list ).  This list was sent to a pruner with instructions to prune out the indicated species from the phylogeny of Bininda-Emonds, et al. 2007.  The resulting sub-tree with 40 species was scaled using teh DateLife service. &amp;quot;  &lt;br /&gt;
#  support the most common user criteria for phylotastic searching &lt;br /&gt;
#* limits on source trees &lt;br /&gt;
#** '''return tree with maximal coverage of list ::= { species }'''&lt;br /&gt;
#*** specify namespace of &amp;lt;list&amp;gt; &lt;br /&gt;
#** restrict by publication status (published or not)&lt;br /&gt;
#** restrict by publication year (e.g., no trees older than 5 years)&lt;br /&gt;
#** restrict by author (e.g., author = bininda-emonds) &lt;br /&gt;
#** restrict by other citation information (e.g., jrnl = index fungorum)&lt;br /&gt;
#** restrict by method&lt;br /&gt;
#*** '''use (exclude) trees made with method class = { parsimony, likelihood, Bayesian, distance, supertree, supermatrix, hand-crafted. . . } '''&lt;br /&gt;
#*** use (exclude)  trees made with evolutionary model = {  }&lt;br /&gt;
#*** '''use (exclude) trees made with software = { RaXML, BEAST, PAUP*, . . . }'''&lt;br /&gt;
#*** other restrictions (e.g.,  supertree, . . . )&lt;br /&gt;
#** restrict by availability of source data such as character matrix&lt;br /&gt;
#*** restrict by minimum number of characters in matrix &lt;br /&gt;
#** '''restrict by type of source data = { molecular, morphological, mixed } '''&lt;br /&gt;
#** restrict by annotation level = { platinum, gold, silver, bronze, polystyrene } &lt;br /&gt;
#** require feature&lt;br /&gt;
#*** '''require support values'''&lt;br /&gt;
#*** require rooting &lt;br /&gt;
#*** '''require branch lengths'''&lt;br /&gt;
#*** require fully resolved&lt;br /&gt;
#*** require other features &lt;br /&gt;
#* limits on phylotastic manipulations &lt;br /&gt;
#** disallow species substitution &lt;br /&gt;
#** disallow grafting of source trees&lt;br /&gt;
#** scaling: provide median age estimates &lt;br /&gt;
#** scaling: provide only lower age estimates&lt;br /&gt;
#** TNRS: prohibit fuzzy matching&lt;br /&gt;
#** TNRS: use matches from source = { NCBI, ITIS, et }&lt;br /&gt;
# support adequate annotation of the provenance of the resulting phylotastic tree&lt;br /&gt;
#* imagine that you have to publish an analysis using a phylogenetic tree-- how do you describe it?  &amp;quot;The phylogeny of 40 mammal species was obtained via phylotastic (phylotastic.org &lt;br /&gt;
#* citation information&lt;br /&gt;
&lt;br /&gt;
== Lessons learned from tree-finding and annotation ==&lt;br /&gt;
&lt;br /&gt;
* data frequently is not readily accessible online &lt;br /&gt;
** e.g. Jetz one must go to birdtree.org&lt;br /&gt;
** e.g., we obtained hymenoptera tree by pers. communication from Peters&lt;br /&gt;
** only one of the studies has trees in TreeBASE (GEBA)&lt;br /&gt;
** another study has a tree in Dryad (Smith)&lt;br /&gt;
* authors often don't provide minimal information explicitly&lt;br /&gt;
** whether or not a tree is rooted is usually not explicit&lt;br /&gt;
*** e.g., taxonomic hierarchies (APG, ToLWeb, NCBI) imply rootedness&lt;br /&gt;
*** e.g., Peters, et al 2011 never invokes the term &amp;quot;root&amp;quot; but we infer rooting from the description of outgroups not present in the final tree&lt;br /&gt;
** the methodology is largely unexplained for curated or hand-crafted taxonomic frameworks like APGIII or NCBI&lt;br /&gt;
* a study frequently has several trees by slightly different methods, but the checklist seems to imply one method&lt;br /&gt;
** e.g., Goloboff molecules only vs. molecules with morphology&lt;br /&gt;
** e.g., Bininda-Emonds best dates vs min dates vs max dates &lt;br /&gt;
* sometimes a study has many trees that all represent outputs of the same method&lt;br /&gt;
** e.g., Jetz provide a large sample from the posterior distribution &lt;br /&gt;
* process of constructing tree does not follow sequences--&amp;gt; alignment--&amp;gt; tree&lt;br /&gt;
** e.g., supertree method in Bininda-Emonds&lt;br /&gt;
** e.g., hand-crafted APG, Smith, trees&lt;br /&gt;
* process of constructing tree cannot be condensed easily &lt;br /&gt;
** e.g., Goloboff, iterative procedure with divide-and-conquer search to find parsimony tree (&amp;quot;tree fuse&amp;quot; &amp;amp; &amp;quot;sector&amp;quot; search repeated)&lt;br /&gt;
** GEBA tree, { missing explanation }&lt;br /&gt;
** partitioned alignment (e.g., Smith, et al), but miapa implies &amp;quot;model of evolution&amp;quot; as though there were only one, whereas in Peters, et al there are 2 models for 2 partitions&lt;br /&gt;
* clustering to define orthologs not included in checklist, but seems important &lt;br /&gt;
** Smith phlawd, alignment: no pre-orthology.  &lt;br /&gt;
* mixed data is common&lt;br /&gt;
** Goloboff has morpho and molecular &lt;br /&gt;
** multiple studies have DNA (e.g., SSU rDNA) and protein sequences&lt;br /&gt;
* concatenated alignments are common, e.g., multiple proteins &lt;br /&gt;
** this means accession:OTU mapping is not 1:1 but many to one&lt;br /&gt;
* not encountered in our inputs but sometimes the OTU is &amp;lt;genus_sp&amp;gt; and the data are fused from multiple species (this is common in MorphoBank)&lt;br /&gt;
* many important trees  do not have branch lengths&lt;br /&gt;
** e.g., APGIII is a taxonomic framework&lt;br /&gt;
** e.g., some supertrees don't have branch lengths &lt;br /&gt;
* do binomials count as meaningful external identifiers for OTUs?  &lt;br /&gt;
** in some cases, the methods make clear that these come from a specific source &lt;br /&gt;
*** e.g., Goloboff names clearly come directly from NCBI via their bioinfo pipeline&lt;br /&gt;
*** e.g., Bininda-Emonds publication declares that naming authority is Wilson &amp;amp; Roeder (Mammal Species of the World)&lt;br /&gt;
*** e.g., NCBI taxonomy comes as a database dump wth taxids and synonyms, so it represents its own authority&lt;br /&gt;
** usually the naming authority is not clear&lt;br /&gt;
* was any study straightforward? &lt;br /&gt;
* OTUs checklist question may be redundant: why have external identifier and then ask for collections information?  &lt;br /&gt;
* OTU external refs in checklist: not applicable for supertree methods, consensus methods, hand-crafted&lt;br /&gt;
&lt;br /&gt;
== MIAPA Resources ==&lt;br /&gt;
* The main [[MIAPA]] page.  Primarily:  &lt;br /&gt;
** [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop.&lt;br /&gt;
** Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
** [http://www.slideshare.net/ElliottHauser/phylogenetics-data-provenance-survey-results Slideshow] with some results from the [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey MIAPA community survey] orchestrated by the Open Tree of Life project in fall 2012.&lt;br /&gt;
* Development:&lt;br /&gt;
** [http://github.com/miapa/miapa MIAPA repo on Github]&lt;br /&gt;
* Ontologies &lt;br /&gt;
** [http://www.evolutionaryontology.org/cdao CDAO]&lt;br /&gt;
** [http://www.w3.org/TR/prov-o/ W3C Provenance Ontology]&lt;br /&gt;
** [http://code.google.com/p/information-artifact-ontology/ Information Artefact Ontology]&lt;br /&gt;
** [http://theswo.sourceforge.net/ Software Ontology] (includes the [http://edamontology.org/ EDAM ontologies])&lt;br /&gt;
** [http://bioportal.bioontology.org/ontologies/49276?p=terms phylont] from [http://arnetminer.org/publication/phylont-a-domain-specific-ontology-for-phylogeny-analysis-3601839.html;jsessionid=E93A88FA38E826E1113A957BBA5BB352.tt Maryam Panahiazar, et al]. (We did not use this.)&lt;br /&gt;
&lt;br /&gt;
== Annotation ==&lt;br /&gt;
* representation of metadata: [http://www.nexml.org NeXML]&lt;br /&gt;
* some stuff that Rutger did mapping the metadata from ToLWeb XML format onto semantic annotations in NeXML.&lt;br /&gt;
** ToLWeb XML described here: http://tolweb.org/tree/home.pages/downloadtree.html&lt;br /&gt;
** a simple script that does the conversion:  https://github.com/ncbnaturalis/bio-phylo/blob/master/experimental/tolconvert.pl&lt;br /&gt;
** Here's an example input file: https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol.xml&lt;br /&gt;
** Here's the resulting output file (indented): https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol-nexml-pp.xml&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4243</id>
		<title>Advancing MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4243"/>
		<updated>2013-02-03T00:19:22Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Lessons learned from tree-finding and annotation */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Factors shaping our conception of source-tree annotations ==&lt;br /&gt;
&lt;br /&gt;
# what folks in the evoinfo community believe is the Minimal Information About a Phylogenetic Analysis (MIAPA)&lt;br /&gt;
#* The current synopsis of this is the MIAPA checklist from teh 2011 TDWG meeting. &lt;br /&gt;
# need to support assignment of credit (blame) for tree-producers and phylotastic service-providers &lt;br /&gt;
# need to support licensing that protects creators, resource-providers and end-users&lt;br /&gt;
# need to contribute to a credible provenance report for phylotastic-generated trees&lt;br /&gt;
#* e.g., a tree might be returned with information as follows (free text form): &amp;quot;This tree was obtained on Jan 29, 2013.  An input list of 58 names was submitted to Taxosaurus, resulting in 45 valid species binomials ( list ).  This list was sent to a pruner with instructions to prune out the indicated species from the phylogeny of Bininda-Emonds, et al. 2007.  The resulting sub-tree with 40 species was scaled using teh DateLife service. &amp;quot;  &lt;br /&gt;
#  support the most common user criteria for phylotastic searching &lt;br /&gt;
#* limits on source trees &lt;br /&gt;
#** '''return tree with maximal coverage of list ::= { species }'''&lt;br /&gt;
#*** specify namespace of &amp;lt;list&amp;gt; &lt;br /&gt;
#** restrict by publication status (published or not)&lt;br /&gt;
#** restrict by publication year (e.g., no trees older than 5 years)&lt;br /&gt;
#** restrict by author (e.g., author = bininda-emonds) &lt;br /&gt;
#** restrict by other citation information (e.g., jrnl = index fungorum)&lt;br /&gt;
#** restrict by method&lt;br /&gt;
#*** '''use (exclude) trees made with method class = { parsimony, likelihood, Bayesian, distance, supertree, supermatrix, hand-crafted. . . } '''&lt;br /&gt;
#*** use (exclude)  trees made with evolutionary model = {  }&lt;br /&gt;
#*** '''use (exclude) trees made with software = { RaXML, BEAST, PAUP*, . . . }'''&lt;br /&gt;
#*** other restrictions (e.g.,  supertree, . . . )&lt;br /&gt;
#** restrict by availability of source data such as character matrix&lt;br /&gt;
#*** restrict by minimum number of characters in matrix &lt;br /&gt;
#** '''restrict by type of source data = { molecular, morphological, mixed } '''&lt;br /&gt;
#** restrict by annotation level = { platinum, gold, silver, bronze, polystyrene } &lt;br /&gt;
#** require feature&lt;br /&gt;
#*** '''require support values'''&lt;br /&gt;
#*** require rooting &lt;br /&gt;
#*** '''require branch lengths'''&lt;br /&gt;
#*** require fully resolved&lt;br /&gt;
#*** require other features &lt;br /&gt;
#* limits on phylotastic manipulations &lt;br /&gt;
#** disallow species substitution &lt;br /&gt;
#** disallow grafting of source trees&lt;br /&gt;
#** scaling: provide median age estimates &lt;br /&gt;
#** scaling: provide only lower age estimates&lt;br /&gt;
#** TNRS: prohibit fuzzy matching&lt;br /&gt;
#** TNRS: use matches from source = { NCBI, ITIS, et }&lt;br /&gt;
# support adequate annotation of the provenance of the resulting phylotastic tree&lt;br /&gt;
#* imagine that you have to publish an analysis using a phylogenetic tree-- how do you describe it?  &amp;quot;The phylogeny of 40 mammal species was obtained via phylotastic (phylotastic.org &lt;br /&gt;
#* citation information&lt;br /&gt;
&lt;br /&gt;
== Lessons learned from tree-finding and annotation ==&lt;br /&gt;
&lt;br /&gt;
* data frequently is not readily accessible online &lt;br /&gt;
** e.g. Jetz one must go to birdtree.org&lt;br /&gt;
** e.g., we obtained hymenoptera tree by pers. communication from Peters&lt;br /&gt;
** only one of the studies has trees in TreeBASE (GEBA)&lt;br /&gt;
** another study has a tree in Dryad (Smith)&lt;br /&gt;
* authors often don't provide minimal information explicitly&lt;br /&gt;
** whether or not a tree is rooted is usually not explicit&lt;br /&gt;
*** e.g., taxonomic hierarchies (APG, ToLWeb, NCBI) imply rootedness&lt;br /&gt;
*** e.g., Peters, et al 2011 never invokes the term &amp;quot;root&amp;quot; but we infer rooting from the description of outgroups not present in the final tree&lt;br /&gt;
** the methodology is largely unexplained for curated or hand-crafted taxonomic frameworks like APGIII or NCBI&lt;br /&gt;
* a study frequently has several trees by slightly different methods, but the checklist seems to imply one method&lt;br /&gt;
** e.g., Goloboff molecules only vs. molecules with morphology&lt;br /&gt;
** e.g., Bininda-Emonds best dates vs min dates vs max dates &lt;br /&gt;
* sometimes a study has many trees that all represent outputs of the same method&lt;br /&gt;
** e.g., Jetz provide a large sample from the posterior distribution &lt;br /&gt;
* process of constructing tree does not follow sequences--&amp;gt; alignment--&amp;gt; tree&lt;br /&gt;
** e.g., supertree method in Bininda-Emonds&lt;br /&gt;
** e.g., hand-crafted APG, Smith, trees&lt;br /&gt;
* process of constructing tree cannot be condensed easily &lt;br /&gt;
** e.g., Goloboff, iterative procedure with divide-and-conquer search to find parsimony tree (&amp;quot;tree fuse&amp;quot; &amp;amp; &amp;quot;sector&amp;quot; search repeated)&lt;br /&gt;
** GEBA tree, { missing explanation }&lt;br /&gt;
** partitioned alignment (e.g., Smith, et al), but miapa implies &amp;quot;model of evolution&amp;quot; as though there were only one, whereas in Peters, et al there are 2 models for 2 partitions&lt;br /&gt;
* clustering to define orthologs not included in checklist, but seems important &lt;br /&gt;
** Smith phlawd, alignment: no pre-orthology.  &lt;br /&gt;
* mixed data is common&lt;br /&gt;
** Goloboff has morpho and molecular &lt;br /&gt;
** multiple studies have DNA (e.g., SSU rDNA) and protein sequences&lt;br /&gt;
* concatenated alignments are common, e.g., multiple proteins &lt;br /&gt;
** this means accession:OTU mapping is not 1:1 but many to one&lt;br /&gt;
* not encountered in our inputs but sometimes the OTU is &amp;lt;genus_sp&amp;gt; and the data are fused from multiple species (this is common in MorphoBank)&lt;br /&gt;
* many important trees  do not have branch lengths&lt;br /&gt;
** e.g., APGIII is a taxonomic framework&lt;br /&gt;
** e.g., some supertrees don't have branch lengths &lt;br /&gt;
* do binomials count as meaningful external identifiers for OTUs?  &lt;br /&gt;
** in some cases, the methods make clear that these come from a specific source &lt;br /&gt;
*** e.g., Goloboff names clearly come directly from NCBI via their bioinfo pipeline&lt;br /&gt;
*** e.g., Bininda-Emonds publication declares that naming authority is Wilson &amp;amp; Roeder (Mammal Species of the World)&lt;br /&gt;
*** e.g., NCBI taxonomy comes as a database dump wth taxids and synonyms, so it represents its own authority&lt;br /&gt;
** usually the naming authority is not clear&lt;br /&gt;
* was any study straightforward? &lt;br /&gt;
* OTUs checklist question may be redundant: why have external identifier and then ask for collections information?  &lt;br /&gt;
* OTU external refs in checklist: not applicable for supertree methods, consensus methods, hand-crafted&lt;br /&gt;
&lt;br /&gt;
== MIAPA Resources ==&lt;br /&gt;
* The main [[MIAPA]] page.  Primarily:  &lt;br /&gt;
** [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop.&lt;br /&gt;
** Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
** [http://www.slideshare.net/ElliottHauser/phylogenetics-data-provenance-survey-results Slideshow] with some results from the [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey MIAPA community survey] orchestrated by the Open Tree of Life project in fall 2012.&lt;br /&gt;
* Ontologies &lt;br /&gt;
** [http://bioportal.bioontology.org/ontologies/49276?p=terms phylont] from [http://arnetminer.org/publication/phylont-a-domain-specific-ontology-for-phylogeny-analysis-3601839.html;jsessionid=E93A88FA38E826E1113A957BBA5BB352.tt Maryam Panahiazar, et al]&lt;br /&gt;
** [http://www.evolutionaryontology.org/cdao CDAO]&lt;br /&gt;
* Development:&lt;br /&gt;
** [http://github.com/miapa/miapa MIAPA repo on Github]&lt;br /&gt;
** Template development after the [https://github.com/phylotastic/treestore/tree/master/terms TNRS ontology] developed at (and after) Phylotastic I.&lt;br /&gt;
&lt;br /&gt;
== Annotation ==&lt;br /&gt;
* representation of metadata: [http://www.nexml.org NeXML]&lt;br /&gt;
* some stuff that Rutger did mapping the metadata from ToLWeb XML format onto semantic annotations in NeXML.&lt;br /&gt;
** ToLWeb XML described here: http://tolweb.org/tree/home.pages/downloadtree.html&lt;br /&gt;
** a simple script that does the conversion:  https://github.com/ncbnaturalis/bio-phylo/blob/master/experimental/tolconvert.pl&lt;br /&gt;
** Here's an example input file: https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol.xml&lt;br /&gt;
** Here's the resulting output file (indented): https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol-nexml-pp.xml&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4242</id>
		<title>Advancing MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4242"/>
		<updated>2013-02-03T00:19:04Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Factors shaping our conception of source-tree annotations ==&lt;br /&gt;
&lt;br /&gt;
# what folks in the evoinfo community believe is the Minimal Information About a Phylogenetic Analysis (MIAPA)&lt;br /&gt;
#* The current synopsis of this is the MIAPA checklist from teh 2011 TDWG meeting. &lt;br /&gt;
# need to support assignment of credit (blame) for tree-producers and phylotastic service-providers &lt;br /&gt;
# need to support licensing that protects creators, resource-providers and end-users&lt;br /&gt;
# need to contribute to a credible provenance report for phylotastic-generated trees&lt;br /&gt;
#* e.g., a tree might be returned with information as follows (free text form): &amp;quot;This tree was obtained on Jan 29, 2013.  An input list of 58 names was submitted to Taxosaurus, resulting in 45 valid species binomials ( list ).  This list was sent to a pruner with instructions to prune out the indicated species from the phylogeny of Bininda-Emonds, et al. 2007.  The resulting sub-tree with 40 species was scaled using teh DateLife service. &amp;quot;  &lt;br /&gt;
#  support the most common user criteria for phylotastic searching &lt;br /&gt;
#* limits on source trees &lt;br /&gt;
#** '''return tree with maximal coverage of list ::= { species }'''&lt;br /&gt;
#*** specify namespace of &amp;lt;list&amp;gt; &lt;br /&gt;
#** restrict by publication status (published or not)&lt;br /&gt;
#** restrict by publication year (e.g., no trees older than 5 years)&lt;br /&gt;
#** restrict by author (e.g., author = bininda-emonds) &lt;br /&gt;
#** restrict by other citation information (e.g., jrnl = index fungorum)&lt;br /&gt;
#** restrict by method&lt;br /&gt;
#*** '''use (exclude) trees made with method class = { parsimony, likelihood, Bayesian, distance, supertree, supermatrix, hand-crafted. . . } '''&lt;br /&gt;
#*** use (exclude)  trees made with evolutionary model = {  }&lt;br /&gt;
#*** '''use (exclude) trees made with software = { RaXML, BEAST, PAUP*, . . . }'''&lt;br /&gt;
#*** other restrictions (e.g.,  supertree, . . . )&lt;br /&gt;
#** restrict by availability of source data such as character matrix&lt;br /&gt;
#*** restrict by minimum number of characters in matrix &lt;br /&gt;
#** '''restrict by type of source data = { molecular, morphological, mixed } '''&lt;br /&gt;
#** restrict by annotation level = { platinum, gold, silver, bronze, polystyrene } &lt;br /&gt;
#** require feature&lt;br /&gt;
#*** '''require support values'''&lt;br /&gt;
#*** require rooting &lt;br /&gt;
#*** '''require branch lengths'''&lt;br /&gt;
#*** require fully resolved&lt;br /&gt;
#*** require other features &lt;br /&gt;
#* limits on phylotastic manipulations &lt;br /&gt;
#** disallow species substitution &lt;br /&gt;
#** disallow grafting of source trees&lt;br /&gt;
#** scaling: provide median age estimates &lt;br /&gt;
#** scaling: provide only lower age estimates&lt;br /&gt;
#** TNRS: prohibit fuzzy matching&lt;br /&gt;
#** TNRS: use matches from source = { NCBI, ITIS, et }&lt;br /&gt;
# support adequate annotation of the provenance of the resulting phylotastic tree&lt;br /&gt;
#* imagine that you have to publish an analysis using a phylogenetic tree-- how do you describe it?  &amp;quot;The phylogeny of 40 mammal species was obtained via phylotastic (phylotastic.org &lt;br /&gt;
#* citation information&lt;br /&gt;
&lt;br /&gt;
=== Lessons learned from tree-finding and annotation ===&lt;br /&gt;
&lt;br /&gt;
* data frequently is not readily accessible online &lt;br /&gt;
** e.g. Jetz one must go to birdtree.org&lt;br /&gt;
** e.g., we obtained hymenoptera tree by pers. communication from Peters&lt;br /&gt;
** only one of the studies has trees in TreeBASE (GEBA)&lt;br /&gt;
** another study has a tree in Dryad (Smith)&lt;br /&gt;
* authors often don't provide minimal information explicitly&lt;br /&gt;
** whether or not a tree is rooted is usually not explicit&lt;br /&gt;
*** e.g., taxonomic hierarchies (APG, ToLWeb, NCBI) imply rootedness&lt;br /&gt;
*** e.g., Peters, et al 2011 never invokes the term &amp;quot;root&amp;quot; but we infer rooting from the description of outgroups not present in the final tree&lt;br /&gt;
** the methodology is largely unexplained for curated or hand-crafted taxonomic frameworks like APGIII or NCBI&lt;br /&gt;
* a study frequently has several trees by slightly different methods, but the checklist seems to imply one method&lt;br /&gt;
** e.g., Goloboff molecules only vs. molecules with morphology&lt;br /&gt;
** e.g., Bininda-Emonds best dates vs min dates vs max dates &lt;br /&gt;
* sometimes a study has many trees that all represent outputs of the same method&lt;br /&gt;
** e.g., Jetz provide a large sample from the posterior distribution &lt;br /&gt;
* process of constructing tree does not follow sequences--&amp;gt; alignment--&amp;gt; tree&lt;br /&gt;
** e.g., supertree method in Bininda-Emonds&lt;br /&gt;
** e.g., hand-crafted APG, Smith, trees&lt;br /&gt;
* process of constructing tree cannot be condensed easily &lt;br /&gt;
** e.g., Goloboff, iterative procedure with divide-and-conquer search to find parsimony tree (&amp;quot;tree fuse&amp;quot; &amp;amp; &amp;quot;sector&amp;quot; search repeated)&lt;br /&gt;
** GEBA tree, { missing explanation }&lt;br /&gt;
** partitioned alignment (e.g., Smith, et al), but miapa implies &amp;quot;model of evolution&amp;quot; as though there were only one, whereas in Peters, et al there are 2 models for 2 partitions&lt;br /&gt;
* clustering to define orthologs not included in checklist, but seems important &lt;br /&gt;
** Smith phlawd, alignment: no pre-orthology.  &lt;br /&gt;
* mixed data is common&lt;br /&gt;
** Goloboff has morpho and molecular &lt;br /&gt;
** multiple studies have DNA (e.g., SSU rDNA) and protein sequences&lt;br /&gt;
* concatenated alignments are common, e.g., multiple proteins &lt;br /&gt;
** this means accession:OTU mapping is not 1:1 but many to one&lt;br /&gt;
* not encountered in our inputs but sometimes the OTU is &amp;lt;genus_sp&amp;gt; and the data are fused from multiple species (this is common in MorphoBank)&lt;br /&gt;
* many important trees  do not have branch lengths&lt;br /&gt;
** e.g., APGIII is a taxonomic framework&lt;br /&gt;
** e.g., some supertrees don't have branch lengths &lt;br /&gt;
* do binomials count as meaningful external identifiers for OTUs?  &lt;br /&gt;
** in some cases, the methods make clear that these come from a specific source &lt;br /&gt;
*** e.g., Goloboff names clearly come directly from NCBI via their bioinfo pipeline&lt;br /&gt;
*** e.g., Bininda-Emonds publication declares that naming authority is Wilson &amp;amp; Roeder (Mammal Species of the World)&lt;br /&gt;
*** e.g., NCBI taxonomy comes as a database dump wth taxids and synonyms, so it represents its own authority&lt;br /&gt;
** usually the naming authority is not clear&lt;br /&gt;
* was any study straightforward? &lt;br /&gt;
* OTUs checklist question may be redundant: why have external identifier and then ask for collections information?  &lt;br /&gt;
* OTU external refs in checklist: not applicable for supertree methods, consensus methods, hand-crafted&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== MIAPA Resources ==&lt;br /&gt;
* The main [[MIAPA]] page.  Primarily:  &lt;br /&gt;
** [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop.&lt;br /&gt;
** Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
** [http://www.slideshare.net/ElliottHauser/phylogenetics-data-provenance-survey-results Slideshow] with some results from the [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey MIAPA community survey] orchestrated by the Open Tree of Life project in fall 2012.&lt;br /&gt;
* Ontologies &lt;br /&gt;
** [http://bioportal.bioontology.org/ontologies/49276?p=terms phylont] from [http://arnetminer.org/publication/phylont-a-domain-specific-ontology-for-phylogeny-analysis-3601839.html;jsessionid=E93A88FA38E826E1113A957BBA5BB352.tt Maryam Panahiazar, et al]&lt;br /&gt;
** [http://www.evolutionaryontology.org/cdao CDAO]&lt;br /&gt;
* Development:&lt;br /&gt;
** [http://github.com/miapa/miapa MIAPA repo on Github]&lt;br /&gt;
** Template development after the [https://github.com/phylotastic/treestore/tree/master/terms TNRS ontology] developed at (and after) Phylotastic I.&lt;br /&gt;
&lt;br /&gt;
== Annotation ==&lt;br /&gt;
* representation of metadata: [http://www.nexml.org NeXML]&lt;br /&gt;
* some stuff that Rutger did mapping the metadata from ToLWeb XML format onto semantic annotations in NeXML.&lt;br /&gt;
** ToLWeb XML described here: http://tolweb.org/tree/home.pages/downloadtree.html&lt;br /&gt;
** a simple script that does the conversion:  https://github.com/ncbnaturalis/bio-phylo/blob/master/experimental/tolconvert.pl&lt;br /&gt;
** Here's an example input file: https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol.xml&lt;br /&gt;
** Here's the resulting output file (indented): https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol-nexml-pp.xml&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3921</id>
		<title>Tree Annotation</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3921"/>
		<updated>2013-02-03T00:17:41Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Quick links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Synopsis''' Annotate a small set of large trees used as sources of phylogenetic knowledge in an automated delivery system for tree-o-life knowledge called &amp;quot;Phylotastic&amp;quot;.   &lt;br /&gt;
&lt;br /&gt;
== Quick links ==&lt;br /&gt;
&lt;br /&gt;
* Overview: [[Media:FinalPres.pptx| Enrico's final presentation]]&lt;br /&gt;
** Also see [[:Image:PetersGraphic.jpg|example annotation graph for the Peters et al. tree and alignment]]. &lt;br /&gt;
* [[AnnotatedPhylotasticSourceTrees]] - report on the set of source trees&lt;br /&gt;
* [[TreestoreMetadataQueryDemonstration]] - report on the workflow and querying &lt;br /&gt;
* [[AdvancingMIAPA]] - report on MIAPA-relevant findings and developments&lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist checklist Reconciled_draft_checklist checklist from TDWG 2011]&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
We (1) identified a set of 10 large trees useful as phylotastic source trees (2) created free-text annotations (metadata) for citations, sources and methods, (3) encoded the data and metadata as RDF using CDAO and a new ontology, (4) loaded the encoded information into a triplestore, and (5) demonstrated logical querying based on data and metadata attributes.  During the hackathon, group members spent their time developing and revising a strategy,  interpreting source materials, developing language support, encoding annotations, working out technical bugs in the workflow, and addressing emerging challenges.  The tangible outcomes of this exercise include &lt;br /&gt;
* a set of 10 source trees (720 to 250,000 species) with low-res metadata (see [[AnnotatedPhylotasticSourceTrees]])&lt;br /&gt;
* a demonstration of semantic annotation, data-basing, and querying (see [[TreestoreMetadataQueryDemonstration]])&lt;br /&gt;
** a workflow plan for encoding tree data and metadata and loading them into a treestore&lt;br /&gt;
** a treestore instance populated with some of these data and metadata&lt;br /&gt;
* Advances in Minimum Information About a Phylogenetic Analysis (see [[AdvancingMIAPA]])&lt;br /&gt;
** a new &amp;quot;MIAPA&amp;quot; ontology  that leverages several existing ontologies&lt;br /&gt;
** recommendations on the draft checklist, and input form&lt;br /&gt;
* optionally&lt;br /&gt;
** a screencast&lt;br /&gt;
** a draft GSOC proposal&lt;br /&gt;
&lt;br /&gt;
== Background, Motivation, and Aims ==&lt;br /&gt;
&lt;br /&gt;
Metadata annotations represent an essential part of the design of phylotastic systems, for two reasons.  First, while we do not have a robust and detailed understand of how users will make use of phylotastic systems, we assume that they will wish to identify trees based on sources and methods.  For instance, a user may restrict a phylotastic query so as to include only trees inferred by Maximum Likelihood, or to exclude grafted trees, or to implicate only the tree associated with the publication by Bininda-Emonds, 2007.  &lt;br /&gt;
&lt;br /&gt;
Second, one of the design criteria of phylotastic systems is to provide credible results, which in the scientific world means providing a description of provenance suitable for a scientific publication.  To be credible, a tree generated by a phylotastic system must include a description of how it was derived, which includes information on source trees as well as a description of any subsequent manipulations.  Yet, metadata play little or no role in current phylotastic component implementations.  &lt;br /&gt;
&lt;br /&gt;
Some guidance may be obtained from prior art relating to databases and to metadata.  Two databases for trees exist already, TreeBASE and Dryad  Dryad does not provide any explicit support for tree-specific annotations.  The TreeBASE input interface allows citation data, creates links to species ident, and links a matrix to a tree with a &amp;quot;analysis&amp;quot; link that may implicate a particular software program.  Though useful, the TreeBASE model falls far short of the recommendations for a &amp;quot;minimum information&amp;quot; standard for phylogeny metadata known as MIAPA, or &amp;quot;Minimum Information About a Phylogenetic Analysis&amp;quot;.  For instance, the draft MIAPA checklist from TDWG2011 calls for an explicit indication of whether a tree is a gene or species tree, whether it is rooted, what software (and version number) was used to derive it, and so on. &lt;br /&gt;
&lt;br /&gt;
The TreeAnnotation team of hackathon 2 (Enrico, Hilmar, Joachim, Arlin, Ramona and 0.5 of Andrea) decided to conduct an annotation exercise that would cover the flow of information from initial annotation of trees, to querying of treestores (not including the annotation of subsequent phylotastic manipulations such as pruning or scaling).   &lt;br /&gt;
&lt;br /&gt;
The motivation for this exercise relates partly to the aspirational nature of MIAPA, which was proposed many years ago but has never evolved into a clear standard supported with convenient technology.   Those of us who have been involved in MIAPA-related efforts sensed a need for practical experiences in real-world uses of annotations.  While the scope of Phylotastic is narrower than that of MIAPA, in the sense of covering only species trees, and mainly large ones, the challenge of supporting useful metadata queries in a treestore represents a critical test of the relevance of the MIAPA checklist and the technology for encoding and managing semantic annotations.&lt;br /&gt;
&lt;br /&gt;
We also hoped to enrich current phylotastic implementations by providing metadata for a specific set of useful trees.  Hackathon participants have been using a handful of trees (APGIII, Bininda-Emonds, etc) without any metadata on citations or methods.  &lt;br /&gt;
&lt;br /&gt;
Thus, our approach has 3 inter-connected aims: &lt;br /&gt;
* to create a set of 10 usefully annotated source trees&lt;br /&gt;
* to demonstrate the feasibility of metadata-based querying in a treestore&lt;br /&gt;
* to leverage a practical annotation exercise to advance the MIAPA project&lt;br /&gt;
&lt;br /&gt;
== Approach ==&lt;br /&gt;
&lt;br /&gt;
Our approach consisted of the following steps&lt;br /&gt;
# identify 10 useful source trees with available publications&lt;br /&gt;
# generate free-text annotations &lt;br /&gt;
# encode citations and annotations in computable form&lt;br /&gt;
# load the citation, annotations, and trees into a treestore&lt;br /&gt;
# demonstrate querying based on metadata&lt;br /&gt;
&lt;br /&gt;
In particular, we chose to gather metadata corresponding to the MIAPA draft checklist, to enode it as RDF using a new ontology that imports several other ontologies, and to load the results into Ben Morris's Virtuoso-based treestore implementation.  On Day 3, we decided to begin by focusing  on citations, which are not in the MIAPA checklist, with the plan to carry citation data through steps 3 to 5.  &lt;br /&gt;
&lt;br /&gt;
'''workflow, in more detail'''&lt;br /&gt;
# Identify 10 trees for use as phylotastic source trees.  Most of the trees were identified pre-hackathon by Arlin.  One of the trees was replaced on day 2 due to lack of metadata (the unpublished fish tree of Westneat &amp;amp; Lundberg).  &lt;br /&gt;
# Annotate them in free-text form.  This was done on day #2 as a team effort by Ramona, Enrico, Arlin and Andrea.  &lt;br /&gt;
#* create [https://docs.google.com/spreadsheet/gform?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c&amp;amp;hl=en#edit web form] in Google docs for input of annotations, based on [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop&lt;br /&gt;
#* Spread sheet has pull down menus, plus options for free text entries under &amp;quot;other&amp;quot;&lt;br /&gt;
# Transform annotations into a formal language statements in RDF.  This was done on Days 3 and 4 by Ramona, Arlin, Enrico and Hilmar.  &lt;br /&gt;
#* Literature Citations&lt;br /&gt;
#** after some discussion,  we decided to use BIBO (not dc or prism alone)&lt;br /&gt;
#** we spent 6 to 8 person hours trying to do this interactively in Protege before finding an automated pathway of discovery and conversion via PubMed--&amp;gt; EndNote --(bibtex export)--&amp;gt; Zotero --&amp;gt; bibo export (bibliontology RDF).  &lt;br /&gt;
#** here is the [[file:10trees_bibliontology.rdf]]&lt;br /&gt;
#* Hilmar developed an annotation ontology that incorporates CDAO, OBI, PROV and other ontologies&lt;br /&gt;
# Load trees into TreeStore.  On days 2 to 3, Joachim worked on the technology for getting our encodings into a triplestore.  Part of the challenge was deciding on an URL scheme.  &lt;br /&gt;
# Execute queries to demonstrate success.  On Day 3 we had success in querying for citation metadata.  On Day 5 ? ?  ?&lt;br /&gt;
&lt;br /&gt;
== Model for semantic encoding ==&lt;br /&gt;
&lt;br /&gt;
==additional suggestions for MIAPA ontology==&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
* Add a class for parsimony under algorithm. [http://github.com/miapa/miapa/issues/7 Filed as Issue #7] &lt;br /&gt;
&lt;br /&gt;
* It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation. [http://github.com/miapa/miapa/issues/8 Filed as Issue #8]&lt;br /&gt;
&lt;br /&gt;
* Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
** ''Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.''&lt;br /&gt;
** ''Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.''&lt;br /&gt;
&lt;br /&gt;
* Remove class for SILVA. [http://github.com/miapa/miapa/issues/9 Filed as Issue #9]&lt;br /&gt;
&lt;br /&gt;
* Add new class for set of trees. [http://github.com/miapa/miapa/issues/10 Filed as Issue #10]&lt;br /&gt;
&lt;br /&gt;
== more annotations ==&lt;br /&gt;
&lt;br /&gt;
miapa ontology&lt;br /&gt;
* topology &lt;br /&gt;
** gene tree vs species tree: Network:Tree:'Gene tree' or SpeciesTree&lt;br /&gt;
** rooted: Network:Tree:RootedTree or UnrootedTree&lt;br /&gt;
** 'Consensus tree'&lt;br /&gt;
* otus&lt;br /&gt;
** toTaxon, object property, points to taxon concept, can be URI from NCBI or other authority &lt;br /&gt;
** derived_from specimen &lt;br /&gt;
** location imported from geo&lt;br /&gt;
* branch properties&lt;br /&gt;
** branch lengths:&lt;br /&gt;
*** data property edge length&lt;br /&gt;
*** object property has_Annotation edge_length&lt;br /&gt;
** branch support: data property has support value either bootstrap or posterior prob&lt;br /&gt;
* character matrix&lt;br /&gt;
* alignment method &lt;br /&gt;
** name of software, version &lt;br /&gt;
** parameters &lt;br /&gt;
** manual correction &lt;br /&gt;
* tree inference method&lt;br /&gt;
** name of software, version: tree wasGeneratedBy (activity=) software procedure; software procedure wasAssociatedWith instance of software agent named &amp;quot;RaXML&amp;quot;&lt;br /&gt;
** parameters: (activity) used instance of a parameter specification (which is a kind of plan)  &lt;br /&gt;
** character weights&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3920</id>
		<title>Tree Annotation</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3920"/>
		<updated>2013-02-03T00:16:57Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Quick links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Synopsis''' Annotate a small set of large trees used as sources of phylogenetic knowledge in an automated delivery system for tree-o-life knowledge called &amp;quot;Phylotastic&amp;quot;.   &lt;br /&gt;
&lt;br /&gt;
== Quick links ==&lt;br /&gt;
&lt;br /&gt;
* [[Media:FinalPres.pptx| Enrico's final presentation]]&lt;br /&gt;
** Also see [[:Image:PetersGraphic.jpg|example annotation graph for the Peters et al. tree and alignment]]. &lt;br /&gt;
* [[AnnotatedPhylotasticSourceTrees]] - report on the set of source trees&lt;br /&gt;
* [[TreestoreMetadataQueryDemonstration]] - report on the workflow and querying &lt;br /&gt;
* [[AdvancingMIAPA]] - report on MIAPA-relevant findings and developments&lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist checklist Reconciled_draft_checklist checklist from TDWG 2011]&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
We (1) identified a set of 10 large trees useful as phylotastic source trees (2) created free-text annotations (metadata) for citations, sources and methods, (3) encoded the data and metadata as RDF using CDAO and a new ontology, (4) loaded the encoded information into a triplestore, and (5) demonstrated logical querying based on data and metadata attributes.  During the hackathon, group members spent their time developing and revising a strategy,  interpreting source materials, developing language support, encoding annotations, working out technical bugs in the workflow, and addressing emerging challenges.  The tangible outcomes of this exercise include &lt;br /&gt;
* a set of 10 source trees (720 to 250,000 species) with low-res metadata (see [[AnnotatedPhylotasticSourceTrees]])&lt;br /&gt;
* a demonstration of semantic annotation, data-basing, and querying (see [[TreestoreMetadataQueryDemonstration]])&lt;br /&gt;
** a workflow plan for encoding tree data and metadata and loading them into a treestore&lt;br /&gt;
** a treestore instance populated with some of these data and metadata&lt;br /&gt;
* Advances in Minimum Information About a Phylogenetic Analysis (see [[AdvancingMIAPA]])&lt;br /&gt;
** a new &amp;quot;MIAPA&amp;quot; ontology  that leverages several existing ontologies&lt;br /&gt;
** recommendations on the draft checklist, and input form&lt;br /&gt;
* optionally&lt;br /&gt;
** a screencast&lt;br /&gt;
** a draft GSOC proposal&lt;br /&gt;
&lt;br /&gt;
== Background, Motivation, and Aims ==&lt;br /&gt;
&lt;br /&gt;
Metadata annotations represent an essential part of the design of phylotastic systems, for two reasons.  First, while we do not have a robust and detailed understand of how users will make use of phylotastic systems, we assume that they will wish to identify trees based on sources and methods.  For instance, a user may restrict a phylotastic query so as to include only trees inferred by Maximum Likelihood, or to exclude grafted trees, or to implicate only the tree associated with the publication by Bininda-Emonds, 2007.  &lt;br /&gt;
&lt;br /&gt;
Second, one of the design criteria of phylotastic systems is to provide credible results, which in the scientific world means providing a description of provenance suitable for a scientific publication.  To be credible, a tree generated by a phylotastic system must include a description of how it was derived, which includes information on source trees as well as a description of any subsequent manipulations.  Yet, metadata play little or no role in current phylotastic component implementations.  &lt;br /&gt;
&lt;br /&gt;
Some guidance may be obtained from prior art relating to databases and to metadata.  Two databases for trees exist already, TreeBASE and Dryad  Dryad does not provide any explicit support for tree-specific annotations.  The TreeBASE input interface allows citation data, creates links to species ident, and links a matrix to a tree with a &amp;quot;analysis&amp;quot; link that may implicate a particular software program.  Though useful, the TreeBASE model falls far short of the recommendations for a &amp;quot;minimum information&amp;quot; standard for phylogeny metadata known as MIAPA, or &amp;quot;Minimum Information About a Phylogenetic Analysis&amp;quot;.  For instance, the draft MIAPA checklist from TDWG2011 calls for an explicit indication of whether a tree is a gene or species tree, whether it is rooted, what software (and version number) was used to derive it, and so on. &lt;br /&gt;
&lt;br /&gt;
The TreeAnnotation team of hackathon 2 (Enrico, Hilmar, Joachim, Arlin, Ramona and 0.5 of Andrea) decided to conduct an annotation exercise that would cover the flow of information from initial annotation of trees, to querying of treestores (not including the annotation of subsequent phylotastic manipulations such as pruning or scaling).   &lt;br /&gt;
&lt;br /&gt;
The motivation for this exercise relates partly to the aspirational nature of MIAPA, which was proposed many years ago but has never evolved into a clear standard supported with convenient technology.   Those of us who have been involved in MIAPA-related efforts sensed a need for practical experiences in real-world uses of annotations.  While the scope of Phylotastic is narrower than that of MIAPA, in the sense of covering only species trees, and mainly large ones, the challenge of supporting useful metadata queries in a treestore represents a critical test of the relevance of the MIAPA checklist and the technology for encoding and managing semantic annotations.&lt;br /&gt;
&lt;br /&gt;
We also hoped to enrich current phylotastic implementations by providing metadata for a specific set of useful trees.  Hackathon participants have been using a handful of trees (APGIII, Bininda-Emonds, etc) without any metadata on citations or methods.  &lt;br /&gt;
&lt;br /&gt;
Thus, our approach has 3 inter-connected aims: &lt;br /&gt;
* to create a set of 10 usefully annotated source trees&lt;br /&gt;
* to demonstrate the feasibility of metadata-based querying in a treestore&lt;br /&gt;
* to leverage a practical annotation exercise to advance the MIAPA project&lt;br /&gt;
&lt;br /&gt;
== Approach ==&lt;br /&gt;
&lt;br /&gt;
Our approach consisted of the following steps&lt;br /&gt;
# identify 10 useful source trees with available publications&lt;br /&gt;
# generate free-text annotations &lt;br /&gt;
# encode citations and annotations in computable form&lt;br /&gt;
# load the citation, annotations, and trees into a treestore&lt;br /&gt;
# demonstrate querying based on metadata&lt;br /&gt;
&lt;br /&gt;
In particular, we chose to gather metadata corresponding to the MIAPA draft checklist, to enode it as RDF using a new ontology that imports several other ontologies, and to load the results into Ben Morris's Virtuoso-based treestore implementation.  On Day 3, we decided to begin by focusing  on citations, which are not in the MIAPA checklist, with the plan to carry citation data through steps 3 to 5.  &lt;br /&gt;
&lt;br /&gt;
'''workflow, in more detail'''&lt;br /&gt;
# Identify 10 trees for use as phylotastic source trees.  Most of the trees were identified pre-hackathon by Arlin.  One of the trees was replaced on day 2 due to lack of metadata (the unpublished fish tree of Westneat &amp;amp; Lundberg).  &lt;br /&gt;
# Annotate them in free-text form.  This was done on day #2 as a team effort by Ramona, Enrico, Arlin and Andrea.  &lt;br /&gt;
#* create [https://docs.google.com/spreadsheet/gform?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c&amp;amp;hl=en#edit web form] in Google docs for input of annotations, based on [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop&lt;br /&gt;
#* Spread sheet has pull down menus, plus options for free text entries under &amp;quot;other&amp;quot;&lt;br /&gt;
# Transform annotations into a formal language statements in RDF.  This was done on Days 3 and 4 by Ramona, Arlin, Enrico and Hilmar.  &lt;br /&gt;
#* Literature Citations&lt;br /&gt;
#** after some discussion,  we decided to use BIBO (not dc or prism alone)&lt;br /&gt;
#** we spent 6 to 8 person hours trying to do this interactively in Protege before finding an automated pathway of discovery and conversion via PubMed--&amp;gt; EndNote --(bibtex export)--&amp;gt; Zotero --&amp;gt; bibo export (bibliontology RDF).  &lt;br /&gt;
#** here is the [[file:10trees_bibliontology.rdf]]&lt;br /&gt;
#* Hilmar developed an annotation ontology that incorporates CDAO, OBI, PROV and other ontologies&lt;br /&gt;
# Load trees into TreeStore.  On days 2 to 3, Joachim worked on the technology for getting our encodings into a triplestore.  Part of the challenge was deciding on an URL scheme.  &lt;br /&gt;
# Execute queries to demonstrate success.  On Day 3 we had success in querying for citation metadata.  On Day 5 ? ?  ?&lt;br /&gt;
&lt;br /&gt;
== Model for semantic encoding ==&lt;br /&gt;
&lt;br /&gt;
==additional suggestions for MIAPA ontology==&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
* Add a class for parsimony under algorithm. [http://github.com/miapa/miapa/issues/7 Filed as Issue #7] &lt;br /&gt;
&lt;br /&gt;
* It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation. [http://github.com/miapa/miapa/issues/8 Filed as Issue #8]&lt;br /&gt;
&lt;br /&gt;
* Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
** ''Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.''&lt;br /&gt;
** ''Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.''&lt;br /&gt;
&lt;br /&gt;
* Remove class for SILVA. [http://github.com/miapa/miapa/issues/9 Filed as Issue #9]&lt;br /&gt;
&lt;br /&gt;
* Add new class for set of trees. [http://github.com/miapa/miapa/issues/10 Filed as Issue #10]&lt;br /&gt;
&lt;br /&gt;
== more annotations ==&lt;br /&gt;
&lt;br /&gt;
miapa ontology&lt;br /&gt;
* topology &lt;br /&gt;
** gene tree vs species tree: Network:Tree:'Gene tree' or SpeciesTree&lt;br /&gt;
** rooted: Network:Tree:RootedTree or UnrootedTree&lt;br /&gt;
** 'Consensus tree'&lt;br /&gt;
* otus&lt;br /&gt;
** toTaxon, object property, points to taxon concept, can be URI from NCBI or other authority &lt;br /&gt;
** derived_from specimen &lt;br /&gt;
** location imported from geo&lt;br /&gt;
* branch properties&lt;br /&gt;
** branch lengths:&lt;br /&gt;
*** data property edge length&lt;br /&gt;
*** object property has_Annotation edge_length&lt;br /&gt;
** branch support: data property has support value either bootstrap or posterior prob&lt;br /&gt;
* character matrix&lt;br /&gt;
* alignment method &lt;br /&gt;
** name of software, version &lt;br /&gt;
** parameters &lt;br /&gt;
** manual correction &lt;br /&gt;
* tree inference method&lt;br /&gt;
** name of software, version: tree wasGeneratedBy (activity=) software procedure; software procedure wasAssociatedWith instance of software agent named &amp;quot;RaXML&amp;quot;&lt;br /&gt;
** parameters: (activity) used instance of a parameter specification (which is a kind of plan)  &lt;br /&gt;
** character weights&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3919</id>
		<title>Tree Annotation</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3919"/>
		<updated>2013-02-03T00:14:45Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* Quick links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Synopsis''' Annotate a small set of large trees used as sources of phylogenetic knowledge in an automated delivery system for tree-o-life knowledge called &amp;quot;Phylotastic&amp;quot;.   &lt;br /&gt;
&lt;br /&gt;
== Quick links ==&lt;br /&gt;
&lt;br /&gt;
* [[Media:FinalPres.pptx| Enrico's final presentation]] &lt;br /&gt;
* [[AnnotatedPhylotasticSourceTrees]] - report on the set of source trees&lt;br /&gt;
* [[TreestoreMetadataQueryDemonstration]] - report on the workflow and querying &lt;br /&gt;
* [[AdvancingMIAPA]] - report on MIAPA-relevant findings and developments&lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist checklist Reconciled_draft_checklist checklist from TDWG 2011]&lt;br /&gt;
&lt;br /&gt;
== Synopsis ==&lt;br /&gt;
&lt;br /&gt;
We (1) identified a set of 10 large trees useful as phylotastic source trees (2) created free-text annotations (metadata) for citations, sources and methods, (3) encoded the data and metadata as RDF using CDAO and a new ontology, (4) loaded the encoded information into a triplestore, and (5) demonstrated logical querying based on data and metadata attributes.  During the hackathon, group members spent their time developing and revising a strategy,  interpreting source materials, developing language support, encoding annotations, working out technical bugs in the workflow, and addressing emerging challenges.  The tangible outcomes of this exercise include &lt;br /&gt;
* a set of 10 source trees (720 to 250,000 species) with low-res metadata (see [[AnnotatedPhylotasticSourceTrees]])&lt;br /&gt;
* a demonstration of semantic annotation, data-basing, and querying (see [[TreestoreMetadataQueryDemonstration]])&lt;br /&gt;
** a workflow plan for encoding tree data and metadata and loading them into a treestore&lt;br /&gt;
** a treestore instance populated with some of these data and metadata&lt;br /&gt;
* Advances in Minimum Information About a Phylogenetic Analysis (see [[AdvancingMIAPA]])&lt;br /&gt;
** a new &amp;quot;MIAPA&amp;quot; ontology  that leverages several existing ontologies&lt;br /&gt;
** recommendations on the draft checklist, and input form&lt;br /&gt;
* optionally&lt;br /&gt;
** a screencast&lt;br /&gt;
** a draft GSOC proposal&lt;br /&gt;
&lt;br /&gt;
== Background, Motivation, and Aims ==&lt;br /&gt;
&lt;br /&gt;
Metadata annotations represent an essential part of the design of phylotastic systems, for two reasons.  First, while we do not have a robust and detailed understand of how users will make use of phylotastic systems, we assume that they will wish to identify trees based on sources and methods.  For instance, a user may restrict a phylotastic query so as to include only trees inferred by Maximum Likelihood, or to exclude grafted trees, or to implicate only the tree associated with the publication by Bininda-Emonds, 2007.  &lt;br /&gt;
&lt;br /&gt;
Second, one of the design criteria of phylotastic systems is to provide credible results, which in the scientific world means providing a description of provenance suitable for a scientific publication.  To be credible, a tree generated by a phylotastic system must include a description of how it was derived, which includes information on source trees as well as a description of any subsequent manipulations.  Yet, metadata play little or no role in current phylotastic component implementations.  &lt;br /&gt;
&lt;br /&gt;
Some guidance may be obtained from prior art relating to databases and to metadata.  Two databases for trees exist already, TreeBASE and Dryad  Dryad does not provide any explicit support for tree-specific annotations.  The TreeBASE input interface allows citation data, creates links to species ident, and links a matrix to a tree with a &amp;quot;analysis&amp;quot; link that may implicate a particular software program.  Though useful, the TreeBASE model falls far short of the recommendations for a &amp;quot;minimum information&amp;quot; standard for phylogeny metadata known as MIAPA, or &amp;quot;Minimum Information About a Phylogenetic Analysis&amp;quot;.  For instance, the draft MIAPA checklist from TDWG2011 calls for an explicit indication of whether a tree is a gene or species tree, whether it is rooted, what software (and version number) was used to derive it, and so on. &lt;br /&gt;
&lt;br /&gt;
The TreeAnnotation team of hackathon 2 (Enrico, Hilmar, Joachim, Arlin, Ramona and 0.5 of Andrea) decided to conduct an annotation exercise that would cover the flow of information from initial annotation of trees, to querying of treestores (not including the annotation of subsequent phylotastic manipulations such as pruning or scaling).   &lt;br /&gt;
&lt;br /&gt;
The motivation for this exercise relates partly to the aspirational nature of MIAPA, which was proposed many years ago but has never evolved into a clear standard supported with convenient technology.   Those of us who have been involved in MIAPA-related efforts sensed a need for practical experiences in real-world uses of annotations.  While the scope of Phylotastic is narrower than that of MIAPA, in the sense of covering only species trees, and mainly large ones, the challenge of supporting useful metadata queries in a treestore represents a critical test of the relevance of the MIAPA checklist and the technology for encoding and managing semantic annotations.&lt;br /&gt;
&lt;br /&gt;
We also hoped to enrich current phylotastic implementations by providing metadata for a specific set of useful trees.  Hackathon participants have been using a handful of trees (APGIII, Bininda-Emonds, etc) without any metadata on citations or methods.  &lt;br /&gt;
&lt;br /&gt;
Thus, our approach has 3 inter-connected aims: &lt;br /&gt;
* to create a set of 10 usefully annotated source trees&lt;br /&gt;
* to demonstrate the feasibility of metadata-based querying in a treestore&lt;br /&gt;
* to leverage a practical annotation exercise to advance the MIAPA project&lt;br /&gt;
&lt;br /&gt;
== Approach ==&lt;br /&gt;
&lt;br /&gt;
Our approach consisted of the following steps&lt;br /&gt;
# identify 10 useful source trees with available publications&lt;br /&gt;
# generate free-text annotations &lt;br /&gt;
# encode citations and annotations in computable form&lt;br /&gt;
# load the citation, annotations, and trees into a treestore&lt;br /&gt;
# demonstrate querying based on metadata&lt;br /&gt;
&lt;br /&gt;
In particular, we chose to gather metadata corresponding to the MIAPA draft checklist, to enode it as RDF using a new ontology that imports several other ontologies, and to load the results into Ben Morris's Virtuoso-based treestore implementation.  On Day 3, we decided to begin by focusing  on citations, which are not in the MIAPA checklist, with the plan to carry citation data through steps 3 to 5.  &lt;br /&gt;
&lt;br /&gt;
'''workflow, in more detail'''&lt;br /&gt;
# Identify 10 trees for use as phylotastic source trees.  Most of the trees were identified pre-hackathon by Arlin.  One of the trees was replaced on day 2 due to lack of metadata (the unpublished fish tree of Westneat &amp;amp; Lundberg).  &lt;br /&gt;
# Annotate them in free-text form.  This was done on day #2 as a team effort by Ramona, Enrico, Arlin and Andrea.  &lt;br /&gt;
#* create [https://docs.google.com/spreadsheet/gform?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c&amp;amp;hl=en#edit web form] in Google docs for input of annotations, based on [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop&lt;br /&gt;
#* Spread sheet has pull down menus, plus options for free text entries under &amp;quot;other&amp;quot;&lt;br /&gt;
# Transform annotations into a formal language statements in RDF.  This was done on Days 3 and 4 by Ramona, Arlin, Enrico and Hilmar.  &lt;br /&gt;
#* Literature Citations&lt;br /&gt;
#** after some discussion,  we decided to use BIBO (not dc or prism alone)&lt;br /&gt;
#** we spent 6 to 8 person hours trying to do this interactively in Protege before finding an automated pathway of discovery and conversion via PubMed--&amp;gt; EndNote --(bibtex export)--&amp;gt; Zotero --&amp;gt; bibo export (bibliontology RDF).  &lt;br /&gt;
#** here is the [[file:10trees_bibliontology.rdf]]&lt;br /&gt;
#* Hilmar developed an annotation ontology that incorporates CDAO, OBI, PROV and other ontologies&lt;br /&gt;
# Load trees into TreeStore.  On days 2 to 3, Joachim worked on the technology for getting our encodings into a triplestore.  Part of the challenge was deciding on an URL scheme.  &lt;br /&gt;
# Execute queries to demonstrate success.  On Day 3 we had success in querying for citation metadata.  On Day 5 ? ?  ?&lt;br /&gt;
&lt;br /&gt;
== Model for semantic encoding ==&lt;br /&gt;
&lt;br /&gt;
==additional suggestions for MIAPA ontology==&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
* Add a class for parsimony under algorithm. [http://github.com/miapa/miapa/issues/7 Filed as Issue #7] &lt;br /&gt;
&lt;br /&gt;
* It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation. [http://github.com/miapa/miapa/issues/8 Filed as Issue #8]&lt;br /&gt;
&lt;br /&gt;
* Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
** ''Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.''&lt;br /&gt;
** ''Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.''&lt;br /&gt;
&lt;br /&gt;
* Remove class for SILVA. [http://github.com/miapa/miapa/issues/9 Filed as Issue #9]&lt;br /&gt;
&lt;br /&gt;
* Add new class for set of trees. [http://github.com/miapa/miapa/issues/10 Filed as Issue #10]&lt;br /&gt;
&lt;br /&gt;
== more annotations ==&lt;br /&gt;
&lt;br /&gt;
miapa ontology&lt;br /&gt;
* topology &lt;br /&gt;
** gene tree vs species tree: Network:Tree:'Gene tree' or SpeciesTree&lt;br /&gt;
** rooted: Network:Tree:RootedTree or UnrootedTree&lt;br /&gt;
** 'Consensus tree'&lt;br /&gt;
* otus&lt;br /&gt;
** toTaxon, object property, points to taxon concept, can be URI from NCBI or other authority &lt;br /&gt;
** derived_from specimen &lt;br /&gt;
** location imported from geo&lt;br /&gt;
* branch properties&lt;br /&gt;
** branch lengths:&lt;br /&gt;
*** data property edge length&lt;br /&gt;
*** object property has_Annotation edge_length&lt;br /&gt;
** branch support: data property has support value either bootstrap or posterior prob&lt;br /&gt;
* character matrix&lt;br /&gt;
* alignment method &lt;br /&gt;
** name of software, version &lt;br /&gt;
** parameters &lt;br /&gt;
** manual correction &lt;br /&gt;
* tree inference method&lt;br /&gt;
** name of software, version: tree wasGeneratedBy (activity=) software procedure; software procedure wasAssociatedWith instance of software agent named &amp;quot;RaXML&amp;quot;&lt;br /&gt;
** parameters: (activity) used instance of a parameter specification (which is a kind of plan)  &lt;br /&gt;
** character weights&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3907</id>
		<title>Tree Annotation</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3907"/>
		<updated>2013-02-01T16:51:51Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* additional suggestions for MIAPA ontology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Synopsis''' Annotate a small set of large trees used as sources of phylogenetic knowledge in an automated delivery system for tree-o-life knowledge called &amp;quot;Phylotastic&amp;quot;.   &lt;br /&gt;
&lt;br /&gt;
== Quick links ==&lt;br /&gt;
&lt;br /&gt;
'''Reports'''&lt;br /&gt;
* [[AnnotatedPhylotasticSourceTrees]] - report on the set of source trees, focusing on the types of metadata available, and how they might be used in phylotastic systems&lt;br /&gt;
** [https://docs.google.com/spreadsheet/ccc?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c#gid=0 spreadsheet]&lt;br /&gt;
* [[TreestoreMetadataQueryDemonstration]] - report on the model of semantic encoding, the technology for translation, the treestore technology, and the implications of this for supporting phylotastic querying&lt;br /&gt;
* [[AdvancingMIAPA]] report page - report on the adequacy of the MIAPA checklist, recommendations for revisions, ontology development, challenges of semantic encoding, and also (redundant to above report) the model of semantic encoding.  &lt;br /&gt;
&lt;br /&gt;
'''Other tangible outcomes'''&lt;br /&gt;
* new MIAPA ontology&lt;br /&gt;
* GSOC project proposal&lt;br /&gt;
&lt;br /&gt;
'''Key resources'''&lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist checklist] from TDWG 2011&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
Metadata annotations represent an essential part of the design of phylotastic systems, enabling users to find trees based on sources and methods, and to generate a credible report of provenance for phylotastically generated trees.  Yet, metadata play no role in current phylotastic component implementations.  The TreeAnnotation team of hackathon 2 (Enrico, Hilmar, Joachim, Arlin, Ramona and 0.5 of Andrea) set out to address this deficiency.  We developed an approach with 3 inter-connected goals: &lt;br /&gt;
&lt;br /&gt;
* create a set of 10 usefully annotated source trees&lt;br /&gt;
* demonstrate metadata-based querying in a treestore&lt;br /&gt;
* leverage this exercise to advance the MIAPA project&lt;br /&gt;
&lt;br /&gt;
Our approach consisted of the following steps&lt;br /&gt;
# identify 10 useful source trees with available publications&lt;br /&gt;
# generate free-text annotations &lt;br /&gt;
# encode citations and annotations in computable form&lt;br /&gt;
# load the citation, annotations, and trees into a treestore&lt;br /&gt;
# demonstrate querying based on metadata&lt;br /&gt;
&lt;br /&gt;
In particular, we chose to gather metadata corresponding to the MIAPA draft checklist, to enode it as RDF using a new ontology that imports several other ontologies, and to load the results into Ben Morris's Virtuoso-based treestore implementation. &lt;br /&gt;
&lt;br /&gt;
During the hackathon, group members spent their time developing and revising a strategy,  interpreting source materials, developing language support, encoding annotations, implementing tools, and addressing emerging challenges.  &lt;br /&gt;
&lt;br /&gt;
The tangible outcomes of the group relate to phylotastic source trees (a set of trees with metadata); software tools for processing, storage and querying; an ontology to support MIAPA annotations, along with a revised MIAPA checklist and form; and written reports on these 3 types of outputs, available on this wiki.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Detailed approach ==&lt;br /&gt;
&lt;br /&gt;
* develop plan (day 1)&lt;br /&gt;
** revise as needed&lt;br /&gt;
** some work is done in parallel&lt;br /&gt;
&lt;br /&gt;
* main workflow&lt;br /&gt;
# identify 10 trees for use as phylotastic source trees&lt;br /&gt;
# annotate them in free-text form &lt;br /&gt;
#* create [https://docs.google.com/spreadsheet/gform?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c&amp;amp;hl=en#edit web form] in Google docs for input of annotations, based on [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop&lt;br /&gt;
#* Spread sheet has pull down menus, plus options for free text entries under &amp;quot;other&amp;quot;&lt;br /&gt;
# transform annotations into a formal language statements in RDF&lt;br /&gt;
#* encoding process is iterative with ontology editing&lt;br /&gt;
#* Hilmar is working on language support&lt;br /&gt;
#* Joachim is working on the technology for getting this into a triplestore &lt;br /&gt;
#* Get URI for tree from TreeStore, add annotations to that URI in Protege&lt;br /&gt;
# Load trees into TreeStore&lt;br /&gt;
#* Will need to have trees in the correct format&lt;br /&gt;
# execute queries to demonstrate success&lt;br /&gt;
&lt;br /&gt;
== Log and accomplishments ==&lt;br /&gt;
* initial plan (day 1)&lt;br /&gt;
* initial MIAPA checklist-based input form (day 1)&lt;br /&gt;
* revised input form &lt;br /&gt;
* plan for (temporarily) storing trees and matrices (data) separate from metadata &lt;br /&gt;
* annotations of 10 trees&lt;br /&gt;
* translation technology&lt;br /&gt;
** NEXUS issues, dendropy, &lt;br /&gt;
** protege deals poorly with unnamed individuals &lt;br /&gt;
* ontology for annotation &lt;br /&gt;
&lt;br /&gt;
From day 4, [[Media:followup_goals.jpg]] from white board.&lt;br /&gt;
&lt;br /&gt;
=== citation exercise ===&lt;br /&gt;
&lt;br /&gt;
'''goal:''' annotate trees with citation data, encode, import into treestore, demonstrate querying based on citation metadata &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''notes on encoding'''&lt;br /&gt;
* after some discussion,  we decided to use BIBO (not dc or prism alone)&lt;br /&gt;
* we failed to find any pre-existing method to auto-convert EndNote (or BibTex or Zotero) into BIBO &lt;br /&gt;
* so we started hand-encoding them using Protege instances&lt;br /&gt;
** authors&lt;br /&gt;
** articles &lt;br /&gt;
*** used Data property &amp;quot;short title&amp;quot; instead of object property title &lt;br /&gt;
*** used date of issue for publication year&lt;br /&gt;
** author-lists (RDF:list?)&lt;br /&gt;
* ultimately we ended up getting the encoded citations via PubMed--&amp;gt; EndNote --&amp;gt; bibtex export --&amp;gt; Zotero --&amp;gt; bibo export (bibliontology RDF).  &lt;br /&gt;
** here is the [[file:10trees_bibliontology.rdf]]&lt;br /&gt;
&lt;br /&gt;
===additional suggestions for MIAPA ontology===&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
* Add a class for parsimony under algorithm. [http://github.com/miapa/miapa/issues/7 Filed as Issue #7] &lt;br /&gt;
&lt;br /&gt;
* It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation. [http://github.com/miapa/miapa/issues/8 Filed as Issue #8]&lt;br /&gt;
&lt;br /&gt;
* Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
** ''Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.''&lt;br /&gt;
** ''Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.''&lt;br /&gt;
&lt;br /&gt;
* Remove class for SILVA. [http://github.com/miapa/miapa/issues/9 Filed as Issue #9]&lt;br /&gt;
&lt;br /&gt;
* Add new class for set of trees. [http://github.com/miapa/miapa/issues/10 Filed as Issue #10]&lt;br /&gt;
&lt;br /&gt;
=== more annotations ===&lt;br /&gt;
&lt;br /&gt;
miapa ontology&lt;br /&gt;
* topology &lt;br /&gt;
** gene tree vs species tree: Network:Tree:'Gene tree' or SpeciesTree&lt;br /&gt;
** rooted: Network:Tree:RootedTree or UnrootedTree&lt;br /&gt;
** 'Consensus tree'&lt;br /&gt;
* otus&lt;br /&gt;
** toTaxon, object property, points to taxon concept, can be URI from NCBI or other authority &lt;br /&gt;
** derived_from specimen &lt;br /&gt;
** location imported from geo&lt;br /&gt;
* branch properties&lt;br /&gt;
** branch lengths:&lt;br /&gt;
*** data property edge length&lt;br /&gt;
*** object property has_Annotation edge_length&lt;br /&gt;
** branch support: data property has support value either bootstrap or posterior prob&lt;br /&gt;
* character matrix&lt;br /&gt;
* alignment method &lt;br /&gt;
** name of software, version &lt;br /&gt;
** parameters &lt;br /&gt;
** manual correction &lt;br /&gt;
* tree inference method&lt;br /&gt;
** name of software, version: tree wasGeneratedBy (activity=) software procedure; software procedure wasAssociatedWith instance of software agent named &amp;quot;RaXML&amp;quot;&lt;br /&gt;
** parameters: (activity) used instance of a parameter specification (which is a kind of plan)  &lt;br /&gt;
** character weights&lt;br /&gt;
&lt;br /&gt;
=== semantic links for tree, citation, methods, etc ===&lt;br /&gt;
&lt;br /&gt;
* tree has unique URI produced during loading: http://phylotastic.org/hack2/...number.../...treename...#tree1&lt;br /&gt;
&lt;br /&gt;
* how rooted tree connects together &lt;br /&gt;
 :tree1 has_root node0 ;&lt;br /&gt;
&lt;br /&gt;
* how unrooted tree connects together, using the belongs_to_tree relation&lt;br /&gt;
  :node9&amp;gt; obo:CDAO_0000200 :tree1 ; &lt;br /&gt;
* and the same for all the other nodes and edges.  &lt;br /&gt;
&lt;br /&gt;
* how tree connects with citation (assume that pub1 is the root of the &amp;lt;bibo:AcademicArticle&amp;gt; individual )&lt;br /&gt;
  :tree1 dcterms:isReferencedBy :pub1 ;&lt;br /&gt;
&lt;br /&gt;
* some other ideas&lt;br /&gt;
** :pub1 IAO:is_about :tree1&lt;br /&gt;
** :pub1 documents :tree1&lt;br /&gt;
** cito:provides_methods_for :tree1&lt;br /&gt;
** :pub1 cito:provides_data_for :tree1&lt;br /&gt;
&lt;br /&gt;
* how tree connects with methods annotation &lt;br /&gt;
 :tree1 prov:wasGeneratedBy :tree_activity1 ;&lt;br /&gt;
&lt;br /&gt;
* how char matrix connects with methods annotation &lt;br /&gt;
 :align1 prov:wasGeneratedBy :align_activity1 ;&lt;br /&gt;
&lt;br /&gt;
* how tree connects with char matrix &lt;br /&gt;
 :tree1 prov:wasDerivedFrom :align1 ;&lt;br /&gt;
&lt;br /&gt;
==Annotation Workflow==&lt;br /&gt;
&lt;br /&gt;
'''Example file:''' Tree_2_Peters_et_al.newick&lt;br /&gt;
&lt;br /&gt;
1. `python treestore.py add Tree_2_Peters_et_al.newick newick Peters2011hymenoptera`&lt;br /&gt;
* reads Newick file `Tree_2_Peters_et_al.newick`&lt;br /&gt;
* stores the tree in the named graph `http://prefix/Peters2011hymenoptera`&lt;br /&gt;
* the URI prefix is automatically generated; it is a hash that (more or less) uniquely identifies the data loaded&lt;br /&gt;
2. `python treestore.py uri`&lt;br /&gt;
* lists tree URIs in the triple store&lt;br /&gt;
* will show something along the line: &amp;quot;Peters2011hymenoptera   http://phylotastic.org/hack2/bd414f8f72a8fabb9454b4ea72cf0e8a760171ba/Peters2011hymenoptera#tree0000001&amp;quot;&lt;br /&gt;
3. `rdfcat -out N-TRIPLE annotations.rdf &amp;gt; annotations.ntriples`&lt;br /&gt;
* takes annotations (saved with Protege as RDF/XML, Turtle, or other format)&lt;br /&gt;
* outputs N-Triples&lt;br /&gt;
4. `python treestore.py add annotations.ntriples ntriples http://phylotastic.org/hack2/bd414f8f72a8fabb9454b4ea72cf0e8a760171ba/Peters2011hymenoptera`&lt;br /&gt;
* adds the annotations to the named graph `http://phylotastic.org/hack2/bd414f8f72a8fabb9454b4ea72cf0e8a760171ba/Peters2011hymenoptera`&lt;br /&gt;
* the URI for the named graph is the URI returned by `python treestore.py uri` up to the `#` character&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3906</id>
		<title>Tree Annotation</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Tree_Annotation&amp;diff=3906"/>
		<updated>2013-02-01T16:43:03Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* additional suggestions for MIAPA ontology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;'''Synopsis''' Annotate a small set of large trees used as sources of phylogenetic knowledge in an automated delivery system for tree-o-life knowledge called &amp;quot;Phylotastic&amp;quot;.   &lt;br /&gt;
&lt;br /&gt;
== Quick links ==&lt;br /&gt;
&lt;br /&gt;
'''Reports'''&lt;br /&gt;
* [[AnnotatedPhylotasticSourceTrees]] - report on the set of source trees, focusing on the types of metadata available, and how they might be used in phylotastic systems&lt;br /&gt;
** [https://docs.google.com/spreadsheet/ccc?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c#gid=0 spreadsheet]&lt;br /&gt;
* [[TreestoreMetadataQueryDemonstration]] - report on the model of semantic encoding, the technology for translation, the treestore technology, and the implications of this for supporting phylotastic querying&lt;br /&gt;
* [[AdvancingMIAPA]] report page - report on the adequacy of the MIAPA checklist, recommendations for revisions, ontology development, challenges of semantic encoding, and also (redundant to above report) the model of semantic encoding.  &lt;br /&gt;
&lt;br /&gt;
'''Other tangible outcomes'''&lt;br /&gt;
* new MIAPA ontology&lt;br /&gt;
* GSOC project proposal&lt;br /&gt;
&lt;br /&gt;
'''Key resources'''&lt;br /&gt;
* [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist checklist] from TDWG 2011&lt;br /&gt;
*&lt;br /&gt;
&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
Metadata annotations represent an essential part of the design of phylotastic systems, enabling users to find trees based on sources and methods, and to generate a credible report of provenance for phylotastically generated trees.  Yet, metadata play no role in current phylotastic component implementations.  The TreeAnnotation team of hackathon 2 (Enrico, Hilmar, Joachim, Arlin, Ramona and 0.5 of Andrea) set out to address this deficiency.  We developed an approach with 3 inter-connected goals: &lt;br /&gt;
&lt;br /&gt;
* create a set of 10 usefully annotated source trees&lt;br /&gt;
* demonstrate metadata-based querying in a treestore&lt;br /&gt;
* leverage this exercise to advance the MIAPA project&lt;br /&gt;
&lt;br /&gt;
Our approach consisted of the following steps&lt;br /&gt;
# identify 10 useful source trees with available publications&lt;br /&gt;
# generate free-text annotations &lt;br /&gt;
# encode citations and annotations in computable form&lt;br /&gt;
# load the citation, annotations, and trees into a treestore&lt;br /&gt;
# demonstrate querying based on metadata&lt;br /&gt;
&lt;br /&gt;
In particular, we chose to gather metadata corresponding to the MIAPA draft checklist, to enode it as RDF using a new ontology that imports several other ontologies, and to load the results into Ben Morris's Virtuoso-based treestore implementation. &lt;br /&gt;
&lt;br /&gt;
During the hackathon, group members spent their time developing and revising a strategy,  interpreting source materials, developing language support, encoding annotations, implementing tools, and addressing emerging challenges.  &lt;br /&gt;
&lt;br /&gt;
The tangible outcomes of the group relate to phylotastic source trees (a set of trees with metadata); software tools for processing, storage and querying; an ontology to support MIAPA annotations, along with a revised MIAPA checklist and form; and written reports on these 3 types of outputs, available on this wiki.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Detailed approach ==&lt;br /&gt;
&lt;br /&gt;
* develop plan (day 1)&lt;br /&gt;
** revise as needed&lt;br /&gt;
** some work is done in parallel&lt;br /&gt;
&lt;br /&gt;
* main workflow&lt;br /&gt;
# identify 10 trees for use as phylotastic source trees&lt;br /&gt;
# annotate them in free-text form &lt;br /&gt;
#* create [https://docs.google.com/spreadsheet/gform?key=0An7iC3iCSumjdDRob1NjUUdxVHFRSG9ZQWltZ0ExZ3c&amp;amp;hl=en#edit web form] in Google docs for input of annotations, based on [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop&lt;br /&gt;
#* Spread sheet has pull down menus, plus options for free text entries under &amp;quot;other&amp;quot;&lt;br /&gt;
# transform annotations into a formal language statements in RDF&lt;br /&gt;
#* encoding process is iterative with ontology editing&lt;br /&gt;
#* Hilmar is working on language support&lt;br /&gt;
#* Joachim is working on the technology for getting this into a triplestore &lt;br /&gt;
#* Get URI for tree from TreeStore, add annotations to that URI in Protege&lt;br /&gt;
# Load trees into TreeStore&lt;br /&gt;
#* Will need to have trees in the correct format&lt;br /&gt;
# execute queries to demonstrate success&lt;br /&gt;
&lt;br /&gt;
== Log and accomplishments ==&lt;br /&gt;
* initial plan (day 1)&lt;br /&gt;
* initial MIAPA checklist-based input form (day 1)&lt;br /&gt;
* revised input form &lt;br /&gt;
* plan for (temporarily) storing trees and matrices (data) separate from metadata &lt;br /&gt;
* annotations of 10 trees&lt;br /&gt;
* translation technology&lt;br /&gt;
** NEXUS issues, dendropy, &lt;br /&gt;
** protege deals poorly with unnamed individuals &lt;br /&gt;
* ontology for annotation &lt;br /&gt;
&lt;br /&gt;
From day 4, [[Media:followup_goals.jpg]] from white board.&lt;br /&gt;
&lt;br /&gt;
=== citation exercise ===&lt;br /&gt;
&lt;br /&gt;
'''goal:''' annotate trees with citation data, encode, import into treestore, demonstrate querying based on citation metadata &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
'''notes on encoding'''&lt;br /&gt;
* after some discussion,  we decided to use BIBO (not dc or prism alone)&lt;br /&gt;
* we failed to find any pre-existing method to auto-convert EndNote (or BibTex or Zotero) into BIBO &lt;br /&gt;
* so we started hand-encoding them using Protege instances&lt;br /&gt;
** authors&lt;br /&gt;
** articles &lt;br /&gt;
*** used Data property &amp;quot;short title&amp;quot; instead of object property title &lt;br /&gt;
*** used date of issue for publication year&lt;br /&gt;
** author-lists (RDF:list?)&lt;br /&gt;
* ultimately we ended up getting the encoded citations via PubMed--&amp;gt; EndNote --&amp;gt; bibtex export --&amp;gt; Zotero --&amp;gt; bibo export (bibliontology RDF).  &lt;br /&gt;
** here is the [[file:10trees_bibliontology.rdf]]&lt;br /&gt;
&lt;br /&gt;
===additional suggestions for MIAPA ontology===&lt;br /&gt;
From annotation session on afternoon of 1/31.&lt;br /&gt;
&lt;br /&gt;
Add a class for parsimony under algorithm.&lt;br /&gt;
&lt;br /&gt;
It would be good to generate an instance of useMaximumLikelihood (&amp;quot;Maximum Likelihood algorithm&amp;quot;) in MIAPA, so we don't have to create one for each annotation.&lt;br /&gt;
&lt;br /&gt;
Alternatively, maybe make classes of software (like PhyML or RAxML) implement ML algorithm, rather than having to assert it for each instance we create. Some software can use multiple algorithms, so we can't do this for each case.&lt;br /&gt;
* Note that in OWL classes cannot be asserted to have property values, only instances can. We can put property restrictions with existential quantification on classes, and a OWL reasoner could then infer that an instance must have at least one such property association (and thus a DL query should in principle return the instance), but this wouldn't work in an RDF triple store so that we could then actually query for these things in SPARQL.&lt;br /&gt;
* Note also that there can be multiple swo:implements assertions for a software instance, so multiple algorithms can be easily asserted. However, this wouldn't the also say which of those implemented algorithms was the one utilized for the generation of the tree of alignment. The idea is that this would be evident from the miapa:'Parameter specification'.&lt;br /&gt;
&lt;br /&gt;
Remove class for SILVA.&lt;br /&gt;
&lt;br /&gt;
Add new class for set of trees.&lt;br /&gt;
&lt;br /&gt;
=== more annotations ===&lt;br /&gt;
&lt;br /&gt;
miapa ontology&lt;br /&gt;
* topology &lt;br /&gt;
** gene tree vs species tree: Network:Tree:'Gene tree' or SpeciesTree&lt;br /&gt;
** rooted: Network:Tree:RootedTree or UnrootedTree&lt;br /&gt;
** 'Consensus tree'&lt;br /&gt;
* otus&lt;br /&gt;
** toTaxon, object property, points to taxon concept, can be URI from NCBI or other authority &lt;br /&gt;
** derived_from specimen &lt;br /&gt;
** location imported from geo&lt;br /&gt;
* branch properties&lt;br /&gt;
** branch lengths:&lt;br /&gt;
*** data property edge length&lt;br /&gt;
*** object property has_Annotation edge_length&lt;br /&gt;
** branch support: data property has support value either bootstrap or posterior prob&lt;br /&gt;
* character matrix&lt;br /&gt;
* alignment method &lt;br /&gt;
** name of software, version &lt;br /&gt;
** parameters &lt;br /&gt;
** manual correction &lt;br /&gt;
* tree inference method&lt;br /&gt;
** name of software, version: tree wasGeneratedBy (activity=) software procedure; software procedure wasAssociatedWith instance of software agent named &amp;quot;RaXML&amp;quot;&lt;br /&gt;
** parameters: (activity) used instance of a parameter specification (which is a kind of plan)  &lt;br /&gt;
** character weights&lt;br /&gt;
&lt;br /&gt;
=== semantic links for tree, citation, methods, etc ===&lt;br /&gt;
&lt;br /&gt;
* tree has unique URI produced during loading: http://phylotastic.org/hack2/...number.../...treename...#tree1&lt;br /&gt;
&lt;br /&gt;
* how rooted tree connects together &lt;br /&gt;
 :tree1 has_root node0 ;&lt;br /&gt;
&lt;br /&gt;
* how unrooted tree connects together, using the belongs_to_tree relation&lt;br /&gt;
  :node9&amp;gt; obo:CDAO_0000200 :tree1 ; &lt;br /&gt;
* and the same for all the other nodes and edges.  &lt;br /&gt;
&lt;br /&gt;
* how tree connects with citation (assume that pub1 is the root of the &amp;lt;bibo:AcademicArticle&amp;gt; individual )&lt;br /&gt;
  :tree1 dcterms:isReferencedBy :pub1 ;&lt;br /&gt;
&lt;br /&gt;
* some other ideas&lt;br /&gt;
** :pub1 IAO:is_about :tree1&lt;br /&gt;
** :pub1 documents :tree1&lt;br /&gt;
** cito:provides_methods_for :tree1&lt;br /&gt;
** :pub1 cito:provides_data_for :tree1&lt;br /&gt;
&lt;br /&gt;
* how tree connects with methods annotation &lt;br /&gt;
 :tree1 prov:wasGeneratedBy :tree_activity1 ;&lt;br /&gt;
&lt;br /&gt;
* how char matrix connects with methods annotation &lt;br /&gt;
 :align1 prov:wasGeneratedBy :align_activity1 ;&lt;br /&gt;
&lt;br /&gt;
* how tree connects with char matrix &lt;br /&gt;
 :tree1 prov:wasDerivedFrom :align1 ;&lt;br /&gt;
&lt;br /&gt;
==Annotation Workflow==&lt;br /&gt;
&lt;br /&gt;
'''Example file:''' Tree_2_Peters_et_al.newick&lt;br /&gt;
&lt;br /&gt;
1. `python treestore.py add Tree_2_Peters_et_al.newick newick Peters2011hymenoptera`&lt;br /&gt;
* reads Newick file `Tree_2_Peters_et_al.newick`&lt;br /&gt;
* stores the tree in the named graph `http://prefix/Peters2011hymenoptera`&lt;br /&gt;
* the URI prefix is automatically generated; it is a hash that (more or less) uniquely identifies the data loaded&lt;br /&gt;
2. `python treestore.py uri`&lt;br /&gt;
* lists tree URIs in the triple store&lt;br /&gt;
* will show something along the line: &amp;quot;Peters2011hymenoptera   http://phylotastic.org/hack2/bd414f8f72a8fabb9454b4ea72cf0e8a760171ba/Peters2011hymenoptera#tree0000001&amp;quot;&lt;br /&gt;
3. `rdfcat -out N-TRIPLE annotations.rdf &amp;gt; annotations.ntriples`&lt;br /&gt;
* takes annotations (saved with Protege as RDF/XML, Turtle, or other format)&lt;br /&gt;
* outputs N-Triples&lt;br /&gt;
4. `python treestore.py add annotations.ntriples ntriples http://phylotastic.org/hack2/bd414f8f72a8fabb9454b4ea72cf0e8a760171ba/Peters2011hymenoptera`&lt;br /&gt;
* adds the annotations to the named graph `http://phylotastic.org/hack2/bd414f8f72a8fabb9454b4ea72cf0e8a760171ba/Peters2011hymenoptera`&lt;br /&gt;
* the URI for the named graph is the URI returned by `python treestore.py uri` up to the `#` character&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4241</id>
		<title>Advancing MIAPA</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=Advancing_MIAPA&amp;diff=4241"/>
		<updated>2013-01-31T23:03:34Z</updated>

		<summary type="html">&lt;p&gt;Hilmar: /* MIAPA */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== Factors shaping our conception of source-tree annotations ===&lt;br /&gt;
&lt;br /&gt;
# what folks in the evoinfo community believe is the Minimal Information About a Phylogenetic Analysis (MIAPA)&lt;br /&gt;
#* The current synopsis of this is the MIAPA checklist from teh 2011 TDWG meeting. &lt;br /&gt;
# need to support assignment of credit (blame) for tree-producers and phylotastic service-providers &lt;br /&gt;
# need to support licensing that protects creators, resource-providers and end-users&lt;br /&gt;
# need to contribute to a credible provenance report for phylotastic-generated trees&lt;br /&gt;
#* e.g., a tree might be returned with information as follows (free text form): &amp;quot;This tree was obtained on Jan 29, 2013.  An input list of 58 names was submitted to Taxosaurus, resulting in 45 valid species binomials ( list ).  This list was sent to a pruner with instructions to prune out the indicated species from the phylogeny of Bininda-Emonds, et al. 2007.  The resulting sub-tree with 40 species was scaled using teh DateLife service. &amp;quot;  &lt;br /&gt;
#  support the most common user criteria for phylotastic searching &lt;br /&gt;
#* limits on source trees &lt;br /&gt;
#** '''return tree with maximal coverage of list ::= { species }'''&lt;br /&gt;
#*** specify namespace of &amp;lt;list&amp;gt; &lt;br /&gt;
#** restrict by publication status (published or not)&lt;br /&gt;
#** restrict by publication year (e.g., no trees older than 5 years)&lt;br /&gt;
#** restrict by author (e.g., author = bininda-emonds) &lt;br /&gt;
#** restrict by other citation information (e.g., jrnl = index fungorum)&lt;br /&gt;
#** restrict by method&lt;br /&gt;
#*** '''use (exclude) trees made with method class = { parsimony, likelihood, Bayesian, distance, supertree, supermatrix, hand-crafted. . . } '''&lt;br /&gt;
#*** use (exclude)  trees made with evolutionary model = {  }&lt;br /&gt;
#*** '''use (exclude) trees made with software = { RaXML, BEAST, PAUP*, . . . }'''&lt;br /&gt;
#*** other restrictions (e.g.,  supertree, . . . )&lt;br /&gt;
#** restrict by availability of source data such as character matrix&lt;br /&gt;
#*** restrict by minimum number of characters in matrix &lt;br /&gt;
#** '''restrict by type of source data = { molecular, morphological, mixed } '''&lt;br /&gt;
#** restrict by annotation level = { platinum, gold, silver, bronze, polystyrene } &lt;br /&gt;
#** require feature&lt;br /&gt;
#*** '''require support values'''&lt;br /&gt;
#*** require rooting &lt;br /&gt;
#*** '''require branch lengths'''&lt;br /&gt;
#*** require fully resolved&lt;br /&gt;
#*** require other features &lt;br /&gt;
#* limits on phylotastic manipulations &lt;br /&gt;
#** disallow species substitution &lt;br /&gt;
#** disallow grafting of source trees&lt;br /&gt;
#** scaling: provide median age estimates &lt;br /&gt;
#** scaling: provide only lower age estimates&lt;br /&gt;
#** TNRS: prohibit fuzzy matching&lt;br /&gt;
#** TNRS: use matches from source = { NCBI, ITIS, et }&lt;br /&gt;
# support adequate annotation of the provenance of the resulting phylotastic tree&lt;br /&gt;
#* imagine that you have to publish an analysis using a phylogenetic tree-- how do you describe it?  &amp;quot;The phylogeny of 40 mammal species was obtained via phylotastic (phylotastic.org &lt;br /&gt;
#* citation information&lt;br /&gt;
&lt;br /&gt;
=== Lessons learned from tree-finding and annotation ===&lt;br /&gt;
&lt;br /&gt;
* data frequently is not readily accessible online &lt;br /&gt;
** e.g. Jetz one must go to birdtree.org&lt;br /&gt;
** e.g., we obtained hymenoptera tree by pers. communication from Peters&lt;br /&gt;
** only one of the studies has trees in TreeBASE (GEBA)&lt;br /&gt;
** another study has a tree in Dryad (Smith)&lt;br /&gt;
* authors often don't provide minimal information explicitly&lt;br /&gt;
** whether or not a tree is rooted is usually not explicit&lt;br /&gt;
*** e.g., taxonomic hierarchies (APG, ToLWeb, NCBI) imply rootedness&lt;br /&gt;
*** e.g., Peters, et al 2011 never invokes the term &amp;quot;root&amp;quot; but we infer rooting from the description of outgroups not present in the final tree&lt;br /&gt;
** the methodology is largely unexplained for curated or hand-crafted taxonomic frameworks like APGIII or NCBI&lt;br /&gt;
* a study frequently has several trees by slightly different methods, but the checklist seems to imply one method&lt;br /&gt;
** e.g., Goloboff molecules only vs. molecules with morphology&lt;br /&gt;
** e.g., Bininda-Emonds best dates vs min dates vs max dates &lt;br /&gt;
* sometimes a study has many trees that all represent outputs of the same method&lt;br /&gt;
** e.g., Jetz provide a large sample from the posterior distribution &lt;br /&gt;
* process of constructing tree does not follow sequences--&amp;gt; alignment--&amp;gt; tree&lt;br /&gt;
** e.g., supertree method in Bininda-Emonds&lt;br /&gt;
** e.g., hand-crafted APG, Smith, trees&lt;br /&gt;
* process of constructing tree cannot be condensed easily &lt;br /&gt;
** e.g., Goloboff, iterative procedure with divide-and-conquer search to find parsimony tree (&amp;quot;tree fuse&amp;quot; &amp;amp; &amp;quot;sector&amp;quot; search repeated)&lt;br /&gt;
** GEBA tree, { missing explanation }&lt;br /&gt;
** partitioned alignment (e.g., Smith, et al), but miapa implies &amp;quot;model of evolution&amp;quot; as though there were only one, whereas in Peters, et al there are 2 models for 2 partitions&lt;br /&gt;
* clustering to define orthologs not included in checklist, but seems important &lt;br /&gt;
** Smith phlawd, alignment: no pre-orthology.  &lt;br /&gt;
* mixed data is common&lt;br /&gt;
** Goloboff has morpho and molecular &lt;br /&gt;
** multiple studies have DNA (e.g., SSU rDNA) and protein sequences&lt;br /&gt;
* concatenated alignments are common, e.g., multiple proteins &lt;br /&gt;
** this means accession:OTU mapping is not 1:1 but many to one&lt;br /&gt;
* not encountered in our inputs but sometimes the OTU is &amp;lt;genus_sp&amp;gt; and the data are fused from multiple species (this is common in MorphoBank)&lt;br /&gt;
* many important trees  do not have branch lengths&lt;br /&gt;
** e.g., APGIII is a taxonomic framework&lt;br /&gt;
** e.g., some supertrees don't have branch lengths &lt;br /&gt;
* do binomials count as meaningful external identifiers for OTUs?  &lt;br /&gt;
** in some cases, the methods make clear that these come from a specific source &lt;br /&gt;
*** e.g., Goloboff names clearly come directly from NCBI via their bioinfo pipeline&lt;br /&gt;
*** e.g., Bininda-Emonds publication declares that naming authority is Wilson &amp;amp; Roeder (Mammal Species of the World)&lt;br /&gt;
*** e.g., NCBI taxonomy comes as a database dump wth taxids and synonyms, so it represents its own authority&lt;br /&gt;
** usually the naming authority is not clear&lt;br /&gt;
* was any study straightforward? &lt;br /&gt;
* OTUs checklist question may be redundant: why have external identifier and then ask for collections information?  &lt;br /&gt;
* OTU external refs in checklist: not applicable for supertree methods, consensus methods, hand-crafted&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== MIAPA Resources ==&lt;br /&gt;
* The main [[MIAPA]] page.  Primarily:  &lt;br /&gt;
** [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Reconciled_draft_checklist MIAPA draft checklist] from TDWG 2011 workshop.&lt;br /&gt;
** Emily McTavish's [https://docs.google.com/a/utexas.edu/spreadsheet/ccc?key=0Av6YLPeop2n0dGZiMjNGS2NleWNvQlVSeThVQXdSdmc#gid=0 Tree Annotation Vocabulary] as resulting from the [[Phylotastic I]] hackathon. Includes a mapping to MIAPA draft checklist attributes. &lt;br /&gt;
** [http://www.slideshare.net/ElliottHauser/phylogenetics-data-provenance-survey-results Slideshow] with some results from the [http://wiki.tdwg.org/twiki/bin/view/Phylogenetics/MIAPADraft#Community_survey MIAPA community survey] orchestrated by the Open Tree of Life project in fall 2012.&lt;br /&gt;
* Ontologies &lt;br /&gt;
** [http://bioportal.bioontology.org/ontologies/49276?p=terms phylont] from [http://arnetminer.org/publication/phylont-a-domain-specific-ontology-for-phylogeny-analysis-3601839.html;jsessionid=E93A88FA38E826E1113A957BBA5BB352.tt Maryam Panahiazar, et al]&lt;br /&gt;
** [http://www.evolutionaryontology.org/cdao CDAO]&lt;br /&gt;
* Development:&lt;br /&gt;
** [http://github.com/miapa/miapa MIAPA repo on Github]&lt;br /&gt;
** Template development after the [https://github.com/phylotastic/treestore/tree/master/terms TNRS ontology] developed at (and after) Phylotastic I.&lt;br /&gt;
&lt;br /&gt;
== Annotation ==&lt;br /&gt;
* representation of metadata: [http://www.nexml.org NeXML]&lt;br /&gt;
* some stuff that Rutger did mapping the metadata from ToLWeb XML format onto semantic annotations in NeXML.&lt;br /&gt;
** ToLWeb XML described here: http://tolweb.org/tree/home.pages/downloadtree.html&lt;br /&gt;
** a simple script that does the conversion:  https://github.com/ncbnaturalis/bio-phylo/blob/master/experimental/tolconvert.pl&lt;br /&gt;
** Here's an example input file: https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol.xml&lt;br /&gt;
** Here's the resulting output file (indented): https://raw.github.com/ncbnaturalis/bio-phylo/master/experimental/tol-nexml-pp.xml&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
</feed>