<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://www.evoio.org/w/index.php?action=history&amp;feed=atom&amp;title=MIAPA%2FDemonstration_Project</id>
	<title>MIAPA/Demonstration Project - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://www.evoio.org/w/index.php?action=history&amp;feed=atom&amp;title=MIAPA%2FDemonstration_Project"/>
	<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MIAPA/Demonstration_Project&amp;action=history"/>
	<updated>2026-05-16T07:14:04Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.41.1</generator>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MIAPA/Demonstration_Project&amp;diff=1600&amp;oldid=prev</id>
		<title>Hilmar at 14:07, 1 August 2011</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MIAPA/Demonstration_Project&amp;diff=1600&amp;oldid=prev"/>
		<updated>2011-08-01T14:07:28Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 14:07, 1 August 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l189&quot;&gt;Line 189:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 189:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;That's where we ran out of time.  Next meeting: Friday, Feb 25, 10:00 am EST.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;That's where we ran out of time.  Next meeting: Friday, Feb 25, 10:00 am EST.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-deleted&quot;&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[Category:MIAPA]]&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
	<entry>
		<id>https://www.evoio.org/w/index.php?title=MIAPA/Demonstration_Project&amp;diff=1599&amp;oldid=prev</id>
		<title>Hilmar: moved DemonstrationProject to MIAPA/Demonstration Project</title>
		<link rel="alternate" type="text/html" href="https://www.evoio.org/w/index.php?title=MIAPA/Demonstration_Project&amp;diff=1599&amp;oldid=prev"/>
		<updated>2011-08-01T13:56:58Z</updated>

		<summary type="html">&lt;p&gt;moved &lt;a href=&quot;/wiki/DemonstrationProject&quot; class=&quot;mw-redirect&quot; title=&quot;DemonstrationProject&quot;&gt;DemonstrationProject&lt;/a&gt; to &lt;a href=&quot;/wiki/MIAPA/Demonstration_Project&quot; title=&quot;MIAPA/Demonstration Project&quot;&gt;MIAPA/Demonstration Project&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;= Demonstration Project, Spring 2011 = &lt;br /&gt;
&lt;br /&gt;
== Quick overview ==&lt;br /&gt;
&lt;br /&gt;
Build a tool that will allow users to create a NeXML file with minimal information to document a phylogenetic analysis.  &lt;br /&gt;
# start by populating CDAO with a rich set of terms from various sources &lt;br /&gt;
# work out NeXML representation of methods concepts using CDAO terms (also OBI?)&lt;br /&gt;
# develop a web form that allows users to create annotations, output NeXML&lt;br /&gt;
# use natural-language workflow descriptions from papers to guide development and testing&lt;br /&gt;
&lt;br /&gt;
== A big-picture strategy that includes this project ==&lt;br /&gt;
&lt;br /&gt;
'''under construction'''&lt;br /&gt;
&lt;br /&gt;
Here is my vision (AS) for the long-term project to develop MIAPA while building support for data re-use.  There are two stages focused on support for archiving.  The work on re-use depends on the archiving work, to some extent.  &lt;br /&gt;
&lt;br /&gt;
# Archiving, first stage - Demonstration.  We build submission and query tools to show what is possible.  The resulting tools may not be very useful to users, but they provide a platform for further work.  &lt;br /&gt;
#* proof-of-concept based on phenote (Arlin, 2008)&lt;br /&gt;
#* a web-based tool demonstration tool to create an annotated record (Maryam, spring of 2011)&lt;br /&gt;
#** loads vocabulary terms for sources and methods&lt;br /&gt;
#** provides term-completion based on the loaded vocabularies &lt;br /&gt;
#** provides slots for specific types of MIAPA annotations&lt;br /&gt;
#** provides support for term requests&lt;br /&gt;
#** outputs in NeXML?  (ok for Dryad, but not currently supported for input by TreeBASE)&lt;br /&gt;
#* a demonstration architecture for annotation and submission (GSOC projects, summer of 2011?)&lt;br /&gt;
#** web services protocol for submitting a phylogenetic record to an archive&lt;br /&gt;
#** annotation tool with client capacity to submit record &lt;br /&gt;
#** modify TreeBASE or Dryad to supply server capacity to receive record&lt;br /&gt;
# Archiving, second stage - Build up user base.  By responding to user needs and adding intelligence, we create a submission tool that is useful both to archives and to users.  Meanwhile, we are using this same tool to harvest information on user needs.  &lt;br /&gt;
#* respond to user needs &lt;br /&gt;
#** use MIAPA survey to identify key use-cases and annotation needs (MIAPA survey team, spring 2011)&lt;br /&gt;
#** work with users to build annotation support for key use cases &lt;br /&gt;
#* lay the foundation for applying intelligent methods (Enrico and Arlin, CREST proposal, 2011)&lt;br /&gt;
#** build out a formal ontology for methods annotation &lt;br /&gt;
#*** include a high-level concept of workflow&lt;br /&gt;
#** harvest annotations from submitted records &lt;br /&gt;
#** apply NLP methods to harvest methods annotations from publications&lt;br /&gt;
#* incorporate intelligence into submission tool &lt;br /&gt;
#** extract candidate annotations from Methods text &lt;br /&gt;
#** use planning concepts to detect errors and gaps, suggest corrections&lt;br /&gt;
# Technology to support re-use (Enrico and Arlin, CREST proposal, 2011).  The aim of this stage is to develop a system that can compile vague workflow descriptions into executable plans, allowing the user to apply the plan to a custom set of data.&lt;br /&gt;
&lt;br /&gt;
== Resources ==&lt;br /&gt;
&lt;br /&gt;
* phylogenetic analysis software &amp;amp; methods &lt;br /&gt;
** [http://evolution.genetics.washington.edu/phylip/software.html Joe's extensive online list])&lt;br /&gt;
** entries from Brian O'Meara's [http://www.treetapper.org treetapper] project&lt;br /&gt;
** the [http://www.sinauer.com/detail.php?id=1775#tents table of contents] from Felsenstein's _Inferring Phylogenies_&lt;br /&gt;
* file formats (see extensive list in [http://www.bioperl.org/wiki/Category:Formats BioPerl docs])&lt;br /&gt;
* alignment software (list in [http://www.bioperl.org/wiki/Category:Run_Modules BioPerl Run modules], amazing [http://en.wikipedia.org/wiki/Sequence_alignment_software wikipedia list of alignment software] )&lt;br /&gt;
* services from the mygrid project services ontology (here is the [http://www.mygrid.org.uk/ontology/ owl file in RDF-XML] and here is an image of part of the class hierarchy: [[image:Mygrid_services.jpg]])&lt;br /&gt;
&lt;br /&gt;
== Notes from meetings ==&lt;br /&gt;
&lt;br /&gt;
===  May 20, 2011 ===&lt;br /&gt;
&lt;br /&gt;
'''present:'''Jim, Maryam, Enrico, Rutger, Arlin&lt;br /&gt;
&lt;br /&gt;
discussed case, and how to handle other cases.  &lt;br /&gt;
&lt;br /&gt;
had to use skype, experienced major problems with this.&lt;br /&gt;
&lt;br /&gt;
===  April 15, 2011 ===&lt;br /&gt;
'''present:''' Jim, Maryam, Enrico, Brandon, Arlin&lt;br /&gt;
&lt;br /&gt;
=== March 18, 2011 ===&lt;br /&gt;
&lt;br /&gt;
'''present:''' Arlin, Maryam, Jim, Eric, Rutger&lt;br /&gt;
&lt;br /&gt;
1. review of demo project &lt;br /&gt;
* inference methods &lt;br /&gt;
** term list from TreeBASE&lt;br /&gt;
** Joe Felsenstein's &amp;quot;Inferring Phylogenies&amp;quot; &lt;br /&gt;
*** get e copy from Joe?  Arlin will do this&lt;br /&gt;
** search for papers.  Arlin will do this &lt;br /&gt;
* review project plan -- see 4 March notes &lt;br /&gt;
&lt;br /&gt;
2. iEvoBio presentations &lt;br /&gt;
* possible lightning talk on Maryam's demo project &lt;br /&gt;
* full talk deadline is next week &lt;br /&gt;
** one talk on MIAPA &amp;amp; related projects (Jim leads)  &lt;br /&gt;
** another talk on publishing trees practices (Arlin leads)&lt;br /&gt;
&lt;br /&gt;
=== March 4, 2011 ===&lt;br /&gt;
&lt;br /&gt;
'''present:''' Arlin, Jim, Maryam, Enrico, Vivek&lt;br /&gt;
&lt;br /&gt;
Agenda: sort out project ideas for spring (Maryam) and summer (GSOC)&lt;br /&gt;
# Maryam's project &lt;br /&gt;
#* start by populating CDAO with terms &lt;br /&gt;
#* work out NeXML representation of methods concepts using CDAO terms (also OBI?)&lt;br /&gt;
#* work on submission form to make NeXML file &lt;br /&gt;
#* use papers from prior literature to harvest natural-language workflow descriptions &lt;br /&gt;
# possible GSOC projects&lt;br /&gt;
#* graphical UI for constructing workflow descriptions&lt;br /&gt;
#** see http://exon.niaid.nih.gov/mobyleWorkflow/ &lt;br /&gt;
#** Vivek is willing to mentor &lt;br /&gt;
#** successful applicant knows Java, ideally GWT (google web toolkit) and Jena&lt;br /&gt;
#** phylogeny experience not necessary &lt;br /&gt;
#** use library of papers from previous project &lt;br /&gt;
#** feedback via informal user testing, comment box &lt;br /&gt;
#** open issue: integrate with existing codebase (Mesquite? TreeBASE?)&lt;br /&gt;
#* implement NeXML submission in TreeBASE&lt;br /&gt;
#** Rutger agreed to be co-mentor &lt;br /&gt;
#* develop web services protocol for phylo record submission &lt;br /&gt;
#** maybe preconditions will not be met by this summer&lt;br /&gt;
#* NLP analysis of methods sections of papers&lt;br /&gt;
#** ratio of analysis to programming is too high for a GSOC&lt;br /&gt;
&lt;br /&gt;
Action items: &lt;br /&gt;
* Vivek, Enrico &amp;amp; Arlin write GSOC  description by Mar 11&lt;br /&gt;
* Arlin try to find co-mentor NeXML submission project by Mar 11&lt;br /&gt;
* put library of papers issue on agenda for next meeting &lt;br /&gt;
* Jim to let Eric know what's happening&lt;br /&gt;
&lt;br /&gt;
=== February 25, 2011 ===&lt;br /&gt;
&lt;br /&gt;
'''present:''' Arlin, Jim, Maryam (10:15)&lt;br /&gt;
&lt;br /&gt;
* deliverables  &lt;br /&gt;
** search interface for TreeBASE&lt;br /&gt;
** submission interface (annotation) for TreeBASE&lt;br /&gt;
&lt;br /&gt;
sub-searches based methods annotation &lt;br /&gt;
* hierarchy, term-completion &lt;br /&gt;
* distribution of trees by method&lt;br /&gt;
* download linked pubs and collect matching terms to test completion? &lt;br /&gt;
* developer access to treebase code &lt;br /&gt;
&lt;br /&gt;
standalone search tool (GSOC)? &lt;br /&gt;
* web services API &lt;br /&gt;
&lt;br /&gt;
Submission tool &lt;br /&gt;
* making it easy for user &lt;br /&gt;
** recognize source data&lt;br /&gt;
** paste methods section, match terms, supply to user &lt;br /&gt;
** start with templates from existing treebase entries &lt;br /&gt;
** following methods from a previous publication &lt;br /&gt;
&lt;br /&gt;
Standalone tool (GSOC? )&lt;br /&gt;
* create nexml file &lt;br /&gt;
* TB nexml upload (basic) &lt;br /&gt;
* TB nexml process methods annotations into text statement&lt;br /&gt;
&lt;br /&gt;
=== February 18, 2011 ===&lt;br /&gt;
&lt;br /&gt;
present: Arlin, Jim L-M, Maryam (10:20?)&lt;br /&gt;
&lt;br /&gt;
Context: &lt;br /&gt;
* Maryam available until mid-May&lt;br /&gt;
* project outcome could support ABI proposal in July &lt;br /&gt;
* could coordinate with possible GSOC proposal&lt;br /&gt;
&lt;br /&gt;
discussion about ontology development.  2 mistaken presumptions&lt;br /&gt;
* encoding domain knowledge of experts is enough (wrong: experts literally don't know what they are talking about when it comes to key philosophical distinctions)&lt;br /&gt;
* proper ontology has only context-independent universals (wrong in practice; just creates an elaborate system of pseudo-universals)&lt;br /&gt;
&lt;br /&gt;
driving biological problem or use-case&lt;br /&gt;
* pre-condition: all those trees out there&lt;br /&gt;
* 1. estimate species tree by combining gene trees (systematics use case)&lt;br /&gt;
* 2. identify orthologs or duplication histories using gene tree (mol biology use case)&lt;br /&gt;
&lt;br /&gt;
so, let's imagine a user scenario &lt;br /&gt;
* pre-requisite: list of 8 species, user wants species tree with these, possibly some others&lt;br /&gt;
* user searches resources with list, gets hits&lt;br /&gt;
** subcase1: finds a species tree with all 8 species&lt;br /&gt;
*** user may wish to prune if there are too many other species &lt;br /&gt;
*** user is done &lt;br /&gt;
** subcase2: finds gene trees with all 8 species &lt;br /&gt;
*** user may wish to select &amp;quot;best&amp;quot; tree&lt;br /&gt;
*** user needs to run reconciliation software &lt;br /&gt;
** subcase3: finds a set of trees with overlapping sub-sets of species (e.g., ABCDE, CDEFG, EFGHI)&lt;br /&gt;
*** this case calls for supertree construction &lt;br /&gt;
&lt;br /&gt;
but (Jim says), we don't want to get bogged down in reconciliation&lt;br /&gt;
&lt;br /&gt;
but (Arlin says), it may be sufficient (for demonstration purposes) to offer the user &lt;br /&gt;
* the right input trees for reconciliation &lt;br /&gt;
* a canned workflow description for reconciliation &lt;br /&gt;
* a third-party service that will execute the workflow description on the input trees&lt;br /&gt;
&lt;br /&gt;
ok, we decide to pursue a simpler scenario &lt;br /&gt;
* pre-requisite: list of 1 gene, E. coli CAP&lt;br /&gt;
* user searches for tree with target gene, gets hits&lt;br /&gt;
** user chooses by criteria (method, bootstraps, etc. . . )&lt;br /&gt;
** user is done &lt;br /&gt;
&lt;br /&gt;
the above could be done on a resource that aggregates from other resources (TreeBASE, Pandit, TreeFams, etc).  However, an ''even simpler use-case'' would be just to provide an interface to whatever useful information is in TreeBASE.  &lt;br /&gt;
&lt;br /&gt;
That's where we ran out of time.  Next meeting: Friday, Feb 25, 10:00 am EST.&lt;/div&gt;</summary>
		<author><name>Hilmar</name></author>
	</entry>
</feed>