MediaWiki API result

This is the HTML representation of the JSON format. HTML is good for debugging, but is unsuitable for application use.

Specify the format parameter to change the output format. To see the non-HTML representation of the JSON format, set format=json.

See the complete documentation, or the API help for more information.

{
    "batchcomplete": "",
    "continue": {
        "gapcontinue": "SADI",
        "continue": "gapcontinue||"
    },
    "warnings": {
        "main": {
            "*": "Subscribe to the mediawiki-api-announce mailing list at <https://lists.wikimedia.org/postorius/lists/mediawiki-api-announce.lists.wikimedia.org/> for notice of API deprecations and breaking changes."
        },
        "revisions": {
            "*": "Because \"rvslots\" was not specified, a legacy format has been used for the output. This format is deprecated, and in the future the new format will always be used."
        }
    },
    "query": {
        "pages": {
            "131": {
                "pageid": 131,
                "ns": 0,
                "title": "Reuse Barriers",
                "revisions": [
                    {
                        "contentformat": "text/x-wiki",
                        "contentmodel": "wikitext",
                        "*": "This page is a developing list of practical barriers to reuse of evolutionary data, in particular phylogenetic data and trees. Since digital access to trees (and data) is normally the starting point for reuse, barriers to the digital availability, findability, and free access to trees, such as barriers to deposition in a digital archive, are deliberately included in the scope here.\n\n== Incorrect or lack of support of exchange standards ==\n\nThe most widely used exchange standard for phylogenetic data is NEXUS. NEXUS files are difficult to validate for compliance with the standard, and some programs either don't support the standard, or their support is incorrect. Digital archives cannot support all possible exchange formats and the variations of their implementations. For example, TreeBASE has had to resort to supporting only NEXUS files produced by the Mesquite tool. This makes digitally archiving and sharing of trees for reuse burdensome and error-prone for users of all other programs.\n\nExamples:\n* WinClada sometimes produces faulty NEXUS files. For example, WinClada's method of indexing character labels and states is wrong.\n** Symptom: \"I keep encountering an error when I try to upload the file to TreeBASE. The file has been created by WinClada.\"\n** Recommended workaround: Get Mesquite to open the file -- check that everything looks right -- and then have Mesquite save the file. Submit to TreeBASE.\n\n* MEGA does not support NEXUS\n** Recommended work-around:  MEGA allows exporting to Newick, however unfortunately this excludes branch lengths. A program like PAUP can recalculate the branch lengths. Follow these steps:\n**# Export the tree to Newick. e.g., let's say that you get the following: \"((Apple,Pear),Banana)\"\n**# Edit the Newick using a text editor so that you create a TREE BLOCK:<div style=\"float:right; margin-left: 5px;\"><code>BEGIN TREES;<br/>   TREE Fig_3 = ((Apple,Pear),Banana);<br/>END;</code><br/>(note the use of semi-colons).</div>\n**# Copy/paste this TREE BLOCK at the end of your NEXUS file. \n**# For re-calculating branch lengths, follow the steps below. Otherwise, open the NEXUS file in Mesquite and check that there are no errors. Re-root your tree as needed. Edit taxon labels as needed (scientific names must be written out in full). Save the file. This can be uploaded to TreeBASE.\n**# To re-calculate branch lengths:\n**## Copy/paste this PAUP BLOCK to the end of the NEXUS file:<div style=\"float:right; margin-left: 5px;\"><code>BEGIN PAUP;<br/>   set criterion=likelihood;<br/>   lset nst=2 basefreq=empirical variant=hky rates=gamma ncat=4 shape=estimate pinvar=estimate;<br/>   lscores 1 / sitelikes;<br/>   savetrees file=foo.tre brlens;<br/> END;</code></div>\n**## Edit the lset command to match your substitution model. \n**## Execute the NEXUS file in PAUP. This should generate a new tree file called foo.tre, containing branch lengths.\n**## Remove the old tree block and the paup block from your NEXUS.\n**## Open the NEXUS in Mesquite, and do \"Include File...\" from the File menu to import your foo.tre file. \n**## Save the file."
                    }
                ]
            },
            "121": {
                "pageid": 121,
                "ns": 0,
                "title": "Reuse Cases",
                "revisions": [
                    {
                        "contentformat": "text/x-wiki",
                        "contentmodel": "wikitext",
                        "*": "This page is for developing a list of use-cases for EvoIO- and MIAPA-relevant project planning.  It is *largely incomplete*.  We welcome anyone who wants to fill in some more cases, e.g., applications of supertrees in ecosystem analysis.  \n\n= Scope = \n\nThis page is for developing a list of use-cases for EvoIO- and MIAPA-relevant project planning.  The use-cases should focus on  '''re-use''', which might mean '''replication''', '''aggregation''', '''re-purposing''', '''meta-analysis''', '''integration''' (see below for a view of what these terms mean).\n\nWe want to enable and facilitate data re-use of phylogenetic data and metadata, which isn't happening often enough.  Because it isn't happening enough, it might be useful for us to consider ''hypothetical'' cases of re-use.  However, even for hypothetical cases, its very important to make every effort to document user needs, e.g., as expressed in published papers.  For instance, before TimeTree existed, a resource to aggregate phylogenetic dates was a hypothetical re-use case, but the user need for placing dates on nodes of trees was not hypothetical and could be documented easily. \n\nFor comparison, some other use-case lists are available:  \n* A list of [http://informatics.nescent.org/wiki/UseCases phylo-relevant use-cases] was developed for the 2006 NESCent hackathon \n* A different [http://evoinfo.nescent.org/Database_Interop_Hackathon/Use_Cases list oriented toward interoperability] was developed for the 2009 hackathon\n\nSome of the above use-cases (or variants of them) might be relevant here.\n\n= What constitutes re-use of data? = \n\nThe '''primary consumer''' of a scientific product typically is the '''primary producer''', e.g., an ecologist collects field observations and then uses these new observations to evaluate hypotheses or clarify patterns.   \"Re-use\" refers to the case when there is a '''secondary consumer'''.  \n\nBy \"data\", we mean information.  That is, for the present purposes, data are coded information, as distinct from the material products of research such as specimens and samples.  We do '''not''' mean ''data'' in the more restricted sense of ''fact, observation'', although this type of '''empirical data''' may be the most likely to be re-used.   Sharing data (unlike sharing materials) is an informatics problem.  \n\nThe general category of '''data re-use''' may cover a large number of diverse cases described with terms such as '''replication''', '''aggregation''', '''re-purposing''', '''meta-analysis''', and '''integration'''.  These do not seem to be distinct non-overlapping categories, but dimensions or qualities that may interpenetrate.  For instance, [http://www.genetics.org/cgi/content/short/170/4/1459 Yampolsky & Stoltzfus (2005)] combined data from 15 studies, comprising nearly 10,000 engineered amino acid exchanges, to generate the \"EX\" matrix of values representing the mean exchangeability from one amino acid to another. The authors clearly were '''secondary''' consumers: each underlying study was performed by the primary producer-consumers in order to map out regions of a protein most susceptible to amino acid changes. The study was described as a '''meta-analysis''' (in the sense of combining separate studies to address an issue beyond the scope or power of any individual study), and it clearly involves '''re-purposing''' (using results for a different goal), and '''aggregation''' (in the sense of combining results from multiple studies).  \n\nTo reiterate, these terms (aggregation, re-purposing, etc) do not seem to represent non-overlapping categories, but qualities or aspects that may apply in combinations.  Here is one person's (AS) interpretation of the terms (for a different view, see Fig. 1 of [http://onlinelibrary.wiley.com/doi/10.1111/j.1558-5646.2009.00892.x/abstract Sidlauskas, et al., 2010]): \n* '''study replication''' means verifying results or conclusions of a published study by repeating it.  Although the potential for study replication is integral to the self-policing nature of science, it happens only on the rare occasions when the published results of a study are perceived to be fraudulent or artefactual (e.g,. in recent memory, the \"memory water\" and \"directed mutations\"  cases). \n* '''aggregation''' means gathering large numbers of results of a precisely defined type.  Often the aggregator adds value in the process.  The Sepkoski marine fossil data set is an example.  TimeTree is an example that has more of a focus on making it easy for the user. \n* '''meta-analysis''' means combining several separate analyses to address issues beyond the scope or power of a single analysis.  This sometimes means a meta-statistical analysis (statistical meta-analysis), in which conclusions are based on combining, not the raw data from each study, but summary statistics from each study (e.g., means and variances) in a way that is sensitive to study design.  Supertree methods (for assembling composite trees from separate overlapping trees) are analogous, in that they combine trees rather than the underlying character data.  Sidlauskas, et al. use \"meta-analysis\" to refer to two studies that \"synthesized the results of hundreds of previous studies\" to show conclusively that climate change causes shifts in species distributions (something that individual studies could not establish conclusively).  \n* '''re-purposing''' means using the results of a study for a purpose other than that of the primary consumer.    \n* '''integration''' seems similar to \"synthesis\" but may have more of an implication of bringing together things that obviously belong together but have been kept separate for arbitrary reasons, e.g., combining data from different domains or different types of studies.  This kind of integration depends on ''integrating variables'' or keys by which data from separate studies are combined.  The integrating variable might be an accession number, a species name, a geographic location, etc.  \n* '''synthesis''' seems similar to \"integration\" but may have more of an implication of conceptual novelty and creativity, i.e., combining results in ways that were not imagined.  \n\nNote that aggregation and meta-analysis combine data from multiple studies ''of the same type''.  Synthesis and integration necessarily combine data from studies ''of different types''.  Study replication, by definition, deals with a single study.\n\n= About use-cases = \n\nA \"use case\" is a description, '''from the perspective of the user''' (not the developer), of a set of transactions intended to satisfy a particular category of user needs. Here is a formula for a use-case: \n* Name and description - brief overview\n* Motivation - why do researchers want to do this? \n* Ideal procedure \n** Preconditions - what does the user need to start with?  \n** Steps -what are the steps in a typical case? \n** Outcomes - what outcomes satisfy user needs? \n* Key challenges - what makes it difficult to do this today? \n* References - who does this, or wants to do it?\n\n= List of use-cases = \n\n== Supertree research == \n\n=== Name and description ===\nA supertree is defined as an estimate of phylogeny assembled from smaller phylogenies. These partial phylogenies (or source trees) must have some taxa in common, but not necessarily all. Modern supertrees can contain hundreds or thousands of taxa and are constructed from hundreds of source phylogenies requiring the collection of large amounts of data. These phylogenies are of great use in, for example, comparative biology, and macroevolutionary studies (quoting in verbatim Davis & Hill, 2010 [http://dx.doi.org/10.1186/1756-0500-3-95]).\n=== Motivation ===\nTo test hypotheses that require phylogenies that are of such great scale and breadth, that creation of a similarly-sized (taxa-wise) phylogeny by conventional methods would be far too difficult for a variety of reasons.\n=== Typical procedure ===\n* Preconditions \n** 'Source trees' : previously published hypotheses of evolutionary relationships for a group (taxa-wise) of choice. Generally topology-only data required but can vary depending on which exact method used. \n* Steps (after Davis & Hill, 2010)\n** 1. Data collection and entry \n** 2. Standardisation of terminal taxa\n** 3. Ensure source tree independence: Remove redundancy within the [meta]dataset that would otherwise unfairly up-weight data\n** 4. Check adequate taxonomic overlap of source trees\n** 5. Matrix creation: Create a matrix suitable for analysis\n* Outcomes \n** ???? A supertree estimate of phylogeny.\n=== Key challenges === \n* The lack of digitally-available tree topology-data in a recognised/standardised format (e.g. Newick) for most phylogenetic studies that have ever been published.\n** Most tree-topologies are generally published (in their original papers) graphically which isn't too helpful for re-use and re-purposing.\n** This problem is so widespread that ingenious methods (e.g. TreeSnatcher [http://dx.doi.org/10.1093/bioinformatics/btm438]) have been developed specifically to re-extract topology-data from published papers.\n* No standardisation of taxon names between studies, hence Step 2 (above).\n* Taxon sampling. Step 3 (above) is necessary because some taxa/groups are extremely 'popular' in phylogenetic studies, whilst others are only vary rarely included.  \n=== References === \nBeck, R., Emonds, O. B., Cardillo, M., Liu, F. G., and Purvis, A. 2006. A higher-level MRP supertree of placental mammals. BMC Evolutionary Biology 6:93+. [http://dx.doi.org/10.1186/1471-2148-6-93]\n\nBininda-Emonds, O. R. P., Gittleman, J. L., and Purvis, A. 1999. Building large trees by combining phylogenetic information: a complete phylogeny of the extant Carnivora (Mammalia). Biological Reviews 74:143\u2013175. [http://dx.doi.org/10.1111/j.1469-185X.1999.tb00184.x]\n\nCotton, J. and Wilkinson, M. 2009a. Supertrees join the mainstream of phylogenetics. Trends in Ecology & Evolution 24:1\u20133. [http://dx.doi.org/10.1016/j.tree.2008.08.006]\n\nDavies, T. J., Barraclough, T. G., Chase, M. W., Soltis, P. S., Soltis, D. E., and Savolainen, V. 2004. Darwin's abominable mystery: Insights from a supertree of the angiosperms. Proceedings of the National Academy of Sciences of the United States of America 101:1904-1909. [http://dx.doi.org/10.1073/pnas.0308127100]\n\nDavis, R. B., Baldauf, S. L., and Mayhew, P. J. 2010. Many hexapod groups originated earlier and withstood extinction events better than previously realized: inferences from supertrees. Proceedings of the Royal Society B: Biological Sciences 277:1597\u20131606. [http://dx.doi.org/10.1098/rspb.2009.2299]\n\nLloyd, G. T., Davis, K. E., Pisani, D., Tarver, J. E., Ruta, M., Sakamoto, M., Hone, D. W. E., Jennings, R., and Benton, M. J. 2008. Dinosaurs and the Cretaceous Terrestrial Revolution. Proceedings of the Royal Society B: Biological Sciences 275:2483-2490. [http://dx.doi.org/10.1098/rspb.2008.0715]\n\nRuta, M., Pisani, D., Lloyd, G. T., and Benton, M. J. 2007. A supertree of Temnospondyli: cladogenetic patterns in the most species-rich group of early tetrapods. Proceedings of the Royal Society B: Biological Sciences 274:3087-3095. [http://dx.doi.org/10.1098/rspb.2007.1250]\n\nSanderson, M. J., Purvis, A., and Henze, C. 1998. Phylogenetic supertrees: assembling the trees of life. Trends in Ecology and Evolution 13:105-109. [http://dx.doi.org/10.1016/S0169-5347(97)01242-1]\n\nThomas, G., Wills, M., and Szekely, T. 2004. A supertree approach to shorebird phylogeny. BMC Evolutionary Biology 4:28+. [http://dx.doi.org/10.1186/1471-2148-4-28]\n\n== Other meta-analyses (multiple phylodata re-use cases) == \n\n=== Name and description ===\nA 'catch-all' basket group for other non-supertree meta-analyses utilizing many phylogenetic datasets.\n=== Motivation ===\nTo test hypotheses over many independent datasets \n=== Typical procedure ===\n* Preconditions\n** Published cladistic data matrices in useable electronic formats (e.g. nexus) \n* Steps (varies depending on exact case) \n** first-order taxon jackknifing [http://dx.doi.org/10.1080/10635150701627296]\n** partitioning datasets and performing subsequent analyses on partitions to compare signal [http://dx.doi.org/10.1111/j.1096-0031.2009.00273.x] [http://www.citeulike.org/user/rossmounce/article/7853861]\n* Outcomes - what outcomes satisfy user needs? \n** ???? Interesting results\n=== Key challenges === \nThe poverty of morphological cladistic datasets digitally available in standardised formats (e.g. nexus). Particularly for non-botanical, non-mycological taxonomic groups.\nRelevent key challenges, barriers and the scale of the problem have been outlined in a talk recently (Mounce, 2010 @ The 12th Young Systematists' Forum [http://prezi.com/1s0lkatmc30t/the-continued-growth-of-phylogenetic-information/]).\nIt is even hard just to ''find'' relevant datasets - speaking from first-hand experience, if you attempt a literature search for cladistic data you'll get lots of false positives (e.g. Titles and Abstracts that refer to \"Systematics of...\" yet contain no primary phylogenetic analysis), AND false negatives (if you 'just' search for \"morphological systematics\" and \"cladist*\" you won't find all that's out there)!\n\n=== References === \n\nAndrea Cobbett, Mark Wilkinson, and Matthew A Wills (2007) Fossils Impact as Hard as Living Taxa in Parsimony Analyses of Morphology\nSyst Biol (2007) 56(5): 753-766 [http://dx.doi.org/10.1080/10635150701627296 doi:10.1080/10635150701627296]\n\nPrevosti, Francisco J., Chemisquy, Mar\u00eda A. (2010) The impact of missing data on real morphological phylogenies: influence of the number and distribution of missing entries. Cladistics 26(3):326-339. [http://dx.doi.org/10.1111/j.1096-0031.2009.00289.x doi:10.1111/j.1096-0031.2009.00289.x]\n\nSong, Hojun, Bucheli, Sibyl R. (2010) Comparison of phylogenetic signal between male genitalia and non-genital characters in insect systematics. Cladistics 26(1):23-35 [http://dx.doi.org/10.1111/j.1096-0031.2009.00273.x doi:10.1111/j.1096-0031.2009.00273.x]\n\nMounce, R. C. P. and M. A. Wills (2010). Congruence\u2029 between\u2029 cranial\u2029 and postcranial\u2029 characters\u2029 in \u2029vertebrate\u2029 systematics. Proceedings of the 29th Annual Meeting of the Willi Hennig Society. http://www.citeulike.org/user/rossmounce/article/7853861\n\nLiow, Lee Hsiang (2007) LINEAGES WITH LONG DURATIONS ARE OLD AND MORPHOLOGICALLY AVERAGE: AN ANALYSIS USING MULTIPLE DATASETS. Evolution 61(4):885-901  [http://dx.doi.org/10.1111/j.1558-5646.2007.00077.x doi:10.1111/j.1558-5646.2007.00077.x] ''interestingly, this last study uses phylogenetic data (cladistic data matrices of morphological characters) to create a 'morphospace' representation. \nNo actual phylogenetic analysis per se, but ample re-use of phylogenetic datasets...''\n\n== Another use-case == \n\n=== Name and description ===\nbrief overview\n=== Motivation ===\nwhy do researchers want to do this? \n=== Typical procedure ===\n* Preconditions -  what does the user need to start with?  \n** \n** \n* Steps - what are the steps in a typical case? \n** \n** \n* Outcomes - what outcomes satisfy user needs? \n** \n** \n=== Key challenges === \nwhat makes it difficult to do this today? \n=== References === \nwho does this, or wants to do it?\n\n[[Category:MIAPA]]"
                    }
                ]
            }
        }
    }
}