<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:prism="http://prismstandard.org/namespaces/1.2/basic/" version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/">
  <channel>
    <title>Nature Precedings - Tag feed for statistics</title>
    <link>http://precedings.nature.com/tags/statistics</link>
    <description>Recently posted documents tagged with 'statistics'</description>
    <dc:publisher>Nature Publishing Group</dc:publisher>
    <dc:language>en</dc:language>
    <prism:publicationName>Nature Precedings</prism:publicationName>
    <image>
      <title>Nature Precedings</title>
      <url>http://precedings.nature.com/images/header_logo.gif</url>
      <link>http://precedings.nature.com</link>
    </image>
    <atom:link type="application/rss+xml" rel="self" href="http://precedings.nature.com/tags/statistics/feed"/>
    <item>
      <title>Parallel routes of human carcinoma development: Implications of the age-specific incidence data</title>
      <link>http://precedings.nature.com/documents/3080/version/1</link>
      <description>The multi-stage hypothesis suggests that cancers develop through a single defined series of genetic alterations. This hypothesis was first suggested over 50 years ago based upon age-specific incidence data. However, recent molecular studies of tumors indicate that multiple routes exist to the formation of cancer, not a single route. This parallel route hypothesis has not been tested with age-specific incidence data.To test the parallel route hypothesis, I formulated it in terms of a mathematical equation, confirmed this equation with computer simulations, then tested whether this equation was consistent with age-specific incidence data compiled by the Surveillance Epidemiology and End Results (SEER) cancer registries since 1973. I used the chi-squared goodness of fit test to measure consistency.I found that the age-specific incidence data from most human carcinomas, including those of the colon, lung, prostate, and breast were consistent with the parallel route hypothesis. However, this hypothesis is only consistent if an immune sub-population exists, one that will never develop carcinoma. Furthermore, breast carcinoma has two distinct forms of the disease, and one of these occurs at significantly different rates in different racial groups.</description>
      <guid>http://precedings.nature.com/documents/3080/version/1</guid>
      <pubDate>Fri, 17 Apr 2009 21:06:50 UTC</pubDate>
      <dc:title>Parallel routes of human carcinoma development: Implications of the age-specific incidence data</dc:title>
      <dc:identifier>hdl:10101/npre.2009.3080.1</dc:identifier>
      <dc:date>2009-04-17</dc:date>
      <dc:creator>James Brody</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2009-04-17T21:06:50Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Cancer</prism:section>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/3080/version/1/files/npre20093080-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>The Gloomy Prospect Wins: Statistical Significance and Population Stratification in Genome Wide Association Studies</title>
      <link>http://dx.doi.org/10.1038/npre.2008.2642.1</link>
      <description>This one-hour lecture by Dr. Eric Turkheimer of  the University of Virginia&amp;#8217;s Department of Psychology explored the following:The contemporary era has seen a convergence of genomic technology and traditional social scientific concerns with complex human individual differences. Rather than finally turning social science into a replicable hard-scientific enterprise, genomics has gotten bogged down in the long-standing frustrations of social science. A recent report of an extensive genome wide association study of human height demonstrates the profound difficulties of explaining uncontrolled human variation at a genomic level. The statistical technologies that have been brought to bear on the problem of genomic association are simply modifications of similar methods that have been used by social scientists for decades, with little success. The motivation for the statistical methods in genomics is the same as it is in traditional social science: An attempt to discern linear causation in complex systems when experimental control is not possible.For an audio recording of Dr. Turkheimer&amp;#8217;s lecture, please visit http://cirge.stanford.edu/activities/events.html.</description>
      <guid>http://dx.doi.org/10.1038/npre.2008.2642.1</guid>
      <pubDate>Mon, 15 Dec 2008 17:02:39 UTC</pubDate>
      <dc:title>The Gloomy Prospect Wins: Statistical Significance and Population Stratification in Genome Wide Association Studies</dc:title>
      <dc:identifier>doi:10.1038/npre.2008.2642.1</dc:identifier>
      <dc:date>2008-12-15</dc:date>
      <dc:creator>Eric Turkheimer</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-12-15T17:02:39Z</prism:publicationDate>
      <prism:category>Presentation</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2642/version/1/files/npre20082642-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>Signi&#64257;cance tests for comparing digital gene  expression pro&#64257;les</title>
      <link>http://precedings.nature.com/documents/2002/version/3</link>
      <description>Most of the statistical tests currently used to detect differentially expressed genes are based on asymptotic results, and perform poorly for low expression tags. Another problem is the common use of a single canonical cutoff for the significance level (p-value) of all the tags, without taking into consideration the type II error and the highly variable character of the sample size of the tags.This work reports the development of two significance tests for the comparison of digital expression profiles, based on frequentist and Bayesian points of view, respectively. Both tests are exact, and do not use any asymptotic considerations, thus producing more correct results for low frequency tags than the chi-square test. The frequentist test uses a tag-customized critical level which minimizes a linear combination of type I and type II errors.  A comparison of the Bayesian and the frequentist tests revealed that they are linked by a Beta distribution function. These tests can be used alone or in conjunction, and represent an improvement over the currently available methods for comparing digital profiles.</description>
      <guid>http://precedings.nature.com/documents/2002/version/3</guid>
      <pubDate>Fri, 29 Aug 2008 21:08:06 UTC</pubDate>
      <dc:title>Signi&#64257;cance tests for comparing digital gene  expression pro&#64257;les</dc:title>
      <dc:identifier>hdl:10101/npre.2008.2002.3</dc:identifier>
      <dc:date>2009-09-03</dc:date>
      <dc:creator>Leonardo Varuzza</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2008-08-29T21:08:06Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/2002/version/3/files/npre20082002-3.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/3.0/</creativeCommons:license>
    </item>
    <item>
      <title>The Normal Fetal Heart Rate Study: Analysis Plan</title>
      <link>http://dx.doi.org/10.1038/npre.2007.980.2</link>
      <description>Recording of fetal heart rate via CTG monitoring has been routinely performed as an important part of antenatal and subpartum care for several decades. The current guidelines of the FIGO (ref1) recommend a normal range of the fetal heart rate from 110 to 150 bpm. However, there is no agreement in the medical community whether this is the correct range (ref2). We aim to address this question by computerized analysis (ref 3) of a high quality database (HQDb, ref 4) of about one billion electronically registered fetal heart rate measurements from about 10,000 pregnancies in three medical centres over seven years. In the present paper, we lay out a detailed analysis plan for this evidence-based project in the vein of the validation policy of the Sylvia Lawry Centre for Multiple Sclerosis Research (ref 5) with a split of the database into an exploratory part and a part reserved for validation. We will perform the analysis and the validation after publication of this plan in order to reduce the probability of publishing false positive research findings (ref 6-7).</description>
      <guid>http://dx.doi.org/10.1038/npre.2007.980.2</guid>
      <pubDate>Mon, 12 Nov 2007 16:48:34 UTC</pubDate>
      <dc:title>The Normal Fetal Heart Rate Study: Analysis Plan</dc:title>
      <dc:identifier>doi:10.1038/npre.2007.980.2</dc:identifier>
      <dc:date>2007-11-12</dc:date>
      <dc:creator>Martin Daumer</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-11-12T16:48:34Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/980/version/2/files/npre2007980-2.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>The Normal Fetal Heart Rate Study: Analysis Plan</title>
      <link>http://dx.doi.org/10.1038/npre.2007.980.1</link>
      <description>Recording of fetal heart rate via CTG monitoring has been routinely performed as an important part of antenatal and subpartum care for several decades. The current guidelines of the FIGO (ref1) recommend a normal range of the fetal heart rate from 110 to 150 bpm. However, there is no agreement in the medical community whether this is the correct range (ref2). We aim to address this question by computerized analysis (ref 3) of a high quality database (HQDb, ref 4) of about one billion electronically registered fetal heart rate measurements from about 10,000 pregnancies in three medical centres over seven years. In the present paper, we lay out a detailed analysis plan for this evidence-based project in the vein of the validation policy of the Sylvia Lawry Centre for Multiple Sclerosis Research (ref 5) with a split of the database into an exploratory part and a part reserved for validation. We will perform the analysis and the validation after publication of this plan in order to reduce the probability of publishing false positive research findings (ref 6-7).</description>
      <guid>http://dx.doi.org/10.1038/npre.2007.980.1</guid>
      <pubDate>Mon, 17 Sep 2007 12:48:48 UTC</pubDate>
      <dc:title>The Normal Fetal Heart Rate Study: Analysis Plan</dc:title>
      <dc:identifier>doi:10.1038/npre.2007.980.1</dc:identifier>
      <dc:date>2007-09-17</dc:date>
      <dc:creator>Martin Daumer</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-09-17T12:48:48Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/980/version/1/files/npre2007980-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>Effective Sample Size: Quick Estimation of the Effect of Related Samples in Genetic Case-Control Association Analyses</title>
      <link>http://precedings.nature.com/documents/400/version/1</link>
      <description>Correlated samples have been frequently avoided in case-controlgenetic association studies in part  because the methods for handling them are either noteasily implemented or not widely known. Weadvocate one method for case-control association analysis of correlatedsamples&amp;#8212;the effective sample size method&amp;#8212;as a simple andaccessible approach that does not require specialized computer programs.The effective sample size method captures the variance inflationof allele frequency estimation exactly, and can be used to modify thechi-square test statistic, p-value, and 95% confidence interval ofodds-ratio simply by replacing the apparent number of allele counts with theeffective ones. For genotype frequency estimation, although a singleeffective sample size is unable to completely characterize the variance inflation,an averaged one can satisfactorily approximate the simulated result.The effective sample size method is applied to the rheumatoid arthritissiblings data collected from the North American Rheumatoid Arthritis Consortium (NARAC)to establish a significant association with the interferon-inducedhelicasel gene (IFIH1) previously being identified as a type 1 diabetessusceptibility locus.  Connections between the effective sample sizemethod and other methods, such as generalized estimation equation,variance of eigenvalues for correlation matrices, and genomic controls,are also discussed.</description>
      <guid>http://precedings.nature.com/documents/400/version/1</guid>
      <pubDate>Tue, 10 Jul 2007 05:01:19 UTC</pubDate>
      <dc:title>Effective Sample Size: Quick Estimation of the Effect of Related Samples in Genetic Case-Control Association Analyses</dc:title>
      <dc:identifier>hdl:10101/npre.2007.400.1</dc:identifier>
      <dc:date>2007-07-10</dc:date>
      <dc:creator>Wentian F. Li</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-07-10T05:01:19Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Genetics &amp; Genomics</prism:section>
      <prism:section>Immunology</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/400/version/1/files/npre2007400-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>The Reproducibility of Lists of Differentially Expressed Genes in Microarray Studies</title>
      <link>http://precedings.nature.com/documents/306/version/2</link>
      <description>Reproducibility is a fundamental requirement in scientific experiments and clinical contexts.  Recent publications raise concerns about the reliability of microarray technology because of the apparent lack of agreement between lists of differentially expressed genes (DEGs).  In this study we demonstrate that (1) such discordance may stem from ranking and selecting DEGs solely by statistical significance (P) derived from widely used simple t-tests; (2) when fold change (FC) is used as the ranking criterion, the lists become much more reproducible, especially when fewer genes are selected; and (3) the instability of short DEG lists based on P cutoffs is an expected mathematical consequence of the high variability of the t-values.  We recommend the use of FC ranking plus a non-stringent P cutoff as a baseline practice in order to generate more reproducible DEG lists.  The FC criterion enhances reproducibility while the P criterion balances sensitivity and specificity.</description>
      <guid>http://precedings.nature.com/documents/306/version/2</guid>
      <pubDate>Tue, 03 Jul 2007 12:18:21 UTC</pubDate>
      <dc:title>The Reproducibility of Lists of Differentially Expressed Genes in Microarray Studies</dc:title>
      <dc:identifier>hdl:10101/npre.2007.306.2</dc:identifier>
      <dc:date>2007-07-03</dc:date>
      <dc:creator>Leming D. Shi</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-07-03T12:18:21Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Biotechnology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/306/version/2/files/npre2007306-2.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
    <item>
      <title>The Reproducibility of Lists of Differentially Expressed Genes in Microarray Studies</title>
      <link>http://dx.doi.org/10.1038/npre.2007.306.1</link>
      <description>Reproducibility is a fundamental requirement in scientific experiments and clinical contexts.  Recent publications raise concerns about the reliability of microarray technology because of the apparent lack of agreement between lists of differentially expressed genes (DEGs).  In this study we demonstrate that (1) such discordance may stem from ranking and selecting DEGs solely by statistical significance (P) derived from widely used simple t-tests; (2) when fold change (FC) is used as the ranking criterion, the lists become much more reproducible, especially when fewer genes are selected; and (3) the instability of short DEG lists based on P cutoffs is an expected mathematical consequence of the high variability of the t-values.  We recommend the use of FC ranking plus a non-stringent P cutoff as a baseline practice in order to generate more reproducible DEG lists.  The FC criterion enhances reproducibility while the P criterion balances sensitivity and specificity.</description>
      <guid>http://dx.doi.org/10.1038/npre.2007.306.1</guid>
      <pubDate>Mon, 02 Jul 2007 09:20:53 UTC</pubDate>
      <dc:title>The Reproducibility of Lists of Differentially Expressed Genes in Microarray Studies</dc:title>
      <dc:identifier>doi:10.1038/npre.2007.306.1</dc:identifier>
      <dc:date>2007-07-02</dc:date>
      <dc:creator>Leming Shi</dc:creator>
      <prism:publicationName>Nature Precedings</prism:publicationName>
      <prism:publicationDate>2007-07-02T09:20:53Z</prism:publicationDate>
      <prism:category>Manuscript</prism:category>
      <prism:section>Biotechnology</prism:section>
      <prism:section>Bioinformatics</prism:section>
      <media:thumbnail url="http://precedings.nature.com/documents/306/version/1/files/npre2007306-1.pdf.thumb.png"/>
      <creativeCommons:license>http://creativecommons.org/licenses/by/2.5/</creativeCommons:license>
    </item>
  </channel>
</rss>
