At the beginning of the post-sequencing era, biology must work with the enormous amounts of quantitative data being amassed and must render complex problems in mathematical terms, with all of the computational effort that entails. This phenomenon is perhaps best exemplified by the interdisciplinary scientific activity caused by the advent of high-throughput cDNA microarray technology, which facilitates large-scale surveys of gene expression. Biologists must now work together with engineers, statisticians, computer scientists, and other specialists, in order to attain a holistic understanding of the complex relationship between genes within the genome and uncover genetic function and regulation. This text aims to help researchers deal with contemporary genomic challenges. Topics covered include: overviews of the role of supercomputers in genomics research, the existing challenges and directions in image processing for microarray technology, and web-based tools for microarray data analysis; approaches to the global modeling and analysis of gene regulatory networks and transcriptional control, using methods, theories, and tools from signal processing, machine learning, information theory, and control theory; state-of-the-art tools in Boolean function theory, time-frequency analysis, pattern recognition, and unsupervised learning, applied to cancer classification, identification of biologically active sites, and visualization of gene expression data; crucial issues associated with statistical analysis of microarray data, statistics and stochastic analysis of gene expression levels in a single cell, statistically sound design of microarray studies and experiments; and biological and medical implications of genomics research.
Computational and Statistical Approaches to Genomics, 2nd Edition, aims to help researchers deal with current genomic challenges. During the three years after the publication of the first edition of this book, the computational and statistical research in genomics have become increasingly more important and indispensable for understanding cellular behavior under a variety of environmental conditions and for tackling challenging clinical problems. In the first edition, the organizational structure was: data à analysis à synthesis à application. In the second edition, the same structure remains, but the chapters that primarily focused on applications have been deleted.
This decision was motivated by several factors. Firstly, the main focus of this book is computational and statistical approaches in genomics research. Thus, the main emphasis is on methods rather than on applications. Secondly, many of the chapters already include numerous examples of applications of the discussed methods to current problems in biology.
The range of topics have been broadened to include newly contributed chapters on topics such as alternative splicing, tissue microarray image and data analysis, single nucleotide polymorphisms, serial analysis of gene expression, and gene shaving. Additionally, a number of chapters have been updated or revised.
This book is for any researcher, in academia and industry, in biology, computer science, statistics, or engineering involved in genomic problems. It can also be used as an advanced level textbook in a course focusing on genomic signals, information processing, or genome biology.