Scientific Program

Conference Series Ltd invites all the participants across the globe to attend 5th International Conference on Biometrics & Biostatistics Houston,Texas,USA.

Day 1 :

Keynote Forum

Bin Li

Takeda Pharmaceuticals International Co., USA

Keynote: NGS data process and building/testing drug-sensitivity predictive models for both single agents and drug combinations

Time : 10:00-11:00

Conference Series Biostatistics 2016 International Conference Keynote Speaker Bin Li photo
Biography:

Bin Li leads a computational biology team at Takeda pharmaceuticals, to support translational research on various Takeda compounds. His team evaluates NGS methods, builds internal pipelines, and selects vendors for clinical data process, for multiple NGS platforms (including WGS, WES, custom panel, and RNA-seq). His team also develops methods and builds predictive models for patient stratifications and disease indication selections. Before joining Millennium, he was a Senior Scientist at Merrimack Pharmaceuticals in Cambridge, MA. Prior to Merrimack, he was a Senior Scientist at Institute for Systems Biology in Seattle, WA.


 

Abstract:

Precision medicine approaches to optimize therapeutic efficacy in selected patient populations requires the acquisition and storage of in depth genomic and phenotypic patient information combined with the use of innovative computational platforms for knowledge generation. A critical objective is to discover and clinically apply biomarkers to select patients most likely to respond or least likely to experience adverse events. Here, we discuss various computational efforts to support translational biomarker research for biomarker discovery and the development of predictive signatures for clinical testing. These efforts include: Method evaluation and pipeline building for QC/process clinical NGS data; single agent’s predictive modeling for identifying/validating translational biomarkers for patient stratification; building translational research storage platforms to integrate clinical and omics data and; method development and evaluation for drug combination prediction. Combining predictive modeling and translational platform building efforts, computational bioinformatics will be able to support the identification of biomarkers for patient stratification and disease indication selection.

Keynote Forum

Mohammad H Rahbar,

The University of Texas Health Science Center at Houston, USA

Keynote: Characteristics of biostatistics, epidemiology, and research design programs in institutions with clinical and translational science awards

Time : 11:20-12:05

Conference Series Biostatistics 2016 International Conference Keynote Speaker Mohammad H Rahbar, photo
Biography:

Mohammad H Rahbar obtained his PhD in Statistics from Michigan State University in 1988. He is the Director of Division of Clinical and Translational Sciences in the Department of Internal Medicine at the University of Texas McGovern Medical School. He also serves as the Director of Biostatistics, Epidemiology, and Research Design component of Center for Clinical and Translational Sciences at the University of Texas Health Science Center at Houston. He has published more than 200 papers in reputed journals and has been the receipient of sevral grant awards from the National Institutes of Health in the United States

Abstract:

Limited information is available regarding the structure and scholarly output of Biostatistics, Epidemiology, and Research Design (BERD) units in the US academic health centers (AHCs), presenting a barrier to understanding common practices. Using four years of survey data from AHCs that were members of the Clinical and Translational Science Award (CTSA) Consortium during 2010-2013, we describe the size, composition, and scholarly output of CTSA-related BERD units. Overall, the size of BERD units ranged from 3-86 individuals. The median Full Time Equivalent (FTE) in BERD units remained similar and ranged from 3.0 to 3.5 FTEs over the years. BERD units reported more availability of doctoral-level biostatisticians than doctoral-level epidemiologists. In 2011, 2012, and 2013, more than a third of BERD units provided consulting support on 101 to 200 projects. A majority of BERD units reported that between 25% and 75% (in 2011) and 31% - 70% (in 2012) of their consulting was provided to junior investigators. More than two-thirds of BERD units reported their contributions to the submission of 20 or more non-BERD grant or contract applications annually. Nearly half of BERD units reported 1 to 10 manuscripts submitted annually with a BERD practitioner as the first or corresponding author. We describe the size, composition, and scholarly output of CTSA-related BERD units for the period 2010-2013. This characterization provides a benchmark against which to compare BERD resources and may be particularly useful for those institutions planning to develop new units in support of programs such as the CTSA

Keynote Forum

Mohammad H Rahbar

The University of Texas Health Science Center at Houston, USA

Keynote: Characteristics of biostatistics, epidemiology, and research design programs in institutions with clinical and translational science awards

Time : 11:20-12:00

Conference Series Biostatistics 2016 International Conference Keynote Speaker Mohammad H Rahbar photo
Biography:

Mohammad H Rahbar obtained his PhD in Statistics from Michigan State University in 1988. He is the Director of Division of Clinical and Translational Sciences in the Department of Internal Medicine at the University of Texas McGovern Medical School. He also serves as the Director of Biostatistics, Epidemiology, and Research Design component of Center for Clinical and Translational Sciences at the University of Texas Health Science Center at Houston. He has published more than 200 papers in reputed journals and has been the receipient of sevral grant awards from the National Institutes of Health in the United States

Abstract:

Limited information is available regarding the structure and scholarly output of Biostatistics, Epidemiology, and Research Design (BERD) units in the US academic health centers (AHCs), presenting a barrier to understanding common practices. Using four years of survey data from AHCs that were members of the Clinical and Translational Science Award (CTSA) Consortium during 2010-2013, we describe the size, composition, and scholarly output of CTSA-related BERD units. Overall, the size of BERD units ranged from 3-86 individuals. The median Full Time Equivalent (FTE) in BERD units remained similar and ranged from 3.0 to 3.5 FTEs over the years. BERD units reported more availability of doctoral-level biostatisticians than doctoral-level epidemiologists. In 2011, 2012, and 2013, more than a third of BERD units provided consulting support on 101 to 200 projects. A majority of BERD units reported that between 25% and 75% (in 2011) and 31% - 70% (in 2012) of their consulting was provided to junior investigators. More than two-thirds of BERD units reported their contributions to the submission of 20 or more non-BERD grant or contract applications annually. Nearly half of BERD units reported 1 to 10 manuscripts submitted annually with a BERD practitioner as the first or corresponding author. We describe the size, composition, and scholarly output of CTSA-related BERD units for the period 2010-2013. This characterization provides a benchmark against which to compare BERD resources and may be particularly useful for those institutions planning to develop new units in support of programs such as the CTSA

Keynote Forum

Ajit Kumar Roy

Central Agricultural University, Agartala, India

Keynote: Cutting edge computational solutions for large scale high-dimensional data sets arising out of new biology

Time : 12:05-12:50

Conference Series Biostatistics 2016 International Conference Keynote Speaker Ajit Kumar Roy photo
Biography:

Ajit Kumar Roy is an acclaimed researcher, consultant, and award-winning Keynote Speaker. He served at CIFA, ICAR, as Principal Scientist and was involved in R&D activities in ICT, Statistics, Bioinformatics and Economics. At International level, he served as a Computer Specialist at SAARC Agricultural Information Centre (SAIC), Dhaka, Bangladesh for over 3 years. He published over 100 articles in refereed journals, conference proceedings. His recent best-sellers are 'Applied Big Data Analytics'; 'Impact of Big Data Analytics on Business, Economy, Health Care and Society'; ‘Web Resources for Bioinformatics, Biotechnology and Life Sciences Research’; 'Self Learning of Bioinformatics Online'; Data Science – A Career Option for 21st Century; 'Applied Bioinformatics, Statistics and Economics in Fisheries Research' and 'Applied Computational Biology and Statistics in Biotechnology and Bioinformatics'.

Abstract:

New biology is a branch of biology that that deals with the nature of biological phenomenon at the molecular level. In post genomic era a new language has been created for new biology viz.,Genomics,Functional Genomics, Proteomics,cDNA,microarrays, Global Gene Expression Patterns .New Computational Tools are used  for Sequencing, Analyzing experimental data, Searching, Pattern matching, Data mining, Gene discovery, Function discovery aiming to Classify, Identify patterns, predictions, Create models & Prediction, Assessment and Comparison,Optimization,Better utilize existing knowledge. The new wave of high-throughput technologies in genomics and proteomics are constantly improving and generating an unprecedented amount of data that can be termed as Big Data means large data sets in terms of volume, variety, velocity, variability, veracity, & complexity. Bioinformatics researchers are currently confronted with a huge challenge of handling, processing and moving these large-scale biological data, a problem that will only increase in coming years. Therefore, cloud computing bears great promise for effectively addressing issues of large-scale data generated by high-throughput technologies in the fields of genomics, proteomics and other biological research areas. Cloud-based bioinformatics resources have changed the approaches toward huge datasets, providing much faster data acquisition, analysis rates and storage. New cloud-based bioinformatics computing tools, algorithms, and workflows are consistently being developed and successfully deployed. Basically, the cloud refers to software and services that run on the internet instead of one's computer. The increased use of next generation sequencing has led to challenges in data analysis, large-scale data storage and management, multi-site data integration, validation for quality and scale-up of informatics. It is integral to overcome these analysis and informatics challenges to successfully translate NGS research and data from the lab to clinical stage. Success in the life sciences will depend on our ability to properly interpret the large-scale, high-dimensional data sets that are generated by these technologies, which in turn requires us to adopt advances in informatics. New technological advances and the availability of ‘big data’ enable us to probe deep into the origin of life and to further understand evolutionary processes. This paper reviews the current development of cloud based computational technologies that can be applied and pinpoints their potential beneficial applications as well as implications for Life Sciences. Big data Analytics platforms that offer implementations of the Map Reduce computational pattern e.g., Hadoop make it easy for developers to perform data-intensive computations at scale is also highlighted. The New Biology approach has the potential to meet critical societal goals in food, the environment, energy, and health.

  • Bioinformatics|Biometrics Applications|Cyber security|Statistical Methods|Clinical Biostatistics|Computational Systems Biology
Location: Houston, USA
Speaker

Chair

Bin Li

Takeda Pharmaceuticals International Co., USA

Speaker

Co-Chair

Ajit Kumar Roy,

Central Agricultural University, India

Speaker
Biography:

Upendra Kumar Devisetty earned his PhD from University of Nottingham, UK and completed Post-doctoral studies at the University of California Davis and Oregon State University. He is currently working as Science Informatician at CyVerse, a life sciences cyberinfrastructure funded by the National Science Foundation (NSF). He has published more than 10 papers in peer-reviewed journals and has been invited to speak at several international conferences

Abstract:

CyVerse (formerly iPlant Collaborative) is a life sciences cyberinfrastructure funded by the National Science Foundation (NSF). The infrastructure’s purpose is to scale science, domain expertise, and knowledge by providing a variety of computational tools, services, and platforms for storing, sharing, and analyzing large and diverse biological datasets. The Discovery Environment (DE) in CyVerse provides a modern web interface for running powerful computing, data, and analysis applications. By providing a consistent user interface for accessing tools and computing resources needed for specialized scientific analyses, the DE facilitates data exploration and scientific discovery. DE merges the “science gateway” functionality and the bioinformatics “work bench” with high-performance data management to allow seamless access to reusable computational workflows that can run at very large scales. It is common in bioinformatics to build new analysis methods utilizing multiple programs, libraries, and modules. However, each analysis that uses these tools requires specific versions of the operating system and underlying software. Docker is a container virtualization technology that wraps software of interest (e.g., a bioinformatics tool) together with all its software dependencies so it can run in a reproducible manner regardless of the environment. CyVerse has adopted Docker for integrating software apps that run in the DE’s Compute Cluster. The user creates a Dockerfile, which is sent to CyVerse and used to build the Docker image containing the tool. After the image has been deployed on the DE’s compute cluster, the user can build an web app in the DE to enable other researches easily use the tool

Speaker
Biography:

Azam Yazdani has completed her PhD in Statistical Causal Inference from Cambridge University, England, and Friedrich Schiller Jena University, Germany. She introduced the granularity directed acyclic graph (GADG) algorithm which is recognized at the 2015 atlantic causal inference conference and won “The Thomas R. Ten Have Award”. The GDAG algorithm integrates different biological levels of granularity in large scales to identify causal networks. She has carried out first-rate studies on robust statistical structures. Her research work aims to tackle the problems of finding principals which govern the mechanism of disease to provide a better understanding of the disease mechanisms applying her expertise on data integration in large scales and generating causal relationships.

Abstract:

Background: Analyses of individual metabolites does not provide information about likely targets of intervention because of co-linearity among the metabolites. One approach to directly incorporate relationships among the metabolome is to consider networks.

 

Method: Powerful and advanced analytic strategies are required to identify pathways from genome and environmental exposures to disease endpoints using comprehensive data from different biological granularities. To address the emerging challenge of relating information among different biological granularities, a granularity directed acyclic graph (GDAG) algorithm has been introduced, validated and applied to create strong instrumental variables from the genome granularity (Mendelian principle) to identify robust causal/Bayesian networks over traits. Analyzing biological causal networks provides important insights into the hierarchical regulation of physiology and metabolism.

 

Results: For the application, information from 1,034,945 genetic variants distributed across the genome was extracted and then employed by the GDAG algorithm to identify a metabolomics causal network among 122 serum metabolites. Five modules, largely corresponding to functional categories (e.g. amino acids), were identified over the network and module boundaries were determined using directionality and causal effect sizes. Based on causal network parameters, individual metabolites were identified as hypothesized targets for intervention and prediction.

 

Conclusion: Given the metabolomics causal network, metabolites with direct effect on risk factors (e.g. triglyceride) were identified which improved the understanding of the role of metabolites in quantitative risk factor phenotypes. Finally, pathways from the genome to risk factors via metabolites were determined to reveal underlying biological networks. Future steps will connect these results to cardiovascular disease end points.

Zixing Wang

The University of Texas Graduate School of Biomedical Sciences, USA

Title: Integrating genome and transcriptome data to predict functional driver mutation in breast cancer

Time : 14:50-15:20

Speaker
Biography:

Dr. Wang got his Ph.D degree in genome science and technology at the University of Tennessee in 2011, with the thesis on the TGF-beta function in neuron development.  After graduate, he moved on and switched to bioinformatics and computational biology. Currently he is working as a postdoc at MD Anderson Cancer Center, University of Texas. His main research interest focuses on data mining, machine learning, especially with their integration and application in cancer genomic and precision medicine.  Dr. Wang has published 16 scientific papers in international well-recognized journals. He has been ACM  SIG and ISMB members and also served as journal reviewer for many top-tier journals in the field of bioinformatics and systems biology

Abstract:

Accurate prediction of the functional effects of genetic variation in cancer is critical for realizing the promise of precision medicine. Due to a lack of statistically rigorous approaches and training data, differentiating driver mutations from passenger mutations remains a major challenge in cancer research. We develope a novel Bayesian method, xDriver that combines mutations and their sequence-derived functional features (such as GERP scores) with gene expression in a population of tumor samples to identify mutations that significantly alter gene expression landscapes. We demonstrate using 752 breast cancer samples in the cancer genome atlas that our integrative approach is able to significantly improve the accuracy of driver mutation identification over existing approaches that do not perform such integration. In particular, our approach is able to enhance the functional prioritization of so-called “tail” (rare) mutations and more accurately delineate cancer subtype specific mutations (such as PIK3CA mutants associated with lymph node negative patients). Importantly, scores generated by our model achieve the best agreement with in vitro functional cell viability data obtained from transfected Ba/F3 and MCF10A cell-lines, compared to predictions from other commonly used algorithms. Our results exemplify the importance of integrating gene expression in predicting candidate driver mutations. This integrative study has the potential to impact functional genomic experiments and is expected to link cancer genomic event to precision medicine

Wei Zhong

ICON Clinical Research, USA

Title: Using pattern mixture models in sensitivity analysis

Time : 15:20-15:50

Speaker
Biography:

Wei Zhong, done Ph.D University of Cincinnati College of Medicine, Sr Biostatistician, Project Director at ICON Clinical Research, San Francisco Bay AreaPharmaceuticals

Abstract:

Missing data is a concern in most clinical trials, especially in those with a longitudinal structure. Sensitivity analyses are often used to assess the robustness of statistical inference made under conditions of missing data. There has been an increasing trend to use pattern mixture models (PMMs) to impute missing data in sensitivity analyses. The key to proper implementation of PMMs is to make valid assumptions about the link between observed values and missing values. By applying different assumptions to the data from a phase II trial, we have demonstrated that conclusions drawn from such sensitivity analysis fluctuate depending on what assumptions underlie the missing data imputation. Therefore, PMMs should be used with caution and the plausibility of model assumptions should be evaluated first by the clinical study team, taking into consideration past experience with the study drug and the patient population

Murat YAZICI

JForce Information Technologies Inc.,Turkey

Title: Spatial point process and its application in geographical epidemiology

Time : 16:10-16:40

Speaker
Biography:

Murat Yazici has completed his BSc in Statistics and MSc in Quantitative Methods. During his Statistics and Quantitative Methods educations, he studied Bayesian Statistical Estimation as a thesis-project and Fuzzy Robust Regression Analysis as a master thesis. He has several scientific papers and a book chapter study about big data analytics. He has attended to several conferences to present his studies. He works as Senior Data Scientist & Researcher at JForce Information Technologies Inc. which is one of the solutions partners of IBM in Istanbul. His research areas include “analytics studies in many areas such as health, telecommunication, security, banking, insurance, transportation and logistics sectors”.

Abstract:

In statistics, a point process is a set of independent points in time, or geographical or general spaces. It is also a type of random process. The streak of lightning can be an example as a point process in both time and geographical space. The statistical disease mapping and location detection of radio stations in a telecommunication network is given as another example. Also, point processes are powerful instruments in statistics for modeling spatial data which is of interest in various disciplines such as plant ecology, epidemiology, geography, seismology, materials science, telecommunications, economics, astronomy and others. Spatial point processes are a type of stochastic process, each of whose realizations consists of a finite or countably infinite set of points in the plane. They are useful in the analysis of observed patterns of points; where the points represent the locations of some object of study e.g. disease cases, or petty crimes. Furthermore, spatial point processes occur in various areas in biology and ecology, e.g. in connection with the spreading of insect larvae, the distribution of trees in woodlands, the distribution of bird's nests, or experiments with revitalization of eroded areas in the Wadden sea. The geographical epidemiology is interested in the examination and definition of disease. It is also concerned with geographic variations and usually demographic, behavioral, environmental, socioeconomic, and genetic information etc. Some of types of studies of the geographical epidemiology are disease mapping, geographic correlation studies, and disease clustering. The spatial point process in geographical epidemiology is concerned with spatial analysis of geographical epidemiology by using point process.

Speaker
Biography:

Tapti Sengupta is an Assistant Professor and Coordinator in the Department of Microbiology at West Bengal State University. She started her career as a Fisheries professional focusing on aquatic animal diseases and environmental modelling aspects but currently she is pursuing a research on Mycobacteriosis both in human and animals. She is involved in a number of projects with central and state government funding especially from Department of Biotechnology and ICAR. She is guiding a bunch of potential research scholars and inspires fellow and supervising numerous summer projects along with her departmental and administrative work

Abstract:

Nontuberculous mycobacteria (NTM) are ubiquitous in the environment and are responsible for several ulcerative diseases in human and/or animals, also known as mycobacteriosis. These granulomatous ulcerations caused by NTM infections have been increasing in number over the past decades, especially in immunocompromised and HIV/AIDS patients and in domestic and free living animals. The aim of the study was to ascertain the spectrum of nontuberculous mycobacteria within mycobacterial isolates not belonging to M. tuberculosis and M. avium complexes but phylogenetically related to them. Fishermen and fish handlers were examined for ulcerative lesions typical for mycobacterial infections for consecutive years, 2015 and 2016, from various districts of West Bengal viz. A- Jalpaiguri, B- Murshidabad, C- North 24 Parganas and D- Kolkata, India. Isolated cultures were analyzed within the framework of ecological studies, carried out in fish models. A total of four mycobacterial species were identified based on bio typing and molecular analysis. 306 human samples (swab and pus from the sight of the ulceration, sputum) and 423 environmental samples viz. water and soil relating to the fish handlers, were pooled and screened for two years. 214 human samples were positive for M. fortuitum and M. kansasii whereas 274 environmental samples were positive for M. kansasii and M. smegmatis. M. chelonae was ubiquitous in nature. M. fortuitum showed significant positive correlation with the lower temperature suggesting the role of environmental factors in the rate and degree of infections by the pathogens. M. kansasii and M. smegmatis were found to be opportunistic and presented themselves in primary as well as secondary infections, and in significant levels (p≤0.01) in human samples having direct contact with the contaminated water. Degree of infection showed positive correlation with districts situated in the north of West Bengal. Temperature ranging from 15o-25oC. Significant chi square (χ2) value (78.20) was observed at p<0.001 signifying that the degree of infection varied with temperature, with lower temperatures showing greater NTM occurrence visibly in the cooler districts. The present study signifies the prevalence of NTMs in West Bengal which was not reported earlier. The rise in the number and variety of NTMs especially of groups related to M. tuberculosis complex raises questions about drug regimens and undiagnosed symptoms. 70.02% of pathogenic mycobacteria reported in this study, from various districts of West Bengal, emphasize the need for research of these neglected and opportunistic pathogens particularly using multivariate analysis to identify causative factors