Last data update: Dec 02, 2024. (Total: 48272 publications since 2009)
Records 1-6 (of 6 Records) |
Query Trace: Rowell J[original query] |
---|
Benchmark datasets for SARS-CoV-2 surveillance bioinformatics.
Xiaoli L , Hagey JV , Park DJ , Gulvik CA , Young EL , Alikhan NF , Lawsin A , Hassell N , Knipe K , Oakeson KF , Retchless AC , Shakya M , Lo CC , Chain P , Page AJ , Metcalf BJ , Su M , Rowell J , Vidyaprakash E , Paden CR , Huang AD , Roellig D , Patel K , Winglee K , Weigand MR , Katz LS . PeerJ 2022 10 e13821 BACKGROUND: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the cause of coronavirus disease 2019 (COVID-19), has spread globally and is being surveilled with an international genome sequencing effort. Surveillance consists of sample acquisition, library preparation, and whole genome sequencing. This has necessitated a classification scheme detailing Variants of Concern (VOC) and Variants of Interest (VOI), and the rapid expansion of bioinformatics tools for sequence analysis. These bioinformatic tools are means for major actionable results: maintaining quality assurance and checks, defining population structure, performing genomic epidemiology, and inferring lineage to allow reliable and actionable identification and classification. Additionally, the pandemic has required public health laboratories to reach high throughput proficiency in sequencing library preparation and downstream data analysis rapidly. However, both processes can be limited by a lack of a standardized sequence dataset. METHODS: We identified six SARS-CoV-2 sequence datasets from recent publications, public databases and internal resources. In addition, we created a method to mine public databases to identify representative genomes for these datasets. Using this novel method, we identified several genomes as either VOI/VOC representatives or non-VOI/VOC representatives. To describe each dataset, we utilized a previously published datasets format, which describes accession information and whole dataset information. Additionally, a script from the same publication has been enhanced to download and verify all data from this study. RESULTS: The benchmark datasets focus on the two most widely used sequencing platforms: long read sequencing data from the Oxford Nanopore Technologies platform and short read sequencing data from the Illumina platform. There are six datasets: three were derived from recent publications; two were derived from data mining public databases to answer common questions not covered by published datasets; one unique dataset representing common sequence failures was obtained by rigorously scrutinizing data that did not pass quality checks. The dataset summary table, data mining script and quality control (QC) values for all sequence data are publicly available on GitHub: https://github.com/CDCgov/datasets-sars-cov-2. DISCUSSION: The datasets presented here were generated to help public health laboratories build sequencing and bioinformatics capacity, benchmark different workflows and pipelines, and calibrate QC thresholds to ensure sequencing quality. Together, improvements in these areas support accurate and timely outbreak investigation and surveillance, providing actionable data for pandemic management. Furthermore, these publicly available and standardized benchmark data will facilitate the development and adjudication of new pipelines. |
Neighborhood socioeconomic deprivation in early childhood mediates racial disparities in blood pressure in a college student sample
Nichols OI , Fuller-Rowell TE , Robinson AT , Eugene D , Homandberg LK . J Youth Adolesc 2022 51 (11) 2146-2160 The influence of childhood contexts on adult blood pressure is an important yet understudied topic. Using a developmental perspective, this study examines the association between neighborhood socioeconomic disadvantage in early childhood (0-5 yrs), middle childhood (6-12 yrs) and adolescence (13-18 yrs) on subsequent blood pressure in young adulthood. Data were from 263 college students (52% Black; M(age) = 19.21 years) and neighborhood socioeconomic disadvantage was measured using a tract-level Area Deprivation Index. Neighborhood disadvantage in early childhood was significantly associated with diastolic blood pressure and explained 22% of the race difference between Black and White adults. The findings are consistent with the notion that early childhood may be a sensitive period for the effects of neighborhood disadvantage on blood pressure. |
ROCker Models for Reliable Detection and Typing of Short-Read Sequences Carrying β-Lactamase Genes.
Zhang SY , Suttner B , Rodriguez RLm , Orellana LH , Conrad RE , Liu F , Rowell JL , Webb HE , Williams-Newkirk AJ , Huang A , Konstantinidis KT . mSystems 2022 7 (3) e0128121 Identification of genes encoding β-lactamases (BLs) from short-read sequences remains challenging due to the high frequency of shared amino acid functional domains and motifs in proteins encoded by BL genes and related non-BL gene sequences. Divergent BL homologs can be frequently missed during similarity searches, which has important practical consequences for monitoring antibiotic resistance. To address this limitation, we built ROCker models that targeted broad classes (e.g., class A, B, C, and D) and individual families (e.g., TEM) of BLs and challenged them with mock 150-bp- and 250-bp-read data sets of known composition. ROCker identifies most-discriminant bit score thresholds in sliding windows along the sequence of the target protein sequence and hence can account for nondiscriminative domains shared by unrelated proteins. BL ROCker models showed a 0% false-positive rate (FPR), a 0% to 4% false-negative rate (FNR), and an up-to-50-fold-higher F1 score [2 × precision × recall/(precision + recall)] compared to alternative methods, such as similarity searches using BLASTx with various e-value thresholds and BL hidden Markov models, or tools like DeepARG, ShortBRED, and AMRFinder. The ROCker models and the underlying protein sequence reference data sets and phylogenetic trees for read placement are freely available through http://enve-omics.ce.gatech.edu/data/rocker-bla. Application of these BL ROCker models to metagenomics, metatranscriptomics, and high-throughput PCR gene amplicon data should facilitate the reliable detection and quantification of BL variants encoded by environmental or clinical isolates and microbiomes and more accurate assessment of the associated public health risk, compared to the current practice. IMPORTANCE Resistance genes encoding β-lactamases (BLs) confer resistance to the widely prescribed antibiotic class β-lactams. Therefore, it is important to assess the prevalence of BL genes in clinical or environmental samples for monitoring the spreading of these genes into pathogens and estimating public health risk. However, detecting BLs in short-read sequence data is technically challenging. Our ROCker model-based bioinformatics approach showcases the reliable detection and typing of BLs in complex data sets and thus contributes toward solving an important problem in antibiotic resistance surveillance. The ROCker models developed substantially expand the toolbox for monitoring antibiotic resistance in clinical or environmental settings. |
Lack of measles transmission to susceptible contacts from a health care worker with probable secondary vaccine failure - Maricopa County, Arizona, 2015
Jones J , Klein R , Popescu S , Rose K , Kretschmer M , Carrigan A , Trembath F , Koski L , Zabel K , Ostdiek S , Rowell-Kinnard P , Munoz E , Sunenshine R , Sylvester T . MMWR Morb Mortal Wkly Rep 2015 64 (30) 832-3 On January 23, 2015, the Maricopa County Department of Public Health (MCDPH) was notified of a suspected measles case in a nurse, a woman aged 48 years. On January 11, the nurse had contact with a patient with laboratory-confirmed measles associated with the Disneyland theme park-related outbreak in California. On January 21, she developed a fever (103 degrees F [39.4 degrees C]), on January 23 she experienced cough and coryza, and on January 24, she developed a rash. The patient was instructed to isolate herself at home. On January 26, serum, a nasopharyngeal swab, and a urine specimen were collected. The following day, measles infection was diagnosed by real time reverse transcription polymerase chain reaction testing of the nasopharyngeal swab and urine specimen and by detection of measles-specific immunoglobulin (Ig)M and IgG in serum by enzyme-linked immunosorbent assay. Because of her symptoms and laboratory results, the patient was considered to be infectious. |
A systematic review of cancer GWAS and candidate gene meta-analyses reveals limited overlap but similar effect sizes.
Chang CQ , Yesupriya A , Rowell JL , Pimentel CB , Clyne M , Gwinn M , Khoury MJ , Wulf A , Schully SD . Eur J Hum Genet 2014 22 (3) 402-8 Candidate gene and genome-wide association studies (GWAS) represent two complementary approaches to uncovering genetic contributions to common diseases. We systematically reviewed the contributions of these approaches to our knowledge of genetic associations with cancer risk by analyzing the data in the Cancer Genome-wide Association and Meta Analyses database (Cancer GAMAdb). The database catalogs studies published since January 1, 2000, by study and cancer type. In all, we found that meta-analyses and pooled analyses of candidate genes reported 349 statistically significant associations and GWAS reported 269, for a total of 577 unique associations. Only 41 (7.1%) associations were reported in both candidate gene meta-analyses and GWAS, usually with similar effect sizes. When considering only noteworthy associations (defined as those with false-positive report probabilities≤0.2) and accounting for indirect overlap, we found 202 associations, with 27 of those appearing in both meta-analyses and GWAS. Our findings suggest that meta-analyses of well-conducted candidate gene studies may continue to add to our understanding of the genetic associations in the post-GWAS era. |
Trends in population-based studies of human genetics in infectious diseases.
Rowell JL , Dowling NF , Yu W , Yesupriya A , Zhang L , Gwinn M . PLoS One 2012 7 (2) e25431 Pathogen genetics is already a mainstay of public health investigation and control efforts; now advances in technology make it possible to investigate the role of human genetic variation in the epidemiology of infectious diseases. To describe trends in this field, we analyzed articles that were published from 2001 through 2010 and indexed by the HuGE Navigator, a curated online database of PubMed abstracts in human genome epidemiology. We extracted the principal findings from all meta-analyses and genome-wide association studies (GWAS) with an infectious disease-related outcome. Finally, we compared the representation of diseases in HuGE Navigator with their contributions to morbidity worldwide. We identified 3,730 articles on infectious diseases, including 27 meta-analyses and 23 GWAS. The number published each year increased from 148 in 2001 to 543 in 2010 but remained a small fraction (about 7%) of all studies in human genome epidemiology. Most articles were by authors from developed countries, but the percentage by authors from resource-limited countries increased from 9% to 25% during the period studied. The most commonly studied diseases were HIV/AIDS, tuberculosis, hepatitis B infection, hepatitis C infection, sepsis, and malaria. As genomic research methods become more affordable and accessible, population-based research on infectious diseases will be able to examine the role of variation in human as well as pathogen genomes. This approach offers new opportunities for understanding infectious disease susceptibility, severity, treatment, control, and prevention. |
- Page last reviewed:Feb 1, 2024
- Page last updated:Dec 02, 2024
- Content source:
- Powered by CDC PHGKB Infrastructure