Last data update: Mar 21, 2025. (Total: 48935 publications since 2009)
Records 1-10 (of 10 Records) |
Query Trace: den Bakker HC[original query] |
---|
Kalamari: a representative set of genomes of public health concern
Katz LS , Griswold T , Lindsey RL , Lauer AC , Im MS , Williams G , Halpin JL , Gómez GA , Kucerova Z , Morrison S , Page A , Den Bakker HC , Carleton HA . Microbiol Resour Announc 2025 e0096324 ![]() ![]() Kalamari is a resource that supports genomic epidemiology and pathogen surveillance. It consists of representative genomes and common contaminants. Kalamari also contains a custom taxonomy and software for downloading and formatting the data. |
Fasten: a toolkit for streaming operations on FASTQ files
Katz LS , Phan J , den Bakker HC . J Open Source Softw 2024 9 (94) 6030 ![]() ![]() |
SneakerNet: A modular quality assurance and quality check workflow for primary genomic and metagenomic read data.
Griswold T , Kapsak C , Chen JC , den Bakker HC , Williams G , Kelley A , Vidyaprakash E , Katz LS . J Open Source Softw 2021 6 (60) ![]() Laboratories that run Whole Genome Sequencing (WGS) produce a tremendous amount of data, up to 10 gigabytes for some common instruments. There is a need to standardize the quality assurance and quality control process (QA/QC). Therefore we have created SneakerNet to automate the QA/QC for WGS. |
Mashtree: a rapid comparison of whole genome sequence files
Katz LS , Griswold T , Morrison SS , Caravas JA , Zhang S , den Bakker HC , Deng X , Carleton HA . J Open Source Softw 2019 4 (44) In the past decade, the number of publicly available bacterial genomes has increased dramatically. These genomes have been generated for impactful initiatives, especially in the field of genomic epidemiology (Brown, Dessai, McGarry, & Gerner-Smidt, 2019; Timme et al., 2017). Genomes are sequenced, shared publicly, and subsequently analyzed for phylogenetic relatedness. If two genomes of epidemiological interest are found to be related, further investigation might be prompted. However, comparing the multitudes of genomes for phylogenetic relatedness is computationally expensive and, with large numbers, laborious. Consequently, there are many strategies to reduce the complexity of the data for downstream analysis, especially using nucleotide stretches of length k (kmers). |
Complete Genome Sequence of a Serotype 7 Listeria monocytogenes Strain, FSL R9-0915.
Peters TL , Hudson LK , Bryan DW , Song Y , den Bakker HC , Kucerova Z , Denes TG . Microbiol Resour Announc 2021 10 (1) ![]() ![]() Listeria monocytogenes serotype 7 lacks glycosidic constituents in wall teichoic acids. Here, we present the complete genome sequence of L. monocytogenes serotype 7 strain FSL R9-0915 and an analysis of genes known to affect L. monocytogenes antigenicity. This strain is used as a control strain in Listeria phage host range analyses. |
SeqSero2: rapid and improved Salmonella serotype determination using whole genome sequencing data.
Zhang S , Den-Bakker HC , Li S , Chen J , Dinsmore BA , Lane C , Lauer AC , Fields PI , Deng X . Appl Environ Microbiol 2019 85 (23) ![]() ![]() SeqSero, launched in 2015, is a software tool for Salmonella serotype determination from whole genome sequencing (WGS) data. Despite its routine use in public health and food safety laboratories in the United States and other countries, the original SeqSero pipeline is relatively slow (minutes per genome using sequencing reads), is not optimized for draft genome assemblies, and may assign multiple serotypes for a strain. Here we present SeqSero2 (github.com/denglab/SeqSero2; denglab.info/SeqSero2), an algorithmic transformation and functional update of the original SeqSero. Major improvements include: 1) additional sequence markers for identification of Salmonella species and subspecies and certain serotypes; 2) a k-mer based algorithm for rapid serotype prediction from raw reads (seconds per genome) and improved serotype prediction from assemblies; and 3) a targeted assembly approach for specific retrieval of serotype determinants from WGS for serotype prediction, new allele discovery, and prediction troubleshooting. Evaluated using 5,794 genomes representing 364 common US serotypes, including 2,280 human isolates of 117 serotypes from the National Antimicrobial Resistance Monitoring System, SeqSero2 is up to 50 times faster than the original SeqSero while maintaining equivalent accuracy for raw reads and substantially improving accuracy for assemblies. SeqSero2 further suggested that 3% of the tested genomes contained reads from multiple serotypes, indicating a use for contamination detection. In addition to short reads, SeqSero2 demonstrated potential for accurate and rapid serotype prediction directly from long nanopore reads despite base call errors. Testing of 40 nanopore-sequenced genomes of 17 serotypes yielded a single H antigen misidentification.IMPORTANCE: Serotyping is the basis of public health surveillance of Salmonella It remains a first-line subtyping method even as surveillance continues to be transformed by whole genome sequencing. SeqSero allows the integration of Salmonella serotyping into a whole genome sequencing-based laboratory workflow while maintaining continuity with the classic serotyping scheme. SeqSero2, informed by extensive testing and application of SeqSero in the United States and other countries, incorporates important improvements and updates that further strengthen its application in routine and large scale surveillance of Salmonella by whole genome sequencing. |
Complete Genome Sequences of Four Salmonella enterica Strains (Including Those of Serotypes Montevideo, Mbandaka, and Lubbock) Isolated from Peripheral Lymph Nodes of Healthy Cattle.
Bugarel M , Cook PW , den Bakker HC , Harhay D , Nightingale KK , Loneragan GH . Microbiol Resour Announc 2019 8 (2) ![]() ![]() Salmonella enterica serotype Lubbock emerged most likely from a Salmonella enterica serotype Mbandaka ancestor that acquired by recombination the fliC operon from Salmonella enterica serotype Montevideo. Here, we report the complete genome sequence of two S. Lubbock, one S. Montevideo, and one S. Mbandaka strain isolated from bovine lymph nodes. |
Evolutionary Relationships of Outbreak-associated Listeria monocytogenes Strains of Serotypes 1/2a and 1/2b Determined by Whole Genome Sequencing.
Bergholz TM , den Bakker HC , Katz LS , Silk BJ , Jackson KA , Kucerova Z , Joseph LA , Turnsek M , Gladney LM , Halpin JL , Xavier K , Gossack J , Ward TJ , Frace M , Tarr CL . Appl Environ Microbiol 2015 82 (3) 928-38 ![]() ![]() We used whole genome sequencing to determine evolutionary relationships among 20 outbreak-associated clinical isolates of Listeria monocytogenes serotypes 1/2a and 1/2b. Isolates from six of eleven outbreaks fell outside of the clonal groups or 'epidemic clones' that have been previously associated with outbreaks, suggesting that epidemic potential may be widespread in L. monocytogenes and is not limited to the recognized epidemic clones. Pairwise comparisons between epidemiologically-related isolates within clonal complexes showed that genome-level variation differed by two orders of magnitude between different comparisons, and the distribution of point mutations (core versus accessory genome) also varied. In addition, genetic divergence between one closely related pair of isolates from a single outbreak was driven primarily by changes in phage regions. The evolutionary analysis showed the changes could be attributed to horizontal gene transfer; members of the diverse bacterial community found in the production facility could have served as the source of novel genetic material at some point in the production chain. The results raise the question of how to best utilize information contained within the accessory genome in outbreak investigations. The full magnitude and complexity of genetic changes revealed by genome sequencing could not be discerned from traditional subtyping methods and the results demonstrate the challenges of interpreting genetic variation among isolates recovered from a single outbreak. Epidemiological information remains critical for proper interpretation of nucleotide and structural diversity among isolates recovered during outbreaks, and will remain so until we understand more about how various population histories influence genetic variation. |
Genomic epidemiology of Salmonella enterica serotype Enteritidis based on population structure of prevalent lineages.
Deng X , Desai PT , den Bakker HC , Mikoleit M , Tolar B , Trees E , Hendriksen RS , Frye JG , Porwollik S , Weimer BC , Wiedmann M , Weinstock GM , Fields PI , McClelland M . Emerg Infect Dis 2014 20 (9) 1481-9 ![]() Salmonella enterica serotype Enteritidis is one of the most commonly reported causes of human salmonellosis. Its low genetic diversity, measured by fingerprinting methods, has made subtyping a challenge. We used whole-genome sequencing to characterize 125 S. enterica Enteritidis and 3 S. enterica serotype Nitra strains. Single-nucleotide polymorphisms were filtered to identify 4,887 reliable loci that distinguished all isolates from each other. Our whole-genome single-nucleotide polymorphism typing approach was robust for S. enterica Enteritidis subtyping with combined data for different strains from 2 different sequencing platforms. Five major genetic lineages were recognized, which revealed possible patterns of geographic and epidemiologic distribution. Analyses on the population dynamics and evolutionary history estimated that major lineages emerged during the 17th-18th centuries and diversified during the 1920s and 1950s. |
Listeria marthii sp. nov., isolated from the natural environment, Finger Lakes National Forest
Graves LM , Helsel LO , Steigerwalt AG , Morey RE , Daneshvar MI , Roof SE , Orsi RH , Fortes ED , Millilo SR , den Bakker HC , Wiedmann M , Swaminathan B , Sauders BD . Int J Syst Evol Microbiol 2009 69 (6) 1280-1288 Four isolates (FSL S4-120T, FSL S4-696, FSL S4-710, and FSL S4-965) of Gram-positive, motile, facultatively anaerobic, non-sporeforming bacilli that were phenotypically similar to Listeria spp. were isolated from soil, standing water, and flowing water samples obtained from the natural environment in the Finger Lakes National Forest, New York, USA. The four isolates were closely related to one another and were determined to be the same species by whole genome DNA-DNA hybridization studies (>82% relatedness at 55 degrees C and >76% relatedness at 70 degrees C with 0.0-0.5% divergence). 16S ribosomal RNA sequence analysis confirmed their close phylogenetic relatedness to L. monocytogenes and L. innocua and more distant relatedness to L. welshimeri, L. seeligeri, L. ivanovii, and L. grayi. Phylogenetic analysis of partial sequences for sigB, gap, and prs showed that these isolates form a well-supported sistergroup to L. monocytogenes. The four isolates were sufficiently different from L. monocytogenes and L. innocua by DNA-DNA hybridization to warrant their designation as a new Listeria species. The four isolates yielded positive reactions in the AccuProbe(R) test that is purported to be specific for L. monocytogenes, did not ferment L-rhamnose, were non-hemolytic on blood agar media, and did not contain a homologue of the L. monocytogenes virulence gene island. On the basis of their phenotypic characteristics and their genotypic distinctiveness from L. monocytogenes and L. innocua, the four isolates should be classified as a new species within the genus Listeria, for which the name Listeria marthii sp. nov. is proposed. The type strain of L. marthii is FSL S4-120T (=ATCC BAA 1595T =BEIR NR 9579T =CCUG 56148T). L. marthii has not been associated with human or animal disease at this time. |
- Page last reviewed:Feb 1, 2024
- Page last updated:Mar 21, 2025
- Content source:
- Powered by CDC PHGKB Infrastructure