DNA-DNA Hybridization Values and Their Relationship to Whole-Genome Sequence Similarities
DNA-DNA hybridization (DDH) is a molecular technique used to determine the degree of similarity between two genomes. It measures the extent to which single-stranded DNA from one organism will anneal (bind) to single-stranded DNA from another organism under controlled conditions. The resulting hybridization value, often expressed as a percentage of DNA-DNA binding, provides a quantitative measure of genomic relatedness. This technique has historically been crucial in bacterial taxonomy and is still relevant, although whole-genome sequence (WGS) analysis is becoming increasingly prevalent.
The Principle of DNA-DNA Hybridization
The underlying principle is simple: the more similar two genomes are, the more their DNA sequences will complement each other and therefore the more extensive the hybridization will be. Conversely, if two genomes are vastly different, hybridization will be minimal. The process involves denaturing (separating) double-stranded DNA into single strands, mixing DNA from two different organisms, and allowing them to reanneal under controlled stringency (temperature and salt concentration). The percentage of DNA that successfully hybridizes is then determined, typically using spectrophotometry.
Interpretation of DDH Values
Traditionally, a 70% DDH value has been used as a threshold to define species boundaries in bacteria. This means that two organisms with a DDH value of 70% or higher are considered to belong to the same species. Values below this threshold suggest distinct species. However, this cutoff is not universally applicable and its validity has been debated, particularly in the context of rapidly evolving species.
DDH and Whole-Genome Sequence Similarities
The advent of WGS has provided a powerful alternative to DDH. WGS allows for a comprehensive comparison of entire genomes, revealing details far beyond the resolution of DDH. While DDH provides an overall measure of similarity, WGS analysis can identify specific regions of similarity and divergence, including:
-
Average nucleotide identity (ANI): ANI is a widely used metric derived from WGS data that reflects the overall similarity of nucleotide sequences between two genomes. ANI correlates well with DDH values, providing a robust alternative for species delineation. High ANI values generally correspond to high DDH values and vice versa.
-
Genome-to-genome distance calculator (GGDC): GGDC is a software tool that utilizes different algorithms to calculate the genomic distance between two genomes based on WGS data. Similar to ANI, GGDC estimates can effectively replace DDH analysis, and often provide a more nuanced understanding of genomic relationships.
-
Phylogenetic analysis: WGS data can be used to construct phylogenetic trees, visually representing evolutionary relationships between organisms based on their genome sequences. This approach provides a broader context than a single DDH value.
Limitations of DDH
While DDH has historically played a significant role in bacterial taxonomy, it has certain limitations:
- Time-consuming and labor-intensive: The technique is relatively slow and requires specialized equipment.
- Limited resolution: It does not provide detailed information about specific regions of similarity or divergence.
- Potential for bias: The stringency of the hybridization conditions can influence the results.
Conclusion
DDH provides a valuable historical perspective on microbial classification, however WGS-based approaches such as ANI and GGDC are now preferred methods for assessing whole genome similarity and delineating species boundaries. These newer methods offer increased accuracy, precision, and detail compared to the older DDH technique. While DDH remains a useful technique in certain contexts, the rise of WGS has largely supplanted it as the gold standard for genomic comparison.