What Roles Does DDA Play in Building Spectral Libraries?
-
The instrument first performs an MS1 full scan to measure the mass-to-charge ratios (m/z) and intensities of precursor ions.
-
Based on predefined selection rules (e.g., Top N and/or an intensity threshold), the most intense precursor ions are selected for MS2 fragmentation to generate corresponding fragment-ion spectra.
-
This survey-then-select strategy enables DDA to capture the highest-intensity and most representative molecular features present in the sample.
-
Precursor information for peptides or metabolites (m/z and retention time)
-
Corresponding MS/MS fragment ions and their relative intensities
-
Sequence or chemical structure
-
Diagnostic fragment ions
-
Retention time (RT) and intensity information
-
Disease-specific samples (e.g., cancer tissues and cerebrospinal fluid)
-
Specific species or non-model organisms
-
Novel translated products (e.g., newly discovered protein splice variants)
-
Increase input material and optimize sample preparation and enrichment strategies.
-
Perform multiple runs and apply multiple chromatographic gradients to improve depth of coverage.
-
Combine DDA data across multiple samples to build a more comprehensive spectral library.
-
Use fixed acquisition parameters.
-
Calibrate and refine spectral information in conjunction with DIA data.
-
Integrate multiple DDA datasets using software-based algorithms (e.g., Spectronaut and EncyclopeDIA).
In proteomics and metabolomics research, data-dependent acquisition (DDA) is one of the key approaches for generating high-quality spectral libraries. Such libraries provide a precise reference foundation for subsequent data-independent acquisition (DIA) analyses. Below, we systematically discuss the role of DDA in spectral library construction from the perspectives of underlying principles, practical value, advantages, and limitations.
What Is DDA (Data-Dependent Acquisition)?
DDA is a mass spectrometry acquisition strategy in which fragmentation targets are selected based on precursor signal intensity. The basic workflow is as follows:
The Core Roles of DDA in Spectral Library Construction
1. Generating High-Quality MS/MS Fragment Spectra
A spectral library is essentially a database of fragmentation features, typically including:
By preferentially selecting targets with high signal-to-noise ratios for fragmentation, DDA can produce clear, high-resolution, and readily interpretable MS/MS spectra. These properties make DDA an ideal data source for constructing high-confidence spectral libraries.
2. Enabling Accurate Identification and Annotation of Peptides/Metabolites
DDA datasets can be matched against reference databases using search engines (e.g., Mascot, Sequest, and MaxQuant) to identify peptides or to support metabolite identification/annotation. Each identified target can serve as a spectral library entry, with information including:
Accordingly, the spectral library functions as a set of known references and plays a critical role in DIA, PRM, and other untargeted/targeted analytical workflows.
3. Building Sample-Specific Spectral Libraries
DDA can be performed directly on specific biological samples to generate project-specific spectral libraries. Compared with public libraries, such libraries often provide higher coverage and better matching performance for the dataset at hand. This is particularly important in the following scenarios:
Advantages of Using DDA to Build Spectral Libraries
1. High Confidence in Identification
DDA data support sequence-level annotation through database searching. When combined with false discovery rate (FDR) control strategies, DDA-based identifications can achieve strong accuracy and credibility.
2. Information-Rich Fragmentation Spectra
With high-resolution MS/MS acquisition, DDA can provide comprehensive fragment-ion patterns. This facilitates the extraction of multiple diagnostic ions for subsequent DIA library matching and quantitative analyses.
3. Support for Post-Translational Modification (PTM) Identification
DDA facilitates the confident identification of modified peptides, such as phosphorylated and acetylated species, thereby providing data support for interrogating protein function and signaling-pathway regulation.
4. Feasibility of Constructing Sample-Specific Spectral Libraries
Relative to general-purpose public libraries, DDA is better suited for extracting signals directly from experimental samples to build project-level libraries with high coverage and strong biological relevance. This is especially useful for studies involving disease tissues and non-model organisms.
Limitations of DDA and Potential Solutions
1. Bias Toward High-Abundance Species
Because DDA commonly relies on a Top N selection strategy, low-abundance species may be undersampled or missed, leading to incomplete coverage of key molecular features in the resulting spectral library.
Potential solutions:
2. Relatively Limited Reproducibility
DDA involves stochastic precursor selection; thus, different precursors may be selected across technical replicates, which can compromise spectral library completeness.
Potential solutions:
How to order?
