Proteomic datasets of HeLa and SiHa cell lines acquired by DDA-PASEF and diaPASEF

Huang, Zelu, et al. “Proteomic datasets of HeLa and SiHa cell lines acquired by DDA-PASEF and diaPASEF.” Data in Brief 41 (2022): 107919. https://doi.org/10.1016/j.dib.2022.107919

Abstract

We present four datasets on proteomics profiling of HeLa and SiHa cell lines associated with the research described in the paper “PROTREC: A probability-based approach for recovering missing proteins based on biological networks”. Proteins in each cell line were acquired by two different data acquisition methods. The first was Data Dependent Acquisition-Parallel Accumulation Serial Fragmentation (DDA-PASEF) and the second was Parallel Accumulation-Serial Fragmentation combined with data-independent acquisition (diaPASEF). Protein assembly was performed following search against the Swiss-Prot Human database using Peaks Studio for DDA datasets and Spectronaut for DIA datasets. The assembled result contains identified PSMs, peptides and proteins that are above threshold for each HeLa and SiHa sample. Coverage-wise, for DDA-PASEF, approximately 6,090 and 7,298 proteins were quantified for HeLa and SiHA sample, while13,339 and 8,773 proteins were quantified by diaPASEF for HeLa for SiHa sample, respectively. Consistency-wise, diaPASEF has fewer missing values (∼ 2%) compared to its DDA counterparts (∼5–7%). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the iProX partner repository with the dataset identifier PXD029773.