Filter MaxQuant DDA output
filter_features_mq_dda.RdThis function filters the output .txt files (peptide groups or PSMs) from Proteome Discoverer for DDA, based on various criteria:
Remove features without a master protein
Remove features without a unique master protein (i.e. Number.of.Protein.Groups == 1)
Remove features matching a contaminant protein
Remove features matching any protein associated with a contaminant protein (see below)
Remove features without quantification values
Usage
filter_features_mq_dda(
obj,
master_protein_col = "Leading.razor.protein",
protein_col = "Proteins",
unique_master = TRUE,
filter_contaminant = TRUE,
contaminant_proteins = NULL,
filter_associated_contaminant = TRUE,
remove_no_quant = TRUE
)Arguments
- obj
SummarisedExperimentcontaining output from MaxQuant. UsereadQFeaturesto read in .txt file- master_protein_col
string. Name of column containing master proteins.- protein_col
string. Name of column containing all protein matches.- unique_master
logical. Filter out features without a unique master protein.- filter_contaminant
logical. Filter out features which match a contaminant protein.- contaminant_proteins
character vector. The protein IDs form the contaminant proteins- filter_associated_contaminant
logical. Filter out features which match a contaminant associated protein.- remove_no_quant
logical. Remove features with no quantification
Details
Associated contaminant proteins are proteins which have at least one feature shared with a contaminant protein. It has been observed that the contaminant fasta files often do not contain all possible contaminant proteins e.g. some features can be assigned to a keratin which is not in the provided contaminant database.
In the example below, using filter_associated_contaminant = TRUE will filter out f2 and f3 in
addition to f1, regardless of the value in the Leading.razor.protein column.