Estimate effect size of over-representation
estimate_overrep.RdThis is a crude function to estimate the effect size of
over-representation i.e. we know a feature is over-represented, but we want to
estimate the effect size/how over-represented it is. This function is typically
run after get_enriched_go but will work from the output of goseq,
regardless whether the functional enrichment tested was for GO terms.
Arguments
- obj
data.framecontaininggoseqresults as generated byget_enriched_goorgoseq.- pwf
data.frameas used inget_enriched_goorgoseq.- gene2cat
data.frameas used inget_enriched_goorgoseq.
Value
Returns obj with an extra column added called adj_overrep. This
column is calculated for each GO term by:
numDEInCat / numInCat / (avgTermWeight / avgNonTermWeight) / (totalDEFeatures / totalFeatures)
where:
numDEInCatis the number of differentially expressed genes (aka. proteins) assigned to that GO term.numInCatis the total number of genes (aka. proteins) annotated to that GO term.avgTermWeightis the averagepwf$pwfvalue for all the differentially expressed genes that were assigned to that GO term.avgNonTermWeightis the averagepwf$pwffor all the other genes supplied inpwf.totalDEFeaturesis the total number of differentially expressed genes indicated inpwf.totalFeaturesis the total number of genes indicated inpwf, i.e. the number of rows.