Bases: object
Compare gene expression for two samples
Plots a scatter-plot of sample1 vs sample2, taken from df. Calculates differentially expressed genes with a Z-test from the closest (local_fraction * 100)% points. Stores result from statistical calculations in self.result_
Parameters: | sample1_name : str
sample2_name : str
df : pandas.DataFrame
p_value_cutoff : float, optional
local_fraction : float, optional
bonferonni : bool, optional
fdr : ???, optional
dtype : str, optional
|
---|
Benjamini-Hochberg correction for multiple hypothesis testing
From: http://udel.edu/~mcdonald/statmultcomp.html One good technique for controlling the false discovery rate was briefly mentioned by Simes (1986) and developed in detail by Benjamini and Hochberg (1995). Put the individual P-values in order, from smallest to largest. The smallest P-value has a rank of i=1, the next has i=2, etc. Then compare each individual P-value to (i/m)Q, where m is the total number of test and Q is the chosen false discovery rate. The largest P-value that has P<(i/m)Q is significant, and all P-values smaller than it are also significant.
Parameters: | p_values : list
fdr : float, optional
|
---|---|
Returns: | sigs : numpy.array
|