RNA-seq Differential Expression:
Gene X’s expression in condition A doubles expression in condition B. But how reliable is this? What’s the chance of observing it by rendom? All comes to variation estimation! How to meassure the variance between different biological replicates. Once you have the variation estimation, you’re able to assign a p-value for expression changes. Variation can be estimated if you have many biological replicates.But in practice, we only have 2-3 replicates. What we can do next is proper statistical models.
Sequencing Read Distribution:
1. Poisson distribution:λ=E(X)=Var(X)
The easiest model for RNA-seq reads count is Poisson distribution.
Assumption : Mean = Variance
But: sequencing data is over-dispersed,not only RNA-seq (Mean<Variance)
2. Negative binomial: X ~ NB(r;p)( 2 parameters : r,p)
Definition : number of successe