Mathematical Inconsistency: Running Statistics notation and variance calculation #1626
debemdeboas
started this conversation in
General
Replies: 1 comment 1 reply
-
I may be wrong about this too. Statistics is not my strong suit. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
In the Running Statistics subsection of the documentation (From batch to online/stream), there appears to be a mathematical inconsistency in how variance is presented.
Current notation:
The text states that "$\sigma_t$ is the running variance at time $t$ ", but this is incorrect.$\frac{s_{t+1}}{n_{t+1}}$ where s is the running sum of squares represents neither the variance nor the standard deviation - it's simply the sum of squares divided by n.
The quantity
Proposed correction:
Beta Was this translation helpful? Give feedback.
All reactions