offer randomized contrast #3

christofs · 2017-03-05T13:17:48Z

Offer the option to run a comparison not on a meaningful partitioning of the data, but on a random one, for a better understanding of the level of differences to be expected if the partitioning is not meaningful.

christofs · 2017-03-10T21:23:08Z

Partly implemented.

Based on the current implementation, it would be very interesting to do this multiple times internally and calculate list of zeta score distributions ranked by mean or median, based on such multiple random partitionings of the data, then use this to do significance tests on the zeta scores with meaningfully partitioned data to estimate which zeta scores can actually be considered statistically significant given a certain text collection and partitioning.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

offer randomized contrast #3

offer randomized contrast #3

christofs commented Mar 5, 2017

christofs commented Mar 10, 2017

offer randomized contrast #3

offer randomized contrast #3

Comments

christofs commented Mar 5, 2017

christofs commented Mar 10, 2017