Skip to content

Releases: cligs/pyzeta

Pre-Cologne Rewrite

17 Aug 09:24
Compare
Choose a tag to compare

This is essentially a rewrite of some core elements of pyzeta. Notably, the way the document-term-matrix is built has been rebuilt, providing a DTM with absolute feature counts first, then deriving DTMs with relative frequencies and binary feature counts. This is then used as the basis for calculating various versions of Zeta in one go. Visualizations remains essentially the same, with a few experimental visualizations being omitted for the time being.

Sample input and random partitions

10 Mar 21:16
Compare
Choose a tag to compare

Release 0.4.0 brings the following enhancements:

  • There is now a separate, dedicated how-to document with hints on usage.
  • There is now some sample input to make it easier to see what input data is needed and in what form
  • A new feature has been introduced: randomly splitting the text collection into two partitions, in order to get a sense of the baseline contrasts to be expected for a given collection.

Analyses with three groups.

17 Dec 12:52
Compare
Choose a tag to compare

Analyses with three groups, based on per-text proportions of distinctive features.

First complete version after rewrite.

13 Dec 09:57
Compare
Choose a tag to compare

First complete version after rewrite.

Complete rewrite.

12 Dec 21:56
Compare
Choose a tag to compare

Complete rewrite with separate preprocessing, increased use of pandas.

Initial release.

11 Dec 10:34
Compare
Choose a tag to compare
Initial release. Pre-release
Pre-release

Initial release.

Very early release.

11 Dec 16:05
Compare
Choose a tag to compare
Very early release. Pre-release
Pre-release

Very early release. Added PCA based on distinctive features for three groups of texts.