Skip to content

Releases: DocNow/twarc

v2.3.4

02 Jul 01:34
6223f85
Compare
Choose a tag to compare

More code to handle ChunkedEncodingError in stream as well as sample. Logging was updated to include tweet ids that have been collected.

v2.3.3

01 Jul 19:01
cea8138
Compare
Choose a tag to compare

This release adds functionality to twarc.Twarc2 to catch any requests exception when reading from recent or all stream. See #471 for background.

v2.3.2

01 Jul 18:19
2e6fa20
Compare
Choose a tag to compare

This release will hopefully allow catching of ChunkedEncodingError exceptions that have been observed intermittently in the wild. #471

v2.3.1

25 Jun 00:30
99d7add
Compare
Choose a tag to compare

A bugfix release to allow the counts command to write to a file without using stdout redirection:

twarc2 counts blm blm.jsonl

v2.3.0

24 Jun 19:55
8780d09
Compare
Choose a tag to compare

This release includes a new command for retrieving data from the new counts API.

Usage: twarc2 counts [OPTIONS] QUERY [OUTFILE]

  Return counts of tweets matching a query.

Options:
  --since-id INTEGER              Count tweets sent after tweet id
  --until-id INTEGER              Count tweets sent prior to tweet id
  --start-time [%Y-%m-%d|%Y-%m-%dT%H:%M:%S]
                                  Count tweets created after UTC time (ISO
                                  8601/RFC 3339), e.g.  2021-01-01T12:31:04
  --end-time [%Y-%m-%d|%Y-%m-%dT%H:%M:%S]
                                  Count tweets sent before UTC time (ISO
                                  8601/RFC 3339)
  --archive                       Count using the full archive (requires
                                  Academic Research track)
  --granularity [day|hour|minute]
                                  Aggregation level for counts. Can be one of:
                                  day, hour, minute. Default is hour.
  --limit INTEGER                 Maximum number of days of results to save
                                  (minimum is 30 days)
  --text                          Output the counts as human readable text
  --csv                           Output counts as CSV
  --help                          Show this message and exit.

v2.2.1

24 Jun 12:30
a1974df
Compare
Choose a tag to compare

This release includes improved reading of users from files for the timelines subcommand, and some internal changes to the way that flattening works.

v2.2.0

21 Jun 01:40
ca2c2cb
Compare
Choose a tag to compare

This release adds the --exclude-retweets and --exclude-replies commands to the timeline and timelines subcommands. Behind the scenes all the source code has been reformatted using black.

v2.1.8

17 Jun 16:37
5d3f934
Compare
Choose a tag to compare

This release fixes a bug in twarc timelines when using a file of user ids instead of collected tweets #484

twarc2 timelines user-ids.txt > tweets.jsonl

Thanks @JanaLasser for the bug report!

v2.1.7

14 Jun 10:25
0c73273
Compare
Choose a tag to compare

This release includes a bug fix for when the configuration file's directory does not exist. This functionality was accidentally removed in v2.1.5 and has been reinstated.

v2.1.6

14 Jun 10:23
327cfce
Compare
Choose a tag to compare

This release includes some new code to handle Operational Disconnect error messages from Twitter when streaming data. twarc will log the message, sleep for 5 seconds and then attempt to reconnect.