Normalize tags before saving #159

WyohKnott · 2018-12-09T16:01:11Z

I have many post tagged with real names in which sometimes i used upper case letters and sometimes I did not. Now the issue is that for tags index pages, these tags are differents: for example "Todd Hido" is different from "todd hido". Would it be possible to normalize every tag beforehand, by making them all lower case?

aspensmonster · 2018-12-11T05:39:37Z

Fuzzy matching might help in this case. You could have any tags with similar enough strings get grouped, and still expose the underlying tags themselves.

toddhido
- Todd Hido
- todd hido
- ToddHido
- ToddHida
someOtherTagWithNoFuzzyMatches

Though I can't think immediately of a decent way to test all combinations within the tag set. Tags can be quite diverse on tumblr.

thisismycontributionaccount · 2018-12-18T21:57:15Z

I was having a similar problem with "/" in tags as well as upper and lower case tags. So I modified the tumblr_backup.py to quote and lower case tags. Let me run a few more tests and I will try to add the code.

…iewing This is for issue bbolli#159 . I was having a similar issue with special characters as well as with tag upper/lower case. I have added three new options and the code to implement the options. --normalize-tags - sets the text to lower case and creates a unique set to remove duplicates --escape-tags - uses urllib.quote_plus to escape special characters in the tags --fix-for-disk - adds an extra urllib.quote_plus when the urls are being built to account for browsing from disk weirdness in windows

I updated the documentation with the three new options I added for issue bbolli#159

thisismycontributionaccount mentioned this issue Dec 19, 2018

added options to normalize text, escape text, and optimize for disk v… #196

Open

thisismycontributionaccount added a commit to thisismycontributionaccount/tumblr-utils that referenced this issue Dec 19, 2018

updated documentation with the three new options

0265ea6

I updated the documentation with the three new options I added for issue bbolli#159

thisismycontributionaccount mentioned this issue Dec 19, 2018

updated documentation with the three new options thisismycontributionaccount/tumblr-utils#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Normalize tags before saving #159

Normalize tags before saving #159

WyohKnott commented Dec 9, 2018

aspensmonster commented Dec 11, 2018

thisismycontributionaccount commented Dec 18, 2018

Normalize tags before saving #159

Normalize tags before saving #159

Comments

WyohKnott commented Dec 9, 2018

aspensmonster commented Dec 11, 2018

thisismycontributionaccount commented Dec 18, 2018