Optimise sanitize with string builder #2918

trkohler · 2024-02-09T13:15:35Z

not a big difference, but on large query strings it's noticeable.
can be minor and safe enhancement?

go test -benchmem -run=^$ -bench ^BenchmarkStripQuery$ -benchtime=10s github.com/zalando/skipper/filters/builtin

BenchmarkStripQuery/[old_sanitize]_url_1-8              85732707               118.6 ns/op            80 B/op          4 allocs/op
BenchmarkStripQuery/[old_sanitize]_url_2-8              14885370               808.2 ns/op           472 B/op         15 allocs/op
BenchmarkStripQuery/[old_sanitize]_url_3-8              73297474               162.9 ns/op           120 B/op          4 allocs/op
BenchmarkStripQuery/[old_sanitize]_url_4-8              100000000              114.1 ns/op            80 B/op          4 allocs/op


BenchmarkStripQuery/[new_sanitize]_url_1-8              100000000              103.5 ns/op            24 B/op          3 allocs/op
BenchmarkStripQuery/[new_sanitize]_url_2-8              15854868               778.5 ns/op           192 B/op         14 allocs/op
BenchmarkStripQuery/[new_sanitize]_url_3-8              71689101               166.3 ns/op            64 B/op          4 allocs/op
BenchmarkStripQuery/[new_sanitize]_url_4-8              100000000              102.0 ns/op            24 B/op          3 allocs/op

AlexanderYastrebov · 2024-02-09T14:16:49Z

filters/builtin/stripquery.go

@@ -51,7 +51,8 @@ func validHeaderFieldByte(b byte) bool {
 }

 // make sure we don't generate invalid headers
-func sanitize(input string) string {
+// temporary public function to benchmark it


There is no need to export it as benchmarks sits in the same package.
Usually we introduce benchmark as a first commit and include baseline benchmark results in the commit message.
Then change implementation, re-run benchmark and compare results to the baseline via https://pkg.go.dev/golang.org/x/perf/cmd/benchstat

To get a statistically meaningful benchmark results it should be run several times, usually we run 10.
See this PR as an example of performance-related change #2870

I've also created a small script to automate benchmarking two commits (its important that benchmark function is added in a separate commit to establish the baseline), see golang/go#63233 (comment)

AlexanderYastrebov · 2024-02-09T14:19:00Z

filters/builtin/stripquery.go

+	toAscii := strconv.QuoteToASCII(input)
+	var s strings.Builder
+	for _, i := range toAscii {
+		if validHeaderFieldByte(byte(i)) {
+			s.WriteRune(i)
+		}
+	}
+	return s.String()


To get ultimate performance we can use byte version https://pkg.go.dev/strconv#AppendQuoteToASCII here and then replace invalid bytes with - instead of allocating another slice.

AlexanderYastrebov · 2024-02-09T14:20:50Z

filters/builtin/stripquery_test.go

+	}
+
+	for i, v := range table {
+		url, _ := url.ParseRequestURI(v.url)


This is redundant, test cases should contain query string instead of url

AlexanderYastrebov · 2024-02-09T14:21:27Z

filters/builtin/stripquery_test.go

+	for i, v := range table {
+		url, _ := url.ParseRequestURI(v.url)
+		q := url.Query()
+		b.Run(fmt.Sprintf("[new sanitize] url %d", i + 1), func(b *testing.B) {
+			for i := 0; i < b.N; i++ {
+				for k := range q {
+					NewSanitize(k)
+				}
+			}
+		})
+	}


There is no need to have a second benchmark, see comment above.

AlexanderYastrebov · 2024-02-09T14:21:29Z

results.md

@@ -0,0 +1,14 @@
+`go test -benchmem -run=^$ -bench ^BenchmarkStripQuery$ -benchtime=10s  github.com/zalando/skipper/filters/builtin`


Please add results to the commit message as described in the comment above.

AlexanderYastrebov · 2024-02-09T14:22:35Z

filters/builtin/stripquery_test.go

+	for i, v := range table {
+		url, _ := url.ParseRequestURI(v.url)
+		q := url.Query()
+		b.Run(fmt.Sprintf("[old sanitize] url %d", i+1), func(b *testing.B) {


Makes sense to add b.ReportAllocs() instead of -benchmem flag

results

c3a4e7c

trkohler added the minor no risk changes, for example new filters label Feb 9, 2024

AlexanderYastrebov reviewed Feb 9, 2024

View reviewed changes

trkohler closed this Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise sanitize with string builder #2918

Optimise sanitize with string builder #2918

trkohler commented Feb 9, 2024

AlexanderYastrebov Feb 9, 2024 •

edited

Loading

AlexanderYastrebov Feb 9, 2024

AlexanderYastrebov Feb 9, 2024

AlexanderYastrebov Feb 9, 2024

AlexanderYastrebov Feb 9, 2024

AlexanderYastrebov Feb 9, 2024

		@@ -0,0 +1,14 @@
		`go test -benchmem -run=^$ -bench ^BenchmarkStripQuery$ -benchtime=10s github.com/zalando/skipper/filters/builtin`

Optimise sanitize with string builder #2918

Optimise sanitize with string builder #2918

Conversation

trkohler commented Feb 9, 2024

AlexanderYastrebov Feb 9, 2024 • edited Loading

Choose a reason for hiding this comment

AlexanderYastrebov Feb 9, 2024

Choose a reason for hiding this comment

AlexanderYastrebov Feb 9, 2024

Choose a reason for hiding this comment

AlexanderYastrebov Feb 9, 2024

Choose a reason for hiding this comment

AlexanderYastrebov Feb 9, 2024

Choose a reason for hiding this comment

AlexanderYastrebov Feb 9, 2024

Choose a reason for hiding this comment

AlexanderYastrebov Feb 9, 2024 •

edited

Loading