Abrandoned/decoding setters without copy #297

abrandoned · 2016-02-08T03:03:31Z

start moving towards per field decoding setters instead of using the standard setter during the decode routine; the standard setter does copies to keep from mutating an incoming value when a value is set, during decode we also make a copy as a result of pulling it off the stream, largely reduces a single copy and in cases where we can it prevents the acceptable? checks as a value that has been serialized and is tagged should be acceptable

also started added setters directly on the message object by tag instead of doing a field lookup and then running through the name of the field; reduces the amount of searching for field definitions and will eventually lead to us having a deserialization routine that only utilizes tags (instead of field lookup)

on MRI and JRuby these changes accounted for ~20% decrease in time for serialization/deserialization of the command
bx rake benchmark:profile_protobuf_serialize[100000,prof_out.txt]

@film42

…py and starts adding setters by tag so we do not need the get_field lookup when we have a tag

…field is Integer in enum before going through coerce! routine

film42 · 2016-02-08T03:40:53Z

lib/protobuf/field/base_field.rb

@@ -234,6 +235,10 @@ def define_getter
        ::Protobuf.field_deprecator.deprecate_method(message_class, method_name) if field.deprecated?
      end

+      def define_decode_setter
+        # empty for now


Only a few variants of this method. Let's bring the duplicated common implementation here.

probably best, I wrote this while I was testing the string copy only and now need to refactor since a common pattern has emerged

…e setter

…r decode

abrandoned · 2016-02-09T01:18:38Z

@zachmargolis would love to get your thoughts on this path

zachmargolis · 2016-02-09T19:07:41Z

lib/protobuf/field/varint_field.rb

@@ -52,6 +52,7 @@ def self.encode(value, use_cache = true)
      #

      def acceptable?(val)
+        return true if val.is_a?(Integer) && val >= 0 && val < INT32_MAX


we still want to compare to self.class.max right?

I understand wanting to avoid coerce! for performance reasons but maybe there's a better way to preserve the existing logic?

Just an idea

def acceptable?(val) int_val = val.is_a?(Integer) val else coerce!(val) end int_val >= self.class.min && int_val <= self.class.max end

we do want to compare the max (which happens after this if it is larger than the smallest max); the smallest max is the INT32_MAX, so if it passes it (and is an Integer; which is the most common case, especially in deserialization) then we just set it instead of taking the long route (which in our usage is very uncommon)

I guess as a reader of this patch, I'm having a hard following what it's supposed to do. I didn't realize it would fall through to the old case. Maybe we could do something like this (comments included) for posterity?

def acceptable?(val) int_val = if val.is_a?(Integer) return true if val >= 0 && val < INT32_MAX # return quickly for smallest integer size, hot code path val else coerce!(val) end int_val >= self.class.min && int_val <= self.class.max end

embark · 2016-02-23T22:54:10Z

lib/protobuf/field/base_field.rb

        method_name = field.getter

        message_class.class_eval do
          define_method(method_name) do
-            @values[field.name] ||= ::Protobuf::Field::FieldArray.new(field)
+            @values[field_name] ||= ::Protobuf::Field::FieldArray.new(field)


Heads up that a lot of this work will need to be moved to the []= and [] methods after PR #302 is merged. In fact a lot of the setter/getter code in this PR will conflict, I comment on just a little of it below.

embark · 2016-02-23T23:23:14Z

👍 great optimization!

How much benefit do we get on creating the dynamic per-tag methods vs. having 1 method in the message where that requires a field lookup first?

Just curious if the optimization is worth the complexity tradeoff of a bunch of new dynamically generated methods

abrandoned added 2 commits February 7, 2016 19:41

add a set of decoding setters for each type that removes the extra co…

22909fb

…py and starts adding setters by tag so we do not need the get_field lookup when we have a tag

account for varint types that nee to run through decode and check if …

7637d60

…field is Integer in enum before going through coerce! routine

film42 reviewed Feb 8, 2016
View reviewed changes

abrandoned added 4 commits February 7, 2016 20:58

do not need to go through field to get the field.name when running th…

e06f642

…e setter

take out nonzero and rewrite VarintPure and use GraphProfile for Jruby

d096a27

Merge branch 'master' into abrandoned/decoding_setters_without_copy

0a9bfbb

cache type_class when used and make sure we set a valid enum on sette…

481e730

…r decode

zachmargolis reviewed Feb 9, 2016
View reviewed changes

remove the name setter for decode as tag is only one needed

476844f

embark reviewed Feb 23, 2016
View reviewed changes

film42 mentioned this pull request Feb 28, 2016

Small optimizations #308

Closed

abrandoned mentioned this pull request Mar 1, 2017

pull in acceptable? optimization for varint and fix rubocop for previ… #356

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Abrandoned/decoding setters without copy #297

Abrandoned/decoding setters without copy #297

Uh oh!

abrandoned commented Feb 8, 2016

Uh oh!

film42 Feb 8, 2016

Uh oh!

abrandoned Feb 8, 2016

Uh oh!

film42 Feb 8, 2016

Uh oh!

abrandoned commented Feb 9, 2016

Uh oh!

zachmargolis Feb 9, 2016

Uh oh!

abrandoned Feb 9, 2016

Uh oh!

zachmargolis Feb 9, 2016

Uh oh!

embark Feb 23, 2016

Uh oh!

embark commented Feb 23, 2016

Uh oh!

Uh oh!

Abrandoned/decoding setters without copy #297

Are you sure you want to change the base?

Abrandoned/decoding setters without copy #297

Uh oh!

Conversation

abrandoned commented Feb 8, 2016

Uh oh!

film42 Feb 8, 2016

Choose a reason for hiding this comment

Uh oh!

abrandoned Feb 8, 2016

Choose a reason for hiding this comment

Uh oh!

film42 Feb 8, 2016

Choose a reason for hiding this comment

Uh oh!

abrandoned commented Feb 9, 2016

Uh oh!

zachmargolis Feb 9, 2016

Choose a reason for hiding this comment

Uh oh!

abrandoned Feb 9, 2016

Choose a reason for hiding this comment

Uh oh!

zachmargolis Feb 9, 2016

Choose a reason for hiding this comment

Uh oh!

embark Feb 23, 2016

Choose a reason for hiding this comment

Uh oh!

embark commented Feb 23, 2016

Uh oh!

Uh oh!