insert_all/upsert_implementation using MERGE #869

mgrunberg · 2021-04-07T17:15:25Z

This PR resolves the issue #859. Since it adds support to insert_on_duplicate_skip it also resolves #847.

The PR is in early stages. I'm exploring adding supports_insert_on_duplicate_skip first (insert_all). Upsert_all is still pending.

At this stage this test is falling with ActiveRecord::RecordNotUnique: TinyTds::Error: Cannot insert duplicate key row in object 'dbo.books' with unique index 'index_books_on_author_id_and_name'. The duplicate key value is (8, Refactoring).

def test_insert_all_with_skip_duplicates_and_autonumber_id_given
  skip unless supports_insert_on_duplicate_skip?

  assert_difference "Book.count", 1 do
    Book.insert_all [
      { id: 200, author_id: 8, name: "Refactoring" },
      { id: 201, author_id: 8, name: "Refactoring" }
    ]
  end
end

The test produces the following query

SET IDENTITY_INSERT [books] ON;
MERGE INTO [books] WITH (UPDLOCK, HOLDLOCK) AS target 
USING (SELECT DISTINCT * FROM (VALUES (200, 8, N'Refactoring'), (201, 8, N'Refactoring')) AS t1 ([id],[author_id],[name])) AS source 
ON (target.[author_id] = source.[author_id] AND target.[name] = source.[name]) OR (target.[id] = source.[id]) 
WHEN NOT MATCHED BY TARGET THEN 
  INSERT ([id],[author_id],[name]) VALUES (source.[id], source.[author_id], source.[name]) 
OUTPUT INSERTED.[id];
SET IDENTITY_INSERT [books] OFF;

SQL Server computes the source and target join and then applies the conditions to decide if a record from the joined table matches or not. In this case, both records are inserted.

I'm having doubts if implementing insert_all using merge is possible.

upsert_all seems more challenging. Besides this problem, WHEN MATCHED can only update a row once.

For reference, table schema is

create_table :books, id: :integer, force: true do |t|
    default_zero = { default: 0 }
    t.references :author
    t.string :format
    t.column :name, :string
    t.column :status, :integer, **default_zero
    t.column :read_status, :integer, **default_zero
    t.column :nullable_status, :integer
    t.column :language, :integer, **default_zero
    t.column :author_visibility, :integer, **default_zero
    t.column :illustrator_visibility, :integer, **default_zero
    t.column :font_size, :integer, **default_zero
    t.column :difficulty, :integer, **default_zero
    t.column :cover, :string, default: "hard"
    t.string :isbn, **case_sensitive_options
    t.datetime :published_on
    t.index [:author_id, :name], unique: true
    t.index :isbn, where: "published_on IS NOT NULL", unique: true
  end

gisborne · 2021-04-12T19:29:13Z

lib/active_record/connection_adapters/sqlserver/database_statements.rb

+
+            sql = +""
+            sql << "SET IDENTITY_INSERT #{insert.model.quoted_table_name} ON;" if includes_primary_key
+            sql << "MERGE INTO #{insert.model.quoted_table_name} WITH (UPDLOCK, HOLDLOCK) AS target"


I believe HOLDLOCK is equivalent to SERIALIZABLE. I'd prefer the SQL standard term. Unless HOLDLOCK is usual in the SQL Server community?

I will check that. Thanks!

gisborne · 2021-04-12T19:29:58Z

lib/active_record/connection_adapters/sqlserver/database_statements.rb

@@ -140,6 +140,37 @@ def default_insert_value(column)
        private :default_insert_value

        def build_insert_sql(insert) # :nodoc:
+          if insert.skip_duplicates?
+            # Do we have a unique_by index? Use index columns
+            conflict_columns = if (unique_by = insert.send(:insert_all).unique_by)


Can we not do insert.insert_all?

Suggest we cache insert_all in a local variable rather than have to look it up each time.

insert_all is private, it's not possible to do insert.insert_all.
Thanks for the suggestion but I'm still not paying attention to things like this. As I said, the PR is in an early stage. I have plans to improve the code (refactor) but right now I'm focused on produce the SQL that cover all tests.

gisborne · 2021-04-12T19:41:21Z

lib/active_record/connection_adapters/sqlserver/database_statements.rb

+            sql = +""
+            sql << "SET IDENTITY_INSERT #{insert.model.quoted_table_name} ON;" if includes_primary_key
+            sql << "MERGE INTO #{insert.model.quoted_table_name} WITH (UPDLOCK, HOLDLOCK) AS target"
+            sql << " USING (SELECT DISTINCT * FROM (#{insert.values_list}) AS t1 (#{insert.send(:columns_list)})) AS source"


None of the standard adapters uniquify insert rows. I don't think we need to do it here either.

uniquify was the solution to this test/scenario

Book.insert_all [ { author_id: 8, name: "Refactoring" }, { author_id: 8, name: "Refactoring" } ]

I don't like it but it works so I move on until all test pass.

Then I found the failing test (still stuck with it)

Book.insert_all [ { id: 200, author_id: 8, name: "Refactoring" }, { id: 201, author_id: 8, name: "Refactoring" } ]

My impression is that the solution to this will let me remove the uniquify.

lib/active_record/connection_adapters/sqlserver/database_statements.rb

justinko · 2023-12-28T06:36:57Z

I think MERGE is still appropriate for this ... it just doesn't support duplicates in the "source". More info on that here: https://www.ibm.com/docs/en/informix-servers/14.10?topic=statement-handling-duplicate-rows

Would a gem (e.g. active record-sqlserver-adapter-insert-all) make sense as a replacement for this pull? There would of course be a caveat stated that your source/args for insertion cannot contain duplicates.

mgrunberg · 2025-03-20T13:54:22Z

The feature was implemented in #1312

add skip_duplicates support. Pending refactor

06b04dc

mgrunberg mentioned this pull request Apr 7, 2021

Check if insert_all/upsert_all can be implemented using MERGE #859

Closed

gisborne reviewed Apr 12, 2021

View reviewed changes

lib/active_record/connection_adapters/sqlserver/database_statements.rb Show resolved Hide resolved

wpolicarpo added feature rails-6.1 rails-6.0 labels Apr 13, 2021

justinko mentioned this pull request Dec 30, 2023

WIP: Add MSSQL support jonahgeorge/solid_queue#1

Closed

3 tasks

aidanharan added need-info rails-6.1 and removed rails-6.0 rails-6.1 labels Feb 10, 2025

andyundso mentioned this pull request Mar 16, 2025

Support insert_all and upsert_all using MERGE #1312

Merged

mgrunberg closed this Mar 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

insert_all/upsert_implementation using MERGE #869

insert_all/upsert_implementation using MERGE #869

Uh oh!

mgrunberg commented Apr 7, 2021

Uh oh!

gisborne Apr 12, 2021

Uh oh!

mgrunberg Apr 13, 2021

Uh oh!

gisborne Apr 12, 2021

Uh oh!

gisborne Apr 12, 2021

Uh oh!

mgrunberg Apr 13, 2021

Uh oh!

gisborne Apr 12, 2021

Uh oh!

mgrunberg Apr 13, 2021

Uh oh!

Uh oh!

justinko commented Dec 28, 2023

Uh oh!

mgrunberg commented Mar 20, 2025

Uh oh!

Uh oh!

insert_all/upsert_implementation using MERGE #869

insert_all/upsert_implementation using MERGE #869

Uh oh!

Conversation

mgrunberg commented Apr 7, 2021

Uh oh!

gisborne Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

mgrunberg Apr 13, 2021

Choose a reason for hiding this comment

Uh oh!

gisborne Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

gisborne Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

mgrunberg Apr 13, 2021

Choose a reason for hiding this comment

Uh oh!

gisborne Apr 12, 2021

Choose a reason for hiding this comment

Uh oh!

mgrunberg Apr 13, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

justinko commented Dec 28, 2023

Uh oh!

mgrunberg commented Mar 20, 2025

Uh oh!

Uh oh!