Class: PG::EM::Client::Helper::Transaction

Inherits:

Object

Object
PG::EM::Client::Helper::Transaction

show all

Includes:: EventMachine::Deferrable, PG::EM::Client::Helper

Defined in:: lib/em-pg-client-helper/transaction.rb

Overview

Represents a database transaction, and contains all of the methods which can be used to execute queries within the transaction connection.

Defined Under Namespace

Classes: ClosedError

Instance Attribute Summary collapse

#autorollback_on_error ⇒ Object
Manage the "rollback on the failure of a single query" behaviour.

Instance Method Summary collapse

#bulk_insert(tbl, columns, rows) {|Integer| ... } ⇒ Object
Efficiently perform a "bulk" insert of multiple rows.
#commit ⇒ Object
Signal the database to commit this transaction.
#exec(sql, values = [], &blk) ⇒ EM::Deferrable (also: #exec_defer)
Execute an arbitrary block of SQL in sql within the transaction.
#initialize(conn, opts = {}, &blk) ⇒ Transaction constructor
Create a new transaction.
#insert(tbl, params, &blk) ⇒ Object
Insert a row of data into the database table tbl, using the keys from the params hash as the field names, and the values from the params hash as the field data.
#rollback(ex) ⇒ Object
Signal the database to abort this transaction.
#savepoint(&blk) ⇒ EM::Deferrable
Setup a "savepoint" within the transaction.
#sequel(&blk) ⇒ EM::Deferrable
Generate SQL statements via Sequel, and run the result against the database.
#upsert(*args) {|PG::Result| ... } ⇒ EM::Deferrable
Run an upsert inside a transaction.

Constructor Details

#initialize(conn, opts = {}, &blk) ⇒ `Transaction`

Create a new transaction. You shouldn't have to call this yourself; db_transaction should create one and pass it to your block.

Parameters:

conn (PG::EM::Connection) —
The connection to execute all commands against. If using a connection pool, this connection needs to have been taken out of the pool (using something like #hold_deferred) so that no other concurrently-operating code can accidentally send queries down the connection (that would be, in a word, bad).
opts (Hash) (defaults to: {}) —
Zero or more optional parameters that adjust the initial state of the transaction. For full details of the available options, see PG::EM::Client::Helper#db_transaction.

Raises:

(ArgumentError) —
If an unknown isolation level is specified.

# File 'lib/em-pg-client-helper/transaction.rb', line 32

def initialize(conn, opts = {}, &blk)
	@conn                  = conn
	@opts                  = opts
	@finished              = nil
	@retryable             = opts[:retry]
	@autorollback_on_error = true
	@savepoint_stack       = []

	DeferrableGroup.new do |dg|
		@dg = dg

		begin_query = case opts[:isolation]
			when :serializable
				"BEGIN TRANSACTION ISOLATION LEVEL SERIALIZABLE"
			when :repeatable_read
				"BEGIN TRANSACTION ISOLATION LEVEL REPEATABLE READ"
			when :read_committed
				"BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED"
			when :read_uncommitted
				"BEGIN TRANSACTION ISOLATION LEVEL READ UNCOMMITTED"
			when nil
				"BEGIN"
			else
				raise ArgumentError,
				      "Unknown value for :isolation option: #{opts[:isolation].inspect}"
		end

		if opts[:deferrable]
			begin_query += " DEFERRABLE"
		end

		exec(begin_query) do
			begin
				blk.call(self)
			rescue StandardError => ex
				rollback(ex)
			end
		end
	end.callback do
		rollback(RuntimeError.new("txn.commit was not called")) unless @finished
		self.succeed
	end.errback do |ex|
		if @retryable and [PG::TRSerializationFailure].include?(ex.class)
			self.class.new(conn, opts, &blk).callback do
				self.succeed
			end.errback do |ex|
				self.fail(ex)
			end
		else
			self.fail(ex)
		end
	end
end

Instance Attribute Details

#autorollback_on_error ⇒ `Object`

Manage the "rollback on the failure of a single query" behaviour.

The default behaviour of a transaction, when a query fails, is for the transaction to automatically be rolled back and the rest of the statements to not be executed. In ALMOST every case, this is the correct behaviour. However, there are some corner cases in which you want to be able to avoid this behaviour, and will manually react to the transaction failure in some way. In that case, you can set this to false and the transaction will not automatically fail.

Given that pretty much the only thing you can do when a query fails, other than abort the transaction, is to rollback to a savepoint, you might want to look at #savepoint before you try using this.

Since:

2.0.0



147
148
149

# File 'lib/em-pg-client-helper/transaction.rb', line 147

def autorollback_on_error
  @autorollback_on_error
end

Instance Method Details

#bulk_insert(tbl, columns, rows) {|Integer| ... } ⇒ `Object`

Efficiently perform a "bulk" insert of multiple rows.

When you have a large quantity of data to insert into a table, you don't want to do it one row at a time -- that's really inefficient. On the other hand, if you do one giant multi-row insert statement, the insert will fail if any of the rows causes a constraint failure. What to do?

Well, here's our answer: try to insert all the records at once. If that fails with a constraint violation, then split the set of records in half and try to bulk insert each of those halves. Recurse in this fashion until you only have one record to insert.

Parameters:

tbl (Symbol, String) —
the name of the table into which you wish to insert your data. If provided as a Symbol, the name will be escaped, otherwise it will be inserted into the query as-is (and may $DEITY have mercy on your soul).

If the symbol name has a double underscore (__) in it, the part to the left of the double-underscore will be taken as a schema name, and the part to the right will be taken as the table name.
columns (Array<Symbol, String>) —
the columns into which each record of data will be inserted. Any element of the array which is a symbol will be run through Sequel::Database#literal to escape it into a "safe" form; elements which are Strings are inserted as-is, and you're responsible for any escaping which may be required.
rows (Array<Array<Object>>) —
the values to insert. Each entry in the outermost array is a row of data; the elements of each of these inner arrays corresponds to the column in the same position in the columns array.

Due to the way the bulk insert query is constructed, some of PostgreSQL's default casting behaviours don't work so well, particularly around dates and times. If you find that you're getting errors of the form column "foo" is of type <something> but expression is of type text, you'll need to explicitly cast the field that is having problems, replacing value in the array with something like Sequel.cast(value, "timestamp without time zone").

NOTE: We don't do any checking to make sure you're providing the correct number of elements for each row. Thus, if you give a row array that has too few, or too many, entries, the database will puke.

Yields:

(Integer) —
Once the insert has completed, the number of rows that were successfully inserted (that may be less than rows.length if there were any constraint failures) will be yielded to the block.

Since:

2.0.0

# File 'lib/em-pg-client-helper/transaction.rb', line 259

def bulk_insert(tbl, columns, rows, &blk)
	columns.map!(&:to_sym)

	if rows.empty?
		return EM::Completion.new.tap do |df|
			df.callback(&blk) if blk
			df.succeed(0)
		end
	end

	EM::Completion.new.tap do |df|
		@dg.add(df)
		df.callback(&blk) if blk

		unique_index_columns_for_table(tbl) do |indexes|
			q = if indexes.empty?
				sequel_sql do |db|
					db[tbl.to_sym].multi_insert_sql(columns, rows)
				end
			else
				# Guh hand-hacked SQL is fugly... but what I'm doing is so utterly
				# niche that Sequel doesn't support it.
				q_tbl = usdb.literal(tbl)
				q_cols = columns.map { |c| usdb.literal(c) }

				# If there are any unique indexes which the set of columns to
				# be inserted into *doesn't* completely cover, we need to error
				# out, because that will cause no rows (or, at most, one row) to
				# be successfully inserted.  In *theory*, a unique index with all-but-one
				# row inserted *could* work, but that would only work if every value
				# inserted was different, but quite frankly, I think that's a wicked
				# corner case I'm not going to go *anywhere* near.
				#
				unless indexes.all? { |i| (i - columns).empty? }
					ex = ArgumentError.new("Unique index columns not covered by data columns")
					if @autorollback_on_error
						df.fail(ex)
						rollback(ex)
					else
						df.fail(ex)
					end
				else
					subselect_where = indexes.map do |idx|
						"(" + idx.map do |c|
							"src.#{usdb.literal(c)}=dst.#{usdb.literal(c)}"
						end.join(" AND ") + ")"
					end.join(" OR ")

					subselect = "SELECT 1 FROM #{q_tbl} AS dst WHERE #{subselect_where}"

					vals = rows.map do |row|
						"(" + row.map { |v| usdb.literal(v) }.join(", ") + ")"
					end.join(", ")

					"INSERT INTO #{q_tbl} (#{q_cols.join(", ")}) " +
					  "(SELECT * FROM (VALUES #{vals}) " +
					  "AS src (#{q_cols.join(", ")}) WHERE NOT EXISTS (#{subselect}))"
				end
			end

			exec(q) do |res|
				df.succeed(res.cmd_tuples)
			end.errback do |ex|
				df.fail(ex)
			end
		end.errback do |ex|
			df.fail(ex)
		end
	end
end

#commit ⇒ `Object`

Signal the database to commit this transaction. You must do this once you've completed your queries, it won't be called automatically for you. Once you've committed the transaction, you cannot use it again; if you execute a query against a committed transaction, an exception will be raised.

# File 'lib/em-pg-client-helper/transaction.rb', line 92

def commit
	unless @finished
		trace_query("COMMIT")
		@conn.exec_defer("COMMIT", []).tap do |df|
			@dg.add(df)
		end.callback do
			@finished = true
			@dg.close
			yield if block_given?
		end.errback do
			@finished = true
			@dg.close
		end
	end
end

#exec(sql, values = [], &blk) ⇒ `EM::Deferrable` Also known as: exec_defer

Execute an arbitrary block of SQL in sql within the transaction. If you need to pass dynamic values to the query, those should be given in values, and referenced in sql as $1, $2, etc. The given block will be called if and when the query completes successfully.

Returns:

(EM::Deferrable) —
A deferrable that will be completed when this specific query finishes.

# File 'lib/em-pg-client-helper/transaction.rb', line 355

def exec(sql, values=[], &blk)
	trace_query(sql, values)

	if @finished
		raise ClosedError,
		      "Cannot execute a query in a transaction that has been closed"
	end

	@conn.exec_defer(sql, values).tap do |df|
		@dg.add(df)
		df.callback(&blk) if blk
	end.errback do |ex|
		rollback(ex) if @autorollback_on_error
	end
end

#insert(tbl, params, &blk) ⇒ `Object`

Insert a row of data into the database table tbl, using the keys from the params hash as the field names, and the values from the params hash as the field data. Once the query has completed, blk will be called to allow the transaction to continue.



205
206
207

# File 'lib/em-pg-client-helper/transaction.rb', line 205

def insert(tbl, params, &blk)
	exec(*insert_sql(tbl, params), &blk)
end

#rollback(ex) ⇒ `Object`

Signal the database to abort this transaction. You only need to call this method if you need to rollback for some business logic reason -- a rollback will be automatically performed for you in the event of a database error or other exception.

# File 'lib/em-pg-client-helper/transaction.rb', line 113

def rollback(ex)
	unless @finished
		if @savepoint_stack.empty?
			exec("ROLLBACK") do
				@finished = true
				@dg.fail(ex)
				@dg.close
			end
		else
			sp = @savepoint_stack.pop
			exec("ROLLBACK TO \"#{sp[:savepoint]}\"")
			sp[:deferrable].fail(ex)
			@dg.close
			@dg = sp[:parent_deferrable_group]
		end
	end
end

#savepoint(&blk) ⇒ `EM::Deferrable`

Setup a "savepoint" within the transaction.

A savepoint is, as the name suggests, kinda like a "saved game", in an SQL transaction. If a query fails within a transaction, normally all you can do is rollback and abort the entire transaction. Savepoints give you another option: roll back to the savepoint, and try again.

So, that's what this method does. Inside of the block passed to #savepoint, if any query fails, instead of rolling back the entire transaction, we instead only rollback to the savepoint, and execution continues by executing the errback callbacks defined on the savepoint deferrable.

Returns:

(EM::Deferrable)

Since:

2.0.0

# File 'lib/em-pg-client-helper/transaction.rb', line 166

def savepoint(&blk)
	savepoint = SecureRandom.uuid
	parent_dg = @dg
	DeferrableGroup.new do |dg|
		@dg = dg

		dg.callback do
			@dg = parent_dg
			@dg.close
		end

		exec("SAVEPOINT \"#{savepoint}\"").tap do |df|
			@savepoint_stack << { :savepoint => savepoint,
			                      :deferrable => df,
			                      :parent_deferrable_group => parent_dg
			                    }

			df.callback(&blk) if blk
		end
	end
end

#sequel(&blk) ⇒ `EM::Deferrable`

Generate SQL statements via Sequel, and run the result against the database. Very chic.

Returns:

(EM::Deferrable)

#upsert(*args) {|PG::Result| ... } ⇒ `EM::Deferrable`

Run an upsert inside a transaction.

Yields:

(PG::Result) —
the row of data that has been inserted/updated.

Returns:

(EM::Deferrable)

Class: PG::EM::Client::Helper::Transaction

Overview

Defined Under Namespace

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(conn, opts = {}, &blk) ⇒ Transaction

Instance Attribute Details

#autorollback_on_error ⇒ Object

Instance Method Details

#bulk_insert(tbl, columns, rows) {|Integer| ... } ⇒ Object

#commit ⇒ Object

#exec(sql, values = [], &blk) ⇒ EM::Deferrable Also known as: exec_defer

#insert(tbl, params, &blk) ⇒ Object

#rollback(ex) ⇒ Object

#savepoint(&blk) ⇒ EM::Deferrable

#sequel(&blk) ⇒ EM::Deferrable

#upsert(*args) {|PG::Result| ... } ⇒ EM::Deferrable

#initialize(conn, opts = {}, &blk) ⇒ `Transaction`

#autorollback_on_error ⇒ `Object`

#bulk_insert(tbl, columns, rows) {|Integer| ... } ⇒ `Object`

#commit ⇒ `Object`

#exec(sql, values = [], &blk) ⇒ `EM::Deferrable` Also known as: exec_defer

#insert(tbl, params, &blk) ⇒ `Object`

#rollback(ex) ⇒ `Object`

#savepoint(&blk) ⇒ `EM::Deferrable`

#sequel(&blk) ⇒ `EM::Deferrable`

#upsert(*args) {|PG::Result| ... } ⇒ `EM::Deferrable`