Aleksey Yeschenko

Warning: as of Cassandra 2.0.0, the&nbsp;<code>ITrigger</code>&nbsp;interface and the rest of the triggers implementation are&nbsp;not&nbsp;final - and will change in 2.1. Please be aware of this before using triggers in production until at least Cassandra 2.1.

<h2>Overview</h2>

New Cassandra 2.0 prototype&nbsp;<a href="https://issues.apache.org/jira/browse/CASSANDRA-1311">triggers</a>&nbsp;rely on&nbsp;<a href="https://issues.apache.org/jira/browse/CASSANDRA-4285">logged</a>&nbsp;<a href="https://www.datastax.com/dev/blog/atomic-batches-in-cassandra-1-2">batches</a>, originally added in Cassandra 1.2, to implement a flexible, atomic, eventually consistent mechanism for reacting to - and augmenting - write operations.

Cassandra triggers have&nbsp;instead of the event&nbsp;activation time and&nbsp;partition-level&nbsp;granularity. A coordinator node executes triggers before actually applying the mutations (locally or on the remote nodes), giving you the ability to alter the mutations-to-be, augment them with extra mutations, or execute any arbitrary code, really *. The coordinator takes the original mutations (potentially modified by the trigger), adds the extra mutations created by the trigger, and applies them together as one single logged batch, guarantying atomicity and eventual consistency.

It follows that triggers on counter tables are generally not supported (counter mutations are not allowed inside logged batches for obvious reasons - they aren't idempotent).

There are multiple potential use cases for Cassandra triggers:

<ul>
	<li>extra input validation - enforcing constraints beyond the data type validation performed by Cassandra</li>
	<li>replicating or migrating modifications from one table or keyspace to another</li>
	<li>incrementally updating a materialised view derived from one or more tables</li>
	<li>logging any mutations that meet particular conditions</li>
	<li>implementing alerts/notifications</li>
	<li>performing any other application-specific logic</li>
</ul>

Credit for the&nbsp;<a href="https://issues.apache.org/jira/browse/CASSANDRA-1311">implementation</a>&nbsp;goes to&nbsp;<a href="https://twitter.com/vijay2win">Vijay Parthasarathy</a>.

<h2>Implementing a Trigger</h2>

The current (as of C* 2.0.0)&nbsp;<code>ITrigger</code>&nbsp;<a href="https://github.com/apache/cassandra/blob/cassandra-2.0/src/java/org/apache/cassandra/triggers/ITrigger.java">interface</a>&nbsp;itself is extremely simple:

<pre>
<code>public interface ITrigger
{ 
	/**
	 * Called exactly once per CF update, returned mutations are atomically updated.
	 *
	 * @param key - Row Key for the update.
	 * @param update - Update received for the CF
	 * @return modifications to be applied, null if no action to be performed.
	 */
	public Collection&lt;RowMutation&gt; augment(ByteBuffer key, ColumnFamily update);
}
</code></pre>

It does (currently) expose some internal classes that should be explained:

<ul>
	<li><code>RowMutation</code>&nbsp;represents changes to one or more tables so that 1) all the tables belong to the same keyspace, and 2) all the changes have the same partition key. These changes are grouped into&nbsp;<code>ColumnFamily</code>&nbsp;objects (<a href="https://github.com/apache/cassandra/blob/cassandra-2.0/src/java/org/apache/cassandra/db/RowMutation.java">source</a>).</li>
	<li><code>ColumnFamily</code>&nbsp;here shall contain the cells to be inserted and/or removed from their respective tables - one&nbsp;<code>ColumnFamily</code>&nbsp;of changes per table (<a href="https://github.com/apache/cassandra/blob/cassandra-2.0/src/java/org/apache/cassandra/db/ColumnFamily.java">source</a>).</li>
</ul>

The&nbsp;<code>ColumnFamily</code>&nbsp;object passed to the&nbsp;<code>augment</code>&nbsp;method is mutable, thus it's technically possible to interfere and alter the original mutation. It's also possible to create additional mutations for any table in any keyspace that will be performed together with the original changes as a single logged batch.

See the simplistic inverted index&nbsp;<a href="https://github.com/apache/cassandra/blob/cassandra-2.0/examples/triggers/src/org/apache/cassandra/triggers/InvertedIndex.java">implementation</a>&nbsp;for the augmented mutations example.

<h2>Operations</h2>

To create a trigger, you must first build a jar with a class implementing the&nbsp;<code>ITrigger</code>&nbsp;interface and put it into the triggers directory on every node, then perform a CQL3&nbsp;<code>CREATE TRIGGER</code>&nbsp;request to tie your trigger to a Cassandra table (or several tables).

<code>conf/triggers</code>&nbsp;is the default location for the trigger jars, but it can be redefined by setting the&nbsp;<code>cassandra.triggers_dir</code>&nbsp;system property.

To add the trigger to a table, run

<pre>
<code>CREATE TRIGGER &lt;name&gt; ON [&lt;keyspace&gt;.]&lt;table&gt; USING '&lt;class&gt;'
</code></pre>

to remove one, use

<pre>
<code>DROP TRIGGER &lt;name&gt; ON [&lt;keyspace&gt;.]&lt;table&gt;
</code></pre>

<h2>Future Work</h2>

The current implementation is experimental, and there is some work to do before triggers in Cassandra can be declared final and production-ready.&nbsp;<code>CREATE TRIGGER</code>&nbsp;should support&nbsp;<a href="https://issues.apache.org/jira/browse/CASSANDRA-5962">parametrisation</a>, so that triggers could be reused between different tables and configured without a need for external configuration files. It would be nice to be able to define triggers in CQL3 in addition to pure Java. And an API that doesn't reveal the internals (<code>RowMutation</code>&nbsp;and&nbsp;<code>ColumnFamily</code>&nbsp;classes) would be preferable to the current one.

That said, please do experiment with the current implementation and share your feedback - it will affect the final trigger design.

* while we do use a separate class loader for trigger classes, we don't sandbox the execution of triggers in any way. Be extra careful with the code that goes in&nbsp;<code>augment</code>&nbsp;- it can negatively affect the whole node.

What’s New in Cassandra 2.0: Prototype Triggers Support

Aleksey Yeschenko

Share

Share

Overview

Implementing a Trigger

Operations

Future Work

More Technology

Knowledge Graphs for RAG without a GraphDB

How Winweb Built its AI Assistant with DataStax Astra DB and LangChain

Vercel + Astra DB: Get Data into Your GenAI Apps Fast

Simplifying Agent Development with Astra DB Connector for Vertex AI Search

One-stop Data API for Production GenAI