Tyler Hobbs

<p dir="ltr">We are excited to announce the public release of version 2.0 of the DataStax Python driver for Apache Cassandra and DataStax Enterprise. The driver has several new features in this version:</p>

<ul>
	<li dir="ltr">Full Apache Cassandra 2.0 and DataStax Enterprise 4.0 support
	<ul>
		<li>Automatic paging of large result sets</li>
		<li>Protocol-level statement batching</li>
		<li>Lightweight transactions</li>
		<li><a href="http://datastax.github.io/python-driver/security.html#authentication">SASL-based authentication</a></li>
	</ul>
	</li>
	<li>Enhanced stability and many minor improvements</li>
	<li>Python 3 support</li>
</ul>

<p dir="ltr">The&nbsp;<a href="http://datastax.github.io/python-driver/upgrading.html#upgrading-to-2-0-from-1-x">Upgrade Guide</a>&nbsp;contains details about new features and other changes. I'll give some examples of how to use the new features here.</p>

<h2>Automatic pagination of results</h2>

<p>If a query yields a very large number of results, only an initial amount of rows will be fetched (according to the page size). The rest of the rows will be fetched on demand as you iterate through the rows in the result set.</p>

<pre>
from cassandra.cluster import Cluster

cluster = Cluster()
session = cluster.connect("mykeyspace")

# set a page size of 1000 rows
session.default_fetch_size = 1000
user_rows = session.execute("SELECT * FROM users")
for user_row in users:
    # when the initial page is exhausted, the next page will
    # be transparently fetched
    process_user(user.id, user.name, user.email)
</pre>

<p>For more details, see the documentation for&nbsp;<a href="http://datastax.github.io/python-driver/query_paging.html">query paging</a>.</p>

<h2>Lightweight Transactions</h2>

<p>Using Cassandra 2.0's&nbsp;<a href="https://www.datastax.com/dev/blog/lightweight-transactions-in-cassandra-2-0">lightweight transactions</a>&nbsp;is simple:</p>

<pre>
from cassandra import ConsistencyLevel

create_user_statement = session.prepare(
    "INSERT INTO users (username, email) VALUES (?, ?) IF NOT EXISTS")
create_user_statement.serial_consistency_level = ConsistencyLevel.SERIAL

session.execute(create_user_statement, [new_username, new_email)</pre>

<h2>Batching Statements</h2>

<p>Although it has always been possible to execute statements in a&nbsp;BATCH, there was no way to do this with multiple prepared statements. With version 2.0 of the driver, you can execute multiple prepared (or unprepared) statements atomically across multiple tables within a single batch.</p>

<pre>
from cassandra.query import BatchStatement

//Prepare the statements involved in a profile update
profile_statement = session.prepare(
    "UPDATE user_profiles SET email=? WHERE key=?")
user_track_statement = session.prepare(
    "INSERT INTO user_track (key, text, date) VALUES (?, ?, ?)")

# add the prepared statements to a batch
batch = BatchStatement()
batch.add(profile_statement, [emailAddress, "hendrix"])
batch.add(user_track_statement,
          ["hendrix", "email changed", datetime.utcnow()])

# execute the batch
session.execute(batch)</pre>

<p>Note that the while version 2.0 supports the new Cassandra 2.0 features, this new version of the driver works with Apache Cassandra 1.2 and 2.0, and DataStax Enterprise 4.0, 3.2, and 3.1. When working with Cassandra 1.2 and DSE 3.x, you should explicitly set the protocol version to 1:</p>

<pre>
cluster = Cluster([127.0.0.1], protocol_version=1)
</pre>

<p>&nbsp;</p>

<p>When using protocol version 1, lightweight transactions, automatic paging, and protocol-level batches are not available.</p>

<p>Version 2.0 of the driver is available on&nbsp;<a href="https://pypi.python.org/pypi/cassandra-driver">PyPI</a>, and of course, you can always find the source on&nbsp;<a href="https://github.com/datastax/python-driver">GitHub</a>. Give it a test run and let us know what you think!</p>


DataStax Python Driver 2.0 Released

Tyler Hobbs

Share

Share

Automatic pagination of results

Lightweight Transactions

Batching Statements

More Company

DataStax Acquires Langflow to Accelerate Generative AI Development

The Top 5 DataStax Stories from 2023

2023 Recap: Data = AI

DataStax Astra DB Nabs Three Prestigious 2023 TrustRadius “Best of” Awards, Dominates the Vector Databases Category

One-stop Data API for Production GenAI