Andrew Tolbert


<p>Cassandra 3.0.4 and 3.4 introduces&nbsp;<code>sstabledump</code>, a new utility for exploring&nbsp;<a href="https://docs.datastax.com/en/cassandra/3.0/share/glossary/gloss_sstable.html" target="_blank">SSTables</a>.&nbsp;<code>sstabledump</code>&nbsp;is the spiritual successor to and a replacement for&nbsp;<a href="https://docs.datastax.com/en/cassandra/1.2/cassandra/tools/toolsSStable2json_t.html" target="_blank"><code>sstable2json</code></a>.&nbsp;<code>sstable2json</code>&nbsp;was removed from Cassandra in version 3.0, but examining SSTable data is still a useful diagnostic tool.&nbsp;<code>sstabledump</code>&nbsp;can export SSTable content to the human readable JSON format.</p>

<p>How SSTable data is stored on disk has changed in Cassandra 3.0, as previously covered in ‘<a href="https://www.datastax.com/blog/2015/12/putting-some-structure-storage-engine" target="_blank">Putting some structure in the storage engine</a>’. Previously, SSTables were composed of partition keys and their cells; now SSTables are composed of partitions and their rows.</p>

<p>This eliminates quite a bit of overhead present in prior versions of Cassandra. Metadata such as clustering key values, timestamps and TTLs are now defined at the row level, rather than repeated for each individual cell within a row. This new layout now matches how data is represented in CQL, and is more understandable.</p>

<p>A nice enhancement of&nbsp;<code>sstabledump</code>&nbsp;over&nbsp;<code>sstable2json</code>&nbsp;is that the utility can be run in ‘client mode’, so the system data does not have to be read to determine schema.&nbsp;<code>sstabledump</code>&nbsp;can be executed outside of the Cassandra environment, and cassandra.yaml is not required in the classpath for the tool to work.</p>

<p>Note that&nbsp;<code>sstabledump</code>&nbsp;only supports Cassandra 3.X SSTables.</p>

<h2>Visualizing the Storage Engine changes in 3.0</h2>

<p>To demonstrate&nbsp;<code>sstabledump</code>&nbsp;and the changes in SSTable layout in 3.0, we’ll use&nbsp;<code>sstable2json</code>&nbsp;and&nbsp;<code>sstabledump</code>&nbsp;to contrast the SSTables created by a Cassandra 2.2 node and those created by a Cassandra 3.0 node.</p>

<p>First, let’s generate a small SSTable for a table that represents stock ticker data. This should be done within a cqlsh session on each Cassandra cluster:</p>

<pre>
<code>
-- Create the schema


CREATE KEYSPACE IF NOT EXISTS ticker WITH REPLICATION = { 'class' : 'SimpleStrategy', 'replication_factor' : 1 };

USE ticker;


CREATE TABLE IF NOT EXISTS symbol_history (

&nbsp;&nbsp;symbol&nbsp;&nbsp;&nbsp; text,

&nbsp;&nbsp;year&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; int,

&nbsp;&nbsp;month&nbsp;&nbsp;&nbsp;&nbsp; int,

&nbsp;&nbsp;day&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; int,

&nbsp;&nbsp;volume&nbsp;&nbsp;&nbsp; bigint,

&nbsp;&nbsp;close&nbsp;&nbsp;&nbsp;&nbsp; double,

&nbsp;&nbsp;open&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; double,

&nbsp;&nbsp;low&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; double,

&nbsp;&nbsp;high&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; double,

&nbsp;&nbsp;idx&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; text static,

&nbsp;&nbsp;PRIMARY KEY ((symbol, year), month, day)

) with CLUSTERING ORDER BY (month desc, day desc);


-- Insert some records


INSERT INTO symbol_history (symbol, year, month, day, volume, close, open, low, high, idx)

VALUES ('CORP', 2015, 12, 31, 1054342, 9.33, 9.55, 9.21, 9.57, 'NYSE') USING TTL 604800;


INSERT INTO symbol_history (symbol, year, month, day, volume, close, open, low, high, idx)

VALUES ('CORP', 2016, 1, 1, 1055334, 8.2, 9.33, 8.02, 9.35, 'NASDAQ') USING TTL 604800;


INSERT INTO symbol_history (symbol, year, month, day, volume, close, open, low, high)

VALUES ('CORP', 2016, 1, 4, 1054342, 8.54, 8.2, 8.2, 8.65) USING TTL 604800;


INSERT INTO symbol_history (symbol, year, month, day, volume, close, open, low, high)

VALUES ('CORP', 2016, 1, 5, 1054772, 8.73, 8.54, 8.44, 8.75) USING TTL 604800;


-- Update a column value


UPDATE symbol_history USING TTL 604800 set close = 8.55 where symbol = 'CORP' and year = 2016 and month = 1 and day = 4;
</code>
</pre>

<p>Next, let’s flush memtables to disk as SSTables using&nbsp;<code>nodetool</code>:</p>

<pre>
<code>
$ bin/nodetool flush
</code>
</pre>

<p>Then in a cqlsh session we will set a column value to null and delete an entire row to generate some tombstones:</p>

<pre>
<code>
-- Set column value to null

USE ticker;

UPDATE symbol_history SET high = null WHERE symbol = 'CORP' and year = 2016 and month = 1 and day = 1;


-- Delete an entire row DELETE FROM symbol_history WHERE symbol = 'CORP' and year = 2016 and month = 1 and day = 5;
</code>
</pre>

<p>We proceed to flush again to generate a new SSTable, and then perform a major compaction yielding a single SSTable.</p>

<pre>
<code>
$ bin/nodetool flush; bin/nodetool compact ticker
</code>
</pre>

<p>Now that we have a single SSTable representing operations on our CQL table we can use the appropriate tool to examine its contents.</p>

<h2>C* 2.2 sstable2json Output</h2>

<pre>
<code>
$ tools/bin/sstable2json data/data/ticker/symbol_history-d7197900e5aa11e590210b5b92b49507/la-3-big-Data.db
</code>
</pre>

<pre>
<code>
[

{"key": "CORP:2016",

&nbsp;"cells": [["::idx","NASDAQ",1457495762169139,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:5:_","1:5:!",1457495781073797,"t",1457495781],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:4:","",1457495762172733,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:4:close","8.55",1457495767496569,"e",604800,1458100567],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:4:high","8.65",1457495762172733,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:4:low","8.2",1457495762172733,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:4:open","8.2",1457495762172733,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:4:volume","1054342",1457495762172733,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:1:","",1457495762169139,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:1:close","8.2",1457495762169139,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:1:high",1457495780,1457495780541716,"d"],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:1:low","8.02",1457495762169139,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:1:open","9.33",1457495762169139,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["1:1:volume","1055334",1457495762169139,"e",604800,1458100562]]},

{"key": "CORP:2015",

&nbsp;"cells": [["::idx","NYSE",1457495762164052,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["12:31:","",1457495762164052,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["12:31:close","9.33",1457495762164052,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["12:31:high","9.57",1457495762164052,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["12:31:low","9.21",1457495762164052,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["12:31:open","9.55",1457495762164052,"e",604800,1458100562],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;["12:31:volume","1054342",1457495762164052,"e",604800,1458100562]]}

]
</code>
</pre>

<p>As previously stated, the sstable2json output demonstrates that the storage engine prior to Cassandra 2.2 represents partition keys and their cells.</p>

<p>A large portion of the presented data in cells is redundant. For example, when we executed INSERT queries, each cell representing a column value shares the same timestamp and TTL. Additionally, each cell contains not only the full name of the column, but also the values of the clustering keys that cell belongs to. This overhead contributes a large portion to the size of the SSTable.</p>

<h2>C* 3.0 sstabledump output</h2>

<pre>
<code>
$ tools/bin/sstabledump data/data/ticker/symbol_history-6d6bfc70e5ab11e5aeae7b4a82a62e48/ma-3-big-Data.db
</code>
</pre>

<pre>
<code>
[

&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;"partition" : {

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"key" : [ "CORP", "2016" ],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 0

&nbsp;&nbsp;&nbsp;&nbsp;},

&nbsp;&nbsp;&nbsp;&nbsp;"rows" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"type" : "static_block",

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 48,

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"cells" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "idx", "value" : "NASDAQ", "tstamp" : 1457484225583260, "ttl" : 604800, "expires_at" : 1458089025, "expired" : false }

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;},

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"type" : "row",

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 48,

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"clustering" : [ "1", "5" ],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"deletion_info" : { "deletion_time" : 1457484273784615, "tstamp" : 1457484273 }

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;},

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"type" : "row",

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 66,

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"clustering" : [ "1", "4" ],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"liveness_info" : { "tstamp" : 1457484225586933, "ttl" : 604800, "expires_at" : 1458089025, "expired" : false },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"cells" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "close", "value" : "8.54" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "high", "value" : "8.65" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "low", "value" : "8.2" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "open", "value" : "8.2" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "volume", "value" : "1054342" }

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;},

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"type" : "row",

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 131,

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"clustering" : [ "1", "1" ],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"liveness_info" : { "tstamp" : 1457484225583260, "ttl" : 604800, "expires_at" : 1458089025, "expired" : false },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"cells" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "close", "value" : "8.2" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "high", "deletion_time" : 1457484267, "tstamp" : 1457484267368678 },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "low", "value" : "8.02" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "open", "value" : "9.33" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "volume", "value" : "1055334" }

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}

&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;},

&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;"partition" : {

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"key" : [ "CORP", "2015" ],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 194

&nbsp;&nbsp;&nbsp;&nbsp;},

&nbsp;&nbsp;&nbsp;&nbsp;"rows" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"type" : "static_block",

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 239,

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"cells" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "idx", "value" : "NYSE", "tstamp" : 1457484225578370, "ttl" : 604800, "expires_at" : 1458089025, "expired" : false }

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;},

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"type" : "row",

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"position" : 239,

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"clustering" : [ "12", "31" ],

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"liveness_info" : { "tstamp" : 1457484225578370, "ttl" : 604800, "expires_at" : 1458089025, "expired" : false },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;"cells" : [

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "close", "value" : "9.33" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "high", "value" : "9.57" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "low", "value" : "9.21" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "open", "value" : "9.55" },

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;{ "name" : "volume", "value" : "1054342" }

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;}

&nbsp;&nbsp;&nbsp;&nbsp;]

&nbsp;&nbsp;}

]
</code>
</pre>

<p>As a consequence of the new tool's verbose output, the output payload is less compact than&nbsp;<code>sstable2json</code>. However, the enriched structure of the 3.0 storage engine, is displayed. What is apparent is that there is less repeated data, which leads to a dramatically reduced SSTable storage footprint.</p>

<p>Looking at the output, note that clustering, timestamp and ttl information are now presented at the row level, instead of repeating in individual cells. This change is a large factor in optimizing disk space. While column names are present in each cell, the full column names are not stored for each cell as previously. You can read more about these optimizations and others in the&nbsp;<a href="https://www.datastax.com/blog/2015/12/putting-some-structure-storage-engine" target="_blank">aforementioned blog post</a>.</p>

<h2>Internal Representation Format</h2>

<p>As previously mentioned,&nbsp;<code>sstabledump</code>’s JSON representation is more verbose than&nbsp;<code>sstable2json</code>.&nbsp;<code>sstabledump</code>&nbsp;also provides an alternative ‘debug’ output format that is more concise than its json counterpart. While initially difficult to understand, it is a more compact and convenient format for advanced users to grok the contents of an SSTable. To view data in this format, simply pass the&nbsp;<code>-d</code>&nbsp;parameter to&nbsp;<code>sstabledump</code>:</p>

<p><code>$ tools/bin/sstabledump data/data/ticker/symbol_history-6d6bfc70e5ab11e5aeae7b4a82a62e48/ma-3-big-Data.db -d</code></p>

<pre>
<code>
[CORP:2016]@0 Row[info=[ts=-9223372036854775808] ]: STATIC | [idx=NASDAQ ts=1457496014384090 ttl=604800 ldt=1458100814]

[CORP:2016]@0 Row[info=[ts=-9223372036854775808] del=deletedAt=1457496035375251, localDeletion=1457496035 ]: 1, 5 |

[CORP:2016]@66 Row[info=[ts=1457496014387922 ttl=604800, let=1458100814] ]: 1, 4 | [close=8.55 ts=1457496020899876 ttl=604800 ldt=1458100820], [high=8.65 ts=1457496014387922 ttl=604800 ldt=1458100814], [low=8.2 ts=1457496014387922 ttl=604800 ldt=1458100814], [open=8.2 ts=1457496014387922 ttl=604800 ldt=1458100814], [volume=1054342 ts=1457496014387922 ttl=604800 ldt=1458100814]

[CORP:2016]@141 Row[info=[ts=1457496014384090 ttl=604800, let=1458100814] ]: 1, 1 | [close=8.2 ts=1457496014384090 ttl=604800 ldt=1458100814], [high=<tombstone> ts=1457496034857652 ldt=1457496034], [low=8.02 ts=1457496014384090 ttl=604800 ldt=1458100814], [open=9.33 ts=1457496014384090 ttl=604800 ldt=1458100814], [volume=1055334 ts=1457496014384090 ttl=604800 ldt=1458100814]

[CORP:2015]@204 Row[info=[ts=-9223372036854775808] ]: STATIC | [idx=NYSE ts=1457496014379236 ttl=604800 ldt=1458100814]

[CORP:2015]@204 Row[info=[ts=1457496014379236 ttl=604800, let=1458100814] ]: 12, 31 | [close=9.33 ts=1457496014379236 ttl=604800 ldt=1458100814], [high=9.57 ts=1457496014379236 ttl=604800 ldt=1458100814], [low=9.21 ts=1457496014379236 ttl=604800 ldt=1458100814], [open=9.55 ts=1457496014379236 ttl=604800 ldt=1458100814], [volume=1054342 ts=1457496014379236 ttl=604800 ldt=1458100814]
</tombstone></code>
</pre>

<p>Other than the inclusion of this internal representation format, the usage between&nbsp;<code>sstabledump</code>&nbsp;and&nbsp;<code>sstable2json</code>&nbsp;is exactly the same.</p>

<h2>Additional Links</h2>

<ul>
	<li><a href="https://www.datastax.com/blog/2015/12/putting-some-structure-storage-engine" target="_blank">Putting some structure in the storage engine</a>&nbsp;- Sylvain Lebresne, DataStax</li>
	<li><a href="http://thelastpickle.com/blog/2016/03/04/introductiont-to-the-apache-cassandra-3-storage-engine.html" target="_blank">Introduction to the Apache Cassandra 3.X Storage Engine</a>&nbsp;- Aaron Morton, The Last Pickle</li>
	<li><a href="https://github.com/apache/cassandra/blob/cassandra-3.0/guide_8099.md" target="_blank">Overview of CASSANDRA-8099 changes</a></li>
	<li><a href="https://issues.apache.org/jira/browse/CASSANDRA-8099" target="_blank">CASSANDRA-8099</a>&nbsp;- Refactor and modernize the storage engine</li>
	<li><a href="https://issues.apache.org/jira/browse/CASSANDRA-7464" target="_blank">CASSANDRA-7464</a>&nbsp;- Replace sstable2json</li>
</ul>


Debugging SSTables in 3.0 with sstabledump

Andrew Tolbert

Share

Share

Visualizing the Storage Engine changes in 3.0

C* 2.2 sstable2json Output

C* 3.0 sstabledump output

Internal Representation Format

Additional Links

More Technology

Knowledge Graphs for RAG without a GraphDB

How Winweb Built its AI Assistant with DataStax Astra DB and LangChain

Vercel + Astra DB: Get Data into Your GenAI Apps Fast

Simplifying Agent Development with Astra DB Connector for Vertex AI Search

One-stop Data API for Production GenAI