Welcome to the DataStax Developer Blog. Here’s where you can find the latest insights and technical articles from top experts and fellow peers on distributed systems, data management strategies, Star Wars/Star Trek debates, and best practices for building cloud applications that are always on, built with effortless scale, and deliver instant insight.

Be sure to check out our hours of free training and downloads on DataStax Academy. For industry topics and general DataStax news, head over to our company blog.

DSE Advanced Replication in DSE 5.1

By Imran Chaudhry - July 11, 2017

DSE Advanced Replication builds on the multi-datacenter support in Apache Cassandra(R) to facilitate scenarios that require selective or "hub and spoke" replication. DSE Advanced Replication is specifically designed to tolerate sporadic connectivity that can occur in constrained environments, such as retail, oil-and-gas remote sites, and cruise ships. DSE Advanced Replication in DSE 5.1 improves design and performance characteristics. In this blog post we discuss design decisions that led to these DSE 5.1 changes.

Read More

Studio 2.0 Goes Multi-Model with CQL Support

By Zach Kurey - June 27, 2017

Datastax Studio 2.0 introduces support for the Apache Cassandra™ Query Language(CQL) with an intelligent CQL editor that will give you a productivity boost when working with CQL and Datastax Enterprise(DSE) 5.0+

Read More

Spark Application Dependency Management

By Jaroslaw Grabowski - June 20, 2017

Runtime and compilation time dependency management can be a difficult task. The Spark Build Examples project is designed to help Spark beginners quickly bootstrap and extend their first Spark Application with custom dependencies. It does that by providing number of template projects for different programing languages and different build tools. Moreover it shows different techniques like assembling and shading which help overcoming common dependency-related problems.

Read More

Graph Storytelling with Studio 2.0.0

By Bob Briody - June 13, 2017

One sign of great data visualization is that you can quickly and accurately interpret the provided information without having to think much about the mechanics of the visualization itself. In Studio 2.0.0 we added a few features to the Graph View that enable this type of seamless storytelling.

Read More

DSE 5.1 Resource Manager Part 2 – Process Security

By Jacek Lewandowski - June 6, 2017

DSE Resource Manager comes with a customizable implementation of the mechanism used to control the driver and executor lifecycles. In particular we provide an alternative to the default mechanism which allows processes to be run as separate system users. Follow this blog post to learn how this impacts the security of your DSE cluster, how it can be configured and how you can verify what it actually does. We will also show a step-by-step guide to demonstrate how it works.

Read More

DSE 5.1 Resource Manager, Part 1 – Network Connections Security

By Jacek Lewandowski - May 30, 2017

DSE Resource Manager is a custom version of the Spark Standalone cluster manager. It provides the functionality of a Spark Master when running Apache Spark(™) applications with DSE. Since the introduction of Spark in DSE 4.5, DSE Analytics has enhanced the open source Spark Master implementation with: automatic management of the Spark Master and Spark Workers lifecycles; Apache Cassandra(R)-based high availability; distributed and fault tolerant storage of Spark Master recovery data; and pain-free configuration for client applications. In 5.1, our introduction of the DSE Resource Manager adds even more to our custom integration providing more ease-of-use, security, and stability.

Read More

From CFS to DSEFS

By Piotr Kołaczkowski - May 23, 2017

Cassandra File System (CFS) is the default distributed file system in the DataStax Enterprise platform in versions 2.0 to 5.0. Its primary purpose is to support Hadoop and Spark workloads with temporary Hadoop-compatible storage. In DSE 5.1, CFS has been deprecated and replaced with a much improved DataStax Enterprise File System (DSEFS). DSEFS is available as an option in DSE 5.0, and was made the default distributed file system in DSE 5.1.

Read More

DataStax Drivers Fluent APIs for DSE Graph are out!

By Kevin Gallardo - May 16, 2017

Following the DataStax Enterprise 5.1 release, DataStax released its first non-beta versions of the Fluent APIs for DSE Graph. This new feature brings the DataStax Enterprise Drivers into full compatibility with the Apache TinkerPop GLVs, and we even included additional functionalities in order to make the experience of developing graph applications even faster and easier.

Read More

DSE 5.1: Automatic Optimization of Spark SQL Queries Using DSE Search

By Russell Spitzer - May 9, 2017

DSE Search (Apache Solr based) and DSE Analytics (Apache Spark Based) may seem like they are basically designed for orthogonal use cases. Search optimizes the quick generic searches over your Big Data and Analytics optimizes for reading your entire dataset for processing. But there is a sweet spot where Analytics can benefit greatly from the enhanced indexing capabilities from Search. Previously in DSE this synergy could only be accessed from the RDD api but now with DSE 5.1 we bring DSE Search together with Dse Analytics in SparkSQL and DataFrames.

Read More

Tel. +1 (408) 933-3120 sales@datastax.com Offices France Germany

DataStax Enterprise is powered by the best distribution of Apache Cassandra™.

© 2017 DataStax, All Rights Reserved. DataStax, Titan, and TitanDB are registered trademark of DataStax, Inc. and its subsidiaries in the United States and/or other countries.
Apache Cassandra, Apache, Tomcat, Lucene, Solr, Hadoop, Spark, TinkerPop, and Cassandra are trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.