We are excited to be a part of AWS re:Invent 2024. Visit us at booth #1844 in Las Vegas.More info
Blog

Aerospike Document API

Ken Tune
Ken Tune
Senior Solutions Architect
June 17, 2021|3 min read

Aerospike is a high performance, distributed, scalable, key value database. Aerospike leverages SSD technology to achieve levels of throughput and low latency exceeding even those obtained with in memory products. This allows hardware costs to be reduced 10x or more and data density to be increased 10x or more versus any other high performance solution.

Aerospike has a significant number of distinguishing characteristics versus competitor products. Here we focus on the Collection Data Type (CDT) API. The CDT API facilitates list and map oriented operations within objects thereby reducing network overhead and client side computation. It is worth noting that it is highly efficient, adding little overhead to read or write calls and composable, allowing construction of complex overall operations.

The CDT API contains within it all the primitives required to build a sophisticated document API along the lines suggested by Stefan Groessner in his well known JsonPath proposal which is in turn modelled on the XPath standard for XML. For those not familiar with it, XPath supports CRUD operations via filesystem navigation like syntax. JsonPath is essentially a port of this idea to JSON.

The Aerospike Document API provides such an API, using the CDT API. Below we demonstrate the way in which CRUD operations can be executed at arbitrary points within a JSON document using the API.

Document Creation

First we need a JSON document to work with. Consider the below, expressing some subjective, for space reasons, highlights of Tommy Lee Jones career.

{
    "forenames": [
    "Tommy",
    "Lee"
    ],
    "surname": "Jones",
    "date_of_birth": {
    "day": 15,
    "month": 9,
    "year": 1946
    },
    "selected_filmography":{
    "2012":["Lincoln","Men In Black 3"],
    "2007":["No Country For Old Men"],
    "2002":["Men in Black 2"],
    "1997":["Men in Black","Volcano"],
    "1994":["Natural Born Killers","Cobb"],
    "1991":["JFK"],
    "1980":["Coal Miner's Daughter","Barn Burning"]
    },
    "imdb_rank":{
    "source":"https://www.imdb.com/list/ls050274118/",
    "rank":51
    },
    "best_films_ranked": [
    {
        "source": "http://www.rottentomatoes.com",
        "films": ["The Fugitive","No Country For Old Men","Men In Black","Coal Miner's Daughter","Lincoln"]
    },
    {
        "source":"https://medium.com/the-greatest-films-according-to-me/10-greatest-films-of-tommy-lee-jones-97426103e3d6",
        "films":["The Three Burials of Melquiades Estrada","The Homesman","No Country for Old Men","In the Valley of Elah","Coal Miner's Daughter"]
    }
    ]
}

Using the document API we add to our database.

// Create a document client via an existing aerospikeClient
AerospikeDocumentClient documentClient = new AerospikeDocumentClient(client);
// Convert JSON string to a JsonNode
JsonNode jsonNode = JsonConverters.convertStringToJsonNode(tommyLeeJonesJson);
// Construct an appropriate key
Key tommyLeeJonesDBKey = new Key(AEROSPIKE_NAMESPACE, AEROSPIKE_SET, "src/test/resources/tommy-lee-jones.json");
// Add to database using a named bin
documentClient.put(tommyLeeJonesDBKey, documentBinName, jsonNode);

Reads

We can find out the name of Jones’ best film according to ‘Rotten Tomatoes’ using the path $.best_films_ranked[0].films[0]. Hopefully the use of keys and list indexes is immediately intuitive when considered in the context of the above document.

documentClient.get(tommyLeeJonesDBKey, documentBinName,"$.best_films_ranked[0].films[0]" );

will return

"The Fugitive"

We are not limited to retrieving primitives. An expression such as

"$.selected_filmography.1980"

will retrieve a list

["Coal Miner's Daughter","Barn Burning"]

Updates

We can add to the document. The snippet below will add a 2019 element to the filmography.

List<String> _2019Films = new Vector<String>();
    _2019Films.add("Ad Astra");
    documentClient.put(tommyLeeJonesDBKey, documentBinName, "$.selected_filmography.2019",_2019Films);

We can update document nodes directly — this syntax updates Jones’ IMDB ranking.

documentClient.put(tommyLeeJonesDBKey, documentBinName, "$.imdb_rank.rank",45);

Append operations may also be used. For example, we can append to ‘Rotten Tomatoes’ list of best films using the reference $.best_films_ranked[0].films

documentClient.append(tommyLeeJonesDBKey, documentBinName, "$.best_films_ranked[0].films","Rolling Thunder");
documentClient.append(tommyLeeJonesDBKey, documentBinName, "$.best_films_ranked[0].films","The Three Burials");

Deletion

We can similarly use JsonPath like expressions to support node deletion. The line below will remove the Medium rankings from the document

documentClient.delete(tommyLeeJonesDBKey, documentBinName, "$.best_films_ranked[1]");

Availability

The library has been published to Maven Central as aerospike-document-api. Add

<dependency>
    <groupId>com.aerospike</groupId>
    <artifactId>aerospike-document-api</artifactId>
    <version>1.1.2</version>
</dependency>

to your pom.xml to use.

For full details of the API see aerospike-document-lib