CDT Indexing in Aerospike
For an interactive Jupyter notebook experience:
This tutorial describes the secondary index and query functionality pertaining to CDTs available in Aerospike Database 6.1+.
This notebook requires the Aerospike Database running locally with Java kernel and Aerospike Java Client. To create a Docker container that satisfies the requirements and holds a copy of Aerospike notebooks, visit the Aerospike Notebooks Repo.
Introduction
In this notebook, we will illustrate the CDT indexing and query capabilities in Aerospike Database 6.1+. The detailed description is available in the accompanying blog post Query JSON Documents Faster (and More) with CDT Indexing.
The specific topics covered in this notebook include:
- New CDT indexing functionality
- Code examples
- Non-collection elements
- Collection elements
- JSON documents
Prerequisites
This tutorial assumes familiarity with the following topics:
Setup
Ensure database is running
This notebook requires that Aerospike database is running.
import io.github.spencerpark.ijava.IJava;import io.github.spencerpark.jupyter.kernel.magic.common.Shell;IJava.getKernelInstance().getMagics().registerMagics(Shell.class);%sh asd
Download and install additional components.
Install the Java client version 6.1.0 or above that supports the new CDT indexing capabilities.
%%loadFromPOM<dependencies> <dependency> <groupId>com.aerospike</groupId> <artifactId>aerospike-client</artifactId> <version>6.1.0</version> </dependency> <dependency> <groupId>com.aerospike</groupId> <artifactId>aerospike-document-api</artifactId> <version>1.1.3</version> </dependency> <dependency> <groupId>org.apache.commons</groupId> <artifactId>commons-io</artifactId> <version>1.3.2</version> </dependency></dependencies>
Initialize Client
Initialize the client.
import com.aerospike.client.AerospikeClient;
AerospikeClient client = new AerospikeClient("localhost", 3000);System.out.println("Initialized the client and connected to the cluster.");;
Output:
Initialized the client and connected to the cluster.
Import Modules
// import all needed modulesimport java.util.ArrayList;import java.util.Arrays;import java.util.HashMap;import java.util.List;import java.util.Map;
import com.aerospike.client.AerospikeException;import com.aerospike.client.Key;import com.aerospike.client.Bin;import com.aerospike.client.policy.WritePolicy;import com.aerospike.client.policy.ClientPolicy;import com.aerospike.client.Record;import com.aerospike.client.Operation;import com.aerospike.client.Value;import com.aerospike.client.Value.ListValue;import com.aerospike.client.Value.MapValue;import com.aerospike.client.cdt.ListOperation;import com.aerospike.client.cdt.ListPolicy;import com.aerospike.client.cdt.ListOrder;import com.aerospike.client.cdt.ListWriteFlags;import com.aerospike.client.cdt.MapOperation;import com.aerospike.client.cdt.MapPolicy;import com.aerospike.client.cdt.MapOrder;import com.aerospike.client.cdt.MapWriteFlags;import com.aerospike.client.query.Statement;import com.aerospike.client.query.Filter;import com.aerospike.client.query.RecordSet;import com.aerospike.client.Record;import com.aerospike.client.policy.QueryPolicy;import com.aerospike.client.exp.Exp;import com.aerospike.client.query.IndexType;import com.aerospike.client.query.IndexCollectionType;import com.aerospike.client.task.IndexTask;import com.aerospike.client.AerospikeException;import com.aerospike.client.ResultCode;import com.aerospike.client.cdt.CTX;
Define Constants and Helper Functions
Define constants for the namespaces test
, set cdt-indexing
, and helper functions createIndex
, dropIndex
, executeQueryAndPrintResults
, and truncateTestData
.
String Namespace = "test";String Set = "cdt-indexing";
// convenience function to create an index - essentially a pass-through to the client APIvoid createIndex(String idxName, String binName, IndexType idxType, IndexCollectionType colType, CTX... ctx) { try { IndexTask task = client.createIndex(null, Namespace, Set, idxName, binName, idxType, colType, ctx); task.waitTillComplete(1000, 0); } catch (AerospikeException ae) { if (ae.getResultCode() != ResultCode.INDEX_ALREADY_EXISTS) { throw ae; } } System.out.format("Created index %s on ns=%s set=%s bin=%s.\n", idxName, Namespace, Set, binName);}
// convenience function to drop an index - essentially a pass-through to the client APIvoid dropIndex(String idxName) { try { IndexTask task = client.dropIndex(null, Namespace, Set, idxName); } catch (AerospikeException ae) { if (ae.getResultCode() != ResultCode.INDEX_NOTFOUND) { throw ae; } } System.out.format("Dropped index %s.\n", idxName);}
// convenience function to execute a query using the input filter and print the resultsvoid executeQueryAndPrintResults(Filter filter, String binName) { Statement stmt = new Statement(); stmt.setNamespace(Namespace); stmt.setSetName(Set); stmt.setFilter(filter); stmt.setBinNames(binName); RecordSet rs = client.query(null, stmt); while (rs.next()) { Key key = rs.getKey(); Record record = rs.getRecord(); System.out.format("key=%s bins=%s\n", key.userKey, record.bins); } //System.out.println(); rs.close();}
// convenience function to truncate test datavoid truncateTestData() { try { client.truncate(null, Namespace, null, null); } catch (AerospikeException e) { // ignore }}
Open a Terminal Tab
You can execute shell commands including Aerospike tools like aql to examine data in the terminal tab. Open a terminal tab by selecting File->Open from the notebook menu, and then New->Terminal.
We will illustrate how to create indexes on CDTs and issue queries using two CDTs - a nested List and a nested Map that are populated in 10 records each.
-
A nested List in records with user_key (id) 1-10. The list is held in bin list_bin and has the following numeric, string, and List elements.
- Numeric elements: user_key to user_key+4.
- For example, for user_key=1: 1, 2, 3, 4, 5.
- String elements: “s”+user_key to “s”+(user_key+4).
- For example, for user_key=1: “s1”, “s2”, “s3”, “s4”, “s5”
- Nested List element: holds numeric elements 10*user_key+1 to 10*user_key+5.
- For example, for user_key=1: [11, 12, 13, 44, 15].
So, the value in list_bin for the record with user_key=1 looks like:
[ 1, "s1", 2, "s2", 3, "s3", 4, "s4", 5, "s5", [ 11, 12, 13, 14, 15 ] ]
. - Numeric elements: user_key to user_key+4.
-
A nested Map in records with user_key (id) 101-110. The map is held in bin map_bin and has the following numeric, string, and List elements. Below, we have used i with a shorthand for i=user_key-100.
- “oid”: i.
- For example, for i=1: “oid”: 1.
- “obj”: a nested map with the following key-value pairs: {“attr1”: 10*i+1, “attr2”: “s”+10*i+2, “subobj”: {“attr3”: 100*i+1, “attr4”: 100*i+2}}.
- For example, for i=1: “obj”: {“attr1”: 11, “attr2”: “s12”, “subobj”: {“attr3”: 101, “attr4”: 102}}.
- i: [“s”+i, “s”+i+1].
- For example, for i=1: 1: [“s1”, “s2”].
So, the value in map_bin for the record with user_key=101 (or, equivalently, i=1) looks like:
{ "oid": 1, "obj": {"attr1": 11, "attr2": "s12", "subobj": {"attr3": 101, "attr4": 102}}, 1: ["s1", "s2"] }
- “oid”: i.
Populate Test Data
Execute the cell below to populate the test data described above.
final String ListBin = "list_bin";final String MapBin = "map_bin";WritePolicy wpolicy = new WritePolicy();wpolicy.sendKey = true;
// create the list_bin with// [i..i+4, "s"+i.."s"+4, [10*i+1..10*i+5]]
for (int user_key = 1; user_key <= 10; user_key++) { Key key = new Key(Namespace, Set, user_key);
// append integer and string elements ArrayList<Value> list_val = new ArrayList<Value>(); for (int n = user_key; n < user_key+5; n++) { list_val.add(Value.get(n)); list_val.add(Value.get("s"+Integer.toString(n))); }
// append the nested list element ArrayList<Integer> list_int = new ArrayList<Integer>(); for (int n=user_key*10+1; n < user_key*10+6; n++) { list_int.add(n); } ListValue list_int_val = new ListValue(list_int);
// create the list bin Record record = client.operate(wpolicy, key, ListOperation.clear(ListBin), ListOperation.appendItems(ListBin, list_val), ListOperation.append(ListBin, list_int_val));}
// create map bin with// "oid": i,// "obj": {"attr1": 10*i+1, "attr2": "s"+10*+i+2, "subobj": {"attr3": 100*1+1, "attr4": 100*1+2}},// i: ["s"+i, "s"+i+1]
for (int user_key = 101; user_key <= 110; user_key++) { Integer i = user_key - 100;
Key key = new Key(Namespace, Set, user_key);
// construct obj map value HashMap <String, Value> obj = new HashMap <String, Value>(); obj.put("attr1", Value.get(10*i+1)); obj.put("attr2", Value.get("s"+Integer.toString(10*i+2))); HashMap <String, Integer> subobj = new HashMap <String, Integer>(); subobj.put("attr3", 100*i+1); subobj.put("attr4", 100*i+2); obj.put("subobj", new MapValue(subobj));
// construct arr list value ArrayList<String> arr = new ArrayList<String>(); arr.add("s"+Integer.toString(i*10+1)); arr.add("s"+Integer.toString(i*10+2));
// create the map in map_bin MapPolicy mPolicy = new MapPolicy(MapOrder.UNORDERED, MapWriteFlags.DEFAULT); Record record = client.operate(wpolicy, key, MapOperation.clear(MapBin), MapOperation.put(mPolicy, MapBin, Value.get("oid"), Value.get(i)), MapOperation.put(mPolicy, MapBin, Value.get("obj"), new MapValue(obj)), MapOperation.put(mPolicy, MapBin, Value.get(i), new ListValue(arr)) );}
Examine Test Data
View the test data by executing the following commands in a Jupyter terminal tab.
aql -c "select list_bin from test.cdt-indexing"
aql -c "select map_bin from test.cdt-indexing"
We will illustrate how indexes on non-collection elements are created and used. We will start with results in mind, and then create the appropriate index, issue a query using the index, and show the results.
Equality queries
Get records with a specific integer or string value at a specific index, rank, or key position of a List or a Map.
Records with a specific value at index X of a List or Map
The top level List in list_bin has 3 at position 2 (0-indexed)
// Create an index for element at position 2 of the top List in list_bin.// idx_list_bin_top_pos_2_numcreateIndex("idx_list_bin_top_pos_2_num", ListBin, IndexType.NUMERIC, IndexCollectionType.DEFAULT, CTX.listIndex(2));
// issue the query and print resultsSystem.out.println("Records with the value 3 at position 2 of the top List in list_bin:");Filter filter = Filter.equal(ListBin, 3, CTX.listIndex(2));executeQueryAndPrintResults(filter, ListBin);
Output:
Created index idx_list_bin_top_pos_2_num on ns=test set=cdt-indexing bin=list_bin.
Records with the value 3 at position 2 of the top List in list_bin:key=2 bins={list_bin=[2, s2, 3, s3, 4, s4, 5, s5, 6, s6, [21, 22, 23, 24, 25]]}
The top level Map in map_bin has 7 at position 2
// Create an index for element at position 2 of the top Map in map_bin. Note, in the key ordered sequence,// "oid" element is in position 2 after the numeric key and "obj".// idx_map_bin_top_pos_2_numcreateIndex("idx_map_bin_top_pos_2_num", MapBin, IndexType.NUMERIC, IndexCollectionType.DEFAULT, CTX.mapIndex(2));
// issue the query and print resultsSystem.out.println("Records with the value 7 at position 2 of the top Map in map_bin:");Filter filter = Filter.equal(MapBin, 7, CTX.mapIndex(2));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_top_pos_2_num on ns=test set=cdt-indexing bin=map_bin.
Records with the value 7 at position 2 of the top Map in map_bin:key=107 bins={map_bin={obj={attr2=s72, attr1=71, subobj={attr4=702, attr3=701}}, oid=7, 7=[s71, s72]}}
Records with a specific value at rank X of a List or Map
The nested List in list_bin has 35 in rank -1 (highest value)
// Create an index for element at rank -1 of the nested List in list_bin. Note the nested List is at the// last poistion (index -1) in the top List.// idx_list_bin_nested_rnk_-1_numcreateIndex("idx_list_bin_nested_rnk_-1_num", ListBin, IndexType.NUMERIC, IndexCollectionType.DEFAULT, CTX.listIndex(-1), CTX.listRank(-1));
// issue the query and print resultsSystem.out.println("Records with the value 35 in rank -1 (highest value) in the nested List in list_bin:");Filter filter = Filter.equal(ListBin, 35, CTX.listIndex(-1), CTX.listRank(-1));executeQueryAndPrintResults(filter, ListBin);
Output:
Created index idx_list_bin_nested_rnk_-1_num on ns=test set=cdt-indexing bin=list_bin.
Records with the value 35 in rank -1 (highest value) in the nested List in list_bin:key=3 bins={list_bin=[3, s3, 4, s4, 5, s5, 6, s6, 7, s7, [31, 32, 33, 34, 35]]}
The nested “subobj” Map in map_bin has 901 in rank 0 (lowest value)
// Create an index for element at rank 0 (lowest value) in the nested "subobj" Map in map_bin.// idx_map_bin_subobj_rnk_0_numcreateIndex("idx_map_bin_subobj_rnk_0_num", MapBin, IndexType.NUMERIC, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("subobj")), CTX.mapRank(0));
// issue the query and print resultsSystem.out.println("Records with the value 901 in rank 0 (lowest value) in the nested \"subobj\" Map in map_bin:");Filter filter = Filter.equal(MapBin, 901, CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("subobj")), CTX.mapRank(0));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_subobj_rnk_0_num on ns=test set=cdt-indexing bin=map_bin.
Records with the value 901 in rank 0 (lowest value) in the nested "subobj" Map in map_bin:key=109 bins={map_bin={9=[s91, s92], obj={attr2=s92, attr1=91, subobj={attr4=902, attr3=901}}, oid=9}}
Records with a specific value at key X of a Map
The top level Map in map_bin has 8 at key “oid”
// Create an index for element with key "oid" in the top level Map in map_bin.// idx_map_bin_top_key_oid_numcreateIndex("idx_map_bin_top_key_oid_num", MapBin, IndexType.NUMERIC, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("oid")));
// issue the query and print resultsSystem.out.println("Records with the value 8 at key \"oid\" in the top level Map in map_bin:");Filter filter = Filter.equal(MapBin, 8, CTX.mapKey(Value.get("oid")));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_top_key_oid_num on ns=test set=cdt-indexing bin=map_bin.
Records with the value 8 at key "oid" in the top level Map in map_bin:key=108 bins={map_bin={8=[s81, s82], obj={attr2=s82, attr1=81, subobj={attr4=802, attr3=801}}, oid=8}}
The nested “obj” Map in map_bin has “s42” at key “attr2”
// Create an index for element with key "attr2" in the nested "obj" Map in map_bin.// idx_map_bin_obj_key_attr2_strcreateIndex("idx_map_bin_obj_key_attr2_str", MapBin, IndexType.STRING, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("attr2")));
// issue the query and print resultsSystem.out.println("Records with the value \"s42\" at key \"attr2\" in the nested \"obj\" Map in map_bin:");Filter filter = Filter.equal(MapBin, "s42", CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("attr2")));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_obj_key_attr2_str on ns=test set=cdt-indexing bin=map_bin.
Records with the value "s42" at key "attr2" in the nested "obj" Map in map_bin:key=104 bins={map_bin={4=[s41, s42], obj={attr2=s42, attr1=41, subobj={attr4=402, attr3=401}}, oid=4}}
Range Queries
Get records having an integer value within a range at a specific index, rank, or key position of a List or a Map. Range queries are supported on integer values only.
We will use the indexes defined above for the range queries too.
Records with value in a specific range at index X of a List or Map
The top level List in list_bin is in range 3-5 at position 2 (0-indexed)
// issue the query and print resultsSystem.out.println("Records with value in range 3-5 at position 2 in the top level List in list_bin:");Filter filter = Filter.range(ListBin, 3, 5, CTX.listIndex(2));executeQueryAndPrintResults(filter, ListBin);
Output:
Records with value in range 3-5 at position 2 in the top level List in list_bin:
key=3 bins={list_bin=[3, s3, 4, s4, 5, s5, 6, s6, 7, s7, [31, 32, 33, 34, 35]]}key=2 bins={list_bin=[2, s2, 3, s3, 4, s4, 5, s5, 6, s6, [21, 22, 23, 24, 25]]}key=4 bins={list_bin=[4, s4, 5, s5, 6, s6, 7, s7, 8, s8, [41, 42, 43, 44, 45]]}
The top level Map in map_bin is in range 5-7 at position 2
// issue the query and print resultsSystem.out.println("Records with value in range 5-7 at position 2 of the top Map in map_bin:");Filter filter = Filter.range(MapBin, 5, 7, CTX.mapIndex(2));executeQueryAndPrintResults(filter, MapBin);
Output:
Records with value in range 5-7 at position 2 of the top Map in map_bin:
key=107 bins={map_bin={obj={attr2=s72, attr1=71, subobj={attr4=702, attr3=701}}, oid=7, 7=[s71, s72]}}key=105 bins={map_bin={5=[s51, s52], obj={attr2=s52, attr1=51, subobj={attr4=502, attr3=501}}, oid=5}}key=106 bins={map_bin={obj={attr2=s62, attr1=61, subobj={attr4=602, attr3=601}}, 6=[s61, s62], oid=6}}
Records with value in a specific range at rank X of a List or Map
- The nested List in list_bin is in range 20-50 at rank -1 (highest value)
// issue the query and print resultsSystem.out.println("Records with value in range 20-50 in rank -1 (highest value) in the nested List in list_bin:");Filter filter = Filter.range(ListBin, 20, 50, CTX.listIndex(-1), CTX.listRank(-1));executeQueryAndPrintResults(filter, ListBin);
Output:
Records with value in range 20-50 in rank -1 (highest value) in the nested List in list_bin:
key=3 bins={list_bin=[3, s3, 4, s4, 5, s5, 6, s6, 7, s7, [31, 32, 33, 34, 35]]}key=2 bins={list_bin=[2, s2, 3, s3, 4, s4, 5, s5, 6, s6, [21, 22, 23, 24, 25]]}key=4 bins={list_bin=[4, s4, 5, s5, 6, s6, 7, s7, 8, s8, [41, 42, 43, 44, 45]]}
The nested “subobj” Map in map_bin is in range 500-800 at rank 0 (lowest value)
// issue the query and print resultsSystem.out.println("Records with value in range 500-800 in rank 0 (lowest value) in the nested \"subobj\" Map in map_bin:");Filter filter = Filter.range(MapBin, 500, 800, CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("subobj")), CTX.mapRank(0));executeQueryAndPrintResults(filter, MapBin);
Output:
Records with value in range 500-800 in rank 0 (lowest value) in the nested "subobj" Map in map_bin:
key=107 bins={map_bin={obj={attr2=s72, attr1=71, subobj={attr4=702, attr3=701}}, oid=7, 7=[s71, s72]}}key=105 bins={map_bin={5=[s51, s52], obj={attr2=s52, attr1=51, subobj={attr4=502, attr3=501}}, oid=5}}key=106 bins={map_bin={obj={attr2=s62, attr1=61, subobj={attr4=602, attr3=601}}, 6=[s61, s62], oid=6}}
Records with value in a specific range at key X of a Map
The top level Map in map_bin has a value in range 4-6 at key “oid”
// issue the query and print resultsSystem.out.println("Records with value in range 4-6 at key \"oid\" in the top level Map in map_bin:");Filter filter = Filter.range(MapBin, 4, 6, CTX.mapKey(Value.get("oid")));executeQueryAndPrintResults(filter, MapBin);
Output:
Records with value in range 4-6 at key "oid" in the top level Map in map_bin:
key=105 bins={map_bin={5=[s51, s52], obj={attr2=s52, attr1=51, subobj={attr4=502, attr3=501}}, oid=5}}key=104 bins={map_bin={4=[s41, s42], obj={attr2=s42, attr1=41, subobj={attr4=402, attr3=401}}, oid=4}}key=106 bins={map_bin={obj={attr2=s62, attr1=61, subobj={attr4=602, attr3=601}}, 6=[s61, s62], oid=6}}
Collection indexes are defined on List and Map elements with the collection type LIST (list values in the List), MAPKEYS (key values in the Map), or MAPVALUES (values in the Map). The indexes can be used for equality queries on integr and string elements, as well as range queries on integer elements. We will illustrate these variations below.
Equality Queries
Get records with a specific integer or string value in a List or Map.
Records with a List or Map containing a specific value.
The nested List in map_bin contains “s91”.
// Create an index for the nested List in map_bin. Note, in key-ordered sequence, the numeric key is the first one.// idx_map_bin_i_list_strcreateIndex("idx_map_bin_i_list_str", MapBin, IndexType.STRING, IndexCollectionType.LIST, CTX.mapIndex(0));
// issue the query and print resultsSystem.out.println("Records with the value \"s91\" in the nested list \"arr\" in map_bin:");Filter filter = Filter.contains(MapBin, IndexCollectionType.LIST, "s91", CTX.mapIndex(0));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_i_list_str on ns=test set=cdt-indexing bin=map_bin.
Records with the value "s91" in the nested list "arr" in map_bin:key=109 bins={map_bin={9=[s91, s92], obj={attr2=s92, attr1=91, subobj={attr4=902, attr3=901}}, oid=9}}
The top level Map in map_bin contains a key 5.
// Create an index for keys in top level Map in map_bin.// idx_map_bin_top_mapkeys_numcreateIndex("idx_map_bin_top_mapkeys_num", MapBin, IndexType.NUMERIC, IndexCollectionType.MAPKEYS);
// issue the query and print resultsSystem.out.println("Records with key 5 in the top level Map in map_bin:");Filter filter = Filter.contains(MapBin, IndexCollectionType.MAPKEYS, 5);executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_top_mapkeys_num on ns=test set=cdt-indexing bin=map_bin.
Records with key 5 in the top level Map in map_bin:key=105 bins={map_bin={5=[s51, s52], obj={attr2=s52, attr1=51, subobj={attr4=502, attr3=501}}, oid=5}}
The nested “obj” Map in map_bin contains a value “s12”.
// Create an index for values in nested Map "obj" in map_bin.// idx_map_bin_obj_mapvals_strcreateIndex("idx_map_bin_obj_mapvals_str", MapBin, IndexType.STRING, IndexCollectionType.MAPVALUES, CTX.mapKey(Value.get("obj")));
// issue the query and print resultsSystem.out.println("Records with the value \"s12\" in the nested Map \"obj\" in map_bin:");Filter filter = Filter.contains(MapBin, IndexCollectionType.MAPVALUES, "s12", CTX.mapKey(Value.get("obj")));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_obj_mapvals_str on ns=test set=cdt-indexing bin=map_bin.
Records with the value "s12" in the nested Map "obj" in map_bin:key=101 bins={map_bin={1=[s11, s12], obj={attr2=s12, attr1=11, subobj={attr4=102, attr3=101}}, oid=1}}
Range Queries
Get records with a range of integer values in a List or Map.
Records with a List or Map containing an integer in a given range.
The nested list in list_bin has a value in range 85-93.
// Create an index for the nested List in map_bin. Note, the nested list is the last element.// idx_list_bin_nested_list_numcreateIndex("idx_list_bin_nested_list_num", ListBin, IndexType.NUMERIC, IndexCollectionType.LIST, CTX.listIndex(-1));
// issue the query and print resultsSystem.out.println("Records with value in range 85-93 in the nested list in list_bin:");Filter filter = Filter.range(ListBin, IndexCollectionType.LIST, 85, 93, CTX.listIndex(-1));executeQueryAndPrintResults(filter, ListBin);
Output:
Created index idx_list_bin_nested_list_num on ns=test set=cdt-indexing bin=list_bin.
Records with value in range 85-93 in the nested list in list_bin:key=8 bins={list_bin=[8, s8, 9, s9, 10, s10, 11, s11, 12, s12, [81, 82, 83, 84, 85]]}key=9 bins={list_bin=[9, s9, 10, s10, 11, s11, 12, s12, 13, s13, [91, 92, 93, 94, 95]]}
The top level Map in map_bin has a key in range 5-7.
// use the index created earlier// issue the query and print resultsSystem.out.println("Records with key in range 5-7 in the top level Map in map_bin:");Filter filter = Filter.range(MapBin, IndexCollectionType.MAPKEYS, 5, 7);executeQueryAndPrintResults(filter, MapBin);
Output:
Records with key in range 5-7 in the top level Map in map_bin:
key=107 bins={map_bin={obj={attr2=s72, attr1=71, subobj={attr4=702, attr3=701}}, oid=7, 7=[s71, s72]}}key=105 bins={map_bin={5=[s51, s52], obj={attr2=s52, attr1=51, subobj={attr4=502, attr3=501}}, oid=5}}key=106 bins={map_bin={obj={attr2=s62, attr1=61, subobj={attr4=602, attr3=601}}, 6=[s61, s62], oid=6}}
The nested Map “subobj” in map_bin has a value in range 300-500.
// Create an index for values in nested Map "subobj" in map_bin.// idx_map_bin_subobj_mapvals_numcreateIndex("idx_map_bin_subobj_mapvals_num", MapBin, IndexType.NUMERIC, IndexCollectionType.MAPVALUES, CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("subobj")));
// issue the query and print resultsSystem.out.println("Records with value in range 300-500 in the nested Map \"subobj\" in map_bin:");Filter filter = Filter.range(MapBin, IndexCollectionType.MAPVALUES, 300, 500, CTX.mapKey(Value.get("obj")), CTX.mapKey(Value.get("subobj")));executeQueryAndPrintResults(filter, MapBin);
Output:
Created index idx_map_bin_subobj_mapvals_num on ns=test set=cdt-indexing bin=map_bin.
Records with value in range 300-500 in the nested Map "subobj" in map_bin:key=103 bins={map_bin={obj={attr2=s32, attr1=31, subobj={attr4=302, attr3=301}}, oid=3, 3=[s31, s32]}}key=104 bins={map_bin={4=[s41, s42], obj={attr2=s42, attr1=41, subobj={attr4=402, attr3=401}}, oid=4}}
We will illustrate how JSON documents can be indexed and accessed using CDT indexing.
Initialize Document Client
// Import modulesimport org.apache.commons.io.FileUtils;import com.fasterxml.jackson.databind.JsonNode;import com.aerospike.documentapi.*;
// Initialize Document client from Aerospike clienttry { client.close();}catch (Exception ae) { //ignore}ClientPolicy cPolicy = new ClientPolicy();cPolicy.writePolicyDefault.sendKey = true;AerospikeClient client = new AerospikeClient(cPolicy, "localhost", 3000);AerospikeDocumentClient documentClient = new AerospikeDocumentClient(client);System.out.println("Initialized document client from the Aerospike client.");;
Initialized document client from the Aerospike client.
String Namespace = "test";String Set = "cdt-indexing";
// convenience function to create an index - essentially a pass-through to the client APIvoid createIndex(String idxName, String binName, IndexType idxType, IndexCollectionType colType, CTX... ctx) { try { IndexTask task = client.createIndex(null, Namespace, Set, idxName, binName, idxType, colType, ctx); task.waitTillComplete(1000, 0); } catch (AerospikeException ae) { if (ae.getResultCode() != ResultCode.INDEX_ALREADY_EXISTS) { throw ae; } } System.out.format("Created index %s on ns=%s set=%s bin=%s.\n", idxName, Namespace, Set, binName);}
// convenience function to drop an index - essentially a pass-through to the client APIvoid dropIndex(String idxName) { try { IndexTask task = client.dropIndex(null, Namespace, Set, idxName); } catch (AerospikeException ae) { if (ae.getResultCode() != ResultCode.INDEX_NOTFOUND) { throw ae; } } System.out.format("Dropped index %s.\n", idxName);}
// convenience function to execute a query using the input filter and print the resultsvoid executeQueryAndPrintResults(Filter filter, String binName) { Statement stmt = new Statement(); stmt.setNamespace(Namespace); stmt.setSetName(Set); stmt.setFilter(filter); stmt.setBinNames(binName); RecordSet rs = client.query(null, stmt); while (rs.next()) { Key key = rs.getKey(); System.out.format("key=%s\n", key.userKey); Record record = rs.getRecord(); //System.out.format("key=%s bins=%s\n", key.userKey, record.bins); Map<String,?> jsonMap = (Map<String,?>) record.getValue(JsonBin); for (String field: jsonMap.keySet()) { System.out.printf("\t%s=%.200s\n", field, jsonMap.get(field)); } } //System.out.println(); rs.close();}
// convenience function to truncate test datavoid truncateTestData() { try { client.truncate(null, Namespace, null, null); } catch (AerospikeException e) { // ignore }}
Example 1: Nobel Prizes JSON Dataset
Read JSON File and Populate Database
We will load JSON documents from the file “nobel_prizes.json”, and store each prize entry in the database in a separate record with user_key starting at 1000 in bin “json_bin”. You can view the file’s contents by opening it from Jupyter’s file browser.
String JsonFilePath = "/home/jovyan/notebooks/java/nobel_prizes.json";String JsonBin = "json_bin";Integer UserKeyOffset = 1000;
// Read the json document into a string.String jsonString = FileUtils.readFileToString(new File(JsonFilePath));System.out.format("Read JSON doc from: %s\n", JsonFilePath);;
// Convert JSON string to a JsonNodeJsonNode rootNode = JsonConverters.convertStringToJsonNode(jsonString);
// Add the prizes in JSON in separate recordsJsonNode prizesNode = rootNode.path("prizes");
Integer prizeIndex = 0;for (JsonNode prizeNode : prizesNode) { if (prizeIndex == 0) { System.out.format("Example prize schema: \n%s \n\n", prizeNode.toPrettyString()); } Key jsonDocKey = new Key(Namespace, Set, UserKeyOffset+prizeIndex+1); documentClient.put(jsonDocKey, JsonBin, prizeNode); prizeIndex++;}System.out.format("Stored %d JSON documents in database \n", prizeIndex);;
Read JSON doc from: /home/jovyan/notebooks/java/nobel_prizes.jsonExample prize schema:{ "year" : "2021", "category" : "chemistry", "laureates" : [ { "id" : "1002", "firstname" : "Benjamin", "surname" : "List", "motivation" : "\"for the development of asymmetric organocatalysis\"", "share" : "2" }, { "id" : "1003", "firstname" : "David", "surname" : "MacMillan", "motivation" : "\"for the development of asymmetric organocatalysis\"", "share" : "2" } ]}
Stored 658 JSON documents in database
Execute Query with Index
Find nobel prizes in a given year.
// Create an index on the year field.// idx_json_bin_year_strcreateIndex("idx_json_bin_year_str", JsonBin, IndexType.STRING, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("year")));
// Find nobel prizes in year 1969.System.out.println("Nobel prizes in year 1969:");Filter filter = Filter.equal(JsonBin, "1969", CTX.mapKey(Value.get("year")));executeQueryAndPrintResults(filter, JsonBin);
Output:
Created index idx_json_bin_year_str on ns=test set=cdt-indexing bin=json_bin.
Nobel prizes in year 1969:key=1314 category=economics year=1969 laureates=[{firstname=Ragnar, share=2, id=677, surname=Frisch, motivation="for having developed and applied dynamic models for the analysis of economic processes"}, {firstname=Jan, share=2, id=678, surname=Tinbkey=1313 category=chemistry year=1969 laureates=[{firstname=Derek, share=2, id=237, surname=Barton, motivation="for their contributions to the development of the concept of conformation and its application in chemistry"}, {firstname=Odd, share=2, ikey=1315 category=literature year=1969 laureates=[{firstname=Samuel, share=1, id=643, surname=Beckett, motivation="for his writing, which - in new forms for the novel and drama - in the destitution of modern man acquires its elevation"}]key=1318 category=medicine year=1969 laureates=[{firstname=Max, share=3, id=391, surname=Delbrück, motivation="for their discoveries concerning the replication mechanism and the genetic structure of viruses"}, {firstname=Alfred D., share=3, id=392key=1317 category=physics year=1969 laureates=[{firstname=Murray, share=1, id=90, surname=Gell-Mann, motivation="for his contributions and discoveries concerning the classification of elementary particles and their interactions"}]key=1316 category=peace year=1969 laureates=[{share=1, firstname=International Labour Organization, id=527, motivation="for creating international legislation insuring certain norms for working conditions in every country"}]
Find nobel prizes in a given category.
// Create an index on the category field.// idx_json_bin_category_strcreateIndex("idx_json_bin_category_str", JsonBin, IndexType.STRING, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("category")));
// Find nobel prizes for peace.System.out.println("Nobel peace prizes:");Filter filter = Filter.equal(JsonBin, "peace", CTX.mapKey(Value.get("category")));executeQueryAndPrintResults(filter, JsonBin);
Output:
Created index idx_json_bin_category_str on ns=test set=cdt-indexing bin=json_bin.
Nobel peace prizes:key=1190 category=peace year=1990 laureates=[{firstname=Mikhail, share=1, id=552, surname=Gorbachev, motivation="for the leading role he played in the radical changes in East-West relations"}]key=1561 category=peace year=1920 laureates=[{firstname=Léon, share=1, id=484, surname=Bourgeois, motivation="for his longstanding contribution to the cause of peace and justice and his prominent role in the establishment of the League of Natiokey=1401 category=peace year=1952 laureates=[{firstname=Albert, share=1, id=513, surname=Schweitzer, motivation="for his altruism, reverence for life, and tireless humanitarian work which has helped making the idea of brotherhood between men ankey=1386 category=peace year=1955 overallMotivation="No Nobel Prize was awarded this year. The prize money was allocated to the Special Fund of this prize section."key=1591 category=peace year=1914 overallMotivation="No Nobel Prize was awarded this year. The prize money was allocated to the Special Fund of this prize section."key=1646 category=peace year=1903 laureates=[{firstname=Randal, share=1, id=466, surname=Cremer, motivation="for his longstanding and devoted effort in favour of the ideas of peace and arbitration"}]... (truncated)
Example 2: Space Companies JSON Dataset
Read JSON File and Populate Database
We will load JSON documents from the file “space_companies.json”, and store each prize entry in the database in a separate record with user_key starting at 2000 in bin “json_bin”. You can view the file contents by opening it from Jupyter’s file browser.
String JsonFilePath = "/home/jovyan/notebooks/java/space_companies.json";String JsonBin = "json_bin";Integer UserKeyOffset = 2000;
// Read the json document into a string.String jsonString = FileUtils.readFileToString(new File(JsonFilePath));System.out.format("Read JSON doc: %s\n", JsonFilePath);;
// Convert JSON string to a JsonNodeJsonNode rootNode = JsonConverters.convertStringToJsonNode(jsonString);
// Add the companies in JSON in separate recordsJsonNode companiesNode = rootNode.path("companies");
Integer companyIndex = 0;for (JsonNode companyNode : companiesNode) { if (companyIndex == 1) { System.out.format("Example company schema: \n%s \n\n", companyNode.toPrettyString()); } Key jsonDocKey = new Key(Namespace, Set, UserKeyOffset+companyIndex+1); documentClient.put(jsonDocKey, JsonBin, companyNode); companyIndex++;}System.out.format("Stored %d JSON documents in database \n", companyIndex);;
Output:
Read JSON doc: /home/jovyan/notebooks/java/space_companies.json
Example company schema:{ "_id" : "58d163cd6ed7bb3c6ee62e63", "company" : { "_id" : "58d164516ed7bb3c6ee62e64", "about" : "Telespazio VEGA UK is a dynamic and experienced consulting, technology and engineering services business based in Luton, UK. Following our merger into the Telespazio Group in 2011 we now have access to a vast array of world leading facilities and services, which we provide both within the UK and to our export markets. With a strong history in the European space arena starting in 1978, we have rich relationships across the space industry, with staff working closely with the European Space Agency (ESA) on programs and projects for over 35 years. ", "backgroundImage" : "https://www.spaceindividuals.com/var/www/images/8907e7e06a094fe31098d167567fa1dc.jpg", "companyLocation" : "Capability Green, Luton LU1 3LU, UK", "companyName" : "Telespazio VEGA UK", "createdAt" : "2017-06-05T00:00:00.000Z", "culture" : "As an organisation that aspires to excellence, Telespazio VEGA recognises the need to appreciate our people for their achievements. Without the skills, dedication and enthusiasm of our people, we would not be where we are today. Telespazio VEGA continually strives to create a rewarding environment, offer ample opportunity for personal and professional development and support our people in balancing work and life.\nTo ensure that we remain relevant and of value to our clients, Telespazio VEGA has created an environment that encourages and rewards innovation and entrepreneurial spirit. This, offered with the financial stability afforded us as part of the Finmeccanica group of companies, means that valid business ideas are given the appropriate backing and support to enable them to succeed.\nWe also recognise that each individual has their own aspirations for personal development. Therefore, we offer a range of career options that enable consultants to enhance individual niche capabilities, embrace the challenge of business development, and take on the responsibility of a management role. These opportunities are made available through a structured career development programme which enables our people to look closely at themselves, where they are today, and where they see themselves in the future. We recognise and support all these ambitions and look to retain a rich mix of skills and capabilities that will continue to support our clients’ evolving business requirements.\n\nContinuous Learning\nAs a business, Telespazio VEGA places great emphasis on allowing our people to learn and develop, both personally and professionally. We share ideas with colleagues and clients, challenge each other and learn through our involvement with a variety of projects.\nTelespazio VEGA provides ample opportunity for more formal training. We encourage our people to undertake professional qualifications, and support those who undertake study which relates to their work\n", "email" : "careers@telespazio.com", "galleryImages" : [ "https://spaceindividuals.com/var/www/images/4feb7c40b8b3d5a3ea83b600d6731b94.jpg", "https://spaceindividuals.com/var/www/images/ab664d6614614b4b771c3fe99e55383b.jpg", "https://spaceindividuals.com/var/www/images/d3ab90c334abfe301c93b14b1ff0f2d4.jpg" ], "isPublished" : true, "kindOfEmployeeLookFor" : null, "logoURL" : "https://spaceindividuals.com/var/www/images/ca6d9b5d356a2186fb9144b6bb34ab03.png", "mission" : null, "paymentType" : "enterprise", "phone" : "+44 (0)1582 399000", "sizeCompany" : "100-250 employees", "specialities" : "We now serve a wide range of public and private commercial markets globally by providing efficient, skilled support, the most up to date space data and services, and fundamentally, understanding and integrating the needs of our customers.\n\nWe have developed and harnessed our knowledge and expertise to deliver unique complex solutions for:\nGEO Information, Satellite Systems and Applications & Satellite Communications.\nWe offer robust and reliable ground segment systems for satellite missions and utilise space assets to develop downstream applications across a wide variety of sectors. \nWe provide world leading scientific services on earth observation and space science missions.\nOur valued and experienced knowledgeable experts deliver consulting and engineering support to space missions worldwide.", "testimonials" : null, "updatedAt" : "2021-01-20T09:45:41.731Z", "weOfferChallenges" : null, "weOfferKindOfChallenges" : "Telespazio VEGA’s culture is defined by our people, and drives the value we deliver through the relationships we have with our stakeholders.\n\nOur culture is distinct and discernible, mirroring the multi-national, multi-disciplinary environment in which we work. It is based on a practical engineering ethos that thrives on challenge, and engenders a rigorous, analytical, results-oriented approach to problem solving. We are open and honest, say what we believe, and challenge that which we do not.\n\nAt Telespazio VEGA, our culture is also based on a pursuit of excellence and thought leadership – not only in our understanding of the technologies with which we work and the market domains in which they are applied, but also in the quality of the ideas, methodologies, services and solutions we deliver.\n\nWe share with our clients an unparalleled enthusiasm for Space and its application in society. Therefore, we are able to empathise strongly, and invest this understanding and commitment into long-term relationships with our clients and stakeholders.\n", "weOfferKindOfTrainings" : null, "weOfferTrainings" : null, "webSite" : "http://telespazio-vega.com" }, "createdAt" : "2017-06-05T00:00:00.000Z", "email" : "careers@telespazio.com", "jobs" : null, "numberOfJobs" : 0, "updatedAt" : "2021-01-20T09:47:36.694Z"}
Stored 154 JSON documents in database
Execute Query with Index
Find companies by number of job postings.
// Create an index on the numberOfJobs field.// idx_json_bin_number_of_jobs_numcreateIndex("idx_json_bin_number_of_jobs_num", JsonBin, IndexType.NUMERIC, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("numberOfJobs")));
// Find companies with more than 10 job postings.System.out.println("Companies with numberOfJobs greater than 10:");Filter filter = Filter.range(JsonBin, 11, 100, CTX.mapKey(Value.get("numberOfJobs")));executeQueryAndPrintResults(filter, JsonBin);
Output:
Created index idx_json_bin_number_of_jobs_num on ns=test set=cdt-indexing bin=json_bin.
Companies with numberOfJobs greater than 10:key=2013 createdAt=2018-03-04T14:00:04.000Z jobs=null numberOfJobs=11 company={backgroundImage=https://spaceindividuals.com/var/www/images/971f13def35f0b21e379fd1555b3e10d.jpg, isPublished=true, companyName=AKKA Technologies, about=As a consulting and engineering Group and Euro _id=593fdfae9839ef0a13b6b92d email=kontakt@akka.eu updatedAt=2021-01-27T14:04:41.487Zkey=2070 createdAt=2018-03-29T14:00:04.000Z jobs=null numberOfJobs=62 company={backgroundImage=https://spaceindividuals.com/var/www/images/02873814215a7b781e0f5f10e726926b.jpg, companyName=CERN, about=Imagine taking part in the largest scientific experiment in the world. CERN n _id=59df28ff947f49c77101e477 email=recruitment.service@cern.ch updatedAt=2021-01-27T09:28:02.836Zkey=2084 createdAt=2018-04-01T14:00:04.000Z jobs=null numberOfJobs=30 company={backgroundImage=https://spaceindividuals.com/var/www/images/81e0f8e35dbab0242b8729f795986047.jpg, isPublished=true, companyName=RHEA Group, about=25 years of experience in the professional engineerin _id=58d965bfde9c102b036040f7 email=temp@rheagroup.com updatedAt=2021-01-27T10:16:40.854Zkey=2107 createdAt=2018-11-21T15:48:41.941Z jobs=null numberOfJobs=95 company={backgroundImage=/profile-pic/1574021440639.jpg, isPublished=true, companyName=European Space Agency - ESA, about=The European Space Agency (ESA) is Europe’s gateway to space. Its mission is to shape _id=5bf57e594739c05820a92eee email=ZINEB.ELOMRI@ESA.INT updatedAt=2021-01-25T09:34:07.372Zkey=2071 createdAt=2018-03-07T14:02:08.000Z jobs=null numberOfJobs=16 company={backgroundImage=https://spaceindividuals.com/var/www/images/093fb01f43d429801105c6fc0b224d40.jpg, companyName=Terma Group, about=At Terma our people make the difference. We employ talented, dedicated _id=590a258c7cd117274a0ae681 email=recruitment.de@terma.com updatedAt=2021-01-27T13:24:59.183Zkey=2083 createdAt=2018-03-04T15:01:08.000Z jobs=null numberOfJobs=52 company={backgroundImage=https://spaceindividuals.com/var/www/images/98181cdb3d430d1e90c695e44caeea90.jpg, isPublished=true, companyName=Serco Europe, about=The Serco Group is a FTSE 250 corporation with over _id=58f78ba25a0782e629c7b621 email=tatiana.thiessen@serco.com updatedAt=2020-11-30T09:49:19.914Zkey=2051 createdAt=2018-04-17T13:04:35.890Z jobs=null numberOfJobs=26 company={backgroundImage=https://spaceindividuals.com/var/www/images/604d16af11bf4f83e8db7aa57a10c3d0.jpg, isPublished=true, companyName=OHB-System AG, about=3,2,1, lift off...
Find companies by a nested field such as the payment type.
The paymentType field may be truncated in the result output, but can be verified with the AQL query in the terminal tab: aql -c “select json_bin from test.cdt-indexing where PK=key”.
// Create an index on the nested paymentType field.// idx_json_bin_payment_type_strcreateIndex("idx_json_bin_payment_type_str", JsonBin, IndexType.STRING, IndexCollectionType.DEFAULT, CTX.mapKey(Value.get("company")), CTX.mapKey(Value.get("paymentType")));
// Find companies with "enterprise" payment type.System.out.println("Companies with enterprise payment type:");Filter filter = Filter.equal(JsonBin, "enterprise", CTX.mapKey(Value.get("company")), CTX.mapKey(Value.get("paymentType")));executeQueryAndPrintResults(filter, JsonBin);
Output:
Created index idx_json_bin_payment_type_str on ns=test set=cdt-indexing bin=json_bin.
Companies with enterprise payment type:key=2013 createdAt=2018-03-04T14:00:04.000Z jobs=null numberOfJobs=11 company={backgroundImage=https://spaceindividuals.com/var/www/images/971f13def35f0b21e379fd1555b3e10d.jpg, isPublished=true, companyName=AKKA Technologies, about=As a consulting and engineering Group and Euro _id=593fdfae9839ef0a13b6b92d email=kontakt@akka.eu updatedAt=2021-01-27T14:04:41.487Zkey=2002 createdAt=2017-06-05T00:00:00.000Z jobs=null numberOfJobs=0 company={backgroundImage=https://www.spaceindividuals.com/var/www/images/8907e7e06a094fe31098d167567fa1dc.jpg, isPublished=true, companyName=Telespazio VEGA UK, about=Telespazio VEGA UK is a dynamic and exper _id=58d163cd6ed7bb3c6ee62e63 email=careers@telespazio.com updatedAt=2021-01-20T09:47:36.694Zkey=2045 createdAt=2018-03-24T14:00:04.000Z jobs=null numberOfJobs=8 company={backgroundImage=https://spaceindividuals.com/var/www/images/0786a572d77f4a0b8a48c5b054766c54.jpg, isPublished=true, companyName=Telespazio Germany GmbH, about=Telespazio Germany is the first choice a _id=58e13089de9c102b03604295 email=recruitment@telespazio.de updatedAt=2021-01-27T11:04:38.881Zkey=2031 createdAt=2018-04-25T12:27:23.286Z jobs=null numberOfJobs=0 company={paymentTypeOrdered=enterprise, backgroundImage=https://www.spaceindividuals.com/var/www/images/5207d3c4216068b9423ade06215f1529.PNG, isPublished=true, companyName=European GNSS Agency, about=Space is _id=5ae0742b4392412b350ca064 email=jobs@gsa.europa.eu updatedAt=2020-11-13T17:25:53.261Zkey=2107 createdAt=2018-11-21T15:48:41.941Z jobs=null numberOfJobs=95 company={backgroundImage=/profile-pic/1574021440639.jpg, isPublished=true, companyName=European Space Agency - ESA, about=The European Space Agency (ESA) is Europe’s gateway to space. Its mission is to shape _id=5bf57e594739c05820a92eee email=ZINEB.ELOMRI@ESA.INT updatedAt=2021-01-25T09:34:07.372Zkey=2084 createdAt=2018-04-01T14:00:04.000Z jobs=null numberOfJobs=30 company={backgroundImage=https://spaceindividuals.com/var/www/images/81e0f8e35dbab0242b8729f795986047.jpg, isPublished=true, companyName=RHEA Group, about=25 years of experience in the professional engineerin _id=58d965bfde9c102b036040f7 email=temp@rheagroup.com updatedAt=2021-01-27T10:16:40.854Z... (truncated)
When the elements are distributed in multiple places in the CDT, a List or Map is not readily available for indexing. A simple example of this is to find the nobel prize(s) by the winner’s name.
{ "year" : "2021", "category" : "chemistry", "laureates" : [ { "id" : "1002", "firstname" : "Benjamin", "surname" : "List", "motivation" : "\"for the development of asymmetric organocatalysis\"", "share" : "2" }, { "id" : "1003", "firstname" : "David", "surname" : "MacMillan", "motivation" : "\"for the development of asymmetric organocatalysis\"", "share" : "2" } ]}
Create List of Record Values to Query
To be able to find the nobel prize records by the winner’s name, we will create a separate List of the names of the laureates, and index that List. The List is stored in its own bin laureate_names.
laureate_names: ["benjamin list", "david macmillan", ..]
// iterate over records and update the laureate_names binimport com.aerospike.client.Bin;import com.aerospike.client.ScanCallback;import com.aerospike.client.policy.ScanPolicy;import com.aerospike.client.exp.Exp.Type;
public class ScanParallel implements ScanCallback { public void scanCallback(Key key, Record record) { //System.out.format("key=%s bins=%s\n", key.userKey, record.bins); Object binval = record.bins.get(JsonBin); List<String> laureateNamesList = new ArrayList<String>(); List<HashMap<String,String>> laureates = (List<HashMap<String,String>>) ((HashMap<String,Value>)binval).get("laureates"); if (laureates == null) { return; } for (HashMap<String,String> laureate: laureates) { //System.out.format("firstname: %s, surname: %s\n", laureate.get("firstname"), // laureate.get("surname")); laureateNamesList.add(laureate.getOrDefault("firstname","").toLowerCase() + " " + laureate.getOrDefault("surname","").toLowerCase()); } //System.out.println(laureateNamesList); Bin laureateNamesBin = new Bin("laureate_names", laureateNamesList); client.put(null, key, laureateNamesBin); }}// expression filter 1001 <= user-key < 2000 is specifed in the policyScanPolicy policy = new ScanPolicy();policy.filterExp = Exp.build( Exp.and( Exp.ge(Exp.key(Type.INT), Exp.val(1001)), Exp.lt(Exp.key(Type.INT), Exp.val(2000))));System.out.format("Scan with expression filter results:\n");client.scanAll(policy, Namespace, Set, new ScanParallel(), JsonBin);
Find nobel prizes by winner name.
// Create an index on the numberOfJobs field.// idx_laureate_names_list_strcreateIndex("idx_laureate_names_list_str", "laureate_names", IndexType.STRING, IndexCollectionType.LIST);
// Find nobel prizes by winner name "albert einstein".System.out.println("Prizes with laureate name \"marie curie\":");Filter filter = Filter.contains("laureate_names", IndexCollectionType.LIST, "marie curie");executeQueryAndPrintResults(filter, JsonBin);
Output:
Created index idx_laureate_names_list_str on ns=test set=cdt-indexing bin=laureate_names.
Prizes with laureate name "marie curie":key=1647 category=physics year=1903 laureates=[{firstname=Henri, share=2, id=4, surname=Becquerel, motivation="in recognition of the extraordinary services he has rendered by his discovery of spontaneous radioactivity"}, {firstname=Pierre, share=key=1604 category=chemistry year=1911 laureates=[{firstname=Marie, share=1, id=6, surname=Curie, motivation="in recognition of her services to the advancement of chemistry by the discovery of the elements radium and polonium, by the isolation of ra
Remove tutorial data and close connection.
client.truncate(null, Namespace, null, null);client.close();System.out.println("Removed tutorial data and closed server connection.");
Output:
Removed tutorial data and closed server connection.
The tutorial illustrated with many examples the secondary index functionality with CDTs that is available in Aerospike Database 6.1.0.0+. The CDT indexing enables faster queries on JSON documents and other complex objects stored as CDTs.
Next steps
Visit Aerospike notebooks repo to run additional Aerospike notebooks. To run a different notebook, download the notebook from the repo to your local machine, and then click on File->Open, and select Upload.