This notebook demonstrates the use of the Java Object Mapper, which provides
a convenient way of saving objects and their relationships in an Aerospike
database, and also retrieving them.
This notebook requires Aerospike database running on localhost. Visit
Aerospike notebooks repo for
additional details and the docker container.
Introduction
The Java Object Mapper code and documentation is available in this repo.
The goal of this tutorial is to describe how to serialize (save) Java
objects into Aerospike database and deserialize (load) Aerospike records
into Java objects using the object mapper. The tutorial focuses on the
core functionality of the object mapper, and points out more advanced
topics for the reader to explore.
The object mapper uses Java annotations to define the Aerospoke
semantics for the saving and loading behavior. Since the respective
annontations are next to the definitions of a class, methods, and
fields, the object mapper makes persistence using Aerospike:
easier to implement,
easier to understand, and
less error prone.
The main topics in this notebook include:
Basic operations
Mapping between Java and Aerospike types
Specifying fields to persist
List and Map object representation
Embedding an object vs storing a reference
Prerequisites
This tutorial assumes familiarity with the following topics:
Aerospike Java client 5.1.3 and the object mapper library 2.0.0.
%%loadFromPOM
<dependencies>
<dependency>
<groupId>com.aerospike</groupId>
<artifactId>aerospike-client</artifactId>
<version>5.1.3</version>
</dependency>
<dependency>
<groupId>com.aerospike</groupId>
<artifactId>java-object-mapper</artifactId>
<version>2.0.0</version>
</dependency>
</dependencies>
Initialize Client and Define Convenience Functions
Initialize the client, and define a convenience functions
truncateTestData to delete test data, and toJsonString to convert
and object to JSON notation for printing.
Initialized Aerospike client and connected to the cluster.
Access Shell Commands
You may execute shell commands including Aerospike tools like
aql and
asadm in the
terminal tab throughout this tutorial. Open a terminal tab by selecting
File->Open from the notebook menu, and then New->Terminal.
Basic Operations
The basic annotations to save and object are @AerospikeRecord and
@AerospikeKey.
@AerospikeRecord: Used with a class definition. It defines how the
objects of the class should be stored in Aerospike.
@AerospikeKey: Used with an attribute within the class to define the
object id or “primary key”.
Consider a simple class Person consisting of two fields: ssn and
name. The class definiition is annotated with @AerospikeRecord
annotation that takes two parameters for the namespace and set where
the instances of this class will be stored. The field ssn is the key
field and is annotated with @AerospikeKey.
NOTE:
All class attributes or fields are made public for convenience of
access in this tutorial. In practice, one would define getter and
setter methods for each.
Some class definitions need to change through this tutorial. Becuse
a class cannot be redefined in a notebook, multiple versions of a
class ClassName are defined with a numeric added to its name like
ClassName2, ClassName3, and so on.
If you make changes to a class definition in a code cell and rerun
it, you will get kernel errors. Restart the kernel after such code
changes.
System.out.format("Instantiated Person object: %s\n", toJsonString(person));;
Output:
Instantiated Person object: {
"ssn": "111-11-1111",
"name": "John Doe"
}
Processing Objects
Stored objects can be retrieved and processed with the find operation
that takes the class and a callback. The callback is defined as a
mapping function that takes an object and returns a boolean. The class
records will continue to be processed with the callback until the
function returns false.
importjava.util.function.Function;
// let's add another Person object
Personp2=newPerson();
p2.ssn="222-22-2222";
p2.name="Jane Doe";
mapper.save(p2);
// the function simply prints the name of a retrieved person record
Function<Person,Boolean> function= person -> {
System.out.println(person.name);
returntrue;
};
// scan records and process with the function
mapper.find(Person.class, function);
Output:
Jane Doe
John Doe
Deleting Object
A stored object may be deleted from the database with mapper’s delete
operation.
System.out.format("Instantiated Person object: %s\n", toJsonString(gone));;
Output:
Instantiated Person object: null
Mapping Java and Aerospike Types
The following table summarizes how Java types are mapped into Aerospike
types during save. During read, the Java class definition is used to map
Aerospike types into specific Java types.
Java Type
Aerospike Type
byte, char, short, int, long
Integer
boolean
Boolean
float, double
Double
java.util.date, java.util.instant
Integer
string
String
byte[]
Blob
enum
String
arrays, List<?>
List
Map
Map
Specifying Fields for Mapping
By default, all fields are mapped to the bins named after the respective
fields. The mapper allows you to select specific fields to save, and
also change bin names.
Selecting Specific Fields
In order to specify specific fields to persist, use@AerospikeBin and
AerospikeExclude annotations, and the mapAll parameter in
@AerospikeRecord.
If it is desired to save only specific bins, annotate those bins with
@AerospikeBin and usemapAll = false on @AerospikeRecord. In this
case, make sure to annotate the key field also with @AerospikeBin to
ensure that the key gets mapped to the database.
In the example below, only the fields explicitly annotated with
@AerospikeBin will be stored in the database.
An object can be embedded in another object. We use @AerospikeEmbed to
annotate the field representing the embedded object. An embedded object
is implicitly saved and loaded with the embedding object.
An object can be stored in Aerospike as a List or a Map.
For example, consider the Address object:
Address {
street = "100 Main St"
city = "Smartville"
state = "CA"
zipcode = "911001"
}
The object can be stored as a Map using the EmbedType parameter set
to MAP in @AerospikeEmbed annotation:
{"street":"100 Main St", "city":"Smartville", "state":"CA", "zipcode":"911001"}
A Map representation is the mapping of field names to their value.
While less space efficient than a List (described below), it doesn’t
need additional information to be stored for schema versioning.
Alternatively, the above object can be stored as a List of its field
values using the EmbedType parameter set to LIST in
@AerospikeEmbed annotation::
["100 Main St", "Smartville", "CA", "911001"]
A List representation of an object is space efficient and stores its
fields in alphabetical order. An explicit order of fields can be
specified using the @AerospikeOrdinal annotation to ensure a specific
sort order of a List of such objects.
In the following code, two Address objects are embedded in a Person
object. The object home_addr is stored as a Map whereas
office_addr is stored as a List.
Note that in the database, the default order of List representation is
the alphabetical order of field names. So in the above case, the List
order will be: city, state, street, zipcode.
office_addr: LIST('["Smartville", "CA", "100 Main St", "911001"]')
ssn: "555-55-5555"
Embedding Object Vs Storing Reference
As noted above, @AerospikeEmbed is used to embed a field representing
an object. An embedded object is implicitly saved and loaded with the
embedding object.
Note below, when we read the parent object that was saved above, the two
embedded objects are retrieved with it.
On the other hand, a reference to an object is annotated with
@AerospikeReference. A reference is stored within the referring object
as the id or key of the annotated object. A referenced object is loaded
automatically with the referring object. However, it must be saved
explicitly.
Below, we save the course that a person is enrolled for by reference.
Courses: Number: 100, Title: English Number: 200, Title: Math Number:
300, Title: Science Number: 400, Title: History
John’s courses: English, Math, History Jill’s courses: Math, Science
The following cell shows the code.
Some points to highlight:
While embedded objects need not have a key or id attribute, it is a
good practice to use an id for flexibility of switching between
embed and reference.
Use generics (as in List<Course> below) to describe the type as
fully as possible.
The object mapper handles circular references in the object graph.
Self-referncing classes are a special case of circular reference. A
class that has references to other instances of the same class is often
needed. For example, a Person object with a spouse field that is a
reference to another Person, and similarly with a children field.
In an arbitrarily deep nested graph, all dependent objects which are
@AerospikeRecord will be loaded. If it is desired for the objects not
to load dependent data, the reference can be marked with lazy = true.
We will extend the object model with the following relationships:
Person
AccountHolder is-a Person
AccountHolder has-a Account (1:N)
Student is-a Person
Student registersFor Course (M:N)
Course taughtBy Person (N:1)
Some relationshps are stored in both entities in this object model. For
example, a course stores a list of its students and each student stores
a list of their courses.
A few other things to note:
an object may be stored without other objects it references already
existing in the database
a subclass can be stored in its own namespace and set through its
@AerospikeRecord annotation; by default it is stored in its
closest ancestor’s namespace and set.
In the database, the actual class name is stored along with an
object reference if the field definition uses its parent class.
Let’s now try to read back some of the objects from the stored graph
topology. Note, we cannot use the JSON print function toJsonString as
earlier because it does not handle circular references. Also for each
list object, we need to iterate over elements and output an identiying
attribute.
The object mapper provides many sophisticated mechanisms, many of which
are listed below, to design and implementat real life use cases. Please
visit the object mapper
repo for their
description and examples.
Config via YAML string, and config file with precedence rules.
Policy specification and precedence rules
Versioning
Class hierarchies
Object deserialization using constructor factories
Custom data converters, custom getter/setter methods
Annotations Summary
The table below provides a summary of various annotations available in
the object mapper.
Persisting objects connected with complex relationships in Aerospike
requires mastery of Aerospike APIs, and doing so manually can be tricky
to maintain and error prone. The object mapper uses Java annotations to
define the Aerospoke semantics for saving and loading behavior. As
annontations appear next to the class, method, and field definitions,
the object mapper makes persistence using Aerospike:
easier to implement,
easier to understand, and
less error prone.
Cleaning Up
Remove tutorial data and close connection.
truncateTestData();
client.close();
System.out.println("Removed tutorial data and closed server connection.");
Output:
Removed tutorial data and closed server connection.
Visit Aerospike notebooks
repo to
run additional Aerospike notebooks. To run a different notebook,
download the notebook from the repo to your local machine, and then
click on File->Open in the notebook menu, and select Upload.