Frequently Asked Questions (FAQ)

General information

Is Aerospike based on an Open Source product?

No. Aerospike was developed from the ground up to be a highly scalable, low-latency, enterprise-class distributed database. Aerospike Database Community Edition (CE) is a free, open source version that was first released in 2014. Aerospike Database Enterprise Edition (EE), and Standard Edition (SE) are built on top of Aerospike CE, with added enterprise features. While CE shares the same developer APIs as EE and SE (with the exception of durable deletes), they differ in scalability, security, ease of operation, connectivity, and much more. Refer to product matrix for details.

Do I need a trial key for Aerospike Database?

Starting with Database 6.1, Enterprise Edition (EE) comes bundled with a single-node, all-features key file for evaluation, and starts up in the evaluation mode unless a different feature key file is configured. You can download Aerospike EE and immediately start using it without any further steps.

Enterprise customers receive separate feature key files for their production and development environments. For a free multi-node EE evaluation see Try Now.

You do not need a feature key file to use the Community Edition of Aerospike Database (CE).

Can I use Community Edition (CE) and Enterprise Edition (EE) at the same time?

No. Aerospike's license agreement with its enterprise customers prohibits running CE clusters alongside EE clusters.

What is the upgrade path from Aerospike CE to EE? Is downtime required during the upgrade?

No downtime is required when you upgrade from CE to EE or SE through a rolling upgrade (one node at a time). See Upgrade/Repair Server and Upgrading from Community to Enterprise version.

How is unique data counted?

Aerospike charges primarily by unique production data. This means that development, testing, staging, and failover clusters are not included.

A production cluster's unique data is roughly the uncompressed data stored in each namespace divided by its replication factor, excluding index metadata. See the Unique Data Agent for more details.

An enterprise customer's unique data is the sum of each production cluster's unique data. The method of writing data to a production cluster (local client, connector or XDR writes) does not change the method by which unique data is counted.

Hardware

What hardware does Aerospike support?

In production, Aerospike database runs on servers powered by 64-bit Intel or, as of Database 6.2, ARM processors running a Linux operating system. Developers can run Aerospike database on compatible processors using Docker in their macOS and Windows development environments, such as, Apple, Intel, and M1 MacBook Pro laptops.

Refer to the System Requirements at Planning Deployment.

Which versions of ARM does Aerospike Database support?

Aerospike runs natively on Linux operating systems, such as Red Hat Enterprise Linux, Ubuntu, Debian, and Amazon Linux 2023. Aerospike ARM port was tested on AWS Graviton2 EC2 instances.

In Database 6.2, Aerospike introduced support for ARM processors compatible with the ARMv8.2-A instruction set (Neoverse N1 microarchitecture).
Some RHEL 7 and RHEL 8 variants for 64-bit ARM have changed their default kernels to a 64K memory page size, and are therefore no longer supported. Aerospike users with deployments on these operating system versions must upgrade to one whose kernel has a 4K page size. Amazon Linux 2, Amazon Linux 2023 for 64-bit ARM are safe.

How do I load balance across the nodes in a cluster?

Aerospike automatically distributes both data and traffic to all the nodes in a cluster. There is no need to add additional load balancing. In fact, load balancers tend to cause interruption and reduce performance.

Can I have mixed hardware configurations for nodes?

Yes. Aerospike does not require that each node be the same hardware. However, the cluster randomly and evenly distributes data across the nodes, so the cluster is limited to the node performance with the least capacity.

After a power outage, should all the nodes be restarted at the same time?

If the application that uses the Aerospike database is not running (for example, if the whole cluster is down), you should be able to bring the cluster nodes up all at the same time. Otherwise, bring up one cluster node at a time, while disabling migrations. As soon as a node joins the cluster, move to bring up the next cluster node.

When a node goes down, how do you reconfigure the system to reroute traffic?

You don’t. With Aerospike, the cluster automatically detects when a node has left the cluster. It automatically responds by rebalancing data and changing the configuration so that the clients know how to communicate with the cluster.

Database

Map key restrictions

Starting in Database 7.1, map keys are restricted to simple types -- integer, string, and blob.

In what programming language is Aerospike written?

Aerospike is written in C for performance and predictability. Aerospike controls the use of memory and is not affected from garbage collection issues, an issue common to products written in other languages, such as Java.

What programming languages does Aerospike support?

You can learn about our client libraries, read developer blogs, and access the sandbox at the Developer Hub.

Can I run multiple instances of the Aerospike server on one machine?

On a server with a multi-socket CPU, you can run multiple instances of Aerospike, each pinned to a unique NUMA node. See Multiple asd instances for more information.

How do I decide how to separate data into namespaces and sets?

In general, you can think of a “set” in Aerospike as you would a “table” in a relational database. You can also think of an Aerospike “namespace” as a “tablespace” in a relational database.

The best way to start is to understand the sets you need. Typically, sets have users, URLs, servers as each record in them. Sets that have similar requirements often belong to the same namespace.

Since Aerospike does not have a set schema, even completely different sets can exist in the same namespace. For example, a namespace may contain sets for people, servers, and URLs. The bins from one namespace do not need to exist in the other namespaces. You may also find that similar items need to be in different namespaces because they have different needs for how the data is synchronized between different data centers. For example, you may want a set of users in one namespace that is synchronized across the US and a set of users in a different namespace that is synchronized across Asia.

How do I define a schema in Aerospike?

With Aerospike, there is no need to define a schema. Every row can have a different set of bins (similar columns in a relational database). In fact, even the same bin in one record, or row, does not have to have the same data type as the same bin in another record. This flexibility allows you to create applications without the limitations inherent in relational databases.

What query language do you support?

Aerospike has its own API to handle requests to the database. These requests are in the form of get/put/updates to the database, and also include atomic operations on data types, such as Integer, Double, Boolean, List, Map, HyperLogLog and Blob. Language specific clients implement the API for C, Java, C#, Python, Go, Node.js, and others.

How does Aerospike distribute data/traffic?

In Aerospike Database every record is randomly assigned to a logical partition and partitions are evenly distributed among the database cluster nodes. As a result, both the data volume and traffic are evenly distributed. Any time a cluster changes, data distribution and redistribution happens automatically.
With other solutions, you must manually redistribute data.

Do you support batch gets and puts?

Yes. Support for batch writes, deletes, and UDFs was added in Database 6.0. Previous versions of the server only supported batch reads.

How do I find an API call to get the key for a given row?

Fix the escaping of Policy.sendKey to a backtick before and after. Currently it is being closed by a single quote (') so it is not using code formatting.

What is Aerospike Available Percent?

Available Percent is the amount of storage defragmented and the percent available for writing to Aerospike, as a percentage of total space on disk. It is not free disk space. It is the available storage for streaming writes on that particular Aerospike namespace.

If I need to make a configuration change, do I have to restart the server?

No. Generally you can change the configuration of a node dynamically by issuing some commands from the command line. Refer to the Configuration Reference for details. You can make changes to most parameters, even memory settings, while the server is running. However, any permanent changes must be made in the configuration file (/etc/aerospike/aerospike.conf), and they are not read dynamically. Changes in the configuration file are only delivered to the server on restart.

If a node goes down during a read or write, what happens?

It is possible for a cluster node to go down due to hardware failure. The cluster identifies that the node is no longer sending heartbeats, triggering the formation of a new cluster with a new partition map.

The clients regularly check for state changes in the Aerospike server through a cluster-tending thread. On the next tend interval after the new cluster is formed (default to one second), the clients learn of the new partition map and the new peer list. Until then, the clients may try to perform some operations against a node that is no longer there.
If a client tries to read from such a node, the operation times out. When this happens, the client automatically tries reading from a node holding a replica of the record. From a coding standpoint, the developer does not need to be aware of this event, as the client handles the additional attempts at communicating with the database.
If the client tries to write to such a node, the operation times out. What happens next depends on write policies given to the client. The clients can be configured to retry the operation a specified number of times with a given interval between them. The application may choose to catch the timeout and either retry, defer, or ignore.
When a node is taken down for planned maintenance, it should first be quiesced. This prevents applications from experiencing read and write timeouts. Quiescence is an Enterprise Edition feature.

How do I back up the database? Will this impact performance?

The Aerospike backup tool, asbackup, gathers data from all the nodes and puts them into files. This can be done while the cluster is up and serving requests. The machine with the backup does not need to be a node within the cluster, but it must have network access to the cluster. The backup process is configurable and Aerospike has recommendations to ensure that the backup does not affect normal operations. Most customers can perform backups within a few hours with these settings. The backup system runs at a lower priority than the front end service, so a backup can take varying amounts of time.

Is there a way to delete all content from a namespace?

We recommend using asadm's manage truncate command to perform truncations rather than info commands when possible.

In the Enterprise Edition, truncation is durable and preserves record deletions through a cold-restart.
In the Community Edition, similar to record deletes, records in previously truncated sets are not durable and deletes can return through a cold-start.

Refer to truncate command at Info Command Reference - Truncate and also the truncate-namespace command at Info Command Reference - Truncate Namespace.

Storage

Aerospike has benchmarked many SSDs with the open-source Aerospike Certification Tool (ACT). ACT validates drives under realistic production conditions. Although some SSDs perform well for a short time, Aerospike has discovered that some may experience issues only after many hours of use. In order to pass Aerospike's strict ACT requirements, an SSD must show excellent performance over an extended period of time. For more information, refer to the flash/SSD certification guide.

Does Aerospike require the use of the TRIM command for flash/SSDs?

No. When an Aerospike namespace is configured to use the filesystem, the filesystem takes care of block management. When an Aerospike namespace is configured to use a raw flash device (SSD), Aerospike controls the device directly. Due to the difference in how they operate, Aerospike has optimized the use of SSDs as a NAND device. These optimizations include functionality similar to the TRIM command. The effect is that the performance is improved and garbage collection gets distributed, while also getting much improved longevity from the drives.

Can I store data in RAM?

Yes. Although Aerospike database makes optimal use of flash storage (SSDs), it can also use RAM and Intel Optane™ Persistent Memory as storage devices. Within the same cluster, you can configure one namespace to store its data in RAM and another namespace to store its data on SSD.

Can I store data on hard disk rather than SSD?

Storing data on a hard disk is not supported. Aerospike database uses many SSD optimizations to achieve predictable low latency. The physical limitations of rotational disks add an unpredictable and unacceptable amount of latency.

How do I calculate the amount of space needed in RAM and/or flash (SSD)?

If you want an exact algorithm for calculating the amount of space needed, refer to the Capacity Planning. We suggest Aerospike customers to contact support regarding capacity planning.

How does Aerospike handle reclamation of space?

Aerospike was designed from the start as an enterprise-class, distributed database. Its architecture focuses on using flash drives (SSDs) as primary data storage, without heavy dependence on RAM for performance. Instead of appending writes to large log files and deferring compaction to a CPU and disk I/O intensive operation, Aerospike uses SSD optimized raw-device block writes and lightweight defragmentation. This leads to greater predictability and reliability, without experiencing long latencies often seen in LSM-tree databases.
Storage is split into two areas: the primary index, which is stored either in RAM, Intel Optane™ persistent memory, or on flash, and data, typically configured to be stored on SSDs. When a new record is written to a node, a metadata entry is made in the primary index and its data is streamed in blocks to the SSD. The metadata entry points to the exact device and r-block offset from where the record data is stored contiguously. This facilitates low latency, concurrent reads.
When the record is updated, its primary index entry is updated to point to a different block on the SSD, where the new record data is persisted. If the record is deleted the metadata entry is removed. Since Aerospike does not use a filesystem to store records, it doesn’t need to compact data files. Rather, the defragmentation process continuously reclaims space in small increments, without causing latency spikes.
A separate process traverses the primary index and removes metadata that has aged beyond its configurable time-to-live (TTL). Subsequently defragmentation reclaims storage blocks with no index metadata pointing to them.

How do deletes work?

A standard delete operation (AKA expunge) only removes the record primary index metadata entry. The record data is reclaimed asynchronously from the namespace data storage (typically SSD) through a separate defragmentation process. Defragmented write blocks are later overwritten with new records. Since expunge only removes a 64 byte metadata entry from the index (usually stored in RAM) the approach is fast and optimal for flash devices, which have limited write I/O capacity compared to read I/O.
The alternative durable delete operation writes a new metadata entry to the primary index pointing to a minimal disk storage marker called a tombstone. Tombstones prevent situations where a cold restart of the node may recover previously deleted records based on their yet to be defragmented stale version on SSD.

Transactions

A transaction is an encapsulation of several commands isolated from commands outside the transaction and executed atomically. Transaction atomicity ensures that all the records changed in the transaction roll forward on a successful commit, otherwise they roll back together.

Requirements for using transactions in Aerospike

Aerospike Database 8.0 and later
The namespace needs to be configured in strong consistency mode
Requires the asdb-strong-consistency feature-key to be enabled in the feature-key file
Use a client version that is fully compatible with Aerospike Database 8.0

How do you configure transactions?

There is no required configuration, and only one optional configuration parameter mrt-duration. The other dynamic configuration parameter disable-mrt-writes is only used to temporarily or permanently stop new transactions from happening.

How do transactions affect read and write commands in Database 8.0?

Read and write commands are not affected when transactions aren't used.
Transactions lock records against writes, so a write command outside the transaction will get rejected with an error code 120 AS_ERR_MRT_BLOCKED.Transactions tend to be short-lived, so a retry of the command will likely avoid running into a lock on a subsequent attempt.
Reads outside the transaction see the original version of the record, until the transaction is committed.

Is the transaction record lock only done in the index or does it also require IO to data storage?

It requires IO. The lock is signified by the existence of a provisional version of the record. This coexists with the original version of the record. When the transaction is completed successfully the original record is discarded. If it is rolled back, the provisional record is discarded. For this reason locked records are persisted to device.

What tracks the reads and writes of a transaction?

The client tracks all the records that are read by the transaction, and upon commit verifies that their generation did not change. If a generation mismatch happens, the transaction fails with an error code 121 AS_ERR_MRT_VERSION_MISMATCH. The application can determine whether it should retry the transaction from the top.

The monitor keeps track of the records locked and written by a transaction. In the event that the client goes away without fully committing or aborting the transaction, the monitor steps in after the transaction timeout deadline to prevent it from dangling.

What happens if the client abandons a transaction?

The monitor waits for the duration of the transaction timeout, as requested by the developer or set by the mrt-duration configuration parameter, plus a safety interval (27s for clock skew) before it takes over and cleans up an abandoned transaction. When the monitor takes over, it assumes the client is gone and will not come back. If the client somehow does come back, any attempt it makes to continue the transaction with a read or write is rejected with an error code 122 AS_ERR_MRT_EXPIRED on a read/write attempt.

In a normal situation, the client sets a commit flag in the monitor record before it begins rolling provisional records forward. If the client went away before finishing to roll records forward, the monitor respects this flag and completes the job. An error code 124 AS_ERR_MRT_COMMITED is returned if a client tries to re-commit a transaction it had previously started to commit, and was completed by the monitor. In all other cases the monitor aborts the transaction, and rolls back the provisional changes. An error code 125 AS_ERR_MRT_ABORTED is returned if a client attempts a commit after the monitor has taken over and aborted the transaction.

What are the extra costs associated with transactions?

A monitor record tracks if any records are written, for the duration of the transaction.
A provisional record is created for each record that is written, for the duration of the transaction. These typically have an insignificant capacity impact.
Extra reads/writes:
- per-read command – one verify, which is lighter than a read as it does not read bins off the storage device; it checks the generation.
- per-write command – an extra write to add the digest to the monitor record, and one when the provisional record is committed or rolled back, similar to a record touch.
- if the transaction had any writes – one monitor record write to commit the transaction, and one durable delete removing the monitor record when done.
All deletes within a transaction must be durable deletes. As a result extra tombstones are created at the rate of transactions ended per-second and tombstone configuration, such as tomb-raider-period, should be considered.
No NSUP overhead. The only new background polling is of monitor records within the special <ERO~MRT set, which has a set index. There is no IO unless monitor records become active to finish transactions. This only occurs when clients abandon transactions.

What happens to existing transactions when the partition becomes unavailable?

Nothing surprising. Locked records remain locked until they are accessible again. A transaction involving unavailable records cannot be completed or rolled back until the records become available again. The operations will see an unavailable partition error.

What happens to existing transactions when the cluster fails?

Nothing surprising. Locked records remain locked when they come back. The monitor cleans up this transaction.

How can you block specific users from accessing the transactions feature?

Unless the namespace is configured with strong consistency (SC), no users can initiate transactions. Within an SC namespace, use disable-mrt-writes to temporarily or permanently block transactions.

To prevent specific users from initiating transactions, use RBAC to deny them write privileges to the <ERO~MRT monitor set within the namespace.

When doing a batch-write does the monitor get updated all at once?

Yes, a list of digests is added to the monitor record in one operation.

Can a transaction span multiple namespaces?

No. Transactions are restricted to the records in a single namespace.

Download

What is a hotfix?

A hotfix is a patch release of a specific server version that has no API changes and no regressions. For example, you can safely upgrade from server 5.5.0.4 to 5.5.0.25, or the latest hotfix for Database 5.5.0. It builds on the initial release of the server, applying layers of subsequent bug fixes.

How do I manually download Aerospike software?

Download the latest Aerospike Database from Aerospike Downloads. The available packages are limited to the supported database versions with the latest hotfix of each version, if available.

How do I automate database downloads?

To automate Aerospike Database downloads, use the artifact repository, which contains the latest and archived versions of the database. Do not use the URLs associated with the manual download page, as they could change.

The URLs of the artifacts do not change, and are easy to define programmatically. For example you can get

The newest version of Aerospike EE at https://download.aerospike.com/artifacts/aerospike-server-enterprise/latest/
The latest Aerospike EE 6.1.0.x release at https://download.aerospike.com/artifacts/aerospike-server-enterprise/6.1.0/
A specific release at https://download.aerospike.com/artifacts/aerospike-server-enterprise/6.0.0.8/

caution

A new package naming convention will affect download automation for new versions of the server, tools, Prometheus Exporter, C client.

Aerospike clusters are tolerant of heterogeneous database versions, so you can safely roll out hotfixes for the same database version. We recommend that you use the latest hotfix link for the database version you are deploying.

What is the public key for verifying Aerospike packages?

Aerospike Public Key download location: https://download.aerospike.com/artifacts/aerospike_public_key.asc

How do I validate an Aerospike Database package?

Here is the current list of GPG Signed packages.

Product	Signed
Enterprise and Community Database	5.7.0.30+, 6.0.0.14+, 6.1.0.12+, 6.2.0.6+, 6.3.0.0+, 6.4.0.0+, 7.0.0.0+
Federal Database	6.0.0.13+, 6.1.0.11+, 6.2.0.6+, 6.3.0.0+, 6.4.0.0+, 7.0.0.0+
Tools	8.2.0+
Prometheus Exporter	1.10.0
Aerospike Shared-Memory Tool	1.2.2+

The following steps validate the Database package. You can use the same steps to validate other Aerospike packages.

Verify checksums

Aerospike software packages are checksummed using the SHA-256 cryptographic hash function. You can verify the checksum using shasum, openssl, or certutil, for example:

shasum --check  aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz.sha256
aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz: OK

Using openssl:

openssl dgst -sha256 aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz
SHA256(aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz)= 3a1ea17a531e1cd8c4fc4838516164463b4b0ab67325b60ae76efd41a4b04797
# check if this matches the SHA256 checksum file
cat aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz.sha256
3a1ea17a531e1cd8c4fc4838516164463b4b0ab67325b60ae76efd41a4b04797  aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz

On Windows:

certutil -hashfile aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz sha256

Verify signatures

Import the Aerospike public key that you previously saved.

gpg --import aerospike_public_key.asc

Verify the signature of the checksum file using the public key.

gpg --verify aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz.sha256.asc aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz.sha256
gpg: Signature made Mon Mar 21 18:32:44 2022 PDT using RSA key F0E177055ABD1099
gpg: Good signature from "Aerospike (Key Pair for signing artifact signature sidecars.) <artifact-sig@aerospike.com>" [unknown]
gpg: WARNING: This key is not certified with a trusted signature! There is no indication that the signature belongs to the owner.
Primary key fingerprint: 0504 6BC9 2786 D0DC FA11  F519 F0E1 7705 5ABD 1099

Authorize the key

The warning is expected since you have not been certified with the public key that Aerospike provided. To certify the key, perform the following:

gpg --list-keys
---------------------------------
pub   rsa4096 2021-11-30 [SC]
     05046BC92786D0DCFA11F519F0E177055ABD1099
uid   [ unknown] Aerospike (Key Pair for signing artifact signature sidecars.) <artifact-sig@aerospike.com>

Edit the keys

gpg --edit-key 05046BC92786D0DCFA11F519F0E177055ABD1099
gpg (GnuPG) 2.3.4; Copyright (C) 2021 Free Software Foundation, Inc.
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.

pub  rsa4096/F0E177055ABD1099
     created: 2021-11-30  expires: never       usage: SC  
     trust: unknown       validity: unknown
[ unknown] (1). Aerospike (Key Pair for signing artifact signature sidecars.) <artifact-sig@aerospike.com>

gpg> trust
pub  rsa4096/F0E177055ABD1099
     created: 2021-11-30  expires: never       usage: SC  
     trust: unknown       validity: unknown
[ unknown] (1). Aerospike (Key Pair for signing artifact signature sidecars.) <artifact-sig@aerospike.com>

Decide how far you trust this user to correctly verify other users' keys
(by looking at passports, checking fingerprints from different sources, or other information).

1 = I don't know or won't say
2 = I do NOT trust
3 = I trust marginally
4 = I trust fully
5 = I trust ultimately
m = back to the main menu

Your decision? 5

pub  rsa4096/F0E177055ABD1099
     created: 2021-11-30  expires: never       usage: SC  
     trust: ultimate      validity: ultimate
[ultimate] (1). Aerospike (Key Pair for signing artifact signature sidecars.) <artifact-sig@aerospike.com>

gpg> save

Verify the key again to make sure the key is certified

gpg --verify aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz.sha256.asc aerospike-server-enterprise-5.7.0.12-ubuntu20.04.tgz.sha256
gpg: Signature made Mon Mar 21 18:32:44 2022 PDT using RSA key F0E177055ABD1099
gpg: Good signature from "Aerospike (Key Pair for signing artifact signature sidecars.) <artifact-sig@aerospike.com>" [ultimate]

Now, you received a valid signature from Aerospike.

Verify signature on RPM package (Redhat base Linux)

Aerospike uses GPG to sign server and tools binaries. Use these steps to verify the signature of the downloaded binaries.

Run:

sudo rpm --import aerospike_public_key.asc

rpm --checksig  <*.rpm>

Expected output:

digests signatures OK

Example:

$ rpm --checksig aerospike-server-enterprise-6.4.0.1-1.el8.x86_64.rpm 

aerospike-server-enterprise-6.4.0.1-1.el8.x86_64.rpm: digests signatures OK

Verify signature on DEB package (Debian, ubuntu linux)

Run:

dpkg-sig --verify <*.deb>

Expect output:

GOODSIG _gpgbuilder

Example:

$ dpkg-sig --verify aerospike-server-enterprise_6.4.0.1-1debian12_arm64.deb 

Processing aerospike-server-enterprise_6.4.0.1-1debian12_arm64.deb...

GOODSIG _gpgbuilder 05046BC92786D0DCFA11F519F0E177055ABD1099 1692152324

Data migration and synchronization

How long does a migration take?

Migrations vary depending on factors such as available network bandwidth, drive speed and the amount of data on each Aerospike cluster node. The rate of migration can be configured, as discussed in What is Migration?.

If I am synchronizing 2 datacenters using the Aerospike XDR product (available in the Enterprise Edition), what happens if the network connection is severed? Will I lose data or will it be stored somewhere?

You will not lose data. Aerospike XDR does not store a record's data for shipment to a remote destination. It records digests and Last Update Times(LUTs). It also persists records Last Ship Time (LST). In case of recovery from a failure, such as a network outage, XDR uses the LST to determine in a record needs to be shipped.

If I am synchronizing two data centers using Aerospike Enterprise Edition (EE) Cross Datacenter Replication (XDR), what happens if the network connection is severed?

You will not lose data. XDR continues operating as soon as the connection is reestablished. It may switch to a recovery process that compares the record last update time (LUT) with the partition last ship time (LST) to the remote destination, to determine if the record needs to be shipped. After the system catches up, XDR continues operating normally.

Definitions

What is Citrusleaf?

Citrusleaf is the former name of Aerospike. It was changed to Aerospike in the summer of 2012, but there are many labels that still refer to citrusleaf (cl).

What is the definition of [Bin, Cluster, Namespace, Record, Set, Digest]?

See the Glossary.

What is a Distributed Hash Table or DHT

The Distributed Hash Table is the accumulated information on where data is stored within the cluster. It includes the indexes that are distributed throughout the cluster and how they map to partitions (see “partition” and “partition map” below) and nodes.

What is the Generation Number?

When a record is written to a cluster, part of the metadata associated with the record is the generation. It is incremented each time the record is altered. The generation resolves any conflicts that may occur if two records have different values.

What is replication factor?

Every Aerospike namespace is configured with a replication factor (RF) that determines the number of copies of the data. The number of copies is referred to as the “replication factor.” A replication factor of 1 (RF1) means that the data is not replicated and does not have a hot backup. Most Aerospike customers use a replication factor of 2 (RF2).

What is the range of values allowed for the replication factor?

The minimum replication factor is 1 (no replication). The upper limit is the number of nodes (a copy of the data in each node).

What is a Partition?

“Partitions” are buckets of records that have been grouped together for the data distribution purpose.
In order to evenly distribute data between the Aerospike cluster nodes, all data is mapped to one of 4096 partitions, based on the digest of a RIPEMD-160 based hashing algorithm. In turn, each partition is mapped to a node in the cluster based on an agreed-upon partition map. Whenever the number of nodes in the cluster changes, the partitions get remapped and transferred to the appropriate node in a process called “migration.”

What is migration?

During normal operation, an Aerospike cluster has a number of copies of the data, as defined by the replication factor. This data is evenly distributed across the cluster in the master and replica partitions. If a node goes down, some portion of the data is no longer at the full replication factor. The Aerospike cluster responds by automatically promoting replica partitions with one fewer copy to master partitions. It is also creates empty replica partitions on other cluster nodes. It re-establishes a full replication factor by copying data between nodes to fill empty replicas. This data motion is referred to as a “fill migration.” A different migration process focuses on rebalancing the data by moving partitions around the newly-formed cluster.
There are configuration settings for how many threads to use for migration, how many partitions should be migrating in and out of the node at the same time, as well as, a setting delay fill migrations in the event of a planned rolling upgrade.

What is a set?

An Aerospike “set” is similar to a table in a relational database. One of the big differences is that with Aerospike, you do not need to predefine a schema. You may add bins (or columns) to one record in a set without needing to add them to any other record in the set.

What is CITRUSLEAF_EPOCH Time?

Aerospike compacts dates by subtracting out the CITRUSLEAF_EPOCH time. The epoch is taken as the second before 12:00:01 am, January 1, 2010 GMT. Time in the Citrusleaf epoch can be calculated by the following formula:

Current time - 1262304000 = Time in CITRUSLEAF_EPOCH

#define CITRUSLEAF_EPOCH 1262304000
struct timespec ts;
clock_gettime(CLOCK_REALTIME, &ts);
return ( ts.tv_sec - CITRUSLEAF_EPOCH );

What does the expiration date of a record returned from a query callback mean?

The expiration date field returned from a query is the actual expiry time (Citrusleaf epoch time) for that record, which is computed from the record TTL value and the time the record was written to the database. It is not mandatory for a record to have a TTL.
It is possible to configure a default TTL for a given namespace, any TTL set by a client supersedes the namespace default.

What is a write-master?

When data is written to the cluster, it is first be written to the master node for that record. This is referred to as a “write-master.” Statistics on these writes are available and are different than writes to a node for a replica copy. These replica writes are known as a “write-prole” (see “write-prole” below). Write-masters are used to determine how many unique records have been written to the cluster.

What is the difference between a write-master and a write-prole?

After a record has been written to the master for that record (see “write-master” above), there are subsequent writes to the nodes that have the replica. These are known as “write-proles”. If the replication factor has been set at 1, there are no write-proles.

General information​

Is Aerospike based on an Open Source product?​

Do I need a trial key for Aerospike Database?​

Can I use Community Edition (CE) and Enterprise Edition (EE) at the same time?​

What is the upgrade path from Aerospike CE to EE? Is downtime required during the upgrade?​

How is unique data counted?​

Hardware​

What hardware does Aerospike support?​

Which versions of ARM does Aerospike Database support?​

How do I load balance across the nodes in a cluster?​

Can I have mixed hardware configurations for nodes?​

After a power outage, should all the nodes be restarted at the same time?​

When a node goes down, how do you reconfigure the system to reroute traffic?​

Database​

Map key restrictions​

In what programming language is Aerospike written?​

What programming languages does Aerospike support?​

Can I run multiple instances of the Aerospike server on one machine?​

How do I decide how to separate data into namespaces and sets?​

How do I define a schema in Aerospike?​

What query language do you support?​

How does Aerospike distribute data/traffic?​

Do you support batch gets and puts?​

How do I find an API call to get the key for a given row?​

What is Aerospike Available Percent?​

If I need to make a configuration change, do I have to restart the server?​

If a node goes down during a read or write, what happens?​

How do I back up the database? Will this impact performance?​

Is there a way to delete all content from a namespace?​

Storage​

Have you tested any SSDs? Which ones do you recommend?​

Does Aerospike require the use of the TRIM command for flash/SSDs?​

Can I store data in RAM?​

Can I store data on hard disk rather than SSD?​

How do I calculate the amount of space needed in RAM and/or flash (SSD)?​

How does Aerospike handle reclamation of space?​

How do deletes work?​

Transactions​

Requirements for using transactions in Aerospike​

How do you configure transactions?​

How do transactions affect read and write commands in Database 8.0?​

Is the transaction record lock only done in the index or does it also require IO to data storage?​

What tracks the reads and writes of a transaction?​

What happens if the client abandons a transaction?​

What are the extra costs associated with transactions?​

What happens to existing transactions when the partition becomes unavailable?​

What happens to existing transactions when the cluster fails?​

How can you block specific users from accessing the transactions feature?​

When doing a batch-write does the monitor get updated all at once?​

Can a transaction span multiple namespaces?​

Download​

What is a hotfix?​

How do I manually download Aerospike software?​

How do I automate database downloads?​

What is the public key for verifying Aerospike packages?​

How do I validate an Aerospike Database package?​

Verify signature on RPM package (Redhat base Linux)​

Verify signature on DEB package (Debian, ubuntu linux)​

Data migration and synchronization​

How long does a migration take?​

If I am synchronizing 2 datacenters using the Aerospike XDR product (available in the Enterprise Edition), what happens if the network connection is severed? Will I lose data or will it be stored somewhere?​

If I am synchronizing two data centers using Aerospike Enterprise Edition (EE) Cross Datacenter Replication (XDR), what happens if the network connection is severed?​

Definitions​

What is Citrusleaf?​

What is the definition of [Bin, Cluster, Namespace, Record, Set, Digest]?​

What is a Distributed Hash Table or DHT​

What is the Generation Number?​

What is replication factor?​

What is the range of values allowed for the replication factor?​

What is a Partition?​

What is migration?​

What is a set?​

What is CITRUSLEAF_EPOCH Time?​

What does the expiration date of a record returned from a query callback mean?​

What is a write-master?​

What is the difference between a write-master and a write-prole?​