Secondary index
Jump to the Code block for a combined complete example.
Basic SI queries can employ the following index filters:
- Equality comparison against string or numeric indexes.
- Range comparison against numeric indexes. Range result sets are inclusive of lower and upper limits.
- Point-In-Region or Region-Contain-Point comparisons against geo indexes.
Filter Expressions can also be used with secondary index queries.
Policies
See Basic Queries for query policy information.
Setup
The following examples will use the setup and record structure below to illustrate secondary index queries in an Aerospike database.
import aerospikefrom aerospike_helpers import expressions as expfrom aerospike_helpers.operations import operations, map_operationsfrom aerospike import predicates as pfrom aerospike import GeoJSON
# Define host configurationconfig = { 'hosts': [ ('127.0.0.1', 3000) ]}# Establishes a connection to the serverclient = aerospike.client(config).connect()
The record structure:
Occurred: IntegerReported: IntegerPosted: IntegerReport: Map{ shape: List, summary: String, city: String, state: String, duration: String}Location: GeoJSON
Create an index
The following command uses the Aerospike Admin (asadm) to create an integer index on the sandbox
namespace, ufodata
set, and occurred
bin. This is the recommended way to create a secondary index.
asadm -e 'enable; manage sindex create numeric occurred_idx ns sandbox set ufodata bin occurred'
The Aersopike Client API can be used to create a secondary index as well. The following example creates the same index as the example above.
# Create indexclient.index_integer_create( 'sandbox', # namespace 'ufodata', # set name 'occurred', # bin name 'occurred_idx', # index name # optional policy)
In this example, the IndexType
is NUMERIC
. Aerospike supports index types NUMERIC
,STRING
, GEO2DSPHERE
, and BLOB
as of server 7.0.
Remove an index
The following command uses the Aerospike Admin (asadm) to remove the index created above.
asadm -e 'enable; manage sindex delete occurred_idx ns sandbox set ufodata'
The Aersopike Client API can be used to remove a secondary index as well. The following example removes the same index as the example above.
# Remove indexclient.index_remove( 'sandbox', # namespace 'occurred_idx', # index name # optional policy)
Query an index
The following example queries the sandbox
namespace and ufodata
set name, with an inclusive range filter on the occurred
bin, returning records with a bin value between 20210101
and 20211231
.
# Create the queryquery = client.query('sandbox', 'ufodata')
# Set index filterquery.where(p.between('occurred', 20210101, 20211231))
# Create callback functiondef record_set(record): (key, meta, bins) = record # Do something print('Key: {0} | Record: {1}'.format(key[2], bins))
# Execute the queryquery.foreach(record_set)
# Close the connection to the serverclient.close()
Query an index with a Filter Expression
The following example will use the geo string defined in the expandable section below.
View the language specific data creation
# Create geo regionregion = GeoJSON({ 'type': 'Polygon', 'coordinates': [ [ [-109.061279296875, 36.97622678464096], [-102.01904296874999, 36.97622678464096], [-102.01904296874999, 41.0130657870063], [-109.061279296875, 41.0130657870063], [-109.061279296875, 36.97622678464096] ] ]})
This example queries the same namespace and set name, while using the same index filter as the example above, but adds a Filter Expression to the
query policy to only return records with a location
bin value within the geo region specified above.
# Build the expressionexpr = exp.CmpGeo(exp.GeoBin('location'), region).compile()
# Create the policyquery_policy = {'expressions': expr}
# Create the queryquery = client.query('sandbox', 'ufodata')
# Set index filterquery.where(p.between('occurred', 20210101, 20211231))
# Create callback functiondef record_set(record): (key, meta, bins) = record # Do something print('Key: {0} | Record: {1}'.format(key[2], bins))
# Execute the queryquery.foreach(record_set, policy=query_policy)
# Close the connection to the serverclient.close()
Query an index then create a batch operation on the returned keys
Currently, Transaction and CDT operations are not available for basic queries. The following example shows how a basic query and a batch operation can be combined to use transaction operations on the results.
This example queries the same namespace and set name, while using the same index filter, geo region, and Filter Expression as above, but
only returns record metadata, which is then used to create a batch operation to return only the city
and state
from the report
map.
# Build the expressionexpr = exp.CmpGeo(exp.GeoBin('location'), region).compile()
# Create the policyquery_policy = {'expressions': expr}
# Create the optionsquery_options = {'nobins': True}
# Create the queryquery = client.query('sandbox', 'ufodata')
# Set index filterquery.where(p.between('occurred', 20210101, 20211231))
# Create callback functionkeys = []def record_set(record): (key, meta, bins) = record keys.append(key)
# Execute the queryquery.foreach(record_set, policy=query_policy, options=query_options)
# Create map key listmapKeys = ['city', 'state']
# Create operationsops = [ map_operations.map_get_by_key_list('report', mapKeys, aerospike.MAP_RETURN_VALUE), # This is the only way a read will work right now operations.touch()]
# Get 'city' and 'state' from report map for each recordbatchRecords = client.batch_operate(keys, ops)
# Access the recordsfor batchRecord in batchRecords.batch_records: record = batchRecord.record # Do something print('Record: ', record[2])
# Close the connection to the serverclient.close()
Pagination
See Pagination for more information.
Code block
Expand this section for a single code block to execute a basic SI query
import aerospikefrom aerospike_helpers import expressions as expfrom aerospike_helpers.operations import operations, map_operationsfrom aerospike import predicates as pfrom aerospike import GeoJSON
# Define host configurationconfig = { 'hosts': [ ('127.0.0.1', 3000) ]}# Establishes a connection to the serverclient = aerospike.client(config).connect()
# Create geo regionregion = GeoJSON({ 'type': 'Polygon', 'coordinates': [ [ [-109.061279296875, 36.97622678464096], [-102.01904296874999, 36.97622678464096], [-102.01904296874999, 41.0130657870063], [-109.061279296875, 41.0130657870063], [-109.061279296875, 36.97622678464096] ] ]})
# Build the expressionexpr = exp.CmpGeo(exp.GeoBin('location'), region).compile()
# Create the policyquery_policy = {'expressions': expr}
# Create the queryquery = client.query('sandbox', 'ufodata')
# Set index filterquery.where(p.between('occurred', 20210101, 20211231))
# Create callback functiondef record_set(record): (key, meta, bins) = record # Do something print('Key: {0} | Record: {1}'.format(key[2], bins))
# Execute the queryquery.foreach(record_set, policy=query_policy)
# Close the connection to the serverclient.close()