IVF Vector Query with Rank Options

Overview

When performing vector similarity search using IVF (Inverted File) indexes, MatrixOne provides the BY RANK WITH OPTION clause to control how the rank function is applied during query execution. This feature allows you to optimize query performance and accuracy based on your specific use case.

Syntax

SELECT column_list
FROM table_name
ORDER BY distance_function(vector_column, query_vector) ASC
LIMIT k
BY RANK WITH OPTION 'mode = <mode>';

Parameters

Parameter	Description
`mode`	Controls when the rank function is applied. Valid values: `pre`, `force`, `post`

Mode Options

`mode = pre` (Pre-ranking Mode)

In pre-ranking mode, the IVF index is used to filter candidate vectors before applying the rank function. This mode:

Behavior: First uses IVF index to select candidate centroids, then applies ranking within those candidates
Performance: Fastest execution, as ranking is applied only to a subset of data
Accuracy: May miss some relevant results if they are in centroids not selected by the probe limit

Use Case: High-throughput scenarios where query speed is prioritized over perfect recall.

-- Example: Pre-ranking mode
SELECT id, content, l2_distance(embedding, '[1.0, 2.0, 3.0, 4.0]') AS distance
FROM documents
ORDER BY l2_distance(embedding, '[1.0, 2.0, 3.0, 4.0]') ASC
LIMIT 10
BY RANK WITH OPTION 'mode = pre';

`mode = force` (Force Mode)

In force mode, the system forces the use of the IVF index for ranking, regardless of the query optimizer's default decision. This mode:

Behavior: Explicitly enforces index usage and applies a strict ranking strategy
Performance: Optimized for scenarios where index usage is guaranteed to be beneficial
Accuracy: Balanced between speed and accuracy based on probe limit settings

Use Case: When you know the IVF index is well-suited for your query pattern and want to ensure it is used.

-- Example: Force mode
SELECT id, content, l2_distance(embedding, '[1.0, 2.0, 3.0, 4.0]') AS distance
FROM documents
ORDER BY l2_distance(embedding, '[1.0, 2.0, 3.0, 4.0]') ASC
LIMIT 10
BY RANK WITH OPTION 'mode = force';

`mode = post` (Post-ranking Mode)

In post-ranking mode, the rank function is applied after the IVF index has retrieved an expanded set of candidates. This mode:

Behavior: Retrieves more candidates from the IVF index, then re-ranks them using the exact distance computation
Performance: Slower than pre-ranking, but faster than full table scan
Accuracy: Higher accuracy, as the final ranking is based on exact distance calculations

Use Case: Scenarios requiring high recall where exact ranking is important.

-- Example: Post-ranking mode
SELECT id, content, l2_distance(embedding, '[1.0, 2.0, 3.0, 4.0]') AS distance
FROM documents
ORDER BY l2_distance(embedding, '[1.0, 2.0, 3.0, 4.0]') ASC
LIMIT 10
BY RANK WITH OPTION 'mode = post';

Comparison of Modes

Mode	Index Usage	Ranking Strategy	Speed	Accuracy	Best For
`pre`	Uses IVF to filter first	Rank within IVF candidates	⚡ Fastest	Lower	High-throughput, latency-sensitive
`force`	Forces IVF index usage	Strict index-based ranking	🚀 Fast	Medium	Predictable index usage
`post`	Uses IVF for candidates	Re-ranks with exact distance	🐢 Slower	⭐ Highest	High-recall requirements

Complete Example

Setup

-- Enable IVF index feature
SET GLOBAL experimental_ivf_index = 1;

-- Create table with vector column
CREATE TABLE products (
    id BIGINT PRIMARY KEY,
    name VARCHAR(200),
    embedding VECF32(4)
);

-- Insert sample data
INSERT INTO products VALUES (1, 'Product A', '[0.1, 0.2, 0.3, 0.4]');
INSERT INTO products VALUES (2, 'Product B', '[0.5, 0.6, 0.7, 0.8]');
INSERT INTO products VALUES (3, 'Product C', '[0.2, 0.3, 0.4, 0.5]');
INSERT INTO products VALUES (4, 'Product D', '[0.9, 0.8, 0.7, 0.6]');
INSERT INTO products VALUES (5, 'Product E', '[0.3, 0.4, 0.5, 0.6]');

-- Create IVF index
CREATE INDEX idx_products_embedding 
USING IVFFLAT ON products(embedding) 
LISTS=2 OP_TYPE "vector_l2_ops";

-- Set probe limit for query
SET @PROBE_LIMIT = 1;

Query Examples

-- Fast approximate search (pre-ranking)
SELECT id, name, l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') AS distance
FROM products
ORDER BY l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') ASC
LIMIT 3
BY RANK WITH OPTION 'mode = pre';

-- Guaranteed index usage (force mode)
SELECT id, name, l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') AS distance
FROM products
ORDER BY l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') ASC
LIMIT 3
BY RANK WITH OPTION 'mode = force';

-- High-accuracy search (post-ranking)
SELECT id, name, l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') AS distance
FROM products
ORDER BY l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') ASC
LIMIT 3
BY RANK WITH OPTION 'mode = post';

Best Practices

Choosing the Right Mode

For real-time applications (e.g., search-as-you-type): Use mode = pre
For batch processing where accuracy matters: Use mode = post
When query optimizer makes suboptimal decisions: Use mode = force

Combining with Probe Limit

The @PROBE_LIMIT variable controls how many centroids are scanned. Combine it with rank options for optimal results:

-- High accuracy configuration
SET @PROBE_LIMIT = 10;

SELECT id, name, l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') AS distance
FROM products
ORDER BY l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') ASC
LIMIT 5
BY RANK WITH OPTION 'mode = post';

-- High speed configuration
SET @PROBE_LIMIT = 1;

SELECT id, name, l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') AS distance
FROM products
ORDER BY l2_distance(embedding, '[0.1, 0.2, 0.3, 0.4]') ASC
LIMIT 5
BY RANK WITH OPTION 'mode = pre';

Performance Tuning

Start with mode = pre for most use cases
Monitor recall rate and switch to mode = post if accuracy is insufficient
Use mode = force when you observe the query optimizer not using the index

Limitations

BY RANK WITH OPTION only works with IVF indexed vector columns
Currently supports only l2_distance metric
The mode option must be specified as a string literal