SQLite FTS5 performance on 2012 HDD-only server

I'm developing an app that uses the FTS5 module in SQLite for search functionality and it just so happens that I want to know in advance what I can expect from it. Unfortunately, there are hardly any articles on the web that explicitly test FTS5 INSERT/SELECT queries speed — besides LLM-generated articles full of nonsense, but with good SEO — so this blog entry is all about that.

Hardware

Dedicated server straight from 2012, running on Debian 13.1:

CPU: Intel Xeon E3-1220 V2
RAM: 32GiB DDR3 1600 MHz
- 4 x Micron 18KSF1G72AZ-1G6P1 (8 GiB)
Storage: 7.3TB, RAID 0 (306 iops)
- 2 x Toshiba MC04ACA400E (3.6T)

Why use HDD-only system from 2012 in 2025? That's a good question, moving on. ¹

Database

I wanted to test FTS5 engine with relatively small amount of data, so I took hackernews dataset (all posts from HN til 2024) end inserted it into one table and the numbers are:

28GB in size (unicode61, porter)
74GB in size (trigram)
41'400'000 rows, 4 columns (by, text, title, url)

What's the Unicode61, Porter and Trigram?

Those are tokenizers that are available by default with SQLite3 FTS5 engine.

Porter itself is a wrapper around Unicode61 and also allows you search terms like "correction" to match similar words such as "corrected" or "correcting". And Trigram extends FTS5 to support substring matching in general, instead of the usual token matching, which is good for fuzzy search (and hence triple the size of the database).

It worth saying that default tokenizers are optimized mostly for English – good luck with searching anything that requires word segmentation support. If you want to support CJK, then use something like better-trigram. Testing this is out of scope of this blog entry.

SELECT queries performance with Unicode61 tokenizer

Let's start with realistic scenario and then we will bench it for real.

I want to search for 'test' everywhere: in the user's nickname, the text, the URL and the title. Also, I don't want to limit myself with pity LIMIT statements. Let's see:

SELECT * FROM hn('test');

Just shy of 385345 results — a pretty good amount to display to a user on a single page. How long did that take?

real    5m51.913s

Come to think of it maybe displaying 385345 results is suboptimal. Let's add LIMIT after all:

SELECT * FROM hn('test') LIMIT 50;

real    0m0.027s
user    0m0.019s
sys     0m0.008s

Much better! Let's also try a little bit more complicated queries

-- This equals to "Airbnb AND ruins AND everything"
SELECT * FROM hn('Airbnb ruins everything');

4 results
real    0m0.040s
user    0m0.024s
sys     0m0.016s

-- phrase "crypto scam" must appear at start of a column
SELECT * FROM hn('^crypto + scam')

13 results
real    0m0.114s
user    0m0.028s
sys     0m0.008s

...you know what? That's pretty good for my case. But to find more results, we can utilize Porter and Trigram.

SELECT comparison between Unicode61, Porter and Trigram

Let's do same queries as before and also record amount of results for each tokenizer. Those queries are being run once without prior runs, so no caching is involved.

`SELECT * FROM db('query');`
Tokenizer	Results amount	Query duration (s)
Unicode61	97774 results	212.9179s
Porter	154188 results	244.6411s
Trigram	161511 results	333.1113s

`SELECT * FROM db('correction');`
Tokenizer	Results amount	Query duration (s)
Unicode61	33067 results	112.4054s
Porter	477944 results	422.8689s
Trigram	46222 results	203.8271s

`SELECT * FROM db('c pthreasd'); -- "c AND pthreasd", typo on purpose :)`
Tokenizer	Results amount	Query duration (s)
Unicode61	0 results	0.3662s
Porter	0 results	0.4749s
Trigram	0 results	31.0733s

`SELECT * FROM db('"c pthreads"');`
Tokenizer	Results amount	Query duration (s)
Unicode61	11 results	0.6669s
Porter	11 results	0.8576s
Trigram	10 results	15.5765s

`SELECT * FROM db('^somewhat longer text in here');`
Tokenizer	Results amount	Query duration (s)
Unicode61	1 result	15.0231s
Porter	1 result	16.0596s
Trigram	4 results	100.0801s

`SELECT * FROM db('NEAR(rust python, 5) performance*');`
Tokenizer	Results amount	Query duration (s)
Unicode61	985 results	5.6444s
Porter	846 results	5.8575s
Trigram	160 results	74.0494s

We can clearly see that Trigram struggles, but sometimes returns the most results. What if we make some queries that utilize unique feature about Trigram – that it can be queried by a sequence of characters, not only by complete tokens?

`SELECT * FROM hn WHERE text LIKE '%dawg%';`
Tokenizer	Results amount	Query duration (s)
Unicode61	781 results	470.1948s
Porter	781 results	381.8981s
Trigram	781 results	3.2189s

`SELECT * FROM hn WHERE title GLOB '*suit*';`
Tokenizer	Results amount	Query duration (s)
Unicode61	9248 results	562.7608s
Porter	9248 results	538.4104s
Trigram	9248 results	38.7541s

So while Trigram is 3 times the size of Unicode61 database and quite a bit slower in cases when matching rows by tokens it is much faster in LIKE and GLOB queries...

`SELECT * FROM hn WHERE title GLOB '*[^1-9]';`
Tokenizer	Results amount	Query duration (s)
Unicode61	4855683 results	584.2672s
Porter	4855683 results	582.6868s
Trigram	4855683 results	1410.1309s

But not with regex – that's linear scan for every party involved.

Benchmarkin'

Created using little python script that tries various PRAGMA combinations to check changes in INSERT speed as well as SELECT performance.

There's also quite a few variables:

The performance measurements are specific to the Python sqlite3 standard library, which might not be representative of the underlying SQLite capabilities. And I also didn't bother with testing in other languages and different libraries.
SELECTs are run once per benchmark, which means SQLite3 can't cache the response. Every benchmark creates a new SQLite3 database.

Run settings:

Runs: 3
Dataset limit: 100000

We run every benchmark 3 times, 100K rows in the database (around 5MB for unicode61, 15MB for trigram).

Query patterns:

QUERY_PATTERNS = {
    "simple_and": ["easy simple", "database performance optimization", "for some reason really long and"],
    "phrase_search": ['"machine learning"', 'ai + shit', '"data science"'],
    "prefix_search": ["python*", "data*", "web*"],
    "initial_token": ["^python", "^data", "^web"],
    "near_operator": ["NEAR(rust sqlite, 5)", "NEAR(react htmx, 3)", "NEAR(ml shit, 2)"],
    "column_filter": ["title: python", "text: database", "by: dawg"], "prefix_and_initial_token": ["^python*"],
    "complex": ["NEAR(rust python, 5) performance*", "^performance NEAR(rust python, 3)"]
}

Those are used to calculate SELECT timing median.

Sorting

Tables are sorted by Insert Time (s) – this is how much time was required to insert the whole dataset.

Maximum Performance Combinations

FTS5 table config: {'prefix': '2 3'}
FTS5 runtime config: {'pgsz': 60000, 'automerge': 16, 'crisismerge': 8}
Tokenizer used: porter
Transcation strategy: batch_500_commit_5000

Config	Insert Time (s)	Insert rows/s (Mdn/P99/P95)	StdDev	Query Median (ms)	Safety
{'mmap_size': 28, 'page_size': 16384, 'synchronous': 'OFF', 'journal_mode': 'OFF', 'temp_store': 'FILE', 'threads': 1, 'cache_size': -8192, 'secure_delete': 'OFF', 'locking_mode': 'NORMAL'}	19.63	5094/5105/5104	±0.05	1.21	UNSAFE
{'mmap_size': 28, 'page_size': 16384, 'synchronous': 'NORMAL', 'journal_mode': 'WAL', 'temp_store': 'FILE', 'threads': 1, 'cache_size': -8192, 'secure_delete': 'OFF', 'locking_mode': 'NORMAL'}	22.14	4517/4525/4525	±0.06	1.55	SAFE
{'synchronous': 'EXTRA', 'cell_size_check': 'ON', 'foreign_keys': 'ON', 'mmap_size': 0, 'journal_mode': 'WAL', 'page_size': 16384, 'temp_store': 'FILE', 'cache_size': -8192, 'threads': 1}	24.61	4063/4070/4070	±0.08	1.31	EXTRA SAFE