notes

Storage

Always append new values to the log file. As a result this log file is immutable.
Maintain the hash table, where the value is offset in this log file.
To delete item append delete marker to the log file.
As long as you are able to store the hash table in memory - you will have tromendous performance.
Perform comprassion (removing overriden values) in background.
Pros:
- Only sequential IO
- As the workload is mostly IO bound - we could have the single thread and do not worry about concurrency.
Cons:
- Effective only if you are able to have the whole hashtable in memory.
- Range queires are not effective.

Sorted String Table - similar to log-structured storage, but stores keys in sorted order.
Pros:
- More efficient segments merge
- We do not need to store the whole hash map in memory, because we could pretty easyly find key in data file (as they are sorted, and we already have some keys in memory) and cache it for future access
Cons:
- We need to keep keys sorted:)
How to maintain keys sorted:
- We keep all the writes in some sorted structure in memory (e.g. AVL or red-black tree)
- Once memtable gets bigger than threshold we write it out to disk
- In order to serve a read request, try to find it in the memtable, then in the most recent disk segment, then the next recent and so on.
- Run merging and comprassion process in background
- To make writes durable maintaint separate log file on disk, where you add all the write operations sequentially.
Performance Optimization:
- Maintain Bloom-Filter in memory to store if the key exists in segment file or not.
Log-Structured Merge-Tree (LSM-Tree) - is data structure based on the above principles.

LSM-Tree advantages:
- Better write performance
  - Altough both LSM-Trees and B-Trees maintain WAL, in addition to this LSM-Trees have not a lot of big sequential IO operations (dump memtable to disk or merge segments). On the other side, B-Trees need to flush pages on disk (which is random IO).
- Write amplification is much less for LSM-Trees (because of only sequential IO)
- The nature of write operations is not so important for SSD disks, but write amplification is critically important due to erase by block operations.
LSM-Tree disadvantages:
- Worse read performance
  - In B-Tree keys are stored only in one place, do not need to traverse segments
- Compression events could unpredictably influence database performance
- Support only lookups by key and keys range, while B-Tree databases supports any kind of lookups.