Robj/indirection support #491

rtjohnso · 2022-11-29T22:56:21Z

No description provided.

…or blobs

…ssage sizes in splinter_test

- modify trunk_split_leaf to never split more than possible - tweak default test parameters - modify functionality test to use pareto-distributed message lengths

netlify · 2022-11-29T22:56:25Z

✅ Deploy Preview for splinterdb canceled.

Name	Link
🔨 Latest commit	`9db4655`
🔍 Latest deploy log	https://app.netlify.com/sites/splinterdb/deploys/6447980f1f7a760008f1207e

rtjohnso · 2022-11-30T08:16:13Z

Reviewing guide

This is a big PR, but it can be logically broken down into a few layers. Feel free to do reviews for each layer separately.

Recommended order to review files:

The `mini_allocator` layer:

mini_allocator.[hc]. The main thing is to add support for sub-page allocations, multi-page allocations, and extent sharing. It also adds support for having a different page type for each batch. It incidentally eliminates pinning support, since the pinning flag didn't actually do anything. Also reorg the mini_allocator struct to cleanly separate constant state (e.g. num_batches) from dynamic state (e.g. num_extents).
Optionally: check out routing_filter.c, where the only changes are to match the new mini_allocator APIs.
trunk.[hc] are also only minor changes, except that it also forces splits to never split more than the parent can handle.

The blob code:

blob.[hc]. Code for accessing blobs.
blob_build.[hc]. Code for building and cloning blobs. Blob building code is broken out into a separate file to avoid a circular dependency blob_build -> mini_allocator -> data_internal -> blob. If blob and blob_build were all in a single source file, this would be a cycle.
Detour to blob_test.c.

The data adapter layer:

data_internal.[hc] and data_blob_build.[hc]. Deblobify things before passing them to user-provided functions. Also various utilities for (de-)blobifying messages, merge_accumulators, etc. data_blob_build is broken out for the same circular dependency reason.

Uses of the data adapter layer to add blob support in the rest of the code:

shard_log.[hc]. Handles blobs. Also, has a function to build a blob in the log's mini_allocator. The intention here is that, when the user inserts a large value, the trunk will blobify it in the log's mini_allocator, and then insert the blob into the btree, and then insert a log record with the blob and the generation number from the btree.
btree.[hc]. Handles big values in insert and pack.
splinterdb.c. De-blobify things before returning them to the user.

The rest:

Miscellaneous other files and tests.

Fixes a bug where memtable_maybe_rotate_and_get_insert_lock would speculatively increment the memtable generation even when the next memtable was not yet ready. This would cause concurrent lookup threads to attempt to access that memtable, resulting in errors. This fix requires the insert threads to wait until the next memtable is ready before finalizing the current one.

…j/indirection-support

rtjohnso added 30 commits August 24, 2022 22:24

resume work on infrastructure for indirect keys/values

d8bff81

clean up and simplify indirect structures and iterator

4609ce8

finally have a good design

cfa2be7

indirect.[hc] compiles

dbef100

rename mini_alloc_multi

22bdb65

added needed functionality to mini_allocator

0d6b9e0

iterating on mini_allocator interface changes

ce4482e

starting on a unit test

54d608f

clean up some unit-test dependencies

a7fa0b0

fix NUM_INDIRECTION_BATCHES

be25007

bugfixes and test of indirection_clone

bc8387c

got unit tests (and all tests) passing

eac26c5

rename indirect to blob

f0a4af2

break blob into two parts to avoid circular dependency w/ mini_allocator

2e8207e

add alignment control to blob building

e31c9cf

plumb blobs through shard_log -- untested

4fca9d8

got shard_log test working (w/o new functionality)

072ba06

Merge remote-tracking branch 'origin/main' into robj/indirection-support

5879c42

fix stupid bug

e92ed83

Merge remote-tracking branch 'origin/main' into robj/indirection-support

fdc6910

fix cache_alloc/cache_get mess

94d4d05

more wiring of blobs

a9c7527

further improvement of blob page iter

85fe34b

log allocates blobs on page alignment

ddffffc

oh my god add missing files

f2b16b8

fix bug in alloc-mode blob iterator

3dc299d

fix up shard_log iterator enough for now

af00762

store cache in message

f1a86ac

adapt mini_allocator to multiple page-types, start on btree support f…

cf4ba69

…or blobs

btree inserts seem to be working

ab2e107

rtjohnso added 4 commits November 24, 2022 19:05

make btree_pack track materialized bytes, improve randomization of me…

ba6422b

…ssage sizes in splinter_test

typo

bd6cb75

Modify trunk_split_leaf to never split too much and tweaked some tests

c651ae9

- modify trunk_split_leaf to never split more than possible - tweak default test parameters - modify functionality test to use pareto-distributed message lengths

organize data-blob functions

3bb772e

rtjohnso added the dont merge label Nov 29, 2022

vmwclabot added the cla-not-required label Nov 29, 2022

rtjohnso and others added 16 commits November 30, 2022 02:16

various cleanups

17d647f

formatting

76fa1c6

Merge remote-tracking branch 'origin/main' into robj/indirection-support

7bb1570

Merge remote-tracking branch 'origin/main' into robj/indirection-support

a77c3a5

merged

8408a79

fix a bunch of bugs and finish self review

c9003ee

formatting

326f7bc

merge w/ origin/main

339d6f9

add assert

d351978

merge main

e8bab76

finish merge

38be83c

fix memtable race

81a9bb4

clang-format

5240e18

merge w/ new main

ba3a248

Merge remote-tracking branch 'origin/main' into robj/indirection-support

173ecf6

rtjohnso added the ok-to-test label Apr 25, 2023

rtjohnso added 5 commits April 24, 2023 21:49

Merge branch 'main' into robj/indirection-support

d6d951b

clang-format

ab5cd48

Merge remote-tracking branch 'origin/robj/memtable-race-fix' into rob…

6316437

…j/indirection-support

shrink some tests so they don't trigger trunk bugs

ba3c8c3

fix some debug-mode compilation issues

9db4655

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robj/indirection support #491

Robj/indirection support #491

rtjohnso commented Nov 29, 2022

netlify bot commented Nov 29, 2022 •

edited

Loading

rtjohnso commented Nov 30, 2022 •

edited

Loading

Robj/indirection support #491

Are you sure you want to change the base?

Robj/indirection support #491

Conversation

rtjohnso commented Nov 29, 2022

netlify bot commented Nov 29, 2022 • edited Loading

✅ Deploy Preview for splinterdb canceled.

rtjohnso commented Nov 30, 2022 • edited Loading

Reviewing guide

The mini_allocator layer:

The blob code:

The data adapter layer:

Uses of the data adapter layer to add blob support in the rest of the code:

The rest:

netlify bot commented Nov 29, 2022 •

edited

Loading

rtjohnso commented Nov 30, 2022 •

edited

Loading

The `mini_allocator` layer: