[hashset feature] Iterator bugfixes - handle empty hashset and change iteration order so rehashed buckets can be skipped #1147

SoftlyRaining · 2024-10-10T21:48:58Z

Fixes 2 bugs in hashsetNext():

null dereference when iterating over new unused hashset
change order of iteration so skipping past already-rehashed buckets works correctly and won't miss elements

Minor optimization in hashsetScan and some renamed variables.

codecov · 2024-10-10T22:28:39Z

Codecov Report

Attention: Patch coverage is 13.04348% with 20 lines in your changes missing coverage. Please review.

Project coverage is 70.33%. Comparing base (3038293) to head (60d557c).
Report is 1 commits behind head on hashset.

Files with missing lines	Patch %	Lines
src/hashset.c	13.04%	20 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           hashset    #1147      +/-   ##
===========================================
- Coverage    70.40%   70.33%   -0.07%     
===========================================
  Files          115      115              
  Lines        63780    63782       +2     
===========================================
- Hits         44907    44864      -43     
- Misses       18873    18918      +45

Files with missing lines	Coverage Δ
src/hashset.c	`39.60% <13.04%> (-0.12%)`	⬇️

... and 14 files with indirect coverage changes

zuiderkwast

Didn't review the tests yet.

src/hashset.c

zuiderkwast · 2024-10-10T22:32:25Z

src/hashset.c

+            if ((growing && !behind_rehash) || (!growing && !ahead_rehash)) {
+                /* Emit elements in smaller table */


How about some more explicit comment to explain this condition?

Suggested change

if ((growing && !behind_rehash) || (!growing && !ahead_rehash)) {

/* Emit elements in smaller table */

/* Emit elements in smaller table, except if this table is the

* rehashing source table and the current index has already been

* rehashed, or the rehashing destination table and the current

* index has not yet been populated by rehashing. */

if ((growing && !behind_rehash) || (!growing && !ahead_rehash)) {

Can't it happen that we're shrinking and we're behind rehashing, but rehash put some elements before or at the current index?

If the big (rehashing source) table had a lot of empty buckets with 'everfull' set, I guess it can happen, because rehash doesn't follow probing chains like scan does.

If we make rehashing follow probing chains, are we safe here?

Being able to always skip one of the tables is a very good thing, I believe, especially in things like the expiry scanning. Eviction does sampling which is using scan too. Therefore, to make sure it's safe to skip in the destination table, I think we need to make rehash follow probing chains. Then we can ensure that a key is not rehashed from the not-yet-scanned part of the source table to the already-scanned part of the destination table. Do you want to do this change to rehashStep in this PR?

The names like table_small are better than table0. I'm not sure that all the extra variables actually make it simpler though. Each variable adds a mental indirection, so if the names are not extremely clear and if they are just a trivial abstraction, then it might not be worth it.

I'm thinking that maybe instead of growing we can just use table_small == 0.

The names behind_rehash and ahead_rehash confuse me a little, because it can mean two things: Is the rehash ahead of cursor or is cursor ahead of rehash? I think already_rehashed and not_yet_rehashed are better names and I think it's enough to have one of them: already_rehashed = cursorIsLessThan(t->rehashIdx, cursor).

We don't need to mask the cursor and rehashIdx here, because cursor should already be masked with the small mask and, if we make rehashing follows probe chains, the rehashIdx will already be masked with the small mask in too rehashStep. Maybe we don't strictly have to rehash from both locations in a larger table to the same location in the smaller destination table at the same time, but if we do this, we can guarantee that a cursor is either rehashed or not rehashed, never half-rehashed. Is this right?

Hmm. Do you mean that rehashStep would continue until there can be no probe chains ahead of its cursor in either table? It would need to track the furthest bucket touched (due to probe chains in destination table) and continue until the rehash cursor has caught up and covered that bucket, otherwise we can't guarantee that the destination table is completely empty ahead of the rehash cursor.

I feel like that would make rehash steps too large and random, and we'd lose a lot of the benefit of incremental rehashing. I think it'd be better to not skip the destination bucket when scanning.

Because scan already follows probe chains, I don't think we ever have to worry about rehash moving an unscanned item behind the scan cursor because of its probe chain becoming shorter like you described. If there was an element at the end of a probe chain like that, scan would have either followed the whole potential probe chain and covered the element, or it would be before the earliest bucket it could land in without a probe chain. :)

Interesting. I am almost convinced. I will think about it during the night. Do you want to draw an ASCII picture or something to make it look more obvious?

I agree that skipping in the new table is not worth the effort.

I still want to argue that it's safe to skip already rehashed indices in the old table though, because those have been rehashed = moved to the new table. (Sam can't be behind the rehash index in the old table, but he can be behind the (index corresponding to the) rehash index in the new table.)

Regarding skipping not-yet-rehashed indices in the new table, I agree it's possible but not really worth the complexity.

My take is that it's not as complex as you say though. Assume the new table is twice as large as the old table. Then, each index x in the old table corresponds to two indices y1 and y2 in the new table. Note that these two indices are neighours though. We have y1 = x & mask_new and y2 = nextCursor(y1, mask_new). Therefore, any probe chain staring before these indices in the new table need to involve y1. Thus, we only need to check y1 > (rehashIdx & mask_new) && new_table[y1]->everfull == 0 to know if we can skip the remaining buckets in the new table in this scan call. (This allows us to skip remaining buckets in the new table when we're following in a probe chain in the old table.)

I still want to argue that it's safe to skip already rehashed indices in the old table though, because those have been rehashed = moved to the new table. (Sam can't be behind the rehash index in the old table, but he can be behind the (index corresponding to the) rehash index in the new table.)

Maybe I'm misunderstanding you, but I think I disagree. Sam can hash to a bucket behind the current rehash index while simultaneously being stored in a bucket ahead of the rehash index due to probe chaining. Suppose when Sam gets rehashed Sam gets placed directly in the bucket he hashes to in the new table, which is behind the rehash index? Remember that rehashing does not follow probe chains and can stop between where Sam is stored and the bucket Sam hashes to!

I've changed it to match the new table being twice as large as the old table:

Sam (☺) hashes to this bucket in the old table | old: E E ☺ . new: ....... | RehashIdx | cursor Sam (☺) hashes here in new table (one of the two new buckets that correspond to the old bucket he hashed to) | old: E E . . new: .☺..... | | | RehashIdx | cursor

In your last picture, Sam's face is moved to the new table. In the new table we're not going to skip him. (Especially if we're not skipping anything in the new table.)

But why not? Sam is behind cursor now. Scan assumes it has already covered all buckets behind cursor including the bucket Sam moved to, and won't revisit them.

Sorry, I didn't reconnect to your pictures properly.

But why not? Sam is behind cursor now.

OK, if Sam was rehashed to behind the cursor, then he is not covered in this scan call. Agreed.

But then he must have been covered in the previous scan call. In the old table, he was ahead of the cursor, but scan follows the probe chain and covered him. If he hashes to a bucket behind the cursor but he is located in a bucket ahead of the cursor, there must be a probe chain between those two buckets, because probing is what placed him there.

So, in this scan call, we cover him in the old table:

Sam (☺) hashes to this bucket in the old table | old: E E ☺ . new: ....... | RehashIdx | cursor

The only way we could miss him is that rehashing was called from the scan callback, but we have prevented that by pausing rehashing during the scan call.

src/hashset.c

zuiderkwast

Cool test case, rigging a race between scan and rehash.

src/unit/test_hashset.c

src/hashset.c

zuiderkwast

I want to merge the fix in the iterator and the tests.

We don't seem to agree we about what we can skip in scan. Can you revert those changes for now? (We can keep the renamed variables table0 to table_small, etc.)

Can you also update the PR title and description to match what's actually implemented?

src/hashset.c

SoftlyRaining · 2024-10-17T21:42:50Z

I've done the rebasing onto the new squashed hashset branch, and I've removed the hashsetScan() stuff we're still discussing. (I did keep a small simple optimization though.)

…rent PR Signed-off-by: Rain Valentine <[email protected]>

zuiderkwast reviewed Oct 10, 2024

View reviewed changes

src/unit/test_hashset.c Outdated Show resolved Hide resolved

src/unit/test_hashset.c Outdated Show resolved Hide resolved

zuiderkwast reviewed Oct 11, 2024

View reviewed changes

src/hashset.c Outdated Show resolved Hide resolved

zuiderkwast reviewed Oct 15, 2024

View reviewed changes

src/hashset.c Outdated Show resolved Hide resolved

src/hashset.c Outdated Show resolved Hide resolved

src/hashset.c Outdated Show resolved Hide resolved

SoftlyRaining mentioned this pull request Oct 16, 2024

[hashset feature] Convert SET datatype to use hashset instead of dict #1176

Open

zuiderkwast reviewed Oct 16, 2024

View reviewed changes

src/hashset.c Outdated Show resolved Hide resolved

zuiderkwast force-pushed the hashset branch from 19576b5 to 8fe59b3 Compare October 17, 2024 13:35

SoftlyRaining force-pushed the scan-while-rehashing-fix branch 3 times, most recently from c3a9d71 to 8f1e2eb Compare October 17, 2024 21:36

SoftlyRaining changed the title ~~[hashset feature] Fix scan skipping elements while hashset is shrink rehashing~~ [hashset feature] Iterator bugfixes - handle unused hashset and change iteration order so rehashed buckets can be skipped Oct 17, 2024

zuiderkwast force-pushed the hashset branch from 8fe59b3 to 3038293 Compare October 18, 2024 07:22

only iterator fixes and renaming - will address scan changes in diffe…

60d557c

…rent PR Signed-off-by: Rain Valentine <[email protected]>

zuiderkwast force-pushed the scan-while-rehashing-fix branch from 8f1e2eb to 60d557c Compare October 18, 2024 08:28

zuiderkwast changed the title ~~[hashset feature] Iterator bugfixes - handle unused hashset and change iteration order so rehashed buckets can be skipped~~ [hashset feature] Iterator bugfixes - handle empty hashset and change iteration order so rehashed buckets can be skipped Oct 18, 2024

zuiderkwast approved these changes Oct 18, 2024

View reviewed changes

zuiderkwast merged commit bfbf5b0 into valkey-io:hashset Oct 18, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[hashset feature] Iterator bugfixes - handle empty hashset and change iteration order so rehashed buckets can be skipped #1147

[hashset feature] Iterator bugfixes - handle empty hashset and change iteration order so rehashed buckets can be skipped #1147

SoftlyRaining commented Oct 10, 2024 •

edited by zuiderkwast

Loading

codecov bot commented Oct 10, 2024 •

edited

Loading

zuiderkwast left a comment

zuiderkwast Oct 10, 2024

zuiderkwast Oct 11, 2024

SoftlyRaining Oct 11, 2024

SoftlyRaining Oct 11, 2024

zuiderkwast Oct 11, 2024

zuiderkwast Oct 15, 2024

SoftlyRaining Oct 15, 2024 •

edited

Loading

zuiderkwast Oct 16, 2024

SoftlyRaining Oct 17, 2024 •

edited

Loading

zuiderkwast Oct 18, 2024

zuiderkwast left a comment

zuiderkwast left a comment

SoftlyRaining commented Oct 17, 2024

		if ((growing && !behind_rehash) \|\| (!growing && !ahead_rehash)) {
		/* Emit elements in smaller table */

-            if ((growing && !behind_rehash) || (!growing && !ahead_rehash)) {
-                /* Emit elements in smaller table */
+            /* Emit elements in smaller table, except if this table is the
+             * rehashing source table and the current index has already been
+             * rehashed, or the rehashing destination table and the current
+             * index has not yet been populated by rehashing. */
+            if ((growing && !behind_rehash) || (!growing && !ahead_rehash)) {

[hashset feature] Iterator bugfixes - handle empty hashset and change iteration order so rehashed buckets can be skipped #1147

[hashset feature] Iterator bugfixes - handle empty hashset and change iteration order so rehashed buckets can be skipped #1147

Conversation

SoftlyRaining commented Oct 10, 2024 • edited by zuiderkwast Loading

codecov bot commented Oct 10, 2024 • edited Loading

Codecov Report

zuiderkwast left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SoftlyRaining Oct 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SoftlyRaining Oct 17, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zuiderkwast left a comment

Choose a reason for hiding this comment

zuiderkwast left a comment

Choose a reason for hiding this comment

SoftlyRaining commented Oct 17, 2024

SoftlyRaining commented Oct 10, 2024 •

edited by zuiderkwast

Loading

codecov bot commented Oct 10, 2024 •

edited

Loading

SoftlyRaining Oct 15, 2024 •

edited

Loading

SoftlyRaining Oct 17, 2024 •

edited

Loading