Introduction of the PrefetchRateLimiter #13907

krhancoc · 2025-08-28T17:22:09Z

The multiscan operation uses the prepare function prefetch and pin all the required blocks for its provided scan ranges. However, this could cause large amount of memory being used. To restrict this, we introduce the PrefetchRateLimiter which allows users to bound how much is prefetched to ensure we do not OOM the host.

Test Plan
Added additional tests.

facebook-github-bot · 2025-09-03T01:12:09Z

@krhancoc has imported this pull request. If you are a Meta employee, you can view this in D81548706.

facebook-github-bot · 2025-09-03T03:24:51Z

@krhancoc has imported this pull request. If you are a Meta employee, you can view this in D81548706.

facebook-github-bot · 2025-09-03T17:10:03Z

@krhancoc has imported this pull request. If you are a Meta employee, you can view this in D81548706.

anand1976 · 2025-09-03T23:25:32Z

table/block_based/block_based_table_iterator.cc

@@ -1345,6 +1410,14 @@ void BlockBasedTableIterator::FindBlockForwardInMultiScan() {
    }
    // Move to the next pinned data block
    ResetDataIter();
+    if (multi_scan_->prefetch_rate_limiter) {
+      size_t releasing =
+          multi_scan_->pinned_data_blocks[multi_scan_->cur_data_block_idx]


pinned_data_blocks[multi_scan_->cur_data_block_idx] would not be valid at this point I think, since it would've been transferred to the data block iter. Or am I missing something?

anand1976 · 2025-09-03T23:27:39Z

include/rocksdb/options.h

+  virtual bool release(size_t bytes) = 0;
+};
+
+class DefaultPrefetchRateLimiter : public PrefetchRateLimiter {


This need not be declared in a public header file. RocksDB typically just exposes an allocator, like NewDefaultPrefetchRateLimiter().

anand1976 · 2025-09-03T23:47:39Z

table/block_based/block_based_table_iterator.cc

+          multi_scan_->pinned_data_blocks[multi_scan_->cur_data_block_idx]
+              .GetValue()
+              ->size();
+      multi_scan_->prefetch_rate_limiter->release(releasing);


Ideally we'd do this in a cleanup function registered with block_iter_ (which is derived from Cleanable) so that the release happens whenever block_iter_ is reset.

anand1976 · 2025-09-03T23:53:26Z

table/block_based/block_based_table_iterator.cc

+      // Lets make sure we are rate limited on how many blocks to prepare
+      if (multiscan_opts->prefetch_rate_limiter) {
+        auto blocks = multiscan_opts->GetMutablePrefetchRateLimiter().acquire(
+            table_, index_iter_->value().handle.size(), true);


Write last parameter as /*all_or_nothing=*/true (Google C++ style guide - https://google.github.io/styleguide/cppguide.html#Function_Argument_Comments)

anand1976 · 2025-09-04T17:31:48Z

Its not clear how the proposed interface would support prefetching across multiple levels. For example, say we have L1, L2 and L3. Suppose L1's Prepare() gets called first. It could potentially exhaust the prefetch quota and L2/L3 cannot do any prefetching. I understand that we want to keep the initial implementation simple for now that works for single level use cases, but the interface should be extensible to support other use cases in the future.

Initial Design of PrefetchRateLimiter

b27217e

meta-cla bot added the CLA Signed label Aug 28, 2025

krhancoc added 10 commits August 28, 2025 10:35

Nits

f9ff96d

remove optional

a0cb28f

Remove default constructor

6d49421

format

f785f74

Nits

b164f7c

remove unneeded

fa5df87

format

677f7c9

Check scan_opts as well

7e5e964

Possibly invalid scan_opts

0d2f0e2

Format

63cdbed

linters

64e9780

krhancoc added 2 commits September 3, 2025 09:37

Also make sure to have proper barriers in tests

3d5aee2

linter

396b0d0

krhancoc requested review from anand1976 and cbi42 September 4, 2025 16:44

anand1976 reviewed Sep 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduction of the PrefetchRateLimiter #13907

Introduction of the PrefetchRateLimiter #13907

krhancoc commented Aug 28, 2025

Uh oh!

facebook-github-bot commented Sep 3, 2025

Uh oh!

facebook-github-bot commented Sep 3, 2025

Uh oh!

facebook-github-bot commented Sep 3, 2025

Uh oh!

anand1976 Sep 3, 2025

Uh oh!

anand1976 Sep 3, 2025

Uh oh!

anand1976 Sep 3, 2025

Uh oh!

anand1976 Sep 3, 2025

Uh oh!

anand1976 commented Sep 4, 2025

Uh oh!

Uh oh!

Introduction of the PrefetchRateLimiter #13907

Are you sure you want to change the base?

Introduction of the PrefetchRateLimiter #13907

Conversation

krhancoc commented Aug 28, 2025

Uh oh!

facebook-github-bot commented Sep 3, 2025

Uh oh!

facebook-github-bot commented Sep 3, 2025

Uh oh!

facebook-github-bot commented Sep 3, 2025

Uh oh!

anand1976 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

anand1976 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

anand1976 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

anand1976 Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

anand1976 commented Sep 4, 2025

Uh oh!

Uh oh!