mmc: sdhci: fix max req size based on spec #5571

6by9 · 2023-08-10T20:06:47Z

THIS NEEDS A CORRECT SoB BEFORE BEING CONSIDERED FOR MERGE.

https://forums.raspberrypi.com/viewtopic.php?t=354518 for testing.

For almost 2 decades, the max allowed requests were limited to 512KB because of SDMA's max 512KiB boundary limit.

ADMA2 and ADMA3 do not have such limits and were effectively made so any kind of block count would not impose interrupt and managing stress to the host.

By limiting that to 512KiB, it effectively downgrades these DMA modes to SDMA.

Fix that by actually following the spec:
When ADMA is selected tuning mode is advised.
On lesser modes 4MiB transfers is selected as max, so re-tuning if timer trigger or if requested by host interrupt, can be done in time. Otherwise, the only limit is the variable size of types used. In this implementation, 16MiB is used as maximum since tests showed that after that point, there are diminishing returns.

Also 16MiB in worst case scenarios, when card is eMMC and its max speed is a generous 350MiB/s, will generate interrupts every 45ms on huge data transfers.

A new adma_get_req_limit sdhci host function was also introduced, to let vendors override imposed limits by the generic implementation if needed.

For example, on local tests with rigorous CPU/GPU burn-in tests and abrupt cut-offs to generate huge temperature changes (upwards/downwards) to the card, tested host was fine up to 128MB/s transfers on slow cards that used SDR104 bus timing without re-tuning.
In that case the 4MiB limit was overridden with a more than safe 8MiB value.

In all testing cases and boards, that change brought the following:

Depending on bus timing and eMMC/SD specs:

Max Read throughput increased by 2-20%
Max Write throughput increased by 50-200% Depending on CPU frequency and transfer sizes:
Reduced mmcqd cpu core usage by 4-50%

For almost 2 decades, the max allowed requests were limited to 512KB because of SDMA's max 512KiB boundary limit. ADMA2 and ADMA3 do not have such limits and were effectively made so any kind of block count would not impose interrupt and managing stress to the host. By limiting that to 512KiB, it effectively downgrades these DMA modes to SDMA. Fix that by actually following the spec: When ADMA is selected tuning mode is advised. On lesser modes 4MiB transfers is selected as max, so re-tuning if timer trigger or if requested by host interrupt, can be done in time. Otherwise, the only limit is the variable size of types used. In this implementation, 16MiB is used as maximum since tests showed that after that point, there are diminishing returns. Also 16MiB in worst case scenarios, when card is eMMC and its max speed is a generous 350MiB/s, will generate interrupts every 45ms on huge data transfers. A new `adma_get_req_limit` sdhci host function was also introduced, to let vendors override imposed limits by the generic implementation if needed. For example, on local tests with rigorous CPU/GPU burn-in tests and abrupt cut-offs to generate huge temperature changes (upwards/downwards) to the card, tested host was fine up to 128MB/s transfers on slow cards that used SDR104 bus timing without re-tuning. In that case the 4MiB limit was overridden with a more than safe 8MiB value. In all testing cases and boards, that change brought the following: Depending on bus timing and eMMC/SD specs: * Max Read throughput increased by 2-20% * Max Write throughput increased by 50-200% Depending on CPU frequency and transfer sizes: * Reduced mmcqd cpu core usage by 4-50% Signed-off-by: CTCaer <[email protected]>

6by9 · 2023-08-10T20:21:06Z

@CTCaer this is your patch from CTCaer/switch-l4t-kernel-4.9@fa86ebb but with the whitespace changes and custom vendor override hook removed (if required then they should be additional independent patches).

We've been asked to test and look at upstreaming it, but as per https://www.kernel.org/doc/html/latest/process/submitting-patches.html#sign-your-work-the-developer-s-certificate-of-origin that requires a genuine name alongside the email address. Is there any chance you could provide such? Replying via this PR is generally considered sufficient as it is a public record, and I can then rewrite the commit and re-push.

Thanks.

lategoodbye · 2023-09-11T22:10:17Z

Looks interesting. Please drop the following part from the commit message:

A new adma_get_req_limit sdhci host function was also introduced, to let
vendors override imposed limits by the generic implementation if needed.

lategoodbye · 2023-09-11T22:37:18Z

drivers/mmc/host/sdhci.c

+		if (host->tuning_mode != SDHCI_TUNING_MODE_3)
+			mmc->max_req_size = 4194304;
+		else
+			mmc->max_req_size = 16777216;


There are defines which are easier to read: SZ_4M and SZ_16M

theofficialgman · 2023-09-28T12:50:54Z

Sorry, the referenced user isn't interested in merging this way due to the public nature of the PR.

I wrote in the original forum thread how it should be done but that was not followed.
Looks like I will have to contact an upstream maintainer who hopefully is more competent.

6by9 · 2023-09-28T13:01:05Z

Sorry, the referenced user isn't interested in merging this way due to the public nature of the PR.

It's an open source project, so how can it not be public?
All that was being requested was a Signed-off-by tag - that is mandatory for any contribution to mainline.

theofficialgman · 2023-09-28T13:16:23Z

Sorry, the referenced user isn't interested in merging this way due to the public nature of the PR.

It's an open source project, so how can it not be public? All that was being requested was a Signed-off-by tag - that is mandatory for any contribution to mainline.

@6by9 no you are missing the point and context. the commit is written with a signed-off-by tag already under the users pseudonym. The user requested that they be contacted via that email address directly (how it is done in mainline to resolve such issues) so that they could either A: change the commit and place it under a real name without reference to the original commit (as the author, they can do that) or B: give authorship writes to someone else entirely.

A/B keep the user pseudonym CTCaer's anonymity. Doing the change here as part of a PR that is easily searchable does not.

lategoodbye reviewed Sep 11, 2023

View reviewed changes

6by9 closed this Sep 28, 2023

theofficialgman mentioned this pull request Nov 12, 2023

Raspberry Pi 5 model B geerlingguy/sbc-reviews#21

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mmc: sdhci: fix max req size based on spec #5571

mmc: sdhci: fix max req size based on spec #5571

6by9 commented Aug 10, 2023

6by9 commented Aug 10, 2023

lategoodbye commented Sep 11, 2023 •

edited

Loading

lategoodbye Sep 11, 2023

theofficialgman commented Sep 28, 2023 •

edited

Loading

6by9 commented Sep 28, 2023

theofficialgman commented Sep 28, 2023 •

edited

Loading

mmc: sdhci: fix max req size based on spec #5571

mmc: sdhci: fix max req size based on spec #5571

Conversation

6by9 commented Aug 10, 2023

6by9 commented Aug 10, 2023

lategoodbye commented Sep 11, 2023 • edited Loading

lategoodbye Sep 11, 2023

Choose a reason for hiding this comment

theofficialgman commented Sep 28, 2023 • edited Loading

6by9 commented Sep 28, 2023

theofficialgman commented Sep 28, 2023 • edited Loading

lategoodbye commented Sep 11, 2023 •

edited

Loading

theofficialgman commented Sep 28, 2023 •

edited

Loading

theofficialgman commented Sep 28, 2023 •

edited

Loading