Decode high duty-cycle CRSF frames using frame marker rather than timeouts #26183

andyp1per · 2024-02-10T15:03:39Z

This fixes an issue with CRSF and ELRS where fast update rates and/or high CPU load can lead to radio failsafes due to scheduling delays preventing full parsing of frames and then the re-synchronization taking too long. The approach is to look for the CRSF frame marker (0xC8) and parse from that point. If parsing fails then the next frame marker is searched for and intermediate bytes are dropped. This means that re-synchronization should take place in a single frame.

tridge

this looks like it should work, but it seems a bit awkward.
might be easier if you had a union on Frame with a uint8_t array?

libraries/AP_RCProtocol/AP_RCProtocol_CRSF.cpp

andyp1per · 2024-02-11T09:14:08Z

might be easier if you had a union on Frame with a uint8_t array?

Maybe, but that then makes it a very large change as all the references to _frame have to be updated. I think it would be better to do that as a separate NFC PR.

andyp1per · 2024-02-11T15:03:28Z

Flew this on the copter that was having multiple RC failsafes - clean as a whistle, no issues at all and flew great. I think this is a good solution and should definitely go in 4.5. Should be possibly considered for 4.4 after further user testing.

tridge

i would prefer the union/memmove approach, it is just much clear

libraries/AP_RCProtocol/AP_RCProtocol_CRSF.cpp

timtuxworth · 2024-02-13T00:04:08Z

@andyp1per I think you might only be using Crossfire, should this be tested with Express LRS (which also uses CRSF)?

andyp1per · 2024-02-13T17:25:58Z

Tested on ELRS at 250Hz 1:2 and 500Hz 1:2 without issue

andyp1per · 2024-02-13T17:36:04Z

i would prefer the union/memmove approach, it is just much clear

Upon reflection I don't agree with this. If I was going to do a union I was only going to do that to make the maths a bit easier, I wasn't planning on doing a memmove. The problem is if we do a memmove we still have to process the bytes to see if the new start makes any sense or not. The new code would then have to have a separate parser to look at frames rather than bytes and we still need to do byte-based processing in the happy path. We still don't know when we have got a frame until we have finally read the CRC and validated it. We don't even know that we have enough bytes to process - in fact quite likely that we don't since that is the general problem that cause the failsafes in the first place. memmove also adds in the overhead of the copy as well as processing the data. So although I agree its a little hard to get your head around at first I think the approach I have implemented is actually the most efficient and the best for the kind of data stream we are dealing with.

tridge · 2024-02-14T22:06:19Z

@andyp1per valgrind is not happy with the decoder:

tridge · 2024-02-14T22:11:07Z

The problem is if we do a memmove we still have to process the bytes to see if the new start makes any sense or not

no, you don't. The basis of a union based decoder is this:

a function which decodes a frame buffer. No shifts, no reading inside the fn, just takes an array of bytes and returns success/fail and the results on success
an outside fn which creates does the reading, filling in a union byte array. When it has the right header and length it calls the frame decoder. If it succeeds then we have a frame done and can zero the input buffer. On fail it uses memchr to find a header byte. If no header byte it zeros the buffer. If it finds a header byte it does a memmove and then sets the length of the input buffer to the bytes starting at the header it found. It then is in a state just as if the bad input bytes never existed

tridge · 2024-02-15T00:31:32Z

@andyp1per note that to use valgrind, you typically build for the linux target with:

./waf configure --board linux --debug
./waf --target examples/RCProtocolTest
valgrind --soname-synonyms=somalloc=nouserintercepts -q build/linux/examples/RCProtocolTest

andyp1per · 2024-02-15T09:34:14Z

@andyp1per valgrind is not happy with the decoder:

I am unable to get valgrind to complain, would be good to understand what else I might need to do.

tridge · 2024-02-15T21:34:02Z

@andyp1per can you have a look at this branch? I've added a commit that reworks the parser using the more conventional memchr/memmove approach for re-sync. I think it makes the code clearer
https://github.com/tridge/ardupilot/commits/pr-high-duty-rc/

tridge · 2024-02-16T00:51:44Z

@andyp1per I've updated that branch some more to fix the test code under SITL and moved the crc calculation to be done once we have the full frame, which makes the skip to next frame work properly

tridge · 2024-02-16T02:40:45Z

@andyp1per note that the CRSF4 test fails with your PR as-is

andyp1per · 2024-02-17T16:28:20Z

@tridge I have incorporated your changes into my PR, fixed up, reviewed and tested. Seems to be working well - thanks.

tridge · 2024-02-17T21:03:01Z

@peterbarker would you mind checking the parser for any logic errors that could cause a crash?

peterbarker · 2024-02-19T03:17:29Z

@andyp1per can we remove AP_SerialProtocol_CRSF?

It's been broken for 18 months and nobody noticed.

We could save bytes and code complexity by removing it.

9b8ea84 was the commit which killed it AFAICS.

tridge · 2024-02-19T03:27:38Z

@andyp1per to expand on the comment from peter, the AP_RCProtocol_CRSF::update() function calls process_byte() with invalid arguments (passes in micros as the byte). The if (_uart) section of update() is only used when SERIALn_PROTOCOL is set to CRSF as opposed to RC. I don't see the point of setting it to CRSF, what is that meant to do?

andyp1per · 2024-02-19T08:23:46Z

I'll fix. Most of the TBS kit supports direct CRSF control without RC frames - I have many. This was the original implementation and I don't want to lose it.

andyp1per · 2024-02-19T10:11:27Z

Fixed and tested

libraries/AP_RCProtocol/AP_RCProtocol_CRSF.cpp

tridge · 2024-02-19T20:23:43Z

libraries/AP_RCProtocol/AP_RCProtocol_CRSF.cpp

@@ -299,7 +349,7 @@ void AP_RCProtocol_CRSF::update(void)
        for (uint8_t i = 0; i < n; i++) {
            int16_t b = _uart->read();
            if (b >= 0) {
-                _process_byte(AP_HAL::micros(), uint8_t(b));


could you explain what the "standalone" mode is for? It seems to be setup by setting SERIALn_PROTOCOL to CRSF instead of RCIN, but I don't see how it fits in. It has clearly been broken for a long time so I wonder if it is really needed at all?

Pretty much all TBS kit supports CRSF control without RC packets. e.g. like SmartAudio for VTX or I have a TBS VTX that supports OSD natively via CRSF - the standalone mode allows you to support these devices. The format is pretty much the same, its just how you integrate that is not - you can't do it via the RC protocol because that essentially assumes that you have data always coming in that you need to reply to. These devices are genuine full-duplex so the FC is often initiating the comms. This was the original CRSF support, I have several copters using it I just haven't flown them for a while - I'm not planning on losing it.

tridge · 2024-02-21T03:32:00Z

@andyp1per I've force pushed a fix for the trailing bytes error

fixed RCProtocolTest on SITL and make it pass/fail with an exit code Co-authored-by: Andrew Tridgell <[email protected]>

… rather than timeouts Co-authored-by: Andrew Tridgell <[email protected]>

andyp1per · 2024-02-21T14:52:05Z

~~Failsafe's not current working - will need to investigate~~

andyp1per · 2024-02-21T20:17:51Z

Squashed, cherry-picked @peterbarker's commit in as a separate commit and re-tested. All looks good. Happy for this to go in now @tridge

andyp1per added the BUG label Feb 10, 2024

andyp1per force-pushed the pr-high-duty-rc branch from d1ba6f0 to 4d20ff1 Compare February 10, 2024 21:26

tridge requested changes Feb 11, 2024

View reviewed changes

libraries/AP_RCProtocol/AP_RCProtocol_CRSF.cpp Outdated Show resolved Hide resolved

andyp1per force-pushed the pr-high-duty-rc branch from 4d20ff1 to 7fdaff0 Compare February 11, 2024 09:12

andyp1per requested a review from tridge February 11, 2024 09:14

andyp1per added the DevCallTopic label Feb 11, 2024

andyp1per changed the title ~~AP_RCProtocol: decode high duty-cycle CRSF frames using frame marker rather than timeouts~~ Decode high duty-cycle CRSF frames using frame marker rather than timeouts Feb 11, 2024

CraigElder removed the DevCallTopic label Feb 12, 2024

tridge approved these changes Feb 12, 2024

View reviewed changes

tridge requested changes Feb 13, 2024

View reviewed changes

libraries/AP_RCProtocol/AP_RCProtocol_CRSF.cpp Outdated Show resolved Hide resolved

tridge added the DevCallEU label Feb 13, 2024

tridge removed the DevCallEU label Feb 14, 2024

tridge mentioned this pull request Feb 14, 2024

AP_RCProtocol: fixed valgrind on RCProtocolTest example #26228

Closed

andyp1per force-pushed the pr-high-duty-rc branch 2 times, most recently from 7f9f01b to 30a7138 Compare February 17, 2024 16:27

andyp1per requested a review from tridge February 17, 2024 16:28

tridge approved these changes Feb 17, 2024

View reviewed changes

andyp1per force-pushed the pr-high-duty-rc branch from ea73cd5 to c5a2a10 Compare February 19, 2024 11:50

tridge requested changes Feb 19, 2024

View reviewed changes

tridge added the DevCallEU label Feb 19, 2024

tridge force-pushed the pr-high-duty-rc branch 2 times, most recently from a63276d to a85fbc8 Compare February 21, 2024 03:24

tridge removed the DevCallEU label Feb 21, 2024

AP_RCProtocol: add tests for CRSF and fix protocol test

aab580f

fixed RCProtocolTest on SITL and make it pass/fail with an exit code Co-authored-by: Andrew Tridgell <[email protected]>

andyp1per force-pushed the pr-high-duty-rc branch from a85fbc8 to 7e57c78 Compare February 21, 2024 14:26

andyp1per and others added 2 commits February 21, 2024 14:31

AP_RCProtocol: decode high duty-cycle CRSF frames using frame markers…

c051bbe

… rather than timeouts Co-authored-by: Andrew Tridgell <[email protected]>

AP_RCProtocol: CRSF: use subtraction with times, not time+timedelta

b464712

andyp1per force-pushed the pr-high-duty-rc branch from 7e57c78 to b464712 Compare February 21, 2024 14:32

andyp1per requested a review from tridge February 21, 2024 20:16

tridge approved these changes Feb 22, 2024

View reviewed changes

tridge merged commit b19f8ed into ArduPilot:master Feb 22, 2024
92 checks passed

andyp1per deleted the pr-high-duty-rc branch February 22, 2024 09:47

andyp1per mentioned this pull request Feb 22, 2024

AP_RCProtocol: CRSF: use subtraction with times, not time+timedelta #26274

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decode high duty-cycle CRSF frames using frame marker rather than timeouts #26183

Decode high duty-cycle CRSF frames using frame marker rather than timeouts #26183

andyp1per commented Feb 10, 2024 •

edited

Loading

tridge left a comment

andyp1per commented Feb 11, 2024

andyp1per commented Feb 11, 2024

tridge left a comment

timtuxworth commented Feb 13, 2024 •

edited

Loading

andyp1per commented Feb 13, 2024

andyp1per commented Feb 13, 2024

tridge commented Feb 14, 2024

tridge commented Feb 14, 2024

tridge commented Feb 15, 2024

andyp1per commented Feb 15, 2024

tridge commented Feb 15, 2024

tridge commented Feb 16, 2024

tridge commented Feb 16, 2024

andyp1per commented Feb 17, 2024

tridge commented Feb 17, 2024

peterbarker commented Feb 19, 2024

tridge commented Feb 19, 2024

andyp1per commented Feb 19, 2024 •

edited

Loading

andyp1per commented Feb 19, 2024

tridge Feb 19, 2024

andyp1per Feb 20, 2024

tridge commented Feb 21, 2024

andyp1per commented Feb 21, 2024 •

edited

Loading

andyp1per commented Feb 21, 2024

Decode high duty-cycle CRSF frames using frame marker rather than timeouts #26183

Decode high duty-cycle CRSF frames using frame marker rather than timeouts #26183

Conversation

andyp1per commented Feb 10, 2024 • edited Loading

tridge left a comment

Choose a reason for hiding this comment

andyp1per commented Feb 11, 2024

andyp1per commented Feb 11, 2024

tridge left a comment

Choose a reason for hiding this comment

timtuxworth commented Feb 13, 2024 • edited Loading

andyp1per commented Feb 13, 2024

andyp1per commented Feb 13, 2024

tridge commented Feb 14, 2024

tridge commented Feb 14, 2024

tridge commented Feb 15, 2024

andyp1per commented Feb 15, 2024

tridge commented Feb 15, 2024

tridge commented Feb 16, 2024

tridge commented Feb 16, 2024

andyp1per commented Feb 17, 2024

tridge commented Feb 17, 2024

peterbarker commented Feb 19, 2024

tridge commented Feb 19, 2024

andyp1per commented Feb 19, 2024 • edited Loading

andyp1per commented Feb 19, 2024

tridge Feb 19, 2024

Choose a reason for hiding this comment

andyp1per Feb 20, 2024

Choose a reason for hiding this comment

tridge commented Feb 21, 2024

andyp1per commented Feb 21, 2024 • edited Loading

andyp1per commented Feb 21, 2024

andyp1per commented Feb 10, 2024 •

edited

Loading

timtuxworth commented Feb 13, 2024 •

edited

Loading

andyp1per commented Feb 19, 2024 •

edited

Loading

andyp1per commented Feb 21, 2024 •

edited

Loading