-
Notifications
You must be signed in to change notification settings - Fork 208
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Raspberry Pi 2B fail to boot occasionally CPUx: failed to come online #253
Comments
I posted about this several times in the forum "Moving Linux Kernel to 5.10" thread. First thought it was something new with the 5.10.x kernel, which I was testing at the time, until I saw it with Pi2 and 5.4.83-v7+ kernel. |
Have a read through this issue for some history: #232 So far it seems like a problem in the CPUs that only appears before the caches are enabled. There is nothing wrong with the code being executed, but sometimes it doesn't work as it should. Code placement might be a factor, otherwise I can think of no explanation why some builds are affected and not others. The fact that the failure is probabilistic rather than guaranteed only makes it harder to diagnose. |
Hard to diagnose indeed. I am also using a 4GB Pi4 with the official Raspbian Buster and all the updates installed as a secondary desktop without any problem so far. On my Pi2 B I can confirm that:
I can provide any other information i.e. log files etc, should you need it. |
|
Oh good. Having just fixed an interesting I2C bug I was looking for another rabbit hole to disappear down. |
That's interesting. Updated to kernel 5.10.11-v7+ on Thursday and the freeze problem after boot seems to be fixed. I made around 30 reboots from ssh without issue, but I also noticed the line Feb 5 13:37:17 PI kernel: [ 1.040913] CPU2: failed to come online As I am unaware of Raspberry Pi internals such as revision numbers, variants etc that may be relevant to the issue, I am posting some information from /proc/cpuinfo which applies to both devices I own: Hardware : BCM2835 CPU architecture: 7 I will repeat the boot test from the console during the weekend checking kern.log for each reboot. |
Kernel 5.10.11-v7+ made it harder to reproduce. Here are my results:
As a workaround for (2) and based on (3), I use a small shell script to check the number CPU cores on /proc/cpuinfo on every boot. If this number is less than 4, the script reboots the Pi. |
Kernel version:
Linux P1 5.4.83-v7+ #1379 SMP Mon Dec 14 13:08:57 GMT 2020 armv7l GNU/Linux
The first Pi runs the "lite" Raspbian Buster image and the second the "desktop and recommended software". Both Raspberries boot to the console and all the installed packages come from Raspbian repositories.
Since the last December however, both Pis occasionally fail to boot. The network does not come up and when I connect a monitor, the devices freeze to the login: prompt. When I unplug/plug the Pis, everything is back to normal.
While investigating this issue I found that on every unsuccessful boot a CPU core does not come up. The following is logged in kern.log:
On an unsuccessful boot:
Jan 18 21:42:08 PI kernel: [ 0.007635] smp: Bringing up secondary CPUs ...
Jan 18 21:42:08 PI kernel: [ 1.040987] CPU1: failed to come online
Jan 18 21:42:08 PI kernel: [ 1.042804] CPU2: update cpu_capacity 1024
Jan 18 21:42:08 PI kernel: [ 1.042816] CPU2: thread -1, cpu 2, socket 15, mpidr 80000f02
Jan 18 21:42:08 PI kernel: [ 1.044511] CPU3: update cpu_capacity 1024
Jan 18 21:42:08 PI kernel: [ 1.044524] CPU3: thread -1, cpu 3, socket 15, mpidr 80000f03
Jan 18 21:42:08 PI kernel: [ 1.044740] smp: Brought up 1 node, 3 CPUs
Jan 18 21:42:08 PI kernel: [ 1.044866] SMP: Total of 3 processors activated (115.20 BogoMIPS).
On a successful boot:
Jan 19 17:00:46 PI kernel: [ 0.007643] smp: Bringing up secondary CPUs ...
Jan 19 17:00:46 PI kernel: [ 0.009263] CPU1: update cpu_capacity 1024
Jan 19 17:00:46 PI kernel: [ 0.009276] CPU1: thread -1, cpu 1, socket 15, mpidr 80000f01
Jan 19 17:00:46 PI kernel: [ 0.011320] CPU2: update cpu_capacity 1024
Jan 19 17:00:46 PI kernel: [ 0.011333] CPU2: thread -1, cpu 2, socket 15, mpidr 80000f02
Jan 19 17:00:46 PI kernel: [ 0.012983] CPU3: update cpu_capacity 1024
Jan 19 17:00:46 PI kernel: [ 0.012995] CPU3: thread -1, cpu 3, socket 15, mpidr 80000f03
Jan 19 17:00:46 PI kernel: [ 0.013205] smp: Brought up 1 node, 4 CPUs
Jan 19 17:00:46 PI kernel: [ 0.013333] SMP: Total of 4 processors activated (153.60 BogoMIPS).
I don't know if this issue is related to issue #232 "CPU1: failed to come online with 5.4.51-v7l+" but I had not such problems with kernel 5.4.51.
Thank you in advance and hope you are all well and safe!
The text was updated successfully, but these errors were encountered: