Skip to content

Linux kernel bugs

Brice Goglin edited this page Mar 2, 2018 · 24 revisions

invalid L3 cpuset on AMD 24-core EPYC

****************************************************************************            
* hwloc 1.11.8 has encountered what looks like an error from the operating system.                                                            
*                                                                                                                                             
* L3 (cpuset 0x60000060) intersects with NUMANode (P#0 cpuset 0x3f00003f nodeset 0x00000001) without inclusion!                                                                 
* Error occurred in topology.c line 1088

Fixed in 4.14 in commit

commit 2b83809a5e6d619a780876fcaf68cdc42b50d28c
Author: Suravee Suthikulpanit <[email protected]>
Date:   Mon Jul 31 10:51:59 2017 +0200

    x86/cpu/amd: Derive L3 shared_cpu_map from cpu_llc_shared_mask
    
    For systems with X86_FEATURE_TOPOEXT, current logic uses the APIC ID
    to calculate shared_cpu_map. However, APIC IDs are not guaranteed to
    be contiguous for cores across different L3s (e.g. family17h system
    w/ downcore configuration). This breaks the logic, and results in an
    incorrect L3 shared_cpu_map.
    
    Instead, always use the previously calculated cpu_llc_shared_mask of
    each CPU to derive the L3 shared_cpu_map.

Backported in upstream 4.13.16.

split Packages on Intel Xeon E5 v3/v4 with Cluster-on-Die enabled

Each dual-NUMA package is reported as two single-NUMA package.

Fixed in 3.18 in commit

commit cebf15eb09a2fd2fa73ee4faa9c4d2f813cf0f09
Author: Dave Hansen <[email protected]>
Date:   Thu Sep 18 12:33:34 2014 -0700

    x86, sched: Add new topology for multi-NUMA-node CPUs

invalid L3 cpuset on AMD 12-core Opteron 6200/6300 (Bulldozer and Piledriver)

****************************************************************************
* Hwloc has encountered what looks like an error from the operating system.
*
* object (L3 cpuset 0x000003f0) intersection without inclusion!

The fix was NEVER pushed to Linux. Use hwloc 1.11.2 and set HWLOC_COMPONENTS=x86 in your environment to work around the issue.

Clone this wiki locally