Changelog in Linux kernel 6.12.64

ACPI: CPPC: Fix missing PCC check for guaranteed_perf [+ + +]

Author: Pengjie Zhang <[email protected]>
Date:   Wed Dec 10 21:22:27 2025 +0800

    ACPI: CPPC: Fix missing PCC check for guaranteed_perf
    
    commit 6ea3a44cef28add2d93b1ef119d84886cb1e3c9b upstream.
    
    The current implementation overlooks the 'guaranteed_perf'
    register in this check.
    
    If the Guaranteed Performance register is located in the PCC
    subspace, the function currently attempts to read it without
    acquiring the lock and without sending the CMD_READ doorbell
    to the firmware. This can result in reading stale data.
    
    Fixes: 29523f095397 ("ACPI / CPPC: Add support for guaranteed performance")
    Signed-off-by: Pengjie Zhang <[email protected]>
    Cc: 4.20+ <[email protected]> # 4.20+
    [ rjw: Subject and changelog edits ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ACPI: fan: Workaround for 64-bit firmware bug [+ + +]

Author: Armin Wolf <[email protected]>
Date:   Wed Oct 8 01:41:45 2025 +0200

    ACPI: fan: Workaround for 64-bit firmware bug
    
    [ Upstream commit 2e00f7a4bb0ac25ec7477b55fe482da39fb4dce8 ]
    
    Some firmware implementations use the "Ones" ASL opcode to produce
    an integer with all bits set in order to indicate missing speed or
    power readings. This however only works when using 32-bit integers,
    as the ACPI spec requires a 32-bit integer (0xFFFFFFFF) to be
    returned for missing speed/power readings. With 64-bit integers the
    "Ones" opcode produces a 64-bit integer with all bits set, violating
    the ACPI spec regarding the placeholder value for missing readings.
    
    Work around such buggy firmware implementation by also checking for
    64-bit integers with all bits set when reading _FST.
    
    Signed-off-by: Armin Wolf <[email protected]>
    [ rjw: Typo fix in the changelog ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ACPI: PCC: Fix race condition by removing static qualifier [+ + +]

Author: Pengjie Zhang <[email protected]>
Date:   Wed Dec 10 21:26:34 2025 +0800

    ACPI: PCC: Fix race condition by removing static qualifier
    
    commit f103fa127c93016bcd89b05d8e11dc1a84f6990d upstream.
    
    Local variable 'ret' in acpi_pcc_address_space_setup() is currently
    declared as 'static'. This can lead to race conditions in a
    multithreaded environment.
    
    Remove the 'static' qualifier to ensure that 'ret' will be allocated
    directly on the stack as a local variable.
    
    Fixes: a10b1c99e2dc ("ACPI: PCC: Setup PCC Opregion handler only if platform interrupt is available")
    Signed-off-by: Pengjie Zhang <[email protected]>
    Reviewed-by: Sudeep Holla <[email protected]>
    Acked-by: [email protected]
    Cc: 6.2+ <[email protected]> # 6.2+
    [ rjw: Changelog edits ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ACPI: property: Use ACPI functions in acpi_graph_get_next_endpoint() only [+ + +]

Author: Sakari Ailus <[email protected]>
Date:   Wed Oct 1 13:43:19 2025 +0300

    ACPI: property: Use ACPI functions in acpi_graph_get_next_endpoint() only
    
    [ Upstream commit 5d010473cdeaabf6a2d3a9e2aed2186c1b73c213 ]
    
    Calling fwnode_get_next_child_node() in ACPI implementation of the fwnode
    property API is somewhat problematic as the latter is used in the
    impelementation of the former. Instead of using
    fwnode_get_next_child_node() in acpi_graph_get_next_endpoint(), call
    acpi_get_next_subnode() directly instead.
    
    Signed-off-by: Sakari Ailus <[email protected]>
    Reviewed-by: Laurent Pinchart <[email protected]>
    Reviewed-by: Jonathan Cameron <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ACPICA: Avoid walking the Namespace if start_node is NULL [+ + +]

Author: Cryolitia PukNgae <[email protected]>
Date:   Tue Nov 25 16:14:38 2025 +0800

    ACPICA: Avoid walking the Namespace if start_node is NULL
    
    [ Upstream commit 9d6c58dae8f6590c746ac5d0012ffe14a77539f0 ]
    
    Although commit 0c9992315e73 ("ACPICA: Avoid walking the ACPI Namespace
    if it is not there") fixed the situation when both start_node and
    acpi_gbl_root_node are NULL, the Linux kernel mainline now still crashed
    on Honor Magicbook 14 Pro [1].
    
    That happens due to the access to the member of parent_node in
    acpi_ns_get_next_node().  The NULL pointer dereference will always
    happen, no matter whether or not the start_node is equal to
    ACPI_ROOT_OBJECT, so move the check of start_node being NULL
    out of the if block.
    
    Unfortunately, all the attempts to contact Honor have failed, they
    refused to provide any technical support for Linux.
    
    The bad DSDT table's dump could be found on GitHub [2].
    
    DMI: HONOR FMB-P/FMB-P-PCB, BIOS 1.13 05/08/2025
    
    Link: https://github.com/acpica/acpica/commit/1c1b57b9eba4554cb132ee658dd942c0210ed20d
    Link: https://gist.github.com/Cryolitia/a860ffc97437dcd2cd988371d5b73ed7 [1]
    Link: https://github.com/denis-bb/honor-fmb-p-dsdt [2]
    Signed-off-by: Cryolitia PukNgae <[email protected]>
    Reviewed-by: WangYuli <[email protected]>
    [ rjw: Subject adjustment, changelog edits ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda: cs35l41: Fix NULL pointer dereference in cs35l41_hda_read_acpi() [+ + +]

Author: Denis Arefev <[email protected]>
Date:   Tue Dec 16 06:00:34 2025 -0500

    ALSA: hda: cs35l41: Fix NULL pointer dereference in cs35l41_hda_read_acpi()
    
    [ Upstream commit c34b04cc6178f33c08331568c7fd25c5b9a39f66 ]
    
    The acpi_get_first_physical_node() function can return NULL, in which
    case the get_device() function also returns NULL, but this value is
    then dereferenced without checking,so add a check to prevent a crash.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: 7b2f3eb492da ("ALSA: hda: cs35l41: Add support for CS35L41 in HDA systems")
    Cc: [email protected]
    Signed-off-by: Denis Arefev <[email protected]>
    Reviewed-by: Richard Fitzgerald <[email protected]>
    Signed-off-by: Takashi Iwai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [ sound/hda/codecs/side-codecs/ -> sound/pci/hda/ ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ALSA: pcmcia: Fix resource leak in snd_pdacf_probe error path [+ + +]

Author: Haotian Zhang <[email protected]>
Date:   Mon Dec 15 17:04:33 2025 +0800

    ALSA: pcmcia: Fix resource leak in snd_pdacf_probe error path
    
    [ Upstream commit 5032347c04ba7ff9ba878f262e075d745c06a2a8 ]
    
    When pdacf_config() fails, snd_pdacf_probe() returns the error code
    directly without freeing the sound card resources allocated by
    snd_card_new(), which leads to a memory leak.
    
    Add proper error handling to free the sound card and clear the card
    list entry when pdacf_config() fails.
    
    Fixes: 15b99ac17295 ("[PATCH] pcmcia: add return value to _config() functions")
    Suggested-by: Takashi Iwai <[email protected]>
    Signed-off-by: Haotian Zhang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: usb-mixer: us16x08: validate meter packet indices [+ + +]

Author: Shipei Qu <[email protected]>
Date:   Wed Dec 17 10:46:30 2025 +0800

    ALSA: usb-mixer: us16x08: validate meter packet indices
    
    [ Upstream commit 5526c1c6ba1d0913c7dfcbbd6fe1744ea7c55f1e ]
    
    get_meter_levels_from_urb() parses the 64-byte meter packets sent by
    the device and fills the per-channel arrays meter_level[],
    comp_level[] and master_level[] in struct snd_us16x08_meter_store.
    
    Currently the function derives the channel index directly from the
    meter packet (MUB2(meter_urb, s) - 1) and uses it to index those
    arrays without validating the range. If the packet contains a
    negative or out-of-range channel number, the driver may write past
    the end of these arrays.
    
    Introduce a local channel variable and validate it before updating the
    arrays. We reject negative indices, limit meter_level[] and
    comp_level[] to SND_US16X08_MAX_CHANNELS, and guard master_level[]
    updates with ARRAY_SIZE(master_level).
    
    Fixes: d2bb390a2081 ("ALSA: usb-audio: Tascam US-16x08 DSP mixer quirk")
    Reported-by: DARKNAVY (@DarkNavyOrg) <[email protected]>
    Closes: https://lore.kernel.org/[email protected]
    Signed-off-by: Shipei Qu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: vxpocket: Fix resource leak in vxpocket_probe error path [+ + +]

Author: Haotian Zhang <[email protected]>
Date:   Mon Dec 15 12:26:52 2025 +0800

    ALSA: vxpocket: Fix resource leak in vxpocket_probe error path
    
    [ Upstream commit 2a03b40deacbd293ac9aed0f9b11197dad54fe5f ]
    
    When vxpocket_config() fails, vxpocket_probe() returns the error code
    directly without freeing the sound card resources allocated by
    snd_card_new(), which leads to a memory leak.
    
    Add proper error handling to free the sound card and clear the
    allocation bit when vxpocket_config() fails.
    
    Fixes: 15b99ac17295 ("[PATCH] pcmcia: add return value to _config() functions")
    Signed-off-by: Haotian Zhang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: wavefront: Clear substream pointers on close [+ + +]

Author: Junrui Luo <[email protected]>
Date:   Tue Dec 16 05:59:26 2025 -0500

    ALSA: wavefront: Clear substream pointers on close
    
    [ Upstream commit e11c5c13ce0ab2325d38fe63500be1dd88b81e38 ]
    
    Clear substream pointers in close functions to avoid leaving dangling
    pointers, helping to improve code safety and
    prevents potential issues.
    
    Reported-by: Yuhao Jiang <[email protected]>
    Reported-by: Junrui Luo <[email protected]>
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Signed-off-by: Junrui Luo <[email protected]>
    Link: https://patch.msgid.link/SYBPR01MB7881DF762CAB45EE42F6D812AFC2A@SYBPR01MB7881.ausprd01.prod.outlook.com
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ALSA: wavefront: Use guard() for spin locks [+ + +]

Author: Takashi Iwai <[email protected]>
Date:   Tue Dec 16 05:59:25 2025 -0500

    ALSA: wavefront: Use guard() for spin locks
    
    [ Upstream commit 4b97f8e614ba46a50bd181d40b5a1424411a211a ]
    
    Clean up the code using guard() for spin locks.
    
    Merely code refactoring, and no behavior change.
    
    Signed-off-by: Takashi Iwai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Stable-dep-of: e11c5c13ce0a ("ALSA: wavefront: Clear substream pointers on close")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

amba: tegra-ahb: Fix device leak on SMMU enable [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Sep 25 17:00:07 2025 +0200

    amba: tegra-ahb: Fix device leak on SMMU enable
    
    commit 500e1368e46928f4b2259612dcabb6999afae2a6 upstream.
    
    Make sure to drop the reference taken to the AHB platform device when
    looking up its driver data while enabling the SMMU.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away.
    
    Fixes: 89c788bab1f0 ("ARM: tegra: Add SMMU enabler in AHB")
    Cc: [email protected]      # 3.5
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Thierry Reding <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

amd-xgbe: reset retries and mode on RX adapt failures [+ + +]

Author: Raju Rangoju <[email protected]>
Date:   Mon Dec 15 20:47:28 2025 +0530

    amd-xgbe: reset retries and mode on RX adapt failures
    
    [ Upstream commit df60c332caf95d70f967aeace826e7e2f0847361 ]
    
    During the stress tests, early RX adaptation handshakes can fail, such
    as missing the RX_ADAPT ACK or not receiving a coefficient update before
    block lock is established. Continuing to retry RX adaptation in this
    state is often ineffective if the current mode selection is not viable.
    
    Resetting the RX adaptation retry counter when an RX_ADAPT request fails
    to receive ACK or a coefficient update prior to block lock, and clearing
    mode_set so the next bring-up performs a fresh mode selection rather
    than looping on a likely invalid configuration.
    
    Fixes: 4f3b20bfbb75 ("amd-xgbe: add support for rx-adaptation")
    Signed-off-by: Raju Rangoju <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Reviewed-by: Shyam Sundar S K <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arm64: dts: ti: k3-j721e-sk: Fix pinmux for pin Y1 used by power regulator [+ + +]

Author: Siddharth Vadapalli <[email protected]>
Date:   Wed Nov 19 21:31:05 2025 +0530

    arm64: dts: ti: k3-j721e-sk: Fix pinmux for pin Y1 used by power regulator
    
    commit 51f89c488f2ecc020f82bfedd77482584ce8027a upstream.
    
    The SoC pin Y1 is incorrectly defined in the WKUP Pinmux device-tree node
    (pinctrl@4301c000) leading to the following silent failure:
    
        pinctrl-single 4301c000.pinctrl: mux offset out of range: 0x1dc (0x178)
    
    According to the datasheet for the J721E SoC [0], the pin Y1 belongs to the
    MAIN Pinmux device-tree node (pinctrl@11c000). This is confirmed by the
    address of the pinmux register for it on page 142 of the datasheet which is
    0x00011C1DC.
    
    Hence fix it.
    
    [0]: https://www.ti.com/lit/ds/symlink/tda4vm.pdf
    
    Fixes: 97b67cc102dc ("arm64: dts: ti: k3-j721e-sk: Add DT nodes for power regulators")
    Cc: [email protected]
    Signed-off-by: Siddharth Vadapalli <[email protected]>
    Reviewed-by: Yemike Abhilash Chandra <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Vignesh Raghavendra <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

arm64: kdump: Fix elfcorehdr overlap caused by reserved memory processing reorder [+ + +]

Author: Jianpeng Chang <[email protected]>
Date:   Fri Dec 5 09:59:34 2025 +0800

    arm64: kdump: Fix elfcorehdr overlap caused by reserved memory processing reorder
    
    [ Upstream commit 3e8ade58b71b48913d21b647b2089e03e81f117e ]
    
    Commit 8a6e02d0c00e ("of: reserved_mem: Restructure how the reserved
    memory regions are processed") changed the processing order of reserved
    memory regions, causing elfcorehdr to overlap with dynamically allocated
    reserved memory regions during kdump kernel boot.
    
    The issue occurs because:
    1. kexec-tools allocates elfcorehdr in the last crashkernel reserved
       memory region and passes it to the second kernel
    2. The problematic commit moved dynamic reserved memory allocation
       (like bman-fbpr) to occur during fdt_scan_reserved_mem(), before
       elfcorehdr reservation in fdt_reserve_elfcorehdr()
    3. bman-fbpr with 16MB alignment requirement can get allocated at
       addresses that overlap with the elfcorehdr location
    4. When fdt_reserve_elfcorehdr() tries to reserve elfcorehdr memory,
       overlap detection identifies the conflict and skips reservation
    5. kdump kernel fails with "Unable to handle kernel paging request"
       because elfcorehdr memory is not properly reserved
    
    The boot log:
    Before 8a6e02d0c00e:
      OF: fdt: Reserving 1 KiB of memory at 0xf4fff000 for elfcorehdr
      OF: reserved mem: 0xf3000000..0xf3ffffff bman-fbpr
    
    After 8a6e02d0c00e:
      OF: reserved mem: 0xf4000000..0xf4ffffff bman-fbpr
      OF: fdt: elfcorehdr is overlapped
    
    Fix this by ensuring elfcorehdr reservation occurs before dynamic
    reserved memory allocation.
    
    Fixes: 8a6e02d0c00e ("of: reserved_mem: Restructure how the reserved memory regions are processed")
    Signed-off-by: Jianpeng Chang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rob Herring (Arm) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arm64: Revamp HCR_EL2.E2H RES1 detection [+ + +]

Author: Marc Zyngier <[email protected]>
Date:   Fri Dec 19 10:21:23 2025 +0000

    arm64: Revamp HCR_EL2.E2H RES1 detection
    
    [ Upstream commit ca88ecdce5f51874a7c151809bd2c936ee0d3805 ]
    
    We currently have two ways to identify CPUs that only implement FEAT_VHE
    and not FEAT_E2H0:
    
    - either they advertise it via ID_AA64MMFR4_EL1.E2H0,
    - or the HCR_EL2.E2H bit is RAO/WI
    
    However, there is a third category of "cpus" that fall between these
    two cases: on CPUs that do not implement FEAT_FGT, it is IMPDEF whether
    an access to ID_AA64MMFR4_EL1 can trap to EL2 when the register value
    is zero.
    
    A consequence of this is that on systems such as Neoverse V2, a NV
    guest cannot reliably detect that it is in a VHE-only configuration
    (E2H is writable, and ID_AA64MMFR0_EL1 is 0), despite the hypervisor's
    best effort to repaint the id register.
    
    Replace the RAO/WI test by a sequence that makes use of the VHE
    register remnapping between EL1 and EL2 to detect this situation,
    and work out whether we get the VHE behaviour even after having
    set HCR_EL2.E2H to 0.
    
    This solves the NV problem, and provides a more reliable acid test
    for CPUs that do not completely follow the letter of the architecture
    while providing a RES1 behaviour for HCR_EL2.E2H.
    
    Suggested-by: Mark Rutland <[email protected]>
    Acked-by: Mark Rutland <[email protected]>
    Acked-by: Catalin Marinas <[email protected]>
    Reviewed-by: Oliver Upton <[email protected]>
    Tested-by: Jan Kotas <[email protected]>
    Signed-off-by: Marc Zyngier <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Marc Zyngier <[email protected]>
    Signed-off-by: Wei-Lin Chang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ARM: dts: microchip: sama5d2: fix spi flexcom fifo size to 32 [+ + +]

Author: Nicolas Ferre <[email protected]>
Date:   Fri Nov 14 15:02:25 2025 +0100

    ARM: dts: microchip: sama5d2: fix spi flexcom fifo size to 32
    
    commit 7d5864dc5d5ea6a35983dd05295fb17f2f2f44ce upstream.
    
    Unlike standalone spi peripherals, on sama5d2, the flexcom spi have fifo
    size of 32 data. Fix flexcom/spi nodes where this property is wrong.
    
    Fixes: 6b9a3584c7ed ("ARM: dts: at91: sama5d2: Add missing flexcom definitions")
    Cc: [email protected] # 5.8+
    Signed-off-by: Nicolas Ferre <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Claudiu Beznea <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ARM: dts: microchip: sama7g5: fix uart fifo size to 32 [+ + +]

Author: Nicolas Ferre <[email protected]>
Date:   Wed Dec 31 16:06:00 2025 -0500

    ARM: dts: microchip: sama7g5: fix uart fifo size to 32
    
    [ Upstream commit 5654889a94b0de5ad6ceae3793e7f5e0b61b50b6 ]
    
    On some flexcom nodes related to uart, the fifo sizes were wrong: fix
    them to 32 data.
    
    Fixes: 7540629e2fc7 ("ARM: dts: at91: add sama7g5 SoC DT and sama7g5-ek")
    Cc: [email protected] # 5.15+
    Signed-off-by: Nicolas Ferre <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Claudiu Beznea <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: ak4458: remove the reset operation in probe and remove [+ + +]

Author: Shengjiu Wang <[email protected]>
Date:   Tue Dec 16 15:02:01 2025 +0800

    ASoC: ak4458: remove the reset operation in probe and remove
    
    [ Upstream commit 00b960a83c764208b0623089eb70af3685e3906f ]
    
    The reset_control handler has the reference count for usage, as there is
    reset operation in runtime suspend and resume, then reset operation in
    probe() would cause the reference count of reset not balanced.
    
    Previously add reset operation in probe and remove is to fix the compile
    issue with !CONFIG_PM, as the driver has been update to use
    RUNTIME_PM_OPS(), so that change can be reverted.
    
    Fixes: 1e0dff741b0a ("ASoC: ak4458: remove "reset-gpios" property handler")
    Signed-off-by: Shengjiu Wang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: codecs: lpass-tx-macro: fix SM6115 support [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Fri Oct 31 12:06:58 2025 +0000

    ASoC: codecs: lpass-tx-macro: fix SM6115 support
    
    commit 7c63b5a8ed972a2c8c03d984f6a43349007cea93 upstream.
    
    SM6115 does have soundwire controller in tx. For some reason
    we ended up with this incorrect patch.
    
    Fix this by adding the flag to reflect this in SoC data.
    
    Fixes: 510c46884299 ("ASoC: codecs: lpass-tx-macro: Add SM6115 support")
    Cc: [email protected]
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: codecs: wcd939x: fix regmap leak on probe failure [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Nov 27 14:50:57 2025 +0100

    ASoC: codecs: wcd939x: fix regmap leak on probe failure
    
    commit 86dc090f737953f16f8dc60c546ae7854690d4f6 upstream.
    
    The soundwire regmap that may be allocated during probe is not freed on
    late probe failures.
    
    Add the missing error handling.
    
    Fixes: be2af391cea0 ("ASoC: codecs: Add WCD939x Soundwire devices driver")
    Cc: [email protected]      # 6.9
    Cc: Neil Armstrong <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: qcom: q6adm: the the copp device only during last instance [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Thu Oct 23 11:24:26 2025 +0100

    ASoC: qcom: q6adm: the the copp device only during last instance
    
    commit 74cc4f3ea4e99262ba0d619c6a4ee33e2cd47f65 upstream.
    
    A matching Common object post processing instance is normally resused
    across multiple streams. However currently we close this on DSP
    even though there is a refcount on this copp object, this can result in
    below error.
    
    q6routing ab00000.remoteproc:glink-edge:apr:service@8:routing: Found Matching Copp 0x0
    qcom-q6adm aprsvc:service:4:8: cmd = 0x10325 return error = 0x2
    q6routing ab00000.remoteproc:glink-edge:apr:service@8:routing: DSP returned error[2]
    q6routing ab00000.remoteproc:glink-edge:apr:service@8:routing: Found Matching Copp 0x0
    qcom-q6adm aprsvc:service:4:8: cmd = 0x10325 return error = 0x2
    q6routing ab00000.remoteproc:glink-edge:apr:service@8:routing: DSP returned error[2]
    qcom-q6adm aprsvc:service:4:8: cmd = 0x10327 return error = 0x2
    qcom-q6adm aprsvc:service:4:8: DSP returned error[2]
    qcom-q6adm aprsvc:service:4:8: Failed to close copp -22
    qcom-q6adm aprsvc:service:4:8: cmd = 0x10327 return error = 0x2
    qcom-q6adm aprsvc:service:4:8: DSP returned error[2]
    qcom-q6adm aprsvc:service:4:8: Failed to close copp -22
    
    Fix this by addressing moving the adm_close to copp_kref destructor
    callback.
    
    Fixes: 7b20b2be51e1 ("ASoC: qdsp6: q6adm: Add q6adm driver")
    Cc: [email protected]
    Reported-by: Martino Facchin <[email protected]>
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Tested-by: Alexey Klimov <[email protected]> # RB5, RB3
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: qcom: q6apm-dai: set flags to reflect correct operation of appl_ptr [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Thu Oct 23 11:24:25 2025 +0100

    ASoC: qcom: q6apm-dai: set flags to reflect correct operation of appl_ptr
    
    commit 950a4e5788fc7dc6e8e93614a7d4d0449c39fb8d upstream.
    
    Driver does not expect the appl_ptr to move backward and requires
    explict sync. Make sure that the userspace does not do appl_ptr rewinds
    by specifying the correct flags in pcm_info.
    
    Without this patch, the result could be a forever loop as current logic assumes
    that appl_ptr can only move forward.
    
    Fixes: 3d4a4411aa8b ("ASoC: q6apm-dai: schedule all available frames to avoid dsp under-runs")
    Cc: [email protected]
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Tested-by: Alexey Klimov <[email protected]> # RB5, RB3
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: qcom: q6asm-dai: perform correct state check before closing [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Thu Oct 23 11:24:28 2025 +0100

    ASoC: qcom: q6asm-dai: perform correct state check before closing
    
    commit bfbb12dfa144d45575bcfe139a71360b3ce80237 upstream.
    
    Do not stop a q6asm stream if its not started, this can result in
    unnecessary dsp command which will timeout anyway something like below:
    
    q6asm-dai ab00000.remoteproc:glink-edge:apr:service@7:dais: CMD 10bcd timeout
    
    Fix this by correctly checking the state.
    
    Fixes: 2a9e92d371db ("ASoC: qdsp6: q6asm: Add q6asm dai driver")
    Cc: [email protected]
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Tested-by: Alexey Klimov <[email protected]> # RB5, RB3
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: qcom: qdsp6: q6asm-dai: set 10 ms period and buffer alignment. [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Thu Oct 23 11:24:27 2025 +0100

    ASoC: qcom: qdsp6: q6asm-dai: set 10 ms period and buffer alignment.
    
    commit 81c53b52de21b8d5a3de55ebd06b6bf188bf7efd upstream.
    
    DSP expects the periods to be aligned to fragment sizes, currently
    setting up to hw constriants on periods bytes is not going to work
    correctly as we can endup with periods sizes aligned to 32 bytes however
    not aligned to fragment size.
    
    Update the constriants to use fragment size, and also set at step of
    10ms for period size to accommodate DSP requirements of 10ms latency.
    
    Fixes: 2a9e92d371db ("ASoC: qdsp6: q6asm: Add q6asm dai driver")
    Cc: [email protected]
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Tested-by: Alexey Klimov <[email protected]> # RB5, RB3
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: qcom: sdw: fix memory leak for sdw_stream_runtime [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Mon Jan 5 10:10:08 2026 -0500

    ASoC: qcom: sdw: fix memory leak for sdw_stream_runtime
    
    [ Upstream commit bcba17279327c6e85dee6a97014dc642e2dc93cc ]
    
    For some reason we endedup allocating sdw_stream_runtime for every cpu dai,
    this has two issues.
    1. we never set snd_soc_dai_set_stream for non soundwire dai, which
       means there is no way that we can free this, resulting in memory leak
    2. startup and shutdown callbacks can be called without
       hw_params callback called. This combination results in memory leak
    because machine driver sruntime array pointer is only set in hw_params
    callback.
    
    Fix this by
     1. adding a helper function to get sdw_runtime for substream
    which can be used by shutdown callback to get hold of sruntime to free.
     2. only allocate sdw_runtime for soundwire dais.
    
    Fixes: d32bac9cb09c ("ASoC: qcom: Add helper for allocating Soundwire stream runtime")
    Cc: Krzysztof Kozlowski <[email protected]>
    Cc: [email protected]
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Tested-by: Steev Klimaszewski <[email protected]> # Thinkpad X13s
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: renesas: rz-ssi: Fix channel swap issue in full duplex mode [+ + +]

Author: Biju Das <[email protected]>
Date:   Mon Jan 5 15:33:04 2026 +0000

    ASoC: renesas: rz-ssi: Fix channel swap issue in full duplex mode
    
    [ Upstream commit 52a525011cb8e293799a085436f026f2958403f9 ]
    
    The full duplex audio starts with half duplex mode and then switch to
    full duplex mode (another FIFO reset) when both playback/capture
    streams available leading to random audio left/right channel swap
    issue. Fix this channel swap issue by detecting the full duplex
    condition by populating struct dup variable in startup() callback
    and synchronize starting both the play and capture at the same time
    in rz_ssi_start().
    
    Cc: [email protected]
    Fixes: 4f8cd05a4305 ("ASoC: sh: rz-ssi: Add full duplex support")
    Co-developed-by: Tony Tang <[email protected]>
    Signed-off-by: Tony Tang <[email protected]>
    Reviewed-by: Kuninori Morimoto <[email protected]>
    Signed-off-by: Biju Das <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: renesas: rz-ssi: Fix rz_ssi_priv::hw_params_cache::sample_width [+ + +]

Author: Biju Das <[email protected]>
Date:   Mon Jan 5 09:47:43 2026 -0500

    ASoC: renesas: rz-ssi: Fix rz_ssi_priv::hw_params_cache::sample_width
    
    [ Upstream commit 2bae7beda19f3b2dc6ab2062c94df19c27923712 ]
    
    The strm->sample_width is not filled during rz_ssi_dai_hw_params(). This
    wrong value is used for caching sample_width in struct hw_params_cache.
    Fix this issue by replacing 'strm->sample_width'->'params_width(params)'
    in rz_ssi_dai_hw_params(). After this drop the variable sample_width
    from struct rz_ssi_stream as it is unused.
    
    Cc: [email protected]
    Fixes: 4f8cd05a4305 ("ASoC: sh: rz-ssi: Add full duplex support")
    Reviewed-by: Kuninori Morimoto <[email protected]>
    Signed-off-by: Biju Das <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: stm32: sai: fix clk prepare imbalance on probe failure [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Nov 24 11:49:06 2025 +0100

    ASoC: stm32: sai: fix clk prepare imbalance on probe failure
    
    commit 312ec2f0d9d1a5656f76d770bbf1d967e9289aa7 upstream.
    
    Make sure to unprepare the parent clock also on probe failures (e.g.
    probe deferral).
    
    Fixes: a14bf98c045b ("ASoC: stm32: sai: fix possible circular locking")
    Cc: [email protected]      # 5.5
    Cc: Olivier Moysan <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: olivier moysan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: stm32: sai: fix device leak on probe [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Nov 24 11:49:05 2025 +0100

    ASoC: stm32: sai: fix device leak on probe
    
    commit e26ff429eaf10c4ef1bc3dabd9bf27eb54b7e1f4 upstream.
    
    Make sure to drop the reference taken when looking up the sync provider
    device and its driver data during DAI probe on probe failures and on
    unbind.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away so there is no point in keeping the reference.
    
    Fixes: 7dd0d835582f ("ASoC: stm32: sai: simplify sync modes management")
    Fixes: 1c3816a19487 ("ASoC: stm32: sai: add missing put_device()")
    Cc: [email protected]      # 4.16: 1c3816a19487
    Cc: olivier moysan <[email protected]>
    Cc: Wen Yang <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: olivier moysan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: stm32: sai: fix OF node leak on probe [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Nov 24 11:49:07 2025 +0100

    ASoC: stm32: sai: fix OF node leak on probe
    
    commit 23261f0de09427367e99f39f588e31e2856a690e upstream.
    
    The reference taken to the sync provider OF node when probing the
    platform device is currently only dropped if the set_sync() callback
    fails during DAI probe.
    
    Make sure to drop the reference on platform probe failures (e.g. probe
    deferral) and on driver unbind.
    
    This also avoids a potential use-after-free in case the DAI is ever
    reprobed without first rebinding the platform driver.
    
    Fixes: 5914d285f6b7 ("ASoC: stm32: sai: Add synchronization support")
    Fixes: d4180b4c02e7 ("ASoC: stm32: sai: fix set_sync service")
    Cc: Olivier Moysan <[email protected]>
    Cc: [email protected]      # 4.16: d4180b4c02e7
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: olivier moysan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

blk-mq: skip CPU offline notify on unmapped hctx [+ + +]

Author: Cong Zhang <[email protected]>
Date:   Tue Dec 30 17:17:05 2025 +0800

    blk-mq: skip CPU offline notify on unmapped hctx
    
    [ Upstream commit 10845a105bbcb030647a729f1716c2309da71d33 ]
    
    If an hctx has no software ctx mapped, blk_mq_map_swqueue() never
    allocates tags and leaves hctx->tags NULL. The CPU hotplug offline
    notifier can still run for that hctx, return early since hctx cannot
    hold any requests.
    
    Signed-off-by: Cong Zhang <[email protected]>
    Fixes: bf0beec0607d ("blk-mq: drain I/O when all CPUs in a hctx are offline")
    Reviewed-by: Ming Lei <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

block: Clear BLK_ZONE_WPLUG_PLUGGED when aborting plugged BIOs [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Thu Dec 4 19:59:52 2025 +0900

    block: Clear BLK_ZONE_WPLUG_PLUGGED when aborting plugged BIOs
    
    commit 552c1149af7ac0cffab6fccd13feeaf816dd1f53 upstream.
    
    Commit fe0418eb9bd6 ("block: Prevent potential deadlocks in zone write
    plug error recovery") added a WARN check in disk_put_zone_wplug() to
    verify that when the last reference to a zone write plug is dropped,
    this zone write plug does not have the BLK_ZONE_WPLUG_PLUGGED flag set,
    that is, that it is not plugged.
    
    However, the function disk_zone_wplug_abort(), which is called for zone
    reset and zone finish operations, does not clear this flag after
    emptying a zone write plug BIO list. This can result in the
    disk_put_zone_wplug() warning to trigger if the user (erroneously as
    that is bad pratcice) issues zone reset or zone finish operations while
    the target zone still has plugged BIOs.
    
    Modify disk_put_zone_wplug() to clear the BLK_ZONE_WPLUG_PLUGGED flag.
    And while at it, also add a lockdep annotation to ensure that this
    function is called with the zone write plug spinlock held.
    
    Fixes: fe0418eb9bd6 ("block: Prevent potential deadlocks in zone write plug error recovery")
    Cc: [email protected]
    Signed-off-by: Damien Le Moal <[email protected]>
    Reviewed-by: Niklas Cassel <[email protected]>
    Reviewed-by: Johannes Thumshirn <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

block: fix NULL pointer dereference in blk_zone_reset_all_bio_endio() [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Thu Nov 13 22:40:26 2025 +0900

    block: fix NULL pointer dereference in blk_zone_reset_all_bio_endio()
    
    commit c2b8d20628ca789640f64074a642f9440eefc623 upstream.
    
    For zoned block devices that do not need zone write plugs (e.g. most
    device mapper devices that support zones), the disk hash table of zone
    write plugs is NULL. For such devices, blk_zone_reset_all_bio_endio()
    should not attempt to scan this has table as that causes a NULL pointer
    dereference.
    
    Fix this by checking that the disk does have zone write plugs using the
    atomic counter. This is equivalent to checking for a non-NULL hash table
    but has the advantage to also speed up the execution of
    blk_zone_reset_all_bio_endio() for devices that do use zone write plugs
    but do not have any plug in the hash table (e.g. a disk with only full
    zones).
    
    Fixes: efae226c2ef1 ("block: handle zone management operations completions")
    Reported-by: Shin'ichiro Kawasaki <[email protected]>
    Signed-off-by: Damien Le Moal <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

block: freeze queue when updating zone resources [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Wed Dec 31 18:40:07 2025 -0500

    block: freeze queue when updating zone resources
    
    [ Upstream commit bba4322e3f303b2d656e748be758320b567f046f ]
    
    Modify disk_update_zone_resources() to freeze the device queue before
    updating the number of zones, zone capacity and other zone related
    resources. The locking order resulting from the call to
    queue_limits_commit_update_frozen() is preserved, that is, the queue
    limits lock is first taken by calling queue_limits_start_update() before
    freezing the queue, and the queue is unfrozen after executing
    queue_limits_commit_update(), which replaces the call to
    queue_limits_commit_update_frozen().
    
    This change ensures that there are no in-flights I/Os when the zone
    resources are updated due to a zone revalidation. In case of error when
    the limits are applied, directly call disk_free_zone_resources() from
    disk_update_zone_resources() while the disk queue is still frozen to
    avoid needing to freeze & unfreeze the queue again in
    blk_revalidate_disk_zones(), thus simplifying that function code a
    little.
    
    Fixes: 0b83c86b444a ("block: Prevent potential deadlock in blk_revalidate_disk_zones()")
    Cc: [email protected]
    Signed-off-by: Damien Le Moal <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Reviewed-by: Johannes Thumshirn <[email protected]>
    Reviewed-by: Chaitanya Kulkarni <[email protected]>
    Reviewed-by: Hannes Reinecke <[email protected]>
    Reviewed-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    [ adapted blk_mq_freeze_queue/unfreeze_queue calls to single-argument void API ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

block: handle zone management operations completions [+ + +]

Author: Damien Le Moal <[email protected]>
Date:   Mon Jan 5 10:11:02 2026 -0500

    block: handle zone management operations completions
    
    [ Upstream commit efae226c2ef19528ffd81d29ba0eecf1b0896ca2 ]
    
    The functions blk_zone_wplug_handle_reset_or_finish() and
    blk_zone_wplug_handle_reset_all() both modify the zone write pointer
    offset of zone write plugs that are the target of a reset, reset all or
    finish zone management operation. However, these functions do this
    modification before the BIO is executed. So if the zone operation fails,
    the modified zone write pointer offsets become invalid.
    
    Avoid this by modifying the zone write pointer offset of a zone write
    plug that is the target of a zone management operation when the
    operation completes. To do so, modify blk_zone_bio_endio() to call the
    new function blk_zone_mgmt_bio_endio() which in turn calls the functions
    blk_zone_reset_all_bio_endio(), blk_zone_reset_bio_endio() or
    blk_zone_finish_bio_endio() depending on the operation of the completed
    BIO, to modify a zone write plug write pointer offset accordingly.
    These functions are called only if the BIO execution was successful.
    
    Fixes: dd291d77cc90 ("block: Introduce zone write plugging")
    Cc: [email protected]
    Signed-off-by: Damien Le Moal <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Reviewed-by: Johannes Thumshirn <[email protected]>
    Reviewed-by: Chaitanya Kulkarni <[email protected]>
    Reviewed-by: Hannes Reinecke <[email protected]>
    Reviewed-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    [ adapted bdev_zone_is_seq() check to disk_zone_is_conv() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

block: rate-limit capacity change info log [+ + +]

Author: Li Chen <[email protected]>
Date:   Mon Nov 17 13:34:07 2025 +0800

    block: rate-limit capacity change info log
    
    commit 3179a5f7f86bcc3acd5d6fb2a29f891ef5615852 upstream.
    
    loop devices under heavy stress-ng loop streessor can trigger many
    capacity change events in a short time. Each event prints an info
    message from set_capacity_and_notify(), flooding the console and
    contributing to soft lockups on slow consoles.
    
    Switch the printk in set_capacity_and_notify() to
    pr_info_ratelimited() so frequent capacity changes do not spam
    the log while still reporting occasional changes.
    
    Cc: [email protected]
    Signed-off-by: Li Chen <[email protected]>
    Reviewed-by: Chaitanya Kulkarni <[email protected]>
    Reviewed-by: Bart Van Assche <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

block: rnbd-clt: Fix leaked ID in init_dev() [+ + +]

Author: Thomas Fourier <[email protected]>
Date:   Wed Dec 17 10:36:48 2025 +0100

    block: rnbd-clt: Fix leaked ID in init_dev()
    
    [ Upstream commit c9b5645fd8ca10f310e41b07540f98e6a9720f40 ]
    
    If kstrdup() fails in init_dev(), then the newly allocated ID is lost.
    
    Fixes: 64e8a6ece1a5 ("block/rnbd-clt: Dynamically alloc buffer for pathname & blk_symlink_name")
    Signed-off-by: Thomas Fourier <[email protected]>
    Acked-by: Jack Wang <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

block: rnbd-clt: Fix signedness bug in init_dev() [+ + +]

Author: Dan Carpenter <[email protected]>
Date:   Sat Dec 20 11:46:10 2025 +0300

    block: rnbd-clt: Fix signedness bug in init_dev()
    
    [ Upstream commit 1ddb815fdfd45613c32e9bd1f7137428f298e541 ]
    
    The "dev->clt_device_id" variable is set using ida_alloc_max() which
    returns an int and in particular it returns negative error codes.
    Change the type from u32 to int to fix the error checking.
    
    Fixes: c9b5645fd8ca ("block: rnbd-clt: Fix leaked ID in init_dev()")
    Signed-off-by: Dan Carpenter <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: btusb: Add new VID/PID 0x0489/0xE12F for RTL8852BE-VT [+ + +]

Author: Max Chou <[email protected]>
Date:   Wed Nov 5 13:50:39 2025 +0800

    Bluetooth: btusb: Add new VID/PID 0x0489/0xE12F for RTL8852BE-VT
    
    [ Upstream commit 32caa197b9b603e20f49fd3a0dffecd0cd620499 ]
    
    Add the support ID(0x0489, 0xE12F) to usb_device_id table for
    Realtek RTL8852BE-VT.
    
    The device info from /sys/kernel/debug/usb/devices as below.
    
    T:  Bus=04 Lev=02 Prnt=02 Port=05 Cnt=01 Dev#= 86 Spd=12   MxCh= 0
    D:  Ver= 1.00 Cls=e0(wlcon) Sub=01 Prot=01 MxPS=64 #Cfgs=  1
    P:  Vendor=0489 ProdID=e12f Rev= 0.00
    S:  Manufacturer=Realtek
    S:  Product=Bluetooth Radio
    S:  SerialNumber=00e04c000001
    C:* #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=500mA
    I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=1ms
    E:  Ad=02(O) Atr=02(Bulk) MxPS=  64 Ivl=0ms
    E:  Ad=82(I) Atr=02(Bulk) MxPS=  64 Ivl=0ms
    I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    
    Signed-off-by: Max Chou <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: btusb: Add new VID/PID 13d3/3533 for RTL8821CE [+ + +]

Author: Gongwei Li <[email protected]>
Date:   Wed Nov 19 15:33:38 2025 +0800

    Bluetooth: btusb: Add new VID/PID 13d3/3533 for RTL8821CE
    
    [ Upstream commit 525459da4bd62a81142fea3f3d52188ceb4d8907 ]
    
    Add VID 13d3 & PID 3533 for Realtek RTL8821CE USB Bluetooth chip.
    
    The information in /sys/kernel/debug/usb/devices about the Bluetooth
    device is listed as the below.
    
    T:  Bus=01 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#=  2 Spd=12   MxCh= 0
    D:  Ver= 1.10 Cls=e0(wlcon) Sub=01 Prot=01 MxPS=64 #Cfgs=  1
    P:  Vendor=13d3 ProdID=3533 Rev= 1.10
    S:  Manufacturer=Realtek
    S:  Product=Bluetooth Radio
    S:  SerialNumber=00e04c000001
    C:* #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=500mA
    I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=1ms
    E:  Ad=02(O) Atr=02(Bulk) MxPS=  64 Ivl=0ms
    E:  Ad=82(I) Atr=02(Bulk) MxPS=  64 Ivl=0ms
    I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    
    Signed-off-by: Gongwei Li <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: btusb: Add new VID/PID 2b89/6275 for RTL8761BUV [+ + +]

Author: Chingbin Li <[email protected]>
Date:   Mon Oct 6 16:46:47 2025 +0800

    Bluetooth: btusb: Add new VID/PID 2b89/6275 for RTL8761BUV
    
    [ Upstream commit 8dbbb5423c0802ec21266765de80fd491868fab1 ]
    
    Add VID 2b89 & PID 6275 for Realtek RTL8761BUV USB Bluetooth chip.
    
    The information in /sys/kernel/debug/usb/devices about the Bluetooth
    device is listed as the below.
    
    T:  Bus=01 Lev=01 Prnt=01 Port=02 Cnt=01 Dev#=  6 Spd=12   MxCh= 0
    D:  Ver= 1.10 Cls=e0(wlcon) Sub=01 Prot=01 MxPS=64 #Cfgs=  1
    P:  Vendor=2b89 ProdID=6275 Rev= 2.00
    S:  Manufacturer=Realtek
    S:  Product=Bluetooth Radio
    S:  SerialNumber=00E04C239987
    C:* #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=500mA
    I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=1ms
    E:  Ad=02(O) Atr=02(Bulk) MxPS=  64 Ivl=0ms
    E:  Ad=82(I) Atr=02(Bulk) MxPS=  64 Ivl=0ms
    I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    
    Signed-off-by: Chingbin Li <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: btusb: MT7920: Add VID/PID 0489/e135 [+ + +]

Author: Chris Lu <[email protected]>
Date:   Wed Oct 15 11:31:49 2025 +0800

    Bluetooth: btusb: MT7920: Add VID/PID 0489/e135
    
    [ Upstream commit c126f98c011f5796ba118ef2093122d02809d30d ]
    
    Add VID 0489 & PID e135 for MediaTek MT7920 USB Bluetooth chip.
    
    The information in /sys/kernel/debug/usb/devices about the Bluetooth
    device is listed as the below.
    
    T:  Bus=06 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#=  2 Spd=480  MxCh= 0
    D:  Ver= 2.10 Cls=ef(misc ) Sub=02 Prot=01 MxPS=64 #Cfgs=  1
    P:  Vendor=0489 ProdID=e135 Rev= 1.00
    S:  Manufacturer=MediaTek Inc.
    S:  Product=Wireless_Device
    S:  SerialNumber=000000000
    C:* #Ifs= 3 Cfg#= 1 Atr=e0 MxPwr=100mA
    A:  FirstIf#= 0 IfCount= 3 Cls=e0(wlcon) Sub=01 Prot=01
    I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=125us
    E:  Ad=82(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    I:  If#= 1 Alt= 6 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    I:* If#= 2 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=(none)
    E:  Ad=8a(I) Atr=03(Int.) MxPS=  64 Ivl=125us
    E:  Ad=0a(O) Atr=03(Int.) MxPS=  64 Ivl=125us
    I:  If#= 2 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=(none)
    E:  Ad=8a(I) Atr=03(Int.) MxPS=  64 Ivl=125us
    E:  Ad=0a(O) Atr=03(Int.) MxPS=  64 Ivl=125us
    
    Signed-off-by: Chris Lu <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: btusb: MT7922: Add VID/PID 0489/e170 [+ + +]

Author: Chris Lu <[email protected]>
Date:   Wed Oct 15 11:31:50 2025 +0800

    Bluetooth: btusb: MT7922: Add VID/PID 0489/e170
    
    [ Upstream commit 5a6700a31c953af9a17a7e2681335f31d922614d ]
    
    Add VID 0489 & PID e170 for MediaTek MT7922 USB Bluetooth chip.
    
    The information in /sys/kernel/debug/usb/devices about the Bluetooth
    device is listed as the below.
    
    T:  Bus=06 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#=  2 Spd=480  MxCh= 0
    D:  Ver= 2.10 Cls=ef(misc ) Sub=02 Prot=01 MxPS=64 #Cfgs=  1
    P:  Vendor=0489 ProdID=e170 Rev= 1.00
    S:  Manufacturer=MediaTek Inc.
    S:  Product=Wireless_Device
    S:  SerialNumber=000000000
    C:* #Ifs= 3 Cfg#= 1 Atr=e0 MxPwr=100mA
    A:  FirstIf#= 0 IfCount= 3 Cls=e0(wlcon) Sub=01 Prot=01
    I:* If#= 0 Alt= 0 #EPs= 3 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=81(I) Atr=03(Int.) MxPS=  16 Ivl=125us
    E:  Ad=82(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    E:  Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms
    I:* If#= 1 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   0 Ivl=1ms
    I:  If#= 1 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=   9 Ivl=1ms
    I:  If#= 1 Alt= 2 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  17 Ivl=1ms
    I:  If#= 1 Alt= 3 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  25 Ivl=1ms
    I:  If#= 1 Alt= 4 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  33 Ivl=1ms
    I:  If#= 1 Alt= 5 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  49 Ivl=1ms
    I:  If#= 1 Alt= 6 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=btusb
    E:  Ad=83(I) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    E:  Ad=03(O) Atr=01(Isoc) MxPS=  63 Ivl=1ms
    I:* If#= 2 Alt= 0 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=(none)
    E:  Ad=8a(I) Atr=03(Int.) MxPS=  64 Ivl=125us
    E:  Ad=0a(O) Atr=03(Int.) MxPS=  64 Ivl=125us
    I:  If#= 2 Alt= 1 #EPs= 2 Cls=e0(wlcon) Sub=01 Prot=01 Driver=(none)
    E:  Ad=8a(I) Atr=03(Int.) MxPS= 512 Ivl=125us
    E:  Ad=0a(O) Atr=03(Int.) MxPS= 512 Ivl=125us
    
    Signed-off-by: Chris Lu <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Bluetooth: btusb: revert use of devm_kzalloc in btusb [+ + +]

Author: Raphael Pinsonneault-Thibeault <[email protected]>
Date:   Wed Dec 10 11:02:28 2025 -0500

    Bluetooth: btusb: revert use of devm_kzalloc in btusb
    
    [ Upstream commit 252714f1e8bdd542025b16321c790458014d6880 ]
    
    This reverts commit 98921dbd00c4e ("Bluetooth: Use devm_kzalloc in
    btusb.c file").
    
    In btusb_probe(), we use devm_kzalloc() to allocate the btusb data. This
    ties the lifetime of all the btusb data to the binding of a driver to
    one interface, INTF. In a driver that binds to other interfaces, ISOC
    and DIAG, this is an accident waiting to happen.
    
    The issue is revealed in btusb_disconnect(), where calling
    usb_driver_release_interface(&btusb_driver, data->intf) will have devm
    free the data that is also being used by the other interfaces of the
    driver that may not be released yet.
    
    To fix this, revert the use of devm and go back to freeing memory
    explicitly.
    
    Fixes: 98921dbd00c4e ("Bluetooth: Use devm_kzalloc in btusb.c file")
    Signed-off-by: Raphael Pinsonneault-Thibeault <[email protected]>
    Signed-off-by: Luiz Augusto von Dentz <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bnxt_en: Fix XDP_TX path [+ + +]

Author: Michael Chan <[email protected]>
Date:   Tue Dec 2 16:30:24 2025 -0800

    bnxt_en: Fix XDP_TX path
    
    [ Upstream commit 0373d5c387f24de749cc22e694a14b3a7c7eb515 ]
    
    For XDP_TX action in bnxt_rx_xdp(), clearing of the event flags is not
    correct.  __bnxt_poll_work() -> bnxt_rx_pkt() -> bnxt_rx_xdp() may be
    looping within NAPI and some event flags may be set in earlier
    iterations.  In particular, if BNXT_TX_EVENT is set earlier indicating
    some XDP_TX packets are ready and pending, it will be cleared if it is
    XDP_TX action again.  Normally, we will set BNXT_TX_EVENT again when we
    successfully call __bnxt_xmit_xdp().  But if the TX ring has no more
    room, the flag will not be set.  This will cause the TX producer to be
    ahead but the driver will not hit the TX doorbell.
    
    For multi-buf XDP_TX, there is no need to clear the event flags and set
    BNXT_AGG_EVENT.  The BNXT_AGG_EVENT flag should have been set earlier in
    bnxt_rx_pkt().
    
    The visible symptom of this is that the RX ring associated with the
    TX XDP ring will eventually become empty and all packets will be dropped.
    Because this condition will cause the driver to not refill the RX ring
    seeing that the TX ring has forever pending XDP_TX packets.
    
    The fix is to only clear BNXT_RX_EVENT when we have successfully
    called __bnxt_xmit_xdp().
    
    Fixes: 7f0a168b0441 ("bnxt_en: Add completion ring pointer in TX and RX ring structures")
    Reported-by: Pavel Dubovitsky <[email protected]>
    Reviewed-by: Andy Gospodarek <[email protected]>
    Reviewed-by: Pavan Chebbi <[email protected]>
    Reviewed-by: Kalesh AP <[email protected]>
    Signed-off-by: Michael Chan <[email protected]>
    Reviewed-by: Jacob Keller <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bpf, arm64: Do not audit capability check in do_jit() [+ + +]

Author: Ondrej Mosnacek <[email protected]>
Date:   Thu Dec 4 13:59:16 2025 +0100

    bpf, arm64: Do not audit capability check in do_jit()
    
    [ Upstream commit 189e5deb944a6f9c7992355d60bffd8ec2e54a9c ]
    
    Analogically to the x86 commit 881a9c9cb785 ("bpf: Do not audit
    capability check in do_jit()"), change the capable() call to
    ns_capable_noaudit() in order to avoid spurious SELinux denials in audit
    log.
    
    The commit log from that commit applies here as well:
    """
    The failure of this check only results in a security mitigation being
    applied, slightly affecting performance of the compiled BPF program. It
    doesn't result in a failed syscall, an thus auditing a failed LSM
    permission check for it is unwanted. For example with SELinux, it causes
    a denial to be reported for confined processes running as root, which
    tends to be flagged as a problem to be fixed in the policy. Yet
    dontauditing or allowing CAP_SYS_ADMIN to the domain may not be
    desirable, as it would allow/silence also other checks - either going
    against the principle of least privilege or making debugging potentially
    harder.
    
    Fix it by changing it from capable() to ns_capable_noaudit(), which
    instructs the LSMs to not audit the resulting denials.
    """
    
    Fixes: f300769ead03 ("arm64: bpf: Only mitigate cBPF programs loaded by unprivileged users")
    Signed-off-by: Ondrej Mosnacek <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexei Starovoitov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

broadcom: b44: prevent uninitialized value usage [+ + +]

Author: Alexey Simakov <[email protected]>
Date:   Fri Dec 5 18:58:16 2025 +0300

    broadcom: b44: prevent uninitialized value usage
    
    [ Upstream commit 50b3db3e11864cb4e18ff099cfb38e11e7f87a68 ]
    
    On execution path with raised B44_FLAG_EXTERNAL_PHY, b44_readphy()
    leaves bmcr value uninitialized and it is used later in the code.
    
    Add check of this flag at the beginning of the b44_nway_reset() and
    exit early of the function with restarting autonegotiation if an
    external PHY is used.
    
    Fixes: 753f492093da ("[B44]: port to native ssb support")
    Reviewed-by: Jonas Gorski <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Signed-off-by: Alexey Simakov <[email protected]>
    Reviewed-by: Michael Chan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: do not skip logging new dentries when logging a new name [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Wed Dec 3 17:02:00 2025 +0000

    btrfs: do not skip logging new dentries when logging a new name
    
    [ Upstream commit 5630f7557de61264ccb4f031d4734a1a97eaed16 ]
    
    When we are logging a directory and the log context indicates that we
    are logging a new name for some other file (that is or was inside that
    directory), we skip logging the inodes for new dentries in the directory.
    
    This is ok most of the time, but if after the rename or link operation
    that triggered the logging of that directory, we have an explicit fsync
    of that directory without the directory inode being evicted and reloaded,
    we end up never logging the inodes for the new dentries that we found
    during the new name logging, as the next directory fsync will only process
    dentries that were added after the last time we logged the directory (we
    are doing an incremental directory logging).
    
    So make sure we always log new dentries for a directory even if we are
    in a context of logging a new name.
    
    We started skipping logging inodes for new dentries as of commit
    c48792c6ee7a ("btrfs: do not log new dentries when logging that a new name
    exists") and it was fine back then, because when logging a directory we
    always iterated over all the directory entries (for leaves changed in the
    current transaction) so a subsequent fsync would always log anything that
    was previously skipped while logging a directory when logging a new name
    (with btrfs_log_new_name()). But later support for incrementally logging
    a directory was added in commit dc2872247ec0 ("btrfs: keep track of the
    last logged keys when logging a directory"), to avoid checking all dir
    items every time we log a directory, so the check to skip dentry logging
    added in the first commit should have been removed when the incremental
    support for logging a directory was added.
    
    A test case for fstests will follow soon.
    
    Reported-by: Vyacheslav Kovalevsky <[email protected]>
    Link: https://lore.kernel.org/linux-btrfs/[email protected]/
    Fixes: dc2872247ec0 ("btrfs: keep track of the last logged keys when logging a directory")
    Reviewed-by: Boris Burkov <[email protected]>
    Signed-off-by: Filipe Manana <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: don't log conflicting inode if it's a dir moved in the current transaction [+ + +]

Author: Filipe Manana <[email protected]>
Date:   Thu Nov 27 16:35:59 2025 +0000

    btrfs: don't log conflicting inode if it's a dir moved in the current transaction
    
    commit 266273eaf4d99475f1ae57f687b3e42bc71ec6f0 upstream.
    
    We can't log a conflicting inode if it's a directory and it was moved
    from one parent directory to another parent directory in the current
    transaction, as this can result an attempt to have a directory with
    two hard links during log replay, one for the old parent directory and
    another for the new parent directory.
    
    The following scenario triggers that issue:
    
    1) We have directories "dir1" and "dir2" created in a past transaction.
       Directory "dir1" has inode A as its parent directory;
    
    2) We move "dir1" to some other directory;
    
    3) We create a file with the name "dir1" in directory inode A;
    
    4) We fsync the new file. This results in logging the inode of the new file
       and the inode for the directory "dir1" that was previously moved in the
       current transaction. So the log tree has the INODE_REF item for the
       new location of "dir1";
    
    5) We move the new file to some other directory. This results in updating
       the log tree to included the new INODE_REF for the new location of the
       file and removes the INODE_REF for the old location. This happens
       during the rename when we call btrfs_log_new_name();
    
    6) We fsync the file, and that persists the log tree changes done in the
       previous step (btrfs_log_new_name() only updates the log tree in
       memory);
    
    7) We have a power failure;
    
    8) Next time the fs is mounted, log replay happens and when processing
       the inode for directory "dir1" we find a new INODE_REF and add that
       link, but we don't remove the old link of the inode since we have
       not logged the old parent directory of the directory inode "dir1".
    
    As a result after log replay finishes when we trigger writeback of the
    subvolume tree's extent buffers, the tree check will detect that we have
    a directory a hard link count of 2 and we get a mount failure.
    The errors and stack traces reported in dmesg/syslog are like this:
    
       [ 3845.729764] BTRFS info (device dm-0): start tree-log replay
       [ 3845.730304] page: refcount:3 mapcount:0 mapping:000000005c8a3027 index:0x1d00 pfn:0x11510c
       [ 3845.731236] memcg:ffff9264c02f4e00
       [ 3845.731751] aops:btree_aops [btrfs] ino:1
       [ 3845.732300] flags: 0x17fffc00000400a(uptodate|private|writeback|node=0|zone=2|lastcpupid=0x1ffff)
       [ 3845.733346] raw: 017fffc00000400a 0000000000000000 dead000000000122 ffff9264d978aea8
       [ 3845.734265] raw: 0000000000001d00 ffff92650e6d4738 00000003ffffffff ffff9264c02f4e00
       [ 3845.735305] page dumped because: eb page dump
       [ 3845.735981] BTRFS critical (device dm-0): corrupt leaf: root=5 block=30408704 slot=6 ino=257, invalid nlink: has 2 expect no more than 1 for dir
       [ 3845.737786] BTRFS info (device dm-0): leaf 30408704 gen 10 total ptrs 17 free space 14881 owner 5
       [ 3845.737789] BTRFS info (device dm-0): refs 4 lock_owner 0 current 30701
       [ 3845.737792]       item 0 key (256 INODE_ITEM 0) itemoff 16123 itemsize 160
       [ 3845.737794]               inode generation 3 transid 9 size 16 nbytes 16384
       [ 3845.737795]               block group 0 mode 40755 links 1 uid 0 gid 0
       [ 3845.737797]               rdev 0 sequence 2 flags 0x0
       [ 3845.737798]               atime 1764259517.0
       [ 3845.737800]               ctime 1764259517.572889464
       [ 3845.737801]               mtime 1764259517.572889464
       [ 3845.737802]               otime 1764259517.0
       [ 3845.737803]       item 1 key (256 INODE_REF 256) itemoff 16111 itemsize 12
       [ 3845.737805]               index 0 name_len 2
       [ 3845.737807]       item 2 key (256 DIR_ITEM 2363071922) itemoff 16077 itemsize 34
       [ 3845.737808]               location key (257 1 0) type 2
       [ 3845.737810]               transid 9 data_len 0 name_len 4
       [ 3845.737811]       item 3 key (256 DIR_ITEM 2676584006) itemoff 16043 itemsize 34
       [ 3845.737813]               location key (258 1 0) type 2
       [ 3845.737814]               transid 9 data_len 0 name_len 4
       [ 3845.737815]       item 4 key (256 DIR_INDEX 2) itemoff 16009 itemsize 34
       [ 3845.737816]               location key (257 1 0) type 2
       [ 3845.737818]               transid 9 data_len 0 name_len 4
       [ 3845.737819]       item 5 key (256 DIR_INDEX 3) itemoff 15975 itemsize 34
       [ 3845.737820]               location key (258 1 0) type 2
       [ 3845.737821]               transid 9 data_len 0 name_len 4
       [ 3845.737822]       item 6 key (257 INODE_ITEM 0) itemoff 15815 itemsize 160
       [ 3845.737824]               inode generation 9 transid 10 size 6 nbytes 0
       [ 3845.737825]               block group 0 mode 40755 links 2 uid 0 gid 0
       [ 3845.737826]               rdev 0 sequence 1 flags 0x0
       [ 3845.737827]               atime 1764259517.572889464
       [ 3845.737828]               ctime 1764259517.572889464
       [ 3845.737830]               mtime 1764259517.572889464
       [ 3845.737831]               otime 1764259517.572889464
       [ 3845.737832]       item 7 key (257 INODE_REF 256) itemoff 15801 itemsize 14
       [ 3845.737833]               index 2 name_len 4
       [ 3845.737834]       item 8 key (257 INODE_REF 258) itemoff 15787 itemsize 14
       [ 3845.737836]               index 2 name_len 4
       [ 3845.737837]       item 9 key (257 DIR_ITEM 2507850652) itemoff 15754 itemsize 33
       [ 3845.737838]               location key (259 1 0) type 1
       [ 3845.737839]               transid 10 data_len 0 name_len 3
       [ 3845.737840]       item 10 key (257 DIR_INDEX 2) itemoff 15721 itemsize 33
       [ 3845.737842]               location key (259 1 0) type 1
       [ 3845.737843]               transid 10 data_len 0 name_len 3
       [ 3845.737844]       item 11 key (258 INODE_ITEM 0) itemoff 15561 itemsize 160
       [ 3845.737846]               inode generation 9 transid 10 size 8 nbytes 0
       [ 3845.737847]               block group 0 mode 40755 links 1 uid 0 gid 0
       [ 3845.737848]               rdev 0 sequence 1 flags 0x0
       [ 3845.737849]               atime 1764259517.572889464
       [ 3845.737850]               ctime 1764259517.572889464
       [ 3845.737851]               mtime 1764259517.572889464
       [ 3845.737852]               otime 1764259517.572889464
       [ 3845.737853]       item 12 key (258 INODE_REF 256) itemoff 15547 itemsize 14
       [ 3845.737855]               index 3 name_len 4
       [ 3845.737856]       item 13 key (258 DIR_ITEM 1843588421) itemoff 15513 itemsize 34
       [ 3845.737857]               location key (257 1 0) type 2
       [ 3845.737858]               transid 10 data_len 0 name_len 4
       [ 3845.737860]       item 14 key (258 DIR_INDEX 2) itemoff 15479 itemsize 34
       [ 3845.737861]               location key (257 1 0) type 2
       [ 3845.737862]               transid 10 data_len 0 name_len 4
       [ 3845.737863]       item 15 key (259 INODE_ITEM 0) itemoff 15319 itemsize 160
       [ 3845.737865]               inode generation 10 transid 10 size 0 nbytes 0
       [ 3845.737866]               block group 0 mode 100600 links 1 uid 0 gid 0
       [ 3845.737867]               rdev 0 sequence 2 flags 0x0
       [ 3845.737868]               atime 1764259517.580874966
       [ 3845.737869]               ctime 1764259517.586121869
       [ 3845.737870]               mtime 1764259517.580874966
       [ 3845.737872]               otime 1764259517.580874966
       [ 3845.737873]       item 16 key (259 INODE_REF 257) itemoff 15306 itemsize 13
       [ 3845.737874]               index 2 name_len 3
       [ 3845.737875] BTRFS error (device dm-0): block=30408704 write time tree block corruption detected
       [ 3845.739448] ------------[ cut here ]------------
       [ 3845.740092] WARNING: CPU: 5 PID: 30701 at fs/btrfs/disk-io.c:335 btree_csum_one_bio+0x25a/0x270 [btrfs]
       [ 3845.741439] Modules linked in: btrfs dm_flakey crc32c_cryptoapi (...)
       [ 3845.750626] CPU: 5 UID: 0 PID: 30701 Comm: mount Tainted: G        W           6.18.0-rc6-btrfs-next-218+ #1 PREEMPT(full)
       [ 3845.752414] Tainted: [W]=WARN
       [ 3845.752828] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.2-0-gea1b7a073390-prebuilt.qemu.org 04/01/2014
       [ 3845.754499] RIP: 0010:btree_csum_one_bio+0x25a/0x270 [btrfs]
       [ 3845.755460] Code: 31 f6 48 89 (...)
       [ 3845.758685] RSP: 0018:ffffa8d9c5677678 EFLAGS: 00010246
       [ 3845.759450] RAX: 0000000000000000 RBX: ffff92650e6d4738 RCX: 0000000000000000
       [ 3845.760309] RDX: 0000000000000000 RSI: ffffffff9aab45b9 RDI: ffff9264c4748000
       [ 3845.761239] RBP: ffff9264d4324000 R08: 0000000000000000 R09: ffffa8d9c5677468
       [ 3845.762607] R10: ffff926bdc1fffa8 R11: 0000000000000003 R12: ffffa8d9c5677680
       [ 3845.764099] R13: 0000000000004000 R14: ffff9264dd624000 R15: ffff9264d978aba8
       [ 3845.765094] FS:  00007f751fa5a840(0000) GS:ffff926c42a82000(0000) knlGS:0000000000000000
       [ 3845.766226] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
       [ 3845.766970] CR2: 0000558df1815380 CR3: 000000010ed88003 CR4: 0000000000370ef0
       [ 3845.768009] Call Trace:
       [ 3845.768392]  <TASK>
       [ 3845.768714]  btrfs_submit_bbio+0x6ee/0x7f0 [btrfs]
       [ 3845.769640]  ? write_one_eb+0x28e/0x340 [btrfs]
       [ 3845.770588]  btree_write_cache_pages+0x2f0/0x550 [btrfs]
       [ 3845.771286]  ? alloc_extent_state+0x19/0x100 [btrfs]
       [ 3845.771967]  ? merge_next_state+0x1a/0x90 [btrfs]
       [ 3845.772586]  ? set_extent_bit+0x233/0x8b0 [btrfs]
       [ 3845.773198]  ? xas_load+0x9/0xc0
       [ 3845.773589]  ? xas_find+0x14d/0x1a0
       [ 3845.773969]  do_writepages+0xc6/0x160
       [ 3845.774367]  filemap_fdatawrite_wbc+0x48/0x60
       [ 3845.775003]  __filemap_fdatawrite_range+0x5b/0x80
       [ 3845.775902]  btrfs_write_marked_extents+0x61/0x170 [btrfs]
       [ 3845.776707]  btrfs_write_and_wait_transaction+0x4e/0xc0 [btrfs]
       [ 3845.777379]  ? _raw_spin_unlock_irqrestore+0x23/0x40
       [ 3845.777923]  btrfs_commit_transaction+0x5ea/0xd20 [btrfs]
       [ 3845.778551]  ? _raw_spin_unlock+0x15/0x30
       [ 3845.778986]  ? release_extent_buffer+0x34/0x160 [btrfs]
       [ 3845.779659]  btrfs_recover_log_trees+0x7a3/0x7c0 [btrfs]
       [ 3845.780416]  ? __pfx_replay_one_buffer+0x10/0x10 [btrfs]
       [ 3845.781499]  open_ctree+0x10bb/0x15f0 [btrfs]
       [ 3845.782194]  btrfs_get_tree.cold+0xb/0x16c [btrfs]
       [ 3845.782764]  ? fscontext_read+0x15c/0x180
       [ 3845.783202]  ? rw_verify_area+0x50/0x180
       [ 3845.783667]  vfs_get_tree+0x25/0xd0
       [ 3845.784047]  vfs_cmd_create+0x59/0xe0
       [ 3845.784458]  __do_sys_fsconfig+0x4f6/0x6b0
       [ 3845.784914]  do_syscall_64+0x50/0x1220
       [ 3845.785340]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
       [ 3845.785980] RIP: 0033:0x7f751fc7f4aa
       [ 3845.786759] Code: 73 01 c3 48 (...)
       [ 3845.789951] RSP: 002b:00007ffcdba45dc8 EFLAGS: 00000246 ORIG_RAX: 00000000000001af
       [ 3845.791402] RAX: ffffffffffffffda RBX: 000055ccc8291c20 RCX: 00007f751fc7f4aa
       [ 3845.792688] RDX: 0000000000000000 RSI: 0000000000000006 RDI: 0000000000000003
       [ 3845.794308] RBP: 000055ccc8292120 R08: 0000000000000000 R09: 0000000000000000
       [ 3845.795829] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
       [ 3845.797183] R13: 00007f751fe11580 R14: 00007f751fe1326c R15: 00007f751fdf8a23
       [ 3845.798633]  </TASK>
       [ 3845.799067] ---[ end trace 0000000000000000 ]---
       [ 3845.800215] BTRFS: error (device dm-0) in btrfs_commit_transaction:2553: errno=-5 IO failure (Error while writing out transaction)
       [ 3845.801860] BTRFS warning (device dm-0 state E): Skipping commit of aborted transaction.
       [ 3845.802815] BTRFS error (device dm-0 state EA): Transaction aborted (error -5)
       [ 3845.803728] BTRFS: error (device dm-0 state EA) in cleanup_transaction:2036: errno=-5 IO failure
       [ 3845.805374] BTRFS: error (device dm-0 state EA) in btrfs_replay_log:2083: errno=-5 IO failure (Failed to recover log tree)
       [ 3845.807919] BTRFS error (device dm-0 state EA): open_ctree failed: -5
    
    Fix this by never logging a conflicting inode that is a directory and was
    moved in the current transaction (its last_unlink_trans equals the current
    transaction) and instead fallback to a transaction commit.
    
    A test case for fstests will follow soon.
    
    Reported-by: Vyacheslav Kovalevsky <[email protected]>
    Link: https://lore.kernel.org/linux-btrfs/[email protected]/
    CC: [email protected] # 6.1+
    Signed-off-by: Filipe Manana <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: don't rewrite ret from inode_permission [+ + +]

Author: Josef Bacik <[email protected]>
Date:   Mon Dec 29 16:39:04 2025 -0500

    btrfs: don't rewrite ret from inode_permission
    
    [ Upstream commit 0185c2292c600993199bc6b1f342ad47a9e8c678 ]
    
    In our user safe ino resolve ioctl we'll just turn any ret into -EACCES
    from inode_permission().  This is redundant, and could potentially be
    wrong if we had an ENOMEM in the security layer or some such other
    error, so simply return the actual return value.
    
    Note: The patch was taken from v5 of fscrypt patchset
    (https://lore.kernel.org/linux-btrfs/[email protected]/)
    which was handled over time by various people: Omar Sandoval, Sweet Tea
    Dorminy, Josef Bacik.
    
    Fixes: 23d0b79dfaed ("btrfs: Add unprivileged version of ino_lookup ioctl")
    CC: [email protected] # 5.4+
    Reviewed-by: Johannes Thumshirn <[email protected]>
    Signed-off-by: Josef Bacik <[email protected]>
    Signed-off-by: Daniel Vacek <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    [ add note ]
    Signed-off-by: David Sterba <[email protected]>
    [ Adjust context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

btrfs: fix a potential path leak in print_data_reloc_error() [+ + +]

Author: Qu Wenruo <[email protected]>
Date:   Tue Nov 25 18:49:56 2025 +1030

    btrfs: fix a potential path leak in print_data_reloc_error()
    
    [ Upstream commit 313ef70a9f0f637a09d9ef45222f5bdcf30a354b ]
    
    Inside print_data_reloc_error(), if extent_from_logical() failed we
    return immediately.
    
    However there are the following cases where extent_from_logical() can
    return error but still holds a path:
    
    - btrfs_search_slot() returned 0
    
    - No backref item found in extent tree
    
    - No flags_ret provided
      This is not possible in this call site though.
    
    So for the above two cases, we can return without releasing the path,
    causing extent buffer leaks.
    
    Fixes: b9a9a85059cd ("btrfs: output affected files when relocation fails")
    Signed-off-by: Qu Wenruo <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: fix memory leak of fs_devices in degraded seed device path [+ + +]

Author: Deepanshu Kartikey <[email protected]>
Date:   Wed Dec 10 18:58:07 2025 +0530

    btrfs: fix memory leak of fs_devices in degraded seed device path
    
    [ Upstream commit b57f2ddd28737db6ff0e9da8467f0ab9d707e997 ]
    
    In open_seed_devices(), when find_fsid() fails and we're in DEGRADED
    mode, a new fs_devices is allocated via alloc_fs_devices() but is never
    added to the seed_list before returning. This contrasts with the normal
    path where fs_devices is properly added via list_add().
    
    If any error occurs later in read_one_dev() or btrfs_read_chunk_tree(),
    the cleanup code iterates seed_list to free seed devices, but this
    orphaned fs_devices is never found and never freed, causing a memory
    leak. Any devices allocated via add_missing_dev() and attached to this
    fs_devices are also leaked.
    
    Fix this by adding the newly allocated fs_devices to seed_list in the
    degraded path, consistent with the normal path.
    
    Fixes: 5f37583569442 ("Btrfs: move the missing device to its own fs device list")
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=eadd98df8bceb15d7fed
    Tested-by: [email protected]
    Reviewed-by: Qu Wenruo <[email protected]>
    Signed-off-by: Deepanshu Kartikey <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: scrub: always update btrfs_scrub_progress::last_physical [+ + +]

Author: Qu Wenruo <[email protected]>
Date:   Mon Nov 3 12:51:09 2025 +1030

    btrfs: scrub: always update btrfs_scrub_progress::last_physical
    
    [ Upstream commit 54df8b80cc63aa0f22c4590cad11542731ed43ff ]
    
    [BUG]
    When a scrub failed immediately without any byte scrubbed, the returned
    btrfs_scrub_progress::last_physical will always be 0, even if there is a
    non-zero @start passed into btrfs_scrub_dev() for resume cases.
    
    This will reset the progress and make later scrub resume start from the
    beginning.
    
    [CAUSE]
    The function btrfs_scrub_dev() accepts a @progress parameter to copy its
    updated progress to the caller, there are cases where we either don't
    touch progress::last_physical at all or copy 0 into last_physical:
    
    - last_physical not updated at all
      If some error happened before scrubbing any super block or chunk, we
      will not copy the progress, leaving the @last_physical untouched.
    
      E.g. failed to allocate @sctx, scrubbing a missing device or even
      there is already a running scrub and so on.
    
      All those cases won't touch @progress at all, resulting the
      last_physical untouched and will be left as 0 for most cases.
    
    - Error out before scrubbing any bytes
      In those case we allocated @sctx, and sctx->stat.last_physical is all
      zero (initialized by kvzalloc()).
      Unfortunately some critical errors happened during
      scrub_enumerate_chunks() or scrub_supers() before any stripe is really
      scrubbed.
    
      In that case although we will copy sctx->stat back to @progress, since
      no byte is really scrubbed, last_physical will be overwritten to 0.
    
    [FIX]
    Make sure the parameter @progress always has its @last_physical member
    updated to @start parameter inside btrfs_scrub_dev().
    
    At the very beginning of the function, set @progress->last_physical to
    @start, so that even if we error out without doing progress copying,
    last_physical is still at @start.
    
    Then after we got @sctx allocated, set sctx->stat.last_physical to
    @start, this will make sure even if we didn't get any byte scrubbed, at
    the progress copying stage the @last_physical is not left as zero.
    
    This should resolve the resume progress reset problem.
    
    Signed-off-by: Qu Wenruo <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

caif: fix integer underflow in cffrml_receive() [+ + +]

Author: Junrui Luo <[email protected]>
Date:   Thu Dec 4 21:30:47 2025 +0800

    caif: fix integer underflow in cffrml_receive()
    
    [ Upstream commit 8a11ff0948b5ad09b71896b7ccc850625f9878d1 ]
    
    The cffrml_receive() function extracts a length field from the packet
    header and, when FCS is disabled, subtracts 2 from this length without
    validating that len >= 2.
    
    If an attacker sends a malicious packet with a length field of 0 or 1
    to an interface with FCS disabled, the subtraction causes an integer
    underflow.
    
    This can lead to memory exhaustion and kernel instability, potential
    information disclosure if padding contains uninitialized kernel memory.
    
    Fix this by validating that len >= 2 before performing the subtraction.
    
    Reported-by: Yuhao Jiang <[email protected]>
    Reported-by: Junrui Luo <[email protected]>
    Fixes: b482cd2053e3 ("net-caif: add CAIF core protocol stack")
    Signed-off-by: Junrui Luo <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/SYBPR01MB7881511122BAFEA8212A1608AFA6A@SYBPR01MB7881.ausprd01.prod.outlook.com
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

can: gs_usb: gs_can_open(): fix error handling [+ + +]

Author: Marc Kleine-Budde <[email protected]>
Date:   Mon Dec 1 19:26:38 2025 +0100

    can: gs_usb: gs_can_open(): fix error handling
    
    commit 3e54d3b4a8437b6783d4145c86962a2aa51022f3 upstream.
    
    Commit 2603be9e8167 ("can: gs_usb: gs_can_open(): improve error handling")
    added missing error handling to the gs_can_open() function.
    
    The driver uses 2 USB anchors to track the allocated URBs: the TX URBs in
    struct gs_can::tx_submitted for each netdev and the RX URBs in struct
    gs_usb::rx_submitted for the USB device. gs_can_open() allocates the RX
    URBs, while TX URBs are allocated during gs_can_start_xmit().
    
    The cleanup in gs_can_open() kills all anchored dev->tx_submitted
    URBs (which is not necessary since the netdev is not yet registered), but
    misses the parent->rx_submitted URBs.
    
    Fix the problem by killing the rx_submitted instead of the tx_submitted.
    
    Fixes: 2603be9e8167 ("can: gs_usb: gs_can_open(): improve error handling")
    Cc: [email protected]
    Link: https://patch.msgid.link/20251210-gs_usb-fix-error-handling-v1-1-d6a5a03f10bb@pengutronix.de
    Signed-off-by: Marc Kleine-Budde <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

char: applicom: fix NULL pointer dereference in ac_ioctl [+ + +]

Author: Tianchu Chen <[email protected]>
Date:   Fri Nov 28 15:53:23 2025 +0800

    char: applicom: fix NULL pointer dereference in ac_ioctl
    
    commit 82d12088c297fa1cef670e1718b3d24f414c23f7 upstream.
    
    Discovered by Atuin - Automated Vulnerability Discovery Engine.
    
    In ac_ioctl, the validation of IndexCard and the check for a valid
    RamIO pointer are skipped when cmd is 6. However, the function
    unconditionally executes readb(apbs[IndexCard].RamIO + VERS) at the
    end.
    
    If cmd is 6, IndexCard may reference a board that does not exist
    (where RamIO is NULL), leading to a NULL pointer dereference.
    
    Fix this by skipping the readb access when cmd is 6, as this
    command is a global information query and does not target a specific
    board context.
    
    Signed-off-by: Tianchu Chen <[email protected]>
    Acked-by: Arnd Bergmann <[email protected]>
    Cc: stable <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cifs: Fix memory and information leak in smb3_reconfigure() [+ + +]

Author: Zilin Guan <[email protected]>
Date:   Wed Dec 24 15:21:42 2025 +0000

    cifs: Fix memory and information leak in smb3_reconfigure()
    
    [ Upstream commit cb6d5aa9c0f10074f1ad056c3e2278ad2cc7ec8d ]
    
    In smb3_reconfigure(), if smb3_sync_session_ctx_passwords() fails, the
    function returns immediately without freeing and erasing the newly
    allocated new_password and new_password2. This causes both a memory leak
    and a potential information leak.
    
    Fix this by calling kfree_sensitive() on both password buffers before
    returning in this error case.
    
    Fixes: 0f0e357902957 ("cifs: during remount, make sure passwords are in sync")
    Signed-off-by: Zilin Guan <[email protected]>
    Reviewed-by: ChenXiaoSong <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: mvebu: cp110 add CLK_IGNORE_UNUSED to pcie_x10, pcie_x11 & pcie_x4 [+ + +]

Author: Josua Mayer <[email protected]>
Date:   Thu Oct 30 16:16:26 2025 +0100

    clk: mvebu: cp110 add CLK_IGNORE_UNUSED to pcie_x10, pcie_x11 & pcie_x4
    
    [ Upstream commit f0e6bc0c3ef4b4afb299bd6912586cafd5d864e9 ]
    
    CP110 based platforms rely on the bootloader for pci port
    initialization.
    TF-A actively prevents non-uboot re-configuration of pci lanes, and many
    boards do not have software control over the pci card reset.
    
    If a pci port had link at boot-time and the clock is stopped at a later
    point, the link fails and can not be recovered.
    
    PCI controller driver probe - and by extension ownership of a driver for
    the pci clocks - may be delayed especially on large modular kernels,
    causing the clock core to start disabling unused clocks.
    
    Add the CLK_IGNORE_UNUSED flag to the three pci port's clocks to ensure
    they are not stopped before the pci controller driver has taken
    ownership and tested for an existing link.
    
    This fixes failed pci link detection when controller driver probes late,
    e.g. with arm64 defconfig and CONFIG_PHY_MVEBU_CP110_COMPHY=m.
    
    Closes: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Josua Mayer <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Signed-off-by: Gregory CLEMENT <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: qcom: dispcc-sm7150: Fix dispcc_mdss_pclk0_clk_src [+ + +]

Author: Jens Reidel <[email protected]>
Date:   Fri Sep 19 14:34:32 2025 +0200

    clk: qcom: dispcc-sm7150: Fix dispcc_mdss_pclk0_clk_src
    
    [ Upstream commit e3c13e0caa8ceb7dec1a7c4fcfd9dbef56a69fbe ]
    
    Set CLK_OPS_PARENT_ENABLE to ensure the parent gets prepared and enabled
    when switching to it, fixing an "rcg didn't update its configuration"
    warning.
    
    Signed-off-by: Jens Reidel <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: samsung: exynos-clkout: Assign .num before accessing .hws [+ + +]

Author: Nathan Chancellor <[email protected]>
Date:   Mon Nov 24 12:11:06 2025 -0700

    clk: samsung: exynos-clkout: Assign .num before accessing .hws
    
    commit cf33f0b7df13685234ccea7be7bfe316b60db4db upstream.
    
    Commit f316cdff8d67 ("clk: Annotate struct clk_hw_onecell_data with
    __counted_by") annotated the hws member of 'struct clk_hw_onecell_data'
    with __counted_by, which informs the bounds sanitizer (UBSAN_BOUNDS)
    about the number of elements in .hws[], so that it can warn when .hws[]
    is accessed out of bounds. As noted in that change, the __counted_by
    member must be initialized with the number of elements before the first
    array access happens, otherwise there will be a warning from each access
    prior to the initialization because the number of elements is zero. This
    occurs in exynos_clkout_probe() due to .num being assigned after .hws[]
    has been accessed:
    
      UBSAN: array-index-out-of-bounds in drivers/clk/samsung/clk-exynos-clkout.c:178:18
      index 0 is out of range for type 'clk_hw *[*]'
    
    Move the .num initialization to before the first access of .hws[],
    clearing up the warning.
    
    Cc: [email protected]
    Fixes: f316cdff8d67 ("clk: Annotate struct clk_hw_onecell_data with __counted_by")
    Reported-by: Jochen Sprickerhof <[email protected]>
    Closes: https://lore.kernel.org/[email protected]/
    Tested-by: Jochen Sprickerhof <[email protected]>
    Signed-off-by: Nathan Chancellor <[email protected]>
    Reviewed-by: Kees Cook <[email protected]>
    Reviewed-by: Sam Protsenko <[email protected]>
    Reviewed-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Stephen Boyd <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

compiler_types.h: add "auto" as a macro for "__auto_type" [+ + +]

Author: H. Peter Anvin <[email protected]>
Date:   Fri Jul 18 11:35:00 2025 -0700

    compiler_types.h: add "auto" as a macro for "__auto_type"
    
    commit 2fb6915fa22dc5524d704afba58a13305dd9f533 upstream.
    
    "auto" was defined as a keyword back in the K&R days, but as a storage
    type specifier.  No one ever used it, since it was and is the default
    storage type for local variables.
    
    C++11 recycled the keyword to allow a type to be declared based on the
    type of an initializer.  This was finally adopted into standard C in
    C23.
    
    gcc and clang provide the "__auto_type" alias keyword as an extension
    for pre-C23, however, there is no reason to pollute the bulk of the
    source base with this temporary keyword; instead define "auto" as a
    macro unless the compiler is running in C23+ mode.
    
    This macro is added in <linux/compiler_types.h> because that header is
    included in some of the tools headers, wheres <linux/compiler.h> is
    not as it has a bunch of very kernel-specific things in it.
    
    [ Cc: stable to reduce potential backporting burden. ]
    
    Signed-off-by: H. Peter Anvin (Intel) <[email protected]>
    Acked-by: Miguel Ojeda <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cpufreq: dt-platdev: Add JH7110S SOC to the allowlist [+ + +]

Author: Hal Feng <[email protected]>
Date:   Thu Oct 16 16:00:48 2025 +0800

    cpufreq: dt-platdev: Add JH7110S SOC to the allowlist
    
    [ Upstream commit 6e7970cab51d01b8f7c56f120486c571c22e1b80 ]
    
    Add the compatible strings for supporting the generic
    cpufreq driver on the StarFive JH7110S SoC.
    
    Signed-off-by: Hal Feng <[email protected]>
    Reviewed-by: Heinrich Schuchardt <[email protected]>
    Signed-off-by: Viresh Kumar <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

cpufreq: nforce2: fix reference count leak in nforce2 [+ + +]

Author: Miaoqian Lin <[email protected]>
Date:   Mon Oct 27 23:04:45 2025 +0800

    cpufreq: nforce2: fix reference count leak in nforce2
    
    commit 9600156bb99852c216a2128cdf9f114eb67c350f upstream.
    
    There are two reference count leaks in this driver:
    
    1. In nforce2_fsb_read(): pci_get_subsys() increases the reference count
       of the PCI device, but pci_dev_put() is never called to release it,
       thus leaking the reference.
    
    2. In nforce2_detect_chipset(): pci_get_subsys() gets a reference to the
       nforce2_dev which is stored in a global variable, but the reference
       is never released when the module is unloaded.
    
    Fix both by:
    - Adding pci_dev_put(nforce2_sub5) in nforce2_fsb_read() after reading
      the configuration.
    - Adding pci_dev_put(nforce2_dev) in nforce2_exit() to release the
      global device reference.
    
    Found via static analysis.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Signed-off-by: Miaoqian Lin <[email protected]>
    Signed-off-by: Viresh Kumar <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cpufreq: s5pv210: fix refcount leak [+ + +]

Author: Shuhao Fu <[email protected]>
Date:   Mon Oct 6 03:31:17 2025 +0800

    cpufreq: s5pv210: fix refcount leak
    
    [ Upstream commit 2de5cb96060a1664880d65b120e59485a73588a8 ]
    
    In function `s5pv210_cpu_init`, a possible refcount inconsistency has
    been identified, causing a resource leak.
    
    Why it is a bug:
    1. For every clk_get, there should be a matching clk_put on every
    successive error handling path.
    2. After calling `clk_get(dmc1_clk)`, variable `dmc1_clk` will not be
    freed even if any error happens.
    
    How it is fixed: For every failed path, an extra goto label is added to
    ensure `dmc1_clk` will be freed regardlessly.
    
    Signed-off-by: Shuhao Fu <[email protected]>
    Signed-off-by: Viresh Kumar <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

cpuidle: governors: teo: Drop misguided target residency check [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Thu Nov 13 14:24:31 2025 +0100

    cpuidle: governors: teo: Drop misguided target residency check
    
    commit a03b2011808ab02ccb7ab6b573b013b77fbb5921 upstream.
    
    When the target residency of the current candidate idle state is
    greater than the expected time till the closest timer (the sleep
    length), it does not matter whether or not the tick has already been
    stopped or if it is going to be stopped.  The closest timer will
    trigger anyway at its due time, so if an idle state with target
    residency above the sleep length is selected, energy will be wasted
    and there may be excess latency.
    
    Of course, if the closest timer were canceled before it could trigger,
    a deeper idle state would be more suitable, but this is not expected
    to happen (generally speaking, hrtimers are not expected to be
    canceled as a rule).
    
    Accordingly, the teo_state_ok() check done in that case causes energy to
    be wasted more often than it allows any energy to be saved (if it allows
    any energy to be saved at all), so drop it and let the governor use the
    teo_find_shallower_state() return value as the new candidate idle state
    index.
    
    Fixes: 21d28cd2fa5f ("cpuidle: teo: Do not call tick_nohz_get_sleep_length() upfront")
    Cc: All applicable <[email protected]>
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Reviewed-by: Christian Loehle <[email protected]>
    Tested-by: Christian Loehle <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

cpuidle: menu: Use residency threshold in polling state override decisions [+ + +]

Author: Aboorva Devarajan <[email protected]>
Date:   Mon Oct 6 07:09:54 2025 +0530

    cpuidle: menu: Use residency threshold in polling state override decisions
    
    [ Upstream commit 07d815701274d156ad8c7c088a52e01642156fb8 ]
    
    On virtualized PowerPC (pseries) systems, where only one polling state
    (Snooze) and one deep state (CEDE) are available, selecting CEDE when
    the predicted idle duration is less than the target residency of CEDE
    state can hurt performance. In such cases, the entry/exit overhead of
    CEDE outweighs the power savings, leading to unnecessary state
    transitions and higher latency.
    
    Menu governor currently contains a special-case rule that prioritizes
    the first non-polling state over polling, even when its target residency
    is much longer than the predicted idle duration. On PowerPC/pseries,
    where the gap between the polling state (Snooze) and the first non-polling
    state (CEDE) is large, this behavior causes performance regressions.
    
    Refine that special case by adding an extra requirement: the first
    non-polling state can only be chosen if its target residency is below
    the defined RESIDENCY_THRESHOLD_NS. If this condition is not satisfied,
    polling is allowed instead, avoiding suboptimal non-polling state
    entries.
    
    This change is limited to the single special-case rule for the first
    non-polling state. The general non-polling state selection logic in the
    menu governor remains unchanged.
    
    Performance improvement observed with pgbench on PowerPC (pseries)
    system:
    +---------------------------+------------+------------+------------+
    | Metric                    | Baseline   | Patched    | Change (%) |
    +---------------------------+------------+------------+------------+
    | Transactions/sec (TPS)    | 495,210    | 536,982    | +8.45%     |
    | Avg latency (ms)          | 0.163      | 0.150      | -7.98%     |
    +---------------------------+------------+------------+------------+
    
    CPUIdle state usage:
    +--------------+--------------+-------------+
    | Metric       | Baseline     | Patched     |
    +--------------+--------------+-------------+
    | Total usage  | 12,735,820   | 13,918,442  |
    | Above usage  | 11,401,520   | 1,598,210   |
    | Below usage  | 20,145       | 702,395     |
    +--------------+--------------+-------------+
    
    Above/Total and Below/Total usage percentages:
    +------------------------+-----------+---------+
    | Metric                 | Baseline  | Patched |
    +------------------------+-----------+---------+
    | Above % (Above/Total)  | 89.56%    | 11.49%  |
    | Below % (Below/Total)  | 0.16%     | 5.05%   |
    | Total cpuidle miss (%) | 89.72%    | 16.54%  |
    +------------------------+-----------+---------+
    
    The results indicate that restricting CEDE selection to cases where
    its residency matches the predicted idle time reduces mispredictions,
    lowers unnecessary state transitions, and improves overall throughput.
    
    Reviewed-by: Christian Loehle <[email protected]>
    Signed-off-by: Aboorva Devarajan <[email protected]>
    [ rjw: Changelog edits, rebase ]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

crypto: af_alg - zero initialize memory allocated via sock_kmalloc [+ + +]

Author: Shivani Agarwal <[email protected]>
Date:   Tue Sep 23 23:01:48 2025 -0700

    crypto: af_alg - zero initialize memory allocated via sock_kmalloc
    
    commit 6f6e309328d53a10c0fe1f77dec2db73373179b6 upstream.
    
    Several crypto user API contexts and requests allocated with
    sock_kmalloc() were left uninitialized, relying on callers to
    set fields explicitly. This resulted in the use of uninitialized
    data in certain error paths or when new fields are added in the
    future.
    
    The ACVP patches also contain two user-space interface files:
    algif_kpp.c and algif_akcipher.c. These too rely on proper
    initialization of their context structures.
    
    A particular issue has been observed with the newly added
    'inflight' variable introduced in af_alg_ctx by commit:
    
      67b164a871af ("crypto: af_alg - Disallow multiple in-flight AIO requests")
    
    Because the context is not memset to zero after allocation,
    the inflight variable has contained garbage values. As a result,
    af_alg_alloc_areq() has incorrectly returned -EBUSY randomly when
    the garbage value was interpreted as true:
    
      https://github.com/gregkh/linux/blame/master/crypto/af_alg.c#L1209
    
    The check directly tests ctx->inflight without explicitly
    comparing against true/false. Since inflight is only ever set to
    true or false later, an uninitialized value has triggered
    -EBUSY failures. Zero-initializing memory allocated with
    sock_kmalloc() ensures inflight and other fields start in a known
    state, removing random issues caused by uninitialized data.
    
    Fixes: fe869cdb89c9 ("crypto: algif_hash - User-space interface for hash operations")
    Fixes: 5afdfd22e6ba ("crypto: algif_rng - add random number generator support")
    Fixes: 2d97591ef43d ("crypto: af_alg - consolidation of duplicate code")
    Fixes: 67b164a871af ("crypto: af_alg - Disallow multiple in-flight AIO requests")
    Cc: [email protected]
    Signed-off-by: Shivani Agarwal <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

crypto: caam - Add check for kcalloc() in test_len() [+ + +]

Author: Guangshuo Li <[email protected]>
Date:   Tue Sep 23 20:44:18 2025 +0800

    crypto: caam - Add check for kcalloc() in test_len()
    
    commit 7cf6e0b69b0d90ab042163e5bbddda0dfcf8b6a7 upstream.
    
    As kcalloc() may fail, check its return value to avoid a NULL pointer
    dereference when passing the buffer to rng->read(). On allocation
    failure, log the error and return since test_len() returns void.
    
    Fixes: 2be0d806e25e ("crypto: caam - add a test for the RNG")
    Cc: [email protected]
    Signed-off-by: Guangshuo Li <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

crypto: seqiv - Do not use req->iv after crypto_aead_encrypt [+ + +]

Author: Herbert Xu <[email protected]>
Date:   Wed Dec 17 14:15:41 2025 +0800

    crypto: seqiv - Do not use req->iv after crypto_aead_encrypt
    
    [ Upstream commit 50fdb78b7c0bcc550910ef69c0984e751cac72fa ]
    
    As soon as crypto_aead_encrypt is called, the underlying request
    may be freed by an asynchronous completion.  Thus dereferencing
    req->iv after it returns is invalid.
    
    Instead of checking req->iv against info, create a new variable
    unaligned_info and use it for that purpose instead.
    
    Fixes: 0a270321dbf9 ("[CRYPTO] seqiv: Add Sequence Number IV Generator")
    Reported-by: Xiumei Mu <[email protected]>
    Reported-by: Xin Long <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

dm-bufio: align write boundary on physical block size [+ + +]

Author: Mikulas Patocka <[email protected]>
Date:   Mon Oct 20 14:48:13 2025 +0200

    dm-bufio: align write boundary on physical block size
    
    commit d0ac06ae53be0cdb61f5fe6b62d25d3317c51657 upstream.
    
    There may be devices with physical block size larger than 4k.
    
    If dm-bufio sends I/O that is not aligned on physical block size,
    performance is degraded.
    
    The 4k minimum alignment limit is there because some SSDs report logical
    and physical block size 512 despite having 4k internally - so dm-bufio
    shouldn't send I/Os not aligned on 4k boundary, because they perform
    badly (the SSD does read-modify-write for them).
    
    Signed-off-by: Mikulas Patocka <[email protected]>
    Reported-by: Uladzislau Rezki (Sony) <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dm-ebs: Mark full buffer dirty even on partial write [+ + +]

Author: Uladzislau Rezki (Sony) <[email protected]>
Date:   Mon Nov 17 11:59:45 2025 +0100

    dm-ebs: Mark full buffer dirty even on partial write
    
    commit 7fa3e7d114abc9cc71cc35d768e116641074ddb4 upstream.
    
    When performing a read-modify-write(RMW) operation, any modification
    to a buffered block must cause the entire buffer to be marked dirty.
    
    Marking only a subrange as dirty is incorrect because the underlying
    device block size(ubs) defines the minimum read/write granularity. A
    lower device can perform I/O only on regions which are fully aligned
    and sized to ubs.
    
    This change ensures that write-back operations always occur in full
    ubs-sized chunks, matching the intended emulation semantics of the
    EBS target.
    
    As for user space visible impact, submitting sub-ubs and misaligned
    I/O for devices which are tuned to ubs sizes only, will reject such
    requests, therefore it can lead to losing data. Example:
    
    1) Create a 8K nvme device in qemu by adding
    
    -device nvme,drive=drv0,serial=foo,logical_block_size=8192,physical_block_size=8192
    
    2) Setup dm-ebs to emulate 512B to 8K mapping
    
    urezki@pc638:~/bin$ cat dmsetup.sh
    
    lower=/dev/nvme0n1
    len=$(blockdev --getsz "$lower")
    
    echo "0 $len ebs $lower 0 1 16" | dmsetup create nvme-8k
    urezki@pc638:~/bin$
    
    offset 0, ebs=1 and ubs=16(in sectors).
    
    3) Create an ext4 filesystem(default 4K block size)
    
    urezki@pc638:~/bin$ sudo mkfs.ext4 -F /dev/dm-0
    mke2fs 1.47.0 (5-Feb-2023)
    Discarding device blocks: done
    Creating filesystem with 2072576 4k blocks and 518144 inodes
    Filesystem UUID: bd0b6ca6-0506-4e31-86da-8d22c9d50b63
    Superblock backups stored on blocks:
            32768, 98304, 163840, 229376, 294912, 819200, 884736, 1605632
    
    Allocating group tables: done
    Writing inode tables: done
    Creating journal (16384 blocks): done
    Writing superblocks and filesystem accounting information: mkfs.ext4: Input/output error while writing out and closing file system
    urezki@pc638:~/bin$ dmesg
    
    <snip>
    [ 1618.875449] buffer_io_error: 1028 callbacks suppressed
    [ 1618.875456] Buffer I/O error on dev dm-0, logical block 0, lost async page write
    [ 1618.875527] Buffer I/O error on dev dm-0, logical block 1, lost async page write
    [ 1618.875602] Buffer I/O error on dev dm-0, logical block 2, lost async page write
    [ 1618.875620] Buffer I/O error on dev dm-0, logical block 3, lost async page write
    [ 1618.875639] Buffer I/O error on dev dm-0, logical block 4, lost async page write
    [ 1618.894316] Buffer I/O error on dev dm-0, logical block 5, lost async page write
    [ 1618.894358] Buffer I/O error on dev dm-0, logical block 6, lost async page write
    [ 1618.894380] Buffer I/O error on dev dm-0, logical block 7, lost async page write
    [ 1618.894405] Buffer I/O error on dev dm-0, logical block 8, lost async page write
    [ 1618.894427] Buffer I/O error on dev dm-0, logical block 9, lost async page write
    <snip>
    
    Many I/O errors because the lower 8K device rejects sub-ubs/misaligned
    requests.
    
    with a patch:
    
    urezki@pc638:~/bin$ sudo mkfs.ext4 -F /dev/dm-0
    mke2fs 1.47.0 (5-Feb-2023)
    Discarding device blocks: done
    Creating filesystem with 2072576 4k blocks and 518144 inodes
    Filesystem UUID: 9b54f44f-ef55-4bd4-9e40-c8b775a616ac
    Superblock backups stored on blocks:
            32768, 98304, 163840, 229376, 294912, 819200, 884736, 1605632
    
    Allocating group tables: done
    Writing inode tables: done
    Creating journal (16384 blocks): done
    Writing superblocks and filesystem accounting information: done
    
    urezki@pc638:~/bin$ sudo mount /dev/dm-0 /mnt/
    urezki@pc638:~/bin$ ls -al /mnt/
    total 24
    drwxr-xr-x  3 root root  4096 Oct 17 15:13 .
    drwxr-xr-x 19 root root  4096 Jul 10 19:42 ..
    drwx------  2 root root 16384 Oct 17 15:13 lost+found
    urezki@pc638:~/bin$
    
    After this change: mkfs completes; mount succeeds.
    
    Signed-off-by: Uladzislau Rezki (Sony) <[email protected]>
    Signed-off-by: Mikulas Patocka <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Fix scratch registers offsets for DCN35 [+ + +]

Author: Ray Wu <[email protected]>
Date:   Fri Nov 28 08:58:13 2025 +0800

    drm/amd/display: Fix scratch registers offsets for DCN35
    
    commit 69741d9ccc7222e6b6f138db67b012ecc0d72542 upstream.
    
    [Why]
    Different platforms use differnet NBIO header files,
    causing display code to use differnt offset and read
    wrong accelerated status.
    
    [How]
    - Unified NBIO offset header file across platform.
    - Correct scratch registers offsets to proper locations.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4667
    Cc: Mario Limonciello <[email protected]>
    Cc: Alex Deucher <[email protected]>
    Reviewed-by: Mario Limonciello <[email protected]>
    Signed-off-by: Ray Wu <[email protected]>
    Signed-off-by: Chenyu Chen <[email protected]>
    Tested-by: Daniel Wheeler <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 49a63bc8eda0304ba307f5ba68305f936174f72d)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Fix scratch registers offsets for DCN351 [+ + +]

Author: Ray Wu <[email protected]>
Date:   Fri Nov 28 09:14:09 2025 +0800

    drm/amd/display: Fix scratch registers offsets for DCN351
    
    commit fd62aa13d3ee0f21c756a40a7c2f900f98992d6a upstream.
    
    [Why]
    Different platforms use different NBIO header files,
    causing display code to use differnt offset and read
    wrong accelerated status.
    
    [How]
    - Unified NBIO offset header file across platform.
    - Correct scratch registers offsets to proper locations.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4667
    Cc: Mario Limonciello <[email protected]>
    Cc: Alex Deucher <[email protected]>
    Reviewed-by: Mario Limonciello <[email protected]>
    Signed-off-by: Ray Wu <[email protected]>
    Signed-off-by: Chenyu Chen <[email protected]>
    Tested-by: Daniel Wheeler <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 576e032e909c8a6bb3d907b4ef5f6abe0f644199)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amd/display: Use GFP_ATOMIC in dc_create_plane_state() [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Tue Nov 11 11:17:22 2025 -0500

    drm/amd/display: Use GFP_ATOMIC in dc_create_plane_state()
    
    commit 3c41114dcdabb7b25f5bc33273c6db9c7af7f4a7 upstream.
    
    This can get called from an atomic context.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4470
    Reviewed-by: Harry Wentland <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 8acdad9344cc7b4e7bc01f0dfea80093eb3768db)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/gmc11: add amdgpu_vm_handle_fault() handling [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Thu Nov 13 15:55:19 2025 -0500

    drm/amdgpu/gmc11: add amdgpu_vm_handle_fault() handling
    
    commit 3f2289b56cd98f5741056bdb6e521324eff07ce5 upstream.
    
    We need to call amdgpu_vm_handle_fault() on page fault
    on all gfx9 and newer parts to properly update the
    page tables, not just for recoverable page faults.
    
    Cc: [email protected]
    Reviewed-by: Timur Kristóf <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu/gmc12: add amdgpu_vm_handle_fault() handling [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Thu Nov 13 15:57:43 2025 -0500

    drm/amdgpu/gmc12: add amdgpu_vm_handle_fault() handling
    
    commit ff28ff98db6a8eeb469e02fb8bd1647b353232a9 upstream.
    
    We need to call amdgpu_vm_handle_fault() on page fault
    on all gfx9 and newer parts to properly update the
    page tables, not just for recoverable page faults.
    
    Cc: [email protected]
    Reviewed-by: Timur Kristóf <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdgpu: add missing lock to amdgpu_ttm_access_memory_sdma [+ + +]

Author: Pierre-Eric Pelloux-Prayer <[email protected]>
Date:   Tue Nov 25 10:48:39 2025 +0100

    drm/amdgpu: add missing lock to amdgpu_ttm_access_memory_sdma
    
    commit 4fa944255be521b1bbd9780383f77206303a3a5c upstream.
    
    Users of ttm entities need to hold the gtt_window_lock before using them
    to guarantee proper ordering of jobs.
    
    Cc: [email protected]
    Fixes: cb5cc4f573e1 ("drm/amdgpu: improve debug VRAM access performance using sdma")
    Signed-off-by: Pierre-Eric Pelloux-Prayer <[email protected]>
    Reviewed-by: Christian König <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdkfd: bump minimum vgpr size for gfx1151 [+ + +]

Author: Jonathan Kim <[email protected]>
Date:   Fri Dec 5 14:41:08 2025 -0500

    drm/amdkfd: bump minimum vgpr size for gfx1151
    
    commit cf326449637a566ba98fb82c47d46cd479608c88 upstream.
    
    GFX1151 has 1.5x the number of available physical VGPRs per SIMD.
    Bump total memory availability for acquire checks on queue creation.
    
    Signed-off-by: Jonathan Kim <[email protected]>
    Reviewed-by: Mario Limonciello <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit b42f3bf9536c9b710fd1d4deb7d1b0dc819dc72d)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdkfd: Export the cwsr_size and ctl_stack_size to userspace [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Fri Dec 5 12:41:58 2025 -0600

    drm/amdkfd: Export the cwsr_size and ctl_stack_size to userspace
    
    commit 8fc2796dea6f1210e1a01573961d5836a7ce531e upstream.
    
    This is important for userspace to avoid hardcoding VGPR size.
    
    Reviewed-by: Kent Russell <[email protected]>
    Signed-off-by: Mario Limonciello <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 71776e0965f9f730af19c5f548827f2a7c91f5a8)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/amdkfd: Trap handler support for expert scheduling mode [+ + +]

Author: Jay Cornwall <[email protected]>
Date:   Fri Nov 14 14:32:42 2025 -0600

    drm/amdkfd: Trap handler support for expert scheduling mode
    
    commit b7851f8c66191cd23a0a08bd484465ad74bbbb7d upstream.
    
    The trap may be entered with dependency checking disabled.
    Wait for dependency counters and save/restore scheduling mode.
    
    v2:
    
    Use ttmp1 instead of ttmp11. ttmp11 is not zero-initialized.
    While the trap handler does zero this field before use, a user-mode
    second-level trap handler could not rely on this being zero when
    using an older kernel mode driver.
    
    v3:
    
    Use ttmp11 primarily but copy to ttmp1 before jumping to the
    second level trap handler. ttmp1 is inspectable by a debugger.
    Unexpected bits in the unused space may regress existing software.
    
    Signed-off-by: Jay Cornwall <[email protected]>
    Reviewed-by: Lancelot Six <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 423888879412e94725ca2bdccd89414887d98e31)
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/buddy: Optimize free block management with RB tree [+ + +]

Author: Arunpravin Paneer Selvam <[email protected]>
Date:   Mon Oct 6 15:21:22 2025 +0530

    drm/buddy: Optimize free block management with RB tree
    
    commit c178e534fff1d5a74da80ea03b20e2b948a00113 upstream.
    
    Replace the freelist (O(n)) used for free block management with a
    red-black tree, providing more efficient O(log n) search, insert,
    and delete operations. This improves scalability and performance
    when managing large numbers of free blocks per order (e.g., hundreds
    or thousands).
    
    In the VK-CTS memory stress subtest, the buddy manager merges
    fragmented memory and inserts freed blocks into the freelist. Since
    freelist insertion is O(n), this becomes a bottleneck as fragmentation
    increases. Benchmarking shows list_insert_sorted() consumes ~52.69% CPU
    with the freelist, compared to just 0.03% with the RB tree
    (rbtree_insert.isra.0), despite performing the same sorted insert.
    
    This also improves performance in heavily fragmented workloads,
    such as games or graphics tests that stress memory.
    
    As the buddy allocator evolves with new features such as clear-page
    tracking, the resulting fragmentation and complexity have grown.
    These RB-tree based design changes are introduced to address that
    growth and ensure the allocator continues to perform efficiently
    under fragmented conditions.
    
    The RB tree implementation with separate clear/dirty trees provides:
    - O(n log n) aggregate complexity for all operations instead of O(n^2)
    - Elimination of soft lockups and system instability
    - Improved code maintainability and clarity
    - Better scalability for large memory systems
    - Predictable performance under fragmentation
    
    v3(Matthew):
      - Remove RB_EMPTY_NODE check in force_merge function.
      - Rename rb for loop macros to have less generic names and move to
        .c file.
      - Make the rb node rb and link field as union.
    
    v4(Jani Nikula):
      - The kernel-doc comment should be "/**"
      - Move all the rbtree macros to rbtree.h and add parens to ensure
        correct precedence.
    
    v5:
      - Remove the inline in a .c file (Jani Nikula).
    
    v6(Peter Zijlstra):
      - Add rb_add() function replacing the existing rbtree_insert() code.
    
    v7:
      - A full walk iteration in rbtree is slower than the list (Peter Zijlstra).
      - The existing rbtree_postorder_for_each_entry_safe macro should be used
        in scenarios where traversal order is not a critical factor (Christian).
    
    v8(Matthew):
      - Remove the rbtree_is_empty() check in this patch as well.
    
    Cc: [email protected]
    Fixes: a68c7eaa7a8f ("drm/amdgpu: Enable clear page functionality")
    Signed-off-by: Arunpravin Paneer Selvam <[email protected]>
    Reviewed-by: Matthew Auld <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/buddy: Separate clear and dirty free block trees [+ + +]

Author: Arunpravin Paneer Selvam <[email protected]>
Date:   Mon Oct 6 15:21:23 2025 +0530

    drm/buddy: Separate clear and dirty free block trees
    
    commit d4cd665c98c144dd6ad5d66d30396e13d23118c9 upstream.
    
    Maintain two separate RB trees per order - one for clear (zeroed) blocks
    and another for dirty (uncleared) blocks. This separation improves
    code clarity and makes it more obvious which tree is being searched
    during allocation. It also improves scalability and efficiency when
    searching for a specific type of block, avoiding unnecessary checks
    and making the allocator more predictable under fragmentation.
    
    The changes have been validated using the existing drm_buddy_test
    KUnit test cases, along with selected graphics workloads,
    to ensure correctness and avoid regressions.
    
    v2: Missed adding the suggested-by tag. Added it in v2.
    
    v3(Matthew):
      - Remove the double underscores from the internal functions.
      - Rename the internal functions to have less generic names.
      - Fix the error handling code.
      - Pass tree argument for the tree macro.
      - Use the existing dirty/free bit instead of new tree field.
      - Make free_trees[] instead of clear_tree and dirty_tree for
        more cleaner approach.
    
    v4:
      - A bug was reported by Intel CI and it is fixed by
        Matthew Auld.
      - Replace the get_root function with
        &mm->free_trees[tree][order] (Matthew)
      - Remove the unnecessary rbtree_is_empty() check (Matthew)
      - Remove the unnecessary get_tree_for_flags() function.
      - Rename get_tree_for_block() name with get_block_tree() for more
        clarity.
    
    v5(Jani Nikula):
      - Don't use static inline in .c files.
      - enum free_tree and enumerator names are quite generic for a header
        and usage and the whole enum should be an implementation detail.
    
    v6:
      - Rewrite the __force_merge() function using the rb_last() and rb_prev().
    
    v7(Matthew):
      - Replace the open-coded tree iteration for loops with the
        for_each_free_tree() macro throughout the code.
      - Fixed out_free_roots to prevent double decrement of i,
        addressing potential crash.
      - Replaced enum drm_buddy_free_tree with unsigned int
        in for_each_free_tree loops.
    
    Cc: [email protected]
    Fixes: a68c7eaa7a8f ("drm/amdgpu: Enable clear page functionality")
    Signed-off-by: Arunpravin Paneer Selvam <[email protected]>
    Suggested-by: Matthew Auld <[email protected]>
    Reviewed-by: Matthew Auld <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4260
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/displayid: add quirk to ignore DisplayID checksum errors [+ + +]

Author: Jani Nikula <[email protected]>
Date:   Wed Dec 31 11:29:26 2025 -0500

    drm/displayid: add quirk to ignore DisplayID checksum errors
    
    [ Upstream commit 83cbb4d33dc22b0ca1a4e85c6e892c9b729e28d4 ]
    
    Add a mechanism for DisplayID specific quirks, and add the first quirk
    to ignore DisplayID section checksum errors.
    
    It would be quite inconvenient to pass existing EDID quirks from
    drm_edid.c for DisplayID parsing. Not all places doing DisplayID
    iteration have the quirks readily available, and would have to pass it
    in all places. Simply add a separate array of DisplayID specific EDID
    quirks. We do end up checking it every time we iterate DisplayID blocks,
    but hopefully the number of quirks remains small.
    
    There are a few laptop models with DisplayID checksum failures, leading
    to higher refresh rates only present in the DisplayID blocks being
    ignored. Add a quirk for the panel in the machines.
    
    Reported-by: Tiago Martins Araújo <[email protected]>
    Closes: https://lore.kernel.org/r/CACRbrPGvLP5LANXuFi6z0S7XMbAG4X5y2YOLBDxfOVtfGGqiKQ@mail.gmail.com
    Closes: https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/14703
    Acked-by: Alex Deucher <[email protected]>
    Tested-by: Tiago Martins Araújo <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/c04d81ae648c5f21b3f5b7953f924718051f2798.1761681968.git.jani.nikula@intel.com
    Signed-off-by: Jani Nikula <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/displayid: pass iter to drm_find_displayid_extension() [+ + +]

Author: Jani Nikula <[email protected]>
Date:   Tue Oct 28 22:07:25 2025 +0200

    drm/displayid: pass iter to drm_find_displayid_extension()
    
    commit 520f37c30992fd0c212a34fbe99c062b7a3dc52e upstream.
    
    It's more convenient to pass iter than a handful of its members to
    drm_find_displayid_extension(), especially as we're about to add another
    member.
    
    Rename the function find_next_displayid_extension() while at it, to be
    more descriptive.
    
    Cc: Tiago Martins Araújo <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Tested-by: Tiago Martins Araújo <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/3837ae7f095e77a082ac2422ce2fac96c4f9373d.1761681968.git.jani.nikula@intel.com
    Signed-off-by: Jani Nikula <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/edid: add DRM_EDID_IDENT_INIT() to initialize struct drm_edid_ident [+ + +]

Author: Jani Nikula <[email protected]>
Date:   Tue Oct 28 22:07:26 2025 +0200

    drm/edid: add DRM_EDID_IDENT_INIT() to initialize struct drm_edid_ident
    
    commit 8b61583f993589a64c061aa91b44f5bd350d90a5 upstream.
    
    Add a convenience helper for initializing struct drm_edid_ident.
    
    Cc: Tiago Martins Araújo <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Tested-by: Tiago Martins Araújo <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/710b2ac6a211606ec1f90afa57b79e8c7375a27e.1761681968.git.jani.nikula@intel.com
    Signed-off-by: Jani Nikula <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/gma500: Remove unused helper psb_fbdev_fb_setcolreg() [+ + +]

Author: Thomas Zimmermann <[email protected]>
Date:   Mon Sep 29 10:23:23 2025 +0200

    drm/gma500: Remove unused helper psb_fbdev_fb_setcolreg()
    
    commit be729f9de6c64240645dc80a24162ac4d3fe00a8 upstream.
    
    Remove psb_fbdev_fb_setcolreg(), which hasn't been called in almost
    a decade.
    
    Gma500 commit 4d8d096e9ae8 ("gma500: introduce the framebuffer support
    code") added the helper psb_fbdev_fb_setcolreg() for setting the fbdev
    palette via fbdev's fb_setcolreg callback. Later
    commit 3da6c2f3b730 ("drm/gma500: use DRM_FB_HELPER_DEFAULT_OPS for
    fb_ops") set several default helpers for fbdev emulation, including
    fb_setcmap.
    
    The fbdev subsystem always prefers fb_setcmap over fb_setcolreg. [1]
    Hence, the gma500 code is no longer in use and gma500 has been using
    drm_fb_helper_setcmap() for several years without issues.
    
    Fixes: 3da6c2f3b730 ("drm/gma500: use DRM_FB_HELPER_DEFAULT_OPS for fb_ops")
    Cc: Patrik Jakobsson <[email protected]>
    Cc: Stefan Christ <[email protected]>
    Cc: Daniel Vetter <[email protected]>
    Cc: [email protected]
    Cc: <[email protected]> # v4.10+
    Link: https://elixir.bootlin.com/linux/v6.16.9/source/drivers/video/fbdev/core/fbcmap.c#L246 # [1]
    Signed-off-by: Thomas Zimmermann <[email protected]>
    Acked-by: Patrik Jakobsson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915/gem: Zero-initialize the eb.vma array in i915_gem_do_execbuffer [+ + +]

Author: Krzysztof Niemiec <[email protected]>
Date:   Tue Dec 16 19:09:01 2025 +0100

    drm/i915/gem: Zero-initialize the eb.vma array in i915_gem_do_execbuffer
    
    commit 4fe2bd195435e71c117983d87f278112c5ab364c upstream.
    
    Initialize the eb.vma array with values of 0 when the eb structure is
    first set up. In particular, this sets the eb->vma[i].vma pointers to
    NULL, simplifying cleanup and getting rid of the bug described below.
    
    During the execution of eb_lookup_vmas(), the eb->vma array is
    successively filled up with struct eb_vma objects. This process includes
    calling eb_add_vma(), which might fail; however, even in the event of
    failure, eb->vma[i].vma is set for the currently processed buffer.
    
    If eb_add_vma() fails, eb_lookup_vmas() returns with an error, which
    prompts a call to eb_release_vmas() to clean up the mess. Since
    eb_lookup_vmas() might fail during processing any (possibly not first)
    buffer, eb_release_vmas() checks whether a buffer's vma is NULL to know
    at what point did the lookup function fail.
    
    In eb_lookup_vmas(), eb->vma[i].vma is set to NULL if either the helper
    function eb_lookup_vma() or eb_validate_vma() fails. eb->vma[i+1].vma is
    set to NULL in case i915_gem_object_userptr_submit_init() fails; the
    current one needs to be cleaned up by eb_release_vmas() at this point,
    so the next one is set. If eb_add_vma() fails, neither the current nor
    the next vma is set to NULL, which is a source of a NULL deref bug
    described in the issue linked in the Closes tag.
    
    When entering eb_lookup_vmas(), the vma pointers are set to the slab
    poison value, instead of NULL. This doesn't matter for the actual
    lookup, since it gets overwritten anyway, however the eb_release_vmas()
    function only recognizes NULL as the stopping value, hence the pointers
    are being set to NULL as they go in case of intermediate failure. This
    patch changes the approach to filling them all with NULL at the start
    instead, rather than handling that manually during failure.
    
    Reported-by: Gangmin Kim <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/15062
    Fixes: 544460c33821 ("drm/i915: Multi-BB execbuf")
    Cc: [email protected] # 5.16.x
    Signed-off-by: Krzysztof Niemiec <[email protected]>
    Reviewed-by: Janusz Krzysztofik <[email protected]>
    Reviewed-by: Krzysztof Karas <[email protected]>
    Reviewed-by: Andi Shyti <[email protected]>
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    (cherry picked from commit 08889b706d4f0b8d2352b7ca29c2d8df4d0787cd)
    Signed-off-by: Jani Nikula <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/i915: Fix format string truncation warning [+ + +]

Author: Ard Biesheuvel <[email protected]>
Date:   Fri Dec 5 12:35:01 2025 +0100

    drm/i915: Fix format string truncation warning
    
    commit 1c7f9e528f8f488b060b786bfb90b40540854db3 upstream.
    
    GCC notices that the 16-byte uabi_name field could theoretically be too
    small for the formatted string if the instance number exceeds 100.
    
    So grow the field to 20 bytes.
    
    drivers/gpu/drm/i915/intel_memory_region.c: In function ‘intel_memory_region_create’:
    drivers/gpu/drm/i915/intel_memory_region.c:273:61: error: ‘%u’ directive output may be truncated writing between 1 and 5 bytes into a region of size between 3 and 11 [-Werror=format-truncation=]
      273 |         snprintf(mem->uabi_name, sizeof(mem->uabi_name), "%s%u",
          |                                                             ^~
    drivers/gpu/drm/i915/intel_memory_region.c:273:58: note: directive argument in the range [0, 65535]
      273 |         snprintf(mem->uabi_name, sizeof(mem->uabi_name), "%s%u",
          |                                                          ^~~~~~
    drivers/gpu/drm/i915/intel_memory_region.c:273:9: note: ‘snprintf’ output between 7 and 19 bytes into a destination of size 16
      273 |         snprintf(mem->uabi_name, sizeof(mem->uabi_name), "%s%u",
          |         ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
      274 |                  intel_memory_type_str(type), instance);
          |                  ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    
    Fixes: 3b38d3515753 ("drm/i915: Add stable memory region names")
    Cc: <[email protected]> # v6.8+
    Signed-off-by: Ard Biesheuvel <[email protected]>
    Signed-off-by: Tvrtko Ursulin <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    (cherry picked from commit 18476087f1a18dc279d200d934ad94fba1fb51d5)
    Signed-off-by: Jani Nikula <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/imagination: Disallow exporting of PM/FW protected objects [+ + +]

Author: Alessio Belle <[email protected]>
Date:   Mon Dec 8 09:11:00 2025 +0000

    drm/imagination: Disallow exporting of PM/FW protected objects
    
    commit 6b991ad8dc3abfe5720fc2e9ee96be63ae43e362 upstream.
    
    These objects are meant to be used by the GPU firmware or by the PM unit
    within the GPU, in which case they may contain physical addresses.
    
    This adds a layer of protection against exposing potentially exploitable
    information outside of the driver.
    
    Fixes: ff5f643de0bf ("drm/imagination: Add GEM and VM related code")
    Signed-off-by: Alessio Belle <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Matt Coster <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/me/gsc: mei interrupt top half should be in irq disabled context [+ + +]

Author: Junxiao Chang <[email protected]>
Date:   Fri Nov 7 11:31:52 2025 +0800

    drm/me/gsc: mei interrupt top half should be in irq disabled context
    
    [ Upstream commit 17445af7dcc7d645b6fb8951fd10c8b72cc7f23f ]
    
    MEI GSC interrupt comes from i915 or xe driver. It has top half and
    bottom half. Top half is called from i915/xe interrupt handler. It
    should be in irq disabled context.
    
    With RT kernel(PREEMPT_RT enabled), by default IRQ handler is in
    threaded IRQ. MEI GSC top half might be in threaded IRQ context.
    generic_handle_irq_safe API could be called from either IRQ or
    process context, it disables local IRQ then calls MEI GSC interrupt
    top half.
    
    This change fixes B580 GPU boot issue with RT enabled.
    
    Fixes: e02cea83d32d ("drm/xe/gsc: add Battlemage support")
    Tested-by: Baoli Zhang <[email protected]>
    Signed-off-by: Junxiao Chang <[email protected]>
    Reviewed-by: Sebastian Andrzej Siewior <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Maarten Lankhorst <[email protected]>
    (cherry picked from commit 3efadf028783a49ab2941294187c8b6dd86bf7da)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/mediatek: Fix device node reference leak in mtk_dp_dt_parse() [+ + +]

Author: Miaoqian Lin <[email protected]>
Date:   Wed Oct 29 15:23:06 2025 +0800

    drm/mediatek: Fix device node reference leak in mtk_dp_dt_parse()
    
    commit a846505a193d7492ad3531e33cacfca31e4bcdd1 upstream.
    
    The function mtk_dp_dt_parse() calls of_graph_get_endpoint_by_regs()
    to get the endpoint device node, but fails to call of_node_put() to release
    the reference when the function returns. This results in a device node
    reference leak.
    
    Fix this by adding the missing of_node_put() call before returning from
    the function.
    
    Found via static analysis and code review.
    
    Fixes: f70ac097a2cf ("drm/mediatek: Add MT8195 Embedded DisplayPort driver")
    Cc: [email protected]
    Signed-off-by: Miaoqian Lin <[email protected]>
    Reviewed-by: Markus Schneider-Pargmann <[email protected]>
    Reviewed-by: CK Hu <[email protected]>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/mediatek: Fix probe device leaks [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Tue Sep 23 17:23:38 2025 +0200

    drm/mediatek: Fix probe device leaks
    
    commit 2a2a04be8e869a19c9f950b89b1e05832a0f7ec7 upstream.
    
    Make sure to drop the reference taken to each component device during
    probe on probe failure (e.g. probe deferral) and on driver unbind.
    
    Fixes: 6ea6f8276725 ("drm/mediatek: Use correct device pointer to get CMDQ client register")
    Cc: [email protected]      # 5.12
    Cc: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/mediatek: Fix probe memory leak [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Tue Sep 23 17:23:37 2025 +0200

    drm/mediatek: Fix probe memory leak
    
    commit 5e49200593f331cd0629b5376fab9192f698e8ef upstream.
    
    The Mediatek DRM driver allocates private data for components without a
    platform driver but as the lifetime is tied to each component device,
    the memory is never freed.
    
    Tie the allocation lifetime to the DRM platform device so that the
    memory is released on probe failure (e.g. probe deferral) and when the
    driver is unbound.
    
    Fixes: c0d36de868a6 ("drm/mediatek: Move clk info from struct mtk_ddp_comp to sub driver private data")
    Cc: [email protected]      # 5.12
    Cc: CK Hu <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/mediatek: Fix probe resource leaks [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Tue Sep 23 17:23:36 2025 +0200

    drm/mediatek: Fix probe resource leaks
    
    commit 07c7c640a8eb9e196f357d15d88a59602a947197 upstream.
    
    Make sure to unmap and release the component iomap and clock on probe
    failure (e.g. probe deferral) and on driver unbind.
    
    Note that unlike of_iomap(), devm_of_iomap() also checks whether the
    region is already mapped.
    
    Fixes: 119f5173628a ("drm/mediatek: Add DRM Driver for Mediatek SoC MT8173.")
    Cc: [email protected]      # 4.7
    Cc: CK Hu <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/mgag200: Fix big-endian support [+ + +]

Author: René Rebe <[email protected]>
Date:   Mon Dec 8 14:18:27 2025 +0100

    drm/mgag200: Fix big-endian support
    
    commit 6cb31fba137d45e682ce455b8ea364f44d5d4f98 upstream.
    
    Unlike the original, deleted Matrox mga driver, the new mgag200 driver
    has the XRGB frame-buffer byte swapped on big-endian "RISC"
    systems. Fix by enabling byte swapping "PowerPC" OPMODE for any
    __BIG_ENDIAN config.
    
    Fixes: 414c45310625 ("mgag200: initial g200se driver (v2)")
    Signed-off-by: René Rebe <[email protected]>
    Cc: [email protected]
    Reviewed-by: Thomas Zimmermann <[email protected]>
    Signed-off-by: Thomas Zimmermann <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/msm/a6xx: Fix out of bound IO access in a6xx_get_gmu_registers [+ + +]

Author: Akhil P Oommen <[email protected]>
Date:   Tue Nov 18 14:20:28 2025 +0530

    drm/msm/a6xx: Fix out of bound IO access in a6xx_get_gmu_registers
    
    commit 779b68a5bf2764c8ed3aa800e41ba0d5d007e1e7 upstream.
    
    REG_A6XX_GMU_AO_AHB_FENCE_CTRL register falls under GMU's register
    range. So, use gmu_write() routines to write to this register.
    
    Fixes: 1707add81551 ("drm/msm/a6xx: Add a6xx gpu state")
    Cc: [email protected]
    Signed-off-by: Akhil P Oommen <[email protected]>
    Reviewed-by: Konrad Dybcio <[email protected]>
    Patchwork: https://patchwork.freedesktop.org/patch/688993/
    Message-ID: <[email protected]>
    Signed-off-by: Rob Clark <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/msm/dpu: Add missing NULL pointer check for pingpong interface [+ + +]

Author: Nikolay Kuratov <[email protected]>
Date:   Thu Dec 11 12:36:30 2025 +0300

    drm/msm/dpu: Add missing NULL pointer check for pingpong interface
    
    commit 88733a0b64872357e5ecd82b7488121503cb9cc6 upstream.
    
    It is checked almost always in dpu_encoder_phys_wb_setup_ctl(), but in a
    single place the check is missing.
    Also use convenient locals instead of phys_enc->* where available.
    
    Cc: [email protected]
    Fixes: d7d0e73f7de33 ("drm/msm/dpu: introduce the dpu_encoder_phys_* for writeback")
    Signed-off-by: Nikolay Kuratov <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Patchwork: https://patchwork.freedesktop.org/patch/693860/
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/nouveau/dispnv50: Don't call drm_atomic_get_crtc_state() in prepare_fb [+ + +]

Author: Lyude Paul <[email protected]>
Date:   Thu Dec 11 14:02:54 2025 -0500

    drm/nouveau/dispnv50: Don't call drm_atomic_get_crtc_state() in prepare_fb
    
    commit 560271e10b2c86e95ea35afa9e79822e4847f07a upstream.
    
    Since we recently started warning about uses of this function after the
    atomic check phase completes, we've started getting warnings about this in
    nouveau. It appears a misplaced drm_atomic_get_crtc_state() call has been
    hiding in our .prepare_fb callback for a while.
    
    So, fix this by adding a new nv50_head_atom_get_new() function and use that
    in our .prepare_fb callback instead.
    
    Signed-off-by: Lyude Paul <[email protected]>
    Reviewed-by: Dave Airlie <[email protected]>
    Fixes: 1590700d94ac ("drm/nouveau/kms/nv50-: split each resource type into their own source files")
    Cc: <[email protected]> # v4.18+
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/panel: sony-td4353-jdi: Enable prepare_prev_first [+ + +]

Author: Marijn Suijten <[email protected]>
Date:   Sun Nov 30 23:40:05 2025 +0100

    drm/panel: sony-td4353-jdi: Enable prepare_prev_first
    
    [ Upstream commit 2b973ca48ff3ef1952091c8f988d7796781836c8 ]
    
    The DSI host must be enabled before our prepare function can run, which
    has to send its init sequence over DSI.  Without enabling the host first
    the panel will not probe.
    
    Fixes: 9e15123eca79 ("drm/msm/dsi: Stop unconditionally powering up DSI hosts at modeset")
    Signed-off-by: Marijn Suijten <[email protected]>
    Reviewed-by: Douglas Anderson <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Reviewed-by: Martin Botka <[email protected]>
    Signed-off-by: Douglas Anderson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

drm/panthor: Flush shmem writes before mapping buffers CPU-uncached [+ + +]

Author: Boris Brezillon <[email protected]>
Date:   Fri Jan 2 12:37:23 2026 -0800

    drm/panthor: Flush shmem writes before mapping buffers CPU-uncached
    
    [ Upstream commit 576c930e5e7dcb937648490611a83f1bf0171048 ]
    
    The shmem layer zeroes out the new pages using cached mappings, and if
    we don't CPU-flush we might leave dirty cachelines behind, leading to
    potential data leaks and/or asynchronous buffer corruption when dirty
    cachelines are evicted.
    
    Fixes: 8a1cc07578bf ("drm/panthor: Add GEM logical block")
    Signed-off-by: Boris Brezillon <[email protected]>
    Reviewed-by: Steven Price <[email protected]>
    Reviewed-by: Liviu Dudau <[email protected]>
    Signed-off-by: Steven Price <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [Harshit: Resolve conflicts due to missing commit: fe69a3918084
    ("drm/panthor: Fix UAF in panthor_gem_create_with_handle() debugfs
    code") in 6.12.y]
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/ttm: Avoid NULL pointer deref for evicted BOs [+ + +]

Author: Simon Richter <[email protected]>
Date:   Tue Oct 14 01:11:33 2025 +0900

    drm/ttm: Avoid NULL pointer deref for evicted BOs
    
    commit 491adc6a0f9903c32b05f284df1148de39e8e644 upstream.
    
    It is possible for a BO to exist that is not currently associated with a
    resource, e.g. because it has been evicted.
    
    When devcoredump tries to read the contents of all BOs for dumping, we need
    to expect this as well -- in this case, ENODATA is recorded instead of the
    buffer contents.
    
    Fixes: 7d08df5d0bd3 ("drm/ttm: Add ttm_bo_access")
    Fixes: 09ac4fcb3f25 ("drm/ttm: Implement vm_operations_struct.access v2")
    Cc: stable <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/6271
    Signed-off-by: Simon Richter <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Reviewed-by: Shuicheng Lin <[email protected]>
    Reviewed-by: Christian König <[email protected]>
    Signed-off-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe/bo: Don't include the CCS metadata in the dma-buf sg-table [+ + +]

Author: Thomas Hellström <[email protected]>
Date:   Tue Dec 9 21:49:20 2025 +0100

    drm/xe/bo: Don't include the CCS metadata in the dma-buf sg-table
    
    commit 449bcd5d45eb4ce26740f11f8601082fe734bed2 upstream.
    
    Some Xe bos are allocated with extra backing-store for the CCS
    metadata. It's never been the intention to share the CCS metadata
    when exporting such bos as dma-buf. Don't include it in the
    dma-buf sg-table.
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Cc: Rodrigo Vivi <[email protected]>
    Cc: Matthew Brost <[email protected]>
    Cc: Maarten Lankhorst <[email protected]>
    Cc: <[email protected]> # v6.8+
    Signed-off-by: Thomas Hellström <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Reviewed-by: Karol Wachowski <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit a4ebfb9d95d78a12512b435a698ee6886d712571)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe/oa: Disallow 0 OA property values [+ + +]

Author: Ashutosh Dixit <[email protected]>
Date:   Thu Dec 11 22:18:49 2025 -0800

    drm/xe/oa: Disallow 0 OA property values
    
    commit 3595114bc31d1eb5e1996164c901485c1ffac6f7 upstream.
    
    An OA property value of 0 is invalid and will cause a NPD.
    
    Reported-by: Peter Senna Tschudin <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/6452
    Fixes: cc4e6994d5a2 ("drm/xe/oa: Move functions up so they can be reused for config ioctl")
    Cc: [email protected]
    Signed-off-by: Ashutosh Dixit <[email protected]>
    Reviewed-by: Harish Chegondi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 7a100e6ddcc47c1f6ba7a19402de86ce24790621)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe/oa: Fix potential UAF in xe_oa_add_config_ioctl() [+ + +]

Author: Sanjay Yadav <[email protected]>
Date:   Tue Nov 18 17:19:00 2025 +0530

    drm/xe/oa: Fix potential UAF in xe_oa_add_config_ioctl()
    
    commit dcb171931954c51a1a7250d558f02b8f36570783 upstream.
    
    In xe_oa_add_config_ioctl(), we accessed oa_config->id after dropping
    metrics_lock. Since this lock protects the lifetime of oa_config, an
    attacker could guess the id and call xe_oa_remove_config_ioctl() with
    perfect timing, freeing oa_config before we dereference it, leading to
    a potential use-after-free.
    
    Fix this by caching the id in a local variable while holding the lock.
    
    v2: (Matt A)
    - Dropped mutex_unlock(&oa->metrics_lock) ordering change from
      xe_oa_remove_config_ioctl()
    
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/6614
    Fixes: cdf02fe1a94a7 ("drm/xe/oa/uapi: Add/remove OA config perf ops")
    Cc: <[email protected]> # v6.11+
    Suggested-by: Matthew Auld <[email protected]>
    Signed-off-by: Sanjay Yadav <[email protected]>
    Reviewed-by: Matthew Auld <[email protected]>
    Signed-off-by: Matthew Auld <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 28aeaed130e8e587fd1b73b6d66ca41ccc5a1a31)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe/oa: Limit num_syncs to prevent oversized allocations [+ + +]

Author: Shuicheng Lin <[email protected]>
Date:   Fri Dec 5 23:47:18 2025 +0000

    drm/xe/oa: Limit num_syncs to prevent oversized allocations
    
    [ Upstream commit f8dd66bfb4e184c71bd26418a00546ebe7f5c17a ]
    
    The OA open parameters did not validate num_syncs, allowing
    userspace to pass arbitrarily large values, potentially
    leading to excessive allocations.
    
    Add check to ensure that num_syncs does not exceed DRM_XE_MAX_SYNCS,
    returning -EINVAL when the limit is violated.
    
    v2: use XE_IOCTL_DBG() and drop duplicated check. (Ashutosh)
    
    Fixes: c8507a25cebd ("drm/xe/oa/uapi: Define and parse OA sync properties")
    Cc: Matthew Brost <[email protected]>
    Cc: Ashutosh Dixit <[email protected]>
    Signed-off-by: Shuicheng Lin <[email protected]>
    Reviewed-by: Ashutosh Dixit <[email protected]>
    Signed-off-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit e057b2d2b8d815df3858a87dffafa2af37e5945b)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/xe: Adjust long-running workload timeslices to reasonable values [+ + +]

Author: Matthew Brost <[email protected]>
Date:   Fri Dec 12 10:28:41 2025 -0800

    drm/xe: Adjust long-running workload timeslices to reasonable values
    
    commit 6f0f404bd289d79a260b634c5b3f4d330b13472c upstream.
    
    A 10ms timeslice for long-running workloads is far too long and causes
    significant jitter in benchmarks when the system is shared. Adjust the
    value to 5ms for preempt-fencing VMs, as the resume step there is quite
    costly as memory is moved around, and set it to zero for pagefault VMs,
    since switching back to pagefault mode after dma-fence mode is
    relatively fast.
    
    Also change min_run_period_ms to 'unsiged int' type rather than 's64' as
    only positive values make sense.
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Cc: [email protected]
    Signed-off-by: Matthew Brost <[email protected]>
    Reviewed-by: Thomas Hellström <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 33a5abd9a68394aa67f9618b20eee65ee8702ff4)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe: Drop preempt-fences when destroying imported dma-bufs. [+ + +]

Author: Thomas Hellström <[email protected]>
Date:   Wed Dec 17 10:34:41 2025 +0100

    drm/xe: Drop preempt-fences when destroying imported dma-bufs.
    
    commit fe3ccd24138fd391ae8e32289d492c85f67770fc upstream.
    
    When imported dma-bufs are destroyed, TTM is not fully
    individualizing the dma-resv, but it *is* copying the fences that
    need to be waited for before declaring idle. So in the case where
    the bo->resv != bo->_resv we can still drop the preempt-fences, but
    make sure we do that on bo->_resv which contains the fence-pointer
    copy.
    
    In the case where the copying fails, bo->_resv will typically not
    contain any fences pointers at all, so there will be nothing to
    drop. In that case, TTM would have ensured all fences that would
    have been copied are signaled, including any remaining preempt
    fences.
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Fixes: fa0af721bd1f ("drm/ttm: test private resv obj on release/destroy")
    Cc: Matthew Brost <[email protected]>
    Cc: <[email protected]> # v6.16+
    Signed-off-by: Thomas Hellström <[email protected]>
    Tested-by: Matthew Brost <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 425fe550fb513b567bd6d01f397d274092a9c274)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

drm/xe: Limit num_syncs to prevent oversized allocations [+ + +]

Author: Shuicheng Lin <[email protected]>
Date:   Fri Dec 5 23:47:17 2025 +0000

    drm/xe: Limit num_syncs to prevent oversized allocations
    
    [ Upstream commit 8e461304009135270e9ccf2d7e2dfe29daec9b60 ]
    
    The exec and vm_bind ioctl allow userspace to specify an arbitrary
    num_syncs value. Without bounds checking, a very large num_syncs
    can force an excessively large allocation, leading to kernel warnings
    from the page allocator as below.
    
    Introduce DRM_XE_MAX_SYNCS (set to 1024) and reject any request
    exceeding this limit.
    
    "
    ------------[ cut here ]------------
    WARNING: CPU: 0 PID: 1217 at mm/page_alloc.c:5124 __alloc_frozen_pages_noprof+0x2f8/0x2180 mm/page_alloc.c:5124
    ...
    Call Trace:
     <TASK>
     alloc_pages_mpol+0xe4/0x330 mm/mempolicy.c:2416
     ___kmalloc_large_node+0xd8/0x110 mm/slub.c:4317
     __kmalloc_large_node_noprof+0x18/0xe0 mm/slub.c:4348
     __do_kmalloc_node mm/slub.c:4364 [inline]
     __kmalloc_noprof+0x3d4/0x4b0 mm/slub.c:4388
     kmalloc_noprof include/linux/slab.h:909 [inline]
     kmalloc_array_noprof include/linux/slab.h:948 [inline]
     xe_exec_ioctl+0xa47/0x1e70 drivers/gpu/drm/xe/xe_exec.c:158
     drm_ioctl_kernel+0x1f1/0x3e0 drivers/gpu/drm/drm_ioctl.c:797
     drm_ioctl+0x5e7/0xc50 drivers/gpu/drm/drm_ioctl.c:894
     xe_drm_ioctl+0x10b/0x170 drivers/gpu/drm/xe/xe_device.c:224
     vfs_ioctl fs/ioctl.c:51 [inline]
     __do_sys_ioctl fs/ioctl.c:598 [inline]
     __se_sys_ioctl fs/ioctl.c:584 [inline]
     __x64_sys_ioctl+0x18b/0x210 fs/ioctl.c:584
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xbb/0x380 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    ...
    "
    
    v2: Add "Reported-by" and Cc stable kernels.
    v3: Change XE_MAX_SYNCS from 64 to 1024. (Matt & Ashutosh)
    v4: s/XE_MAX_SYNCS/DRM_XE_MAX_SYNCS/ (Matt)
    v5: Do the check at the top of the exec func. (Matt)
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Reported-by: Koen Koning <[email protected]>
    Reported-by: Peter Senna Tschudin <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/6450
    Cc: <[email protected]> # v6.12+
    Cc: Matthew Brost <[email protected]>
    Cc: Michal Mrozek <[email protected]>
    Cc: Carl Zhang <[email protected]>
    Cc: José Roberto de Souza <[email protected]>
    Cc: Lionel Landwerlin <[email protected]>
    Cc: Ivan Briano <[email protected]>
    Cc: Thomas Hellström <[email protected]>
    Cc: Ashutosh Dixit <[email protected]>
    Signed-off-by: Shuicheng Lin <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Signed-off-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit b07bac9bd708ec468cd1b8a5fe70ae2ac9b0a11c)
    Signed-off-by: Thomas Hellström <[email protected]>
    Stable-dep-of: f8dd66bfb4e1 ("drm/xe/oa: Limit num_syncs to prevent oversized allocations")
    Signed-off-by: Sasha Levin <[email protected]>

drm/xe: Restore engine registers before restarting schedulers after GT reset [+ + +]

Author: Jan Maslak <[email protected]>
Date:   Wed Dec 10 15:56:18 2025 +0100

    drm/xe: Restore engine registers before restarting schedulers after GT reset
    
    [ Upstream commit eed5b815fa49c17d513202f54e980eb91955d3ed ]
    
    During GT reset recovery in do_gt_restart(), xe_uc_start() was called
    before xe_reg_sr_apply_mmio() restored engine-specific registers. This
    created a race window where the scheduler could run jobs before hardware
    state was fully restored.
    
    This caused failures in eudebug tests (xe_exec_sip_eudebug@breakpoint-
    waitsip-*) where TD_CTL register (containing TD_CTL_GLOBAL_DEBUG_ENABLE)
    wasn't restored before jobs started executing. Breakpoints would fail to
    trigger SIP entry because the debug enable bit wasn't set yet.
    
    Fix by moving xe_uc_start() after all MMIO register restoration,
    including engine registers and CCS mode configuration, ensuring all
    hardware state is fully restored before any jobs can be scheduled.
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Signed-off-by: Jan Maslak <[email protected]>
    Reviewed-by: Jonathan Cavitt <[email protected]>
    Reviewed-by: Matthew Brost <[email protected]>
    Signed-off-by: Matthew Brost <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit 825aed0328588b2837636c1c5a0c48795d724617)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/xe: Use usleep_range for accurate long-running workload timeslicing [+ + +]

Author: Matthew Brost <[email protected]>
Date:   Fri Dec 12 10:28:42 2025 -0800

    drm/xe: Use usleep_range for accurate long-running workload timeslicing
    
    commit 80f9c601d9c4d26f00356c0a9c461650e7089273 upstream.
    
    msleep is not very accurate in terms of how long it actually sleeps,
    whereas usleep_range is precise. Replace the timeslice sleep for
    long-running workloads with the more accurate usleep_range to avoid
    jitter if the sleep period is less than 20ms.
    
    Fixes: dd08ebf6c352 ("drm/xe: Introduce a new DRM driver for Intel GPUs")
    Cc: [email protected]
    Signed-off-by: Matthew Brost <[email protected]>
    Reviewed-by: Thomas Hellström <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    (cherry picked from commit ca415c4d4c17ad676a2c8981e1fcc432221dce79)
    Signed-off-by: Thomas Hellström <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: mmc: sdhci-of-aspeed: Switch ref to sdhci-common.yaml [+ + +]

Author: Andrew Jeffery <[email protected]>
Date:   Thu Dec 11 17:45:48 2025 +0900

    dt-bindings: mmc: sdhci-of-aspeed: Switch ref to sdhci-common.yaml
    
    commit ed724ea1b82a800af4704311cb89e5ef1b4ea7ac upstream.
    
    Enable use of common SDHCI-related properties such as sdhci-caps-mask as
    found in the AST2600 EVB DTS.
    
    Cc: [email protected] # v6.2+
    Signed-off-by: Andrew Jeffery <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sc7280: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:45 2025 +0100

    dt-bindings: PCI: qcom,pcie-sc7280: Add missing required power-domains and resets
    
    commit ef99c2efeacac7758cc8c2d00e3200100a4da16c upstream.
    
    Commit 756485bfbb85 ("dt-bindings: PCI: qcom,pcie-sc7280: Move SC7280 to
    dedicated schema") move the device schema to separate file, but it
    missed a "if:not:...then:" clause in the original binding which was
    requiring power-domains and resets for this particular chip.
    
    Fixes: 756485bfbb85 ("dt-bindings: PCI: qcom,pcie-sc7280: Move SC7280 to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-2-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sc8280xp: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:46 2025 +0100

    dt-bindings: PCI: qcom,pcie-sc8280xp: Add missing required power-domains and resets
    
    commit ea551601404d286813aef6819ddf0bf1d7d69a24 upstream.
    
    Commit c007a5505504 ("dt-bindings: PCI: qcom,pcie-sc8280xp: Move
    SC8280XP to dedicated schema") move the device schema to separate file,
    but it missed a "if:not:...then:" clause in the original binding which
    was requiring power-domains and resets for this particular chip.
    
    Fixes: c007a5505504 ("dt-bindings: PCI: qcom,pcie-sc8280xp: Move SC8280XP to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-3-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sm8150: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:47 2025 +0100

    dt-bindings: PCI: qcom,pcie-sm8150: Add missing required power-domains and resets
    
    commit 31cb432b62fb796e0c1084542ba39311d2f716d5 upstream.
    
    Commit 51bc04d5b49d ("dt-bindings: PCI: qcom,pcie-sm8150: Move SM8150 to
    dedicated schema") move the device schema to separate file, but it
    missed a "if:not:...then:" clause in the original binding which was
    requiring power-domains and resets for this particular chip.
    
    Fixes: 51bc04d5b49d ("dt-bindings: PCI: qcom,pcie-sm8150: Move SM8150 to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-4-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sm8250: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:48 2025 +0100

    dt-bindings: PCI: qcom,pcie-sm8250: Add missing required power-domains and resets
    
    commit 2620c6bcd8c141b79ff2afe95dc814dfab644f63 upstream.
    
    Commit 4891b66185c1 ("dt-bindings: PCI: qcom,pcie-sm8250: Move SM8250 to
    dedicated schema") move the device schema to separate file, but it
    missed a "if:not:...then:" clause in the original binding which was
    requiring power-domains and resets for this particular chip.
    
    Fixes: 4891b66185c1 ("dt-bindings: PCI: qcom,pcie-sm8250: Move SM8250 to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-5-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sm8350: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:49 2025 +0100

    dt-bindings: PCI: qcom,pcie-sm8350: Add missing required power-domains and resets
    
    commit 012ba0d5f02e1f192eda263b5f9f826e47d607bb upstream.
    
    Commit 2278b8b54773 ("dt-bindings: PCI: qcom,pcie-sm8350: Move SM8350 to
    dedicated schema") move the device schema to separate file, but it
    missed a "if:not:...then:" clause in the original binding which was
    requiring power-domains and resets for this particular chip.
    
    Fixes: 2278b8b54773 ("dt-bindings: PCI: qcom,pcie-sm8350: Move SM8350 to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-6-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sm8450: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:50 2025 +0100

    dt-bindings: PCI: qcom,pcie-sm8450: Add missing required power-domains and resets
    
    commit 667facc4000c49a7c280097ef6638f133bcb1e59 upstream.
    
    Commit 88c9b3af4e31 ("dt-bindings: PCI: qcom,pcie-sm8450: Move SM8450 to
    dedicated schema") move the device schema to separate file, but it
    missed a "if:not:...then:" clause in the original binding which was
    requiring power-domains and resets for this particular chip.
    
    Fixes: 88c9b3af4e31 ("dt-bindings: PCI: qcom,pcie-sm8450: Move SM8450 to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-7-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

dt-bindings: PCI: qcom,pcie-sm8550: Add missing required power-domains and resets [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 30 09:50:51 2025 +0100

    dt-bindings: PCI: qcom,pcie-sm8550: Add missing required power-domains and resets
    
    commit e60c6f34b9f3a83f96006243c0ef96c134520257 upstream.
    
    Commit b8d3404058a6 ("dt-bindings: PCI: qcom,pcie-sm8550: Move SM8550 to
    dedicated schema") move the device schema to separate file, but it
    missed a "if:not:...then:" clause in the original binding which was
    requiring power-domains and resets for this particular chip.
    
    Fixes: b8d3404058a6 ("dt-bindings: PCI: qcom,pcie-sm8550: Move SM8550 to dedicated schema")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/20251030-dt-bindings-pci-qcom-fixes-power-domains-v2-8-28c1f11599fe@linaro.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

e1000: fix OOB in e1000_tbi_should_accept() [+ + +]

Author: Guangshuo Li <[email protected]>
Date:   Mon Dec 1 11:40:58 2025 +0800

    e1000: fix OOB in e1000_tbi_should_accept()
    
    commit 9c72a5182ed92904d01057f208c390a303f00a0f upstream.
    
    In e1000_tbi_should_accept() we read the last byte of the frame via
    'data[length - 1]' to evaluate the TBI workaround. If the descriptor-
    reported length is zero or larger than the actual RX buffer size, this
    read goes out of bounds and can hit unrelated slab objects. The issue
    is observed from the NAPI receive path (e1000_clean_rx_irq):
    
    ==================================================================
    BUG: KASAN: slab-out-of-bounds in e1000_tbi_should_accept+0x610/0x790
    Read of size 1 at addr ffff888014114e54 by task sshd/363
    
    CPU: 0 PID: 363 Comm: sshd Not tainted 5.18.0-rc1 #1
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.12.0-59-gc9ba5276e321-prebuilt.qemu.org 04/01/2014
    Call Trace:
     <IRQ>
     dump_stack_lvl+0x5a/0x74
     print_address_description+0x7b/0x440
     print_report+0x101/0x200
     kasan_report+0xc1/0xf0
     e1000_tbi_should_accept+0x610/0x790
     e1000_clean_rx_irq+0xa8c/0x1110
     e1000_clean+0xde2/0x3c10
     __napi_poll+0x98/0x380
     net_rx_action+0x491/0xa20
     __do_softirq+0x2c9/0x61d
     do_softirq+0xd1/0x120
     </IRQ>
     <TASK>
     __local_bh_enable_ip+0xfe/0x130
     ip_finish_output2+0x7d5/0xb00
     __ip_queue_xmit+0xe24/0x1ab0
     __tcp_transmit_skb+0x1bcb/0x3340
     tcp_write_xmit+0x175d/0x6bd0
     __tcp_push_pending_frames+0x7b/0x280
     tcp_sendmsg_locked+0x2e4f/0x32d0
     tcp_sendmsg+0x24/0x40
     sock_write_iter+0x322/0x430
     vfs_write+0x56c/0xa60
     ksys_write+0xd1/0x190
     do_syscall_64+0x43/0x90
     entry_SYSCALL_64_after_hwframe+0x44/0xae
    RIP: 0033:0x7f511b476b10
    Code: 73 01 c3 48 8b 0d 88 d3 2b 00 f7 d8 64 89 01 48 83 c8 ff c3 66 0f 1f 44 00 00 83 3d f9 2b 2c 00 00 75 10 b8 01 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 31 c3 48 83 ec 08 e8 8e 9b 01 00 48 89 04 24
    RSP: 002b:00007ffc9211d4e8 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
    RAX: ffffffffffffffda RBX: 0000000000004024 RCX: 00007f511b476b10
    RDX: 0000000000004024 RSI: 0000559a9385962c RDI: 0000000000000003
    RBP: 0000559a9383a400 R08: fffffffffffffff0 R09: 0000000000004f00
    R10: 0000000000000070 R11: 0000000000000246 R12: 0000000000000000
    R13: 00007ffc9211d57f R14: 0000559a9347bde7 R15: 0000000000000003
     </TASK>
    Allocated by task 1:
     __kasan_krealloc+0x131/0x1c0
     krealloc+0x90/0xc0
     add_sysfs_param+0xcb/0x8a0
     kernel_add_sysfs_param+0x81/0xd4
     param_sysfs_builtin+0x138/0x1a6
     param_sysfs_init+0x57/0x5b
     do_one_initcall+0x104/0x250
     do_initcall_level+0x102/0x132
     do_initcalls+0x46/0x74
     kernel_init_freeable+0x28f/0x393
     kernel_init+0x14/0x1a0
     ret_from_fork+0x22/0x30
    The buggy address belongs to the object at ffff888014114000
     which belongs to the cache kmalloc-2k of size 2048
    The buggy address is located 1620 bytes to the right of
     2048-byte region [ffff888014114000, ffff888014114800]
    The buggy address belongs to the physical page:
    page:ffffea0000504400 refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x14110
    head:ffffea0000504400 order:3 compound_mapcount:0 compound_pincount:0
    flags: 0x100000000010200(slab|head|node=0|zone=1)
    raw: 0100000000010200 0000000000000000 dead000000000001 ffff888013442000
    raw: 0000000000000000 0000000000080008 00000001ffffffff 0000000000000000
    page dumped because: kasan: bad access detected
    ==================================================================
    
    This happens because the TBI check unconditionally dereferences the last
    byte without validating the reported length first:
    
            u8 last_byte = *(data + length - 1);
    
    Fix by rejecting the frame early if the length is zero, or if it exceeds
    adapter->rx_buffer_len. This preserves the TBI workaround semantics for
    valid frames and prevents touching memory beyond the RX buffer.
    
    Fixes: 2037110c96d5 ("e1000: move tbi workaround code into helper function")
    Cc: [email protected]
    Signed-off-by: Guangshuo Li <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

erofs: fix unexpected EIO under memory pressure [+ + +]

Author: Junbeom Yeom <[email protected]>
Date:   Mon Dec 29 15:21:42 2025 -0500

    erofs: fix unexpected EIO under memory pressure
    
    [ Upstream commit 4012d78562193ef5eb613bad4b0c0fa187637cfe ]
    
    erofs readahead could fail with ENOMEM under the memory pressure because
    it tries to alloc_page with GFP_NOWAIT | GFP_NORETRY, while GFP_KERNEL
    for a regular read. And if readahead fails (with non-uptodate folios),
    the original request will then fall back to synchronous read, and
    `.read_folio()` should return appropriate errnos.
    
    However, in scenarios where readahead and read operations compete,
    read operation could return an unintended EIO because of an incorrect
    error propagation.
    
    To resolve this, this patch modifies the behavior so that, when the
    PCL is for read(which means pcl.besteffort is true), it attempts actual
    decompression instead of propagating the privios error except initial EIO.
    
    - Page size: 4K
    - The original size of FileA: 16K
    - Compress-ratio per PCL: 50% (Uncompressed 8K -> Compressed 4K)
    [page0, page1] [page2, page3]
    [PCL0]---------[PCL1]
    
    - functions declaration:
      . pread(fd, buf, count, offset)
      . readahead(fd, offset, count)
    - Thread A tries to read the last 4K
    - Thread B tries to do readahead 8K from 4K
    - RA, besteffort == false
    - R, besteffort == true
    
            <process A>                   <process B>
    
    pread(FileA, buf, 4K, 12K)
      do readahead(page3) // failed with ENOMEM
      wait_lock(page3)
        if (!uptodate(page3))
          goto do_read
                                   readahead(FileA, 4K, 8K)
                                   // Here create PCL-chain like below:
                                   // [null, page1] [page2, null]
                                   //   [PCL0:RA]-----[PCL1:RA]
    ...
      do read(page3)        // found [PCL1:RA] and add page3 into it,
                            // and then, change PCL1 from RA to R
    ...
                                   // Now, PCL-chain is as below:
                                   // [null, page1] [page2, page3]
                                   //   [PCL0:RA]-----[PCL1:R]
    
                                     // try to decompress PCL-chain...
                                     z_erofs_decompress_queue
                                       err = 0;
    
                                       // failed with ENOMEM, so page 1
                                       // only for RA will not be uptodated.
                                       // it's okay.
                                       err = decompress([PCL0:RA], err)
    
                                       // However, ENOMEM propagated to next
                                       // PCL, even though PCL is not only
                                       // for RA but also for R. As a result,
                                       // it just failed with ENOMEM without
                                       // trying any decompression, so page2
                                       // and page3 will not be uptodated.
                    ** BUG HERE ** --> err = decompress([PCL1:R], err)
    
                                       return err as ENOMEM
    ...
        wait_lock(page3)
          if (!uptodate(page3))
            return EIO      <-- Return an unexpected EIO!
    ...
    
    Fixes: 2349d2fa02db ("erofs: sunset unneeded NOFAILs")
    Cc: [email protected]
    Reviewed-by: Jaewook Kim <[email protected]>
    Reviewed-by: Sungjong Seo <[email protected]>
    Signed-off-by: Junbeom Yeom <[email protected]>
    Reviewed-by: Gao Xiang <[email protected]>
    Signed-off-by: Gao Xiang <[email protected]>
    [ Adjust context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ethtool: Avoid overflowing userspace buffer on stats query [+ + +]

Author: Gal Pressman <[email protected]>
Date:   Mon Dec 8 14:19:01 2025 +0200

    ethtool: Avoid overflowing userspace buffer on stats query
    
    [ Upstream commit 7b07be1ff1cb6c49869910518650e8d0abc7d25f ]
    
    The ethtool -S command operates across three ioctl calls:
    ETHTOOL_GSSET_INFO for the size, ETHTOOL_GSTRINGS for the names, and
    ETHTOOL_GSTATS for the values.
    
    If the number of stats changes between these calls (e.g., due to device
    reconfiguration), userspace's buffer allocation will be incorrect,
    potentially leading to buffer overflow.
    
    Drivers are generally expected to maintain stable stat counts, but some
    drivers (e.g., mlx5, bnx2x, bna, ksz884x) use dynamic counters, making
    this scenario possible.
    
    Some drivers try to handle this internally:
    - bnad_get_ethtool_stats() returns early in case stats.n_stats is not
      equal to the driver's stats count.
    - micrel/ksz884x also makes sure not to write anything beyond
      stats.n_stats and overflow the buffer.
    
    However, both use stats.n_stats which is already assigned with the value
    returned from get_sset_count(), hence won't solve the issue described
    here.
    
    Change ethtool_get_strings(), ethtool_get_stats(),
    ethtool_get_phy_stats() to not return anything in case of a mismatch
    between userspace's size and get_sset_size(), to prevent buffer
    overflow.
    The returned n_stats value will be equal to zero, to reflect that
    nothing has been returned.
    
    This could result in one of two cases when using upstream ethtool,
    depending on when the size change is detected:
    1. When detected in ethtool_get_strings():
        # ethtool -S eth2
        no stats available
    
    2. When detected in get stats, all stats will be reported as zero.
    
    Both cases are presumably transient, and a subsequent ethtool call
    should succeed.
    
    Other than the overflow avoidance, these two cases are very evident (no
    output/cleared stats), which is arguably better than presenting
    incorrect/shifted stats.
    I also considered returning an error instead of a "silent" response, but
    that seems more destructive towards userspace apps.
    
    Notes:
    - This patch does not claim to fix the inherent race, it only makes sure
      that we do not overflow the userspace buffer, and makes for a more
      predictable behavior.
    
    - RTNL lock is held during each ioctl, the race window exists between
      the separate ioctl calls when the lock is released.
    
    - Userspace ethtool always fills stats.n_stats, but it is likely that
      these stats ioctls are implemented in other userspace applications
      which might not fill it. The added code checks that it's not zero,
      to prevent any regressions.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Reviewed-by: Dragos Tatulea <[email protected]>
    Reviewed-by: Tariq Toukan <[email protected]>
    Signed-off-by: Gal Pressman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

exfat: fix remount failure in different process environments [+ + +]

Author: Yuezhang Mo <[email protected]>
Date:   Fri Nov 28 17:51:10 2025 +0800

    exfat: fix remount failure in different process environments
    
    [ Upstream commit 51fc7b4ce10ccab8ea5e4876bcdc42cf5202a0ef ]
    
    The kernel test robot reported that the exFAT remount operation
    failed. The reason for the failure was that the process's umask
    is different between mount and remount, causing fs_fmask and
    fs_dmask are changed.
    
    Potentially, both gid and uid may also be changed. Therefore, when
    initializing fs_context for remount, inherit these mount options
    from the options used during mount.
    
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-lkp/[email protected]
    Signed-off-by: Yuezhang Mo <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

exfat: zero out post-EOF page cache on file extension [+ + +]

Author: Yuezhang Mo <[email protected]>
Date:   Mon Oct 27 17:03:41 2025 +0800

    exfat: zero out post-EOF page cache on file extension
    
    [ Upstream commit 4e163c39dd4e70fcdce948b8774d96e0482b4a11 ]
    
    xfstests generic/363 was failing due to unzeroed post-EOF page
    cache that allowed mmap writes beyond EOF to become visible
    after file extension.
    
    For example, in following xfs_io sequence, 0x22 should not be
    written to the file but would become visible after the extension:
    
      xfs_io -f -t -c "pwrite -S 0x11 0 8" \
        -c "mmap 0 4096" \
        -c "mwrite -S 0x22 32 32" \
        -c "munmap" \
        -c "pwrite -S 0x33 512 32" \
        $testfile
    
    This violates the expected behavior where writes beyond EOF via
    mmap should not persist after the file is extended. Instead, the
    extended region should contain zeros.
    
    Fix this by using truncate_pagecache() to truncate the page cache
    after the current EOF when extending the file.
    
    Signed-off-by: Yuezhang Mo <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ext4: align max orphan file size with e2fsprogs limit [+ + +]

Author: Baokun Li <[email protected]>
Date:   Thu Nov 20 21:42:33 2025 +0800

    ext4: align max orphan file size with e2fsprogs limit
    
    commit 7c11c56eb32eae96893eebafdbe3decadefe88ad upstream.
    
    Kernel commit 0a6ce20c1564 ("ext4: verify orphan file size is not too big")
    limits the maximum supported orphan file size to 8 << 20.
    
    However, in e2fsprogs, the orphan file size is set to 32–512 filesystem
    blocks when creating a filesystem.
    
    With 64k block size, formatting an ext4 fs >32G gives an orphan file bigger
    than the kernel allows, so mount prints an error and fails:
    
        EXT4-fs (vdb): orphan file too big: 8650752
        EXT4-fs (vdb): mount failed
    
    To prevent this issue and allow previously created 64KB filesystems to
    mount, we updates the maximum allowed orphan file size in the kernel to
    512 filesystem blocks.
    
    Fixes: 0a6ce20c1564 ("ext4: verify orphan file size is not too big")
    Signed-off-by: Baokun Li <[email protected]>
    Reviewed-by: Jan Kara <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ext4: clear i_state_flags when alloc inode [+ + +]

Author: Haibo Chen <[email protected]>
Date:   Tue Nov 4 16:12:24 2025 +0800

    ext4: clear i_state_flags when alloc inode
    
    commit 4091c8206cfd2e3bb529ef260887296b90d9b6a2 upstream.
    
    i_state_flags used on 32-bit archs, need to clear this flag when
    alloc inode.
    Find this issue when umount ext4, sometimes track the inode as orphan
    accidently, cause ext4 mesg dump.
    
    Fixes: acf943e9768e ("ext4: fix checks for orphan inodes")
    Signed-off-by: Haibo Chen <[email protected]>
    Reviewed-by: Baokun Li <[email protected]>
    Reviewed-by: Zhang Yi <[email protected]>
    Reviewed-by: Jan Kara <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ext4: fix incorrect group number assertion in mb_check_buddy [+ + +]

Author: Yongjian Sun <[email protected]>
Date:   Thu Nov 6 14:06:13 2025 +0800

    ext4: fix incorrect group number assertion in mb_check_buddy
    
    commit 3f7a79d05c692c7cfec70bf104b1b3c3d0ce6247 upstream.
    
    When the MB_CHECK_ASSERT macro is enabled, an assertion failure can
    occur in __mb_check_buddy when checking preallocated blocks (pa) in
    a block group:
    
    Assertion failure in mb_free_blocks() : "groupnr == e4b->bd_group"
    
    This happens when a pa at the very end of a block group (e.g.,
    pa_pstart=32765, pa_len=3 in a group of 32768 blocks) becomes
    exhausted - its pa_pstart is advanced by pa_len to 32768, which
    lies in the next block group. If this exhausted pa (with pa_len == 0)
    is still in the bb_prealloc_list during the buddy check, the assertion
    incorrectly flags it as belonging to the wrong group. A possible
    sequence is as follows:
    
    ext4_mb_new_blocks
      ext4_mb_release_context
        pa->pa_pstart += EXT4_C2B(sbi, ac->ac_b_ex.fe_len)
        pa->pa_len -= ac->ac_b_ex.fe_len
    
                             __mb_check_buddy
                               for each pa in group
                                 ext4_get_group_no_and_offset
                                 MB_CHECK_ASSERT(groupnr == e4b->bd_group)
    
    To fix this, we modify the check to skip block group validation for
    exhausted preallocations (where pa_len == 0). Such entries are in a
    transitional state and will be removed from the list soon, so they
    should not trigger an assertion. This change prevents the false
    positive while maintaining the integrity of the checks for active
    allocations.
    
    Fixes: c9de560ded61f ("ext4: Add multi block allocator for ext4")
    Signed-off-by: Yongjian Sun <[email protected]>
    Reviewed-by: Baokun Li <[email protected]>
    Reviewed-by: Jan Kara <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ext4: fix string copying in parse_apply_sb_mount_options() [+ + +]

Author: Fedor Pchelkin <[email protected]>
Date:   Sat Nov 1 19:04:28 2025 +0300

    ext4: fix string copying in parse_apply_sb_mount_options()
    
    commit ee5a977b4e771cc181f39d504426dbd31ed701cc upstream.
    
    strscpy_pad() can't be used to copy a non-NUL-term string into a NUL-term
    string of possibly bigger size.  Commit 0efc5990bca5 ("string.h: Introduce
    memtostr() and memtostr_pad()") provides additional information in that
    regard.  So if this happens, the following warning is observed:
    
    strnlen: detected buffer overflow: 65 byte read of buffer size 64
    WARNING: CPU: 0 PID: 28655 at lib/string_helpers.c:1032 __fortify_report+0x96/0xc0 lib/string_helpers.c:1032
    Modules linked in:
    CPU: 0 UID: 0 PID: 28655 Comm: syz-executor.3 Not tainted 6.12.54-syzkaller-00144-g5f0270f1ba00 #0
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    RIP: 0010:__fortify_report+0x96/0xc0 lib/string_helpers.c:1032
    Call Trace:
     <TASK>
     __fortify_panic+0x1f/0x30 lib/string_helpers.c:1039
     strnlen include/linux/fortify-string.h:235 [inline]
     sized_strscpy include/linux/fortify-string.h:309 [inline]
     parse_apply_sb_mount_options fs/ext4/super.c:2504 [inline]
     __ext4_fill_super fs/ext4/super.c:5261 [inline]
     ext4_fill_super+0x3c35/0xad00 fs/ext4/super.c:5706
     get_tree_bdev_flags+0x387/0x620 fs/super.c:1636
     vfs_get_tree+0x93/0x380 fs/super.c:1814
     do_new_mount fs/namespace.c:3553 [inline]
     path_mount+0x6ae/0x1f70 fs/namespace.c:3880
     do_mount fs/namespace.c:3893 [inline]
     __do_sys_mount fs/namespace.c:4103 [inline]
     __se_sys_mount fs/namespace.c:4080 [inline]
     __x64_sys_mount+0x280/0x300 fs/namespace.c:4080
     do_syscall_x64 arch/x86/entry/common.c:52 [inline]
     do_syscall_64+0x64/0x140 arch/x86/entry/common.c:83
     entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    Since userspace is expected to provide s_mount_opts field to be at most 63
    characters long with the ending byte being NUL-term, use a 64-byte buffer
    which matches the size of s_mount_opts, so that strscpy_pad() does its job
    properly.  Return with error if the user still managed to provide a
    non-NUL-term string here.
    
    Found by Linux Verification Center (linuxtesting.org) with Syzkaller.
    
    Fixes: 8ecb790ea8c3 ("ext4: avoid potential buffer over-read in parse_apply_sb_mount_options()")
    Cc: [email protected]
    Signed-off-by: Fedor Pchelkin <[email protected]>
    Reviewed-by: Baokun Li <[email protected]>
    Reviewed-by: Jan Kara <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ext4: xattr: fix null pointer deref in ext4_raw_inode() [+ + +]

Author: Karina Yankevich <[email protected]>
Date:   Wed Oct 22 12:32:53 2025 +0300

    ext4: xattr: fix null pointer deref in ext4_raw_inode()
    
    commit b97cb7d6a051aa6ebd57906df0e26e9e36c26d14 upstream.
    
    If ext4_get_inode_loc() fails (e.g. if it returns -EFSCORRUPTED),
    iloc.bh will remain set to NULL. Since ext4_xattr_inode_dec_ref_all()
    lacks error checking, this will lead to a null pointer dereference
    in ext4_raw_inode(), called right after ext4_get_inode_loc().
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: c8e008b60492 ("ext4: ignore xattrs past end")
    Cc: [email protected]
    Signed-off-by: Karina Yankevich <[email protected]>
    Reviewed-by: Sergey Shtylyov <[email protected]>
    Reviewed-by: Baokun Li <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: add timeout in f2fs_enable_checkpoint() [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Dec 30 11:22:52 2025 -0500

    f2fs: add timeout in f2fs_enable_checkpoint()
    
    [ Upstream commit 4bc347779698b5e67e1514bab105c2c083e55502 ]
    
    During f2fs_enable_checkpoint() in remount(), if we flush a large
    amount of dirty pages into slow device, it may take long time which
    will block write IO, let's add a timeout machanism during dirty
    pages flush to avoid long time block in f2fs_enable_checkpoint().
    
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Stable-dep-of: be112e7449a6 ("f2fs: fix to propagate error from f2fs_enable_checkpoint()")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: clear SBI_POR_DOING before initing inmem curseg [+ + +]

Author: Sheng Yong <[email protected]>
Date:   Tue Dec 30 11:22:51 2025 -0500

    f2fs: clear SBI_POR_DOING before initing inmem curseg
    
    [ Upstream commit f88c7904b5c7e35ab8037e2a59e10d80adf6fd7e ]
    
    SBI_POR_DOING can be cleared after recovery is completed, so that
    changes made before recovery can be persistent, and subsequent
    errors can be recorded into cp/sb.
    
    Signed-off-by: Song Feng <[email protected]>
    Signed-off-by: Yongpeng Yang <[email protected]>
    Signed-off-by: Sheng Yong <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Stable-dep-of: be112e7449a6 ("f2fs: fix to propagate error from f2fs_enable_checkpoint()")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: drop inode from the donation list when the last file is closed [+ + +]

Author: Jaegeuk Kim <[email protected]>
Date:   Tue Dec 30 11:15:26 2025 -0500

    f2fs: drop inode from the donation list when the last file is closed
    
    [ Upstream commit 078cad8212ce4f4ebbafcc0936475b8215e1ca2a ]
    
    Let's drop the inode from the donation list when there is no other
    open file.
    
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Stable-dep-of: 10b591e7fb7c ("f2fs: fix to avoid updating compression context during writeback")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: dump more information for f2fs_{enable,disable}_checkpoint() [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Dec 30 11:22:53 2025 -0500

    f2fs: dump more information for f2fs_{enable,disable}_checkpoint()
    
    [ Upstream commit 80b6d1d2535a343e43d658777a46f1ebce8f3413 ]
    
    Changes as below:
    - print more logs for f2fs_{enable,disable}_checkpoint()
    - account and dump time stats for f2fs_enable_checkpoint()
    
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Stable-dep-of: be112e7449a6 ("f2fs: fix to propagate error from f2fs_enable_checkpoint()")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: ensure node page reads complete before f2fs_put_super() finishes [+ + +]

Author: Jan Prusakowski <[email protected]>
Date:   Mon Oct 6 10:46:15 2025 +0200

    f2fs: ensure node page reads complete before f2fs_put_super() finishes
    
    commit 297baa4aa263ff8f5b3d246ee16a660d76aa82c4 upstream.
    
    Xfstests generic/335, generic/336 sometimes crash with the following message:
    
    F2FS-fs (dm-0): detect filesystem reference count leak during umount, type: 9, count: 1
    ------------[ cut here ]------------
    kernel BUG at fs/f2fs/super.c:1939!
    Oops: invalid opcode: 0000 [#1] SMP NOPTI
    CPU: 1 UID: 0 PID: 609351 Comm: umount Tainted: G        W           6.17.0-rc5-xfstests-g9dd1835ecda5 #1 PREEMPT(none)
    Tainted: [W]=WARN
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    RIP: 0010:f2fs_put_super+0x3b3/0x3c0
    Call Trace:
     <TASK>
     generic_shutdown_super+0x7e/0x190
     kill_block_super+0x1a/0x40
     kill_f2fs_super+0x9d/0x190
     deactivate_locked_super+0x30/0xb0
     cleanup_mnt+0xba/0x150
     task_work_run+0x5c/0xa0
     exit_to_user_mode_loop+0xb7/0xc0
     do_syscall_64+0x1ae/0x1c0
     entry_SYSCALL_64_after_hwframe+0x76/0x7e
     </TASK>
    ---[ end trace 0000000000000000 ]---
    
    It appears that sometimes it is possible that f2fs_put_super() is called before
    all node page reads are completed.
    Adding a call to f2fs_wait_on_all_pages() for F2FS_RD_NODE fixes the problem.
    
    Cc: [email protected]
    Fixes: 20872584b8c0b ("f2fs: fix to drop all dirty meta/node pages during umount()")
    Signed-off-by: Jan Prusakowski <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix age extent cache insertion skip on counter overflow [+ + +]

Author: Xiaole He <[email protected]>
Date:   Mon Oct 27 17:23:41 2025 +0800

    f2fs: fix age extent cache insertion skip on counter overflow
    
    commit 27bf6a637b7613fc85fa6af468b7d612d78cd5c0 upstream.
    
    The age extent cache uses last_blocks (derived from
    allocated_data_blocks) to determine data age. However, there's a
    conflict between the deletion
    marker (last_blocks=0) and legitimate last_blocks=0 cases when
    allocated_data_blocks overflows to 0 after reaching ULLONG_MAX.
    
    In this case, valid extents are incorrectly skipped due to the
    "if (!tei->last_blocks)" check in __update_extent_tree_range().
    
    This patch fixes the issue by:
    1. Reserving ULLONG_MAX as an invalid/deletion marker
    2. Limiting allocated_data_blocks to range [0, ULLONG_MAX-1]
    3. Using F2FS_EXTENT_AGE_INVALID for deletion scenarios
    4. Adjusting overflow age calculation from ULLONG_MAX to (ULLONG_MAX-1)
    
    Reproducer (using a patched kernel with allocated_data_blocks
    initialized to ULLONG_MAX - 3 for quick testing):
    
    Step 1: Mount and check initial state
      # dd if=/dev/zero of=/tmp/test.img bs=1M count=100
      # mkfs.f2fs -f /tmp/test.img
      # mkdir -p /mnt/f2fs_test
      # mount -t f2fs -o loop,age_extent_cache /tmp/test.img /mnt/f2fs_test
      # cat /sys/kernel/debug/f2fs/status | grep -A 4 "Block Age"
      Allocated Data Blocks: 18446744073709551612 # ULLONG_MAX - 3
      Inner Struct Count: tree: 1(0), node: 0
    
    Step 2: Create files and write data to trigger overflow
      # touch /mnt/f2fs_test/{1,2,3,4}.txt; sync
      # cat /sys/kernel/debug/f2fs/status | grep -A 4 "Block Age"
      Allocated Data Blocks: 18446744073709551613 # ULLONG_MAX - 2
      Inner Struct Count: tree: 5(0), node: 1
    
      # dd if=/dev/urandom of=/mnt/f2fs_test/1.txt bs=4K count=1; sync
      # cat /sys/kernel/debug/f2fs/status | grep -A 4 "Block Age"
      Allocated Data Blocks: 18446744073709551614 # ULLONG_MAX - 1
      Inner Struct Count: tree: 5(0), node: 2
    
      # dd if=/dev/urandom of=/mnt/f2fs_test/2.txt bs=4K count=1; sync
      # cat /sys/kernel/debug/f2fs/status | grep -A 4 "Block Age"
      Allocated Data Blocks: 18446744073709551615 # ULLONG_MAX
      Inner Struct Count: tree: 5(0), node: 3
    
      # dd if=/dev/urandom of=/mnt/f2fs_test/3.txt bs=4K count=1; sync
      # cat /sys/kernel/debug/f2fs/status | grep -A 4 "Block Age"
      Allocated Data Blocks: 0 # Counter overflowed!
      Inner Struct Count: tree: 5(0), node: 4
    
    Step 3: Trigger the bug - next write should create node but gets skipped
      # dd if=/dev/urandom of=/mnt/f2fs_test/4.txt bs=4K count=1; sync
      # cat /sys/kernel/debug/f2fs/status | grep -A 4 "Block Age"
      Allocated Data Blocks: 1
      Inner Struct Count: tree: 5(0), node: 4
    
      Expected: node: 5 (new extent node for 4.txt)
      Actual: node: 4 (extent insertion was incorrectly skipped due to
      last_blocks = allocated_data_blocks = 0 in __get_new_block_age)
    
    After this fix, the extent node is correctly inserted and node count
    becomes 5 as expected.
    
    Fixes: 71644dff4811 ("f2fs: add block_age-based extent cache")
    Cc: [email protected]
    Signed-off-by: Xiaole He <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix return value of f2fs_recover_fsync_data() [+ + +]

Author: Chao Yu <[email protected]>
Date:   Wed Nov 5 14:50:22 2025 +0800

    f2fs: fix return value of f2fs_recover_fsync_data()
    
    commit 01fba45deaddcce0d0b01c411435d1acf6feab7b upstream.
    
    With below scripts, it will trigger panic in f2fs:
    
    mkfs.f2fs -f /dev/vdd
    mount /dev/vdd /mnt/f2fs
    touch /mnt/f2fs/foo
    sync
    echo 111 >> /mnt/f2fs/foo
    f2fs_io fsync /mnt/f2fs/foo
    f2fs_io shutdown 2 /mnt/f2fs
    umount /mnt/f2fs
    mount -o ro,norecovery /dev/vdd /mnt/f2fs
    or
    mount -o ro,disable_roll_forward /dev/vdd /mnt/f2fs
    
    F2FS-fs (vdd): f2fs_recover_fsync_data: recovery fsync data, check_only: 0
    F2FS-fs (vdd): Mounted with checkpoint version = 7f5c361f
    F2FS-fs (vdd): Stopped filesystem due to reason: 0
    F2FS-fs (vdd): f2fs_recover_fsync_data: recovery fsync data, check_only: 1
    Filesystem f2fs get_tree() didn't set fc->root, returned 1
    ------------[ cut here ]------------
    kernel BUG at fs/super.c:1761!
    Oops: invalid opcode: 0000 [#1] SMP PTI
    CPU: 3 UID: 0 PID: 722 Comm: mount Not tainted 6.18.0-rc2+ #721 PREEMPT(voluntary)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    RIP: 0010:vfs_get_tree.cold+0x18/0x1a
    Call Trace:
     <TASK>
     fc_mount+0x13/0xa0
     path_mount+0x34e/0xc50
     __x64_sys_mount+0x121/0x150
     do_syscall_64+0x84/0x800
     entry_SYSCALL_64_after_hwframe+0x76/0x7e
    RIP: 0033:0x7fa6cc126cfe
    
    The root cause is we missed to handle error number returned from
    f2fs_recover_fsync_data() when mounting image w/ ro,norecovery or
    ro,disable_roll_forward mount option, result in returning a positive
    error number to vfs_get_tree(), fix it.
    
    Cc: [email protected]
    Fixes: 6781eabba1bd ("f2fs: give -EINVAL for norecovery and rw mount")
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix to avoid potential deadlock [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Oct 14 19:47:35 2025 +0800

    f2fs: fix to avoid potential deadlock
    
    commit ca8b201f28547e28343a6f00a6e91fa8c09572fe upstream.
    
    As Jiaming Zhang and syzbot reported, there is potential deadlock in
    f2fs as below:
    
    Chain exists of:
      &sbi->cp_rwsem --> fs_reclaim --> sb_internal#2
    
     Possible unsafe locking scenario:
    
           CPU0                    CPU1
           ----                    ----
      rlock(sb_internal#2);
                                   lock(fs_reclaim);
                                   lock(sb_internal#2);
      rlock(&sbi->cp_rwsem);
    
     *** DEADLOCK ***
    
    3 locks held by kswapd0/73:
     #0: ffffffff8e247a40 (fs_reclaim){+.+.}-{0:0}, at: balance_pgdat mm/vmscan.c:7015 [inline]
     #0: ffffffff8e247a40 (fs_reclaim){+.+.}-{0:0}, at: kswapd+0x951/0x2800 mm/vmscan.c:7389
     #1: ffff8880118400e0 (&type->s_umount_key#50){.+.+}-{4:4}, at: super_trylock_shared fs/super.c:562 [inline]
     #1: ffff8880118400e0 (&type->s_umount_key#50){.+.+}-{4:4}, at: super_cache_scan+0x91/0x4b0 fs/super.c:197
     #2: ffff888011840610 (sb_internal#2){.+.+}-{0:0}, at: f2fs_evict_inode+0x8d9/0x1b60 fs/f2fs/inode.c:890
    
    stack backtrace:
    CPU: 0 UID: 0 PID: 73 Comm: kswapd0 Not tainted syzkaller #0 PREEMPT(full)
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
    Call Trace:
     <TASK>
     dump_stack_lvl+0x189/0x250 lib/dump_stack.c:120
     print_circular_bug+0x2ee/0x310 kernel/locking/lockdep.c:2043
     check_noncircular+0x134/0x160 kernel/locking/lockdep.c:2175
     check_prev_add kernel/locking/lockdep.c:3165 [inline]
     check_prevs_add kernel/locking/lockdep.c:3284 [inline]
     validate_chain+0xb9b/0x2140 kernel/locking/lockdep.c:3908
     __lock_acquire+0xab9/0xd20 kernel/locking/lockdep.c:5237
     lock_acquire+0x120/0x360 kernel/locking/lockdep.c:5868
     down_read+0x46/0x2e0 kernel/locking/rwsem.c:1537
     f2fs_down_read fs/f2fs/f2fs.h:2278 [inline]
     f2fs_lock_op fs/f2fs/f2fs.h:2357 [inline]
     f2fs_do_truncate_blocks+0x21c/0x10c0 fs/f2fs/file.c:791
     f2fs_truncate_blocks+0x10a/0x300 fs/f2fs/file.c:867
     f2fs_truncate+0x489/0x7c0 fs/f2fs/file.c:925
     f2fs_evict_inode+0x9f2/0x1b60 fs/f2fs/inode.c:897
     evict+0x504/0x9c0 fs/inode.c:810
     f2fs_evict_inode+0x1dc/0x1b60 fs/f2fs/inode.c:853
     evict+0x504/0x9c0 fs/inode.c:810
     dispose_list fs/inode.c:852 [inline]
     prune_icache_sb+0x21b/0x2c0 fs/inode.c:1000
     super_cache_scan+0x39b/0x4b0 fs/super.c:224
     do_shrink_slab+0x6ef/0x1110 mm/shrinker.c:437
     shrink_slab_memcg mm/shrinker.c:550 [inline]
     shrink_slab+0x7ef/0x10d0 mm/shrinker.c:628
     shrink_one+0x28a/0x7c0 mm/vmscan.c:4955
     shrink_many mm/vmscan.c:5016 [inline]
     lru_gen_shrink_node mm/vmscan.c:5094 [inline]
     shrink_node+0x315d/0x3780 mm/vmscan.c:6081
     kswapd_shrink_node mm/vmscan.c:6941 [inline]
     balance_pgdat mm/vmscan.c:7124 [inline]
     kswapd+0x147c/0x2800 mm/vmscan.c:7389
     kthread+0x70e/0x8a0 kernel/kthread.c:463
     ret_from_fork+0x4bc/0x870 arch/x86/kernel/process.c:158
     ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:245
     </TASK>
    
    The root cause is deadlock among four locks as below:
    
    kswapd
    - fs_reclaim                            --- Lock A
     - shrink_one
      - evict
       - f2fs_evict_inode
        - sb_start_intwrite                 --- Lock B
    
    - iput
     - evict
      - f2fs_evict_inode
       - sb_start_intwrite                  --- Lock B
       - f2fs_truncate
        - f2fs_truncate_blocks
         - f2fs_do_truncate_blocks
          - f2fs_lock_op                    --- Lock C
    
    ioctl
    - f2fs_ioc_commit_atomic_write
     - f2fs_lock_op                         --- Lock C
      - __f2fs_commit_atomic_write
       - __replace_atomic_write_block
        - f2fs_get_dnode_of_data
         - __get_node_folio
          - f2fs_check_nid_range
           - f2fs_handle_error
            - f2fs_record_errors
             - f2fs_down_write              --- Lock D
    
    open
    - do_open
     - do_truncate
      - security_inode_need_killpriv
       - f2fs_getxattr
        - lookup_all_xattrs
         - f2fs_handle_error
          - f2fs_record_errors
           - f2fs_down_write                --- Lock D
            - f2fs_commit_super
             - read_mapping_folio
              - filemap_alloc_folio_noprof
               - prepare_alloc_pages
                - fs_reclaim_acquire        --- Lock A
    
    In order to avoid such deadlock, we need to avoid grabbing sb_lock in
    f2fs_handle_error(), so, let's use asynchronous method instead:
    - remove f2fs_handle_error() implementation
    - rename f2fs_handle_error_async() to f2fs_handle_error()
    - spread f2fs_handle_error()
    
    Fixes: 95fa90c9e5a7 ("f2fs: support recording errors into superblock")
    Cc: [email protected]
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/linux-f2fs-devel/[email protected]
    Reported-by: Jiaming Zhang <[email protected]>
    Closes: https://lore.kernel.org/lkml/CANypQFa-Gy9sD-N35o3PC+FystOWkNuN8pv6S75HLT0ga-Tzgw@mail.gmail.com
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix to avoid updating compression context during writeback [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Dec 30 11:15:27 2025 -0500

    f2fs: fix to avoid updating compression context during writeback
    
    [ Upstream commit 10b591e7fb7cdc8c1e53e9c000dc0ef7069aaa76 ]
    
    Bai, Shuangpeng <[email protected]> reported a bug as below:
    
    Oops: divide error: 0000 [#1] SMP KASAN PTI
    CPU: 0 UID: 0 PID: 11441 Comm: syz.0.46 Not tainted 6.17.0 #1 PREEMPT(full)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.15.0-1 04/01/2014
    RIP: 0010:f2fs_all_cluster_page_ready+0x106/0x550 fs/f2fs/compress.c:857
    Call Trace:
     <TASK>
     f2fs_write_cache_pages fs/f2fs/data.c:3078 [inline]
     __f2fs_write_data_pages fs/f2fs/data.c:3290 [inline]
     f2fs_write_data_pages+0x1c19/0x3600 fs/f2fs/data.c:3317
     do_writepages+0x38e/0x640 mm/page-writeback.c:2634
     filemap_fdatawrite_wbc mm/filemap.c:386 [inline]
     __filemap_fdatawrite_range mm/filemap.c:419 [inline]
     file_write_and_wait_range+0x2ba/0x3e0 mm/filemap.c:794
     f2fs_do_sync_file+0x6e6/0x1b00 fs/f2fs/file.c:294
     generic_write_sync include/linux/fs.h:3043 [inline]
     f2fs_file_write_iter+0x76e/0x2700 fs/f2fs/file.c:5259
     new_sync_write fs/read_write.c:593 [inline]
     vfs_write+0x7e9/0xe00 fs/read_write.c:686
     ksys_write+0x19d/0x2d0 fs/read_write.c:738
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xf7/0x470 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    
    The bug was triggered w/ below race condition:
    
    fsync                           setattr                 ioctl
    - f2fs_do_sync_file
     - file_write_and_wait_range
      - f2fs_write_cache_pages
      : inode is non-compressed
      : cc.cluster_size =
        F2FS_I(inode)->i_cluster_size = 0
       - tag_pages_for_writeback
                                    - f2fs_setattr
                                     - truncate_setsize
                                     - f2fs_truncate
                                                            - f2fs_fileattr_set
                                                             - f2fs_setflags_common
                                                              - set_compress_context
                                                              : F2FS_I(inode)->i_cluster_size = 4
                                                              : set_inode_flag(inode, FI_COMPRESSED_FILE)
       - f2fs_compressed_file
       : return true
       - f2fs_all_cluster_page_ready
       : "pgidx % cc->cluster_size" trigger dividing 0 issue
    
    Let's change as below to fix this issue:
    - introduce a new atomic type variable .writeback in structure f2fs_inode_info
    to track the number of threads which calling f2fs_write_cache_pages().
    - use .i_sem lock to protect .writeback update.
    - check .writeback before update compression context in f2fs_setflags_common()
    to avoid race w/ ->writepages.
    
    Fixes: 4c8ff7095bef ("f2fs: support data compression")
    Cc: [email protected]
    Reported-by: Bai, Shuangpeng <[email protected]>
    Tested-by: Bai, Shuangpeng <[email protected]>
    Closes: https://lore.kernel.org/lkml/[email protected]
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix to avoid updating zero-sized extent in extent cache [+ + +]

Author: Chao Yu <[email protected]>
Date:   Mon Oct 20 10:42:12 2025 +0800

    f2fs: fix to avoid updating zero-sized extent in extent cache
    
    commit 7c37c79510329cd951a4dedf3f7bf7e2b18dccec upstream.
    
    As syzbot reported:
    
    F2FS-fs (loop0): __update_extent_tree_range: extent len is zero, type: 0, extent [0, 0, 0], age [0, 0]
    ------------[ cut here ]------------
    kernel BUG at fs/f2fs/extent_cache.c:678!
    Oops: invalid opcode: 0000 [#1] SMP KASAN NOPTI
    CPU: 0 UID: 0 PID: 5336 Comm: syz.0.0 Not tainted syzkaller #0 PREEMPT(full)
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
    RIP: 0010:__update_extent_tree_range+0x13bc/0x1500 fs/f2fs/extent_cache.c:678
    Call Trace:
     <TASK>
     f2fs_update_read_extent_cache_range+0x192/0x3e0 fs/f2fs/extent_cache.c:1085
     f2fs_do_zero_range fs/f2fs/file.c:1657 [inline]
     f2fs_zero_range+0x10c1/0x1580 fs/f2fs/file.c:1737
     f2fs_fallocate+0x583/0x990 fs/f2fs/file.c:2030
     vfs_fallocate+0x669/0x7e0 fs/open.c:342
     ioctl_preallocate fs/ioctl.c:289 [inline]
     file_ioctl+0x611/0x780 fs/ioctl.c:-1
     do_vfs_ioctl+0xb33/0x1430 fs/ioctl.c:576
     __do_sys_ioctl fs/ioctl.c:595 [inline]
     __se_sys_ioctl+0x82/0x170 fs/ioctl.c:583
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xfa/0x3b0 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    RIP: 0033:0x7f07bc58eec9
    
    In error path of f2fs_zero_range(), it may add a zero-sized extent
    into extent cache, it should be avoided.
    
    Fixes: 6e9619499f53 ("f2fs: support in batch fzero in dnode page")
    Cc: [email protected]
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/linux-f2fs-devel/[email protected]
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix to detect recoverable inode during dryrun of find_fsync_dnodes() [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Dec 30 13:32:19 2025 -0500

    f2fs: fix to detect recoverable inode during dryrun of find_fsync_dnodes()
    
    [ Upstream commit 68d05693f8c031257a0822464366e1c2a239a512 ]
    
    mkfs.f2fs -f /dev/vdd
    mount /dev/vdd /mnt/f2fs
    touch /mnt/f2fs/foo
    sync            # avoid CP_UMOUNT_FLAG in last f2fs_checkpoint.ckpt_flags
    touch /mnt/f2fs/bar
    f2fs_io fsync /mnt/f2fs/bar
    f2fs_io shutdown 2 /mnt/f2fs
    umount /mnt/f2fs
    blockdev --setro /dev/vdd
    mount /dev/vdd /mnt/f2fs
    mount: /mnt/f2fs: WARNING: source write-protected, mounted read-only.
    
    For the case if we create and fsync a new inode before sudden power-cut,
    without norecovery or disable_roll_forward mount option, the following
    mount will succeed w/o recovering last fsynced inode.
    
    The problem here is that we only check inode_list list after
    find_fsync_dnodes() in f2fs_recover_fsync_data() to find out whether
    there is recoverable data in the iamge, but there is a missed case, if
    last fsynced inode is not existing in last checkpoint, then, we will
    fail to get its inode due to nat of inode node is not existing in last
    checkpoint, so the inode won't be linked in inode_list.
    
    Let's detect such case in dyrun mode to fix this issue.
    
    After this change, mount will fail as expected below:
    mount: /mnt/f2fs: cannot mount /dev/vdd read-only.
           dmesg(1) may have more information after failed mount system call.
    demsg:
    F2FS-fs (vdd): Need to recover fsync data, but write access unavailable, please try mount w/ disable_roll_forward or norecovery
    
    Cc: [email protected]
    Fixes: 6781eabba1bd ("f2fs: give -EINVAL for norecovery and rw mount")
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    [ folio => page ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix to propagate error from f2fs_enable_checkpoint() [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Dec 30 11:22:54 2025 -0500

    f2fs: fix to propagate error from f2fs_enable_checkpoint()
    
    [ Upstream commit be112e7449a6e1b54aa9feac618825d154b3a5c7 ]
    
    In order to let userspace detect such error rather than suffering
    silent failure.
    
    Fixes: 4354994f097d ("f2fs: checkpoint disabling")
    Cc: [email protected]
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: fix uninitialized one_time_gc in victim_sel_policy [+ + +]

Author: Xiaole He <[email protected]>
Date:   Wed Oct 29 13:18:07 2025 +0800

    f2fs: fix uninitialized one_time_gc in victim_sel_policy
    
    commit 392711ef18bff524a873b9c239a73148c5432262 upstream.
    
    The one_time_gc field in struct victim_sel_policy is conditionally
    initialized but unconditionally read, leading to undefined behavior
    that triggers UBSAN warnings.
    
    In f2fs_get_victim() at fs/f2fs/gc.c:774, the victim_sel_policy
    structure is declared without initialization:
    
        struct victim_sel_policy p;
    
    The field p.one_time_gc is only assigned when the 'one_time' parameter
    is true (line 789):
    
        if (one_time) {
            p.one_time_gc = one_time;
            ...
        }
    
    However, this field is unconditionally read in subsequent get_gc_cost()
    at line 395:
    
        if (p->one_time_gc && (valid_thresh_ratio < 100) && ...)
    
    When one_time is false, p.one_time_gc contains uninitialized stack
    memory. Hence p.one_time_gc is an invalid bool value.
    
    UBSAN detects this invalid bool value:
    
        UBSAN: invalid-load in fs/f2fs/gc.c:395:7
        load of value 77 is not a valid value for type '_Bool'
        CPU: 3 UID: 0 PID: 1297 Comm: f2fs_gc-252:16 Not tainted 6.18.0-rc3
        #5 PREEMPT(voluntary)
        Hardware name: OpenStack Foundation OpenStack Nova,
        BIOS 1.13.0-1ubuntu1.1 04/01/2014
        Call Trace:
         <TASK>
         dump_stack_lvl+0x70/0x90
         dump_stack+0x14/0x20
         __ubsan_handle_load_invalid_value+0xb3/0xf0
         ? dl_server_update+0x2e/0x40
         ? update_curr+0x147/0x170
         f2fs_get_victim.cold+0x66/0x134 [f2fs]
         ? sched_balance_newidle+0x2ca/0x470
         ? finish_task_switch.isra.0+0x8d/0x2a0
         f2fs_gc+0x2ba/0x8e0 [f2fs]
         ? _raw_spin_unlock_irqrestore+0x12/0x40
         ? __timer_delete_sync+0x80/0xe0
         ? timer_delete_sync+0x14/0x20
         ? schedule_timeout+0x82/0x100
         gc_thread_func+0x38b/0x860 [f2fs]
         ? gc_thread_func+0x38b/0x860 [f2fs]
         ? __pfx_autoremove_wake_function+0x10/0x10
         kthread+0x10b/0x220
         ? __pfx_gc_thread_func+0x10/0x10 [f2fs]
         ? _raw_spin_unlock_irq+0x12/0x40
         ? __pfx_kthread+0x10/0x10
         ret_from_fork+0x11a/0x160
         ? __pfx_kthread+0x10/0x10
         ret_from_fork_asm+0x1a/0x30
         </TASK>
    
    This issue is reliably reproducible with the following steps on a
    100GB SSD /dev/vdb:
    
        mkfs.f2fs -f /dev/vdb
        mount /dev/vdb /mnt/f2fs_test
        fio --name=gc --directory=/mnt/f2fs_test --rw=randwrite \
            --bs=4k --size=8G --numjobs=12 --fsync=4 --runtime=10 \
            --time_based
        echo 1 > /sys/fs/f2fs/vdb/gc_urgent
    
    The uninitialized value causes incorrect GC victim selection, leading
    to unpredictable garbage collection behavior.
    
    Fix by zero-initializing the entire victim_sel_policy structure to
    ensure all fields have defined values.
    
    Fixes: e791d00bd06c ("f2fs: add valid block ratio not to do excessive GC for one time GC")
    Cc: [email protected]
    Signed-off-by: Xiaole He <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: invalidate dentry cache on failed whiteout creation [+ + +]

Author: Deepanshu Kartikey <[email protected]>
Date:   Mon Oct 27 18:36:34 2025 +0530

    f2fs: invalidate dentry cache on failed whiteout creation
    
    commit d33f89b34aa313f50f9a512d58dd288999f246b0 upstream.
    
    F2FS can mount filesystems with corrupted directory depth values that
    get runtime-clamped to MAX_DIR_HASH_DEPTH. When RENAME_WHITEOUT
    operations are performed on such directories, f2fs_rename performs
    directory modifications (updating target entry and deleting source
    entry) before attempting to add the whiteout entry via f2fs_add_link.
    
    If f2fs_add_link fails due to the corrupted directory structure, the
    function returns an error to VFS, but the partial directory
    modifications have already been committed to disk. VFS assumes the
    entire rename operation failed and does not update the dentry cache,
    leaving stale mappings.
    
    In the error path, VFS does not call d_move() to update the dentry
    cache. This results in new_dentry still pointing to the old inode
    (new_inode) which has already had its i_nlink decremented to zero.
    The stale cache causes subsequent operations to incorrectly reference
    the freed inode.
    
    This causes subsequent operations to use cached dentry information that
    no longer matches the on-disk state. When a second rename targets the
    same entry, VFS attempts to decrement i_nlink on the stale inode, which
    may already have i_nlink=0, triggering a WARNING in drop_nlink().
    
    Example sequence:
    1. First rename (RENAME_WHITEOUT): file2 → file1
       - f2fs updates file1 entry on disk (points to inode 8)
       - f2fs deletes file2 entry on disk
       - f2fs_add_link(whiteout) fails (corrupted directory)
       - Returns error to VFS
       - VFS does not call d_move() due to error
       - VFS cache still has: file1 → inode 7 (stale!)
       - inode 7 has i_nlink=0 (already decremented)
    
    2. Second rename: file3 → file1
       - VFS uses stale cache: file1 → inode 7
       - Tries to drop_nlink on inode 7 (i_nlink already 0)
       - WARNING in drop_nlink()
    
    Fix this by explicitly invalidating old_dentry and new_dentry when
    f2fs_add_link fails during whiteout creation. This forces VFS to
    refresh from disk on subsequent operations, ensuring cache consistency
    even when the rename partially succeeds.
    
    Reproducer:
    1. Mount F2FS image with corrupted i_current_depth
    2. renameat2(file2, file1, RENAME_WHITEOUT)
    3. renameat2(file3, file1, 0)
    4. System triggers WARNING in drop_nlink()
    
    Fixes: 7e01e7ad746b ("f2fs: support RENAME_WHITEOUT")
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=632cf32276a9a564188d
    Suggested-by: Chao Yu <[email protected]>
    Link: https://lore.kernel.org/all/[email protected]/ [v1]
    Cc: [email protected]
    Signed-off-by: Deepanshu Kartikey <[email protected]>
    Reviewed-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

f2fs: use global inline_xattr_slab instead of per-sb slab cache [+ + +]

Author: Chao Yu <[email protected]>
Date:   Tue Dec 30 13:06:25 2025 -0500

    f2fs: use global inline_xattr_slab instead of per-sb slab cache
    
    [ Upstream commit 1f27ef42bb0b7c0740c5616ec577ec188b8a1d05 ]
    
    As Hong Yun reported in mailing list:
    
    loop7: detected capacity change from 0 to 131072
    ------------[ cut here ]------------
    kmem_cache of name 'f2fs_xattr_entry-7:7' already exists
    WARNING: CPU: 0 PID: 24426 at mm/slab_common.c:110 kmem_cache_sanity_check mm/slab_common.c:109 [inline]
    WARNING: CPU: 0 PID: 24426 at mm/slab_common.c:110 __kmem_cache_create_args+0xa6/0x320 mm/slab_common.c:307
    CPU: 0 UID: 0 PID: 24426 Comm: syz.7.1370 Not tainted 6.17.0-rc4 #1 PREEMPT(full)
    Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1ubuntu1.1 04/01/2014
    RIP: 0010:kmem_cache_sanity_check mm/slab_common.c:109 [inline]
    RIP: 0010:__kmem_cache_create_args+0xa6/0x320 mm/slab_common.c:307
    Call Trace:
     __kmem_cache_create include/linux/slab.h:353 [inline]
     f2fs_kmem_cache_create fs/f2fs/f2fs.h:2943 [inline]
     f2fs_init_xattr_caches+0xa5/0xe0 fs/f2fs/xattr.c:843
     f2fs_fill_super+0x1645/0x2620 fs/f2fs/super.c:4918
     get_tree_bdev_flags+0x1fb/0x260 fs/super.c:1692
     vfs_get_tree+0x43/0x140 fs/super.c:1815
     do_new_mount+0x201/0x550 fs/namespace.c:3808
     do_mount fs/namespace.c:4136 [inline]
     __do_sys_mount fs/namespace.c:4347 [inline]
     __se_sys_mount+0x298/0x2f0 fs/namespace.c:4324
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0x8e/0x3a0 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    The bug can be reproduced w/ below scripts:
    - mount /dev/vdb /mnt1
    - mount /dev/vdc /mnt2
    - umount /mnt1
    - mounnt /dev/vdb /mnt1
    
    The reason is if we created two slab caches, named f2fs_xattr_entry-7:3
    and f2fs_xattr_entry-7:7, and they have the same slab size. Actually,
    slab system will only create one slab cache core structure which has
    slab name of "f2fs_xattr_entry-7:3", and two slab caches share the same
    structure and cache address.
    
    So, if we destroy f2fs_xattr_entry-7:3 cache w/ cache address, it will
    decrease reference count of slab cache, rather than release slab cache
    entirely, since there is one more user has referenced the cache.
    
    Then, if we try to create slab cache w/ name "f2fs_xattr_entry-7:3" again,
    slab system will find that there is existed cache which has the same name
    and trigger the warning.
    
    Let's changes to use global inline_xattr_slab instead of per-sb slab cache
    for fixing.
    
    Fixes: a999150f4fe3 ("f2fs: use kmem_cache pool during inline xattr lookups")
    Cc: [email protected]
    Reported-by: Hong Yun <[email protected]>
    Tested-by: Hong Yun <[email protected]>
    Signed-off-by: Chao Yu <[email protected]>
    Signed-off-by: Jaegeuk Kim <[email protected]>
    [ folio => page ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fbdev: gbefb: fix to use physical address instead of dma address [+ + +]

Author: Rene Rebe <[email protected]>
Date:   Fri Nov 14 16:00:42 2025 +0100

    fbdev: gbefb: fix to use physical address instead of dma address
    
    commit e3f44742bbb10537fe53d83d20dea2a7c167674d upstream.
    
    While debuggigng why X would not start on mips64 Sgi/O2 I found the
    phys adress being off. Turns out the gbefb passed the internal
    dma_addr as phys. May be broken pre git history. Fix by converting
    dma_to_phys.
    
    Signed-off-by: René Rebe <[email protected]>
    Cc: <[email protected]> # v4.0+
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fbdev: pxafb: Fix multiple clamped values in pxafb_adjust_timing [+ + +]

Author: Thorsten Blum <[email protected]>
Date:   Tue Dec 2 19:15:32 2025 +0100

    fbdev: pxafb: Fix multiple clamped values in pxafb_adjust_timing
    
    commit 0155e868cbc111846cc2809c1546ea53810a56ae upstream.
    
    The variables were never clamped because the return value of clamp_val()
    was not used. Fix this by assigning the clamped values, and use clamp()
    instead of clamp_val().
    
    Cc: [email protected]
    Fixes: 3f16ff608a75 ("[ARM] pxafb: cleanup of the timing checking code")
    Signed-off-by: Thorsten Blum <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fbdev: tcx.c fix mem_map to correct smem_start offset [+ + +]

Author: René Rebe <[email protected]>
Date:   Thu Nov 20 14:24:00 2025 +0100

    fbdev: tcx.c fix mem_map to correct smem_start offset
    
    commit 35fa2b4bf96415b88d7edaa5cf8af5185d9ce76e upstream.
    
    403ae52ac047 ("sparc: fix drivers/video/tcx.c warning") changed the
    physbase initializing breaking the user-space mmap, e.g. for Xorg
    entirely.
    
    Fix fbdev mmap table so the sbus mmap helper work correctly, and
    not try to map vastly (physbase) offset memory.
    
    Fixes: 403ae52ac047 ("sparc: fix drivers/video/tcx.c warning")
    Cc: <[email protected]>
    Signed-off-by: René Rebe <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fgraph: Check ftrace_pids_enabled on registration for early filtering [+ + +]

Author: Shengming Hu <[email protected]>
Date:   Wed Nov 26 17:33:31 2025 +0800

    fgraph: Check ftrace_pids_enabled on registration for early filtering
    
    commit 1650a1b6cb1ae6cb99bb4fce21b30ebdf9fc238e upstream.
    
    When registering ftrace_graph, check if ftrace_pids_enabled is active.
    If enabled, assign entryfunc to fgraph_pid_func to ensure filtering
    is performed before executing the saved original entry function.
    
    Cc: [email protected]
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Fixes: df3ec5da6a1e7 ("function_graph: Add pid tracing back to function graph tracer")
    Signed-off-by: Shengming Hu <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fgraph: Initialize ftrace_ops->private for function graph ops [+ + +]

Author: Shengming Hu <[email protected]>
Date:   Wed Nov 26 17:29:26 2025 +0800

    fgraph: Initialize ftrace_ops->private for function graph ops
    
    commit b5d6d3f73d0bac4a7e3a061372f6da166fc6ee5c upstream.
    
    The ftrace_pids_enabled(op) check relies on op->private being properly
    initialized, but fgraph_ops's underlying ftrace_ops->private was left
    uninitialized. This caused ftrace_pids_enabled() to always return false,
    effectively disabling PID filtering for function graph tracing.
    
    Fix this by copying src_ops->private to dst_ops->private in
    fgraph_init_ops(), ensuring PID filter state is correctly propagated.
    
    Cc: [email protected]
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Cc: <[email protected]>
    Fixes: c132be2c4fcc1 ("function_graph: Have the instances use their own ftrace_ops for filtering")
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Shengming Hu <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

firewire: nosy: Fix dma_free_coherent() size [+ + +]

Author: Thomas Fourier <[email protected]>
Date:   Tue Dec 16 17:54:18 2025 +0100

    firewire: nosy: Fix dma_free_coherent() size
    
    [ Upstream commit c48c0fd0e19684b6ecdb4108a429e3a4e73f5e21 ]
    
    It looks like the buffer allocated and mapped in add_card() is done
    with size RCV_BUFFER_SIZE which is 16 KB and 4KB.
    
    Fixes: 286468210d83 ("firewire: new driver: nosy - IEEE 1394 traffic sniffer")
    Co-developed-by: Thomas Fourier <[email protected]>
    Signed-off-by: Thomas Fourier <[email protected]>
    Co-developed-by: Christophe JAILLET <[email protected]>
    Signed-off-by: Christophe JAILLET <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Takashi Sakamoto <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

firmware: imx: scu-irq: Init workqueue before request mbox channel [+ + +]

Author: Peng Fan <[email protected]>
Date:   Fri Oct 17 09:56:26 2025 +0800

    firmware: imx: scu-irq: Init workqueue before request mbox channel
    
    [ Upstream commit 81fb53feb66a3aefbf6fcab73bb8d06f5b0c54ad ]
    
    With mailbox channel requested, there is possibility that interrupts may
    come in, so need to make sure the workqueue is initialized before
    the queue is scheduled by mailbox rx callback.
    
    Reviewed-by: Frank Li <[email protected]>
    Signed-off-by: Peng Fan <[email protected]>
    Signed-off-by: Shawn Guo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

firmware: stratix10-svc: Add mutex in stratix10 memory management [+ + +]

Author: Mahesh Rao <[email protected]>
Date:   Mon Oct 27 22:54:40 2025 +0800

    firmware: stratix10-svc: Add mutex in stratix10 memory management
    
    commit 85f96cbbbc67b59652b2c1ec394b8ddc0ddf1b0b upstream.
    
    Add mutex lock to stratix10_svc_allocate_memory and
    stratix10_svc_free_memory for thread safety. This prevents race
    conditions and ensures proper synchronization during memory operations.
    This is required for parallel communication with the Stratix10 service
    channel.
    
    Fixes: 7ca5ce896524f ("firmware: add Intel Stratix10 service layer driver")
    Cc: [email protected]
    Signed-off-by: Mahesh Rao <[email protected]>
    Reviewed-by: Matthew Gerlach <[email protected]>
    Signed-off-by: Dinh Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fjes: Add missing iounmap in fjes_hw_init() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Thu Dec 11 15:37:56 2025 +0800

    fjes: Add missing iounmap in fjes_hw_init()
    
    commit 15ef641a0c6728d25a400df73922e80ab2cf029c upstream.
    
    In error paths, add fjes_hw_iounmap() to release the
    resource acquired by fjes_hw_iomap(). Add a goto label
    to do so.
    
    Fixes: 8cdc3f6c5d22 ("fjes: Hardware initialization routine")
    Cc: [email protected]
    Signed-off-by: Haoxiang Li <[email protected]>
    Signed-off-by: Simon Horman <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

floppy: fix for PAGE_SIZE != 4KB [+ + +]

Author: Rene Rebe <[email protected]>
Date:   Fri Nov 14 14:41:27 2025 +0100

    floppy: fix for PAGE_SIZE != 4KB
    
    commit 82d20481024cbae2ea87fe8b86d12961bfda7169 upstream.
    
    For years I wondered why the floppy driver does not just work on
    sparc64, e.g:
    
    root@SUNW_375_0066:# disktype /dev/fd0
    disktype: Can't open /dev/fd0: No such device or address
    
    [  525.341906] disktype: attempt to access beyond end of device
    fd0: rw=0, sector=0, nr_sectors = 16 limit=8
    [  525.341991] floppy: error 10 while reading block 0
    
    Turns out floppy.c __floppy_read_block_0 tries to read one page for
    the first test read to determine the disk size and thus fails if that
    is greater than 4k. Adjust minimum MAX_DISK_SIZE to PAGE_SIZE to fix
    floppy on sparc64 and likely all other PAGE_SIZE != 4KB configs.
    
    Cc: [email protected]
    Signed-off-by: René Rebe <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fs/ntfs3: check for shutdown in fsync [+ + +]

Author: Konstantin Komarov <[email protected]>
Date:   Thu Nov 6 16:17:19 2025 +0300

    fs/ntfs3: check for shutdown in fsync
    
    [ Upstream commit 1b2ae190ea43bebb8c73d21f076addc8a8c71849 ]
    
    Ensure fsync() returns -EIO when the ntfs3 filesystem is in forced
    shutdown, instead of silently succeeding via generic_file_fsync().
    
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fs/ntfs3: fix mount failure for sparse runs in run_unpack() [+ + +]

Author: Konstantin Komarov <[email protected]>
Date:   Thu Sep 18 13:35:24 2025 +0300

    fs/ntfs3: fix mount failure for sparse runs in run_unpack()
    
    commit 801f614ba263cb37624982b27b4c82f3c3c597a9 upstream.
    
    Some NTFS volumes failed to mount because sparse data runs were not
    handled correctly during runlist unpacking. The code performed arithmetic
    on the special SPARSE_LCN64 marker, leading to invalid LCN values and
    mount errors.
    
    Add an explicit check for the case described above, marking the run as
    sparse without applying arithmetic.
    
    Fixes: 736fc7bf5f68 ("fs: ntfs3: Fix integer overflow in run_unpack()")
    Cc: [email protected]
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fs/ntfs3: Support timestamps prior to epoch [+ + +]

Author: Konstantin Komarov <[email protected]>
Date:   Mon Sep 1 11:48:48 2025 +0300

    fs/ntfs3: Support timestamps prior to epoch
    
    [ Upstream commit 5180138604323895b5c291eca6aa7c20be494ade ]
    
    Before it used an unsigned 64-bit type, which prevented proper handling
    of timestamps earlier than 1970-01-01. Switch to a signed 64-bit type to
    support pre-epoch timestamps. The issue was caught by xfstests.
    
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fsnotify: do not generate ACCESS/MODIFY events on child for special files [+ + +]

Author: Amir Goldstein <[email protected]>
Date:   Sun Dec 7 11:44:55 2025 +0100

    fsnotify: do not generate ACCESS/MODIFY events on child for special files
    
    commit 635bc4def026a24e071436f4f356ea08c0eed6ff upstream.
    
    inotify/fanotify do not allow users with no read access to a file to
    subscribe to events (e.g. IN_ACCESS/IN_MODIFY), but they do allow the
    same user to subscribe for watching events on children when the user
    has access to the parent directory (e.g. /dev).
    
    Users with no read access to a file but with read access to its parent
    directory can still stat the file and see if it was accessed/modified
    via atime/mtime change.
    
    The same is not true for special files (e.g. /dev/null). Users will not
    generally observe atime/mtime changes when other users read/write to
    special files, only when someone sets atime/mtime via utimensat().
    
    Align fsnotify events with this stat behavior and do not generate
    ACCESS/MODIFY events to parent watchers on read/write of special files.
    The events are still generated to parent watchers on utimensat(). This
    closes some side-channels that could be possibly used for information
    exfiltration [1].
    
    [1] https://snee.la/pdf/pubs/file-notification-attacks.pdf
    
    Reported-by: Sudheendra Raghav Neela <[email protected]>
    CC: [email protected]
    Signed-off-by: Amir Goldstein <[email protected]>
    Signed-off-by: Jan Kara <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fuse: Always flush the page cache before FOPEN_DIRECT_IO write [+ + +]

Author: Bernd Schubert <[email protected]>
Date:   Thu Oct 23 00:21:18 2025 +0200

    fuse: Always flush the page cache before FOPEN_DIRECT_IO write
    
    [ Upstream commit 1ce120dcefc056ce8af2486cebbb77a458aad4c3 ]
    
    This was done as condition on direct_io_allow_mmap, but I believe
    this is not right, as a file might be open two times - once with
    write-back enabled another time with FOPEN_DIRECT_IO.
    
    Signed-off-by: Bernd Schubert <[email protected]>
    Signed-off-by: Miklos Szeredi <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fuse: fix readahead reclaim deadlock [+ + +]

Author: Joanne Koong <[email protected]>
Date:   Fri Oct 10 15:07:38 2025 -0700

    fuse: fix readahead reclaim deadlock
    
    commit bd5603eaae0aabf527bfb3ce1bb07e979ce5bd50 upstream.
    
    Commit e26ee4efbc79 ("fuse: allocate ff->release_args only if release is
    needed") skips allocating ff->release_args if the server does not
    implement open. However in doing so, fuse_prepare_release() now skips
    grabbing the reference on the inode, which makes it possible for an
    inode to be evicted from the dcache while there are inflight readahead
    requests. This causes a deadlock if the server triggers reclaim while
    servicing the readahead request and reclaim attempts to evict the inode
    of the file being read ahead. Since the folio is locked during
    readahead, when reclaim evicts the fuse inode and fuse_evict_inode()
    attempts to remove all folios associated with the inode from the page
    cache (truncate_inode_pages_range()), reclaim will block forever waiting
    for the lock since readahead cannot relinquish the lock because it is
    itself blocked in reclaim:
    
    >>> stack_trace(1504735)
     folio_wait_bit_common (mm/filemap.c:1308:4)
     folio_lock (./include/linux/pagemap.h:1052:3)
     truncate_inode_pages_range (mm/truncate.c:336:10)
     fuse_evict_inode (fs/fuse/inode.c:161:2)
     evict (fs/inode.c:704:3)
     dentry_unlink_inode (fs/dcache.c:412:3)
     __dentry_kill (fs/dcache.c:615:3)
     shrink_kill (fs/dcache.c:1060:12)
     shrink_dentry_list (fs/dcache.c:1087:3)
     prune_dcache_sb (fs/dcache.c:1168:2)
     super_cache_scan (fs/super.c:221:10)
     do_shrink_slab (mm/shrinker.c:435:9)
     shrink_slab (mm/shrinker.c:626:10)
     shrink_node (mm/vmscan.c:5951:2)
     shrink_zones (mm/vmscan.c:6195:3)
     do_try_to_free_pages (mm/vmscan.c:6257:3)
     do_swap_page (mm/memory.c:4136:11)
     handle_pte_fault (mm/memory.c:5562:10)
     handle_mm_fault (mm/memory.c:5870:9)
     do_user_addr_fault (arch/x86/mm/fault.c:1338:10)
     handle_page_fault (arch/x86/mm/fault.c:1481:3)
     exc_page_fault (arch/x86/mm/fault.c:1539:2)
     asm_exc_page_fault+0x22/0x27
    
    Fix this deadlock by allocating ff->release_args and grabbing the
    reference on the inode when preparing the file for release even if the
    server does not implement open. The inode reference will be dropped when
    the last reference on the fuse file is dropped (see fuse_file_put() ->
    fuse_release_end()).
    
    Fixes: e26ee4efbc79 ("fuse: allocate ff->release_args only if release is needed")
    Cc: [email protected]
    Signed-off-by: Joanne Koong <[email protected]>
    Reported-by: Omar Sandoval <[email protected]>
    Signed-off-by: Miklos Szeredi <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fuse: Invalidate the page cache after FOPEN_DIRECT_IO write [+ + +]

Author: Bernd Schubert <[email protected]>
Date:   Thu Oct 23 00:21:17 2025 +0200

    fuse: Invalidate the page cache after FOPEN_DIRECT_IO write
    
    [ Upstream commit b359af8275a982a458e8df6c6beab1415be1f795 ]
    
    generic_file_direct_write() also does this and has a large
    comment about.
    
    Reproducer here is xfstest's generic/209, which is exactly to
    have competing DIO write and cached IO read.
    
    Signed-off-by: Bernd Schubert <[email protected]>
    Signed-off-by: Miklos Szeredi <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

genalloc.h: fix htmldocs warning [+ + +]

Author: Andrew Morton <[email protected]>
Date:   Thu Nov 27 10:39:24 2025 -0800

    genalloc.h: fix htmldocs warning
    
    [ Upstream commit 5393802c94e0ab1295c04c94c57bcb00222d4674 ]
    
    WARNING: include/linux/genalloc.h:52 function parameter 'start_addr' not described in 'genpool_algo_t'
    
    Fixes: 52fbf1134d47 ("lib/genalloc.c: fix allocation of aligned buffer from non-aligned chunk")
    Reported-by: Stephen Rothwell <[email protected]>
    Closes: https://lkml.kernel.org/r/[email protected]
    Acked-by: Randy Dunlap <[email protected]>
    Tested-by: Randy Dunlap <[email protected]>
    Cc: Alexey Skidanov <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

gfs2: Fix "gfs2: Switch to wait_event in gfs2_quotad" [+ + +]

Author: Andreas Gruenbacher <[email protected]>
Date:   Wed Nov 26 23:27:14 2025 +0000

    gfs2: Fix "gfs2: Switch to wait_event in gfs2_quotad"
    
    [ Upstream commit dff1fb6d8b7abe5b1119fa060f5d6b3370bf10ac ]
    
    Commit e4a8b5481c59a ("gfs2: Switch to wait_event in gfs2_quotad") broke
    cyclic statfs syncing, so the numbers reported by "df" could easily get
    completely out of sync with reality.  Fix this by reverting part of
    commit e4a8b5481c59a for now.
    
    A follow-up commit will clean this code up later.
    
    Signed-off-by: Andreas Gruenbacher <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

gfs2: fix freeze error handling [+ + +]

Author: Alexey Velichayshiy <[email protected]>
Date:   Mon Dec 29 17:33:32 2025 -0500

    gfs2: fix freeze error handling
    
    [ Upstream commit 4cfc7d5a4a01d2133b278cdbb1371fba1b419174 ]
    
    After commit b77b4a4815a9 ("gfs2: Rework freeze / thaw logic"),
    the freeze error handling is broken because gfs2_do_thaw()
    overwrites the 'error' variable, causing incorrect processing
    of the original freeze error.
    
    Fix this by calling gfs2_do_thaw() when gfs2_lock_fs_check_clean()
    fails but ignoring its return value to preserve the original
    freeze error for proper reporting.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: b77b4a4815a9 ("gfs2: Rework freeze / thaw logic")
    Cc: [email protected] # v6.5+
    Signed-off-by: Alexey Velichayshiy <[email protected]>
    Signed-off-by: Andreas Gruenbacher <[email protected]>
    [ gfs2_do_thaw() only takes 2 params ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gfs2: fix remote evict for read-only filesystems [+ + +]

Author: Andreas Gruenbacher <[email protected]>
Date:   Wed Nov 19 12:14:24 2025 +0000

    gfs2: fix remote evict for read-only filesystems
    
    [ Upstream commit 64c10ed9274bc46416f502afea48b4ae11279669 ]
    
    When a node tries to delete an inode, it first requests exclusive access
    to the iopen glock.  This triggers demote requests on all remote nodes
    currently holding the iopen glock.  To satisfy those requests, the
    remote nodes evict the inode in question, or they poke the corresponding
    inode glock to signal that the inode is still in active use.
    
    This behavior doesn't depend on whether or not a filesystem is
    read-only, so remove the incorrect read-only check.
    
    Signed-off-by: Andreas Gruenbacher <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

gfs2: Fix use of bio_chain [+ + +]

Author: Andreas Gruenbacher <[email protected]>
Date:   Sun Nov 30 21:19:52 2025 +0000

    gfs2: Fix use of bio_chain
    
    [ Upstream commit 8a157e0a0aa5143b5d94201508c0ca1bb8cfb941 ]
    
    In gfs2_chain_bio(), the call to bio_chain() has its arguments swapped.
    The result is leaked bios and incorrect synchronization (only the last
    bio will actually be waited for).  This code is only used during mount
    and filesystem thaw, so the bug normally won't be noticeable.
    
    Reported-by: Stephen Zhang <[email protected]>
    Signed-off-by: Andreas Gruenbacher <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

gpio: regmap: Fix memleak in error path in gpio_regmap_register() [+ + +]

Author: Wentao Guan <[email protected]>
Date:   Thu Dec 4 18:13:04 2025 +0800

    gpio: regmap: Fix memleak in error path in gpio_regmap_register()
    
    commit 52721cfc78c76b09c66e092b52617006390ae96a upstream.
    
    Call gpiochip_remove() to free the resources allocated by
    gpiochip_add_data() in error path.
    
    Fixes: 553b75d4bfe9 ("gpio: regmap: Allow to allocate regmap-irq device")
    Fixes: ae495810cffe ("gpio: regmap: add the .fixed_direction_output configuration parameter")
    CC: [email protected]
    Co-developed-by: WangYuli <[email protected]>
    Signed-off-by: WangYuli <[email protected]>
    Signed-off-by: Wentao Guan <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [Bartosz: reworked the commit message]
    Signed-off-by: Bartosz Golaszewski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Add a quirk for Acer Nitro V15 [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Wed Dec 31 09:32:16 2025 -0500

    gpiolib: acpi: Add a quirk for Acer Nitro V15
    
    [ Upstream commit 9ab29ed505557bd106e292184fa4917955eb8e6e ]
    
    It is reported that on Acer Nitro V15 suspend only works properly if the
    keyboard backlight is turned off. In looking through the issue Acer Nitro
    V15 has a GPIO (#8) specified in _AEI but it has no matching notify device
    in _EVT. The values for GPIO #8 change as keyboard backlight is turned on
    and off.
    
    This makes it seem that GPIO #8 is actually supposed to be solely for
    keyboard backlight.  Turning off the interrupt for this GPIO fixes the issue.
    Add a quirk that does just that.
    
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4169
    Signed-off-by: Mario Limonciello <[email protected]>
    Acked-by: Mika Westerberg <[email protected]>
    Signed-off-by: Andy Shevchenko <[email protected]>
    Stable-dep-of: 2d967310c49e ("gpiolib: acpi: Add quirk for Dell Precision 7780")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Add acpi_gpio_need_run_edge_events_on_boot() getter [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Wed Dec 31 09:32:14 2025 -0500

    gpiolib: acpi: Add acpi_gpio_need_run_edge_events_on_boot() getter
    
    [ Upstream commit 5666a8777add09d1167de308df2147983486a0af ]
    
    Add acpi_gpio_need_run_edge_events_on_boot() getter which moves
    towards isolating the GPIO ACPI and quirk APIs. It will helps
    splitting them completely in the next changes.
    
    No functional changes.
    
    Reviewed-by: Hans de Goede <[email protected]>
    Acked-by: Mika Westerberg <[email protected]>
    Signed-off-by: Andy Shevchenko <[email protected]>
    Stable-dep-of: 2d967310c49e ("gpiolib: acpi: Add quirk for Dell Precision 7780")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Add quirk for ASUS ProArt PX13 [+ + +]

Author: Mario Limonciello (AMD) <[email protected]>
Date:   Wed Dec 31 09:32:17 2025 -0500

    gpiolib: acpi: Add quirk for ASUS ProArt PX13
    
    [ Upstream commit 23800ad1265f10c2bc6f42154ce4d20e59f2900e ]
    
    The ASUS ProArt PX13 has a spurious wakeup event from the touchpad
    a few moments after entering hardware sleep.  This can be avoided
    by preventing the touchpad from being a wake source.
    
    Add to the wakeup ignore list.
    
    Reported-by: Amit Chaudhari <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4482
    Tested-by: Amit Chaudhari <[email protected]>
    Signed-off-by: Mario Limonciello (AMD) <[email protected]>
    Reviewed-by: Mika Westerberg <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Linus Walleij <[email protected]>
    Stable-dep-of: 2d967310c49e ("gpiolib: acpi: Add quirk for Dell Precision 7780")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Add quirk for Dell Precision 7780 [+ + +]

Author: Askar Safin <[email protected]>
Date:   Wed Dec 31 09:32:18 2025 -0500

    gpiolib: acpi: Add quirk for Dell Precision 7780
    
    [ Upstream commit 2d967310c49ed93ac11cef408a55ddf15c3dd52e ]
    
    Dell Precision 7780 often wakes up on its own from suspend. Sometimes
    wake up happens immediately (i. e. within 7 seconds), sometimes it happens
    after, say, 30 minutes.
    
    Fixes: 1796f808e4bb ("HID: i2c-hid: acpi: Stop setting wakeup_capable")
    Link: https://lore.kernel.org/linux-i2c/[email protected]/
    Cc: [email protected]
    Reviewed-by: Andy Shevchenko <[email protected]>
    Signed-off-by: Askar Safin <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bartosz Golaszewski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Handle deferred list via new API [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Wed Dec 31 09:32:13 2025 -0500

    gpiolib: acpi: Handle deferred list via new API
    
    [ Upstream commit a594877663d1e3d5cf57ec8af739582fc5c47cec ]
    
    Introduce a new API and handle deferred list via it which moves
    towards isolating the GPIO ACPI and quirk APIs. It will helps
    splitting them completely in the next changes.
    
    No functional changes.
    
    Reviewed-by: Hans de Goede <[email protected]>
    Acked-by: Mika Westerberg <[email protected]>
    Signed-off-by: Andy Shevchenko <[email protected]>
    Stable-dep-of: 2d967310c49e ("gpiolib: acpi: Add quirk for Dell Precision 7780")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Move quirks to a separate file [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Wed Dec 31 09:32:15 2025 -0500

    gpiolib: acpi: Move quirks to a separate file
    
    [ Upstream commit 92dc572852ddcae687590cb159189004d58e382e ]
    
    The gpiolib-acpi.c is huge enough even without DMI quirks.
    Move them to a separate file for a better maintenance.
    
    No functional change intended.
    
    Reviewed-by: Hans de Goede <[email protected]>
    Acked-by: Mika Westerberg <[email protected]>
    Signed-off-by: Andy Shevchenko <[email protected]>
    Stable-dep-of: 2d967310c49e ("gpiolib: acpi: Add quirk for Dell Precision 7780")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gpiolib: acpi: Switch to use enum in acpi_gpio_in_ignore_list() [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Wed Dec 31 09:32:12 2025 -0500

    gpiolib: acpi: Switch to use enum in acpi_gpio_in_ignore_list()
    
    [ Upstream commit b24fd5bc8e6d6b6006db65b5956c2c2cd0ee5a7b ]
    
    Switch to use enum instead of pointers in acpi_gpio_in_ignore_list()
    which moves towards isolating the GPIO ACPI and quirk APIs. It will
    helps splitting them completely in the next changes.
    
    No functional changes.
    
    Reviewed-by: Hans de Goede <[email protected]>
    Acked-by: Mika Westerberg <[email protected]>
    Signed-off-by: Andy Shevchenko <[email protected]>
    Stable-dep-of: 2d967310c49e ("gpiolib: acpi: Add quirk for Dell Precision 7780")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

gve: defer interrupt enabling until NAPI registration [+ + +]

Author: Ankit Garg <[email protected]>
Date:   Fri Dec 19 10:29:45 2025 +0000

    gve: defer interrupt enabling until NAPI registration
    
    commit 3d970eda003441f66551a91fda16478ac0711617 upstream.
    
    Currently, interrupts are automatically enabled immediately upon
    request. This allows interrupt to fire before the associated NAPI
    context is fully initialized and cause failures like below:
    
    [    0.946369] Call Trace:
    [    0.946369]  <IRQ>
    [    0.946369]  __napi_poll+0x2a/0x1e0
    [    0.946369]  net_rx_action+0x2f9/0x3f0
    [    0.946369]  handle_softirqs+0xd6/0x2c0
    [    0.946369]  ? handle_edge_irq+0xc1/0x1b0
    [    0.946369]  __irq_exit_rcu+0xc3/0xe0
    [    0.946369]  common_interrupt+0x81/0xa0
    [    0.946369]  </IRQ>
    [    0.946369]  <TASK>
    [    0.946369]  asm_common_interrupt+0x22/0x40
    [    0.946369] RIP: 0010:pv_native_safe_halt+0xb/0x10
    
    Use the `IRQF_NO_AUTOEN` flag when requesting interrupts to prevent auto
    enablement and explicitly enable the interrupt in NAPI initialization
    path (and disable it during NAPI teardown).
    
    This ensures that interrupt lifecycle is strictly coupled with
    readiness of NAPI context.
    
    Cc: [email protected]
    Fixes: 1dfc2e46117e ("gve: Refactor napi add and remove functions")
    Signed-off-by: Ankit Garg <[email protected]>
    Reviewed-by: Jordan Rhee <[email protected]>
    Reviewed-by: Joshua Washington <[email protected]>
    Signed-off-by: Harshitha Ramamurthy <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hfsplus: fix missing hfs_bnode_get() in __hfs_bnode_create [+ + +]

Author: Yang Chenzhi <[email protected]>
Date:   Fri Aug 29 17:39:12 2025 +0800

    hfsplus: fix missing hfs_bnode_get() in __hfs_bnode_create
    
    [ Upstream commit 152af114287851583cf7e0abc10129941f19466a ]
    
    When sync() and link() are called concurrently, both threads may
    enter hfs_bnode_find() without finding the node in the hash table
    and proceed to create it.
    
    Thread A:
      hfsplus_write_inode()
        -> hfsplus_write_system_inode()
          -> hfs_btree_write()
            -> hfs_bnode_find(tree, 0)
              -> __hfs_bnode_create(tree, 0)
    
    Thread B:
      hfsplus_create_cat()
        -> hfs_brec_insert()
          -> hfs_bnode_split()
            -> hfs_bmap_alloc()
              -> hfs_bnode_find(tree, 0)
                -> __hfs_bnode_create(tree, 0)
    
    In this case, thread A creates the bnode, sets refcnt=1, and hashes it.
    Thread B also tries to create the same bnode, notices it has already
    been inserted, drops its own instance, and uses the hashed one without
    getting the node.
    
    ```
    
            node2 = hfs_bnode_findhash(tree, cnid);
            if (!node2) {                                 <- Thread A
                    hash = hfs_bnode_hash(cnid);
                    node->next_hash = tree->node_hash[hash];
                    tree->node_hash[hash] = node;
                    tree->node_hash_cnt++;
            } else {                                      <- Thread B
                    spin_unlock(&tree->hash_lock);
                    kfree(node);
                    wait_event(node2->lock_wq,
                            !test_bit(HFS_BNODE_NEW, &node2->flags));
                    return node2;
            }
    ```
    
    However, hfs_bnode_find() requires each call to take a reference.
    Here both threads end up setting refcnt=1. When they later put the node,
    this triggers:
    
    BUG_ON(!atomic_read(&node->refcnt))
    
    In this scenario, Thread B in fact finds the node in the hash table
    rather than creating a new one, and thus must take a reference.
    
    Fix this by calling hfs_bnode_get() when reusing a bnode newly created by
    another thread to ensure the refcount is updated correctly.
    
    A similar bug was fixed in HFS long ago in commit
    a9dc087fd3c4 ("fix missing hfs_bnode_get() in __hfs_bnode_create")
    but the same issue remained in HFS+ until now.
    
    Reported-by: [email protected]
    Signed-off-by: Yang Chenzhi <[email protected]>
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hfsplus: fix volume corruption issue for generic/070 [+ + +]

Author: Viacheslav Dubeyko <[email protected]>
Date:   Fri Oct 31 17:12:30 2025 -0700

    hfsplus: fix volume corruption issue for generic/070
    
    [ Upstream commit ed490f36f439b877393c12a2113601e4145a5a56 ]
    
    The xfstests' test-case generic/070 leaves HFS+ volume
    in corrupted state:
    
    sudo ./check generic/070
    FSTYP -- hfsplus
    PLATFORM -- Linux/x86_64 hfsplus-testing-0001 6.17.0-rc1+ #4 SMP PREEMPT_DYNAMIC Wed Oct 1 15:02:44 PDT 2025
    MKFS_OPTIONS -- /dev/loop51
    MOUNT_OPTIONS -- /dev/loop51 /mnt/scratch
    
    generic/070 _check_generic_filesystem: filesystem on /dev/loop50 is inconsistent
    (see xfstests-dev/results//generic/070.full for details)
    
    Ran: generic/070
    Failures: generic/070
    Failed 1 of 1 tests
    
    sudo fsck.hfsplus -d /dev/loop50
    ** /dev/loop50
    Using cacheBlockSize=32K cacheTotalBlock=1024 cacheSize=32768K.
    Executing fsck_hfs (version 540.1-Linux).
    ** Checking non-journaled HFS Plus Volume.
    The volume name is test
    ** Checking extents overflow file.
    Unused node is not erased (node = 1)
    ** Checking catalog file.
    ** Checking multi-linked files.
    ** Checking catalog hierarchy.
    ** Checking extended attributes file.
    ** Checking volume bitmap.
    ** Checking volume information.
    Verify Status: VIStat = 0x0000, ABTStat = 0x0000 EBTStat = 0x0004
    CBTStat = 0x0000 CatStat = 0x00000000
    ** Repairing volume.
    ** Rechecking volume.
    ** Checking non-journaled HFS Plus Volume.
    The volume name is test
    ** Checking extents overflow file.
    ** Checking catalog file.
    ** Checking multi-linked files.
    ** Checking catalog hierarchy.
    ** Checking extended attributes file.
    ** Checking volume bitmap.
    ** Checking volume information.
    ** The volume test was repaired successfully.
    
    It is possible to see that fsck.hfsplus detected not
    erased and unused node for the case of extents overflow file.
    The HFS+ logic has special method that defines if the node
    should be erased:
    
    bool hfs_bnode_need_zeroout(struct hfs_btree *tree)
    {
            struct super_block *sb = tree->inode->i_sb;
            struct hfsplus_sb_info *sbi = HFSPLUS_SB(sb);
            const u32 volume_attr = be32_to_cpu(sbi->s_vhdr->attributes);
    
            return tree->cnid == HFSPLUS_CAT_CNID &&
                    volume_attr & HFSPLUS_VOL_UNUSED_NODE_FIX;
    }
    
    However, it is possible to see that this method works
    only for the case of catalog file. But debugging of the issue
    has shown that HFSPLUS_VOL_UNUSED_NODE_FIX attribute has been
    requested for the extents overflow file too:
    
    catalog file
    kernel: hfsplus: node 4, num_recs 0, flags 0x10
    kernel: hfsplus: tree->cnid 4, volume_attr 0x80000800
    
    extents overflow file
    kernel: hfsplus: node 1, num_recs 0, flags 0x10
    kernel: hfsplus: tree->cnid 3, volume_attr 0x80000800
    
    This patch modifies the hfs_bnode_need_zeroout() by checking
    only volume_attr but not the b-tree ID because node zeroing
    can be requested for all HFS+ b-tree types.
    
    sudo ./check generic/070
    FSTYP         -- hfsplus
    PLATFORM      -- Linux/x86_64 hfsplus-testing-0001 6.18.0-rc3+ #79 SMP PREEMPT_DYNAMIC Fri Oct 31 16:07:42 PDT 2025
    MKFS_OPTIONS  -- /dev/loop51
    MOUNT_OPTIONS -- /dev/loop51 /mnt/scratch
    
    generic/070 33s ...  34s
    Ran: generic/070
    Passed all 1 tests
    
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    cc: John Paul Adrian Glaubitz <[email protected]>
    cc: Yangtao Li <[email protected]>
    cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hfsplus: fix volume corruption issue for generic/073 [+ + +]

Author: Viacheslav Dubeyko <[email protected]>
Date:   Wed Nov 12 15:25:23 2025 -0800

    hfsplus: fix volume corruption issue for generic/073
    
    [ Upstream commit 24e17a29cf7537f0947f26a50f85319abd723c6c ]
    
    The xfstests' test-case generic/073 leaves HFS+ volume
    in corrupted state:
    
    sudo ./check generic/073
    FSTYP -- hfsplus
    PLATFORM -- Linux/x86_64 hfsplus-testing-0001 6.17.0-rc1+ #4 SMP PREEMPT_DYNAMIC Wed Oct 1 15:02:44 PDT 2025
    MKFS_OPTIONS -- /dev/loop51
    MOUNT_OPTIONS -- /dev/loop51 /mnt/scratch
    
    generic/073 _check_generic_filesystem: filesystem on /dev/loop51 is inconsistent
    (see XFSTESTS-2/xfstests-dev/results//generic/073.full for details)
    
    Ran: generic/073
    Failures: generic/073
    Failed 1 of 1 tests
    
    sudo fsck.hfsplus -d /dev/loop51
    ** /dev/loop51
    Using cacheBlockSize=32K cacheTotalBlock=1024 cacheSize=32768K.
    Executing fsck_hfs (version 540.1-Linux).
    ** Checking non-journaled HFS Plus Volume.
    The volume name is untitled
    ** Checking extents overflow file.
    ** Checking catalog file.
    ** Checking multi-linked files.
    ** Checking catalog hierarchy.
    Invalid directory item count
    (It should be 1 instead of 0)
    ** Checking extended attributes file.
    ** Checking volume bitmap.
    ** Checking volume information.
    Verify Status: VIStat = 0x0000, ABTStat = 0x0000 EBTStat = 0x0000
    CBTStat = 0x0000 CatStat = 0x00004000
    ** Repairing volume.
    ** Rechecking volume.
    ** Checking non-journaled HFS Plus Volume.
    The volume name is untitled
    ** Checking extents overflow file.
    ** Checking catalog file.
    ** Checking multi-linked files.
    ** Checking catalog hierarchy.
    ** Checking extended attributes file.
    ** Checking volume bitmap.
    ** Checking volume information.
    ** The volume untitled was repaired successfully.
    
    The test is doing these steps on final phase:
    
    mv $SCRATCH_MNT/testdir_1/bar $SCRATCH_MNT/testdir_2/bar
    $XFS_IO_PROG -c "fsync" $SCRATCH_MNT/testdir_1
    $XFS_IO_PROG -c "fsync" $SCRATCH_MNT/foo
    
    So, we move file bar from testdir_1 into testdir_2 folder. It means that HFS+
    logic decrements the number of entries in testdir_1 and increments number of
    entries in testdir_2. Finally, we do fsync only for testdir_1 and foo but not
    for testdir_2. As a result, this is the reason why fsck.hfsplus detects the
    volume corruption afterwards.
    
    This patch fixes the issue by means of adding the
    hfsplus_cat_write_inode() call for old_dir and new_dir in
    hfsplus_rename() after the successful ending of
    hfsplus_rename_cat(). This method makes modification of in-core
    inode objects for old_dir and new_dir but it doesn't save these
    modifications in Catalog File's entries. It was expected that
    hfsplus_write_inode() will save these modifications afterwards.
    However, because generic/073 does fsync only for testdir_1 and foo
    then testdir_2 modification hasn't beed saved into Catalog File's
    entry and it was flushed without this modification. And it was
    detected by fsck.hfsplus. Now, hfsplus_rename() stores in Catalog
    File all modified entries and correct state of Catalog File will
    be flushed during hfsplus_file_fsync() call. Finally, it makes
    fsck.hfsplus happy.
    
    sudo ./check generic/073
    FSTYP         -- hfsplus
    PLATFORM      -- Linux/x86_64 hfsplus-testing-0001 6.18.0-rc3+ #93 SMP PREEMPT_DYNAMIC Wed Nov 12 14:37:49 PST 2025
    MKFS_OPTIONS  -- /dev/loop51
    MOUNT_OPTIONS -- /dev/loop51 /mnt/scratch
    
    generic/073 32s ...  32s
    Ran: generic/073
    Passed all 1 tests
    
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    cc: John Paul Adrian Glaubitz <[email protected]>
    cc: Yangtao Li <[email protected]>
    cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hfsplus: Verify inode mode when loading from disk [+ + +]

Author: Tetsuo Handa <[email protected]>
Date:   Sat Nov 15 18:18:54 2025 +0900

    hfsplus: Verify inode mode when loading from disk
    
    [ Upstream commit 005d4b0d33f6b4a23d382b7930f7a96b95b01f39 ]
    
    syzbot is reporting that S_IFMT bits of inode->i_mode can become bogus when
    the S_IFMT bits of the 16bits "mode" field loaded from disk are corrupted.
    
    According to [1], the permissions field was treated as reserved in Mac OS
    8 and 9. According to [2], the reserved field was explicitly initialized
    with 0, and that field must remain 0 as long as reserved. Therefore, when
    the "mode" field is not 0 (i.e. no longer reserved), the file must be
    S_IFDIR if dir == 1, and the file must be one of S_IFREG/S_IFLNK/S_IFCHR/
    S_IFBLK/S_IFIFO/S_IFSOCK if dir == 0.
    
    Reported-by: syzbot <[email protected]>
    Closes: https://syzkaller.appspot.com/bug?extid=895c23f6917da440ed0d
    Link: https://developer.apple.com/library/archive/technotes/tn/tn1150.html#HFSPlusPermissions [1]
    Link: https://developer.apple.com/library/archive/technotes/tn/tn1150.html#ReservedAndPadFields [2]
    Signed-off-by: Tetsuo Handa <[email protected]>
    Reviewed-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Viacheslav Dubeyko <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

HID: input: map HID_GD_Z to ABS_DISTANCE for stylus/pen [+ + +]

Author: Ping Cheng <[email protected]>
Date:   Mon Oct 27 13:37:42 2025 -0700

    HID: input: map HID_GD_Z to ABS_DISTANCE for stylus/pen
    
    commit 7953794f741e94d30df9dafaaa4c031c85b891d6 upstream.
    
    HID_GD_Z is mapped to ABS_Z for stylus and pen in hid-input.c. But HID_GD_Z
    should be used to report ABS_DISTANCE for stylus and pen as described at:
    Documentation/input/event-codes.rst#n226
    
    * ABS_DISTANCE:
    
      - Used to describe the distance of a tool from an interaction surface. This
        event should only be emitted while the tool is hovering, meaning in close
        proximity of the device and while the value of the BTN_TOUCH code is 0. If
        the input device may be used freely in three dimensions, consider ABS_Z
        instead.
      - BTN_TOOL_<name> should be set to 1 when the tool comes into detectable
        proximity and set to 0 when the tool leaves detectable proximity.
        BTN_TOOL_<name> signals the type of tool that is currently detected by the
        hardware and is otherwise independent of ABS_DISTANCE and/or BTN_TOUCH.
    
    This patch makes the correct mapping. The ABS_DISTANCE is currently not mapped
    by any HID usage in hid-generic driver.
    
    Signed-off-by: Ping Cheng <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jiri Kosina <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

HID: logitech-dj: Remove duplicate error logging [+ + +]

Author: Hans de Goede <[email protected]>
Date:   Sat Nov 8 22:03:18 2025 +0100

    HID: logitech-dj: Remove duplicate error logging
    
    commit ca389a55d8b2d86a817433bf82e0602b68c4d541 upstream.
    
    logi_dj_recv_query_paired_devices() and logi_dj_recv_switch_to_dj_mode()
    both have 2 callers which all log an error if the function fails. Move
    the error logging to inside these 2 functions to remove the duplicated
    error logging in the callers.
    
    While at it also move the logi_dj_recv_send_report() call error handling
    in logi_dj_recv_switch_to_dj_mode() to directly after the call. That call
    only fails if the report cannot be found and in that case it does nothing,
    so the msleep() is not necessary on failures.
    
    Fixes: 6f20d3261265 ("HID: logitech-dj: Fix error handling in logi_dj_recv_switch_to_dj_mode()")
    Cc: [email protected]
    Signed-off-by: Hans de Goede <[email protected]>
    Signed-off-by: Jiri Kosina <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hrtimers: Introduce hrtimer_update_function() [+ + +]

Author: Nam Cao <[email protected]>
Date:   Tue Dec 30 10:55:54 2025 -0500

    hrtimers: Introduce hrtimer_update_function()
    
    [ Upstream commit 8f02e3563bb5824eb01c94f2c75f1dcee2d05625 ]
    
    Some users of hrtimer need to change the callback function after the
    initial setup. They write to hrtimer::function directly.
    
    That's not safe under all circumstances as the write is lockless and a
    concurrent timer expiry might end up using the wrong function pointer.
    
    Introduce hrtimer_update_function(), which also performs runtime checks
    whether it is safe to modify the callback.
    
    This allows to make hrtimer::function private once all users are converted.
    
    Signed-off-by: Nam Cao <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://lore.kernel.org/all/20a937b0ae09ad54b5b6d86eabead7c570f1b72e.1730386209.git.namcao@linutronix.de
    Stable-dep-of: 267ee93c417e ("serial: xilinx_uartps: fix rs485 delay_rts_after_send")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hrtimers: Make hrtimer_update_function() less expensive [+ + +]

Author: Thomas Gleixner <[email protected]>
Date:   Fri Feb 7 22:16:09 2025 +0100

    hrtimers: Make hrtimer_update_function() less expensive
    
    commit 2ea97b76d6712bfb0408e5b81ffd7bc4551d3153 upstream.
    
    The sanity checks in hrtimer_update_function() are expensive for high
    frequency usage like in the io/uring code due to locking.
    
    Hide the sanity checks behind CONFIG_PROVE_LOCKING, which has a decent
    chance to be enabled on a regular basis for testing.
    
    Fixes: 8f02e3563bb5 ("hrtimers: Introduce hrtimer_update_function()")
    Reported-by: Jens Axboe <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://lore.kernel.org/all/87ikpllali.ffs@tglx
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hsr: hold rcu and dev lock for hsr_get_port_ndev [+ + +]

Author: Hangbin Liu <[email protected]>
Date:   Thu Dec 18 10:37:36 2025 -0500

    hsr: hold rcu and dev lock for hsr_get_port_ndev
    
    [ Upstream commit 847748fc66d08a89135a74e29362a66ba4e3ab15 ]
    
    hsr_get_port_ndev calls hsr_for_each_port, which need to hold rcu lock.
    On the other hand, before return the port device, we need to hold the
    device reference to avoid UaF in the caller function.
    
    Suggested-by: Paolo Abeni <[email protected]>
    Fixes: 9c10dd8eed74 ("net: hsr: Create and export hsr_get_port_ndev()")
    Signed-off-by: Hangbin Liu <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    [ Drop multicast filtering changes ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (dell-smm) Limit fan multiplier to avoid overflow [+ + +]

Author: Denis Sergeev <[email protected]>
Date:   Tue Dec 9 09:37:06 2025 +0300

    hwmon: (dell-smm) Limit fan multiplier to avoid overflow
    
    [ Upstream commit 46c28bbbb150b80827e4bcbea231560af9d16854 ]
    
    The fan nominal speed returned by SMM is limited to 16 bits, but the
    driver allows the fan multiplier to be set via a module parameter.
    
    Clamp the computed fan multiplier so that fan_nominal_speed *
    i8k_fan_mult always fits into a signed 32-bit integer and refuse to
    initialize the driver if the value is too large.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: 20bdeebc88269 ("hwmon: (dell-smm) Introduce helper function for data init")
    Signed-off-by: Denis Sergeev <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hwmon: (ibmpex) fix use-after-free in high/low store [+ + +]

Author: Junrui Luo <[email protected]>
Date:   Wed Dec 10 17:48:08 2025 +0800

    hwmon: (ibmpex) fix use-after-free in high/low store
    
    [ Upstream commit 6946c726c3f4c36f0f049e6f97e88c510b15f65d ]
    
    The ibmpex_high_low_store() function retrieves driver data using
    dev_get_drvdata() and uses it without validation. This creates a race
    condition where the sysfs callback can be invoked after the data
    structure is freed, leading to use-after-free.
    
    Fix by adding a NULL check after dev_get_drvdata(), and reordering
    operations in the deletion path to prevent TOCTOU.
    
    Reported-by: Yuhao Jiang <[email protected]>
    Reported-by: Junrui Luo <[email protected]>
    Fixes: 57c7c3a0fdea ("hwmon: IBM power meter driver")
    Signed-off-by: Junrui Luo <[email protected]>
    Link: https://lore.kernel.org/r/MEYPR01MB7886BE2F51BFE41875B74B60AFA0A@MEYPR01MB7886.ausprd01.prod.outlook.com
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hwmon: (ltc4282): Fix reset_history file permissions [+ + +]

Author: Nuno Sá <[email protected]>
Date:   Fri Dec 19 16:11:05 2025 +0000

    hwmon: (ltc4282): Fix reset_history file permissions
    
    [ Upstream commit b3db91c3bfea69a6c6258fea508f25a59c0feb1a ]
    
    The reset_history attributes are write only. Hence don't report them as
    readable just to return -EOPNOTSUPP later on.
    
    Fixes: cbc29538dbf7 ("hwmon: Add driver for LTC4282")
    Signed-off-by: Nuno Sá <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hwmon: (max16065) Use local variable to avoid TOCTOU [+ + +]

Author: Gui-Dong Han <[email protected]>
Date:   Fri Nov 28 20:47:09 2025 +0800

    hwmon: (max16065) Use local variable to avoid TOCTOU
    
    commit b8d5acdcf525f44e521ca4ef51dce4dac403dab4 upstream.
    
    In max16065_current_show, data->curr_sense is read twice: once for the
    error check and again for the calculation. Since
    i2c_smbus_read_byte_data returns negative error codes on failure, if the
    data changes to an error code between the check and the use, ADC_TO_CURR
    results in an incorrect calculation.
    
    Read data->curr_sense into a local variable to ensure consistency. Note
    that data->curr_gain is constant and safe to access directly.
    
    This aligns max16065_current_show with max16065_input_show, which
    already uses a local variable for the same reason.
    
    Link: https://lore.kernel.org/all/CALbr=LYJ_ehtp53HXEVkSpYoub+XYSTU8Rg=o1xxMJ8=5z8B-g@mail.gmail.com/
    Fixes: f5bae2642e3d ("hwmon: Driver for MAX16065 System Manager and compatibles")
    Cc: [email protected]
    Signed-off-by: Gui-Dong Han <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (max6697) fix regmap leak on probe failure [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Nov 27 14:43:51 2025 +0100

    hwmon: (max6697) fix regmap leak on probe failure
    
    commit 02f0ad8e8de8cf5344f8f0fa26d9529b8339da47 upstream.
    
    The i2c regmap allocated during probe is never freed.
    
    Switch to using the device managed allocator so that the regmap is
    released on probe failures (e.g. probe deferral) and on driver unbind.
    
    Fixes: 3a2a8cc3fe24 ("hwmon: (max6697) Convert to use regmap")
    Cc: [email protected]      # 6.12
    Cc: Guenter Roeck <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (tmp401) fix overflow caused by default conversion rate value [+ + +]

Author: Alexey Simakov <[email protected]>
Date:   Thu Dec 11 19:43:43 2025 +0300

    hwmon: (tmp401) fix overflow caused by default conversion rate value
    
    [ Upstream commit 82f2aab35a1ab2e1460de06ef04c726460aed51c ]
    
    The driver computes conversion intervals using the formula:
    
        interval = (1 << (7 - rate)) * 125ms
    
    where 'rate' is the sensor's conversion rate register value. According to
    the datasheet, the power-on reset value of this register is 0x8, which
    could be assigned to the register, after handling i2c general call.
    Using this default value causes a result greater than the bit width of
    left operand and an undefined behaviour in the calculation above, since
    shifting by values larger than the bit width is undefined behaviour as
    per C language standard.
    
    Limit the maximum usable 'rate' value to 7 to prevent undefined
    behaviour in calculations.
    
    Found by Linux Verification Center (linuxtesting.org) with Svace.
    
    Note (groeck):
        This does not matter in practice unless someone overwrites the chip
        configuration from outside the driver while the driver is loaded.
        The conversion time register is initialized with a value of 5 (500ms)
        when the driver is loaded, and the driver never writes a bad value.
    
    Fixes: ca53e7640de7 ("hwmon: (tmp401) Convert to _info API")
    Signed-off-by: Alexey Simakov <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hwmon: (w83791d) Convert macros to functions to avoid TOCTOU [+ + +]

Author: Gui-Dong Han <[email protected]>
Date:   Wed Dec 3 02:01:05 2025 +0800

    hwmon: (w83791d) Convert macros to functions to avoid TOCTOU
    
    commit 670d7ef945d3a84683594429aea6ab2cdfa5ceb4 upstream.
    
    The macro FAN_FROM_REG evaluates its arguments multiple times. When used
    in lockless contexts involving shared driver data, this leads to
    Time-of-Check to Time-of-Use (TOCTOU) race conditions, potentially
    causing divide-by-zero errors.
    
    Convert the macro to a static function. This guarantees that arguments
    are evaluated only once (pass-by-value), preventing the race
    conditions.
    
    Additionally, in store_fan_div, move the calculation of the minimum
    limit inside the update lock. This ensures that the read-modify-write
    sequence operates on consistent data.
    
    Adhere to the principle of minimal changes by only converting macros
    that evaluate arguments multiple times and are used in lockless
    contexts.
    
    Link: https://lore.kernel.org/all/CALbr=LYJ_ehtp53HXEVkSpYoub+XYSTU8Rg=o1xxMJ8=5z8B-g@mail.gmail.com/
    Fixes: 9873964d6eb2 ("[PATCH] HWMON: w83791d: New hardware monitoring driver for the Winbond W83791D")
    Cc: [email protected]
    Signed-off-by: Gui-Dong Han <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

hwmon: (w83l786ng) Convert macros to functions to avoid TOCTOU [+ + +]

Author: Gui-Dong Han <[email protected]>
Date:   Fri Nov 28 20:38:16 2025 +0800

    hwmon: (w83l786ng) Convert macros to functions to avoid TOCTOU
    
    commit 07272e883fc61574b8367d44de48917f622cdd83 upstream.
    
    The macros FAN_FROM_REG and TEMP_FROM_REG evaluate their arguments
    multiple times. When used in lockless contexts involving shared driver
    data, this causes Time-of-Check to Time-of-Use (TOCTOU) race
    conditions.
    
    Convert the macros to static functions. This guarantees that arguments
    are evaluated only once (pass-by-value), preventing the race
    conditions.
    
    Adhere to the principle of minimal changes by only converting macros
    that evaluate arguments multiple times and are used in lockless
    contexts.
    
    Link: https://lore.kernel.org/all/CALbr=LYJ_ehtp53HXEVkSpYoub+XYSTU8Rg=o1xxMJ8=5z8B-g@mail.gmail.com/
    Fixes: 85f03bccd6e0 ("hwmon: Add support for Winbond W83L786NG/NR")
    Cc: [email protected]
    Signed-off-by: Gui-Dong Han <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i2c: amd-mp2: fix reference leak in MP2 PCI device [+ + +]

Author: Ma Ke <[email protected]>
Date:   Wed Oct 22 17:54:02 2025 +0800

    i2c: amd-mp2: fix reference leak in MP2 PCI device
    
    commit a6ee6aac66fb394b7f6e6187c73bdcd873f2d139 upstream.
    
    In i2c_amd_probe(), amd_mp2_find_device() utilizes
    driver_find_next_device() which internally calls driver_find_device()
    to locate the matching device. driver_find_device() increments the
    reference count of the found device by calling get_device(), but
    amd_mp2_find_device() fails to call put_device() to decrement the
    reference count before returning. This results in a reference count
    leak of the PCI device each time i2c_amd_probe() is executed, which
    may prevent the device from being properly released and cause a memory
    leak.
    
    Found by code review.
    
    Cc: [email protected]
    Fixes: 529766e0a011 ("i2c: Add drivers for the AMD PCIe MP2 I2C controller")
    Signed-off-by: Ma Ke <[email protected]>
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

i2c: designware: Disable SMBus interrupts to prevent storms from mis-configured firmware [+ + +]

Author: Jinhui Guo <[email protected]>
Date:   Tue Oct 21 15:57:14 2025 +0800

    i2c: designware: Disable SMBus interrupts to prevent storms from mis-configured firmware
    
    [ Upstream commit d3429178ee51dd7155445d15a5ab87a45fae3c73 ]
    
    When probing the I2C master, disable SMBus interrupts to prevent
    storms caused by broken firmware mis-configuring IC_SMBUS=1; the
    handler never services them and a mis-configured SMBUS Master
    extend-clock timeout or SMBUS Slave extend-clock timeout can
    flood the CPU.
    
    Signed-off-by: Jinhui Guo <[email protected]>
    Reviewed-by: Andy Shevchenko <[email protected]>
    Acked-by: Mika Westerberg <[email protected]>
    Signed-off-by: Andi Shyti <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

i40e: fix scheduling in set_rx_mode [+ + +]

Author: Przemyslaw Korba <[email protected]>
Date:   Thu Nov 20 13:07:28 2025 +0100

    i40e: fix scheduling in set_rx_mode
    
    [ Upstream commit be43abc5514167cc129a8d8e9727b89b8e1d9719 ]
    
    Add service task schedule to set_rx_mode.
    In some cases there are error messages printed out in PTP application
    (ptp4l):
    
    ptp4l[13848.762]: port 1 (ens2f3np3): received SYNC without timestamp
    ptp4l[13848.825]: port 1 (ens2f3np3): received SYNC without timestamp
    ptp4l[13848.887]: port 1 (ens2f3np3): received SYNC without timestamp
    
    This happens when service task would not run immediately after
    set_rx_mode, and we need it for setup tasks. This service task checks, if
    PTP RX packets are hung in firmware, and propagate correct settings such
    as multicast address for IEEE 1588 Precision Time Protocol.
    RX timestamping depends on some of these filters set. Bug happens only
    with high PTP packets frequency incoming, and not every run since
    sometimes service task is being ran from a different place immediately
    after starting ptp4l.
    
    Fixes: 0e4425ed641f ("i40e: fix: do not sleep in netdev_ops")
    Reviewed-by: Grzegorz Nitka <[email protected]>
    Reviewed-by: Jacob Keller <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Signed-off-by: Przemyslaw Korba <[email protected]>
    Tested-by: Rinitha S <[email protected]> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

i40e: validate ring_len parameter against hardware-specific values [+ + +]

Author: Gregory Herrero <[email protected]>
Date:   Fri Dec 12 22:06:43 2025 +0100

    i40e: validate ring_len parameter against hardware-specific values
    
    [ Upstream commit 69942834215323cd9131db557091b4dec43f19c5 ]
    
    The maximum number of descriptors supported by the hardware is
    hardware-dependent and can be retrieved using
    i40e_get_max_num_descriptors(). Move this function to a shared header
    and use it when checking for valid ring_len parameter rather than using
    hardcoded value.
    
    By fixing an over-acceptance issue, behavior change could be seen where
    ring_len could now be rejected while configuring rx and tx queues if its
    size is larger than the hardware-dependent maximum number of
    descriptors.
    
    Fixes: 55d225670def ("i40e: add validation for ring_len param")
    Signed-off-by: Gregory Herrero <[email protected]>
    Tested-by: Rafal Romanowski <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iavf: fix off-by-one issues in iavf_config_rss_reg() [+ + +]

Author: Kohei Enju <[email protected]>
Date:   Sun Oct 26 01:58:50 2025 +0900

    iavf: fix off-by-one issues in iavf_config_rss_reg()
    
    [ Upstream commit 6daa2893f323981c7894c68440823326e93a7d61 ]
    
    There are off-by-one bugs when configuring RSS hash key and lookup
    table, causing out-of-bounds reads to memory [1] and out-of-bounds
    writes to device registers.
    
    Before commit 43a3d9ba34c9 ("i40evf: Allow PF driver to configure RSS"),
    the loop upper bounds were:
        i <= I40E_VFQF_{HKEY,HLUT}_MAX_INDEX
    which is safe since the value is the last valid index.
    
    That commit changed the bounds to:
        i <= adapter->rss_{key,lut}_size / 4
    where `rss_{key,lut}_size / 4` is the number of dwords, so the last
    valid index is `(rss_{key,lut}_size / 4) - 1`. Therefore, using `<=`
    accesses one element past the end.
    
    Fix the issues by using `<` instead of `<=`, ensuring we do not exceed
    the bounds.
    
    [1] KASAN splat about rss_key_size off-by-one
      BUG: KASAN: slab-out-of-bounds in iavf_config_rss+0x619/0x800
      Read of size 4 at addr ffff888102c50134 by task kworker/u8:6/63
    
      CPU: 0 UID: 0 PID: 63 Comm: kworker/u8:6 Not tainted 6.18.0-rc2-enjuk-tnguy-00378-g3005f5b77652-dirty #156 PREEMPT(voluntary)
      Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
      Workqueue: iavf iavf_watchdog_task
      Call Trace:
       <TASK>
       dump_stack_lvl+0x6f/0xb0
       print_report+0x170/0x4f3
       kasan_report+0xe1/0x1a0
       iavf_config_rss+0x619/0x800
       iavf_watchdog_task+0x2be7/0x3230
       process_one_work+0x7fd/0x1420
       worker_thread+0x4d1/0xd40
       kthread+0x344/0x660
       ret_from_fork+0x249/0x320
       ret_from_fork_asm+0x1a/0x30
       </TASK>
    
      Allocated by task 63:
       kasan_save_stack+0x30/0x50
       kasan_save_track+0x14/0x30
       __kasan_kmalloc+0x7f/0x90
       __kmalloc_noprof+0x246/0x6f0
       iavf_watchdog_task+0x28fc/0x3230
       process_one_work+0x7fd/0x1420
       worker_thread+0x4d1/0xd40
       kthread+0x344/0x660
       ret_from_fork+0x249/0x320
       ret_from_fork_asm+0x1a/0x30
    
      The buggy address belongs to the object at ffff888102c50100
       which belongs to the cache kmalloc-64 of size 64
      The buggy address is located 0 bytes to the right of
       allocated 52-byte region [ffff888102c50100, ffff888102c50134)
    
      The buggy address belongs to the physical page:
      page: refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x102c50
      flags: 0x200000000000000(node=0|zone=2)
      page_type: f5(slab)
      raw: 0200000000000000 ffff8881000418c0 dead000000000122 0000000000000000
      raw: 0000000000000000 0000000080200020 00000000f5000000 0000000000000000
      page dumped because: kasan: bad access detected
    
      Memory state around the buggy address:
       ffff888102c50000: 00 00 00 00 00 00 00 fc fc fc fc fc fc fc fc fc
       ffff888102c50080: 00 00 00 00 00 00 00 fc fc fc fc fc fc fc fc fc
      >ffff888102c50100: 00 00 00 00 00 00 04 fc fc fc fc fc fc fc fc fc
                                           ^
       ffff888102c50180: 00 00 00 00 00 00 00 00 fc fc fc fc fc fc fc fc
       ffff888102c50200: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
    
    Fixes: 43a3d9ba34c9 ("i40evf: Allow PF driver to configure RSS")
    Signed-off-by: Kohei Enju <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Reviewed-by: Przemek Kitszel <[email protected]>
    Tested-by: Rafal Romanowski <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

idpf: add support for SW triggered interrupts [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:40 2025 -0800

    idpf: add support for SW triggered interrupts
    
    [ Upstream commit 93433c1d919775f8ac0f7893692f42e6731a5373 ]
    
    SW triggered interrupts are guaranteed to fire after their timer
    expires, unlike Tx and Rx interrupts which will only fire after the
    timer expires _and_ a descriptor write back is available to be processed
    by the driver.
    
    Add the necessary fields, defines, and initializations to enable a SW
    triggered interrupt in the vector's dyn_ctl register.
    
    Reviewed-by: Madhu Chittim <[email protected]>
    Signed-off-by: Joshua Hay <[email protected]>
    Tested-by: Krishneil Singh <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: add support for Tx refillqs in flow scheduling mode [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:42 2025 -0800

    idpf: add support for Tx refillqs in flow scheduling mode
    
    [ Upstream commit cb83b559bea39f207ee214ee2972657e8576ed18 ]
    
    Changes from original commit:
    - Adjusted idpf_tx_queue assert size to align with 6.12 struct definition
    
    In certain production environments, it is possible for completion tags
    to collide, meaning N packets with the same completion tag are in flight
    at the same time. In this environment, any given Tx queue is effectively
    used to send both slower traffic and higher throughput traffic
    simultaneously. This is the result of a customer's specific
    configuration in the device pipeline, the details of which Intel cannot
    provide. This configuration results in a small number of out-of-order
    completions, i.e., a small number of packets in flight. The existing
    guardrails in the driver only protect against a large number of packets
    in flight. The slower flow completions are delayed which causes the
    out-of-order completions. The fast flow will continue sending traffic
    and generating tags. Because tags are generated on the fly, the fast
    flow eventually uses the same tag for a packet that is still in flight
    from the slower flow. The driver has no idea which packet it should
    clean when it processes the completion with that tag, but it will look
    for the packet on the buffer ring before the hash table.  If the slower
    flow packet completion is processed first, it will end up cleaning the
    fast flow packet on the ring prematurely. This leaves the descriptor
    ring in a bad state resulting in a crash or Tx timeout.
    
    In summary, generating a tag when a packet is sent can lead to the same
    tag being associated with multiple packets. This can lead to resource
    leaks, crashes, and/or Tx timeouts.
    
    Before we can replace the tag generation, we need a new mechanism for
    the send path to know what tag to use next. The driver will allocate and
    initialize a refillq for each TxQ with all of the possible free tag
    values. During send, the driver grabs the next free tag from the refillq
    from next_to_clean. While cleaning the packet, the clean routine posts
    the tag back to the refillq's next_to_use to indicate that it is now
    free to use.
    
    This mechanism works exactly the same way as the existing Rx refill
    queues, which post the cleaned buffer IDs back to the buffer queue to be
    reposted to HW. Since we're using the refillqs for both Rx and Tx now,
    genericize some of the existing refillq support.
    
    Note: the refillqs will not be used yet. This is only demonstrating how
    they will be used to pass free tags back to the send path.
    
    Signed-off-by: Joshua Hay <[email protected]>
    Reviewed-by: Madhu Chittim <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: improve when to set RE bit logic [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:43 2025 -0800

    idpf: improve when to set RE bit logic
    
    [ Upstream commit f2d18e16479cac7a708d77cbfb4220a9114a71fc ]
    
    Track the gap between next_to_use and the last RE index. Set RE again
    if the gap is large enough to ensure RE bit is set frequently. This is
    critical before removing the stashing mechanisms because the
    opportunistic descriptor ring cleaning from the out-of-order completions
    will go away. Previously the descriptors would be "cleaned" by both the
    descriptor (RE) completion and the out-of-order completions. Without the
    latter, we must ensure the RE bit is set more frequently. Otherwise,
    it's theoretically possible for the descriptor ring next_to_clean to
    never advance.  The previous implementation was dependent on the start
    of a packet falling on a 64th index in the descriptor ring, which is not
    guaranteed with large packets.
    
    Signed-off-by: Luigi Rizzo <[email protected]>
    Signed-off-by: Brian Vazquez <[email protected]>
    Signed-off-by: Joshua Hay <[email protected]>
    Reviewed-by: Madhu Chittim <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: reduce mbx_task schedule delay to 300us [+ + +]

Author: Brian Vazquez <[email protected]>
Date:   Mon Nov 10 20:58:37 2025 +0000

    idpf: reduce mbx_task schedule delay to 300us
    
    [ Upstream commit b3d6bbae1d6d5638a4ab702ab195476787cde857 ]
    
    During the IDPF init phase, the mailbox runs in poll mode until it is
    configured to properly handle interrupts. The previous delay of 300ms is
    excessively long for the mailbox polling mechanism, which causes a slow
    initialization of ~2s:
    
    echo 0000:06:12.4 > /sys/bus/pci/drivers/idpf/bind
    
    [   52.444239] idpf 0000:06:12.4: enabling device (0000 -> 0002)
    [   52.485005] idpf 0000:06:12.4: Device HW Reset initiated
    [   54.177181] idpf 0000:06:12.4: PTP init failed, err=-EOPNOTSUPP
    [   54.206177] idpf 0000:06:12.4: Minimum RX descriptor support not provided, using the default
    [   54.206182] idpf 0000:06:12.4: Minimum TX descriptor support not provided, using the default
    
    Changing the delay to 300us avoids the delays during the initial mailbox
    transactions, making the init phase much faster:
    
    [   83.342590] idpf 0000:06:12.4: enabling device (0000 -> 0002)
    [   83.384402] idpf 0000:06:12.4: Device HW Reset initiated
    [   83.518323] idpf 0000:06:12.4: PTP init failed, err=-EOPNOTSUPP
    [   83.547430] idpf 0000:06:12.4: Minimum RX descriptor support not provided, using the default
    [   83.547435] idpf 0000:06:12.4: Minimum TX descriptor support not provided, using the default
    
    Fixes: 4930fbf419a7 ("idpf: add core init and interrupt request")
    Signed-off-by: Brian Vazquez <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

idpf: remove obsolete stashing code [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:47 2025 -0800

    idpf: remove obsolete stashing code
    
    [ Upstream commit 6c4e68480238274f84aa50d54da0d9e262df6284 ]
    
    Changes from original commit:
    - Adjusted idpf_tx_queue assert size to align with 6.12 struct definition
    
    With the new Tx buffer management scheme, there is no need for all of
    the stashing mechanisms, the hash table, the reserve buffer stack, etc.
    Remove all of that.
    
    Signed-off-by: Joshua Hay <[email protected]>
    Reviewed-by: Madhu Chittim <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: replace flow scheduling buffer ring with buffer pool [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:45 2025 -0800

    idpf: replace flow scheduling buffer ring with buffer pool
    
    [ Upstream commit 5f417d551324d2894168b362f2429d120ab06243 ]
    
    Replace the TxQ buffer ring with one large pool/array of buffers (only
    for flow scheduling). This eliminates the tag generation and makes it
    impossible for a tag to be associated with more than one packet.
    
    The completion tag passed to HW through the descriptor is the index into
    the array. That same completion tag is posted back to the driver in the
    completion descriptor, and used to index into the array to quickly
    retrieve the buffer during cleaning.  In this way, the tags are treated
    as a fix sized resource. If all tags are in use, no more packets can be
    sent on that particular queue (until some are freed up). The tag pool
    size is 64K since the completion tag width is 16 bits.
    
    For each packet, the driver pulls a free tag from the refillq to get the
    next free buffer index. When cleaning is complete, the tag is posted
    back to the refillq. A multi-frag packet spans multiple buffers in the
    driver, therefore it uses multiple buffer indexes/tags from the pool.
    Each frag pulls from the refillq to get the next free buffer index.
    These are tracked in a next_buf field that replaces the completion tag
    field in the buffer struct. This chains the buffers together so that the
    packet can be cleaned from the starting completion tag taken from the
    completion descriptor, then from the next_buf field for each subsequent
    buffer.
    
    In case of a dma_mapping_error occurs or the refillq runs out of free
    buf_ids, the packet will execute the rollback error path. This unmaps
    any buffers previously mapped for the packet. Since several free
    buf_ids could have already been pulled from the refillq, we need to
    restore its original state as well. Otherwise, the buf_ids/tags
    will be leaked and not used again until the queue is reallocated.
    
    Descriptor completions only advance the descriptor ring index to "clean"
    the descriptors. The packet completions only clean the buffers
    associated with the given packet completion tag and do not update the
    descriptor ring index.
    
    When operating in queue based scheduling mode, the array still acts as a
    ring and will only have TxQ descriptor count entries. The tx_bufs are
    still associated 1:1 with the descriptor ring entries and we can use the
    conventional indexing mechanisms.
    
    Fixes: c2d548cad150 ("idpf: add TX splitq napi poll support")
    Signed-off-by: Luigi Rizzo <[email protected]>
    Signed-off-by: Brian Vazquez <[email protected]>
    Signed-off-by: Joshua Hay <[email protected]>
    Reviewed-by: Madhu Chittim <[email protected]>
    Reviewed-by: Aleksandr Loktionov <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: simplify and fix splitq Tx packet rollback error path [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:44 2025 -0800

    idpf: simplify and fix splitq Tx packet rollback error path
    
    [ Upstream commit b61dfa9bc4430ad82b96d3a7c1c485350f91b467 ]
    
    Move (and rename) the existing rollback logic to singleq.c since that
    will be the only consumer. Create a simplified splitq specific rollback
    function to loop through and unmap tx_bufs based on the completion tag.
    This is critical before replacing the Tx buffer ring with the buffer
    pool since the previous rollback indexing will not work to unmap the
    chained buffers from the pool.
    
    Cache the next_to_use index before any portion of the packet is put on
    the descriptor ring. In case of an error, the rollback will bump tail to
    the correct next_to_use value. Because the splitq path now supports
    different types of context descriptors (and potentially multiple in the
    future), this will take care of rolling back any and all context
    descriptors encoded on the ring for the erroneous packet. The previous
    rollback logic was broken for PTP packets since it would not account for
    the PTP context descriptor.
    
    Fixes: 1a49cf814fe1 ("idpf: add Tx timestamp flows")
    Signed-off-by: Joshua Hay <[email protected]>
    Reviewed-by: Madhu Chittim <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: stop Tx if there are insufficient buffer resources [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:46 2025 -0800

    idpf: stop Tx if there are insufficient buffer resources
    
    [ Upstream commit 0c3f135e840d4a2ba4253e15d530ec61bc30718e ]
    
    The Tx refillq logic will cause packets to be silently dropped if there
    are not enough buffer resources available to send a packet in flow
    scheduling mode. Instead, determine how many buffers are needed along
    with number of descriptors. Make sure there are enough of both resources
    to send the packet, and stop the queue if not.
    
    Fixes: 7292af042bcf ("idpf: fix a race in txq wakeup")
    Signed-off-by: Joshua Hay <[email protected]>
    Reviewed-by: Madhu Chittim <[email protected]>
    Tested-by: Samuel Salin <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idpf: trigger SW interrupt when exiting wb_on_itr mode [+ + +]

Author: Joshua Hay <[email protected]>
Date:   Mon Dec 15 13:42:41 2025 -0800

    idpf: trigger SW interrupt when exiting wb_on_itr mode
    
    [ Upstream commit 0c1683c681681c14f4389e3bfa8de10baf242ba8 ]
    
    There is a race condition between exiting wb_on_itr and completion write
    backs. For example, we are in wb_on_itr mode and a Tx completion is
    generated by HW, ready to be written back, as we are re-enabling
    interrupts:
    
            HW                      SW
            |                       |
            |                       | idpf_tx_splitq_clean_all
            |                       | napi_complete_done
            |                       |
            | tx_completion_wb      | idpf_vport_intr_update_itr_ena_irq
    
    That tx_completion_wb happens before the vector is fully re-enabled.
    Continuing with this example, it is a UDP stream and the
    tx_completion_wb is the last one in the flow (there are no rx packets).
    Because the HW generated the completion before the interrupt is fully
    enabled, the HW will not fire the interrupt once the timer expires and
    the write back will not happen. NAPI poll won't be called.  We have
    indicated we're back in interrupt mode but nothing else will trigger the
    interrupt. Therefore, the completion goes unprocessed, triggering a Tx
    timeout.
    
    To mitigate this, fire a SW triggered interrupt upon exiting wb_on_itr.
    This interrupt will catch the rogue completion and avoid the timeout.
    Add logic to set the appropriate bits in the vector's dyn_ctl register.
    
    Fixes: 9c4a27da0ecc ("idpf: enable WB_ON_ITR")
    Reviewed-by: Madhu Chittim <[email protected]>
    Signed-off-by: Joshua Hay <[email protected]>
    Tested-by: Krishneil Singh <[email protected]>
    Signed-off-by: Tony Nguyen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

idr: fix idr_alloc() returning an ID out of range [+ + +]

Author: Matthew Wilcox (Oracle) <[email protected]>
Date:   Fri Nov 28 16:18:32 2025 +0000

    idr: fix idr_alloc() returning an ID out of range
    
    commit c6e8e595a0798ad67da0f7bebaf69c31ef70dfff upstream.
    
    If you use an IDR with a non-zero base, and specify a range that lies
    entirely below the base, 'max - base' becomes very large and
    idr_get_free() can return an ID that lies outside of the requested range.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 6ce711f27500 ("idr: Make 1-based IDRs more efficient")
    Signed-off-by: Matthew Wilcox (Oracle) <[email protected]>
    Reported-by: Jan Sokolowski <[email protected]>
    Reported-by: Koen Koning <[email protected]>
    Reported-by: Peter Senna Tschudin <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/xe/kernel/-/issues/6449
    Reviewed-by: Christian König <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iio: adc: ti_am335x_adc: Limit step_avg to valid range for gcc complains [+ + +]

Author: Pei Xiao <[email protected]>
Date:   Tue Oct 14 17:12:50 2025 +0800

    iio: adc: ti_am335x_adc: Limit step_avg to valid range for gcc complains
    
    [ Upstream commit c9fb952360d0c78bbe98239bd6b702f05c2dbb31 ]
    
    FIELD_PREP() checks that a value fits into the available bitfield, add a
    check for step_avg to fix gcc complains.
    
    which gcc complains about:
      drivers/iio/adc/ti_am335x_adc.c: In function 'tiadc_step_config':
      include/linux/compiler_types.h:572:38: error: call to
    '__compiletime_assert_491' declared with attribute error: FIELD_PREP: value
    too large for the field include/linux/mfd/ti_am335x_tscadc.h:58:29: note:
    in expansion of macro 'FIELD_PREP'
        #define STEPCONFIG_AVG(val) FIELD_PREP(GENMASK(4, 2), (val))
                                    ^~~~~~~~~~
    drivers/iio/adc/ti_am335x_adc.c:127:17: note: in expansion of macro 'STEPCONFIG_AVG'
            stepconfig = STEPCONFIG_AVG(ffs(adc_dev->step_avg[i]) - 1)
    
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Signed-off-by: Pei Xiao <[email protected]>
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Input: alps - fix use-after-free bugs caused by dev3_register_work [+ + +]

Author: Duoming Zhou <[email protected]>
Date:   Wed Dec 17 11:00:17 2025 +0800

    Input: alps - fix use-after-free bugs caused by dev3_register_work
    
    commit bf40644ef8c8a288742fa45580897ed0e0289474 upstream.
    
    The dev3_register_work delayed work item is initialized within
    alps_reconnect() and scheduled upon receipt of the first bare
    PS/2 packet from an external PS/2 device connected to the ALPS
    touchpad. During device detachment, the original implementation
    calls flush_workqueue() in psmouse_disconnect() to ensure
    completion of dev3_register_work. However, the flush_workqueue()
    in psmouse_disconnect() only blocks and waits for work items that
    were already queued to the workqueue prior to its invocation. Any
    work items submitted after flush_workqueue() is called are not
    included in the set of tasks that the flush operation awaits.
    This means that after flush_workqueue() has finished executing,
    the dev3_register_work could still be scheduled. Although the
    psmouse state is set to PSMOUSE_CMD_MODE in psmouse_disconnect(),
    the scheduling of dev3_register_work remains unaffected.
    
    The race condition can occur as follows:
    
    CPU 0 (cleanup path)     | CPU 1 (delayed work)
    psmouse_disconnect()     |
      psmouse_set_state()    |
      flush_workqueue()      | alps_report_bare_ps2_packet()
      alps_disconnect()      |   psmouse_queue_work()
        kfree(priv); // FREE | alps_register_bare_ps2_mouse()
                             |   priv = container_of(work...); // USE
                             |   priv->dev3 // USE
    
    Add disable_delayed_work_sync() in alps_disconnect() to ensure
    that dev3_register_work is properly canceled and prevented from
    executing after the alps_data structure has been deallocated.
    
    This bug is identified by static analysis.
    
    Fixes: 04aae283ba6a ("Input: ALPS - do not mix trackstick and external PS/2 mouse data")
    Cc: [email protected]
    Signed-off-by: Duoming Zhou <[email protected]>
    Link: https://patch.msgid.link/b57b0a9ccca51a3f06be141bfc02b9ffe69d1845.1765939397.git.duoming@zju.edu.cn
    Signed-off-by: Dmitry Torokhov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Input: i8042 - add TUXEDO InfinityBook Max Gen10 AMD to i8042 quirk table [+ + +]

Author: Christoffer Sandberg <[email protected]>
Date:   Mon Nov 24 21:31:34 2025 +0100

    Input: i8042 - add TUXEDO InfinityBook Max Gen10 AMD to i8042 quirk table
    
    commit aed3716db7fff74919cc5775ca3a80c8bb246489 upstream.
    
    The device occasionally wakes up from suspend with missing input on the
    internal keyboard and the following suspend attempt results in an instant
    wake-up. The quirks fix both issues for this device.
    
    Signed-off-by: Christoffer Sandberg <[email protected]>
    Signed-off-by: Werner Sembach <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Dmitry Torokhov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Input: lkkbd - disable pending work before freeing device [+ + +]

Author: Minseong Kim <[email protected]>
Date:   Fri Dec 12 00:29:23 2025 -0800

    Input: lkkbd - disable pending work before freeing device
    
    commit e58c88f0cb2d8ed89de78f6f17409d29cfab6c5c upstream.
    
    lkkbd_interrupt() schedules lk->tq via schedule_work(), and the work
    handler lkkbd_reinit() dereferences the lkkbd structure and its
    serio/input_dev fields.
    
    lkkbd_disconnect() and error paths in lkkbd_connect() free the lkkbd
    structure without preventing the reinit work from being queued again
    until serio_close() returns. This can allow the work handler to run
    after the structure has been freed, leading to a potential use-after-free.
    
    Use disable_work_sync() instead of cancel_work_sync() to ensure the
    reinit work cannot be re-queued, and call it both in lkkbd_disconnect()
    and in lkkbd_connect() error paths after serio_open().
    
    Signed-off-by: Minseong Kim <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Dmitry Torokhov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Input: ti_am335x_tsc - fix off-by-one error in wire_order validation [+ + +]

Author: Junjie Cao <[email protected]>
Date:   Thu Dec 18 21:56:59 2025 -0800

    Input: ti_am335x_tsc - fix off-by-one error in wire_order validation
    
    commit 248d3a73a0167dce15ba100477c3e778c4787178 upstream.
    
    The current validation 'wire_order[i] > ARRAY_SIZE(config_pins)' allows
    wire_order[i] to equal ARRAY_SIZE(config_pins), which causes out-of-bounds
    access when used as index in 'config_pins[wire_order[i]]'.
    
    Since config_pins has 4 elements (indices 0-3), the valid range for
    wire_order should be 0-3. Fix the off-by-one error by using >= instead
    of > in the validation check.
    
    Signed-off-by: Junjie Cao <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Fixes: bb76dc09ddfc ("input: ti_am33x_tsc: Order of TSC wires, made configurable")
    Cc: [email protected]
    Signed-off-by: Dmitry Torokhov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

intel_th: Fix error handling in intel_th_output_open [+ + +]

Author: Ma Ke <[email protected]>
Date:   Wed Nov 12 17:17:23 2025 +0800

    intel_th: Fix error handling in intel_th_output_open
    
    commit 6d5925b667e4ed9e77c8278cc215191d29454a3f upstream.
    
    intel_th_output_open() calls bus_find_device_by_devt() which
    internally increments the device reference count via get_device(), but
    this reference is not properly released in several error paths. When
    device driver is unavailable, file operations cannot be obtained, or
    the driver's open method fails, the function returns without calling
    put_device(), leading to a permanent device reference count leak. This
    prevents the device from being properly released and could cause
    resource exhaustion over time.
    
    Found by code review.
    
    Cc: stable <[email protected]>
    Fixes: 39f4034693b7 ("intel_th: Add driver infrastructure for Intel(R) Trace Hub devices")
    Signed-off-by: Ma Ke <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

interconnect: qcom: sdx75: Drop QPIC interconnect and BCM nodes [+ + +]

Author: Raviteja Laggyshetty <[email protected]>
Date:   Fri Sep 26 12:12:09 2025 +0530

    interconnect: qcom: sdx75: Drop QPIC interconnect and BCM nodes
    
    commit 295f58fdccd05b2d6da1f4a4f81952ccb565c4dc upstream.
    
    As like other SDX SoCs, SDX75 SoC's QPIC BCM resource was modeled as a
    RPMh clock in clk-rpmh driver. However, for SDX75, this resource was also
    described as an interconnect and BCM node mistakenly. It is incorrect to
    describe the same resource in two different providers, as it will lead to
    votes from clients overriding each other.
    
    Hence, drop the QPIC interconnect and BCM nodes and let the clients use
    clk-rpmh driver to vote for this resource.
    
    Without this change, the NAND driver fails to probe on SDX75, as the
    interconnect sync state disables the QPIC nodes as there were no clients
    voting for this ICC resource. However, the NAND driver had already voted
    for this BCM resource through the clk-rpmh driver. Since both votes come
    from Linux, RPMh was unable to distinguish between these two and ends up
    disabling the QPIC resource during sync state.
    
    Cc: [email protected]
    Fixes: 3642b4e5cbfe ("interconnect: qcom: Add SDX75 interconnect provider driver")
    Signed-off-by: Raviteja Laggyshetty <[email protected]>
    [mani: dropped the reference to bcm_qp0, reworded description]
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Konrad Dybcio <[email protected]>
    Tested-by: Lakshmi Sowjanya D <[email protected]>  # on SDX75
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Georgi Djakov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

io_uring/poll: correctly handle io_poll_add() return value on update [+ + +]

Author: Jens Axboe <[email protected]>
Date:   Mon Dec 1 13:25:22 2025 -0700

    io_uring/poll: correctly handle io_poll_add() return value on update
    
    Commit 84230ad2d2afbf0c44c32967e525c0ad92e26b4e upstream.
    
    When the core of io_uring was updated to handle completions
    consistently and with fixed return codes, the POLL_REMOVE opcode
    with updates got slightly broken. If a POLL_ADD is pending and
    then POLL_REMOVE is used to update the events of that request, if that
    update causes the POLL_ADD to now trigger, then that completion is lost
    and a CQE is never posted.
    
    Additionally, ensure that if an update does cause an existing POLL_ADD
    to complete, that the completion value isn't always overwritten with
    -ECANCELED. For that case, whatever io_poll_add() set the value to
    should just be retained.
    
    Cc: [email protected]
    Fixes: 97b388d70b53 ("io_uring: handle completions in the core")
    Reported-by: [email protected]
    Tested-by: [email protected]
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

io_uring: fix filename leak in __io_openat_prep() [+ + +]

Author: Prithvi Tambewagh <[email protected]>
Date:   Thu Dec 25 12:58:29 2025 +0530

    io_uring: fix filename leak in __io_openat_prep()
    
    commit b14fad555302a2104948feaff70503b64c80ac01 upstream.
    
     __io_openat_prep() allocates a struct filename using getname(). However,
    for the condition of the file being installed in the fixed file table as
    well as having O_CLOEXEC flag set, the function returns early. At that
    point, the request doesn't have REQ_F_NEED_CLEANUP flag set. Due to this,
    the memory for the newly allocated struct filename is not cleaned up,
    causing a memory leak.
    
    Fix this by setting the REQ_F_NEED_CLEANUP for the request just after the
    successful getname() call, so that when the request is torn down, the
    filename will be cleaned up, along with other resources needing cleanup.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=00e61c43eb5e4740438f
    Tested-by: [email protected]
    Cc: [email protected]
    Signed-off-by: Prithvi Tambewagh <[email protected]>
    Fixes: b9445598d8c6 ("io_uring: openat directly into fixed fd table")
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

io_uring: fix min_wait wakeups for SQPOLL [+ + +]

Author: Jens Axboe <[email protected]>
Date:   Tue Dec 9 13:25:23 2025 -0700

    io_uring: fix min_wait wakeups for SQPOLL
    
    Commit e15cb2200b934e507273510ba6bc747d5cde24a3 upstream.
    
    Using min_wait, two timeouts are given:
    
    1) The min_wait timeout, within which up to 'wait_nr' events are
       waited for.
    2) The overall long timeout, which is entered if no events are generated
       in the min_wait window.
    
    If the min_wait has expired, any event being posted must wake the task.
    For SQPOLL, that isn't the case, as it won't trigger the io_has_work()
    condition, as it will have already processed the task_work that happened
    when an event was posted. This causes any event to trigger post the
    min_wait to not always cause the waiting application to wakeup, and
    instead it will wait until the overall timeout has expired. This can be
    shown in a test case that has a 1 second min_wait, with a 5 second
    overall wait, even if an event triggers after 1.5 seconds:
    
    axboe@m2max-kvm /d/iouring-mre (master)> zig-out/bin/iouring
    info: MIN_TIMEOUT supported: true, features: 0x3ffff
    info: Testing: min_wait=1000ms, timeout=5s, wait_nr=4
    info: 1 cqes in 5000.2ms
    
    where the expected result should be:
    
    axboe@m2max-kvm /d/iouring-mre (master)> zig-out/bin/iouring
    info: MIN_TIMEOUT supported: true, features: 0x3ffff
    info: Testing: min_wait=1000ms, timeout=5s, wait_nr=4
    info: 1 cqes in 1500.3ms
    
    When the min_wait timeout triggers, reset the number of completions
    needed to wake the task. This should ensure that any future events will
    wake the task, regardless of how many events it originally wanted to
    wait for.
    
    Reported-by: Tip ten Brink <[email protected]>
    Cc: [email protected]
    Fixes: 1100c4a2656d ("io_uring: add support for batch wait timeout")
    Link: https://github.com/axboe/liburing/issues/1477
    Signed-off-by: Jens Axboe <[email protected]>
    (cherry picked from commit e15cb2200b934e507273510ba6bc747d5cde24a3)
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iomap: account for unaligned end offsets when truncating read range [+ + +]

Author: Joanne Koong <[email protected]>
Date:   Tue Nov 11 11:36:51 2025 -0800

    iomap: account for unaligned end offsets when truncating read range
    
    [ Upstream commit 9d875e0eef8ec15b6b1da0cb9a0f8ed13efee89e ]
    
    The end position to start truncating from may be at an offset into a
    block, which under the current logic would result in overtruncation.
    
    Adjust the calculation to account for unaligned end offsets.
    
    Signed-off-by: Joanne Koong <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Christoph Hellwig <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Signed-off-by: Christian Brauner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iomap: adjust read range correctly for non-block-aligned positions [+ + +]

Author: Joanne Koong <[email protected]>
Date:   Mon Sep 22 11:00:42 2025 -0700

    iomap: adjust read range correctly for non-block-aligned positions
    
    [ Upstream commit 7aa6bc3e8766990824f66ca76c19596ce10daf3e ]
    
    iomap_adjust_read_range() assumes that the position and length passed in
    are block-aligned. This is not always the case however, as shown in the
    syzbot generated case for erofs. This causes too many bytes to be
    skipped for uptodate blocks, which results in returning the incorrect
    position and length to read in. If all the blocks are uptodate, this
    underflows length and returns a position beyond the folio.
    
    Fix the calculation to also take into account the block offset when
    calculating how many bytes can be skipped for uptodate blocks.
    
    Signed-off-by: Joanne Koong <[email protected]>
    Tested-by: [email protected]
    Reviewed-by: Brian Foster <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Christian Brauner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iomap: allocate s_dio_done_wq for async reads as well [+ + +]

Author: Christoph Hellwig <[email protected]>
Date:   Mon Nov 24 15:00:13 2025 +0100

    iomap: allocate s_dio_done_wq for async reads as well
    
    commit 7fd8720dff2d9c70cf5a1a13b7513af01952ec02 upstream.
    
    Since commit 222f2c7c6d14 ("iomap: always run error completions in user
    context"), read error completions are deferred to s_dio_done_wq.  This
    means the workqueue also needs to be allocated for async reads.
    
    Fixes: 222f2c7c6d14 ("iomap: always run error completions in user context")
    Reported-by: [email protected]
    Signed-off-by: Christoph Hellwig <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Tested-by: [email protected]
    Reviewed-by: Dave Chinner <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Signed-off-by: Christian Brauner <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/amd: Fix pci_segment memleak in alloc_pci_segment() [+ + +]

Author: Jinhui Guo <[email protected]>
Date:   Tue Oct 28 00:50:17 2025 +0800

    iommu/amd: Fix pci_segment memleak in alloc_pci_segment()
    
    commit 75ba146c2674ba49ed8a222c67f9abfb4a4f2a4f upstream.
    
    Fix a memory leak of struct amd_iommu_pci_segment in alloc_pci_segment()
    when system memory (or contiguous memory) is insufficient.
    
    Fixes: 04230c119930 ("iommu/amd: Introduce per PCI segment device table")
    Fixes: eda797a27795 ("iommu/amd: Introduce per PCI segment rlookup table")
    Fixes: 99fc4ac3d297 ("iommu/amd: Introduce per PCI segment alias_table")
    Cc: [email protected]
    Signed-off-by: Jinhui Guo <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/amd: Propagate the error code returned by __modify_irte_ga() in modify_irte_ga() [+ + +]

Author: Jinhui Guo <[email protected]>
Date:   Thu Nov 20 23:47:25 2025 +0800

    iommu/amd: Propagate the error code returned by __modify_irte_ga() in modify_irte_ga()
    
    commit 2381a1b40be4b286062fb3cf67dd7f005692aa2a upstream.
    
    The return type of __modify_irte_ga() is int, but modify_irte_ga()
    treats it as a bool. Casting the int to bool discards the error code.
    
    To fix the issue, change the type of ret to int in modify_irte_ga().
    
    Fixes: 57cdb720eaa5 ("iommu/amd: Do not flush IRTE when only updating isRun and destination fields")
    Cc: [email protected]
    Signed-off-by: Jinhui Guo <[email protected]>
    Reviewed-by: Vasant Hegde <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/apple-dart: fix device leak on of_xlate() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:05 2025 +0200

    iommu/apple-dart: fix device leak on of_xlate()
    
    commit a6eaa872c52a181ae9a290fd4e40c9df91166d7a upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during of_xlate().
    
    Fixes: 46d1fb072e76 ("iommu/dart: Add DART iommu driver")
    Cc: [email protected]      # 5.15
    Cc: Sven Peter <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/exynos: fix device leak on of_xlate() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:07 2025 +0200

    iommu/exynos: fix device leak on of_xlate()
    
    commit 05913cc43cb122f9afecdbe775115c058b906e1b upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during of_xlate().
    
    Note that commit 1a26044954a6 ("iommu/exynos: add missing put_device()
    call in exynos_iommu_of_xlate()") fixed the leak in a couple of error
    paths, but the reference is still leaking on success.
    
    Fixes: aa759fd376fb ("iommu/exynos: Add callback for initializing devices from device tree")
    Cc: [email protected]      # 4.2: 1a26044954a6
    Cc: Yu Kuai <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Acked-by: Marek Szyprowski <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/ipmmu-vmsa: fix device leak on of_xlate() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:08 2025 +0200

    iommu/ipmmu-vmsa: fix device leak on of_xlate()
    
    commit 80aa518452c4aceb9459f9a8e3184db657d1b441 upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during of_xlate().
    
    Fixes: 7b2d59611fef ("iommu/ipmmu-vmsa: Replace local utlb code with fwspec ids")
    Cc: [email protected]      # 4.14
    Cc: Magnus Damm <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/mediatek-v1: fix device leak on probe_device() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:12 2025 +0200

    iommu/mediatek-v1: fix device leak on probe_device()
    
    commit c77ad28bfee0df9cbc719eb5adc9864462cfb65b upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during probe_device().
    
    Fixes: b17336c55d89 ("iommu/mediatek: add support for mtk iommu generation one HW")
    Cc: [email protected]      # 4.8
    Cc: Honghui Zhang <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Reviewed-by: Yong Wu <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/mediatek-v1: fix device leaks on probe() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:13 2025 +0200

    iommu/mediatek-v1: fix device leaks on probe()
    
    commit 46207625c9f33da0e43bb4ae1e91f0791b6ed633 upstream.
    
    Make sure to drop the references taken to the larb devices during
    probe on probe failure (e.g. probe deferral) and on driver unbind.
    
    Fixes: b17336c55d89 ("iommu/mediatek: add support for mtk iommu generation one HW")
    Cc: [email protected]      # 4.8
    Cc: Honghui Zhang <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/mediatek: fix device leak on of_xlate() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:09 2025 +0200

    iommu/mediatek: fix device leak on of_xlate()
    
    commit b3f1ee18280363ef17f82b564fc379ceba9ec86f upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during of_xlate().
    
    Fixes: 0df4fabe208d ("iommu/mediatek: Add mt8173 IOMMU driver")
    Cc: [email protected]      # 4.6
    Acked-by: Robin Murphy <[email protected]>
    Reviewed-by: Yong Wu <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/mediatek: fix use-after-free on probe deferral [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:10 2025 +0200

    iommu/mediatek: fix use-after-free on probe deferral
    
    commit de83d4617f9fe059623e97acf7e1e10d209625b5 upstream.
    
    The driver is dropping the references taken to the larb devices during
    probe after successful lookup as well as on errors. This can
    potentially lead to a use-after-free in case a larb device has not yet
    been bound to its driver so that the iommu driver probe defers.
    
    Fix this by keeping the references as expected while the iommu driver is
    bound.
    
    Fixes: 26593928564c ("iommu/mediatek: Add error path for loop of mm_dts_parse")
    Cc: [email protected]
    Cc: Yong Wu <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Yong Wu <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/omap: fix device leaks on probe_device() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:15 2025 +0200

    iommu/omap: fix device leaks on probe_device()
    
    commit b5870691065e6bbe6ba0650c0412636c6a239c5a upstream.
    
    Make sure to drop the references taken to the iommu platform devices
    when looking up their driver data during probe_device().
    
    Note that the arch data device pointer added by commit 604629bcb505
    ("iommu/omap: add support for late attachment of iommu devices") has
    never been used. Remove it to underline that the references are not
    needed.
    
    Fixes: 9d5018deec86 ("iommu/omap: Add support to program multiple iommus")
    Fixes: 7d6827748d54 ("iommu/omap: Fix iommu archdata name for DT-based devices")
    Cc: [email protected]      # 3.18
    Cc: Suman Anna <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/qcom: fix device leak on of_xlate() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:06 2025 +0200

    iommu/qcom: fix device leak on of_xlate()
    
    commit 6a3908ce56e6879920b44ef136252b2f0c954194 upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during of_xlate().
    
    Note that commit e2eae09939a8 ("iommu/qcom: add missing put_device()
    call in qcom_iommu_of_xlate()") fixed the leak in a couple of error
    paths, but the reference is still leaking on success and late failures.
    
    Fixes: 0ae349a0f33f ("iommu/qcom: Add qcom_iommu")
    Cc: [email protected]      # 4.14: e2eae09939a8
    Cc: Rob Clark <[email protected]>
    Cc: Yu Kuai <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/sun50i: fix device leak on of_xlate() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:17 2025 +0200

    iommu/sun50i: fix device leak on of_xlate()
    
    commit f916109bf53864605d10bf6f4215afa023a80406 upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during of_xlate().
    
    Fixes: 4100b8c229b3 ("iommu: Add Allwinner H6 IOMMU driver")
    Cc: [email protected]      # 5.8
    Cc: Maxime Ripard <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu/tegra: fix device leak on probe_device() [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Mon Oct 20 06:53:18 2025 +0200

    iommu/tegra: fix device leak on probe_device()
    
    commit c08934a61201db8f1d1c66fcc63fb2eb526b656d upstream.
    
    Make sure to drop the reference taken to the iommu platform device when
    looking up its driver data during probe_device().
    
    Note that commit 9826e393e4a8 ("iommu/tegra-smmu: Fix missing
    put_device() call in tegra_smmu_find") fixed the leak in an error path,
    but the reference is still leaking on success.
    
    Fixes: 891846516317 ("memory: Add NVIDIA Tegra memory controller support")
    Cc: [email protected]      # 3.19: 9826e393e4a8
    Cc: Miaoqian Lin <[email protected]>
    Acked-by: Robin Murphy <[email protected]>
    Acked-by: Thierry Reding <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Joerg Roedel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommu: disable SVA when CONFIG_X86 is set [+ + +]

Author: Lu Baolu <[email protected]>
Date:   Wed Oct 22 16:26:27 2025 +0800

    iommu: disable SVA when CONFIG_X86 is set
    
    commit 72f98ef9a4be30d2a60136dd6faee376f780d06c upstream.
    
    Patch series "Fix stale IOTLB entries for kernel address space", v7.
    
    This proposes a fix for a security vulnerability related to IOMMU Shared
    Virtual Addressing (SVA).  In an SVA context, an IOMMU can cache kernel
    page table entries.  When a kernel page table page is freed and
    reallocated for another purpose, the IOMMU might still hold stale,
    incorrect entries.  This can be exploited to cause a use-after-free or
    write-after-free condition, potentially leading to privilege escalation or
    data corruption.
    
    This solution introduces a deferred freeing mechanism for kernel page
    table pages, which provides a safe window to notify the IOMMU to
    invalidate its caches before the page is reused.
    
    
    This patch (of 8):
    
    In the IOMMU Shared Virtual Addressing (SVA) context, the IOMMU hardware
    shares and walks the CPU's page tables.  The x86 architecture maps the
    kernel's virtual address space into the upper portion of every process's
    page table.  Consequently, in an SVA context, the IOMMU hardware can walk
    and cache kernel page table entries.
    
    The Linux kernel currently lacks a notification mechanism for kernel page
    table changes, specifically when page table pages are freed and reused.
    The IOMMU driver is only notified of changes to user virtual address
    mappings.  This can cause the IOMMU's internal caches to retain stale
    entries for kernel VA.
    
    Use-After-Free (UAF) and Write-After-Free (WAF) conditions arise when
    kernel page table pages are freed and later reallocated.  The IOMMU could
    misinterpret the new data as valid page table entries.  The IOMMU might
    then walk into attacker-controlled memory, leading to arbitrary physical
    memory DMA access or privilege escalation.  This is also a
    Write-After-Free issue, as the IOMMU will potentially continue to write
    Accessed and Dirty bits to the freed memory while attempting to walk the
    stale page tables.
    
    Currently, SVA contexts are unprivileged and cannot access kernel
    mappings.  However, the IOMMU will still walk kernel-only page tables all
    the way down to the leaf entries, where it realizes the mapping is for the
    kernel and errors out.  This means the IOMMU still caches these
    intermediate page table entries, making the described vulnerability a real
    concern.
    
    Disable SVA on x86 architecture until the IOMMU can receive notification
    to flush the paging cache before freeing the CPU kernel page table pages.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 26b25a2b98e4 ("iommu: Bind process address spaces to devices")
    Signed-off-by: Lu Baolu <[email protected]>
    Suggested-by: Jason Gunthorpe <[email protected]>
    Reviewed-by: Jason Gunthorpe <[email protected]>
    Cc: Alistair Popple <[email protected]>
    Cc: Andy Lutomirski <[email protected]>
    Cc: Borislav Betkov <[email protected]>
    Cc: Dave Hansen <[email protected]>
    Cc: David Hildenbrand <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: Jann Horn <[email protected]>
    Cc: Jean-Philippe Brucker <[email protected]>
    Cc: Joerg Roedel <[email protected]>
    Cc: Kevin Tian <[email protected]>
    Cc: Liam Howlett <[email protected]>
    Cc: Lorenzo Stoakes <[email protected]>
    Cc: Matthew Wilcox (Oracle) <[email protected]>
    Cc: Michal Hocko <[email protected]>
    Cc: Mike Rapoport <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Robin Murohy <[email protected]>
    Cc: Thomas Gleinxer <[email protected]>
    Cc: "Uladzislau Rezki (Sony)" <[email protected]>
    Cc: Vasant Hegde <[email protected]>
    Cc: Vinicius Costa Gomes <[email protected]>
    Cc: Vlastimil Babka <[email protected]>
    Cc: Will Deacon <[email protected]>
    Cc: Yi Lai <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

iommufd/selftest: Add coverage for reporting max_pasid_log2 via IOMMU_HW_INFO [+ + +]

Author: Yi Liu <[email protected]>
Date:   Fri Mar 21 11:01:43 2025 -0700

    iommufd/selftest: Add coverage for reporting max_pasid_log2 via IOMMU_HW_INFO
    
    [ Upstream commit 6d9500bb1ff8c7f9c3ce199521c41aa41e8fd994 ]
    
    IOMMU_HW_INFO is extended to report max_pasid_log2, hence add coverage
    for it.
    
    Link: https://patch.msgid.link/r/[email protected]
    Reviewed-by: Nicolin Chen <[email protected]>
    Tested-by: Nicolin Chen <[email protected]>
    Signed-off-by: Yi Liu <[email protected]>
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Stable-dep-of: 5b244b077c0b ("iommufd/selftest: Make it clearer to gcc that the access is not out of bounds")
    Signed-off-by: Sasha Levin <[email protected]>

iommufd/selftest: Check for overflow in IOMMU_TEST_OP_ADD_RESERVED [+ + +]

Author: Jason Gunthorpe <[email protected]>
Date:   Tue Dec 16 11:53:40 2025 -0400

    iommufd/selftest: Check for overflow in IOMMU_TEST_OP_ADD_RESERVED
    
    [ Upstream commit e6a973af11135439de32ece3b9cbe3bfc043bea8 ]
    
    syzkaller found it could overflow math in the test infrastructure and
    cause a WARN_ON by corrupting the reserved interval tree. This only
    effects test kernels with CONFIG_IOMMUFD_TEST.
    
    Validate the user input length in the test ioctl.
    
    Fixes: f4b20bb34c83 ("iommufd: Add kernel support for testing iommufd")
    Link: https://patch.msgid.link/r/[email protected]
    Reviewed-by: Samiullah Khawaja <[email protected]>
    Reviewed-by: Kevin Tian <[email protected]>
    Tested-by: Yi Liu <[email protected]>
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iommufd/selftest: Make it clearer to gcc that the access is not out of bounds [+ + +]

Author: Jason Gunthorpe <[email protected]>
Date:   Fri Dec 5 14:56:12 2025 -0400

    iommufd/selftest: Make it clearer to gcc that the access is not out of bounds
    
    [ Upstream commit 5b244b077c0b0e76573fbb9542cf038e42368901 ]
    
    GCC gets a bit confused and reports:
    
       In function '_test_cmd_get_hw_info',
           inlined from 'iommufd_ioas_get_hw_info' at iommufd.c:779:3,
           inlined from 'wrapper_iommufd_ioas_get_hw_info' at iommufd.c:752:1:
    >> iommufd_utils.h:804:37: warning: array subscript 'struct iommu_test_hw_info[0]' is partly outside array bounds of 'struct iommu_test_hw_info_buffer_smaller[1]' [-Warray-bounds=]
         804 |                         assert(!info->flags);
             |                                 ~~~~^~~~~~~
       iommufd.c: In function 'wrapper_iommufd_ioas_get_hw_info':
       iommufd.c:761:11: note: object 'buffer_smaller' of size 4
         761 |         } buffer_smaller;
             |           ^~~~~~~~~~~~~~
    
    While it is true that "struct iommu_test_hw_info[0]" is partly out of
    bounds of the input pointer, it is not true that info->flags is out of
    bounds. Unclear why it warns on this.
    
    Reuse an existing properly sized stack buffer and pass a truncated length
    instead to test the same thing.
    
    Fixes: af4fde93c319 ("iommufd/selftest: Add coverage for IOMMU_GET_HW_INFO ioctl")
    Link: https://patch.msgid.link/r/[email protected]
    Reviewed-by: Kevin Tian <[email protected]>
    Reviewed-by: Nicolin Chen <[email protected]>
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iommufd/selftest: Update hw_info coverage for an input data_type [+ + +]

Author: Nicolin Chen <[email protected]>
Date:   Wed Jul 9 22:59:14 2025 -0700

    iommufd/selftest: Update hw_info coverage for an input data_type
    
    [ Upstream commit 3a35f7d4a4673edf6f02422bb2d78b17c667e167 ]
    
    Test both IOMMU_HW_INFO_TYPE_DEFAULT and IOMMU_HW_INFO_TYPE_SELFTEST, and
    add a negative test for an unsupported type.
    
    Also drop the unused mask in test_cmd_get_hw_capabilities() as checkpatch
    is complaining.
    
    Link: https://patch.msgid.link/r/f01a1e50cd7366f217cbf192ad0b2b79e0eb89f0.1752126748.git.nicolinc@nvidia.com
    Signed-off-by: Nicolin Chen <[email protected]>
    Reviewed-by: Pranjal Shrivastava <[email protected]>
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Stable-dep-of: 5b244b077c0b ("iommufd/selftest: Make it clearer to gcc that the access is not out of bounds")
    Signed-off-by: Sasha Levin <[email protected]>

ip6_gre: make ip6gre_header() robust [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Thu Dec 11 17:35:50 2025 +0000

    ip6_gre: make ip6gre_header() robust
    
    [ Upstream commit db5b4e39c4e63700c68a7e65fc4e1f1375273476 ]
    
    Over the years, syzbot found many ways to crash the kernel
    in ip6gre_header() [1].
    
    This involves team or bonding drivers ability to dynamically
    change their dev->needed_headroom and/or dev->hard_header_len
    
    In this particular crash mld_newpack() allocated an skb
    with a too small reserve/headroom, and by the time mld_sendpack()
    was called, syzbot managed to attach an ip6gre device.
    
    [1]
    skbuff: skb_under_panic: text:ffffffff8a1d69a8 len:136 put:40 head:ffff888059bc7000 data:ffff888059bc6fe8 tail:0x70 end:0x6c0 dev:team0
    ------------[ cut here ]------------
     kernel BUG at net/core/skbuff.c:213 !
     <TASK>
      skb_under_panic net/core/skbuff.c:223 [inline]
      skb_push+0xc3/0xe0 net/core/skbuff.c:2641
      ip6gre_header+0xc8/0x790 net/ipv6/ip6_gre.c:1371
      dev_hard_header include/linux/netdevice.h:3436 [inline]
      neigh_connected_output+0x286/0x460 net/core/neighbour.c:1618
      neigh_output include/net/neighbour.h:556 [inline]
      ip6_finish_output2+0xfb3/0x1480 net/ipv6/ip6_output.c:136
     __ip6_finish_output net/ipv6/ip6_output.c:-1 [inline]
      ip6_finish_output+0x234/0x7d0 net/ipv6/ip6_output.c:220
      NF_HOOK_COND include/linux/netfilter.h:307 [inline]
      ip6_output+0x340/0x550 net/ipv6/ip6_output.c:247
      NF_HOOK+0x9e/0x380 include/linux/netfilter.h:318
      mld_sendpack+0x8d4/0xe60 net/ipv6/mcast.c:1855
      mld_send_cr net/ipv6/mcast.c:2154 [inline]
      mld_ifc_work+0x83e/0xd60 net/ipv6/mcast.c:2693
    
    Fixes: c12b395a4664 ("gre: Support GRE over IPv6")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/netdev/[email protected]/T/#u
    Signed-off-by: Eric Dumazet <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipmi: Fix __scan_channels() failing to rescan channels [+ + +]

Author: Jinhui Guo <[email protected]>
Date:   Tue Sep 30 15:42:38 2025 +0800

    ipmi: Fix __scan_channels() failing to rescan channels
    
    [ Upstream commit 6bd30d8fc523fb880b4be548e8501bc0fe8f42d4 ]
    
    channel_handler() sets intf->channels_ready to true but never
    clears it, so __scan_channels() skips any rescan. When the BMC
    firmware changes a rescan is required. Allow it by clearing
    the flag before starting a new scan.
    
    Signed-off-by: Jinhui Guo <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Corey Minyard <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipmi: Fix the race between __scan_channels() and deliver_response() [+ + +]

Author: Jinhui Guo <[email protected]>
Date:   Tue Sep 30 15:42:37 2025 +0800

    ipmi: Fix the race between __scan_channels() and deliver_response()
    
    [ Upstream commit 936750fdba4c45e13bbd17f261bb140dd55f5e93 ]
    
    The race window between __scan_channels() and deliver_response() causes
    the parameters of some channels to be set to 0.
    
    1.[CPUA] __scan_channels() issues an IPMI request and waits with
             wait_event() until all channels have been scanned.
             wait_event() internally calls might_sleep(), which might
             yield the CPU. (Moreover, an interrupt can preempt
             wait_event() and force the task to yield the CPU.)
    2.[CPUB] deliver_response() is invoked when the CPU receives the
             IPMI response. After processing a IPMI response,
             deliver_response() directly assigns intf->wchannels to
             intf->channel_list and sets intf->channels_ready to true.
             However, not all channels are actually ready for use.
    3.[CPUA] Since intf->channels_ready is already true, wait_event()
             never enters __wait_event(). __scan_channels() immediately
             clears intf->null_user_handler and exits.
    4.[CPUB] Once intf->null_user_handler is set to NULL, deliver_response()
             ignores further IPMI responses, leaving the remaining
             channels zero-initialized and unusable.
    
    CPUA                             CPUB
    -------------------------------  -----------------------------
    __scan_channels()
     intf->null_user_handler
           = channel_handler;
     send_channel_info_cmd(intf,
           0);
     wait_event(intf->waitq,
           intf->channels_ready);
      do {
       might_sleep();
                                     deliver_response()
                                      channel_handler()
                                       intf->channel_list =
                                             intf->wchannels + set;
                                       intf->channels_ready = true;
                                       send_channel_info_cmd(intf,
                                             intf->curr_channel);
       if (condition)
        break;
       __wait_event(wq_head,
              condition);
      } while(0)
     intf->null_user_handler
           = NULL;
                                     deliver_response()
                                      if (!msg->user)
                                       if (intf->null_user_handler)
                                        rv = -EINVAL;
                                      return rv;
    -------------------------------  -----------------------------
    
    Fix the race between __scan_channels() and deliver_response() by
    deferring both the assignment intf->channel_list = intf->wchannels
    and the flag intf->channels_ready = true until all channels have
    been successfully scanned or until the IPMI request has failed.
    
    Signed-off-by: Jinhui Guo <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Corey Minyard <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipv4: Fix reference count leak when using error routes with nexthop objects [+ + +]

Author: Ido Schimmel <[email protected]>
Date:   Sun Dec 21 16:48:28 2025 +0200

    ipv4: Fix reference count leak when using error routes with nexthop objects
    
    [ Upstream commit ac782f4e3bfcde145b8a7f8af31d9422d94d172a ]
    
    When a nexthop object is deleted, it is marked as dead and then
    fib_table_flush() is called to flush all the routes that are using the
    dead nexthop.
    
    The current logic in fib_table_flush() is to only flush error routes
    (e.g., blackhole) when it is called as part of network namespace
    dismantle (i.e., with flush_all=true). Therefore, error routes are not
    flushed when their nexthop object is deleted:
    
     # ip link add name dummy1 up type dummy
     # ip nexthop add id 1 dev dummy1
     # ip route add 198.51.100.1/32 nhid 1
     # ip route add blackhole 198.51.100.2/32 nhid 1
     # ip nexthop del id 1
     # ip route show
     blackhole 198.51.100.2 nhid 1 dev dummy1
    
    As such, they keep holding a reference on the nexthop object which in
    turn holds a reference on the nexthop device, resulting in a reference
    count leak:
    
     # ip link del dev dummy1
     [   70.516258] unregister_netdevice: waiting for dummy1 to become free. Usage count = 2
    
    Fix by flushing error routes when their nexthop is marked as dead.
    
    IPv6 does not suffer from this problem.
    
    Fixes: 493ced1ac47c ("ipv4: Allow routes to use nexthop objects")
    Reported-by: Tetsuo Handa <[email protected]>
    Closes: https://lore.kernel.org/netdev/[email protected]/
    Reported-by: [email protected]
    Signed-off-by: Ido Schimmel <[email protected]>
    Reviewed-by: David Ahern <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipv6: adopt dst_dev() helper [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Fri Jan 2 12:37:25 2026 -0800

    ipv6: adopt dst_dev() helper
    
    [ Upstream commit 1caf27297215a5241f9bfc9c07336349d9034ee3 ]
    
    Use the new helper as a step to deal with potential dst->dev races.
    
    Signed-off-by: Eric Dumazet <[email protected]>
    Reviewed-by: Kuniyuki Iwashima <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [Harshit: Backport to 6.12.y, pulled this is a prerequisite]
    Stable-dep-of: 99a2ace61b21 ("net: use dst_dev_rcu() in sk_setup_caps()")
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ipv6: BUG() in pskb_expand_head() as part of calipso_skbuff_setattr() [+ + +]

Author: Will Rosenberg <[email protected]>
Date:   Fri Dec 19 10:36:37 2025 -0700

    ipv6: BUG() in pskb_expand_head() as part of calipso_skbuff_setattr()
    
    [ Upstream commit 58fc7342b529803d3c221101102fe913df7adb83 ]
    
    There exists a kernel oops caused by a BUG_ON(nhead < 0) at
    net/core/skbuff.c:2232 in pskb_expand_head().
    This bug is triggered as part of the calipso_skbuff_setattr()
    routine when skb_cow() is passed headroom > INT_MAX
    (i.e. (int)(skb_headroom(skb) + len_delta) < 0).
    
    The root cause of the bug is due to an implicit integer cast in
    __skb_cow(). The check (headroom > skb_headroom(skb)) is meant to ensure
    that delta = headroom - skb_headroom(skb) is never negative, otherwise
    we will trigger a BUG_ON in pskb_expand_head(). However, if
    headroom > INT_MAX and delta <= -NET_SKB_PAD, the check passes, delta
    becomes negative, and pskb_expand_head() is passed a negative value for
    nhead.
    
    Fix the trigger condition in calipso_skbuff_setattr(). Avoid passing
    "negative" headroom sizes to skb_cow() within calipso_skbuff_setattr()
    by only using skb_cow() to grow headroom.
    
    PoC:
            Using `netlabelctl` tool:
    
            netlabelctl map del default
            netlabelctl calipso add pass doi:7
            netlabelctl map add default address:0::1/128 protocol:calipso,7
    
            Then run the following PoC:
    
            int fd = socket(AF_INET6, SOCK_DGRAM, IPPROTO_UDP);
    
            // setup msghdr
            int cmsg_size = 2;
            int cmsg_len = 0x60;
            struct msghdr msg;
            struct sockaddr_in6 dest_addr;
            struct cmsghdr * cmsg = (struct cmsghdr *) calloc(1,
                            sizeof(struct cmsghdr) + cmsg_len);
            msg.msg_name = &dest_addr;
            msg.msg_namelen = sizeof(dest_addr);
            msg.msg_iov = NULL;
            msg.msg_iovlen = 0;
            msg.msg_control = cmsg;
            msg.msg_controllen = cmsg_len;
            msg.msg_flags = 0;
    
            // setup sockaddr
            dest_addr.sin6_family = AF_INET6;
            dest_addr.sin6_port = htons(31337);
            dest_addr.sin6_flowinfo = htonl(31337);
            dest_addr.sin6_addr = in6addr_loopback;
            dest_addr.sin6_scope_id = 31337;
    
            // setup cmsghdr
            cmsg->cmsg_len = cmsg_len;
            cmsg->cmsg_level = IPPROTO_IPV6;
            cmsg->cmsg_type = IPV6_HOPOPTS;
            char * hop_hdr = (char *)cmsg + sizeof(struct cmsghdr);
            hop_hdr[1] = 0x9; //set hop size - (0x9 + 1) * 8 = 80
    
            sendmsg(fd, &msg, 0);
    
    Fixes: 2917f57b6bc1 ("calipso: Allow the lsm to label the skbuff directly.")
    Suggested-by: Paul Moore <[email protected]>
    Signed-off-by: Will Rosenberg <[email protected]>
    Acked-by: Paul Moore <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipv6: fix a BUG in rt6_get_pcpu_route() under PREEMPT_RT [+ + +]

Author: Jiayuan Chen <[email protected]>
Date:   Tue Dec 23 13:14:12 2025 +0800

    ipv6: fix a BUG in rt6_get_pcpu_route() under PREEMPT_RT
    
    [ Upstream commit 1adaea51c61b52e24e7ab38f7d3eba023b2d050d ]
    
    On PREEMPT_RT kernels, after rt6_get_pcpu_route() returns NULL, the
    current task can be preempted. Another task running on the same CPU
    may then execute rt6_make_pcpu_route() and successfully install a
    pcpu_rt entry. When the first task resumes execution, its cmpxchg()
    in rt6_make_pcpu_route() will fail because rt6i_pcpu is no longer
    NULL, triggering the BUG_ON(prev). It's easy to reproduce it by adding
    mdelay() after rt6_get_pcpu_route().
    
    Using preempt_disable/enable is not appropriate here because
    ip6_rt_pcpu_alloc() may sleep.
    
    Fix this by handling the cmpxchg() failure gracefully on PREEMPT_RT:
    free our allocation and return the existing pcpu_rt installed by
    another task. The BUG_ON is replaced by WARN_ON_ONCE for non-PREEMPT_RT
    kernels where such races should not occur.
    
    Link: https://syzkaller.appspot.com/bug?extid=9b35e9bc0951140d13e6
    Fixes: d2d6422f8bd1 ("x86: Allow to enable PREEMPT_RT.")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/T/
    Signed-off-by: Jiayuan Chen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipvlan: Ignore PACKET_LOOPBACK in handle_mode_l2() [+ + +]

Author: Dmitry Skorodumov <[email protected]>
Date:   Tue Dec 2 13:39:03 2025 +0300

    ipvlan: Ignore PACKET_LOOPBACK in handle_mode_l2()
    
    [ Upstream commit 0c57ff008a11f24f7f05fa760222692a00465fec ]
    
    Packets with pkt_type == PACKET_LOOPBACK are captured by
    handle_frame() function, but they don't have L2 header.
    We should not process them in handle_mode_l2().
    
    This doesn't affect old L2 functionality, since handling
    was anyway incorrect.
    
    Handle them the same way as in br_handle_frame():
    just pass the skb.
    
    To observe invalid behaviour, just start "ping -b" on bcast address
    of port-interface.
    
    Fixes: 2ad7bf363841 ("ipvlan: Initial check-in of the IPVLAN driver.")
    Signed-off-by: Dmitry Skorodumov <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipvs: fix ipv4 null-ptr-deref in route error path [+ + +]

Author: Slavin Liu <[email protected]>
Date:   Fri Nov 21 16:52:13 2025 +0800

    ipvs: fix ipv4 null-ptr-deref in route error path
    
    [ Upstream commit ad891bb3d079a46a821bf2b8867854645191bab0 ]
    
    The IPv4 code path in __ip_vs_get_out_rt() calls dst_link_failure()
    without ensuring skb->dev is set, leading to a NULL pointer dereference
    in fib_compute_spec_dst() when ipv4_link_failure() attempts to send
    ICMP destination unreachable messages.
    
    The issue emerged after commit ed0de45a1008 ("ipv4: recompile ip options
    in ipv4_link_failure") started calling __ip_options_compile() from
    ipv4_link_failure(). This code path eventually calls fib_compute_spec_dst()
    which dereferences skb->dev. An attempt was made to fix the NULL skb->dev
    dereference in commit 0113d9c9d1cc ("ipv4: fix null-deref in
    ipv4_link_failure"), but it only addressed the immediate dev_net(skb->dev)
    dereference by using a fallback device. The fix was incomplete because
    fib_compute_spec_dst() later in the call chain still accesses skb->dev
    directly, which remains NULL when IPVS calls dst_link_failure().
    
    The crash occurs when:
    1. IPVS processes a packet in NAT mode with a misconfigured destination
    2. Route lookup fails in __ip_vs_get_out_rt() before establishing a route
    3. The error path calls dst_link_failure(skb) with skb->dev == NULL
    4. ipv4_link_failure() → ipv4_send_dest_unreach() →
       __ip_options_compile() → fib_compute_spec_dst()
    5. fib_compute_spec_dst() dereferences NULL skb->dev
    
    Apply the same fix used for IPv6 in commit 326bf17ea5d4 ("ipvs: fix
    ipv6 route unreach panic"): set skb->dev from skb_dst(skb)->dev before
    calling dst_link_failure().
    
    KASAN: null-ptr-deref in range [0x0000000000000328-0x000000000000032f]
    CPU: 1 PID: 12732 Comm: syz.1.3469 Not tainted 6.6.114 #2
    RIP: 0010:__in_dev_get_rcu include/linux/inetdevice.h:233
    RIP: 0010:fib_compute_spec_dst+0x17a/0x9f0 net/ipv4/fib_frontend.c:285
    Call Trace:
      <TASK>
      spec_dst_fill net/ipv4/ip_options.c:232
      spec_dst_fill net/ipv4/ip_options.c:229
      __ip_options_compile+0x13a1/0x17d0 net/ipv4/ip_options.c:330
      ipv4_send_dest_unreach net/ipv4/route.c:1252
      ipv4_link_failure+0x702/0xb80 net/ipv4/route.c:1265
      dst_link_failure include/net/dst.h:437
      __ip_vs_get_out_rt+0x15fd/0x19e0 net/netfilter/ipvs/ip_vs_xmit.c:412
      ip_vs_nat_xmit+0x1d8/0xc80 net/netfilter/ipvs/ip_vs_xmit.c:764
    
    Fixes: ed0de45a1008 ("ipv4: recompile ip options in ipv4_link_failure")
    Signed-off-by: Slavin Liu <[email protected]>
    Acked-by: Julian Anastasov <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

jbd2: fix the inconsistency between checksum and data in memory for journal sb [+ + +]

Author: Ye Bin <[email protected]>
Date:   Mon Dec 29 18:34:27 2025 -0500

    jbd2: fix the inconsistency between checksum and data in memory for journal sb
    
    [ Upstream commit 6abfe107894af7e8ce3a2e120c619d81ee764ad5 ]
    
    Copying the file system while it is mounted as read-only results in
    a mount failure:
    [~]# mkfs.ext4 -F /dev/sdc
    [~]# mount /dev/sdc -o ro /mnt/test
    [~]# dd if=/dev/sdc of=/dev/sda bs=1M
    [~]# mount /dev/sda /mnt/test1
    [ 1094.849826] JBD2: journal checksum error
    [ 1094.850927] EXT4-fs (sda): Could not load journal inode
    mount: mount /dev/sda on /mnt/test1 failed: Bad message
    
    The process described above is just an abstracted way I came up with to
    reproduce the issue. In the actual scenario, the file system was mounted
    read-only and then copied while it was still mounted. It was found that
    the mount operation failed. The user intended to verify the data or use
    it as a backup, and this action was performed during a version upgrade.
    Above issue may happen as follows:
    ext4_fill_super
     set_journal_csum_feature_set(sb)
      if (ext4_has_metadata_csum(sb))
       incompat = JBD2_FEATURE_INCOMPAT_CSUM_V3;
      if (test_opt(sb, JOURNAL_CHECKSUM)
       jbd2_journal_set_features(sbi->s_journal, compat, 0, incompat);
        lock_buffer(journal->j_sb_buffer);
        sb->s_feature_incompat  |= cpu_to_be32(incompat);
        //The data in the journal sb was modified, but the checksum was not
          updated, so the data remaining in memory has a mismatch between the
          data and the checksum.
        unlock_buffer(journal->j_sb_buffer);
    
    In this case, the journal sb copied over is in a state where the checksum
    and data are inconsistent, so mounting fails.
    To solve the above issue, update the checksum in memory after modifying
    the journal sb.
    
    Fixes: 4fd5ea43bc11 ("jbd2: checksum journal superblock")
    Signed-off-by: Ye Bin <[email protected]>
    Reviewed-by: Baokun Li <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Reviewed-by: Jan Kara <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Cc: [email protected]
    [ jbd2_superblock_csum() also takes a journal param ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

jbd2: use a per-journal lock_class_key for jbd2_trans_commit_key [+ + +]

Author: Tetsuo Handa <[email protected]>
Date:   Wed Oct 22 20:11:37 2025 +0900

    jbd2: use a per-journal lock_class_key for jbd2_trans_commit_key
    
    commit 524c3853831cf4f7e1db579e487c757c3065165c upstream.
    
    syzbot is reporting possibility of deadlock due to sharing lock_class_key
    for jbd2_handle across ext4 and ocfs2. But this is a false positive, for
    one disk partition can't have two filesystems at the same time.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=6e493c165d26d6fcbf72
    Signed-off-by: Tetsuo Handa <[email protected]>
    Tested-by: [email protected]
    Reviewed-by: Jan Kara <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

jbd2: use a weaker annotation in journal handling [+ + +]

Author: Byungchul Park <[email protected]>
Date:   Fri Oct 24 16:39:40 2025 +0900

    jbd2: use a weaker annotation in journal handling
    
    commit 40a71b53d5a6d4ea17e4d54b99b2ac03a7f5e783 upstream.
    
    jbd2 journal handling code doesn't want jbd2_might_wait_for_commit()
    to be placed between start_this_handle() and stop_this_handle().  So it
    marks the region with rwsem_acquire_read() and rwsem_release().
    
    However, the annotation is too strong for that purpose.  We don't have
    to use more than try lock annotation for that.
    
    rwsem_acquire_read() implies:
    
       1. might be a waiter on contention of the lock.
       2. enter to the critical section of the lock.
    
    All we need in here is to act 2, not 1.  So trylock version of
    annotation is sufficient for that purpose.  Now that dept partially
    relies on lockdep annotaions, dept interpets rwsem_acquire_read() as a
    potential wait and might report a deadlock by the wait.
    
    Replace it with trylock version of annotation.
    
    Signed-off-by: Byungchul Park <[email protected]>
    Reviewed-by: Jan Kara <[email protected]>
    Cc: [email protected]
    Message-ID: <[email protected]>
    Signed-off-by: Theodore Ts'o <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kallsyms: Fix wrong "big" kernel symbol type read from procfs [+ + +]

Author: Zheng Yejian <[email protected]>
Date:   Fri Oct 11 22:38:53 2024 +0800

    kallsyms: Fix wrong "big" kernel symbol type read from procfs
    
    commit f3f9f42232dee596d15491ca3f611d02174db49c upstream.
    
    Currently when the length of a symbol is longer than 0x7f characters,
    its type shown in /proc/kallsyms can be incorrect.
    
    I found this issue when reading the code, but it can be reproduced by
    following steps:
    
      1. Define a function which symbol length is 130 characters:
    
        #define X13(x) x##x##x##x##x##x##x##x##x##x##x##x##x
        static noinline void X13(x123456789)(void)
        {
            printk("hello world\n");
        }
    
      2. The type in vmlinux is 't':
    
        $ nm vmlinux | grep x123456
        ffffffff816290f0 t x123456789x123456789x123456789x12[...]
    
      3. Then boot the kernel, the type shown in /proc/kallsyms becomes 'g'
         instead of the expected 't':
    
        # cat /proc/kallsyms | grep x123456
        ffffffff816290f0 g x123456789x123456789x123456789x12[...]
    
    The root cause is that, after commit 73bbb94466fd ("kallsyms: support
    "big" kernel symbols"), ULEB128 was used to encode symbol name length.
    That is, for "big" kernel symbols of which name length is longer than
    0x7f characters, the length info is encoded into 2 bytes.
    
    kallsyms_get_symbol_type() expects to read the first char of the
    symbol name which indicates the symbol type. However, due to the
    "big" symbol case not being handled, the symbol type read from
    /proc/kallsyms may be wrong, so handle it properly.
    
    Cc: [email protected]
    Fixes: 73bbb94466fd ("kallsyms: support "big" kernel symbols")
    Signed-off-by: Zheng Yejian <[email protected]>
    Acked-by: Gary Guo <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Miguel Ojeda <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kasan: refactor pcpu kasan vmalloc unpoison [+ + +]

Author: Maciej Wieczor-Retman <[email protected]>
Date:   Thu Dec 4 19:00:04 2025 +0000

    kasan: refactor pcpu kasan vmalloc unpoison
    
    commit 6f13db031e27e88213381039032a9cc061578ea6 upstream.
    
    A KASAN tag mismatch, possibly causing a kernel panic, can be observed
    on systems with a tag-based KASAN enabled and with multiple NUMA nodes.
    It was reported on arm64 and reproduced on x86. It can be explained in
    the following points:
    
    1. There can be more than one virtual memory chunk.
    2. Chunk's base address has a tag.
    3. The base address points at the first chunk and thus inherits
       the tag of the first chunk.
    4. The subsequent chunks will be accessed with the tag from the
       first chunk.
    5. Thus, the subsequent chunks need to have their tag set to
       match that of the first chunk.
    
    Refactor code by reusing __kasan_unpoison_vmalloc in a new helper in
    preparation for the actual fix.
    
    Link: https://lkml.kernel.org/r/eb61d93b907e262eefcaa130261a08bcb6c5ce51.1764874575.git.m.wieczorretman@pm.me
    Fixes: 1d96320f8d53 ("kasan, vmalloc: add vmalloc tagging for SW_TAGS")
    Signed-off-by: Maciej Wieczor-Retman <[email protected]>
    Reviewed-by: Andrey Konovalov <[email protected]>
    Cc: Alexander Potapenko <[email protected]>
    Cc: Andrey Ryabinin <[email protected]>
    Cc: Danilo Krummrich <[email protected]>
    Cc: Dmitriy Vyukov <[email protected]>
    Cc: Jiayuan Chen <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Marco Elver <[email protected]>
    Cc: "Uladzislau Rezki (Sony)" <[email protected]>
    Cc: Vincenzo Frascino <[email protected]>
    Cc: <[email protected]>    [6.1+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kasan: unpoison vms[area] addresses with a common tag [+ + +]

Author: Maciej Wieczor-Retman <[email protected]>
Date:   Thu Dec 4 19:00:11 2025 +0000

    kasan: unpoison vms[area] addresses with a common tag
    
    commit 6a0e5b333842cf65d6f4e4f0a2a4386504802515 upstream.
    
    A KASAN tag mismatch, possibly causing a kernel panic, can be observed on
    systems with a tag-based KASAN enabled and with multiple NUMA nodes.  It
    was reported on arm64 and reproduced on x86.  It can be explained in the
    following points:
    
    1. There can be more than one virtual memory chunk.
    2. Chunk's base address has a tag.
    3. The base address points at the first chunk and thus inherits
       the tag of the first chunk.
    4. The subsequent chunks will be accessed with the tag from the
       first chunk.
    5. Thus, the subsequent chunks need to have their tag set to
       match that of the first chunk.
    
    Use the new vmalloc flag that disables random tag assignment in
    __kasan_unpoison_vmalloc() - pass the same random tag to all the
    vm_structs by tagging the pointers before they go inside
    __kasan_unpoison_vmalloc().  Assigning a common tag resolves the pcpu
    chunk address mismatch.
    
    [[email protected]: use WARN_ON_ONCE(), per Andrey]
      Link: https://lkml.kernel.org/r/CA+fCnZeuGdKSEm11oGT6FS71_vGq1vjq-xY36kxVdFvwmag2ZQ@mail.gmail.com
    [[email protected]: remove unneeded pr_warn()]
      Link: https://lkml.kernel.org/r/919897daaaa3c982a27762a2ee038769ad033991.1764945396.git.m.wieczorretman@pm.me
    Link: https://lkml.kernel.org/r/873821114a9f722ffb5d6702b94782e902883fdf.1764874575.git.m.wieczorretman@pm.me
    Fixes: 1d96320f8d53 ("kasan, vmalloc: add vmalloc tagging for SW_TAGS")
    Signed-off-by: Maciej Wieczor-Retman <[email protected]>
    Reviewed-by: Andrey Konovalov <[email protected]>
    Cc: Alexander Potapenko <[email protected]>
    Cc: Andrey Ryabinin <[email protected]>
    Cc: Danilo Krummrich <[email protected]>
    Cc: Dmitriy Vyukov <[email protected]>
    Cc: Jiayuan Chen <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Marco Elver <[email protected]>
    Cc: "Uladzislau Rezki (Sony)" <[email protected]>
    Cc: Vincenzo Frascino <[email protected]>
    Cc: <[email protected]>    [6.1+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kbuild: fix compilation of dtb specified on command-line without make rule [+ + +]

Author: Thomas De Schampheleire <[email protected]>
Date:   Wed Nov 26 11:00:16 2025 +0100

    kbuild: fix compilation of dtb specified on command-line without make rule
    
    [ Upstream commit b08fc4d0ec2466558f6d5511434efdfabbddf2a6 ]
    
    Since commit e7e2941300d2 ("kbuild: split device tree build rules into
    scripts/Makefile.dtbs"), it is no longer possible to compile a device tree
    blob that is not specified in a make rule
    like:
        dtb-$(CONFIG_FOO) += foo.dtb
    
    Before the mentioned commit, one could copy a dts file to e.g.
    arch/arm64/boot/dts/ (or a new subdirectory) and then convert it to a dtb
    file using:
        make ARCH=arm64 foo.dtb
    
    In this scenario, both 'dtb-y' and 'dtb-' are empty, and the inclusion of
    scripts/Makefile.dtbs relies on 'targets' to contain the MAKECMDGOALS. The
    value of 'targets', however, is only final later in the code.
    
    Move the conditional include of scripts/Makefile.dtbs down to where the
    value of 'targets' is final. Since Makefile.dtbs updates 'always-y' which is
    used as a prerequisite in the build rule, the build rule also needs to move
    down.
    
    Fixes: e7e2941300d2 ("kbuild: split device tree build rules into scripts/Makefile.dtbs")
    Signed-off-by: Thomas De Schampheleire <[email protected]>
    Reviewed-by: Nathan Chancellor <[email protected]>
    Tested-by: Nathan Chancellor <[email protected]>
    Acked-by: Rob Herring (Arm) <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Nicolas Schier <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

kbuild: Use objtree for module signing key path [+ + +]

Author: Mikhail Malyshev <[email protected]>
Date:   Wed Oct 15 16:34:52 2025 +0000

    kbuild: Use objtree for module signing key path
    
    [ Upstream commit af61da281f52aba0c5b090bafb3a31c5739850ff ]
    
    When building out-of-tree modules with CONFIG_MODULE_SIG_FORCE=y,
    module signing fails because the private key path uses $(srctree)
    while the public key path uses $(objtree). Since signing keys are
    generated in the build directory during kernel compilation, both
    paths should use $(objtree) for consistency.
    
    This causes SSL errors like:
      SSL error:02001002:system library:fopen:No such file or directory
      sign-file: /kernel-src/certs/signing_key.pem
    
    The issue occurs because:
    - sig-key uses: $(srctree)/certs/signing_key.pem (source tree)
    - cmd_sign uses: $(objtree)/certs/signing_key.x509 (build tree)
    
    But both keys are generated in $(objtree) during the build.
    
    This complements commit 25ff08aa43e37 ("kbuild: Fix signing issue for
    external modules") which fixed the scripts path and public key path,
    but missed the private key path inconsistency.
    
    Fixes out-of-tree module signing for configurations with separate
    source and build directories (e.g., O=/kernel-out).
    
    Signed-off-by: Mikhail Malyshev <[email protected]>
    Reviewed-by: Nathan Chancellor <[email protected]>
    Tested-by: Nicolas Schier <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Nicolas Schier <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

KEYS: trusted: Fix a memory leak in tpm2_load_cmd [+ + +]

Author: Jarkko Sakkinen <[email protected]>
Date:   Sat Oct 18 13:30:36 2025 +0300

    KEYS: trusted: Fix a memory leak in tpm2_load_cmd
    
    commit 62cd5d480b9762ce70d720a81fa5b373052ae05f upstream.
    
    'tpm2_load_cmd' allocates a tempoary blob indirectly via 'tpm2_key_decode'
    but it is not freed in the failure paths. Address this by wrapping the blob
    into with a cleanup helper.
    
    Cc: [email protected] # v5.13+
    Fixes: f2219745250f ("security: keys: trusted: use ASN.1 TPM2 key format for the blobs")
    Signed-off-by: Jarkko Sakkinen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix buffer validation by including null terminator size in EA length [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Sun Dec 14 15:06:34 2025 +0900

    ksmbd: fix buffer validation by including null terminator size in EA length
    
    commit 95d7a890e4b03e198836d49d699408fd1867cb55 upstream.
    
    The smb2_set_ea function, which handles Extended Attributes (EA),
    was performing buffer validation checks that incorrectly omitted the size
    of the null terminating character (+1 byte) for EA Name.
    This patch fixes the issue by explicitly adding '+ 1' to EaNameLength where
    the null terminator is expected to be present in the buffer, ensuring
    the validation accurately reflects the total required buffer size.
    
    Cc: [email protected]
    Reported-by: Roger <[email protected]>
    Reported-by: Stanislas Polu <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: Fix memory leak in get_file_all_info() [+ + +]

Author: Zilin Guan <[email protected]>
Date:   Wed Dec 24 14:20:16 2025 +0000

    ksmbd: Fix memory leak in get_file_all_info()
    
    [ Upstream commit 0c56693b06a68476ba113db6347e7897475f9e4c ]
    
    In get_file_all_info(), if vfs_getattr() fails, the function returns
    immediately without freeing the allocated filename, leading to a memory
    leak.
    
    Fix this by freeing the filename before returning in this error case.
    
    Fixes: 5614c8c487f6a ("ksmbd: replace generic_fillattr with vfs_getattr")
    Signed-off-by: Zilin Guan <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: Fix refcount leak when invalid session is found on session lookup [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Sun Dec 14 15:05:56 2025 +0900

    ksmbd: Fix refcount leak when invalid session is found on session lookup
    
    commit cafb57f7bdd57abba87725eb4e82bbdca4959644 upstream.
    
    When a session is found but its state is not SMB2_SESSION_VALID, It
    indicates that no valid session was found, but it is missing to decrement
    the reference count acquired by the session lookup, which results in
    a reference count leak. This patch fixes the issue by explicitly calling
    ksmbd_user_session_put to release the reference to the session.
    
    Cc: [email protected]
    Reported-by: Alexandre <[email protected]>
    Reported-by: Stanislas Polu <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix use-after-free in ksmbd_tree_connect_put under concurrency [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Tue Nov 18 09:05:46 2025 +0900

    ksmbd: fix use-after-free in ksmbd_tree_connect_put under concurrency
    
    [ Upstream commit b39a1833cc4a2755b02603eec3a71a85e9dff926 ]
    
    Under high concurrency, A tree-connection object (tcon) is freed on
    a disconnect path while another path still holds a reference and later
    executes *_put()/write on it.
    
    Reported-by: Qianchang Zhao <[email protected]>
    Reported-by: Zhitong Liu <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: skip lock-range check on equal size to avoid size==0 underflow [+ + +]

Author: Qianchang Zhao <[email protected]>
Date:   Sun Nov 9 10:00:55 2025 +0900

    ksmbd: skip lock-range check on equal size to avoid size==0 underflow
    
    commit 5d510ac31626ed157d2182149559430350cf2104 upstream.
    
    When size equals the current i_size (including 0), the code used to call
    check_lock_range(filp, i_size, size - 1, WRITE), which computes `size - 1`
    and can underflow for size==0. Skip the equal case.
    
    Cc: [email protected]
    Reported-by: Qianchang Zhao <[email protected]>
    Reported-by: Zhitong Liu <[email protected]>
    Signed-off-by: Qianchang Zhao <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: vfs: fix race on m_flags in vfs_cache [+ + +]

Author: Qianchang Zhao <[email protected]>
Date:   Mon Nov 24 16:05:09 2025 +0900

    ksmbd: vfs: fix race on m_flags in vfs_cache
    
    [ Upstream commit 991f8a79db99b14c48d20d2052c82d65b9186cad ]
    
    ksmbd maintains delete-on-close and pending-delete state in
    ksmbd_inode->m_flags. In vfs_cache.c this field is accessed under
    inconsistent locking: some paths read and modify m_flags under
    ci->m_lock while others do so without taking the lock at all.
    
    Examples:
    
     - ksmbd_query_inode_status() and __ksmbd_inode_close() use
       ci->m_lock when checking or updating m_flags.
     - ksmbd_inode_pending_delete(), ksmbd_set_inode_pending_delete(),
       ksmbd_clear_inode_pending_delete() and ksmbd_fd_set_delete_on_close()
       used to read and modify m_flags without ci->m_lock.
    
    This creates a potential data race on m_flags when multiple threads
    open, close and delete the same file concurrently. In the worst case
    delete-on-close and pending-delete bits can be lost or observed in an
    inconsistent state, leading to confusing delete semantics (files that
    stay on disk after delete-on-close, or files that disappear while still
    in use).
    
    Fix it by:
    
     - Making ksmbd_query_inode_status() look at m_flags under ci->m_lock
       after dropping inode_hash_lock.
     - Adding ci->m_lock protection to all helpers that read or modify
       m_flags (ksmbd_inode_pending_delete(), ksmbd_set_inode_pending_delete(),
       ksmbd_clear_inode_pending_delete(), ksmbd_fd_set_delete_on_close()).
     - Keeping the existing ci->m_lock protection in __ksmbd_inode_close(),
       and moving the actual unlink/xattr removal outside the lock.
    
    This unifies the locking around m_flags and removes the data race while
    preserving the existing delete-on-close behaviour.
    
    Reported-by: Qianchang Zhao <[email protected]>
    Reported-by: Zhitong Liu <[email protected]>
    Signed-off-by: Qianchang Zhao <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ktest.pl: Fix uninitialized var in config-bisect.pl [+ + +]

Author: Steven Rostedt <[email protected]>
Date:   Wed Dec 3 18:09:24 2025 -0500

    ktest.pl: Fix uninitialized var in config-bisect.pl
    
    commit d3042cbe84a060b4df764eb6c5300bbe20d125ca upstream.
    
    The error path of copying the old config used the wrong variable in the
    error message:
    
     $ mkdir /tmp/build
     $ ./tools/testing/ktest/config-bisect.pl -b /tmp/build config-good /tmp/config-bad
     $ chmod 0 /tmp/build
     $ ./tools/testing/ktest/config-bisect.pl -b /tmp/build config-good /tmp/config-bad good
     cp /tmp/build//.config config-good.tmp ... [0 seconds] FAILED!
     Use of uninitialized value $config in concatenation (.) or string at ./tools/testing/ktest/config-bisect.pl line 744.
     failed to copy  to config-good.tmp
    
    When it should have shown:
    
     failed to copy /tmp/build//.config to config-good.tmp
    
    Cc: [email protected]
    Cc: John 'Warthog9' Hawley <[email protected]>
    Fixes: 0f0db065999cf ("ktest: Add standalone config-bisect.pl program")
    Link: https://patch.msgid.link/[email protected]
    Reported-by: "John W. Krahn" <[email protected]>
    Signed-off-by: Steven Rostedt <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: arm64: Initialize HCR_EL2.E2H early [+ + +]

Author: Mark Rutland <[email protected]>
Date:   Fri Dec 19 10:21:21 2025 +0000

    KVM: arm64: Initialize HCR_EL2.E2H early
    
    [ Upstream commit 7a68b55ff39b0a1638acb1694c185d49f6077a0d ]
    
    On CPUs without FEAT_E2H0, HCR_EL2.E2H is RES1, but may reset to an
    UNKNOWN value out of reset and consequently may not read as 1 unless it
    has been explicitly initialized.
    
    We handled this for the head.S boot code in commits:
    
      3944382fa6f22b54 ("arm64: Treat HCR_EL2.E2H as RES1 when ID_AA64MMFR4_EL1.E2H0 is negative")
      b3320142f3db9b3f ("arm64: Fix early handling of FEAT_E2H0 not being implemented")
    
    Unfortunately, we forgot to apply a similar fix to the KVM PSCI entry
    points used when relaying CPU_ON, CPU_SUSPEND, and SYSTEM SUSPEND. When
    KVM is entered via these entry points, the value of HCR_EL2.E2H may be
    consumed before it has been initialized (e.g. by the 'init_el2_state'
    macro).
    
    Initialize HCR_EL2.E2H early in these paths such that it can be consumed
    reliably. The existing code in head.S is factored out into a new
    'init_el2_hcr' macro, and this is used in the __kvm_hyp_init_cpu()
    function common to all the relevant PSCI entry points.
    
    For clarity, I've tweaked the assembly used to check whether
    ID_AA64MMFR4_EL1.E2H0 is negative. The bitfield is extracted as a signed
    value, and this is checked with a signed-greater-or-equal (GE) comparison.
    
    As the hyp code will reconfigure HCR_EL2 later in ___kvm_hyp_init(), all
    bits other than E2H are initialized to zero in __kvm_hyp_init_cpu().
    
    Fixes: 3944382fa6f22b54 ("arm64: Treat HCR_EL2.E2H as RES1 when ID_AA64MMFR4_EL1.E2H0 is negative")
    Fixes: b3320142f3db9b3f ("arm64: Fix early handling of FEAT_E2H0 not being implemented")
    Signed-off-by: Mark Rutland <[email protected]>
    Cc: Ahmed Genidi <[email protected]>
    Cc: Ben Horgan <[email protected]>
    Cc: Catalin Marinas <[email protected]>
    Cc: Leo Yan <[email protected]>
    Cc: Marc Zyngier <[email protected]>
    Cc: Oliver Upton <[email protected]>
    Cc: Will Deacon <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [maz: fixed LT->GE thinko]
    Signed-off-by: Marc Zyngier <[email protected]>
    Signed-off-by: Wei-Lin Chang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: arm64: Initialize SCTLR_EL1 in __kvm_hyp_init_cpu() [+ + +]

Author: Ahmed Genidi <[email protected]>
Date:   Fri Dec 19 10:21:22 2025 +0000

    KVM: arm64: Initialize SCTLR_EL1 in __kvm_hyp_init_cpu()
    
    [ Upstream commit 3855a7b91d42ebf3513b7ccffc44807274978b3d ]
    
    When KVM is in protected mode, host calls to PSCI are proxied via EL2,
    and cold entries from CPU_ON, CPU_SUSPEND, and SYSTEM_SUSPEND bounce
    through __kvm_hyp_init_cpu() at EL2 before entering the host kernel's
    entry point at EL1. While __kvm_hyp_init_cpu() initializes SPSR_EL2 for
    the exception return to EL1, it does not initialize SCTLR_EL1.
    
    Due to this, it's possible to enter EL1 with SCTLR_EL1 in an UNKNOWN
    state. In practice this has been seen to result in kernel crashes after
    CPU_ON as a result of SCTLR_EL1.M being 1 in violation of the initial
    core configuration specified by PSCI.
    
    Fix this by initializing SCTLR_EL1 for cold entry to the host kernel.
    As it's necessary to write to SCTLR_EL12 in VHE mode, this
    initialization is moved into __kvm_host_psci_cpu_entry() where we can
    use write_sysreg_el1().
    
    The remnants of the '__init_el2_nvhe_prepare_eret' macro are folded into
    its only caller, as this is clearer than having the macro.
    
    Fixes: cdf367192766ad11 ("KVM: arm64: Intercept host's CPU_ON SMCs")
    Reported-by: Leo Yan <[email protected]>
    Signed-off-by: Ahmed Genidi <[email protected]>
    [ Mark: clarify commit message, handle E2H, move to C, remove macro ]
    Signed-off-by: Mark Rutland <[email protected]>
    Cc: Ahmed Genidi <[email protected]>
    Cc: Ben Horgan <[email protected]>
    Cc: Catalin Marinas <[email protected]>
    Cc: Leo Yan <[email protected]>
    Cc: Marc Zyngier <[email protected]>
    Cc: Oliver Upton <[email protected]>
    Cc: Will Deacon <[email protected]>
    Reviewed-by: Leo Yan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Marc Zyngier <[email protected]>
    Signed-off-by: Wei-Lin Chang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: Disallow toggling KVM_MEM_GUEST_MEMFD on an existing memslot [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Mon Dec 1 18:03:33 2025 -0800

    KVM: Disallow toggling KVM_MEM_GUEST_MEMFD on an existing memslot
    
    commit 9935df5333aa503a18de5071f53762b65c783c4c upstream.
    
    Reject attempts to disable KVM_MEM_GUEST_MEMFD on a memslot that was
    initially created with a guest_memfd binding, as KVM doesn't support
    toggling KVM_MEM_GUEST_MEMFD on existing memslots.  KVM prevents enabling
    KVM_MEM_GUEST_MEMFD, but doesn't prevent clearing the flag.
    
    Failure to reject the new memslot results in a use-after-free due to KVM
    not unbinding from the guest_memfd instance.  Unbinding on a FLAGS_ONLY
    change is easy enough, and can/will be done as a hardening measure (in
    anticipation of KVM supporting dirty logging on guest_memfd at some point),
    but fixing the use-after-free would only address the immediate symptom.
    
      ==================================================================
      BUG: KASAN: slab-use-after-free in kvm_gmem_release+0x362/0x400 [kvm]
      Write of size 8 at addr ffff8881111ae908 by task repro/745
    
      CPU: 7 UID: 1000 PID: 745 Comm: repro Not tainted 6.18.0-rc6-115d5de2eef3-next-kasan #3 NONE
      Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 0.0.0 02/06/2015
      Call Trace:
       <TASK>
       dump_stack_lvl+0x51/0x60
       print_report+0xcb/0x5c0
       kasan_report+0xb4/0xe0
       kvm_gmem_release+0x362/0x400 [kvm]
       __fput+0x2fa/0x9d0
       task_work_run+0x12c/0x200
       do_exit+0x6ae/0x2100
       do_group_exit+0xa8/0x230
       __x64_sys_exit_group+0x3a/0x50
       x64_sys_call+0x737/0x740
       do_syscall_64+0x5b/0x900
       entry_SYSCALL_64_after_hwframe+0x4b/0x53
      RIP: 0033:0x7f581f2eac31
       </TASK>
    
      Allocated by task 745 on cpu 6 at 9.746971s:
       kasan_save_stack+0x20/0x40
       kasan_save_track+0x13/0x50
       __kasan_kmalloc+0x77/0x90
       kvm_set_memory_region.part.0+0x652/0x1110 [kvm]
       kvm_vm_ioctl+0x14b0/0x3290 [kvm]
       __x64_sys_ioctl+0x129/0x1a0
       do_syscall_64+0x5b/0x900
       entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
      Freed by task 745 on cpu 6 at 9.747467s:
       kasan_save_stack+0x20/0x40
       kasan_save_track+0x13/0x50
       __kasan_save_free_info+0x37/0x50
       __kasan_slab_free+0x3b/0x60
       kfree+0xf5/0x440
       kvm_set_memslot+0x3c2/0x1160 [kvm]
       kvm_set_memory_region.part.0+0x86a/0x1110 [kvm]
       kvm_vm_ioctl+0x14b0/0x3290 [kvm]
       __x64_sys_ioctl+0x129/0x1a0
       do_syscall_64+0x5b/0x900
       entry_SYSCALL_64_after_hwframe+0x4b/0x53
    
    Reported-by: Alexander Potapenko <[email protected]>
    Fixes: a7800aa80ea4 ("KVM: Add KVM_CREATE_GUEST_MEMFD ioctl() for guest-specific backing memory")
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nSVM: Avoid incorrect injection of SVM_EXIT_CR0_SEL_WRITE [+ + +]

Author: Yosry Ahmed <[email protected]>
Date:   Fri Oct 24 19:29:18 2025 +0000

    KVM: nSVM: Avoid incorrect injection of SVM_EXIT_CR0_SEL_WRITE
    
    commit 3d80f4c93d3d26d0f9a0dd2844961a632eeea634 upstream.
    
    When emulating L2 instructions, svm_check_intercept() checks whether a
    write to CR0 should trigger a synthesized #VMEXIT with
    SVM_EXIT_CR0_SEL_WRITE. However, it does not check whether L1 enabled
    the intercept for SVM_EXIT_WRITE_CR0, which has higher priority
    according to the APM (24593—Rev.  3.42—March 2024, Table 15-7):
    
      When both selective and non-selective CR0-write intercepts are active at
      the same time, the non-selective intercept takes priority. With respect
      to exceptions, the priority of this intercept is the same as the generic
      CR0-write intercept.
    
    Make sure L1 does NOT intercept SVM_EXIT_WRITE_CR0 before checking if
    SVM_EXIT_CR0_SEL_WRITE needs to be injected.
    
    Opportunistically tweak the "not CR0" logic to explicitly bail early so
    that it's more obvious that only CR0 has a selective intercept, and that
    modifying icpt_info.exit_code is functionally necessary so that the call
    to nested_svm_exit_handled() checks the correct exit code.
    
    Fixes: cfec82cb7d31 ("KVM: SVM: Add intercept check for emulated cr accesses")
    Cc: [email protected]
    Signed-off-by: Yosry Ahmed <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [sean: isolate non-CR0 write logic, tweak comments accordingly]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nSVM: Clear exit_code_hi in VMCB when synthesizing nested VM-Exits [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Thu Nov 13 14:56:13 2025 -0800

    KVM: nSVM: Clear exit_code_hi in VMCB when synthesizing nested VM-Exits
    
    commit da01f64e7470988f8607776aa7afa924208863fb upstream.
    
    Explicitly clear exit_code_hi in the VMCB when synthesizing "normal"
    nested VM-Exits, as the full exit code is a 64-bit value (spoiler alert),
    and all exit codes for non-failing VMRUN use only bits 31:0.
    
    Cc: Jim Mattson <[email protected]>
    Cc: Yosry Ahmed <[email protected]>
    Cc: [email protected]
    Reviewed-by: Yosry Ahmed <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nSVM: Propagate SVM_EXIT_CR0_SEL_WRITE correctly for LMSW emulation [+ + +]

Author: Yosry Ahmed <[email protected]>
Date:   Fri Oct 24 19:29:17 2025 +0000

    KVM: nSVM: Propagate SVM_EXIT_CR0_SEL_WRITE correctly for LMSW emulation
    
    commit 5674a76db0213f9db1e4d08e847ff649b46889c0 upstream.
    
    When emulating L2 instructions, svm_check_intercept() checks whether a
    write to CR0 should trigger a synthesized #VMEXIT with
    SVM_EXIT_CR0_SEL_WRITE. For MOV-to-CR0, SVM_EXIT_CR0_SEL_WRITE is only
    triggered if any bit other than CR0.MP and CR0.TS is updated. However,
    according to the APM (24593—Rev.  3.42—March 2024, Table 15-7):
    
      The LMSW instruction treats the selective CR0-write
      intercept as a non-selective intercept (i.e., it intercepts
      regardless of the value being written).
    
    Skip checking the changed bits for x86_intercept_lmsw and always inject
    SVM_EXIT_CR0_SEL_WRITE.
    
    Fixes: cfec82cb7d31 ("KVM: SVM: Add intercept check for emulated cr accesses")
    Cc: [email protected]
    Reported-by: Matteo Rizzo <[email protected]>
    Signed-off-by: Yosry Ahmed <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nSVM: Set exit_code_hi to -1 when synthesizing SVM_EXIT_ERR (failed VMRUN) [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Thu Nov 13 14:56:14 2025 -0800

    KVM: nSVM: Set exit_code_hi to -1 when synthesizing SVM_EXIT_ERR (failed VMRUN)
    
    commit f402ecd7a8b6446547076f4bd24bd5d4dcc94481 upstream.
    
    Set exit_code_hi to -1u as a temporary band-aid to fix a long-standing
    (effectively since KVM's inception) bug where KVM treats the exit code as
    a 32-bit value, when in reality it's a 64-bit value.  Per the APM, offset
    0x70 is a single 64-bit value:
    
      070h 63:0 EXITCODE
    
    And a sane reading of the error values defined in "Table C-1. SVM Intercept
    Codes" is that negative values use the full 64 bits:
    
      –1 VMEXIT_INVALID Invalid guest state in VMCB.
      –2 VMEXIT_BUSYBUSY bit was set in the VMSA
      –3 VMEXIT_IDLE_REQUIREDThe sibling thread is not in an idle state
      -4 VMEXIT_INVALID_PMC Invalid PMC state
    
    And that interpretation is confirmed by testing on Milan and Turin (by
    setting bits in CR0[63:32] to generate VMEXIT_INVALID on VMRUN).
    
    Furthermore, Xen has treated exitcode as a 64-bit value since HVM support
    was adding in 2006 (see Xen commit d1bd157fbc ("Big merge the HVM
    full-virtualisation abstractions.")).
    
    Cc: Jim Mattson <[email protected]>
    Cc: Yosry Ahmed <[email protected]>
    Cc: [email protected]
    Reviewed-by: Yosry Ahmed <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: nVMX: Immediately refresh APICv controls as needed on nested VM-Exit [+ + +]

Author: Dongli Zhang <[email protected]>
Date:   Fri Dec 5 15:19:05 2025 -0800

    KVM: nVMX: Immediately refresh APICv controls as needed on nested VM-Exit
    
    commit 29763138830916f46daaa50e83e7f4f907a3236b upstream.
    
    If an APICv status updated was pended while L2 was active, immediately
    refresh vmcs01's controls instead of pending KVM_REQ_APICV_UPDATE as
    kvm_vcpu_update_apicv() only calls into vendor code if a change is
    necessary.
    
    E.g. if APICv is inhibited, and then activated while L2 is running:
    
      kvm_vcpu_update_apicv()
      |
      -> __kvm_vcpu_update_apicv()
         |
         -> apic->apicv_active = true
          |
          -> vmx_refresh_apicv_exec_ctrl()
             |
             -> vmx->nested.update_vmcs01_apicv_status = true
              |
              -> return
    
    Then L2 exits to L1:
    
      __nested_vmx_vmexit()
      |
      -> kvm_make_request(KVM_REQ_APICV_UPDATE)
    
      vcpu_enter_guest(): KVM_REQ_APICV_UPDATE
      -> kvm_vcpu_update_apicv()
         |
         -> __kvm_vcpu_update_apicv()
            |
            -> return // because if (apic->apicv_active == activate)
    
    Reported-by: Chao Gao <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]
    Fixes: 7c69661e225c ("KVM: nVMX: Defer APICv updates while L2 is active until L1 is active")
    Cc: [email protected]
    Signed-off-by: Dongli Zhang <[email protected]>
    [sean: write changelog]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: SVM: Mark VMCB_NPT as dirty on nested VMRUN [+ + +]

Author: Jim Mattson <[email protected]>
Date:   Mon Sep 22 09:29:23 2025 -0700

    KVM: SVM: Mark VMCB_NPT as dirty on nested VMRUN
    
    commit 7c8b465a1c91f674655ea9cec5083744ec5f796a upstream.
    
    Mark the VMCB_NPT bit as dirty in nested_vmcb02_prepare_save()
    on every nested VMRUN.
    
    If L1 changes the PAT MSR between two VMRUN instructions on the same
    L1 vCPU, the g_pat field in the associated vmcb02 will change, and the
    VMCB_NPT clean bit should be cleared.
    
    Fixes: 4bb170a5430b ("KVM: nSVM: do not mark all VMCB02 fields dirty on nested vmexit")
    Cc: [email protected]
    Signed-off-by: Jim Mattson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: SVM: Mark VMCB_PERM_MAP as dirty on nested VMRUN [+ + +]

Author: Jim Mattson <[email protected]>
Date:   Mon Sep 22 09:29:22 2025 -0700

    KVM: SVM: Mark VMCB_PERM_MAP as dirty on nested VMRUN
    
    commit 93c9e107386dbe1243287a5b14ceca894de372b9 upstream.
    
    Mark the VMCB_PERM_MAP bit as dirty in nested_vmcb02_prepare_control()
    on every nested VMRUN.
    
    If L1 changes MSR interception (INTERCEPT_MSR_PROT) between two VMRUN
    instructions on the same L1 vCPU, the msrpm_base_pa in the associated
    vmcb02 will change, and the VMCB_PERM_MAP clean bit should be cleared.
    
    Fixes: 4bb170a5430b ("KVM: nSVM: do not mark all VMCB02 fields dirty on nested vmexit")
    Reported-by: Matteo Rizzo <[email protected]>
    Cc: [email protected]
    Signed-off-by: Jim Mattson <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Don't clear async #PF queue when CR0.PG is disabled (e.g. on #SMI) [+ + +]

Author: Maxim Levitsky <[email protected]>
Date:   Tue Oct 14 23:32:58 2025 -0400

    KVM: x86: Don't clear async #PF queue when CR0.PG is disabled (e.g. on #SMI)
    
    commit ab4e41eb9fabd4607304fa7cfe8ec9c0bd8e1552 upstream.
    
    Fix an interaction between SMM and PV asynchronous #PFs where an #SMI can
    cause KVM to drop an async #PF ready event, and thus result in guest tasks
    becoming permanently stuck due to the task that encountered the #PF never
    being resumed.  Specifically, don't clear the completion queue when paging
    is disabled, and re-check for completed async #PFs if/when paging is
    enabled.
    
    Prior to commit 2635b5c4a0e4 ("KVM: x86: interrupt based APF 'page ready'
    event delivery"), flushing the APF queue without notifying the guest of
    completed APF requests when paging is disabled was "necessary", in that
    delivering a #PF to the guest when paging is disabled would likely confuse
    and/or crash the guest.  And presumably the original async #PF development
    assumed that a guest would only disable paging when there was no intent to
    ever re-enable paging.
    
    That assumption fails in several scenarios, most visibly on an emulated
    SMI, as entering SMM always disables CR0.PG (i.e. initially runs with
    paging disabled).  When the SMM handler eventually executes RSM, the
    interrupted paging-enabled is restored, and the async #PF event is lost.
    
    Similarly, invoking firmware, e.g. via EFI runtime calls, might require a
    transition through paging modes and thus also disable paging with valid
    entries in the competion queue.
    
    To avoid dropping completion events, drop the "clear" entirely, and handle
    paging-enable transitions in the same way KVM already handles APIC
    enable/disable events: if a vCPU's APIC is disabled, APF completion events
    are not kept pending and not injected while APIC is disabled.  Once a
    vCPU's APIC is re-enabled, KVM raises KVM_REQ_APF_READY so that the vCPU
    recognizes any pending pending #APF ready events.
    
    Signed-off-by: Maxim Levitsky <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    [sean: rework changelog to call out #PF injection, drop "real mode"
           references, expand the code comment]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Explicitly set new periodic hrtimer expiration in apic_timer_fn() [+ + +]

Author: fuqiang wang <[email protected]>
Date:   Thu Nov 13 12:51:12 2025 -0800

    KVM: x86: Explicitly set new periodic hrtimer expiration in apic_timer_fn()
    
    commit 9633f180ce994ab293ce4924a9b7aaf4673aa114 upstream.
    
    When restarting an hrtimer to emulate a the guest's APIC timer in periodic
    mode, explicitly set the expiration using the target expiration computed
    by advance_periodic_target_expiration() instead of adding the period to
    the existing timer.  This will allow making adjustments to the expiration,
    e.g. to deal with expirations far in the past, without having to implement
    the same logic in both advance_periodic_target_expiration() and
    apic_timer_fn().
    
    Cc: [email protected]
    Signed-off-by: fuqiang wang <[email protected]>
    [sean: split to separate patch, write changelog]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: Fix VM hard lockup after prolonged inactivity with periodic HV timer [+ + +]

Author: fuqiang wang <[email protected]>
Date:   Thu Nov 13 12:51:13 2025 -0800

    KVM: x86: Fix VM hard lockup after prolonged inactivity with periodic HV timer
    
    commit 18ab3fc8e880791aa9f7c000261320fc812b5465 upstream.
    
    When advancing the target expiration for the guest's APIC timer in periodic
    mode, set the expiration to "now" if the target expiration is in the past
    (similar to what is done in update_target_expiration()).  Blindly adding
    the period to the previous target expiration can result in KVM generating
    a practically unbounded number of hrtimer IRQs due to programming an
    expired timer over and over.  In extreme scenarios, e.g. if userspace
    pauses/suspends a VM for an extended duration, this can even cause hard
    lockups in the host.
    
    Currently, the bug only affects Intel CPUs when using the hypervisor timer
    (HV timer), a.k.a. the VMX preemption timer.  Unlike the software timer,
    a.k.a. hrtimer, which KVM keeps running even on exits to userspace, the
    HV timer only runs while the guest is active.  As a result, if the vCPU
    does not run for an extended duration, there will be a huge gap between
    the target expiration and the current time the vCPU resumes running.
    Because the target expiration is incremented by only one period on each
    timer expiration, this leads to a series of timer expirations occurring
    rapidly after the vCPU/VM resumes.
    
    More critically, when the vCPU first triggers a periodic HV timer
    expiration after resuming, advancing the expiration by only one period
    will result in a target expiration in the past.  As a result, the delta
    may be calculated as a negative value.  When the delta is converted into
    an absolute value (tscdeadline is an unsigned u64), the resulting value
    can overflow what the HV timer is capable of programming.  I.e. the large
    value will exceed the VMX Preemption Timer's maximum bit width of
    cpu_preemption_timer_multi + 32, and thus cause KVM to switch from the
    HV timer to the software timer (hrtimers).
    
    After switching to the software timer, periodic timer expiration callbacks
    may be executed consecutively within a single clock interrupt handler,
    because hrtimers honors KVM's request for an expiration in the past and
    immediately re-invokes KVM's callback after reprogramming.  And because
    the interrupt handler runs with IRQs disabled, restarting KVM's hrtimer
    over and over until the target expiration is advanced to "now" can result
    in a hard lockup.
    
    E.g. the following hard lockup was triggered in the host when running a
    Windows VM (only relevant because it used the APIC timer in periodic mode)
    after resuming the VM from a long suspend (in the host).
    
      NMI watchdog: Watchdog detected hard LOCKUP on cpu 45
      ...
      RIP: 0010:advance_periodic_target_expiration+0x4d/0x80 [kvm]
      ...
      RSP: 0018:ff4f88f5d98d8ef0 EFLAGS: 00000046
      RAX: fff0103f91be678e RBX: fff0103f91be678e RCX: 00843a7d9e127bcc
      RDX: 0000000000000002 RSI: 0052ca4003697505 RDI: ff440d5bfbdbd500
      RBP: ff440d5956f99200 R08: ff2ff2a42deb6a84 R09: 000000000002a6c0
      R10: 0122d794016332b3 R11: 0000000000000000 R12: ff440db1af39cfc0
      R13: ff440db1af39cfc0 R14: ffffffffc0d4a560 R15: ff440db1af39d0f8
      FS:  00007f04a6ffd700(0000) GS:ff440db1af380000(0000) knlGS:000000e38a3b8000
      CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      CR2: 000000d5651feff8 CR3: 000000684e038002 CR4: 0000000000773ee0
      PKRU: 55555554
      Call Trace:
       <IRQ>
       apic_timer_fn+0x31/0x50 [kvm]
       __hrtimer_run_queues+0x100/0x280
       hrtimer_interrupt+0x100/0x210
       ? ttwu_do_wakeup+0x19/0x160
       smp_apic_timer_interrupt+0x6a/0x130
       apic_timer_interrupt+0xf/0x20
       </IRQ>
    
    Moreover, if the suspend duration of the virtual machine is not long enough
    to trigger a hard lockup in this scenario, since commit 98c25ead5eda
    ("KVM: VMX: Move preemption timer <=> hrtimer dance to common x86"), KVM
    will continue using the software timer until the guest reprograms the APIC
    timer in some way.  Since the periodic timer does not require frequent APIC
    timer register programming, the guest may continue to use the software
    timer in perpetuity.
    
    Fixes: d8f2f498d9ed ("x86/kvm: fix LAPIC timer drift when guest uses periodic mode")
    Cc: [email protected]
    Signed-off-by: fuqiang wang <[email protected]>
    [sean: massage comments and changelog]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

KVM: x86: WARN if hrtimer callback for periodic APIC timer fires with period=0 [+ + +]

Author: Sean Christopherson <[email protected]>
Date:   Thu Nov 13 12:51:11 2025 -0800

    KVM: x86: WARN if hrtimer callback for periodic APIC timer fires with period=0
    
    commit 0ea9494be9c931ddbc084ad5e11fda91b554cf47 upstream.
    
    WARN and don't restart the hrtimer if KVM's callback runs with the guest's
    APIC timer in periodic mode but with a period of '0', as not advancing the
    hrtimer's deadline would put the CPU into an infinite loop of hrtimer
    events.  Observing a period of '0' should be impossible, even when the
    hrtimer is running on a different CPU than the vCPU, as KVM is supposed to
    cancel the hrtimer before changing (or zeroing) the period, e.g. when
    switching from periodic to one-shot.
    
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sean Christopherson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

leds: leds-cros_ec: Skip LEDs without color components [+ + +]

Author: Thomas Weißschuh <[email protected]>
Date:   Tue Oct 28 16:31:03 2025 +0100

    leds: leds-cros_ec: Skip LEDs without color components
    
    commit 4dbf066d965cd3299fb396f1375d10423c9c625c upstream.
    
    A user reports that on their Lenovo Corsola Magneton with EC firmware
    steelix-15194.270.0 the driver probe fails with EINVAL. It turns out
    that the power LED does not contain any color components as indicated
    by the following "ectool led power query" output:
    
    Brightness range for LED 1:
            red     : 0x0
            green   : 0x0
            blue    : 0x0
            yellow  : 0x0
            white   : 0x0
            amber   : 0x0
    
    The LED also does not react to commands sent manually through ectool and
    is generally non-functional.
    
    Instead of failing the probe for all LEDs managed by the EC when one
    without color components is encountered, silently skip those.
    
    Cc: [email protected]
    Fixes: 8d6ce6f3ec9d ("leds: Add ChromeOS EC driver")
    Signed-off-by: Thomas Weißschuh <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

leds: leds-lp50xx: Allow LED 0 to be added to module bank [+ + +]

Author: Christian Hitz <[email protected]>
Date:   Wed Oct 8 14:32:21 2025 +0200

    leds: leds-lp50xx: Allow LED 0 to be added to module bank
    
    commit 26fe74d598c32e7bc6f150edfc4aa43e1bee55db upstream.
    
    led_banks contains LED module number(s) that should be grouped into the
    module bank. led_banks is 0-initialized.
    By checking the led_banks entries for 0, un-set entries are detected.
    But a 0-entry also indicates that LED module 0 should be grouped into the
    module bank.
    
    By only iterating over the available entries no check for unused entries
    is required and LED module 0 can be added to bank.
    
    Cc: [email protected]
    Fixes: 242b81170fb8 ("leds: lp50xx: Add the LP50XX family of the RGB LED driver")
    Signed-off-by: Christian Hitz <[email protected]>
    Reviewed-by: Jacek Anaszewski <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

leds: leds-lp50xx: Enable chip before any communication [+ + +]

Author: Christian Hitz <[email protected]>
Date:   Tue Oct 28 16:51:40 2025 +0100

    leds: leds-lp50xx: Enable chip before any communication
    
    commit 434959618c47efe9e5f2e20f4a850caac4f6b823 upstream.
    
    If a GPIO is used to control the chip's enable pin, it needs to be pulled
    high before any i2c communication is attempted.
    
    Currently, the enable GPIO handling is not correct.
    
    Assume the enable GPIO is low when the probe function is entered. In this
    case the device is in SHUTDOWN mode and does not react to i2c commands.
    
    During probe the following sequence happens:
     1. The call to lp50xx_reset() on line 548 has no effect as i2c is not
        possible yet.
     2. Then - on line 552 - lp50xx_enable_disable() is called. As
        "priv->enable_gpio“ has not yet been initialized, setting the GPIO has
        no effect. Also the i2c enable command is not executed as the device
        is still in SHUTDOWN.
     3. On line 556 the call to lp50xx_probe_dt() finally parses the rest of
        the DT and the configured priv->enable_gpio is set up.
    
    As a result the device is still in SHUTDOWN mode and not ready for
    operation.
    
    Split lp50xx_enable_disable() into distinct enable and disable functions
    to enforce correct ordering between enable_gpio manipulations and i2c
    commands.
    Read enable_gpio configuration from DT before attempting to manipulate
    enable_gpio.
    Add delays to observe correct wait timing after manipulating enable_gpio
    and before any i2c communication.
    
    Cc: [email protected]
    Fixes: 242b81170fb8 ("leds: lp50xx: Add the LP50XX family of the RGB LED driver")
    Signed-off-by: Christian Hitz <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

leds: leds-lp50xx: LP5009 supports 3 modules for a total of 9 LEDs [+ + +]

Author: Christian Hitz <[email protected]>
Date:   Wed Oct 22 08:33:04 2025 +0200

    leds: leds-lp50xx: LP5009 supports 3 modules for a total of 9 LEDs
    
    commit 5246e3673eeeccb4f5bf4f42375dd495d465ac15 upstream.
    
    LP5009 supports 9 LED outputs that are grouped into 3 modules.
    
    Cc: [email protected]
    Fixes: 242b81170fb8 ("leds: lp50xx: Add the LP50XX family of the RGB LED driver")
    Signed-off-by: Christian Hitz <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lib/crypto: riscv/chacha: Avoid s0/fp register [+ + +]

Author: Vivian Wang <[email protected]>
Date:   Mon Dec 29 14:37:29 2025 -0800

    lib/crypto: riscv/chacha: Avoid s0/fp register
    
    commit 43169328c7b4623b54b7713ec68479cebda5465f upstream.
    
    In chacha_zvkb, avoid using the s0 register, which is the frame pointer,
    by reallocating KEY0 to t5. This makes stack traces available if e.g. a
    crash happens in chacha_zvkb.
    
    No frame pointer maintenance is otherwise required since this is a leaf
    function.
    
    Signed-off-by: Vivian Wang <[email protected]>
    Fixes: bb54668837a0 ("crypto: riscv - add vector crypto accelerated ChaCha20")
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Eric Biggers <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lib/crypto: x86/blake2s: Fix 32-bit arg treated as 64-bit [+ + +]

Author: Eric Biggers <[email protected]>
Date:   Sun Nov 2 15:42:04 2025 -0800

    lib/crypto: x86/blake2s: Fix 32-bit arg treated as 64-bit
    
    commit 2f22115709fc7ebcfa40af3367a508fbbd2f71e9 upstream.
    
    In the C code, the 'inc' argument to the assembly functions
    blake2s_compress_ssse3() and blake2s_compress_avx512() is declared with
    type u32, matching blake2s_compress().  The assembly code then reads it
    from the 64-bit %rcx.  However, the ABI doesn't guarantee zero-extension
    to 64 bits, nor do gcc or clang guarantee it.  Therefore, fix these
    functions to read this argument from the 32-bit %ecx.
    
    In theory, this bug could have caused the wrong 'inc' value to be used,
    causing incorrect BLAKE2s hashes.  In practice, probably not: I've fixed
    essentially this same bug in many other assembly files too, but there's
    never been a real report of it having caused a problem.  In x86_64, all
    writes to 32-bit registers are zero-extended to 64 bits.  That results
    in zero-extension in nearly all situations.  I've only been able to
    demonstrate a lack of zero-extension with a somewhat contrived example
    involving truncation, e.g. when the C code has a u64 variable holding
    0x1234567800000040 and passes it as a u32 expecting it to be truncated
    to 0x40 (64).  But that's not what the real code does, of course.
    
    Fixes: ed0356eda153 ("crypto: blake2s - x86_64 SIMD implementation")
    Cc: [email protected]
    Reviewed-by: Ard Biesheuvel <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Eric Biggers <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libceph: make decode_pool() more resilient against corrupted osdmaps [+ + +]

Author: Ilya Dryomov <[email protected]>
Date:   Tue Dec 2 10:32:31 2025 +0100

    libceph: make decode_pool() more resilient against corrupted osdmaps
    
    commit 8c738512714e8c0aa18f8a10c072d5b01c83db39 upstream.
    
    If the osdmap is (maliciously) corrupted such that the encoded length
    of ceph_pg_pool envelope is less than what is expected for a particular
    encoding version, out-of-bounds reads may ensue because the only bounds
    check that is there is based on that length value.
    
    This patch adds explicit bounds checks for each field that is decoded
    or skipped.
    
    Cc: [email protected]
    Reported-by: ziming zhang <[email protected]>
    Signed-off-by: Ilya Dryomov <[email protected]>
    Reviewed-by: Xiubo Li <[email protected]>
    Tested-by: ziming zhang <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

libperf cpumap: Fix perf_cpu_map__max for an empty/NULL map [+ + +]

Author: Ian Rogers <[email protected]>
Date:   Wed Dec 3 13:47:01 2025 -0800

    libperf cpumap: Fix perf_cpu_map__max for an empty/NULL map
    
    [ Upstream commit a0a4173631bfcfd3520192c0a61cf911d6a52c3a ]
    
    Passing an empty map to perf_cpu_map__max triggered a SEGV. Explicitly
    test for the empty map.
    
    Reported-by: Ingo Molnar <[email protected]>
    Closes: https://lore.kernel.org/linux-perf-users/[email protected]/
    Tested-by: Ingo Molnar <[email protected]>
    Signed-off-by: Ian Rogers <[email protected]>
    Tested-by: Thomas Richter <[email protected]>
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Linux: Linux 6.12.64 [+ + +]

Author: Greg Kroah-Hartman <[email protected]>
Date:   Thu Jan 8 10:15:06 2026 +0100

    Linux 6.12.64
    
    Link: https://lore.kernel.org/r/[email protected]
    Tested-by: Brett A C Sheffield <[email protected]>
    Tested-by: Pavel Machek (CIP) <[email protected]>
    Tested-by: Shuah Khan <[email protected]>
    Tested-by: Peter Schneider <[email protected]>
    Tested-by: Florian Fainelli <[email protected]>
    Tested-by: Ron Economos <[email protected]>
    Tested-by: Mark Brown <[email protected]>
    Tested-by: Francesco Dolcini <[email protected]>
    Tested-by: Salvatore Bonaccorso <[email protected]>
    Tested-by: Jeffrin Jose T <[email protected]>
    Tested-by: Harshit Mogalapalli <[email protected]>
    Tested-by: Miguel Ojeda <[email protected]>
    Tested-by: Brett Mastbergen <[email protected]>
    Tested-by: Jon Hunter <[email protected]>
    Tested-by: Hardik Garg <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

livepatch: Match old_sympos 0 and 1 in klp_find_func() [+ + +]

Author: Song Liu <[email protected]>
Date:   Mon Oct 13 10:30:19 2025 -0700

    livepatch: Match old_sympos 0 and 1 in klp_find_func()
    
    [ Upstream commit 139560e8b973402140cafeb68c656c1374bd4c20 ]
    
    When there is only one function of the same name, old_sympos of 0 and 1
    are logically identical. Match them in klp_find_func().
    
    This is to avoid a corner case with different toolchain behavior.
    
    In this specific issue, two versions of kpatch-build were used to
    build livepatch for the same kernel. One assigns old_sympos == 0 for
    unique local functions, the other assigns old_sympos == 1 for unique
    local functions. Both versions work fine by themselves. (PS: This
    behavior change was introduced in a downstream version of kpatch-build.
    This change does not exist in upstream kpatch-build.)
    
    However, during livepatch upgrade (with the replace flag set) from a
    patch built with one version of kpatch-build to the same fix built with
    the other version of kpatch-build, livepatching fails with errors like:
    
    [   14.218706] sysfs: cannot create duplicate filename 'xxx/somefunc,1'
    ...
    [   14.219466] Call Trace:
    [   14.219468]  <TASK>
    [   14.219469]  dump_stack_lvl+0x47/0x60
    [   14.219474]  sysfs_warn_dup.cold+0x17/0x27
    [   14.219476]  sysfs_create_dir_ns+0x95/0xb0
    [   14.219479]  kobject_add_internal+0x9e/0x260
    [   14.219483]  kobject_add+0x68/0x80
    [   14.219485]  ? kstrdup+0x3c/0xa0
    [   14.219486]  klp_enable_patch+0x320/0x830
    [   14.219488]  patch_init+0x443/0x1000 [ccc_0_6]
    [   14.219491]  ? 0xffffffffa05eb000
    [   14.219492]  do_one_initcall+0x2e/0x190
    [   14.219494]  do_init_module+0x67/0x270
    [   14.219496]  init_module_from_file+0x75/0xa0
    [   14.219499]  idempotent_init_module+0x15a/0x240
    [   14.219501]  __x64_sys_finit_module+0x61/0xc0
    [   14.219503]  do_syscall_64+0x5b/0x160
    [   14.219505]  entry_SYSCALL_64_after_hwframe+0x4b/0x53
    [   14.219507] RIP: 0033:0x7f545a4bd96d
    ...
    [   14.219516] kobject: kobject_add_internal failed for somefunc,1 with
        -EEXIST, don't try to register things with the same name ...
    
    This happens because klp_find_func() thinks somefunc with old_sympos==0
    is not the same as somefunc with old_sympos==1, and klp_add_object_nops
    adds another xxx/func,1 to the list of functions to patch.
    
    Signed-off-by: Song Liu <[email protected]>
    Acked-by: Josh Poimboeuf <[email protected]>
    [[email protected]: Fixed some typos.]
    Reviewed-by: Petr Mladek <[email protected]>
    Tested-by: Petr Mladek <[email protected]>
    Signed-off-by: Petr Mladek <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

lockd: fix vfs_test_lock() calls [+ + +]

Author: NeilBrown <[email protected]>
Date:   Sat Nov 22 12:00:36 2025 +1100

    lockd: fix vfs_test_lock() calls
    
    commit a49a2a1baa0c553c3548a1c414b6a3c005a8deba upstream.
    
    Usage of vfs_test_lock() is somewhat confused.  Documentation suggests
    it is given a "lock" but this is not the case.  It is given a struct
    file_lock which contains some details of the sort of lock it should be
    looking for.
    
    In particular passing a "file_lock" containing fl_lmops or fl_ops is
    meaningless and possibly confusing.
    
    This is particularly problematic in lockd.  nlmsvc_testlock() receives
    an initialised "file_lock" from xdr-decode, including manager ops and an
    owner.  It then mistakenly passes this to vfs_test_lock() which might
    replace the owner and the ops.  This can lead to confusion when freeing
    the lock.
    
    The primary role of the 'struct file_lock' passed to vfs_test_lock() is
    to report a conflicting lock that was found, so it makes more sense for
    nlmsvc_testlock() to pass "conflock", which it uses for returning the
    conflicting lock.
    
    With this change, freeing of the lock is not confused and code in
    __nlm4svc_proc_test() and __nlmsvc_proc_test() can be simplified.
    
    Documentation for vfs_test_lock() is improved to reflect its real
    purpose, and a WARN_ON_ONCE() is added to avoid a similar problem in the
    future.
    
    Reported-by: Olga Kornievskaia <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]
    Signed-off-by: NeilBrown <[email protected]>
    Fixes: 20fa19027286 ("nfs: add export operations")
    Cc: [email protected]
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Add new PCI ID for pci_fixup_vgadev() [+ + +]

Author: Huacai Chen <[email protected]>
Date:   Sat Dec 6 10:39:49 2025 +0800

    LoongArch: Add new PCI ID for pci_fixup_vgadev()
    
    commit bf3fa8f232a1eec8d7b88dcd9e925e60f04f018d upstream.
    
    Loongson-2K3000 has a new PCI ID (0x7a46) for its display controller,
    Add it for pci_fixup_vgadev() since we prefer a discrete graphics card
    as default boot device if present.
    
    Cc: [email protected]
    Signed-off-by: Tianrui Zhao <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: BPF: Sign extend kfunc call arguments [+ + +]

Author: Hengqi Chen <[email protected]>
Date:   Wed Dec 31 15:19:20 2025 +0800

    LoongArch: BPF: Sign extend kfunc call arguments
    
    commit 3f5a238f24d7b75f9efe324d3539ad388f58536e upstream.
    
    The kfunc calls are native calls so they should follow LoongArch calling
    conventions. Sign extend its arguments properly to avoid kernel panic.
    This is done by adding a new emit_abi_ext() helper. The emit_abi_ext()
    helper performs extension in place meaning a value already store in the
    target register (Note: this is different from the existing sign_extend()
    helper and thus we can't reuse it).
    
    Cc: [email protected]
    Fixes: 5dc615520c4d ("LoongArch: Add BPF JIT support")
    Signed-off-by: Hengqi Chen <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: BPF: Zero-extend bpf_tail_call() index [+ + +]

Author: Hengqi Chen <[email protected]>
Date:   Wed Dec 31 15:19:20 2025 +0800

    LoongArch: BPF: Zero-extend bpf_tail_call() index
    
    commit eb71f5c433e1c6dff089b315881dec40a88a7baf upstream.
    
    The bpf_tail_call() index should be treated as a u32 value. Let's
    zero-extend it to avoid calling wrong BPF progs. See similar fixes
    for x86 [1]) and arm64 ([2]) for more details.
    
      [1]: https://github.com/torvalds/linux/commit/90caccdd8cc0215705f18b92771b449b01e2474a
      [2]: https://github.com/torvalds/linux/commit/16338a9b3ac30740d49f5dfed81bac0ffa53b9c7
    
    Cc: [email protected]
    Fixes: 5dc615520c4d ("LoongArch: Add BPF JIT support")
    Signed-off-by: Hengqi Chen <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Correct the calculation logic of thread_count [+ + +]

Author: Qiang Ma <[email protected]>
Date:   Sat Dec 6 10:39:49 2025 +0800

    LoongArch: Correct the calculation logic of thread_count
    
    commit 1de0ae21f136efa6c5d8a4d3e07b7d1ca39c750f upstream.
    
    For thread_count, the current calculation method has a maximum of 255,
    which may not be sufficient in the future. Therefore, we are correcting
    it now.
    
    Reference: SMBIOS Specification, 7.5 Processor Information (Type 4)[1]
    
    [1]: https://www.dmtf.org/sites/default/files/standards/documents/DSP0134_3.9.0.pdf
    
    Cc: [email protected]
    Signed-off-by: Qiang Ma <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Fix build errors for CONFIG_RANDSTRUCT [+ + +]

Author: Huacai Chen <[email protected]>
Date:   Sat Dec 6 10:39:40 2025 +0800

    LoongArch: Fix build errors for CONFIG_RANDSTRUCT
    
    commit 3c250aecef62da81deb38ac6738ac0a88d91f1fc upstream.
    
    When CONFIG_RANDSTRUCT enabled, members of task_struct are randomized.
    There is a chance that TASK_STACK_CANARY be out of 12bit immediate's
    range and causes build errors. TASK_STACK_CANARY is naturally aligned,
    so fix it by replacing ld.d/st.d with ldptr.d/stptr.d which have 14bit
    immediates.
    
    Cc: [email protected]
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Suggested-by: Rui Wang <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Refactor register restoration in ftrace_common_return [+ + +]

Author: Chenghao Duan <[email protected]>
Date:   Wed Dec 31 15:19:20 2025 +0800

    LoongArch: Refactor register restoration in ftrace_common_return
    
    commit 45cb47c628dfbd1994c619f3eac271a780602826 upstream.
    
    Refactor the register restoration sequence in the ftrace_common_return
    function to clearly distinguish between the logic of normal returns and
    direct call returns in function tracing scenarios. The logic is as
    follows:
    
    1. In the case of a normal return, the execution flow returns to the
    traced function, and ftrace must ensure that the register data is
    consistent with the state when the function was entered.
    
    ra = parent return address; t0 = traced function return address.
    
    2. In the case of a direct call return, the execution flow jumps to the
    custom trampoline function, and ftrace must ensure that the register
    data is consistent with the state when ftrace was entered.
    
    ra = traced function return address; t0 = parent return address.
    
    Cc: [email protected]
    Fixes: 9cdc3b6a299c ("LoongArch: ftrace: Add direct call support")
    Signed-off-by: Chenghao Duan <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Use __pmd()/__pte() for swap entry conversions [+ + +]

Author: WangYuli <[email protected]>
Date:   Sat Dec 6 10:39:48 2025 +0800

    LoongArch: Use __pmd()/__pte() for swap entry conversions
    
    commit 4a71df151e703b5e7e85b33369cee59ef2665e61 upstream.
    
    The __pmd() and __pte() helper macros provide the correct initialization
    syntax and abstraction for the pmd_t and pte_t types.
    
    Use __pmd() to fix follow warning about __swp_entry_to_pmd() with gcc-15
    under specific configs [1] :
    
      In file included from ./include/linux/pgtable.h:6,
                       from ./include/linux/mm.h:31,
                       from ./include/linux/pagemap.h:8,
                       from arch/loongarch/mm/init.c:14:
      ./include/linux/swapops.h: In function ‘swp_entry_to_pmd’:
      ./arch/loongarch/include/asm/pgtable.h:302:34: error: missing braces around initializer [-Werror=missing-braces]
        302 | #define __swp_entry_to_pmd(x)   ((pmd_t) { (x).val | _PAGE_HUGE })
            |                                  ^
      ./include/linux/swapops.h:559:16: note: in expansion of macro ‘__swp_entry_to_pmd’
        559 |         return __swp_entry_to_pmd(arch_entry);
            |                ^~~~~~~~~~~~~~~~~~
      cc1: all warnings being treated as errors
    
    Also update __swp_entry_to_pte() to use __pte() for consistency.
    
    [1]. https://download.01.org/0day-ci/archive/20251119/[email protected]/config
    
    Cc: [email protected]
    Signed-off-by: Yuli Wang <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Use unsigned long for _end and _text [+ + +]

Author: Tiezhu Yang <[email protected]>
Date:   Sat Dec 6 10:39:48 2025 +0800

    LoongArch: Use unsigned long for _end and _text
    
    commit a258a3cb1895e3acf5f2fe245d17426e894bc935 upstream.
    
    It is better to use unsigned long rather than long for _end and _text to
    calculate the kernel length.
    
    Cc: [email protected] # v6.3+
    Fixes: e5f02b51fa0c ("LoongArch: Add support for kernel address space layout randomization (KASLR)")
    Signed-off-by: Tiezhu Yang <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

md/raid10: wait barrier before returning discard request with REQ_NOWAIT [+ + +]

Author: Xiao Ni <[email protected]>
Date:   Fri Jan 2 12:37:22 2026 -0800

    md/raid10: wait barrier before returning discard request with REQ_NOWAIT
    
    [ Upstream commit 3db4404435397a345431b45f57876a3df133f3b4 ]
    
    raid10_handle_discard should wait barrier before returning a discard bio
    which has REQ_NOWAIT. And there is no need to print warning calltrace
    if a discard bio has REQ_NOWAIT flag. Quality engineer usually checks
    dmesg and reports error if dmesg has warning/error calltrace.
    
    Fixes: c9aa889b035f ("md: raid10 add nowait support")
    Signed-off-by: Xiao Ni <[email protected]>
    Acked-by: Coly Li <[email protected]>
    Link: https://lore.kernel.org/linux-raid/[email protected]
    Signed-off-by: Yu Kuai <[email protected]>
    [Harshit: Clean backport to 6.12.y]
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

md/raid5: fix possible null-pointer dereferences in raid5_store_group_thread_cnt() [+ + +]

Author: Tuo Li <[email protected]>
Date:   Thu Dec 25 21:03:26 2025 +0800

    md/raid5: fix possible null-pointer dereferences in raid5_store_group_thread_cnt()
    
    [ Upstream commit 7ad6ef91d8745d04aff9cce7bdbc6320d8e05fe9 ]
    
    The variable mddev->private is first assigned to conf and then checked:
    
      conf = mddev->private;
      if (!conf) ...
    
    If conf is NULL, then mddev->private is also NULL. In this case,
    null-pointer dereferences can occur when calling raid5_quiesce():
    
      raid5_quiesce(mddev, true);
      raid5_quiesce(mddev, false);
    
    since mddev->private is assigned to conf again in raid5_quiesce(), and conf
    is dereferenced in several places, for example:
    
      conf->quiesce = 0;
      wake_up(&conf->wait_for_quiescent);
    
    To fix this issue, the function should unlock mddev and return before
    invoking raid5_quiesce() when conf is NULL, following the existing pattern
    in raid5_change_consistency_policy().
    
    Fixes: fa1944bbe622 ("md/raid5: Wait sync io to finish before changing group cnt")
    Signed-off-by: Tuo Li <[email protected]>
    Reviewed-by: Xiao Ni <[email protected]>
    Reviewed-by: Paul Menzel <[email protected]>
    Link: https://lore.kernel.org/linux-raid/[email protected]
    Signed-off-by: Yu Kuai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

md: Fix static checker warning in analyze_sbs [+ + +]

Author: Li Nan <[email protected]>
Date:   Mon Dec 15 20:44:12 2025 +0800

    md: Fix static checker warning in analyze_sbs
    
    [ Upstream commit 00f6c1b4d15d35fadb7f34768a1831c81aaa8936 ]
    
    The following warn is reported:
    
     drivers/md/md.c:3912 analyze_sbs()
     warn: iterator 'i' not incremented
    
    Fixes: d8730f0cf4ef ("md: Remove deprecated CONFIG_MD_MULTIPATH")
    Reported-by: Dan Carpenter <[email protected]>
    Closes: https://lore.kernel.org/linux-raid/[email protected]/T/#t
    Signed-off-by: Li Nan <[email protected]>
    Link: https://lore.kernel.org/linux-raid/[email protected]
    Signed-off-by: Yu Kuai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

media: adv7842: Avoid possible out-of-bounds array accesses in adv7842_cp_log_status() [+ + +]

Author: Ivan Abramov <[email protected]>
Date:   Wed Sep 3 02:23:31 2025 +0300

    media: adv7842: Avoid possible out-of-bounds array accesses in adv7842_cp_log_status()
    
    commit 8163419e3e05d71dcfa8fb49c8fdf8d76908fe51 upstream.
    
    It's possible for cp_read() and hdmi_read() to return -EIO. Those
    values are further used as indexes for accessing arrays.
    
    Fix that by checking return values where it's needed.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: a89bcd4c6c20 ("[media] adv7842: add new video decoder driver")
    Cc: [email protected]
    Signed-off-by: Ivan Abramov <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: amphion: Add a frame flush mode for decoder [+ + +]

Author: Ming Qian <[email protected]>
Date:   Mon Jan 5 16:05:30 2026 -0500

    media: amphion: Add a frame flush mode for decoder
    
    [ Upstream commit 9ea16ba6eaf93f25f61855751f71e2e701709ddf ]
    
    By default the amphion decoder will pre-parse 3 frames before starting
    to decode the first frame. Alternatively, a block of flush padding data
    can be appended to the frame, which will ensure that the decoder can
    start decoding immediately after parsing the flush padding data, thus
    potentially reducing decoding latency.
    
    This mode was previously only enabled, when the display delay was set to
    0. Allow the user to manually toggle the use of that mode via a module
    parameter called low_latency, which enables the mode without
    changing the display order.
    
    Signed-off-by: Ming Qian <[email protected]>
    Reviewed-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Sebastian Fricke <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Stable-dep-of: 634c2cd17bd0 ("media: amphion: Remove vpu_vb_is_codecconfig")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: amphion: Cancel message work before releasing the VPU core [+ + +]

Author: Ming Qian <[email protected]>
Date:   Tue Sep 16 14:10:07 2025 +0800

    media: amphion: Cancel message work before releasing the VPU core
    
    commit ae246b0032146e352c4c06a7bf03cd3d5bcb2ecd upstream.
    
    To avoid accessing the VPU register after release of the VPU core,
    cancel the message work and destroy the workqueue that handles the
    VPU message before release of the VPU core.
    
    Fixes: 3cd084519c6f ("media: amphion: add vpu v4l2 m2m support")
    Cc: [email protected]
    Signed-off-by: Ming Qian <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: amphion: Make some vpu_v4l2 functions static [+ + +]

Author: Laurent Pinchart <[email protected]>
Date:   Mon Jan 5 16:05:31 2026 -0500

    media: amphion: Make some vpu_v4l2 functions static
    
    [ Upstream commit 5d1e54bb4dc6741284a3ed587e994308ddee2f16 ]
    
    Some functions defined in vpu_v4l2.c are never used outside of that
    compilation unit. Make them static.
    
    Signed-off-by: Laurent Pinchart <[email protected]>
    Reviewed-by: Ming Qian <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Stable-dep-of: 634c2cd17bd0 ("media: amphion: Remove vpu_vb_is_codecconfig")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: amphion: Remove vpu_vb_is_codecconfig [+ + +]

Author: Ming Qian <[email protected]>
Date:   Mon Jan 5 16:05:32 2026 -0500

    media: amphion: Remove vpu_vb_is_codecconfig
    
    [ Upstream commit 634c2cd17bd021487c57b95973bddb14be8002ff ]
    
    Currently the function vpu_vb_is_codecconfig() always returns 0.
    Delete it and its related code.
    
    Fixes: 3cd084519c6f ("media: amphion: add vpu v4l2 m2m support")
    Cc: [email protected]
    Signed-off-by: Ming Qian <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: cec: Fix debugfs leak on bus_register() failure [+ + +]

Author: Haotian Zhang <[email protected]>
Date:   Mon Sep 29 19:12:29 2025 +0800

    media: cec: Fix debugfs leak on bus_register() failure
    
    commit c43bcd2b2aa3c2ca9d2433c3990ecbc2c47d10eb upstream.
    
    In cec_devnode_init(), the debugfs directory created with
    debugfs_create_dir() is not removed if bus_register() fails.
    This leaves a stale "cec" entry in debugfs and prevents
    proper module reloading.
    
    Fix this by removing the debugfs directory in the error path.
    
    Fixes: a56960e8b406 ("[media] cec: add HDMI CEC framework (core)")
    Cc: [email protected]
    Signed-off-by: Haotian Zhang <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: dvb-usb: dtv5100: fix out-of-bounds in dtv5100_i2c_msg() [+ + +]

Author: Jeongjun Park <[email protected]>
Date:   Mon Apr 21 21:52:44 2025 +0900

    media: dvb-usb: dtv5100: fix out-of-bounds in dtv5100_i2c_msg()
    
    commit b91e6aafe8d356086cc621bc03e35ba2299e4788 upstream.
    
    rlen value is a user-controlled value, but dtv5100_i2c_msg() does not
    check the size of the rlen value. Therefore, if it is set to a value
    larger than sizeof(st->data), an out-of-bounds vuln occurs for st->data.
    
    Therefore, we need to add proper range checking to prevent this vuln.
    
    Fixes: 60688d5e6e6e ("V4L/DVB (8735): dtv5100: replace dummy frontend by zl10353")
    Cc: [email protected]
    Signed-off-by: Jeongjun Park <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: i2c: ADV7604: Remove redundant cancel_delayed_work in probe [+ + +]

Author: Duoming Zhou <[email protected]>
Date:   Tue Sep 2 09:53:37 2025 +0800

    media: i2c: ADV7604: Remove redundant cancel_delayed_work in probe
    
    commit 8f34f24355a607b98ecd9924837aab13c676eeca upstream.
    
    The delayed_work delayed_work_enable_hotplug is initialized with
    INIT_DELAYED_WORK() in adv76xx_probe(), but it is never scheduled
    anywhere in the probe function.
    
    Calling cancel_delayed_work() on a work that has never been
    scheduled is redundant and unnecessary, as there is no pending
    work to cancel.
    
    Remove the redundant cancel_delayed_work() from error handling
    path and adjust the goto label accordingly to simplify the code
    and avoid potential confusion.
    
    Fixes: 54450f591c99 ("[media] adv7604: driver for the Analog Devices ADV7604 video decoder")
    Cc: [email protected]
    Signed-off-by: Duoming Zhou <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: i2c: adv7842: Remove redundant cancel_delayed_work in probe [+ + +]

Author: Duoming Zhou <[email protected]>
Date:   Tue Sep 2 09:10:31 2025 +0800

    media: i2c: adv7842: Remove redundant cancel_delayed_work in probe
    
    commit e66a5cc606c58e72f18f9cdd868a3672e918f9f8 upstream.
    
    The delayed_work delayed_work_enable_hotplug is initialized with
    INIT_DELAYED_WORK() in adv7842_probe(), but it is never scheduled
    anywhere in the probe function.
    
    Calling cancel_delayed_work() on a work that has never been
    scheduled is redundant and unnecessary, as there is no pending
    work to cancel.
    
    Remove the redundant cancel_delayed_work() from error handling
    path and adjust the goto label accordingly to simplify the code
    and avoid potential confusion.
    
    Fixes: a89bcd4c6c20 ("[media] adv7842: add new video decoder driver")
    Cc: [email protected]
    Signed-off-by: Duoming Zhou <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: i2c: imx219: Fix 1920x1080 mode to use 1:1 pixel aspect ratio [+ + +]

Author: Dave Stevenson <[email protected]>
Date:   Mon Jan 5 17:10:32 2026 +0530

    media: i2c: imx219: Fix 1920x1080 mode to use 1:1 pixel aspect ratio
    
    commit 9ef6e4db152c34580cc52792f32485c193945395 upstream.
    
    Commit 0af46fbc333d ("media: i2c: imx219: Calculate crop rectangle
    dynamically") meant that the 1920x1080 mode switched from using no
    binning to using vertical binning but no horizontal binning, which
    resulted in stretched pixels.
    
    Until proper controls are available to independently select horizontal
    and vertical binning, restore the original 1:1 pixel aspect ratio by
    forcing binning to be uniform in both directions.
    
    Cc: [email protected]
    Fixes: 0af46fbc333d ("media: i2c: imx219: Calculate crop rectangle dynamically")
    Signed-off-by: Dave Stevenson <[email protected]>
    Reviewed-by: Jacopo Mondi <[email protected]>
    Signed-off-by: Sakari Ailus <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Jai Luthra <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: mediatek: vcodec: Fix a reference leak in mtk_vcodec_fw_vpu_init() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Mon Sep 15 20:09:38 2025 +0800

    media: mediatek: vcodec: Fix a reference leak in mtk_vcodec_fw_vpu_init()
    
    commit cdd0f118ef87db8a664fb5ea366fd1766d2df1cd upstream.
    
    vpu_get_plat_device() increases the reference count of the returned
    platform device. However, when devm_kzalloc() fails, the reference
    is not released, causing a reference leak.
    
    Fix this by calling put_device() on fw_pdev->dev before returning
    on the error path.
    
    Fixes: e25a89f743b1 ("media: mtk-vcodec: potential dereference of null pointer")
    Cc: [email protected]
    Signed-off-by: Haoxiang Li <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Reviewed-by: Tzung-Bi Shih <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: mediatek: vcodec: Use spinlock for context list protection lock [+ + +]

Author: Chen-Yu Tsai <[email protected]>
Date:   Mon Jan 5 18:31:58 2026 -0500

    media: mediatek: vcodec: Use spinlock for context list protection lock
    
    [ Upstream commit a5844227e0f030d2af2d85d4aed10c5eca6ca176 ]
    
    Previously a mutex was added to protect the encoder and decoder context
    lists from unexpected changes originating from the SCP IP block, causing
    the context pointer to go invalid, resulting in a NULL pointer
    dereference in the IPI handler.
    
    Turns out on the MT8173, the VPU IPI handler is called from hard IRQ
    context. This causes a big warning from the scheduler. This was first
    reported downstream on the ChromeOS kernels, but is also reproducible
    on mainline using Fluster with the FFmpeg v4l2m2m decoders. Even though
    the actual capture format is not supported, the affected code paths
    are triggered.
    
    Since this lock just protects the context list and operations on it are
    very fast, it should be OK to switch to a spinlock.
    
    Fixes: 6467cda18c9f ("media: mediatek: vcodec: adding lock to protect decoder context list")
    Fixes: afaaf3a0f647 ("media: mediatek: vcodec: adding lock to protect encoder context list")
    Cc: Yunfei Dong <[email protected]>
    Cc: [email protected]
    Signed-off-by: Chen-Yu Tsai <[email protected]>
    Reviewed-by: Fei Shao <[email protected]>
    Reviewed-by: Tomasz Figa <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    [ adapted file_to_dec_ctx() and file_to_enc_ctx() helper calls to equivalent fh_to_dec_ctx(file->private_data) and fh_to_enc_ctx(file->private_data) pattern ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: msp3400: Avoid possible out-of-bounds array accesses in msp3400c_thread() [+ + +]

Author: Ivan Abramov <[email protected]>
Date:   Wed Sep 3 02:28:14 2025 +0300

    media: msp3400: Avoid possible out-of-bounds array accesses in msp3400c_thread()
    
    commit d2bceb2e20e783d57e739c71e4e50b4b9f4a3953 upstream.
    
    It's possible for max1 to remain -1 if msp_read() always fail. This
    variable is further used as index for accessing arrays.
    
    Fix that by checking max1 prior to array accesses.
    
    It seems that restart is the preferable action in case of out-of-bounds
    value.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: 8a4b275f9c19 ("V4L/DVB (3427): audmode and rxsubchans fixes (VIDIOC_G/S_TUNER)")
    Cc: [email protected]
    Signed-off-by: Ivan Abramov <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: platform: mtk-mdp3: fix device leaks at probe [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Wed Sep 24 16:39:19 2025 +0200

    media: platform: mtk-mdp3: fix device leaks at probe
    
    commit 8f6f3aa21517ef34d50808af0c572e69580dca20 upstream.
    
    Make sure to drop the references taken when looking up the subsys
    devices during probe on probe failure (e.g. probe deferral) and on
    driver unbind.
    
    Similarly, drop the SCP device reference after retrieving its platform
    data during probe to avoid leaking it.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away.
    
    Fixes: 61890ccaefaf ("media: platform: mtk-mdp3: add MediaTek MDP3 driver")
    Cc: [email protected]      # 6.1
    Cc: Moudy Ho <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: pvrusb2: Fix incorrect variable used in trace message [+ + +]

Author: Colin Ian King <[email protected]>
Date:   Wed Sep 3 09:44:16 2025 +0100

    media: pvrusb2: Fix incorrect variable used in trace message
    
    commit be440980eace19c035a0745fd6b6e42707bc4f49 upstream.
    
    The pvr2_trace message is reporting an error about control read
    transfers, however it is using the incorrect variable write_len
    instead of read_lean. Fix this by using the correct variable
    read_len.
    
    Fixes: d855497edbfb ("V4L/DVB (4228a): pvrusb2 to kernel 2.6.18")
    Cc: [email protected]
    Signed-off-by: Colin Ian King <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: rc: st_rc: Fix reset control resource leak [+ + +]

Author: Haotian Zhang <[email protected]>
Date:   Fri Oct 31 14:03:32 2025 +0800

    media: rc: st_rc: Fix reset control resource leak
    
    commit 1240abf4b71f632f0117b056e22488e4d9808938 upstream.
    
    The driver calls reset_control_get_optional_exclusive() but never calls
    reset_control_put() in error paths or in the remove function. This causes
    a resource leak when probe fails after successfully acquiring the reset
    control, or when the driver is unloaded.
    
    Switch to devm_reset_control_get_optional_exclusive() to automatically
    manage the reset control resource.
    
    Fixes: a4b80242d046 ("media: st-rc: explicitly request exclusive reset control")
    Cc: [email protected]
    Signed-off-by: Haotian Zhang <[email protected]>
    Reviewed-by: Patrice Chotard <[email protected]>
    Signed-off-by: Sean Young <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: renesas: rcar_drif: fix device node reference leak in rcar_drif_bond_enabled [+ + +]

Author: Miaoqian Lin <[email protected]>
Date:   Wed Sep 3 21:37:29 2025 +0800

    media: renesas: rcar_drif: fix device node reference leak in rcar_drif_bond_enabled
    
    commit 445e1658894fd74eab7e53071fa16233887574ed upstream.
    
    The function calls of_parse_phandle() which returns
    a device node with an incremented reference count. When the bonded device
    is not available, the function
    returns NULL without releasing the reference, causing a reference leak.
    
    Add of_node_put(np) to release the device node reference.
    The of_node_put function handles NULL pointers.
    
    Found through static analysis by reviewing the doc of of_parse_phandle()
    and cross-checking its usage patterns across the codebase.
    
    Fixes: 7625ee981af1 ("[media] media: platform: rcar_drif: Add DRIF support")
    Cc: [email protected]
    Signed-off-by: Miaoqian Lin <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Reviewed-by: Fabrizio Castro <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: samsung: exynos4-is: fix potential ABBA deadlock on init [+ + +]

Author: Marek Szyprowski <[email protected]>
Date:   Tue Oct 14 12:46:43 2025 +0200

    media: samsung: exynos4-is: fix potential ABBA deadlock on init
    
    commit 17dc8ccd6dd5ffe30aa9b0d36e2af1389344ce2b upstream.
    
    v4l2_device_register_subdev_nodes() must called without taking
    media_dev->graph_mutex to avoid potential AB-BA deadlock on further
    subdevice driver initialization.
    
    Fixes: fa91f1056f17 ("[media] exynos4-is: Add support for asynchronous subdevices registration")
    Cc: [email protected]
    Signed-off-by: Marek Szyprowski <[email protected]>
    Acked-by: Sylwester Nawrocki <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: TDA1997x: Remove redundant cancel_delayed_work in probe [+ + +]

Author: Duoming Zhou <[email protected]>
Date:   Mon Sep 1 21:26:17 2025 +0800

    media: TDA1997x: Remove redundant cancel_delayed_work in probe
    
    commit 29de195ca39fc2ac0af6fd45522994df9f431f80 upstream.
    
    The delayed_work delayed_work_enable_hpd is initialized with
    INIT_DELAYED_WORK(), but it is never scheduled in tda1997x_probe().
    
    Calling cancel_delayed_work() on a work that has never been
    scheduled is redundant and unnecessary, as there is no pending
    work to cancel.
    
    Remove the redundant cancel_delayed_work() from error handling
    path in tda1997x_probe() to avoid potential confusion.
    
    Fixes: 9ac0038db9a7 ("media: i2c: Add TDA1997x HDMI receiver driver")
    Cc: [email protected]
    Signed-off-by: Duoming Zhou <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: v4l2-mem2mem: Fix outdated documentation [+ + +]

Author: Laurent Pinchart <[email protected]>
Date:   Wed Oct 8 12:55:18 2025 +0300

    media: v4l2-mem2mem: Fix outdated documentation
    
    commit 082b86919b7a94de01d849021b4da820a6cb89dc upstream.
    
    Commit cbd9463da1b1 ("media: v4l2-mem2mem: Avoid calling .device_run in
    v4l2_m2m_job_finish") deferred calls to .device_run() to a work queue to
    avoid recursive calls when a job is finished right away from
    .device_run(). It failed to update the v4l2_m2m_job_finish()
    documentation that still states the function must not be called from
    .device_run(). Fix it.
    
    Fixes: cbd9463da1b1 ("media: v4l2-mem2mem: Avoid calling .device_run in v4l2_m2m_job_finish")
    Cc: [email protected]
    Signed-off-by: Laurent Pinchart <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: verisilicon: Fix CPU stalls on G2 bus error [+ + +]

Author: Nicolas Dufresne <[email protected]>
Date:   Mon Sep 22 14:43:38 2025 -0400

    media: verisilicon: Fix CPU stalls on G2 bus error
    
    commit 19c286b755072a22a063052f530a6b1fac8a1f63 upstream.
    
    In some seek stress tests, we are getting IRQ from the G2 decoder where
    the dec_bus_int and the dec_e bits are high, meaning the decoder is
    still running despite the error.
    
    Fix this by reworking the IRQ handler to only finish the job once we
    have reached completion and move the software reset to when our software
    watchdog triggers.
    
    This way, we let the hardware continue on errors when it did not self
    reset and in worse case scenario the hardware timeout will
    automatically stop it. The actual error will be fixed in a follow up
    patch.
    
    Fixes: 3385c514ecc5a ("media: hantro: Convert imx8m_vpu_g2_irq to helper")
    Cc: [email protected]
    Reviewed-by: Benjamin Gaignard <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: verisilicon: Protect G2 HEVC decoder against invalid DPB index [+ + +]

Author: Nicolas Dufresne <[email protected]>
Date:   Mon Sep 22 14:43:39 2025 -0400

    media: verisilicon: Protect G2 HEVC decoder against invalid DPB index
    
    commit 47825b1646a6a9eca0f90baa3d4f98947c2add96 upstream.
    
    Fix the Hantro G2 HEVC decoder so that we use DPB index 0 whenever a
    ninvalid index is received from user space. This protects the hardware
    from doing faulty memory access which then leads to bus errors.
    
    To be noted that when a reference is missing, userspace such as GStreamer
    passes an invalid DPB index of 255. This issue was found by seeking to a
    CRA picture using GStreamer. The framework is currently missing the code
    to skip over RASL pictures placed after the CRA. This situation can also
    occur while doing live streaming over lossy transport.
    
    Fixes: cb5dd5a0fa518 ("media: hantro: Introduce G2/HEVC decoder")
    Cc: [email protected]
    Reviewed-by: Benjamin Gaignard <[email protected]>
    Signed-off-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: videobuf2: Fix device reference leak in vb2_dc_alloc error path [+ + +]

Author: Haotian Zhang <[email protected]>
Date:   Tue Oct 28 14:44:43 2025 +0800

    media: videobuf2: Fix device reference leak in vb2_dc_alloc error path
    
    commit 94de23a9aa487d7c1372efb161721d7949a177ae upstream.
    
    In vb2_dc_alloc(), get_device() is called to increment the device
    reference count. However, if subsequent DMA allocation fails
    (vb2_dc_alloc_coherent or vb2_dc_alloc_non_coherent returns error),
    the function returns without calling put_device(), causing a device
    reference leak.
    
    Add put_device() call in the error path before kfree() to properly
    release the device reference acquired earlier.
    
    Fixes: de27891f675e ("media: videobuf2: handle non-contiguous DMA allocations")
    Cc: [email protected]
    Signed-off-by: Haotian Zhang <[email protected]>
    Reviewed-by: Marek Szyprowski <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: vidtv: initialize local pointers upon transfer of memory ownership [+ + +]

Author: Jeongjun Park <[email protected]>
Date:   Fri Sep 5 14:18:16 2025 +0900

    media: vidtv: initialize local pointers upon transfer of memory ownership
    
    commit 98aabfe2d79f74613abc2b0b1cef08f97eaf5322 upstream.
    
    vidtv_channel_si_init() creates a temporary list (program, service, event)
    and ownership of the memory itself is transferred to the PAT/SDT/EIT
    tables through vidtv_psi_pat_program_assign(),
    vidtv_psi_sdt_service_assign(), vidtv_psi_eit_event_assign().
    
    The problem here is that the local pointer where the memory ownership
    transfer was completed is not initialized to NULL. This causes the
    vidtv_psi_pmt_create_sec_for_each_pat_entry() function to fail, and
    in the flow that jumps to free_eit, the memory that was freed by
    vidtv_psi_*_table_destroy() can be accessed again by
    vidtv_psi_*_event_destroy() due to the uninitialized local pointer, so it
    is freed once again.
    
    Therefore, to prevent use-after-free and double-free vulnerability,
    local pointers must be initialized to NULL when transferring memory
    ownership.
    
    Cc: <[email protected]>
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=1d9c0edea5907af239e0
    Fixes: 3be8037960bc ("media: vidtv: add error checks")
    Signed-off-by: Jeongjun Park <[email protected]>
    Reviewed-by: Daniel Almeida <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: vpif_capture: fix section mismatch [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Oct 17 07:33:20 2025 +0200

    media: vpif_capture: fix section mismatch
    
    commit 0ef841113724166c3c484d0e9ae6db1eb5634fde upstream.
    
    Platform drivers can be probed after their init sections have been
    discarded (e.g. on probe deferral or manual rebind through sysfs) so the
    probe function must not live in init.
    
    Note that commit ffa1b391c61b ("V4L/DVB: vpif_cap/disp: Removed section
    mismatch warning") incorrectly suppressed the modpost warning.
    
    Fixes: ffa1b391c61b ("V4L/DVB: vpif_cap/disp: Removed section mismatch warning")
    Fixes: 6ffefff5a9e7 ("V4L/DVB (12906c): V4L : vpif capture driver for DM6467")
    Cc: [email protected]      # 2.6.32
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: vpif_display: fix section mismatch [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Oct 17 07:33:21 2025 +0200

    media: vpif_display: fix section mismatch
    
    commit 59ca64bf98e4209df8ace8057d31ae3c80f948cd upstream.
    
    Platform drivers can be probed after their init sections have been
    discarded (e.g. on probe deferral or manual rebind through sysfs) so the
    probe function must not live in init.
    
    Note that commit ffa1b391c61b ("V4L/DVB: vpif_cap/disp: Removed section
    mismatch warning") incorrectly suppressed the modpost warning.
    
    Fixes: ffa1b391c61b ("V4L/DVB: vpif_cap/disp: Removed section mismatch warning")
    Fixes: e7332e3a552f ("V4L/DVB (12176): davinci/vpif_display: Add VPIF display driver")
    Cc: [email protected]      # 2.6.32
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mei: gsc: add dependency on Xe driver [+ + +]

Author: Junxiao Chang <[email protected]>
Date:   Sun Nov 9 17:35:33 2025 +0200

    mei: gsc: add dependency on Xe driver
    
    commit 5d92c3b41f0bddfa416130c6e1b424414f3d2acf upstream.
    
    INTEL_MEI_GSC depends on either i915 or Xe
    and can be present when either of above is present.
    
    Cc: stable <[email protected]>
    Fixes: 87a4c85d3a3e ("drm/xe/gsc: add gsc device support")
    Tested-by: Baoli Zhang <[email protected]>
    Signed-off-by: Junxiao Chang <[email protected]>
    Signed-off-by: Alexander Usyskin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mfd: altera-sysmgr: Fix device leak on sysmgr regmap lookup [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Sep 25 17:02:19 2025 +0200

    mfd: altera-sysmgr: Fix device leak on sysmgr regmap lookup
    
    commit ccb7cd3218e48665f3c7e19eede0da5f069c323d upstream.
    
    Make sure to drop the reference taken to the sysmgr platform device when
    retrieving its driver data.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away.
    
    Fixes: f36e789a1f8d ("mfd: altera-sysmgr: Add SOCFPGA System Manager")
    Cc: [email protected]      # 5.2
    Signed-off-by: Johan Hovold <[email protected]>
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mfd: max77620: Fix potential IRQ chip conflict when probing two devices [+ + +]

Author: Krzysztof Kozlowski <[email protected]>
Date:   Thu Oct 23 12:19:40 2025 +0200

    mfd: max77620: Fix potential IRQ chip conflict when probing two devices
    
    commit 2bac49bad1f3553cc3b3bfb22cc194e9bd9e8427 upstream.
    
    MAX77620 is most likely always a single device on the board, however
    nothing stops board designers to have two of them, thus same device
    driver could probe twice. Or user could manually try to probing second
    time.
    
    Device driver is not ready for that case, because it allocates
    statically 'struct regmap_irq_chip' as non-const and stores during
    probe in 'irq_drv_data' member a pointer to per-probe state
    container ('struct max77620_chip').  devm_regmap_add_irq_chip() does not
    make a copy of 'struct regmap_irq_chip' but store the pointer.
    
    Second probe - either successful or failure - would overwrite the
    'irq_drv_data' from previous device probe, so interrupts would be
    executed in a wrong context.
    
    Cc: [email protected]
    Fixes: 3df140d11c6d ("mfd: max77620: Mask/unmask interrupt before/after servicing it")
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

MIPS: Fix a reference leak bug in ip22_check_gio() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Thu Dec 4 18:36:18 2025 +0800

    MIPS: Fix a reference leak bug in ip22_check_gio()
    
    [ Upstream commit 680ad315caaa2860df411cb378bf3614d96c7648 ]
    
    If gio_device_register fails, gio_dev_put() is required to
    drop the gio_dev device reference.
    
    Fixes: e84de0c61905 ("MIPS: GIO bus support for SGI IP22/28")
    Signed-off-by: Haoxiang Li <[email protected]>
    Signed-off-by: Thomas Bogendoerfer <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

MIPS: ftrace: Fix memory corruption when kernel is located beyond 32 bits [+ + +]

Author: Gregory CLEMENT <[email protected]>
Date:   Fri Nov 28 09:30:06 2025 +0100

    MIPS: ftrace: Fix memory corruption when kernel is located beyond 32 bits
    
    [ Upstream commit 36dac9a3dda1f2bae343191bc16b910c603cac25 ]
    
    Since commit e424054000878 ("MIPS: Tracing: Reduce the overhead of
    dynamic Function Tracer"), the macro UASM_i_LA_mostly has been used,
    and this macro can generate more than 2 instructions. At the same
    time, the code in ftrace assumes that no more than 2 instructions can
    be generated, which is why it stores them in an int[2] array. However,
    as previously noted, the macro UASM_i_LA_mostly (and now UASM_i_LA)
    causes a buffer overflow when _mcount is beyond 32 bits. This leads to
    corruption of the variables located in the __read_mostly section.
    
    This corruption was observed because the variable
    __cpu_primary_thread_mask was corrupted, causing a hang very early
    during boot.
    
    This fix prevents the corruption by avoiding the generation of
    instructions if they could exceed 2 instructions in
    length. Fortunately, insn_la_mcount is only used if the instrumented
    code is located outside the kernel code section, so dynamic ftrace can
    still be used, albeit in a more limited scope. This is still
    preferable to corrupting memory and/or crashing the kernel.
    
    Signed-off-by: Gregory CLEMENT <[email protected]>
    Signed-off-by: Thomas Bogendoerfer <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mlxsw: spectrum_mr: Fix use-after-free when updating multicast route stats [+ + +]

Author: Ido Schimmel <[email protected]>
Date:   Tue Dec 2 18:44:13 2025 +0100

    mlxsw: spectrum_mr: Fix use-after-free when updating multicast route stats
    
    [ Upstream commit 8ac1dacec458f55f871f7153242ed6ab60373b90 ]
    
    Cited commit added a dedicated mutex (instead of RTNL) to protect the
    multicast route list, so that it will not change while the driver
    periodically traverses it in order to update the kernel about multicast
    route stats that were queried from the device.
    
    One instance of list entry deletion (during route replace) was missed
    and it can result in a use-after-free [1].
    
    Fix by acquiring the mutex before deleting the entry from the list and
    releasing it afterwards.
    
    [1]
    BUG: KASAN: slab-use-after-free in mlxsw_sp_mr_stats_update+0x4a5/0x540 drivers/net/ethernet/mellanox/mlxsw/spectrum_mr.c:1006 [mlxsw_spectrum]
    Read of size 8 at addr ffff8881523c2fa8 by task kworker/2:5/22043
    
    CPU: 2 UID: 0 PID: 22043 Comm: kworker/2:5 Not tainted 6.18.0-rc1-custom-g1a3d6d7cd014 #1 PREEMPT(full)
    Hardware name: Mellanox Technologies Ltd. MSN2010/SA002610, BIOS 5.6.5 08/24/2017
    Workqueue: mlxsw_core mlxsw_sp_mr_stats_update [mlxsw_spectrum]
    Call Trace:
     <TASK>
     dump_stack_lvl+0xba/0x110
     print_report+0x174/0x4f5
     kasan_report+0xdf/0x110
     mlxsw_sp_mr_stats_update+0x4a5/0x540 drivers/net/ethernet/mellanox/mlxsw/spectrum_mr.c:1006 [mlxsw_spectrum]
     process_one_work+0x9cc/0x18e0
     worker_thread+0x5df/0xe40
     kthread+0x3b8/0x730
     ret_from_fork+0x3e9/0x560
     ret_from_fork_asm+0x1a/0x30
     </TASK>
    
    Allocated by task 29933:
     kasan_save_stack+0x30/0x50
     kasan_save_track+0x14/0x30
     __kasan_kmalloc+0x8f/0xa0
     mlxsw_sp_mr_route_add+0xd8/0x4770 [mlxsw_spectrum]
     mlxsw_sp_router_fibmr_event_work+0x371/0xad0 drivers/net/ethernet/mellanox/mlxsw/spectrum_router.c:7965 [mlxsw_spectrum]
     process_one_work+0x9cc/0x18e0
     worker_thread+0x5df/0xe40
     kthread+0x3b8/0x730
     ret_from_fork+0x3e9/0x560
     ret_from_fork_asm+0x1a/0x30
    
    Freed by task 29933:
     kasan_save_stack+0x30/0x50
     kasan_save_track+0x14/0x30
     __kasan_save_free_info+0x3b/0x70
     __kasan_slab_free+0x43/0x70
     kfree+0x14e/0x700
     mlxsw_sp_mr_route_add+0x2dea/0x4770 drivers/net/ethernet/mellanox/mlxsw/spectrum_mr.c:444 [mlxsw_spectrum]
     mlxsw_sp_router_fibmr_event_work+0x371/0xad0 drivers/net/ethernet/mellanox/mlxsw/spectrum_router.c:7965 [mlxsw_spectrum]
     process_one_work+0x9cc/0x18e0
     worker_thread+0x5df/0xe40
     kthread+0x3b8/0x730
     ret_from_fork+0x3e9/0x560
     ret_from_fork_asm+0x1a/0x30
    
    Fixes: f38656d06725 ("mlxsw: spectrum_mr: Protect multicast route list with a lock")
    Signed-off-by: Ido Schimmel <[email protected]>
    Reviewed-by: Petr Machata <[email protected]>
    Signed-off-by: Petr Machata <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/f996feecfd59fde297964bfc85040b6d83ec6089.1764695650.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mlxsw: spectrum_router: Fix neighbour use-after-free [+ + +]

Author: Ido Schimmel <[email protected]>
Date:   Tue Dec 2 18:44:12 2025 +0100

    mlxsw: spectrum_router: Fix neighbour use-after-free
    
    [ Upstream commit 8b0e69763ef948fb872a7767df4be665d18f5fd4 ]
    
    We sometimes observe use-after-free when dereferencing a neighbour [1].
    The problem seems to be that the driver stores a pointer to the
    neighbour, but without holding a reference on it. A reference is only
    taken when the neighbour is used by a nexthop.
    
    Fix by simplifying the reference counting scheme. Always take a
    reference when storing a neighbour pointer in a neighbour entry. Avoid
    taking a referencing when the neighbour is used by a nexthop as the
    neighbour entry associated with the nexthop already holds a reference.
    
    Tested by running the test that uncovered the problem over 300 times.
    Without this patch the problem was reproduced after a handful of
    iterations.
    
    [1]
    BUG: KASAN: slab-use-after-free in mlxsw_sp_neigh_entry_update+0x2d4/0x310
    Read of size 8 at addr ffff88817f8e3420 by task ip/3929
    
    CPU: 3 UID: 0 PID: 3929 Comm: ip Not tainted 6.18.0-rc4-virtme-g36b21a067510 #3 PREEMPT(full)
    Hardware name: Nvidia SN5600/VMOD0013, BIOS 5.13 05/31/2023
    Call Trace:
     <TASK>
     dump_stack_lvl+0x6f/0xa0
     print_address_description.constprop.0+0x6e/0x300
     print_report+0xfc/0x1fb
     kasan_report+0xe4/0x110
     mlxsw_sp_neigh_entry_update+0x2d4/0x310
     mlxsw_sp_router_rif_gone_sync+0x35f/0x510
     mlxsw_sp_rif_destroy+0x1ea/0x730
     mlxsw_sp_inetaddr_port_vlan_event+0xa1/0x1b0
     __mlxsw_sp_inetaddr_lag_event+0xcc/0x130
     __mlxsw_sp_inetaddr_event+0xf5/0x3c0
     mlxsw_sp_router_netdevice_event+0x1015/0x1580
     notifier_call_chain+0xcc/0x150
     call_netdevice_notifiers_info+0x7e/0x100
     __netdev_upper_dev_unlink+0x10b/0x210
     netdev_upper_dev_unlink+0x79/0xa0
     vrf_del_slave+0x18/0x50
     do_set_master+0x146/0x7d0
     do_setlink.isra.0+0x9a0/0x2880
     rtnl_newlink+0x637/0xb20
     rtnetlink_rcv_msg+0x6fe/0xb90
     netlink_rcv_skb+0x123/0x380
     netlink_unicast+0x4a3/0x770
     netlink_sendmsg+0x75b/0xc90
     __sock_sendmsg+0xbe/0x160
     ____sys_sendmsg+0x5b2/0x7d0
     ___sys_sendmsg+0xfd/0x180
     __sys_sendmsg+0x124/0x1c0
     do_syscall_64+0xbb/0xfd0
     entry_SYSCALL_64_after_hwframe+0x4b/0x53
    [...]
    
    Allocated by task 109:
     kasan_save_stack+0x30/0x50
     kasan_save_track+0x14/0x30
     __kasan_kmalloc+0x7b/0x90
     __kmalloc_noprof+0x2c1/0x790
     neigh_alloc+0x6af/0x8f0
     ___neigh_create+0x63/0xe90
     mlxsw_sp_nexthop_neigh_init+0x430/0x7e0
     mlxsw_sp_nexthop_type_init+0x212/0x960
     mlxsw_sp_nexthop6_group_info_init.constprop.0+0x81f/0x1280
     mlxsw_sp_nexthop6_group_get+0x392/0x6a0
     mlxsw_sp_fib6_entry_create+0x46a/0xfd0
     mlxsw_sp_router_fib6_replace+0x1ed/0x5f0
     mlxsw_sp_router_fib6_event_work+0x10a/0x2a0
     process_one_work+0xd57/0x1390
     worker_thread+0x4d6/0xd40
     kthread+0x355/0x5b0
     ret_from_fork+0x1d4/0x270
     ret_from_fork_asm+0x11/0x20
    
    Freed by task 154:
     kasan_save_stack+0x30/0x50
     kasan_save_track+0x14/0x30
     __kasan_save_free_info+0x3b/0x60
     __kasan_slab_free+0x43/0x70
     kmem_cache_free_bulk.part.0+0x1eb/0x5e0
     kvfree_rcu_bulk+0x1f2/0x260
     kfree_rcu_work+0x130/0x1b0
     process_one_work+0xd57/0x1390
     worker_thread+0x4d6/0xd40
     kthread+0x355/0x5b0
     ret_from_fork+0x1d4/0x270
     ret_from_fork_asm+0x11/0x20
    
    Last potentially related work creation:
     kasan_save_stack+0x30/0x50
     kasan_record_aux_stack+0x8c/0xa0
     kvfree_call_rcu+0x93/0x5b0
     mlxsw_sp_router_neigh_event_work+0x67d/0x860
     process_one_work+0xd57/0x1390
     worker_thread+0x4d6/0xd40
     kthread+0x355/0x5b0
     ret_from_fork+0x1d4/0x270
     ret_from_fork_asm+0x11/0x20
    
    Fixes: 6cf3c971dc84 ("mlxsw: spectrum_router: Add private neigh table")
    Signed-off-by: Ido Schimmel <[email protected]>
    Reviewed-by: Petr Machata <[email protected]>
    Signed-off-by: Petr Machata <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/92d75e21d95d163a41b5cea67a15cd33f547cba6.1764695650.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mlxsw: spectrum_router: Fix possible neighbour reference count leak [+ + +]

Author: Ido Schimmel <[email protected]>
Date:   Tue Dec 2 18:44:11 2025 +0100

    mlxsw: spectrum_router: Fix possible neighbour reference count leak
    
    [ Upstream commit b6b638bda240395dff49a87403b2e32493e56d2a ]
    
    mlxsw_sp_router_schedule_work() takes a reference on a neighbour,
    expecting a work item to release it later on. However, we might fail to
    schedule the work item, in which case the neighbour reference count will
    be leaked.
    
    Fix by taking the reference just before scheduling the work item. Note
    that mlxsw_sp_router_schedule_work() can receive a NULL neighbour
    pointer, but neigh_clone() handles that correctly.
    
    Spotted during code review, did not actually observe the reference count
    leak.
    
    Fixes: 151b89f6025a ("mlxsw: spectrum_router: Reuse work neighbor initialization in work scheduler")
    Reviewed-by: Petr Machata <[email protected]>
    Signed-off-by: Ido Schimmel <[email protected]>
    Signed-off-by: Petr Machata <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/ec2934ae4aca187a8d8c9329a08ce93cca411378.1764695650.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mm/balloon_compaction: convert balloon_page_delete() to balloon_page_finalize() [+ + +]

Author: David Hildenbrand <[email protected]>
Date:   Mon Jan 5 12:42:04 2026 -0500

    mm/balloon_compaction: convert balloon_page_delete() to balloon_page_finalize()
    
    [ Upstream commit 15504b1163007bbfbd9a63460d5c14737c16e96d ]
    
    Let's move the removal of the page from the balloon list into the single
    caller, to remove the dependency on the PG_isolated flag and clarify
    locking requirements.
    
    Note that for now, balloon_page_delete() was used on two paths:
    
    (1) Removing a page from the balloon for deflation through
        balloon_page_list_dequeue()
    (2) Removing an isolated page from the balloon for migration in the
        per-driver migration handlers. Isolated pages were already removed from
        the balloon list during isolation.
    
    So instead of relying on the flag, we can just distinguish both cases
    directly and handle it accordingly in the caller.
    
    We'll shuffle the operations a bit such that they logically make more
    sense (e.g., remove from the list before clearing flags).
    
    In balloon migration functions we can now move the balloon_page_finalize()
    out of the balloon lock and perform the finalization just before dropping
    the balloon reference.
    
    Document that the page lock is currently required when modifying the
    movability aspects of a page; hopefully we can soon decouple this from the
    page lock.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: David Hildenbrand <[email protected]>
    Reviewed-by: Lorenzo Stoakes <[email protected]>
    Cc: Alistair Popple <[email protected]>
    Cc: Al Viro <[email protected]>
    Cc: Arnd Bergmann <[email protected]>
    Cc: Brendan Jackman <[email protected]>
    Cc: Byungchul Park <[email protected]>
    Cc: Chengming Zhou <[email protected]>
    Cc: Christian Brauner <[email protected]>
    Cc: Christophe Leroy <[email protected]>
    Cc: Eugenio Pé rez <[email protected]>
    Cc: Greg Kroah-Hartman <[email protected]>
    Cc: Gregory Price <[email protected]>
    Cc: Harry Yoo <[email protected]>
    Cc: "Huang, Ying" <[email protected]>
    Cc: Jan Kara <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: Jason Wang <[email protected]>
    Cc: Jerrin Shaji George <[email protected]>
    Cc: Johannes Weiner <[email protected]>
    Cc: John Hubbard <[email protected]>
    Cc: Jonathan Corbet <[email protected]>
    Cc: Joshua Hahn <[email protected]>
    Cc: Liam Howlett <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Mathew Brost <[email protected]>
    Cc: Matthew Wilcox (Oracle) <[email protected]>
    Cc: Miaohe Lin <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: "Michael S. Tsirkin" <[email protected]>
    Cc: Michal Hocko <[email protected]>
    Cc: Mike Rapoport <[email protected]>
    Cc: Minchan Kim <[email protected]>
    Cc: Naoya Horiguchi <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: Oscar Salvador <[email protected]>
    Cc: Peter Xu <[email protected]>
    Cc: Qi Zheng <[email protected]>
    Cc: Rakie Kim <[email protected]>
    Cc: Rik van Riel <[email protected]>
    Cc: Sergey Senozhatsky <[email protected]>
    Cc: Shakeel Butt <[email protected]>
    Cc: Suren Baghdasaryan <[email protected]>
    Cc: Vlastimil Babka <[email protected]>
    Cc: Xuan Zhuo <[email protected]>
    Cc: xu xin <[email protected]>
    Cc: Zi Yan <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Stable-dep-of: 0da2ba35c0d5 ("powerpc/pseries/cmm: adjust BALLOON_MIGRATE when migrating pages")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/balloon_compaction: we cannot have isolated pages in the balloon list [+ + +]

Author: David Hildenbrand <[email protected]>
Date:   Mon Jan 5 12:42:03 2026 -0500

    mm/balloon_compaction: we cannot have isolated pages in the balloon list
    
    [ Upstream commit fb05f992b6bbb4702307d96f00703ee637b24dbf ]
    
    Patch series "mm/migration: rework movable_ops page migration (part 1)",
    v2.
    
    In the future, as we decouple "struct page" from "struct folio", pages
    that support "non-lru page migration" -- movable_ops page migration such
    as memory balloons and zsmalloc -- will no longer be folios.  They will
    not have ->mapping, ->lru, and likely no refcount and no page lock.  But
    they will have a type and flags 🙂
    
    This is the first part (other parts not written yet) of decoupling
    movable_ops page migration from folio migration.
    
    In this series, we get rid of the ->mapping usage, and start cleaning up
    the code + separating it from folio migration.
    
    Migration core will have to be further reworked to not treat movable_ops
    pages like folios.  This is the first step into that direction.
    
    This patch (of 29):
    
    The core will set PG_isolated only after mops->isolate_page() was called.
    In case of the balloon, that is where we will remove it from the balloon
    list.  So we cannot have isolated pages in the balloon list.
    
    Let's drop this unnecessary check.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: David Hildenbrand <[email protected]>
    Acked-by: Zi Yan <[email protected]>
    Reviewed-by: Lorenzo Stoakes <[email protected]>
    Cc: Alistair Popple <[email protected]>
    Cc: Al Viro <[email protected]>
    Cc: Arnd Bergmann <[email protected]>
    Cc: Brendan Jackman <[email protected]>
    Cc: Byungchul Park <[email protected]>
    Cc: Chengming Zhou <[email protected]>
    Cc: Christian Brauner <[email protected]>
    Cc: Christophe Leroy <[email protected]>
    Cc: Eugenio Pé rez <[email protected]>
    Cc: Greg Kroah-Hartman <[email protected]>
    Cc: Gregory Price <[email protected]>
    Cc: "Huang, Ying" <[email protected]>
    Cc: Jan Kara <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: Jason Wang <[email protected]>
    Cc: Jerrin Shaji George <[email protected]>
    Cc: Johannes Weiner <[email protected]>
    Cc: John Hubbard <[email protected]>
    Cc: Jonathan Corbet <[email protected]>
    Cc: Joshua Hahn <[email protected]>
    Cc: Liam Howlett <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Mathew Brost <[email protected]>
    Cc: Matthew Wilcox (Oracle) <[email protected]>
    Cc: Miaohe Lin <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: "Michael S. Tsirkin" <[email protected]>
    Cc: Michal Hocko <[email protected]>
    Cc: Mike Rapoport <[email protected]>
    Cc: Minchan Kim <[email protected]>
    Cc: Naoya Horiguchi <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: Oscar Salvador <[email protected]>
    Cc: Peter Xu <[email protected]>
    Cc: Qi Zheng <[email protected]>
    Cc: Rakie Kim <[email protected]>
    Cc: Rik van Riel <[email protected]>
    Cc: Sergey Senozhatsky <[email protected]>
    Cc: Shakeel Butt <[email protected]>
    Cc: Suren Baghdasaryan <[email protected]>
    Cc: Vlastimil Babka <[email protected]>
    Cc: Xuan Zhuo <[email protected]>
    Cc: xu xin <[email protected]>
    Cc: Harry Yoo <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Stable-dep-of: 0da2ba35c0d5 ("powerpc/pseries/cmm: adjust BALLOON_MIGRATE when migrating pages")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failres in damon_test_new_filter() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:07 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failres in damon_test_new_filter()
    
    commit 28ab2265e9422ccd81e4beafc0ace90f78de04c4 upstream.
    
    damon_test_new_filter() is assuming all dynamic memory allocation in it
    will succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 2a158e956b98 ("mm/damon/core-test: add a test for damos_new_filter()")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [6.6+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: SeongJae Park <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failure on damon_test_set_attrs() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:06 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failure on damon_test_set_attrs()
    
    commit 915a2453d824a9b6bf724e3f970d86ae1d092a61 upstream.
    
    damon_test_set_attrs() is assuming all dynamic memory allocation in it
    will succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: aa13779be6b7 ("mm/damon/core-test: add a test for damon_set_attrs()")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [6.5+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures in damon_test_ops_registration() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:03 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures in damon_test_ops_registration()
    
    commit 4f835f4e8c863985f15abd69db033c2f66546094 upstream.
    
    damon_test_ops_registration() is assuming all dynamic memory allocation in
    it will succeed.  Those are indeed likely in the real use cases since
    those allocations are too small to fail, but theoretically those could
    fail.  In the case, inappropriate memory access can happen.  Fix it by
    appropriately cleanup pre-allocated memory and skip the execution of the
    remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 4f540f5ab4f2 ("mm/damon/core-test: add a kunit test case for ops registration")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.19+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures in damon_test_set_regions() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:04 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures in damon_test_set_regions()
    
    commit 74d5969995d129fd59dd93b9c7daa6669cb6810f upstream.
    
    damon_test_set_regions() is assuming all dynamic memory allocation in it
    will succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 62f409560eb2 ("mm/damon/core-test: test damon_set_regions")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [6.1+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures in damon_test_update_monitoring_result() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:05 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures in damon_test_update_monitoring_result()
    
    commit 8cf298c01b7fdb08eef5b6b26d0fe98d48134d72 upstream.
    
    damon_test_update_monitoring_result() is assuming all dynamic memory
    allocation in it will succeed.  Those are indeed likely in the real use
    cases since those allocations are too small to fail, but theoretically
    those could fail.  In the case, inappropriate memory access can happen.
    Fix it by appropriately cleanup pre-allocated memory and skip the
    execution of the remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: f4c978b6594b ("mm/damon/core-test: add a test for damon_update_monitoring_results()")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [6.3+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures on damon_test_merge_two() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:00 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures on damon_test_merge_two()
    
    commit 3d443dd29a1db7efa587a4bb0c06a497e13ca9e4 upstream.
    
    damon_test_merge_two() is assuming all dynamic memory allocation in it
    will succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures on damon_test_split_at() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:19:59 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures on damon_test_split_at()
    
    commit 5e80d73f22043c59c8ad36452a3253937ed77955 upstream.
    
    damon_test_split_at() is assuming all dynamic memory allocation in it will
    succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures on damon_test_split_regions_of() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:02 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures on damon_test_split_regions_of()
    
    commit eded254cb69044bd4abde87394ea44909708d7c0 upstream.
    
    damon_test_split_regions_of() is assuming all dynamic memory allocation in
    it will succeed.  Those are indeed likely in the real use cases since
    those allocations are too small to fail, but theoretically those could
    fail.  In the case, inappropriate memory access can happen.  Fix it by
    appropriately cleanup pre-allocated memory and skip the execution of the
    remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: SeongJae Park <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle alloc failures on dasmon_test_merge_regions_of() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:01 2025 -0700

    mm/damon/tests/core-kunit: handle alloc failures on dasmon_test_merge_regions_of()
    
    commit 0998d2757218771c59d5ca59ccf13d1542a38f17 upstream.
    
    damon_test_merge_regions_of() is assuming all dynamic memory allocation in
    it will succeed.  Those are indeed likely in the real use cases since
    those allocations are too small to fail, but theoretically those could
    fail.  In the case, inappropriate memory access can happen.  Fix it by
    appropriately cleanup pre-allocated memory and skip the execution of the
    remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle allocation failures in damon_test_regions() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:19:56 2025 -0700

    mm/damon/tests/core-kunit: handle allocation failures in damon_test_regions()
    
    commit e16fdd4f754048d6e23c56bd8d920b71e41e3777 upstream.
    
    damon_test_regions() is assuming all dynamic memory allocation in it will
    succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle memory alloc failure from damon_test_aggregate() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:19:58 2025 -0700

    mm/damon/tests/core-kunit: handle memory alloc failure from damon_test_aggregate()
    
    commit f79f2fc44ebd0ed655239046be3e80e8804b5545 upstream.
    
    damon_test_aggregate() is assuming all dynamic memory allocation in it
    will succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/core-kunit: handle memory failure from damon_test_target() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:19:57 2025 -0700

    mm/damon/tests/core-kunit: handle memory failure from damon_test_target()
    
    commit fafe953de2c661907c94055a2497c6b8dbfd26f3 upstream.
    
    damon_test_target() is assuming all dynamic memory allocation in it will
    succeed.  Those are indeed likely in the real use cases since those
    allocations are too small to fail, but theoretically those could fail.  In
    the case, inappropriate memory access can happen.  Fix it by appropriately
    cleanup pre-allocated memory and skip the execution of the remaining tests
    in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/sysfs-kunit: handle alloc failures on damon_sysfs_test_add_targets() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:14 2025 -0700

    mm/damon/tests/sysfs-kunit: handle alloc failures on damon_sysfs_test_add_targets()
    
    commit 7d808bf13943f4c6a6142400bffe14267f6dc997 upstream.
    
    damon_sysfs_test_add_targets() is assuming all dynamic memory allocation
    in it will succeed.  Those are indeed likely in the real use cases since
    those allocations are too small to fail, but theoretically those could
    fail.  In the case, inappropriate memory access can happen.  Fix it by
    appropriately cleanup pre-allocated memory and skip the execution of the
    remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: b8ee5575f763 ("mm/damon/sysfs-test: add a unit test for damon_sysfs_set_targets()")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [6.7+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/vaddr-kunit: handle alloc failures in damon_test_split_evenly_fail() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:12 2025 -0700

    mm/damon/tests/vaddr-kunit: handle alloc failures in damon_test_split_evenly_fail()
    
    commit 7890e5b5bb6e386155c6e755fe70e0cdcc77f18e upstream.
    
    damon_test_split_evenly_fail() is assuming all dynamic memory allocation
    in it will succeed.  Those are indeed likely in the real use cases since
    those allocations are too small to fail, but theoretically those could
    fail.  In the case, inappropriate memory access can happen.  Fix it by
    appropriately cleanup pre-allocated memory and skip the execution of the
    remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/vaddr-kunit: handle alloc failures on damon_do_test_apply_three_regions() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:11 2025 -0700

    mm/damon/tests/vaddr-kunit: handle alloc failures on damon_do_test_apply_three_regions()
    
    commit 2b22d0fcc6320ba29b2122434c1d2f0785fb0a25 upstream.
    
    damon_do_test_apply_three_regions() is assuming all dynamic memory
    allocation in it will succeed.  Those are indeed likely in the real use
    cases since those allocations are too small to fail, but theoretically
    those could fail.  In the case, inappropriate memory access can happen.
    Fix it by appropriately cleanup pre-allocated memory and skip the
    execution of the remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: SeongJae Park <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/damon/tests/vaddr-kunit: handle alloc failures on damon_test_split_evenly_succ() [+ + +]

Author: SeongJae Park <[email protected]>
Date:   Sat Nov 1 11:20:13 2025 -0700

    mm/damon/tests/vaddr-kunit: handle alloc failures on damon_test_split_evenly_succ()
    
    commit 0a63a0e7570b9b2631dfb8d836dc572709dce39e upstream.
    
    damon_test_split_evenly_succ() is assuming all dynamic memory allocation
    in it will succeed.  Those are indeed likely in the real use cases since
    those allocations are too small to fail, but theoretically those could
    fail.  In the case, inappropriate memory access can happen.  Fix it by
    appropriately cleanup pre-allocated memory and skip the execution of the
    remaining tests in the failure cases.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 17ccae8bb5c9 ("mm/damon: add kunit tests")
    Signed-off-by: SeongJae Park <[email protected]>
    Cc: Brendan Higgins <[email protected]>
    Cc: David Gow <[email protected]>
    Cc: Kefeng Wang <[email protected]>
    Cc: <[email protected]>    [5.15+]
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/kasan: fix incorrect unpoisoning in vrealloc for KASAN [+ + +]

Author: Jiayuan Chen <[email protected]>
Date:   Thu Dec 4 18:59:55 2025 +0000

    mm/kasan: fix incorrect unpoisoning in vrealloc for KASAN
    
    commit 007f5da43b3d0ecff972e2616062b8da1f862f5e upstream.
    
    Patch series "kasan: vmalloc: Fixes for the percpu allocator and
    vrealloc", v3.
    
    Patches fix two issues related to KASAN and vmalloc.
    
    The first one, a KASAN tag mismatch, possibly resulting in a kernel panic,
    can be observed on systems with a tag-based KASAN enabled and with
    multiple NUMA nodes.  Initially it was only noticed on x86 [1] but later a
    similar issue was also reported on arm64 [2].
    
    Specifically the problem is related to how vm_structs interact with
    pcpu_chunks - both when they are allocated, assigned and when pcpu_chunk
    addresses are derived.
    
    When vm_structs are allocated they are unpoisoned, each with a different
    random tag, if vmalloc support is enabled along the KASAN mode.  Later
    when first pcpu chunk is allocated it gets its 'base_addr' field set to
    the first allocated vm_struct.  With that it inherits that vm_struct's
    tag.
    
    When pcpu_chunk addresses are later derived (by pcpu_chunk_addr(), for
    example in pcpu_alloc_noprof()) the base_addr field is used and offsets
    are added to it.  If the initial conditions are satisfied then some of the
    offsets will point into memory allocated with a different vm_struct.  So
    while the lower bits will get accurately derived the tag bits in the top
    of the pointer won't match the shadow memory contents.
    
    The solution (proposed at v2 of the x86 KASAN series [3]) is to unpoison
    the vm_structs with the same tag when allocating them for the per cpu
    allocator (in pcpu_get_vm_areas()).
    
    The second one reported by syzkaller [4] is related to vrealloc and
    happens because of random tag generation when unpoisoning memory without
    allocating new pages.  This breaks shadow memory tracking and needs to
    reuse the existing tag instead of generating a new one.  At the same time
    an inconsistency in used flags is corrected.
    
    
    This patch (of 3):
    
    Syzkaller reported a memory out-of-bounds bug [4].  This patch fixes two
    issues:
    
    1. In vrealloc the KASAN_VMALLOC_VM_ALLOC flag is missing when
       unpoisoning the extended region. This flag is required to correctly
       associate the allocation with KASAN's vmalloc tracking.
    
       Note: In contrast, vzalloc (via __vmalloc_node_range_noprof)
       explicitly sets KASAN_VMALLOC_VM_ALLOC and calls
       kasan_unpoison_vmalloc() with it.  vrealloc must behave consistently --
       especially when reusing existing vmalloc regions -- to ensure KASAN can
       track allocations correctly.
    
    2. When vrealloc reuses an existing vmalloc region (without allocating
       new pages) KASAN generates a new tag, which breaks tag-based memory
       access tracking.
    
    Introduce KASAN_VMALLOC_KEEP_TAG, a new KASAN flag that allows reusing the
    tag already attached to the pointer, ensuring consistent tag behavior
    during reallocation.
    
    Pass KASAN_VMALLOC_KEEP_TAG and KASAN_VMALLOC_VM_ALLOC to the
    kasan_unpoison_vmalloc inside vrealloc_node_align_noprof().
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/38dece0a4074c43e48150d1e242f8242c73bf1a5.1764874575.git.m.wieczorretman@pm.me
    Link: https://lore.kernel.org/all/e7e04692866d02e6d3b32bb43b998e5d17092ba4.1738686764.git.maciej.wieczor-retman@intel.com/ [1]
    Link: https://lore.kernel.org/all/aMUrW1Znp1GEj7St@MiWiFi-R3L-srv/ [2]
    Link: https://lore.kernel.org/all/CAPAsAGxDRv_uFeMYu9TwhBVWHCCtkSxoWY4xmFB_vowMbi8raw@mail.gmail.com/ [3]
    Link: https://syzkaller.appspot.com/bug?extid=997752115a851cb0cf36 [4]
    Fixes: a0309faf1cb0 ("mm: vmalloc: support more granular vrealloc() sizing")
    Signed-off-by: Jiayuan Chen <[email protected]>
    Co-developed-by: Maciej Wieczor-Retman <[email protected]>
    Signed-off-by: Maciej Wieczor-Retman <[email protected]>
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/T/
    Reviewed-by: Andrey Konovalov <[email protected]>
    Cc: Alexander Potapenko <[email protected]>
    Cc: Andrey Ryabinin <[email protected]>
    Cc: Danilo Krummrich <[email protected]>
    Cc: Dmitriy Vyukov <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Marco Elver <[email protected]>
    Cc: "Uladzislau Rezki (Sony)" <[email protected]>
    Cc: Vincenzo Frascino <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/ksm: fix exec/fork inheritance support for prctl [+ + +]

Author: xu xin <[email protected]>
Date:   Mon Dec 29 21:49:37 2025 -0500

    mm/ksm: fix exec/fork inheritance support for prctl
    
    [ Upstream commit 590c03ca6a3fbb114396673314e2aa483839608b ]
    
    Patch series "ksm: fix exec/fork inheritance", v2.
    
    This series fixes exec/fork inheritance.  See the detailed description of
    the issue below.
    
    This patch (of 2):
    
    Background
    ==========
    
    commit d7597f59d1d33 ("mm: add new api to enable ksm per process")
    introduced MMF_VM_MERGE_ANY for mm->flags, and allowed user to set it by
    prctl() so that the process's VMAs are forcibly scanned by ksmd.
    
    Subsequently, the 3c6f33b7273a ("mm/ksm: support fork/exec for prctl")
    supported inheriting the MMF_VM_MERGE_ANY flag when a task calls execve().
    
    Finally, commit 3a9e567ca45fb ("mm/ksm: fix ksm exec support for prctl")
    fixed the issue that ksmd doesn't scan the mm_struct with MMF_VM_MERGE_ANY
    by adding the mm_slot to ksm_mm_head in __bprm_mm_init().
    
    Problem
    =======
    
    In some extreme scenarios, however, this inheritance of MMF_VM_MERGE_ANY
    during exec/fork can fail.  For example, when the scanning frequency of
    ksmd is tuned extremely high, a process carrying MMF_VM_MERGE_ANY may
    still fail to pass it to the newly exec'd process.  This happens because
    ksm_execve() is executed too early in the do_execve flow (prematurely
    adding the new mm_struct to the ksm_mm_slot list).
    
    As a result, before do_execve completes, ksmd may have already performed a
    scan and found that this new mm_struct has no VM_MERGEABLE VMAs, thus
    clearing its MMF_VM_MERGE_ANY flag.  Consequently, when the new program
    executes, the flag MMF_VM_MERGE_ANY inheritance missed.
    
    Root reason
    ===========
    
    commit d7597f59d1d33 ("mm: add new api to enable ksm per process") clear
    the flag MMF_VM_MERGE_ANY when ksmd found no VM_MERGEABLE VMAs.
    
    Solution
    ========
    
    Firstly, Don't clear MMF_VM_MERGE_ANY when ksmd found no VM_MERGEABLE
    VMAs, because perhaps their mm_struct has just been added to ksm_mm_slot
    list, and its process has not yet officially started running or has not
    yet performed mmap/brk to allocate anonymous VMAS.
    
    Secondly, recheck MMF_VM_MERGEABLE again if a process takes
    MMF_VM_MERGE_ANY, and create a mm_slot and join it into ksm_scan_list
    again.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 3c6f33b7273a ("mm/ksm: support fork/exec for prctl")
    Fixes: d7597f59d1d3 ("mm: add new api to enable ksm per process")
    Signed-off-by: xu xin <[email protected]>
    Cc: Stefan Roesch <[email protected]>
    Cc: David Hildenbrand <[email protected]>
    Cc: Jinjiang Tu <[email protected]>
    Cc: Wang Yaxin <[email protected]>
    Cc: Yang Yang <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    [ changed mm_flags_test() and mm_flags_clear() calls to test_bit() and clear_bit() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mm/page_owner: fix memory leak in page_owner_stack_fops->release() [+ + +]

Author: Ran Xiaokai <[email protected]>
Date:   Fri Dec 19 07:42:32 2025 +0000

    mm/page_owner: fix memory leak in page_owner_stack_fops->release()
    
    commit a76a5ae2c6c645005672c2caf2d49361c6f2500f upstream.
    
    The page_owner_stack_fops->open() callback invokes seq_open_private(),
    therefore its corresponding ->release() callback must call
    seq_release_private().  Otherwise it will cause a memory leak of struct
    stack_print_ctx.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 765973a09803 ("mm,page_owner: display all stacks and their count")
    Signed-off-by: Ran Xiaokai <[email protected]>
    Acked-by: Michal Hocko <[email protected]>
    Acked-by: Vlastimil Babka <[email protected]>
    Cc: Andrey Konovalov <[email protected]>
    Cc: Brendan Jackman <[email protected]>
    Cc: Johannes Weiner <[email protected]>
    Cc: Marco Elver <[email protected]>
    Cc: Suren Baghdasaryan <[email protected]>
    Cc: Zi Yan <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: sdhci-esdhc-imx: add alternate ARCH_S32 dependency to Kconfig [+ + +]

Author: Jared Kangas <[email protected]>
Date:   Fri Dec 12 07:03:17 2025 -0800

    mmc: sdhci-esdhc-imx: add alternate ARCH_S32 dependency to Kconfig
    
    commit d3ecb12e2e04ce53c95f933c462f2d8b150b965b upstream.
    
    MMC_SDHCI_ESDHC_IMX requires ARCH_MXC despite also being used on
    ARCH_S32, which results in unmet dependencies when compiling strictly
    for ARCH_S32. Resolve this by adding ARCH_S32 as an alternative to
    ARCH_MXC in the driver's dependencies.
    
    Fixes: 5c4f00627c9a ("mmc: sdhci-esdhc-imx: add NXP S32G2 support")
    Cc: [email protected]
    Signed-off-by: Jared Kangas <[email protected]>
    Reviewed-by: Haibo Chen <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: sdhci-msm: Avoid early clock doubling during HS400 transition [+ + +]

Author: Sarthak Garg <[email protected]>
Date:   Fri Nov 14 13:58:24 2025 +0530

    mmc: sdhci-msm: Avoid early clock doubling during HS400 transition
    
    commit b1f856b1727c2eaa4be2c6d7cd7a8ed052bbeb87 upstream.
    
    According to the hardware programming guide, the clock frequency must
    remain below 52MHz during the transition to HS400 mode.
    
    However,in the current implementation, the timing is set to HS400 (a
    DDR mode) before adjusting the clock. This causes the clock to double
    prematurely to 104MHz during the transition phase, violating the
    specification and potentially resulting in CRC errors or CMD timeouts.
    
    This change ensures that clock doubling is avoided during intermediate
    transitions and is applied only when the card requires a 200MHz clock
    for HS400 operation.
    
    Signed-off-by: Sarthak Garg <[email protected]>
    Reviewed-by: Bjorn Andersson <[email protected]>
    Acked-by: Adrian Hunter <[email protected]>
    Cc: [email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: sdhci-of-arasan: Increase CD stable timeout to 2 seconds [+ + +]

Author: Sai Krishna Potthuri <[email protected]>
Date:   Fri Dec 12 12:05:09 2025 +0530

    mmc: sdhci-of-arasan: Increase CD stable timeout to 2 seconds
    
    commit a9c4c9085ec8ce3ce01be21b75184789e74f5f19 upstream.
    
    On Xilinx/AMD platforms, the CD stable bit take slightly longer than
    one second(about an additional 100ms) to assert after a host
    controller reset. Although no functional failure observed with the
    existing one second delay but to ensure reliable initialization, increase
    the CD stable timeout to 2 seconds.
    
    Fixes: e251709aaddb ("mmc: sdhci-of-arasan: Ensure CD logic stabilization before power-up")
    Cc: [email protected]
    Signed-off-by: Sai Krishna Potthuri <[email protected]>
    Acked-by: Adrian Hunter <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mptcp: avoid deadlock on fallback while reinjecting [+ + +]

Author: Paolo Abeni <[email protected]>
Date:   Fri Dec 5 19:55:17 2025 +0100

    mptcp: avoid deadlock on fallback while reinjecting
    
    commit ffb8c27b0539dd90262d1021488e7817fae57c42 upstream.
    
    Jakub reported an MPTCP deadlock at fallback time:
    
     WARNING: possible recursive locking detected
     6.18.0-rc7-virtme #1 Not tainted
     --------------------------------------------
     mptcp_connect/20858 is trying to acquire lock:
     ff1100001da18b60 (&msk->fallback_lock){+.-.}-{3:3}, at: __mptcp_try_fallback+0xd8/0x280
    
     but task is already holding lock:
     ff1100001da18b60 (&msk->fallback_lock){+.-.}-{3:3}, at: __mptcp_retrans+0x352/0xaa0
    
     other info that might help us debug this:
      Possible unsafe locking scenario:
    
            CPU0
            ----
       lock(&msk->fallback_lock);
       lock(&msk->fallback_lock);
    
      *** DEADLOCK ***
    
      May be due to missing lock nesting notation
    
     3 locks held by mptcp_connect/20858:
      #0: ff1100001da18290 (sk_lock-AF_INET){+.+.}-{0:0}, at: mptcp_sendmsg+0x114/0x1bc0
      #1: ff1100001db40fd0 (k-sk_lock-AF_INET#2){+.+.}-{0:0}, at: __mptcp_retrans+0x2cb/0xaa0
      #2: ff1100001da18b60 (&msk->fallback_lock){+.-.}-{3:3}, at: __mptcp_retrans+0x352/0xaa0
    
     stack backtrace:
     CPU: 0 UID: 0 PID: 20858 Comm: mptcp_connect Not tainted 6.18.0-rc7-virtme #1 PREEMPT(full)
     Hardware name: Bochs, BIOS Bochs 01/01/2011
     Call Trace:
      <TASK>
      dump_stack_lvl+0x6f/0xa0
      print_deadlock_bug.cold+0xc0/0xcd
      validate_chain+0x2ff/0x5f0
      __lock_acquire+0x34c/0x740
      lock_acquire.part.0+0xbc/0x260
      _raw_spin_lock_bh+0x38/0x50
      __mptcp_try_fallback+0xd8/0x280
      mptcp_sendmsg_frag+0x16c2/0x3050
      __mptcp_retrans+0x421/0xaa0
      mptcp_release_cb+0x5aa/0xa70
      release_sock+0xab/0x1d0
      mptcp_sendmsg+0xd5b/0x1bc0
      sock_write_iter+0x281/0x4d0
      new_sync_write+0x3c5/0x6f0
      vfs_write+0x65e/0xbb0
      ksys_write+0x17e/0x200
      do_syscall_64+0xbb/0xfd0
      entry_SYSCALL_64_after_hwframe+0x4b/0x53
     RIP: 0033:0x7fa5627cbc5e
     Code: 4d 89 d8 e8 14 bd 00 00 4c 8b 5d f8 41 8b 93 08 03 00 00 59 5e 48 83 f8 fc 74 11 c9 c3 0f 1f 80 00 00 00 00 48 8b 45 10 0f 05 <c9> c3 83 e2 39 83 fa 08 75 e7 e8 13 ff ff ff 0f 1f 00 f3 0f 1e fa
     RSP: 002b:00007fff1fe14700 EFLAGS: 00000202 ORIG_RAX: 0000000000000001
     RAX: ffffffffffffffda RBX: 0000000000000005 RCX: 00007fa5627cbc5e
     RDX: 0000000000001f9c RSI: 00007fff1fe16984 RDI: 0000000000000005
     RBP: 00007fff1fe14710 R08: 0000000000000000 R09: 0000000000000000
     R10: 0000000000000000 R11: 0000000000000202 R12: 00007fff1fe16920
     R13: 0000000000002000 R14: 0000000000001f9c R15: 0000000000001f9c
    
    The packet scheduler could attempt a reinjection after receiving an
    MP_FAIL and before the infinite map has been transmitted, causing a
    deadlock since MPTCP needs to do the reinjection atomically from WRT
    fallback.
    
    Address the issue explicitly avoiding the reinjection in the critical
    scenario. Note that this is the only fallback critical section that
    could potentially send packets and hit the double-lock.
    
    Reported-by: Jakub Kicinski <[email protected]>
    Closes: https://netdev-ctrl.bots.linux.dev/logs/vmksft/mptcp-dbg/results/412720/1-mptcp-join-sh/stderr
    Fixes: f8a1d9b18c5e ("mptcp: make fallback action and fallback decision atomic")
    Cc: [email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Reviewed-by: Matthieu Baerts (NGI0) <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20251205-net-mptcp-misc-fixes-6-19-rc1-v1-4-9e4781a6c1b8@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mptcp: pm: ignore unknown endpoint flags [+ + +]

Author: Matthieu Baerts (NGI0) <[email protected]>
Date:   Tue Dec 30 08:21:11 2025 -0500

    mptcp: pm: ignore unknown endpoint flags
    
    [ Upstream commit 0ace3297a7301911e52d8195cb1006414897c859 ]
    
    Before this patch, the kernel was saving any flags set by the userspace,
    even unknown ones. This doesn't cause critical issues because the kernel
    is only looking at specific ones. But on the other hand, endpoints dumps
    could tell the userspace some recent flags seem to be supported on older
    kernel versions.
    
    Instead, ignore all unknown flags when parsing them. By doing that, the
    userspace can continue to set unsupported flags, but it has a way to
    verify what is supported by the kernel.
    
    Note that it sounds better to continue accepting unsupported flags not
    to change the behaviour, but also that eases things on the userspace
    side by adding "optional" endpoint types only supported by newer kernel
    versions without having to deal with the different kernel versions.
    
    A note for the backports: there will be conflicts in mptcp.h on older
    versions not having the mentioned flags, the new line should still be
    added last, and the '5' needs to be adapted to have the same value as
    the last entry.
    
    Fixes: 01cacb00b35c ("mptcp: add netlink-based PM")
    Cc: [email protected]
    Reviewed-by: Mat Martineau <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20251205-net-mptcp-misc-fixes-6-19-rc1-v1-1-9e4781a6c1b8@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    [ GENMASK(5, 0) => GENMASK(4, 0) + context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mptcp: schedule rtx timer only after pushing data [+ + +]

Author: Paolo Abeni <[email protected]>
Date:   Fri Dec 5 19:55:16 2025 +0100

    mptcp: schedule rtx timer only after pushing data
    
    commit 2ea6190f42d0416a4310e60a7fcb0b49fcbbd4fb upstream.
    
    The MPTCP protocol usually schedule the retransmission timer only
    when there is some chances for such retransmissions to happen.
    
    With a notable exception: __mptcp_push_pending() currently schedule
    such timer unconditionally, potentially leading to unnecessary rtx
    timer expiration.
    
    The issue is present since the blamed commit below but become easily
    reproducible after commit 27b0e701d387 ("mptcp: drop bogus optimization
    in __mptcp_check_push()")
    
    Fixes: 33d41c9cd74c ("mptcp: more accurate timeout")
    Cc: [email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Reviewed-by: Matthieu Baerts (NGI0) <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20251205-net-mptcp-misc-fixes-6-19-rc1-v1-3-9e4781a6c1b8@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: mtdpart: ignore error -ENOENT from parsers on subpartitions [+ + +]

Author: Christian Marangi <[email protected]>
Date:   Sun Nov 9 12:52:44 2025 +0100

    mtd: mtdpart: ignore error -ENOENT from parsers on subpartitions
    
    commit 64ef5f454e167bb66cf70104f033c3d71e6ef9c0 upstream.
    
    Commit 5c2f7727d437 ("mtd: mtdpart: check for subpartitions parsing
    result") introduced some kind of regression with parser on subpartitions
    where if a parser emits an error then the entire parsing process from the
    upper parser fails and partitions are deleted.
    
    Not checking for error in subpartitions was originally intended as
    special parser can emit error also in the case of the partition not
    correctly init (for example a wiped partition) or special case where the
    partition should be skipped due to some ENV variables externally
    provided (from bootloader for example)
    
    One example case is the TRX partition where, in the context of a wiped
    partition, returns a -ENOENT as the trx_magic is not found in the
    expected TRX header (as the partition is wiped)
    
    To better handle this and still keep some kind of error tracking (for
    example to catch -ENOMEM errors or -EINVAL errors), permit parser on
    subpartition to emit -ENOENT error, print a debug log and skip them
    accordingly.
    
    This results in giving better tracking of the status of the parser
    (instead of returning just 0, dropping any kind of signal that there is
    something wrong with the parser) and to some degree restore the original
    logic of the subpartitions parse.
    
    (worth to notice that some special partition might have all the special
    header present for the parser and declare 0 partition in it, this is why
    it would be wrong to simply return 0 in the case of a special partition
    that is NOT init for the scanning parser)
    
    Cc: [email protected]
    Fixes: 5c2f7727d437 ("mtd: mtdpart: check for subpartitions parsing result")
    Signed-off-by: Christian Marangi <[email protected]>
    Signed-off-by: Miquel Raynal <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: winbond: Add support for W25H01NWxxAM chips [+ + +]

Author: Miquel Raynal <[email protected]>
Date:   Wed Nov 5 18:27:04 2025 +0100

    mtd: spi-nor: winbond: Add support for W25H01NWxxAM chips
    
    commit 1df1fdbc7e63350b2962dc7d87ded124ee26f3ad upstream.
    
    These chips must be described as none of the block protection
    information are discoverable. This chip supports 4 bits plus the
    top/bottom addressing capability to identify the protected blocks.
    
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: winbond: Add support for W25H02NWxxAM chips [+ + +]

Author: Miquel Raynal <[email protected]>
Date:   Wed Nov 5 18:27:05 2025 +0100

    mtd: spi-nor: winbond: Add support for W25H02NWxxAM chips
    
    commit 604cf6a40157abba4677dea9834de8df9047d798 upstream.
    
    These chips must be described as none of the block protection
    information are discoverable. This chip supports 4 bits plus the
    top/bottom addressing capability to identify the protected blocks.
    
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: winbond: Add support for W25H512NWxxAM chips [+ + +]

Author: Miquel Raynal <[email protected]>
Date:   Wed Nov 5 18:27:03 2025 +0100

    mtd: spi-nor: winbond: Add support for W25H512NWxxAM chips
    
    commit f21d2c7d37553b24825918f2f61df123e182b712 upstream.
    
    These chips must be described as none of the block protection
    information are discoverable. This chip supports 4 bits plus the
    top/bottom addressing capability to identify the protected blocks.
    
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: winbond: Add support for W25Q01NWxxIM chips [+ + +]

Author: Miquel Raynal <[email protected]>
Date:   Wed Nov 5 18:27:01 2025 +0100

    mtd: spi-nor: winbond: Add support for W25Q01NWxxIM chips
    
    commit a607e676c8b9258eabc3fc88f45bcd70ea178b41 upstream.
    
    These chips must be described as none of the block protection
    information are discoverable. This chip supports 4 bits plus the
    top/bottom addressing capability to identify the protected blocks.
    
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: winbond: Add support for W25Q01NWxxIQ chips [+ + +]

Author: Miquel Raynal <[email protected]>
Date:   Wed Nov 5 18:27:00 2025 +0100

    mtd: spi-nor: winbond: Add support for W25Q01NWxxIQ chips
    
    commit aee8c4d9d48d661624d72de670ebe5c6b5687842 upstream.
    
    This chip must be described as none of the block protection information
    are discoverable. This chip supports 4 bits plus the top/bottom
    addressing capability to identify the protected blocks.
    
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mtd: spi-nor: winbond: Add support for W25Q02NWxxIM chips [+ + +]

Author: Miquel Raynal <[email protected]>
Date:   Wed Nov 5 18:27:02 2025 +0100

    mtd: spi-nor: winbond: Add support for W25Q02NWxxIM chips
    
    commit 71c239348d9fbdb1f0d6f36013f1697cc06c3e9c upstream.
    
    These chips must be described as none of the block protection
    information are discoverable. This chip supports 4 bits plus the
    top/bottom addressing capability to identify the protected blocks.
    
    Cc: [email protected]
    Signed-off-by: Miquel Raynal <[email protected]>
    Reviewed-by: Michael Walle <[email protected]>
    Signed-off-by: Pratyush Yadav <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/handshake: duplicate handshake cancellations leak socket [+ + +]

Author: Scott Mayhew <[email protected]>
Date:   Tue Dec 9 14:30:15 2025 -0500

    net/handshake: duplicate handshake cancellations leak socket
    
    [ Upstream commit 15564bd67e2975002f2a8e9defee33e321d3183f ]
    
    When a handshake request is cancelled it is removed from the
    handshake_net->hn_requests list, but it is still present in the
    handshake_rhashtbl until it is destroyed.
    
    If a second cancellation request arrives for the same handshake request,
    then remove_pending() will return false... and assuming
    HANDSHAKE_F_REQ_COMPLETED isn't set in req->hr_flags, we'll continue
    processing through the out_true label, where we put another reference on
    the sock and a refcount underflow occurs.
    
    This can happen for example if a handshake times out - particularly if
    the SUNRPC client sends the AUTH_TLS probe to the server but doesn't
    follow it up with the ClientHello due to a problem with tlshd.  When the
    timeout is hit on the server, the server will send a FIN, which triggers
    a cancellation request via xs_reset_transport().  When the timeout is
    hit on the client, another cancellation request happens via
    xs_tls_handshake_sync().
    
    Add a test_and_set_bit(HANDSHAKE_F_REQ_COMPLETED) in the pending cancel
    path so duplicate cancels can be detected.
    
    Fixes: 3b3009ea8abb ("net/handshake: Create a NETLINK service for handling handshake requests")
    Suggested-by: Chuck Lever <[email protected]>
    Signed-off-by: Scott Mayhew <[email protected]>
    Reviewed-by: Chuck Lever <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/handshake: restore destructor on submit failure [+ + +]

Author: caoping <[email protected]>
Date:   Thu Dec 4 01:10:58 2025 -0800

    net/handshake: restore destructor on submit failure
    
    commit 6af2a01d65f89e73c1cbb9267f8880d83a88cee4 upstream.
    
    handshake_req_submit() replaces sk->sk_destruct but never restores it when
    submission fails before the request is hashed. handshake_sk_destruct() then
    returns early and the original destructor never runs, leaking the socket.
    Restore sk_destruct on the error path.
    
    Fixes: 3b3009ea8abb ("net/handshake: Create a NETLINK service for handling handshake requests")
    Reviewed-by: Chuck Lever <[email protected]>
    Cc: [email protected]
    Signed-off-by: caoping <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/hsr: fix NULL pointer dereference in prp_get_untagged_frame() [+ + +]

Author: Shaurya Rane <[email protected]>
Date:   Sat Nov 29 15:07:18 2025 +0530

    net/hsr: fix NULL pointer dereference in prp_get_untagged_frame()
    
    commit 188e0fa5a679570ea35474575e724d8211423d17 upstream.
    
    prp_get_untagged_frame() calls __pskb_copy() to create frame->skb_std
    but doesn't check if the allocation failed. If __pskb_copy() returns
    NULL, skb_clone() is called with a NULL pointer, causing a crash:
    
    Oops: general protection fault, probably for non-canonical address 0xdffffc000000000f: 0000 [#1] SMP KASAN NOPTI
    KASAN: null-ptr-deref in range [0x0000000000000078-0x000000000000007f]
    CPU: 0 UID: 0 PID: 5625 Comm: syz.1.18 Not tainted syzkaller #0 PREEMPT(full)
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
    RIP: 0010:skb_clone+0xd7/0x3a0 net/core/skbuff.c:2041
    Code: 03 42 80 3c 20 00 74 08 4c 89 f7 e8 23 29 05 f9 49 83 3e 00 0f 85 a0 01 00 00 e8 94 dd 9d f8 48 8d 6b 7e 49 89 ee 49 c1 ee 03 <43> 0f b6 04 26 84 c0 0f 85 d1 01 00 00 44 0f b6 7d 00 41 83 e7 0c
    RSP: 0018:ffffc9000d00f200 EFLAGS: 00010207
    RAX: ffffffff892235a1 RBX: 0000000000000000 RCX: ffff88803372a480
    RDX: 0000000000000000 RSI: 0000000000000820 RDI: 0000000000000000
    RBP: 000000000000007e R08: ffffffff8f7d0f77 R09: 1ffffffff1efa1ee
    R10: dffffc0000000000 R11: fffffbfff1efa1ef R12: dffffc0000000000
    R13: 0000000000000820 R14: 000000000000000f R15: ffff88805144cc00
    FS:  0000555557f6d500(0000) GS:ffff88808d72f000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 0000555581d35808 CR3: 000000005040e000 CR4: 0000000000352ef0
    Call Trace:
     <TASK>
     hsr_forward_do net/hsr/hsr_forward.c:-1 [inline]
     hsr_forward_skb+0x1013/0x2860 net/hsr/hsr_forward.c:741
     hsr_handle_frame+0x6ce/0xa70 net/hsr/hsr_slave.c:84
     __netif_receive_skb_core+0x10b9/0x4380 net/core/dev.c:5966
     __netif_receive_skb_one_core net/core/dev.c:6077 [inline]
     __netif_receive_skb+0x72/0x380 net/core/dev.c:6192
     netif_receive_skb_internal net/core/dev.c:6278 [inline]
     netif_receive_skb+0x1cb/0x790 net/core/dev.c:6337
     tun_rx_batched+0x1b9/0x730 drivers/net/tun.c:1485
     tun_get_user+0x2b65/0x3e90 drivers/net/tun.c:1953
     tun_chr_write_iter+0x113/0x200 drivers/net/tun.c:1999
     new_sync_write fs/read_write.c:593 [inline]
     vfs_write+0x5c9/0xb30 fs/read_write.c:686
     ksys_write+0x145/0x250 fs/read_write.c:738
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xfa/0xfa0 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    RIP: 0033:0x7f0449f8e1ff
    Code: 89 54 24 18 48 89 74 24 10 89 7c 24 08 e8 f9 92 02 00 48 8b 54 24 18 48 8b 74 24 10 41 89 c0 8b 7c 24 08 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 31 44 89 c7 48 89 44 24 08 e8 4c 93 02 00 48
    RSP: 002b:00007ffd7ad94c90 EFLAGS: 00000293 ORIG_RAX: 0000000000000001
    RAX: ffffffffffffffda RBX: 00007f044a1e5fa0 RCX: 00007f0449f8e1ff
    RDX: 000000000000003e RSI: 0000200000000500 RDI: 00000000000000c8
    RBP: 00007ffd7ad94d20 R08: 0000000000000000 R09: 0000000000000000
    R10: 000000000000003e R11: 0000000000000293 R12: 0000000000000001
    R13: 00007f044a1e5fa0 R14: 00007f044a1e5fa0 R15: 0000000000000003
     </TASK>
    
    Add a NULL check immediately after __pskb_copy() to handle allocation
    failures gracefully.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=2fa344348a579b779e05
    Fixes: f266a683a480 ("net/hsr: Better frame dispatch")
    Cc: [email protected]
    Signed-off-by: Shaurya Rane <[email protected]>
    Reviewed-by: Felix Maurer <[email protected]>
    Tested-by: Felix Maurer <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/mlx5: Drain firmware reset in shutdown callback [+ + +]

Author: Moshe Shemesh <[email protected]>
Date:   Tue Dec 9 14:56:10 2025 +0200

    net/mlx5: Drain firmware reset in shutdown callback
    
    [ Upstream commit 5846a365fc6476b02d6766963cf0985520f0385f ]
    
    Invoke drain_fw_reset() in the shutdown callback to ensure all
    firmware reset handling is completed before shutdown proceeds.
    
    Fixes: 16d42d313350 ("net/mlx5: Drain fw_reset when removing device")
    Signed-off-by: Moshe Shemesh <[email protected]>
    Reviewed-by: Shay Drori <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: fw reset, clear reset requested on drain_fw_reset [+ + +]

Author: Moshe Shemesh <[email protected]>
Date:   Tue Dec 9 14:56:09 2025 +0200

    net/mlx5: fw reset, clear reset requested on drain_fw_reset
    
    [ Upstream commit 89a898d63f6f588acf5c104c65c94a38b68c69a6 ]
    
    drain_fw_reset() waits for ongoing firmware reset events and blocks new
    event handling, but does not clear the reset requested flag, and may
    keep sync reset polling.
    
    To fix it, call mlx5_sync_reset_clear_reset_requested() to clear the
    flag, stop sync reset polling, and resume health polling, ensuring
    health issues are still detected after the firmware reset drain.
    
    Fixes: 16d42d313350 ("net/mlx5: Drain fw_reset when removing device")
    Signed-off-by: Moshe Shemesh <[email protected]>
    Reviewed-by: Shay Drori <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: fw_tracer, Handle escaped percent properly [+ + +]

Author: Shay Drory <[email protected]>
Date:   Tue Dec 9 14:56:12 2025 +0200

    net/mlx5: fw_tracer, Handle escaped percent properly
    
    [ Upstream commit c0289f67f7d6a0dfba0e92cfe661a5c70c8c6e92 ]
    
    The firmware tracer's format string validation and parameter counting
    did not properly handle escaped percent signs (%%). This caused
    fw_tracer to count more parameters when trace format strings contained
    literal percent characters.
    
    To fix it, allow %% to pass string validation and skip %% sequences when
    counting parameters since they represent literal percent signs rather
    than format specifiers.
    
    Fixes: 70dd6fdb8987 ("net/mlx5: FW tracer, parse traces and kernel tracing support")
    Signed-off-by: Shay Drory <[email protected]>
    Reported-by: Breno Leitao <[email protected]>
    Reviewed-by: Moshe Shemesh <[email protected]>
    Closes: https://lore.kernel.org/netdev/hanz6rzrb2bqbplryjrakvkbmv4y5jlmtthnvi3thg5slqvelp@t3s3erottr6s/
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: fw_tracer, Validate format string parameters [+ + +]

Author: Shay Drory <[email protected]>
Date:   Tue Dec 9 14:56:11 2025 +0200

    net/mlx5: fw_tracer, Validate format string parameters
    
    [ Upstream commit b35966042d20b14e2d83330049f77deec5229749 ]
    
    Add validation for format string parameters in the firmware tracer to
    prevent potential security vulnerabilities and crashes from malformed
    format strings received from firmware.
    
    The firmware tracer receives format strings from the device firmware and
    uses them to format trace messages. Without proper validation, bad
    firmware could provide format strings with invalid format specifiers
    (e.g., %s, %p, %n) that could lead to crashes, or other undefined
    behavior.
    
    Add mlx5_tracer_validate_params() to validate that all format specifiers
    in trace strings are limited to safe integer/hex formats (%x, %d, %i,
    %u, %llx, %lx, etc.). Reject strings containing other format types that
    could be used to access arbitrary memory or cause crashes.
    Invalid format strings are added to the trace output for visibility with
    "BAD_FORMAT: " prefix.
    
    Fixes: 70dd6fdb8987 ("net/mlx5: FW tracer, parse traces and kernel tracing support")
    Signed-off-by: Shay Drory <[email protected]>
    Reviewed-by: Moshe Shemesh <[email protected]>
    Reported-by: Breno Leitao <[email protected]>
    Closes: https://lore.kernel.org/netdev/hanz6rzrb2bqbplryjrakvkbmv4y5jlmtthnvi3thg5slqvelp@t3s3erottr6s/
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/mlx5: Serialize firmware reset with devlink [+ + +]

Author: Shay Drory <[email protected]>
Date:   Tue Dec 9 14:56:13 2025 +0200

    net/mlx5: Serialize firmware reset with devlink
    
    [ Upstream commit 367e501f8b095eca08d2eb0ba4ccea5b5e82c169 ]
    
    The firmware reset mechanism can be triggered by asynchronous events,
    which may race with other devlink operations like devlink reload or
    devlink dev eswitch set, potentially leading to inconsistent states.
    
    This patch addresses the race by using the devl_lock to serialize the
    firmware reset against other devlink operations. When a reset is
    requested, the driver attempts to acquire the lock. If successful, it
    sets a flag to block devlink reload or eswitch changes, ACKs the reset
    to firmware and then releases the lock. If the lock is already held by
    another operation, the driver NACKs the firmware reset request,
    indicating that the reset cannot proceed.
    
    Firmware reset does not keep the devl_lock and instead uses an internal
    firmware reset bit. This is because firmware resets can be triggered by
    asynchronous events, and processed in different threads. It is illegal
    and unsafe to acquire a lock in one thread and attempt to release it in
    another, as lock ownership is intrinsically thread-specific.
    
    This change ensures that firmware resets and other devlink operations
    are mutually exclusive during the critical reset request phase,
    preventing race conditions.
    
    Fixes: 38b9f903f22b ("net/mlx5: Handle sync reset request event")
    Signed-off-by: Shay Drory <[email protected]>
    Reviewed-by: Mateusz Berezecki <[email protected]>
    Reviewed-by: Moshe Shemesh <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: ets: Always remove class from active list before deleting in ets_qdisc_change [+ + +]

Author: Jamal Hadi Salim <[email protected]>
Date:   Fri Nov 28 10:19:19 2025 -0500

    net/sched: ets: Always remove class from active list before deleting in ets_qdisc_change
    
    [ Upstream commit ce052b9402e461a9aded599f5b47e76bc727f7de ]
    
    [email protected] says:
    
    The vulnerability is a race condition between `ets_qdisc_dequeue` and
    `ets_qdisc_change`.  It leads to UAF on `struct Qdisc` object.
    Attacker requires the capability to create new user and network namespace
    in order to trigger the bug.
    See my additional commentary at the end of the analysis.
    
    Analysis:
    
    static int ets_qdisc_change(struct Qdisc *sch, struct nlattr *opt,
                              struct netlink_ext_ack *extack)
    {
    ...
    
          // (1) this lock is preventing .change handler (`ets_qdisc_change`)
          //to race with .dequeue handler (`ets_qdisc_dequeue`)
          sch_tree_lock(sch);
    
          for (i = nbands; i < oldbands; i++) {
                  if (i >= q->nstrict && q->classes[i].qdisc->q.qlen)
                          list_del_init(&q->classes[i].alist);
                  qdisc_purge_queue(q->classes[i].qdisc);
          }
    
          WRITE_ONCE(q->nbands, nbands);
          for (i = nstrict; i < q->nstrict; i++) {
                  if (q->classes[i].qdisc->q.qlen) {
                          // (2) the class is added to the q->active
                          list_add_tail(&q->classes[i].alist, &q->active);
                          q->classes[i].deficit = quanta[i];
                  }
          }
          WRITE_ONCE(q->nstrict, nstrict);
          memcpy(q->prio2band, priomap, sizeof(priomap));
    
          for (i = 0; i < q->nbands; i++)
                  WRITE_ONCE(q->classes[i].quantum, quanta[i]);
    
          for (i = oldbands; i < q->nbands; i++) {
                  q->classes[i].qdisc = queues[i];
                  if (q->classes[i].qdisc != &noop_qdisc)
                          qdisc_hash_add(q->classes[i].qdisc, true);
          }
    
          // (3) the qdisc is unlocked, now dequeue can be called in parallel
          // to the rest of .change handler
          sch_tree_unlock(sch);
    
          ets_offload_change(sch);
          for (i = q->nbands; i < oldbands; i++) {
                  // (4) we're reducing the refcount for our class's qdisc and
                  //  freeing it
                  qdisc_put(q->classes[i].qdisc);
                  // (5) If we call .dequeue between (4) and (5), we will have
                  // a strong UAF and we can control RIP
                  q->classes[i].qdisc = NULL;
                  WRITE_ONCE(q->classes[i].quantum, 0);
                  q->classes[i].deficit = 0;
                  gnet_stats_basic_sync_init(&q->classes[i].bstats);
                  memset(&q->classes[i].qstats, 0, sizeof(q->classes[i].qstats));
          }
          return 0;
    }
    
    Comment:
    This happens because some of the classes have their qdiscs assigned to
    NULL, but remain in the active list. This commit fixes this issue by always
    removing the class from the active list before deleting and freeing its
    associated qdisc
    
    Reproducer Steps
    (trimmed version of what was sent by [email protected])
    
    ```
    DEV="${DEV:-lo}"
    ROOT_HANDLE="${ROOT_HANDLE:-1:}"
    BAND2_HANDLE="${BAND2_HANDLE:-20:}"   # child under 1:2
    PING_BYTES="${PING_BYTES:-48}"
    PING_COUNT="${PING_COUNT:-200000}"
    PING_DST="${PING_DST:-127.0.0.1}"
    
    SLOW_TBF_RATE="${SLOW_TBF_RATE:-8bit}"
    SLOW_TBF_BURST="${SLOW_TBF_BURST:-100b}"
    SLOW_TBF_LAT="${SLOW_TBF_LAT:-1s}"
    
    cleanup() {
      tc qdisc del dev "$DEV" root 2>/dev/null
    }
    trap cleanup EXIT
    
    ip link set "$DEV" up
    
    tc qdisc del dev "$DEV" root 2>/dev/null || true
    
    tc qdisc add dev "$DEV" root handle "$ROOT_HANDLE" ets bands 2 strict 2
    
    tc qdisc add dev "$DEV" parent 1:2 handle "$BAND2_HANDLE" \
      tbf rate "$SLOW_TBF_RATE" burst "$SLOW_TBF_BURST" latency "$SLOW_TBF_LAT"
    
    tc filter add dev "$DEV" parent 1: protocol all prio 1 u32 match u32 0 0 flowid 1:2
    tc -s qdisc ls dev $DEV
    
    ping -I "$DEV" -f -c "$PING_COUNT" -s "$PING_BYTES" -W 0.001 "$PING_DST" \
      >/dev/null 2>&1 &
    tc qdisc change dev "$DEV" root handle "$ROOT_HANDLE" ets bands 2 strict 0
    tc qdisc change dev "$DEV" root handle "$ROOT_HANDLE" ets bands 2 strict 2
    tc -s qdisc ls dev $DEV
    tc qdisc del dev "$DEV" parent 1:2 || true
    tc -s qdisc ls dev $DEV
    tc qdisc change dev "$DEV" root handle "$ROOT_HANDLE" ets bands 1 strict 1
    ```
    
    KASAN report
    ```
    ==================================================================
    BUG: KASAN: slab-use-after-free in ets_qdisc_dequeue+0x1071/0x11b0 kernel/net/sched/sch_ets.c:481
    Read of size 8 at addr ffff8880502fc018 by task ping/12308
    >
    CPU: 0 UID: 0 PID: 12308 Comm: ping Not tainted 6.18.0-rc4-dirty #1 PREEMPT(full)
    Hardware name: QEMU Ubuntu 25.04 PC (i440FX + PIIX, 1996), BIOS 1.16.3-debian-1.16.3-2 04/01/2014
    Call Trace:
     <IRQ>
     __dump_stack kernel/lib/dump_stack.c:94
     dump_stack_lvl+0x100/0x190 kernel/lib/dump_stack.c:120
     print_address_description kernel/mm/kasan/report.c:378
     print_report+0x156/0x4c9 kernel/mm/kasan/report.c:482
     kasan_report+0xdf/0x110 kernel/mm/kasan/report.c:595
     ets_qdisc_dequeue+0x1071/0x11b0 kernel/net/sched/sch_ets.c:481
     dequeue_skb kernel/net/sched/sch_generic.c:294
     qdisc_restart kernel/net/sched/sch_generic.c:399
     __qdisc_run+0x1c9/0x1b00 kernel/net/sched/sch_generic.c:417
     __dev_xmit_skb kernel/net/core/dev.c:4221
     __dev_queue_xmit+0x2848/0x4410 kernel/net/core/dev.c:4729
     dev_queue_xmit kernel/./include/linux/netdevice.h:3365
    [...]
    
    Allocated by task 17115:
     kasan_save_stack+0x30/0x50 kernel/mm/kasan/common.c:56
     kasan_save_track+0x14/0x30 kernel/mm/kasan/common.c:77
     poison_kmalloc_redzone kernel/mm/kasan/common.c:400
     __kasan_kmalloc+0xaa/0xb0 kernel/mm/kasan/common.c:417
     kasan_kmalloc kernel/./include/linux/kasan.h:262
     __do_kmalloc_node kernel/mm/slub.c:5642
     __kmalloc_node_noprof+0x34e/0x990 kernel/mm/slub.c:5648
     kmalloc_node_noprof kernel/./include/linux/slab.h:987
     qdisc_alloc+0xb8/0xc30 kernel/net/sched/sch_generic.c:950
     qdisc_create_dflt+0x93/0x490 kernel/net/sched/sch_generic.c:1012
     ets_class_graft+0x4fd/0x800 kernel/net/sched/sch_ets.c:261
     qdisc_graft+0x3e4/0x1780 kernel/net/sched/sch_api.c:1196
    [...]
    
    Freed by task 9905:
     kasan_save_stack+0x30/0x50 kernel/mm/kasan/common.c:56
     kasan_save_track+0x14/0x30 kernel/mm/kasan/common.c:77
     __kasan_save_free_info+0x3b/0x70 kernel/mm/kasan/generic.c:587
     kasan_save_free_info kernel/mm/kasan/kasan.h:406
     poison_slab_object kernel/mm/kasan/common.c:252
     __kasan_slab_free+0x5f/0x80 kernel/mm/kasan/common.c:284
     kasan_slab_free kernel/./include/linux/kasan.h:234
     slab_free_hook kernel/mm/slub.c:2539
     slab_free kernel/mm/slub.c:6630
     kfree+0x144/0x700 kernel/mm/slub.c:6837
     rcu_do_batch kernel/kernel/rcu/tree.c:2605
     rcu_core+0x7c0/0x1500 kernel/kernel/rcu/tree.c:2861
     handle_softirqs+0x1ea/0x8a0 kernel/kernel/softirq.c:622
     __do_softirq kernel/kernel/softirq.c:656
    [...]
    
    Commentary:
    
    1. Maher Azzouzi working with Trend Micro Zero Day Initiative was reported as
    the person who found the issue. I requested to get a proper email to add to the
    reported-by tag but got no response. For this reason i will credit the person
    i exchanged emails with i.e [email protected]
    
    2. Neither i nor Victor who did a much more thorough testing was able to
    reproduce a UAF with the PoC or other approaches we tried. We were both able to
    reproduce a null ptr deref. After exchange with [email protected]
    they sent a small change to be made to the code to add an extra delay which
    was able to simulate the UAF. i.e, this:
       qdisc_put(q->classes[i].qdisc);
       mdelay(90);
       q->classes[i].qdisc = NULL;
    
    I was informed by Thomas Gleixner([email protected]) that adding delays was
    acceptable approach for demonstrating the bug, quote:
    "Adding such delays is common exploit validation practice"
    The equivalent delay could happen "by virt scheduling the vCPU out, SMIs,
    NMIs, PREEMPT_RT enabled kernel"
    
    3. I asked the OP to test and report back but got no response and after a
    few days gave up and proceeded to submit this fix.
    
    Fixes: de6d25924c2a ("net/sched: sch_ets: don't peek at classes beyond 'nbands'")
    Reported-by: [email protected]
    Tested-by: Victor Nogueira <[email protected]>
    Signed-off-by: Jamal Hadi Salim <[email protected]>
    Reviewed-by: Davide Caratti <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net/sched: ets: Remove drr class from the active list if it changes to strict [+ + +]

Author: Victor Nogueira <[email protected]>
Date:   Mon Dec 8 16:01:24 2025 -0300

    net/sched: ets: Remove drr class from the active list if it changes to strict
    
    [ Upstream commit b1e125ae425aba9b45252e933ca8df52a843ec70 ]
    
    Whenever a user issues an ets qdisc change command, transforming a
    drr class into a strict one, the ets code isn't checking whether that
    class was in the active list and removing it. This means that, if a
    user changes a strict class (which was in the active list) back to a drr
    one, that class will be added twice to the active list [1].
    
    Doing so with the following commands:
    
    tc qdisc add dev lo root handle 1: ets bands 2 strict 1
    tc qdisc add dev lo parent 1:2 handle 20: \
        tbf rate 8bit burst 100b latency 1s
    tc filter add dev lo parent 1: basic classid 1:2
    ping -c1 -W0.01 -s 56 127.0.0.1
    tc qdisc change dev lo root handle 1: ets bands 2 strict 2
    tc qdisc change dev lo root handle 1: ets bands 2 strict 1
    ping -c1 -W0.01 -s 56 127.0.0.1
    
    Will trigger the following splat with list debug turned on:
    
    [   59.279014][  T365] ------------[ cut here ]------------
    [   59.279452][  T365] list_add double add: new=ffff88801d60e350, prev=ffff88801d60e350, next=ffff88801d60e2c0.
    [   59.280153][  T365] WARNING: CPU: 3 PID: 365 at lib/list_debug.c:35 __list_add_valid_or_report+0x17f/0x220
    [   59.280860][  T365] Modules linked in:
    [   59.281165][  T365] CPU: 3 UID: 0 PID: 365 Comm: tc Not tainted 6.18.0-rc7-00105-g7e9f13163c13-dirty #239 PREEMPT(voluntary)
    [   59.281977][  T365] Hardware name: Bochs Bochs, BIOS Bochs 01/01/2011
    [   59.282391][  T365] RIP: 0010:__list_add_valid_or_report+0x17f/0x220
    [   59.282842][  T365] Code: 89 c6 e8 d4 b7 0d ff 90 0f 0b 90 90 31 c0 e9 31 ff ff ff 90 48 c7 c7 e0 a0 22 9f 48 89 f2 48 89 c1 4c 89 c6 e8 b2 b7 0d ff 90 <0f> 0b 90 90 31 c0 e9 0f ff ff ff 48 89 f7 48 89 44 24 10 4c 89 44
    ...
    [   59.288812][  T365] Call Trace:
    [   59.289056][  T365]  <TASK>
    [   59.289224][  T365]  ? srso_alias_return_thunk+0x5/0xfbef5
    [   59.289546][  T365]  ets_qdisc_change+0xd2b/0x1e80
    [   59.289891][  T365]  ? __lock_acquire+0x7e7/0x1be0
    [   59.290223][  T365]  ? __pfx_ets_qdisc_change+0x10/0x10
    [   59.290546][  T365]  ? srso_alias_return_thunk+0x5/0xfbef5
    [   59.290898][  T365]  ? __mutex_trylock_common+0xda/0x240
    [   59.291228][  T365]  ? __pfx___mutex_trylock_common+0x10/0x10
    [   59.291655][  T365]  ? srso_alias_return_thunk+0x5/0xfbef5
    [   59.291993][  T365]  ? srso_alias_return_thunk+0x5/0xfbef5
    [   59.292313][  T365]  ? trace_contention_end+0xc8/0x110
    [   59.292656][  T365]  ? srso_alias_return_thunk+0x5/0xfbef5
    [   59.293022][  T365]  ? srso_alias_return_thunk+0x5/0xfbef5
    [   59.293351][  T365]  tc_modify_qdisc+0x63a/0x1cf0
    
    Fix this by always checking and removing an ets class from the active list
    when changing it to strict.
    
    [1] https://git.kernel.org/pub/scm/linux/kernel/git/netdev/net.git/tree/net/sched/sch_ets.c?id=ce052b9402e461a9aded599f5b47e76bc727f7de#n663
    
    Fixes: cd9b50adc6bb9 ("net/sched: ets: fix crash when flipping from 'strict' to 'quantum'")
    Acked-by: Jamal Hadi Salim <[email protected]>
    Signed-off-by: Victor Nogueira <[email protected]>
    Reviewed-by: Petr Machata <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: bridge: Describe @tunnel_hash member in net_bridge_vlan_group struct [+ + +]

Author: Bagas Sanjaya <[email protected]>
Date:   Thu Dec 18 11:29:37 2025 +0700

    net: bridge: Describe @tunnel_hash member in net_bridge_vlan_group struct
    
    [ Upstream commit f79f9b7ace1713e4b83888c385f5f55519dfb687 ]
    
    Sphinx reports kernel-doc warning:
    
    WARNING: ./net/bridge/br_private.h:267 struct member 'tunnel_hash' not described in 'net_bridge_vlan_group'
    
    Fix it by describing @tunnel_hash member.
    
    Fixes: efa5356b0d9753 ("bridge: per vlan dst_metadata netlink support")
    Signed-off-by: Bagas Sanjaya <[email protected]>
    Acked-by: Nikolay Aleksandrov <[email protected]>
    Reviewed-by: Ido Schimmel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: dsa: b53: skip multicast entries for fdb_dump() [+ + +]

Author: Jonas Gorski <[email protected]>
Date:   Wed Dec 17 21:57:56 2025 +0100

    net: dsa: b53: skip multicast entries for fdb_dump()
    
    [ Upstream commit d42bce414d1c5c0b536758466a1f63ac358e613c ]
    
    port_fdb_dump() is supposed to only add fdb entries, but we iterate over
    the full ARL table, which also includes multicast entries.
    
    So check if the entry is a multicast entry before passing it on to the
    callback().
    
    Additionally, the port of those entries is a bitmask, not a port number,
    so any included entries would have even be for the wrong port.
    
    Fixes: 1da6df85c6fb ("net: dsa: b53: Implement ARL add/del/dump operations")
    Signed-off-by: Jonas Gorski <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: dsa: fix missing put_device() in dsa_tree_find_first_conduit() [+ + +]

Author: Vladimir Oltean <[email protected]>
Date:   Mon Dec 15 17:02:36 2025 +0200

    net: dsa: fix missing put_device() in dsa_tree_find_first_conduit()
    
    [ Upstream commit a9f96dc59b4a50ffbf86158f315e115969172d48 ]
    
    of_find_net_device_by_node() searches net devices by their /sys/class/net/,
    entry. It is documented in its kernel-doc that:
    
     * If successful, returns a pointer to the net_device with the embedded
     * struct device refcount incremented by one, or NULL on failure. The
     * refcount must be dropped when done with the net_device.
    
    We are missing a put_device(&conduit->dev) which we could place at the
    end of dsa_tree_find_first_conduit(). But to explain why calling
    put_device() right away is safe is the same as to explain why the chosen
    solution is different.
    
    The code is very poorly split: dsa_tree_find_first_conduit() was first
    introduced in commit 95f510d0b792 ("net: dsa: allow the DSA master to be
    seen and changed through rtnetlink") but was first used several commits
    later, in commit acc43b7bf52a ("net: dsa: allow masters to join a LAG").
    
    Assume there is a switch with 2 CPU ports and 2 conduits, eno2 and eno3.
    When we create a LAG (bonding or team device) and place eno2 and eno3
    beneath it, we create a 3rd conduit (the LAG device itself), but this is
    slightly different than the first two.
    
    Namely, the cpu_dp->conduit pointer of the CPU ports does not change,
    and remains pointing towards the physical Ethernet controllers which are
    now LAG ports. Only 2 things change:
    - the LAG device has a dev->dsa_ptr which marks it as a DSA conduit
    - dsa_port_to_conduit(user port) finds the LAG and not the physical
      conduit, because of the dp->cpu_port_in_lag bit being set.
    
    When the LAG device is destroyed, dsa_tree_migrate_ports_from_lag_conduit()
    is called and this is where dsa_tree_find_first_conduit() kicks in.
    
    This is the logical mistake and the reason why introducing code in one
    patch and using it from another is bad practice. I didn't realize that I
    don't have to call of_find_net_device_by_node() again; the cpu_dp->conduit
    association was never undone, and is still available for direct (re)use.
    There's only one concern - maybe the conduit disappeared in the
    meantime, but the netdev_hold() call we made during dsa_port_parse_cpu()
    (see previous change) ensures that this was not the case.
    
    Therefore, fixing the code means reimplementing it in the simplest way.
    
    I am blaming the time of use, since this is what "git blame" would show
    if we were to monitor for the conduit's kobject's refcount remaining
    elevated instead of being freed.
    
    Tested on the NXP LS1028A, using the steps from
    Documentation/networking/dsa/configuration.rst section "Affinity of user
    ports to CPU ports", followed by (extra prints added by me):
    
    $ ip link del bond0
    mscc_felix 0000:00:00.5 swp3: Link is Down
    bond0 (unregistering): (slave eno2): Releasing backup interface
    fsl_enetc 0000:00:00.2 eno2: Link is Down
    mscc_felix 0000:00:00.5 swp0: bond0 disappeared, migrating to eno2
    mscc_felix 0000:00:00.5 swp1: bond0 disappeared, migrating to eno2
    mscc_felix 0000:00:00.5 swp2: bond0 disappeared, migrating to eno2
    mscc_felix 0000:00:00.5 swp3: bond0 disappeared, migrating to eno2
    
    Fixes: acc43b7bf52a ("net: dsa: allow masters to join a LAG")
    Signed-off-by: Vladimir Oltean <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: enetc: do not transmit redirected XDP frames when the link is down [+ + +]

Author: Wei Fang <[email protected]>
Date:   Thu Dec 11 10:09:19 2025 +0800

    net: enetc: do not transmit redirected XDP frames when the link is down
    
    [ Upstream commit 2939203ffee818f1e5ebd60bbb85a174d63aab9c ]
    
    In the current implementation, the enetc_xdp_xmit() always transmits
    redirected XDP frames even if the link is down, but the frames cannot
    be transmitted from TX BD rings when the link is down, so the frames
    are still kept in the TX BD rings. If the XDP program is uninstalled,
    users will see the following warning logs.
    
    fsl_enetc 0000:00:00.0 eno0: timeout for tx ring #6 clear
    
    More worse, the TX BD ring cannot work properly anymore, because the
    HW PIR and CIR are not equal after the re-initialization of the TX
    BD ring. At this point, the BDs between CIR and PIR are invalid,
    which will cause a hardware malfunction.
    
    Another reason is that there is internal context in the ring prefetch
    logic that will retain the state from the first incarnation of the ring
    and continue prefetching from the stale location when we re-initialize
    the ring. The internal context is only reset by an FLR. That is to say,
    for LS1028A ENETC, software cannot set the HW CIR and PIR when
    initializing the TX BD ring.
    
    It does not make sense to transmit redirected XDP frames when the link is
    down. Add a link status check to prevent transmission in this condition.
    This fixes part of the issue, but more complex cases remain. For example,
    the TX BD ring may still contain unsent frames when the link goes down.
    Those situations require additional patches, which will build on this
    one.
    
    Fixes: 9d2b68cc108d ("net: enetc: add support for XDP_REDIRECT")
    Signed-off-by: Wei Fang <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Reviewed-by: Hariprasad Kelam <[email protected]>
    Reviewed-by: Vladimir Oltean <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: fec: ERR007885 Workaround for XDP TX path [+ + +]

Author: Wei Fang <[email protected]>
Date:   Fri Nov 28 10:59:15 2025 +0800

    net: fec: ERR007885 Workaround for XDP TX path
    
    [ Upstream commit e8e032cd24dda7cceaa27bc2eb627f82843f0466 ]
    
    The ERR007885 will lead to a TDAR race condition for mutliQ when the
    driver sets TDAR and the UDMA clears TDAR simultaneously or in a small
    window (2-4 cycles). And it will cause the udma_tx and udma_tx_arbiter
    state machines to hang. Therefore, the commit 53bb20d1faba ("net: fec:
    add variable reg_desc_active to speed things up") and the commit
    a179aad12bad ("net: fec: ERR007885 Workaround for conventional TX") have
    added the workaround to fix the potential issue for the conventional TX
    path. Similarly, the XDP TX path should also have the potential hang
    issue, so add the workaround for XDP TX path.
    
    Fixes: 6d6b39f180b8 ("net: fec: add initial XDP support")
    Signed-off-by: Wei Fang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: hns3: add VLAN id validation before using [+ + +]

Author: Jian Shen <[email protected]>
Date:   Thu Dec 11 10:37:37 2025 +0800

    net: hns3: add VLAN id validation before using
    
    [ Upstream commit 6ef935e65902bfed53980ad2754b06a284ea8ac1 ]
    
    Currently, the VLAN id may be used without validation when
    receive a VLAN configuration mailbox from VF. The length of
    vlan_del_fail_bmap is BITS_TO_LONGS(VLAN_N_VID). It may cause
    out-of-bounds memory access once the VLAN id is bigger than
    or equal to VLAN_N_VID.
    
    Therefore, VLAN id needs to be checked to ensure it is within
    the range of VLAN_N_VID.
    
    Fixes: fe4144d47eef ("net: hns3: sync VLAN filter entries when kill VLAN ID failed")
    Signed-off-by: Jian Shen <[email protected]>
    Signed-off-by: Jijie Shao <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: hns3: using the num_tqps in the vf driver to apply for resources [+ + +]

Author: Jian Shen <[email protected]>
Date:   Thu Dec 11 10:37:35 2025 +0800

    net: hns3: using the num_tqps in the vf driver to apply for resources
    
    [ Upstream commit c2a16269742e176fccdd0ef9c016a233491a49ad ]
    
    Currently, hdev->htqp is allocated using hdev->num_tqps, and kinfo->tqp
    is allocated using kinfo->num_tqps. However, kinfo->num_tqps is set to
    min(new_tqps, hdev->num_tqps);  Therefore, kinfo->num_tqps may be smaller
    than hdev->num_tqps, which causes some hdev->htqp[i] to remain
    uninitialized in hclgevf_knic_setup().
    
    Thus, this patch allocates hdev->htqp and kinfo->tqp using hdev->num_tqps,
    ensuring that the lengths of hdev->htqp and kinfo->tqp are consistent
    and that all elements are properly initialized.
    
    Fixes: e2cb1dec9779 ("net: hns3: Add HNS3 VF HCL(Hardware Compatibility Layer) Support")
    Signed-off-by: Jian Shen <[email protected]>
    Signed-off-by: Jijie Shao <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: hns3: using the num_tqps to check whether tqp_index is out of range when vf get ring info from mbx [+ + +]

Author: Jian Shen <[email protected]>
Date:   Thu Dec 11 10:37:36 2025 +0800

    net: hns3: using the num_tqps to check whether tqp_index is out of range when vf get ring info from mbx
    
    [ Upstream commit d180c11aa8a6fa735f9ac2c72c61364a9afc2ba7 ]
    
    Currently, rss_size = num_tqps / tc_num. If tc_num is 1, then num_tqps
    equals rss_size. However, if the tc_num is greater than 1, then rss_size
    will be less than num_tqps, causing the tqp_index check for subsequent TCs
    using rss_size to always fail.
    
    This patch uses the num_tqps to check whether tqp_index is out of range,
    instead of rss_size.
    
    Fixes: 326334aad024 ("net: hns3: add a check for tqp_index in hclge_get_ring_chain_from_mbx()")
    Signed-off-by: Jian Shen <[email protected]>
    Signed-off-by: Jijie Shao <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: ipv6: ioam6: use consistent dst names [+ + +]

Author: Justin Iurman <[email protected]>
Date:   Fri Jan 2 12:37:24 2026 -0800

    net: ipv6: ioam6: use consistent dst names
    
    [ Upstream commit d55acb9732d981c7a8e07dd63089a77d2938e382 ]
    
    Be consistent and use the same terminology as other lwt users: orig_dst
    is the dst_entry before the transformation, while dst is either the
    dst_entry in the cache or the dst_entry after the transformation
    
    Signed-off-by: Justin Iurman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    [Harshit: Backport to 6.12.y]
    Stable-dep-of: 99a2ace61b21 ("net: use dst_dev_rcu() in sk_setup_caps()")
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: macb: Relocate mog_init_rings() callback from macb_mac_link_up() to macb_open() [+ + +]

Author: Xiaolei Wang <[email protected]>
Date:   Mon Dec 22 09:56:24 2025 +0800

    net: macb: Relocate mog_init_rings() callback from macb_mac_link_up() to macb_open()
    
    commit 99537d5c476cada9cf75aef9fa75579a31faadb9 upstream.
    
    In the non-RT kernel, local_bh_disable() merely disables preemption,
    whereas it maps to an actual spin lock in the RT kernel. Consequently,
    when attempting to refill RX buffers via netdev_alloc_skb() in
    macb_mac_link_up(), a deadlock scenario arises as follows:
    
       WARNING: possible circular locking dependency detected
       6.18.0-08691-g2061f18ad76e #39 Not tainted
       ------------------------------------------------------
       kworker/0:0/8 is trying to acquire lock:
       ffff00080369bbe0 (&bp->lock){+.+.}-{3:3}, at: macb_start_xmit+0x808/0xb7c
    
       but task is already holding lock:
       ffff000803698e58 (&queue->tx_ptr_lock){+...}-{3:3}, at: macb_start_xmit
       +0x148/0xb7c
    
       which lock already depends on the new lock.
    
       the existing dependency chain (in reverse order) is:
    
       -> #3 (&queue->tx_ptr_lock){+...}-{3:3}:
              rt_spin_lock+0x50/0x1f0
              macb_start_xmit+0x148/0xb7c
              dev_hard_start_xmit+0x94/0x284
              sch_direct_xmit+0x8c/0x37c
              __dev_queue_xmit+0x708/0x1120
              neigh_resolve_output+0x148/0x28c
              ip6_finish_output2+0x2c0/0xb2c
              __ip6_finish_output+0x114/0x308
              ip6_output+0xc4/0x4a4
              mld_sendpack+0x220/0x68c
              mld_ifc_work+0x2a8/0x4f4
              process_one_work+0x20c/0x5f8
              worker_thread+0x1b0/0x35c
              kthread+0x144/0x200
              ret_from_fork+0x10/0x20
    
       -> #2 (_xmit_ETHER#2){+...}-{3:3}:
              rt_spin_lock+0x50/0x1f0
              sch_direct_xmit+0x11c/0x37c
              __dev_queue_xmit+0x708/0x1120
              neigh_resolve_output+0x148/0x28c
              ip6_finish_output2+0x2c0/0xb2c
              __ip6_finish_output+0x114/0x308
              ip6_output+0xc4/0x4a4
              mld_sendpack+0x220/0x68c
              mld_ifc_work+0x2a8/0x4f4
              process_one_work+0x20c/0x5f8
              worker_thread+0x1b0/0x35c
              kthread+0x144/0x200
              ret_from_fork+0x10/0x20
    
       -> #1 ((softirq_ctrl.lock)){+.+.}-{3:3}:
              lock_release+0x250/0x348
              __local_bh_enable_ip+0x7c/0x240
              __netdev_alloc_skb+0x1b4/0x1d8
              gem_rx_refill+0xdc/0x240
              gem_init_rings+0xb4/0x108
              macb_mac_link_up+0x9c/0x2b4
              phylink_resolve+0x170/0x614
              process_one_work+0x20c/0x5f8
              worker_thread+0x1b0/0x35c
              kthread+0x144/0x200
              ret_from_fork+0x10/0x20
    
       -> #0 (&bp->lock){+.+.}-{3:3}:
              __lock_acquire+0x15a8/0x2084
              lock_acquire+0x1cc/0x350
              rt_spin_lock+0x50/0x1f0
              macb_start_xmit+0x808/0xb7c
              dev_hard_start_xmit+0x94/0x284
              sch_direct_xmit+0x8c/0x37c
              __dev_queue_xmit+0x708/0x1120
              neigh_resolve_output+0x148/0x28c
              ip6_finish_output2+0x2c0/0xb2c
              __ip6_finish_output+0x114/0x308
              ip6_output+0xc4/0x4a4
              mld_sendpack+0x220/0x68c
              mld_ifc_work+0x2a8/0x4f4
              process_one_work+0x20c/0x5f8
              worker_thread+0x1b0/0x35c
              kthread+0x144/0x200
              ret_from_fork+0x10/0x20
    
       other info that might help us debug this:
    
       Chain exists of:
         &bp->lock --> _xmit_ETHER#2 --> &queue->tx_ptr_lock
    
        Possible unsafe locking scenario:
    
              CPU0                    CPU1
              ----                    ----
         lock(&queue->tx_ptr_lock);
                                      lock(_xmit_ETHER#2);
                                      lock(&queue->tx_ptr_lock);
         lock(&bp->lock);
    
        *** DEADLOCK ***
    
       Call trace:
        show_stack+0x18/0x24 (C)
        dump_stack_lvl+0xa0/0xf0
        dump_stack+0x18/0x24
        print_circular_bug+0x28c/0x370
        check_noncircular+0x198/0x1ac
        __lock_acquire+0x15a8/0x2084
        lock_acquire+0x1cc/0x350
        rt_spin_lock+0x50/0x1f0
        macb_start_xmit+0x808/0xb7c
        dev_hard_start_xmit+0x94/0x284
        sch_direct_xmit+0x8c/0x37c
        __dev_queue_xmit+0x708/0x1120
        neigh_resolve_output+0x148/0x28c
        ip6_finish_output2+0x2c0/0xb2c
        __ip6_finish_output+0x114/0x308
        ip6_output+0xc4/0x4a4
        mld_sendpack+0x220/0x68c
        mld_ifc_work+0x2a8/0x4f4
        process_one_work+0x20c/0x5f8
        worker_thread+0x1b0/0x35c
        kthread+0x144/0x200
        ret_from_fork+0x10/0x20
    
    Notably, invoking the mog_init_rings() callback upon link establishment
    is unnecessary. Instead, we can exclusively call mog_init_rings() within
    the ndo_open() callback. This adjustment resolves the deadlock issue.
    Furthermore, since MACB_CAPS_MACB_IS_EMAC cases do not use mog_init_rings()
    when opening the network interface via at91ether_open(), moving
    mog_init_rings() to macb_open() also eliminates the MACB_CAPS_MACB_IS_EMAC
    check.
    
    Fixes: 633e98a711ac ("net: macb: use resolved link config in mac_link_up()")
    Cc: [email protected]
    Suggested-by: Kevin Hao <[email protected]>
    Signed-off-by: Xiaolei Wang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: mdio: aspeed: add dummy read to avoid read-after-write issue [+ + +]

Author: Jacky Chou <[email protected]>
Date:   Thu Dec 11 14:24:58 2025 +0800

    net: mdio: aspeed: add dummy read to avoid read-after-write issue
    
    [ Upstream commit d1a1a4bade4b20c0858d0b2f81d2611de055f675 ]
    
    The Aspeed MDIO controller may return incorrect data when a read operation
    follows immediately after a write. Due to a controller bug, the subsequent
    read can latch stale data, causing the polling logic to terminate earlier
    than expected.
    
    To work around this hardware issue, insert a dummy read after each write
    operation. This ensures that the next actual read returns the correct
    data and prevents premature polling exit.
    
    This workaround has been verified to stabilize MDIO transactions on
    affected Aspeed platforms.
    
    Fixes: f160e99462c6 ("net: phy: Add mdio-aspeed")
    Signed-off-by: Jacky Chou <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Link: https://patch.msgid.link/20251211-aspeed_mdio_add_dummy_read-v3-1-382868869004@aspeedtech.com
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: nfc: fix deadlock between nfc_unregister_device and rfkill_fop_write [+ + +]

Author: Deepanshu Kartikey <[email protected]>
Date:   Thu Dec 18 06:53:54 2025 +0530

    net: nfc: fix deadlock between nfc_unregister_device and rfkill_fop_write
    
    commit 1ab526d97a57e44d26fadcc0e9adeb9c0c0182f5 upstream.
    
    A deadlock can occur between nfc_unregister_device() and rfkill_fop_write()
    due to lock ordering inversion between device_lock and rfkill_global_mutex.
    
    The problematic lock order is:
    
    Thread A (rfkill_fop_write):
      rfkill_fop_write()
        mutex_lock(&rfkill_global_mutex)
          rfkill_set_block()
            nfc_rfkill_set_block()
              nfc_dev_down()
                device_lock(&dev->dev)    <- waits for device_lock
    
    Thread B (nfc_unregister_device):
      nfc_unregister_device()
        device_lock(&dev->dev)
          rfkill_unregister()
            mutex_lock(&rfkill_global_mutex)  <- waits for rfkill_global_mutex
    
    This creates a classic ABBA deadlock scenario.
    
    Fix this by moving rfkill_unregister() and rfkill_destroy() outside the
    device_lock critical section. Store the rfkill pointer in a local variable
    before releasing the lock, then call rfkill_unregister() after releasing
    device_lock.
    
    This change is safe because rfkill_fop_write() holds rfkill_global_mutex
    while calling the rfkill callbacks, and rfkill_unregister() also acquires
    rfkill_global_mutex before cleanup. Therefore, rfkill_unregister() will
    wait for any ongoing callback to complete before proceeding, and
    device_del() is only called after rfkill_unregister() returns, preventing
    any use-after-free.
    
    The similar lock ordering in nfc_register_device() (device_lock ->
    rfkill_global_mutex via rfkill_register) is safe because during
    registration the device is not yet in rfkill_list, so no concurrent
    rfkill operations can occur on this device.
    
    Fixes: 3e3b5dfcd16a ("NFC: reorder the logic in nfc_{un,}register_device")
    Cc: [email protected]
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=4ef89409a235d804c6c2
    Link: https://lore.kernel.org/all/[email protected]/T/ [v1]
    Signed-off-by: Deepanshu Kartikey <[email protected]>
    Reviewed-by: Krzysztof Kozlowski <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: openvswitch: Avoid needlessly taking the RTNL on vport destroy [+ + +]

Author: Toke Høiland-Jørgensen <[email protected]>
Date:   Thu Dec 11 12:50:05 2025 +0100

    net: openvswitch: Avoid needlessly taking the RTNL on vport destroy
    
    [ Upstream commit 5498227676303e3ffa9a3a46214af96bc3e81314 ]
    
    The openvswitch teardown code will immediately call
    ovs_netdev_detach_dev() in response to a NETDEV_UNREGISTER notification.
    It will then start the dp_notify_work workqueue, which will later end up
    calling the vport destroy() callback. This callback takes the RTNL to do
    another ovs_netdev_detach_port(), which in this case is unnecessary.
    This causes extra pressure on the RTNL, in some cases leading to
    "unregister_netdevice: waiting for XX to become free" warnings on
    teardown.
    
    We can straight-forwardly avoid the extra RTNL lock acquisition by
    checking the device flags before taking the lock, and skip the locking
    altogether if the IFF_OVS_DATAPATH flag has already been unset.
    
    Fixes: b07c26511e94 ("openvswitch: fix vport-netdev unregister")
    Tested-by: Adrian Moreno <[email protected]>
    Signed-off-by: Toke Høiland-Jørgensen <[email protected]>
    Acked-by: Eelco Chaudron <[email protected]>
    Acked-by: Aaron Conole <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: openvswitch: fix middle attribute validation in push_nsh() action [+ + +]

Author: Ilya Maximets <[email protected]>
Date:   Thu Dec 4 11:53:32 2025 +0100

    net: openvswitch: fix middle attribute validation in push_nsh() action
    
    [ Upstream commit 5ace7ef87f059d68b5f50837ef3e8a1a4870c36e ]
    
    The push_nsh() action structure looks like this:
    
     OVS_ACTION_ATTR_PUSH_NSH(OVS_KEY_ATTR_NSH(OVS_NSH_KEY_ATTR_BASE,...))
    
    The outermost OVS_ACTION_ATTR_PUSH_NSH attribute is OK'ed by the
    nla_for_each_nested() inside __ovs_nla_copy_actions().  The innermost
    OVS_NSH_KEY_ATTR_BASE/MD1/MD2 are OK'ed by the nla_for_each_nested()
    inside nsh_key_put_from_nlattr().  But nothing checks if the attribute
    in the middle is OK.  We don't even check that this attribute is the
    OVS_KEY_ATTR_NSH.  We just do a double unwrap with a pair of nla_data()
    calls - first time directly while calling validate_push_nsh() and the
    second time as part of the nla_for_each_nested() macro, which isn't
    safe, potentially causing invalid memory access if the size of this
    attribute is incorrect.  The failure may not be noticed during
    validation due to larger netlink buffer, but cause trouble later during
    action execution where the buffer is allocated exactly to the size:
    
     BUG: KASAN: slab-out-of-bounds in nsh_hdr_from_nlattr+0x1dd/0x6a0 [openvswitch]
     Read of size 184 at addr ffff88816459a634 by task a.out/22624
    
     CPU: 8 UID: 0 PID: 22624 6.18.0-rc7+ #115 PREEMPT(voluntary)
     Call Trace:
      <TASK>
      dump_stack_lvl+0x51/0x70
      print_address_description.constprop.0+0x2c/0x390
      kasan_report+0xdd/0x110
      kasan_check_range+0x35/0x1b0
      __asan_memcpy+0x20/0x60
      nsh_hdr_from_nlattr+0x1dd/0x6a0 [openvswitch]
      push_nsh+0x82/0x120 [openvswitch]
      do_execute_actions+0x1405/0x2840 [openvswitch]
      ovs_execute_actions+0xd5/0x3b0 [openvswitch]
      ovs_packet_cmd_execute+0x949/0xdb0 [openvswitch]
      genl_family_rcv_msg_doit+0x1d6/0x2b0
      genl_family_rcv_msg+0x336/0x580
      genl_rcv_msg+0x9f/0x130
      netlink_rcv_skb+0x11f/0x370
      genl_rcv+0x24/0x40
      netlink_unicast+0x73e/0xaa0
      netlink_sendmsg+0x744/0xbf0
      __sys_sendto+0x3d6/0x450
      do_syscall_64+0x79/0x2c0
      entry_SYSCALL_64_after_hwframe+0x76/0x7e
      </TASK>
    
    Let's add some checks that the attribute is properly sized and it's
    the only one attribute inside the action.  Technically, there is no
    real reason for OVS_KEY_ATTR_NSH to be there, as we know that we're
    pushing an NSH header already, it just creates extra nesting, but
    that's how uAPI works today.  So, keeping as it is.
    
    Fixes: b2d0f5d5dc53 ("openvswitch: enable NSH support")
    Reported-by: Junvy Yang <[email protected]>
    Signed-off-by: Ilya Maximets <[email protected]>
    Acked-by: Eelco Chaudron [email protected]
    Reviewed-by: Aaron Conole <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: phy: marvell-88q2xxx: Fix clamped value in mv88q2xxx_hwmon_write [+ + +]

Author: Thorsten Blum <[email protected]>
Date:   Tue Dec 2 18:27:44 2025 +0100

    net: phy: marvell-88q2xxx: Fix clamped value in mv88q2xxx_hwmon_write
    
    commit c4cdf7376271bce5714c06d79ec67759b18910eb upstream.
    
    The local variable 'val' was never clamped to -75000 or 180000 because
    the return value of clamp_val() was not used. Fix this by assigning the
    clamped value back to 'val', and use clamp() instead of clamp_val().
    
    Cc: [email protected]
    Fixes: a557a92e6881 ("net: phy: marvell-88q2xxx: add support for temperature sensor")
    Signed-off-by: Thorsten Blum <[email protected]>
    Reviewed-by: Dimitri Fedrau <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: rose: fix invalid array index in rose_kill_by_device() [+ + +]

Author: Pwnverse <[email protected]>
Date:   Mon Dec 22 21:22:27 2025 +0000

    net: rose: fix invalid array index in rose_kill_by_device()
    
    [ Upstream commit 6595beb40fb0ec47223d3f6058ee40354694c8e4 ]
    
    rose_kill_by_device() collects sockets into a local array[] and then
    iterates over them to disconnect sockets bound to a device being brought
    down.
    
    The loop mistakenly indexes array[cnt] instead of array[i]. For cnt <
    ARRAY_SIZE(array), this reads an uninitialized entry; for cnt ==
    ARRAY_SIZE(array), it is an out-of-bounds read. Either case can lead to
    an invalid socket pointer dereference and also leaks references taken
    via sock_hold().
    
    Fix the index to use i.
    
    Fixes: 64b8bc7d5f143 ("net/rose: fix races in rose_kill_by_device()")
    Co-developed-by: Fatma Alwasmi <[email protected]>
    Signed-off-by: Fatma Alwasmi <[email protected]>
    Signed-off-by: Pwnverse <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: stmmac: fix the crash issue for zero copy XDP_TX action [+ + +]

Author: Wei Fang <[email protected]>
Date:   Thu Dec 4 15:13:32 2025 +0800

    net: stmmac: fix the crash issue for zero copy XDP_TX action
    
    [ Upstream commit a48e232210009be50591fdea8ba7c07b0f566a13 ]
    
    There is a crash issue when running zero copy XDP_TX action, the crash
    log is shown below.
    
    [  216.122464] Unable to handle kernel paging request at virtual address fffeffff80000000
    [  216.187524] Internal error: Oops: 0000000096000144 [#1]  SMP
    [  216.301694] Call trace:
    [  216.304130]  dcache_clean_poc+0x20/0x38 (P)
    [  216.308308]  __dma_sync_single_for_device+0x1bc/0x1e0
    [  216.313351]  stmmac_xdp_xmit_xdpf+0x354/0x400
    [  216.317701]  __stmmac_xdp_run_prog+0x164/0x368
    [  216.322139]  stmmac_napi_poll_rxtx+0xba8/0xf00
    [  216.326576]  __napi_poll+0x40/0x218
    [  216.408054] Kernel panic - not syncing: Oops: Fatal exception in interrupt
    
    For XDP_TX action, the xdp_buff is converted to xdp_frame by
    xdp_convert_buff_to_frame(). The memory type of the resulting xdp_frame
    depends on the memory type of the xdp_buff. For page pool based xdp_buff
    it produces xdp_frame with memory type MEM_TYPE_PAGE_POOL. For zero copy
    XSK pool based xdp_buff it produces xdp_frame with memory type
    MEM_TYPE_PAGE_ORDER0. However, stmmac_xdp_xmit_back() does not check the
    memory type and always uses the page pool type, this leads to invalid
    mappings and causes the crash. Therefore, check the xdp_buff memory type
    in stmmac_xdp_xmit_back() to fix this issue.
    
    Fixes: bba2556efad6 ("net: stmmac: Enable RX via AF_XDP zero-copy")
    Signed-off-by: Wei Fang <[email protected]>
    Reviewed-by: Hariprasad Kelam <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: usb: asix: validate PHY address before use [+ + +]

Author: Deepanshu Kartikey <[email protected]>
Date:   Thu Dec 18 06:41:56 2025 +0530

    net: usb: asix: validate PHY address before use
    
    [ Upstream commit a1e077a3f76eea0dc671ed6792e7d543946227e8 ]
    
    The ASIX driver reads the PHY address from the USB device via
    asix_read_phy_addr(). A malicious or faulty device can return an
    invalid address (>= PHY_MAX_ADDR), which causes a warning in
    mdiobus_get_phy():
    
      addr 207 out of range
      WARNING: drivers/net/phy/mdio_bus.c:76
    
    Validate the PHY address in asix_read_phy_addr() and remove the
    now-redundant check in ax88172a.c.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=3d43c9066a5b54902232
    Tested-by: [email protected]
    Fixes: 7e88b11a862a ("net: usb: asix: refactor asix_read_phy_addr() and handle errors on return")
    Link: https://lore.kernel.org/all/[email protected]/T/ [v1]
    Signed-off-by: Deepanshu Kartikey <[email protected]>
    Reviewed-by: Andrew Lunn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: usb: rtl8150: fix memory leak on usb_submit_urb() failure [+ + +]

Author: Deepakkumar Karn <[email protected]>
Date:   Tue Dec 16 20:43:05 2025 +0530

    net: usb: rtl8150: fix memory leak on usb_submit_urb() failure
    
    [ Upstream commit 12cab1191d9890097171156d06bfa8d31f1e39c8 ]
    
    In async_set_registers(), when usb_submit_urb() fails, the allocated
      async_req structure and URB are not freed, causing a memory leak.
    
      The completion callback async_set_reg_cb() is responsible for freeing
      these allocations, but it is only called after the URB is successfully
      submitted and completes (successfully or with error). If submission
      fails, the callback never runs and the memory is leaked.
    
      Fix this by freeing both the URB and the request structure in the error
      path when usb_submit_urb() fails.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=8dd915c7cb0490fc8c52
    Fixes: 4d12997a9bb3 ("drivers: net: usb: rtl8150: concurrent URB bugfix")
    Signed-off-by: Deepakkumar Karn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: usb: sr9700: fix incorrect command used to write single register [+ + +]

Author: Ethan Nelson-Moore <[email protected]>
Date:   Sun Dec 21 00:24:00 2025 -0800

    net: usb: sr9700: fix incorrect command used to write single register
    
    commit fa0b198be1c6775bc7804731a43be5d899d19e7a upstream.
    
    This fixes the device failing to initialize with "error reading MAC
    address" for me, probably because the incorrect write of NCR_RST to
    SR_NCR is not actually resetting the device.
    
    Fixes: c9b37458e95629b1d1171457afdcc1bf1eb7881d ("USB2NET : SR9700 : One chip USB 1.1 USB2NET SR9700Device Driver Support")
    Cc: [email protected]
    Signed-off-by: Ethan Nelson-Moore <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net: use dst_dev_rcu() in sk_setup_caps() [+ + +]

Author: Eric Dumazet <[email protected]>
Date:   Fri Jan 2 12:37:26 2026 -0800

    net: use dst_dev_rcu() in sk_setup_caps()
    
    [ Upstream commit 99a2ace61b211b0be861b07fbaa062fca4b58879 ]
    
    Use RCU to protect accesses to dst->dev from sk_setup_caps()
    and sk_dst_gso_max_size().
    
    Also use dst_dev_rcu() in ip6_dst_mtu_maybe_forward(),
    and ip_dst_mtu_maybe_forward().
    
    ip4_dst_hoplimit() can use dst_dev_net_rcu().
    
    Fixes: 4a6ce2b6f2ec ("net: introduce a new function dst_dev_put()")
    Signed-off-by: Eric Dumazet <[email protected]>
    Reviewed-by: David Ahern <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    [Harshit: Backport to 6.12.y, resolve conflict due to missing commit:
    22d6c9eebf2e ("net: Unexport shared functions for DCCP.")  in 6.12.y]
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

netfilter: nf_conncount: fix leaked ct in error paths [+ + +]

Author: Fernando Fernandez Mancera <[email protected]>
Date:   Fri Dec 5 12:58:01 2025 +0100

    netfilter: nf_conncount: fix leaked ct in error paths
    
    [ Upstream commit 2e2a720766886190a6d35c116794693aabd332b6 ]
    
    There are some situations where ct might be leaked as error paths are
    skipping the refcounted check and return immediately. In order to solve
    it make sure that the check is always called.
    
    Fixes: be102eb6a0e7 ("netfilter: nf_conncount: rework API to use sk_buff directly")
    Signed-off-by: Fernando Fernandez Mancera <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_nat: remove bogus direction check [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Mon Dec 8 16:00:34 2025 +0100

    netfilter: nf_nat: remove bogus direction check
    
    [ Upstream commit 5ec8ca26fe93103577c904644b0957f069d0051a ]
    
    Jakub reports spurious failures of the 'conntrack_reverse_clash.sh'
    selftest.  A bogus test makes nat core resort to port rewrite even
    though there is no need for this.
    
    When the test is made, nf_nat_used_tuple() would already have caused us
    to return if no other CPU had added a colliding entry.
    Moreover, nf_nat_used_tuple() would have ignored the colliding entry if
    their origin tuples had been the same.
    
    All that is left to check is if the colliding entry in the hash table
    is subject to NAT, and, if its not, if our entry matches in the reverse
    direction, e.g. hash table has
    
    addr1:1234 -> addr2:80, and we want to commit
    addr2:80   -> addr1:1234.
    
    Because we already checked that neither the new nor the committed entry is
    subject to NAT we only have to check origin vs. reply tuple:
    for non-nat entries, the reply tuple is always the inverted original.
    
    Just in case there are more problems extend the error reporting
    in the selftest while at it and dump conntrack table/stats on error.
    
    Reported-by: Jakub Kicinski <[email protected]>
    Closes: https://lore.kernel.org/netdev/[email protected]/
    Fixes: d8f84a9bc7c4 ("netfilter: nf_nat: don't try nat source port reallocation for reverse dir clash")
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nf_tables: remove redundant chain validation on register store [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Wed Nov 19 13:42:05 2025 +0100

    netfilter: nf_tables: remove redundant chain validation on register store
    
    [ Upstream commit a67fd55f6a09f4119b7232c19e0f348fe31ab0db ]
    
    This validation predates the introduction of the state machine that
    determines when to enter slow path validation for error reporting.
    
    Currently, table validation is perform when:
    
    - new rule contains expressions that need validation.
    - new set element with jump/goto verdict.
    
    Validation on register store skips most checks with no basechains, still
    this walks the graph searching for loops and ensuring expressions are
    called from the right hook. Remove this.
    
    Fixes: a654de8fdc18 ("netfilter: nf_tables: fix chain dependency validation")
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nft_ct: add seqadj extension for natted connections [+ + +]

Author: Andrii Melnychenko <[email protected]>
Date:   Fri Jan 2 12:37:21 2026 -0800

    netfilter: nft_ct: add seqadj extension for natted connections
    
    [ Upstream commit 90918e3b6404c2a37837b8f11692471b4c512de2 ]
    
    Sequence adjustment may be required for FTP traffic with PASV/EPSV modes.
    due to need to re-write packet payload (IP, port) on the ftp control
    connection. This can require changes to the TCP length and expected
    seq / ack_seq.
    
    The easiest way to reproduce this issue is with PASV mode.
    Example ruleset:
    table inet ftp_nat {
            ct helper ftp_helper {
                    type "ftp" protocol tcp
                    l3proto inet
            }
    
            chain prerouting {
                    type filter hook prerouting priority 0; policy accept;
                    tcp dport 21 ct state new ct helper set "ftp_helper"
            }
    }
    table ip nat {
            chain prerouting {
                    type nat hook prerouting priority -100; policy accept;
                    tcp dport 21 dnat ip prefix to ip daddr map {
                            192.168.100.1 : 192.168.13.2/32 }
            }
    
            chain postrouting {
                    type nat hook postrouting priority 100 ; policy accept;
                    tcp sport 21 snat ip prefix to ip saddr map {
                            192.168.13.2 : 192.168.100.1/32 }
            }
    }
    
    Note that the ftp helper gets assigned *after* the dnat setup.
    
    The inverse (nat after helper assign) is handled by an existing
    check in nf_nat_setup_info() and will not show the problem.
    
    Topoloy:
    
     +-------------------+     +----------------------------------+
     | FTP: 192.168.13.2 | <-> | NAT: 192.168.13.3, 192.168.100.1 |
     +-------------------+     +----------------------------------+
                                          |
                             +-----------------------+
                             | Client: 192.168.100.2 |
                             +-----------------------+
    
    ftp nat changes do not work as expected in this case:
    Connected to 192.168.100.1.
    [..]
    ftp> epsv
    EPSV/EPRT on IPv4 off.
    ftp> ls
    227 Entering passive mode (192,168,100,1,209,129).
    421 Service not available, remote server has closed connection.
    
    Kernel logs:
    Missing nfct_seqadj_ext_add() setup call
    WARNING: CPU: 1 PID: 0 at net/netfilter/nf_conntrack_seqadj.c:41
    [..]
     __nf_nat_mangle_tcp_packet+0x100/0x160 [nf_nat]
     nf_nat_ftp+0x142/0x280 [nf_nat_ftp]
     help+0x4d1/0x880 [nf_conntrack_ftp]
     nf_confirm+0x122/0x2e0 [nf_conntrack]
     nf_hook_slow+0x3c/0xb0
     ..
    
    Fix this by adding the required extension when a conntrack helper is assigned
    to a connection that has a nat binding.
    
    Fixes: 1a64edf54f55 ("netfilter: nft_ct: add helper set support")
    Signed-off-by: Andrii Melnychenko <[email protected]>
    Signed-off-by: Florian Westphal <[email protected]>
    [Harshit: Clean cherry-pick, apply it to stable-6.12.y]
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

netrom: Fix memory leak in nr_sendmsg() [+ + +]

Author: Wang Liang <[email protected]>
Date:   Sat Nov 29 12:13:15 2025 +0800

    netrom: Fix memory leak in nr_sendmsg()
    
    [ Upstream commit 613d12dd794e078be8ff3cf6b62a6b9acf7f4619 ]
    
    syzbot reported a memory leak [1].
    
    When function sock_alloc_send_skb() return NULL in nr_output(), the
    original skb is not freed, which was allocated in nr_sendmsg(). Fix this
    by freeing it before return.
    
    [1]
    BUG: memory leak
    unreferenced object 0xffff888129f35500 (size 240):
      comm "syz.0.17", pid 6119, jiffies 4294944652
      hex dump (first 32 bytes):
        00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
        00 00 00 00 00 00 00 00 00 10 52 28 81 88 ff ff  ..........R(....
      backtrace (crc 1456a3e4):
        kmemleak_alloc_recursive include/linux/kmemleak.h:44 [inline]
        slab_post_alloc_hook mm/slub.c:4983 [inline]
        slab_alloc_node mm/slub.c:5288 [inline]
        kmem_cache_alloc_node_noprof+0x36f/0x5e0 mm/slub.c:5340
        __alloc_skb+0x203/0x240 net/core/skbuff.c:660
        alloc_skb include/linux/skbuff.h:1383 [inline]
        alloc_skb_with_frags+0x69/0x3f0 net/core/skbuff.c:6671
        sock_alloc_send_pskb+0x379/0x3e0 net/core/sock.c:2965
        sock_alloc_send_skb include/net/sock.h:1859 [inline]
        nr_sendmsg+0x287/0x450 net/netrom/af_netrom.c:1105
        sock_sendmsg_nosec net/socket.c:727 [inline]
        __sock_sendmsg net/socket.c:742 [inline]
        sock_write_iter+0x293/0x2a0 net/socket.c:1195
        new_sync_write fs/read_write.c:593 [inline]
        vfs_write+0x45d/0x710 fs/read_write.c:686
        ksys_write+0x143/0x170 fs/read_write.c:738
        do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
        do_syscall_64+0xa4/0xfa0 arch/x86/entry/syscall_64.c:94
        entry_SYSCALL_64_after_hwframe+0x77/0x7f
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=d7abc36bbbb6d7d40b58
    Tested-by: [email protected]
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Signed-off-by: Wang Liang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nfc: pn533: Fix error code in pn533_acr122_poweron_rdr() [+ + +]

Author: Dan Carpenter <[email protected]>
Date:   Tue Dec 9 09:56:39 2025 +0300

    nfc: pn533: Fix error code in pn533_acr122_poweron_rdr()
    
    [ Upstream commit 885bebac9909994050bbbeed0829c727e42bd1b7 ]
    
    Set the error code if "transferred != sizeof(cmd)" instead of
    returning success.
    
    Fixes: dbafc28955fa ("NFC: pn533: don't send USB data off of the stack")
    Signed-off-by: Dan Carpenter <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

NFSD: Clear SECLABEL in the suppattr_exclcreat bitmap [+ + +]

Author: Chuck Lever <[email protected]>
Date:   Mon Nov 17 11:00:49 2025 -0500

    NFSD: Clear SECLABEL in the suppattr_exclcreat bitmap
    
    commit 27d17641cacfedd816789b75d342430f6b912bd2 upstream.
    
    >From RFC 8881:
    
    5.8.1.14. Attribute 75: suppattr_exclcreat
    
    > The bit vector that would set all REQUIRED and RECOMMENDED
    > attributes that are supported by the EXCLUSIVE4_1 method of file
    > creation via the OPEN operation. The scope of this attribute
    > applies to all objects with a matching fsid.
    
    There's nothing in RFC 8881 that states that suppattr_exclcreat is
    or is not allowed to contain bits for attributes that are clear in
    the reported supported_attrs bitmask. But it doesn't make sense for
    an NFS server to indicate that it /doesn't/ implement an attribute,
    but then also indicate that clients /are/ allowed to set that
    attribute using OPEN(create) with EXCLUSIVE4_1.
    
    Ensure that the SECURITY_LABEL and ACL bits are not set in the
    suppattr_exclcreat bitmask when they are also not set in the
    supported_attrs bitmask.
    
    Fixes: 8c18f2052e75 ("nfsd41: SUPPATTR_EXCLCREAT attribute")
    Cc: [email protected]
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nfsd: Drop the client reference in client_states_open() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Sat Dec 6 15:38:42 2025 +0800

    nfsd: Drop the client reference in client_states_open()
    
    commit 1f941b2c23fd34c6f3b76d36f9d0a2528fa92b8f upstream.
    
    In error path, call drop_client() to drop the reference
    obtained by get_nfsdfs_clp().
    
    Fixes: 78599c42ae3c ("nfsd4: add file to display list of client's opens")
    Cc: [email protected]
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Haoxiang Li <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nfsd: fix memory leak in nfsd_create_serv error paths [+ + +]

Author: Shardul Bankar <[email protected]>
Date:   Mon Nov 17 17:41:21 2025 +0530

    nfsd: fix memory leak in nfsd_create_serv error paths
    
    [ Upstream commit df8d829bba3adcf3cc744c01d933b6fd7cf06e91 ]
    
    When nfsd_create_serv() calls percpu_ref_init() to initialize
    nn->nfsd_net_ref, it allocates both a percpu reference counter
    and a percpu_ref_data structure (64 bytes). However, if the
    function fails later due to svc_create_pooled() returning NULL
    or svc_bind() returning an error, these allocations are not
    cleaned up, resulting in a memory leak.
    
    The leak manifests as:
    - Unreferenced percpu allocation (8 bytes per CPU)
    - Unreferenced percpu_ref_data structure (64 bytes)
    
    Fix this by adding percpu_ref_exit() calls in both error paths
    to properly clean up the percpu_ref_init() allocations.
    
    This patch fixes the percpu_ref leak in nfsd_create_serv() seen
    as an auxiliary leak in syzbot report 099461f8558eb0a1f4f3; the
    prepare_creds() and vsock-related leaks in the same report
    remain to be addressed separately.
    
    Reported-by: [email protected]
    Link: https://syzkaller.appspot.com/bug?extid=099461f8558eb0a1f4f3
    Fixes: 47e988147f40 ("nfsd: add nfsd_serv_try_get and nfsd_serv_put")
    Signed-off-by: Shardul Bankar <[email protected]>
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nfsd: Mark variable __maybe_unused to avoid W=1 build break [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Thu Nov 13 09:31:31 2025 +0100

    nfsd: Mark variable __maybe_unused to avoid W=1 build break
    
    commit ebae102897e760e9e6bc625f701dd666b2163bd1 upstream.
    
    Clang is not happy about set but (in some cases) unused variable:
    
    fs/nfsd/export.c:1027:17: error: variable 'inode' set but not used [-Werror,-Wunused-but-set-variable]
    
    since it's used as a parameter to dprintk() which might be configured
    a no-op. To avoid uglifying code with the specific ifdeffery just mark
    the variable __maybe_unused.
    
    The commit [1], which introduced this behaviour, is quite old and hence
    the Fixes tag points to the first of the Git era.
    
    Link: https://git.kernel.org/pub/scm/linux/kernel/git/history/history.git/commit/?id=0431923fb7a1 [1]
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Signed-off-by: Andy Shevchenko <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NFSD: NFSv4 file creation neglects setting ACL [+ + +]

Author: Chuck Lever <[email protected]>
Date:   Tue Nov 18 19:51:19 2025 -0500

    NFSD: NFSv4 file creation neglects setting ACL
    
    commit 913f7cf77bf14c13cfea70e89bcb6d0b22239562 upstream.
    
    An NFSv4 client that sets an ACL with a named principal during file
    creation retrieves the ACL afterwards, and finds that it is only a
    default ACL (based on the mode bits) and not the ACL that was
    requested during file creation. This violates RFC 8881 section
    6.4.1.3: "the ACL attribute is set as given".
    
    The issue occurs in nfsd_create_setattr(), which calls
    nfsd_attrs_valid() to determine whether to call nfsd_setattr().
    However, nfsd_attrs_valid() checks only for iattr changes and
    security labels, but not POSIX ACLs. When only an ACL is present,
    the function returns false, nfsd_setattr() is skipped, and the
    POSIX ACL is never applied to the inode.
    
    Subsequently, when the client retrieves the ACL, the server finds
    no POSIX ACL on the inode and returns one generated from the file's
    mode bits rather than returning the originally-specified ACL.
    
    Reported-by: Aurélien Couderc <[email protected]>
    Fixes: c0cbe70742f4 ("NFSD: add posix ACLs to struct nfsd_attrs")
    Cc: Roland Mainz <[email protected]>
    Cc: [email protected]
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nfsd: rename nfsd_serv_ prefixed methods and variables with nfsd_net_ [+ + +]

Author: Mike Snitzer <[email protected]>
Date:   Fri Nov 15 20:40:59 2024 -0500

    nfsd: rename nfsd_serv_ prefixed methods and variables with nfsd_net_
    
    [ Upstream commit b33f7dec3a67216123312c7bb752b8f6faa1c465 ]
    
    Also update Documentation/filesystems/nfs/localio.rst accordingly
    and reduce the technical documentation debt that was previously
    captured in that document.
    
    Signed-off-by: Mike Snitzer <[email protected]>
    Reviewed-by: Jeff Layton <[email protected]>
    Acked-by: Chuck Lever <[email protected]>
    Signed-off-by: Anna Schumaker <[email protected]>
    Stable-dep-of: df8d829bba3a ("nfsd: fix memory leak in nfsd_create_serv error paths")
    Signed-off-by: Sasha Levin <[email protected]>

nfsd: update percpu_ref to manage references on nfsd_net [+ + +]

Author: Mike Snitzer <[email protected]>
Date:   Fri Nov 15 20:40:58 2024 -0500

    nfsd: update percpu_ref to manage references on nfsd_net
    
    [ Upstream commit 39972494e318a21b3059287909fc090186dbe60a ]
    
    Holding a reference on nfsd_net is what is required, it was never
    actually about ensuring nn->nfsd_serv available.
    
    Move waiting for outstanding percpu references from
    nfsd_destroy_serv() to nfsd_shutdown_net().
    
    By moving it later it will be possible to invalidate localio clients
    during nfsd_file_cache_shutdown_net() via __nfsd_file_cache_purge().
    
    Signed-off-by: Mike Snitzer <[email protected]>
    Reviewed-by: Jeff Layton <[email protected]>
    Acked-by: Chuck Lever <[email protected]>
    Signed-off-by: Anna Schumaker <[email protected]>
    Stable-dep-of: df8d829bba3a ("nfsd: fix memory leak in nfsd_create_serv error paths")
    Signed-off-by: Sasha Levin <[email protected]>

NFSD: use correct reservation type in nfsd4_scsi_fence_client [+ + +]

Author: Dai Ngo <[email protected]>
Date:   Wed Nov 5 12:45:54 2025 -0800

    NFSD: use correct reservation type in nfsd4_scsi_fence_client
    
    commit 6f52063db9aabdaabea929b1e998af98c2e8d917 upstream.
    
    The reservation type argument for the pr_preempt call should match the
    one used in nfsd4_block_get_device_info_scsi.
    
    Fixes: f99d4fbdae67 ("nfsd: add SCSI layout support")
    Cc: [email protected]
    Signed-off-by: Dai Ngo <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ntfs: Do not overwrite uptodate pages [+ + +]

Author: Matthew Wilcox (Oracle) <[email protected]>
Date:   Fri Jul 18 20:53:58 2025 +0100

    ntfs: Do not overwrite uptodate pages
    
    commit 68f6bd128e75a032432eda9d16676ed2969a1096 upstream.
    
    When reading a compressed file, we may read several pages in addition to
    the one requested.  The current code will overwrite pages in the page
    cache with the data from disc which can definitely result in changes
    that have been made being lost.
    
    For example if we have four consecutie pages ABCD in the file compressed
    into a single extent, on first access, we'll bring in ABCD.  Then we
    write to page B.  Memory pressure results in the eviction of ACD.
    When we attempt to write to page C, we will overwrite the data in page
    B with the data currently on disk.
    
    I haven't investigated the decompression code to check whether it's
    OK to overwrite a clean page or whether it might be possible to see
    corrupt data.  Out of an abundance of caution, decline to overwrite
    uptodate pages, not just dirty pages.
    
    Fixes: 4342306f0f0d (fs/ntfs3: Add file operations and implementation)
    Signed-off-by: Matthew Wilcox (Oracle) <[email protected]>
    Cc: [email protected]
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ntfs: set dummy blocksize to read boot_block when mounting [+ + +]

Author: Pedro Demarchi Gomes <[email protected]>
Date:   Fri Oct 3 12:38:50 2025 -0300

    ntfs: set dummy blocksize to read boot_block when mounting
    
    [ Upstream commit d1693a7d5a38acf6424235a6070bcf5b186a360d ]
    
    When mounting, sb->s_blocksize is used to read the boot_block without
    being defined or validated. Set a dummy blocksize before attempting to
    read the boot_block.
    
    The issue can be triggered with the following syz reproducer:
    
      mkdirat(0xffffffffffffff9c, &(0x7f0000000080)='./file1\x00', 0x0)
      r4 = openat$nullb(0xffffffffffffff9c, &(0x7f0000000040), 0x121403, 0x0)
      ioctl$FS_IOC_SETFLAGS(r4, 0x40081271, &(0x7f0000000980)=0x4000)
      mount(&(0x7f0000000140)=@nullb, &(0x7f0000000040)='./cgroup\x00',
            &(0x7f0000000000)='ntfs3\x00', 0x2208004, 0x0)
      syz_clone(0x88200200, 0x0, 0x0, 0x0, 0x0, 0x0)
    
    Here, the ioctl sets the bdev block size to 16384. During mount,
    get_tree_bdev_flags() calls sb_set_blocksize(sb, block_size(bdev)),
    but since block_size(bdev) > PAGE_SIZE, sb_set_blocksize() leaves
    sb->s_blocksize at zero.
    
    Later, ntfs_init_from_boot() attempts to read the boot_block while
    sb->s_blocksize is still zero, which triggers the bug.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=f4f84b57a01d6b8364ad
    Signed-off-by: Pedro Demarchi Gomes <[email protected]>
    [[email protected]: changed comment style, added
    return value handling]
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nvme-fabrics: add ENOKEY to no retry criteria for authentication failures [+ + +]

Author: Justin Tee <[email protected]>
Date:   Mon Nov 17 10:43:43 2025 -0800

    nvme-fabrics: add ENOKEY to no retry criteria for authentication failures
    
    [ Upstream commit 13989207ee29c40501e719512e8dc90768325895 ]
    
    With authentication, in addition to EKEYREJECTED there is also no point in
    retrying reconnects when status is ENOKEY.  Thus, add -ENOKEY as another
    criteria to determine when to stop retries.
    
    Cc: Daniel Wagner <[email protected]>
    Cc: Hannes Reinecke <[email protected]>
    Closes: https://lore.kernel.org/linux-nvme/[email protected]/
    Signed-off-by: Justin Tee <[email protected]>
    Tested-by: Daniel Wagner <[email protected]>
    Reviewed-by: Daniel Wagner <[email protected]>
    Reviewed-by: Hannes Reinecke <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nvme-fc: don't hold rport lock when putting ctrl [+ + +]

Author: Daniel Wagner <[email protected]>
Date:   Thu Oct 30 11:05:45 2025 +0100

    nvme-fc: don't hold rport lock when putting ctrl
    
    [ Upstream commit b71cbcf7d170e51148d5467820ae8a72febcb651 ]
    
    nvme_fc_ctrl_put can acquire the rport lock when freeing the
    ctrl object:
    
    nvme_fc_ctrl_put
      nvme_fc_ctrl_free
        spin_lock_irqsave(rport->lock)
    
    Thus we can't hold the rport lock when calling nvme_fc_ctrl_put.
    
    Justin suggested use the safe list iterator variant because
    nvme_fc_ctrl_put will also modify the rport->list.
    
    Cc: Justin Tee <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Daniel Wagner <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ocfs2: fix kernel BUG in ocfs2_find_victim_chain [+ + +]

Author: Prithvi Tambewagh <[email protected]>
Date:   Mon Dec 1 18:37:11 2025 +0530

    ocfs2: fix kernel BUG in ocfs2_find_victim_chain
    
    commit 039bef30e320827bac8990c9f29d2a68cd8adb5f upstream.
    
    syzbot reported a kernel BUG in ocfs2_find_victim_chain() because the
    `cl_next_free_rec` field of the allocation chain list (next free slot in
    the chain list) is 0, triggring the BUG_ON(!cl->cl_next_free_rec)
    condition in ocfs2_find_victim_chain() and panicking the kernel.
    
    To fix this, an if condition is introduced in ocfs2_claim_suballoc_bits(),
    just before calling ocfs2_find_victim_chain(), the code block in it being
    executed when either of the following conditions is true:
    
    1. `cl_next_free_rec` is equal to 0, indicating that there are no free
    chains in the allocation chain list
    2. `cl_next_free_rec` is greater than `cl_count` (the total number of
    chains in the allocation chain list)
    
    Either of them being true is indicative of the fact that there are no
    chains left for usage.
    
    This is addressed using ocfs2_error(), which prints
    the error log for debugging purposes, rather than panicking the kernel.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Prithvi Tambewagh <[email protected]>
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=96d38c6e1655c1420a72
    Tested-by: [email protected]
    Reviewed-by: Joseph Qi <[email protected]>
    Cc: Mark Fasheh <[email protected]>
    Cc: Joel Becker <[email protected]>
    Cc: Junxiao Bi <[email protected]>
    Cc: Changwei Ge <[email protected]>
    Cc: Jun Piao <[email protected]>
    Cc: Heming Zhao <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

octeontx2-pf: fix "UBSAN: shift-out-of-bounds error" [+ + +]

Author: Anshumali Gaur <[email protected]>
Date:   Fri Dec 19 11:52:26 2025 +0530

    octeontx2-pf: fix "UBSAN: shift-out-of-bounds error"
    
    [ Upstream commit 85f4b0c650d9f9db10bda8d3acfa1af83bf78cf7 ]
    
    This patch ensures that the RX ring size (rx_pending) is not
    set below the permitted length. This avoids UBSAN
    shift-out-of-bounds errors when users passes small or zero
    ring sizes via ethtool -G.
    
    Fixes: d45d8979840d ("octeontx2-pf: Add basic ethtool support")
    Signed-off-by: Anshumali Gaur <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

parisc: Do not reprogram affinitiy on ASP chip [+ + +]

Author: Helge Deller <[email protected]>
Date:   Tue Nov 25 15:23:02 2025 +0100

    parisc: Do not reprogram affinitiy on ASP chip
    
    commit dca7da244349eef4d78527cafc0bf80816b261f5 upstream.
    
    The ASP chip is a very old variant of the GSP chip and is used e.g. in
    HP 730 workstations. When trying to reprogram the affinity it will crash
    with a HPMC as the relevant registers don't seem to be at the usual
    location.  Let's avoid the crash by checking the sversion. Also note,
    that reprogramming isn't necessary either, as the HP730 is a just a
    single-CPU machine.
    
    Signed-off-by: Helge Deller <[email protected]>
    Cc: [email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

parisc: entry.S: fix space adjustment on interruption for 64-bit userspace [+ + +]

Author: Sven Schnelle <[email protected]>
Date:   Thu Oct 30 08:56:05 2025 +0100

    parisc: entry.S: fix space adjustment on interruption for 64-bit userspace
    
    commit 1aa4524c0c1b54842c4c0a370171d11b12d0709b upstream.
    
    In wide mode, the IASQ contain the upper part of the GVA
    during interruption. This needs to be reversed before
    the space is used - otherwise it contains parts of IAOQ.
    See Page 2-13 "Processing Resources / Interruption Instruction
    Address Queues" in the Parisc 2.0 Architecture Manual page 2-13
    for an explanation.
    
    The IAOQ/IASQ space_adjust was skipped for other interruptions
    than itlb misses. However, the code in handle_interruption()
    checks whether iasq[0] contains a valid space. Due to the not
    masked out bits this match failed and the process was killed.
    
    Also add space_adjust for IAOQ1/IASQ1 so ptregs contains sane values.
    
    Signed-off-by: Sven Schnelle <[email protected]>
    Cc: [email protected] # v6.0+
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

parisc: entry: set W bit for !compat tasks in syscall_restore_rfi() [+ + +]

Author: Sven Schnelle <[email protected]>
Date:   Wed Oct 15 23:21:41 2025 +0200

    parisc: entry: set W bit for !compat tasks in syscall_restore_rfi()
    
    commit 5fb1d3ce3e74a4530042795e1e065422295f1371 upstream.
    
    When the kernel leaves to userspace via syscall_restore_rfi(), the
    W bit is not set in the new PSW. This doesn't cause any problems
    because there's no 64 bit userspace for parisc. Simple static binaries
    are usually loaded at addresses way below the 32 bit limit so the W bit
    doesn't matter.
    
    Fix this by setting the W bit when TIF_32BIT is not set.
    
    Signed-off-by: Sven Schnelle <[email protected]>
    Cc: [email protected]
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

PCI/PM: Reinstate clearing state_saved in legacy and !PM codepaths [+ + +]

Author: Lukas Wunner <[email protected]>
Date:   Wed Nov 19 09:50:01 2025 +0100

    PCI/PM: Reinstate clearing state_saved in legacy and !PM codepaths
    
    commit 894f475f88e06c0f352c829849560790dbdedbe5 upstream.
    
    When a PCI device is suspended, it is normally the PCI core's job to save
    Config Space and put the device into a low power state.  However drivers
    are allowed to assume these responsibilities.  When they do, the PCI core
    can tell by looking at the state_saved flag in struct pci_dev:  The flag
    is cleared before commencing the suspend sequence and it is set when
    pci_save_state() is called.  If the PCI core finds the flag set late in
    the suspend sequence, it refrains from calling pci_save_state() itself.
    
    But there are two corner cases where the PCI core neglects to clear the
    flag before commencing the suspend sequence:
    
    * If a driver has legacy PCI PM callbacks, pci_legacy_suspend() neglects
      to clear the flag.  The (stale) flag is subsequently queried by
      pci_legacy_suspend() itself and pci_legacy_suspend_late().
    
    * If a device has no driver or its driver has no PCI PM callbacks,
      pci_pm_freeze() neglects to clear the flag.  The (stale) flag is
      subsequently queried by pci_pm_freeze_noirq().
    
    The flag may be set prior to suspend if the device went through error
    recovery:  Drivers commonly invoke pci_restore_state() + pci_save_state()
    to restore Config Space after reset.
    
    The flag may also be set if drivers call pci_save_state() on probe to
    allow for recovery from subsequent errors.
    
    The result is that pci_legacy_suspend_late() and pci_pm_freeze_noirq()
    don't call pci_save_state() and so the state that will be restored on
    resume is the one recorded on last error recovery or on probe, not the one
    that the device had on suspend.  If the two states happen to be identical,
    there's no problem.
    
    Reinstate clearing the flag in pci_legacy_suspend() and pci_pm_freeze().
    The two functions used to do that until commit 4b77b0a2ba27 ("PCI: Clear
    saved_state after the state has been restored") deemed it unnecessary
    because it assumed that it's sufficient to clear the flag on resume in
    pci_restore_state().  The commit seemingly did not take into account that
    pci_save_state() and pci_restore_state() are not only used by power
    management code, but also for error recovery.
    
    Devices without driver or whose driver has no PCI PM callbacks may be in
    runtime suspend when pci_pm_freeze() is called.  Their state has already
    been saved, so don't clear the flag to skip a pointless pci_save_state()
    in pci_pm_freeze_noirq().
    
    None of the drivers with legacy PCI PM callbacks seem to use runtime PM,
    so clear the flag unconditionally in their case.
    
    Fixes: 4b77b0a2ba27 ("PCI: Clear saved_state after the state has been restored")
    Signed-off-by: Lukas Wunner <[email protected]>
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Reviewed-by: Rafael J. Wysocki (Intel) <[email protected]>
    Cc: [email protected] # v2.6.32+
    Link: https://patch.msgid.link/094f2aad64418710daf0940112abe5a0afdc6bce.1763483367.git.lukas@wunner.de
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

PCI: brcmstb: Fix disabling L0s capability [+ + +]

Author: Jim Quinlan <[email protected]>
Date:   Mon Jan 5 12:34:20 2026 -0500

    PCI: brcmstb: Fix disabling L0s capability
    
    [ Upstream commit 9583f9d22991d2cfb5cc59a2552040c4ae98d998 ]
    
    caab002d5069 ("PCI: brcmstb: Disable L0s component of ASPM if requested")
    set PCI_EXP_LNKCAP_ASPM_L1 and (optionally) PCI_EXP_LNKCAP_ASPM_L0S in
    PCI_EXP_LNKCAP (aka PCIE_RC_CFG_PRIV1_LINK_CAPABILITY in brcmstb).
    
    But instead of using PCI_EXP_LNKCAP_ASPM_L1 and PCI_EXP_LNKCAP_ASPM_L0S
    directly, it used PCIE_LINK_STATE_L1 and PCIE_LINK_STATE_L0S, which are
    Linux-created values that only coincidentally matched the PCIe spec.
    b478e162f227 ("PCI/ASPM: Consolidate link state defines") later changed
    them so they no longer matched the PCIe spec, so the bits ended up in the
    wrong place in PCI_EXP_LNKCAP.
    
    Use PCI_EXP_LNKCAP_ASPM_L0S to clear L0s support when there's an
    'aspm-no-l0s' property.  Rely on brcmstb hardware to advertise L0s and/or
    L1 support otherwise.
    
    Fixes: caab002d5069 ("PCI: brcmstb: Disable L0s component of ASPM if requested")
    Reported-by: Bjorn Helgaas <[email protected]>
    Closes: https://lore.kernel.org/linux-pci/20250925194424.GA2197200@bhelgaas
    Signed-off-by: Jim Quinlan <[email protected]>
    [mani: reworded subject and description, added closes tag and CCed stable]
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    [bhelgaas: commit log]
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

PCI: brcmstb: Reuse pcie_cfg_data structure [+ + +]

Author: Stanimir Varbanov <[email protected]>
Date:   Mon Jan 5 12:34:18 2026 -0500

    PCI: brcmstb: Reuse pcie_cfg_data structure
    
    [ Upstream commit 10dbedad3c8188ce8b68559d43b7aaee7dafba25 ]
    
    Instead of copying fields from the pcie_cfg_data structure to
    brcm_pcie, reference it directly.
    
    Signed-off-by: Stanimir Varbanov <[email protected]>
    Reviewed-by: Florian Fainelil <[email protected]>
    Reviewed-by: Jim Quinlan <[email protected]>
    Tested-by: Ivan T. Ivanov <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [kwilczynski: commit log]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Stable-dep-of: 9583f9d22991 ("PCI: brcmstb: Fix disabling L0s capability")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

PCI: brcmstb: Set MLW based on "num-lanes" DT property if present [+ + +]

Author: Jim Quinlan <[email protected]>
Date:   Mon Jan 5 12:34:19 2026 -0500

    PCI: brcmstb: Set MLW based on "num-lanes" DT property if present
    
    [ Upstream commit a364d10ffe361fb34c3838d33604da493045de1e ]
    
    By default, the driver relies on the default hardware defined value for the
    Max Link Width (MLW) capability. But if the "num-lanes" DT property is
    present, assume that the chip's default capability information is incorrect
    or undesired, and use the specified value instead.
    
    Signed-off-by: Jim Quinlan <[email protected]>
    [mani: reworded the description and comments]
    Signed-off-by: Manivannan Sadhasivam <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Stable-dep-of: 9583f9d22991 ("PCI: brcmstb: Fix disabling L0s capability")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf/x86/amd/uncore: Fix the return value of amd_uncore_df_event_init() on error [+ + +]

Author: Sandipan Das <[email protected]>
Date:   Tue Dec 9 13:56:38 2025 +0530

    perf/x86/amd/uncore: Fix the return value of amd_uncore_df_event_init() on error
    
    commit 01439286514ce9d13b8123f8ec3717d7135ff1d6 upstream.
    
    If amd_uncore_event_init() fails, return an error irrespective of the
    pmu_version. Setting hwc->config should be safe even if there is an
    error so use this opportunity to simplify the code.
    
    Closes: https://lore.kernel.org/all/[email protected]/
    
    Fixes: d6389d3ccc13 ("perf/x86/amd/uncore: Refactor uncore management")
    Reported-by: Dan Carpenter <[email protected]>
    Signed-off-by: Sandipan Das <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/076935e23a70335d33bd6e23308b75ae0ad35ba2.1765268667.git.sandipan.das@amd.com
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf/x86/amd: Check event before enable to avoid GPF [+ + +]

Author: George Kennedy <[email protected]>
Date:   Tue Oct 8 08:00:53 2024 -0500

    perf/x86/amd: Check event before enable to avoid GPF
    
    [ Upstream commit 866cf36bfee4fba6a492d2dcc5133f857e3446b0 ]
    
    On AMD machines cpuc->events[idx] can become NULL in a subtle race
    condition with NMI->throttle->x86_pmu_stop().
    
    Check event for NULL in amd_pmu_enable_all() before enable to avoid a GPF.
    This appears to be an AMD only issue.
    
    Syzkaller reported a GPF in amd_pmu_enable_all.
    
    INFO: NMI handler (perf_event_nmi_handler) took too long to run: 13.143
        msecs
    Oops: general protection fault, probably for non-canonical address
        0xdffffc0000000034: 0000  PREEMPT SMP KASAN NOPTI
    KASAN: null-ptr-deref in range [0x00000000000001a0-0x00000000000001a7]
    CPU: 0 UID: 0 PID: 328415 Comm: repro_36674776 Not tainted 6.12.0-rc1-syzk
    RIP: 0010:x86_pmu_enable_event (arch/x86/events/perf_event.h:1195
        arch/x86/events/core.c:1430)
    RSP: 0018:ffff888118009d60 EFLAGS: 00010012
    RAX: dffffc0000000000 RBX: 0000000000000000 RCX: 0000000000000000
    RDX: 0000000000000034 RSI: 0000000000000000 RDI: 00000000000001a0
    RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000000
    R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000002
    R13: ffff88811802a440 R14: ffff88811802a240 R15: ffff8881132d8601
    FS:  00007f097dfaa700(0000) GS:ffff888118000000(0000) GS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 00000000200001c0 CR3: 0000000103d56000 CR4: 00000000000006f0
    Call Trace:
     <IRQ>
    amd_pmu_enable_all (arch/x86/events/amd/core.c:760 (discriminator 2))
    x86_pmu_enable (arch/x86/events/core.c:1360)
    event_sched_out (kernel/events/core.c:1191 kernel/events/core.c:1186
        kernel/events/core.c:2346)
    __perf_remove_from_context (kernel/events/core.c:2435)
    event_function (kernel/events/core.c:259)
    remote_function (kernel/events/core.c:92 (discriminator 1)
        kernel/events/core.c:72 (discriminator 1))
    __flush_smp_call_function_queue (./arch/x86/include/asm/jump_label.h:27
        ./include/linux/jump_label.h:207 ./include/trace/events/csd.h:64
        kernel/smp.c:135 kernel/smp.c:540)
    __sysvec_call_function_single (./arch/x86/include/asm/jump_label.h:27
        ./include/linux/jump_label.h:207
        ./arch/x86/include/asm/trace/irq_vectors.h:99 arch/x86/kernel/smp.c:272)
    sysvec_call_function_single (arch/x86/kernel/smp.c:266 (discriminator 47)
        arch/x86/kernel/smp.c:266 (discriminator 47))
     </IRQ>
    
    Reported-by: syzkaller <[email protected]>
    Signed-off-by: George Kennedy <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf: arm_cspmu: fix error handling in arm_cspmu_impl_unregister() [+ + +]

Author: Ma Ke <[email protected]>
Date:   Wed Oct 22 19:53:25 2025 +0800

    perf: arm_cspmu: fix error handling in arm_cspmu_impl_unregister()
    
    commit 970e1e41805f0bd49dc234330a9390f4708d097d upstream.
    
    driver_find_device() calls get_device() to increment the reference
    count once a matching device is found. device_release_driver()
    releases the driver, but it does not decrease the reference count that
    was incremented by driver_find_device(). At the end of the loop, there
    is no put_device() to balance the reference count. To avoid reference
    count leakage, add put_device() to decrease the reference count.
    
    Found by code review.
    
    Cc: [email protected]
    Fixes: bfc653aa89cb ("perf: arm_cspmu: Separate Arm and vendor module")
    Signed-off-by: Ma Ke <[email protected]>
    Signed-off-by: Will Deacon <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

phy: broadcom: bcm63xx-usbh: fix section mismatches [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Oct 17 07:45:37 2025 +0200

    phy: broadcom: bcm63xx-usbh: fix section mismatches
    
    commit 356d1924b9a6bc2164ce2bf1fad147b0c37ae085 upstream.
    
    Platform drivers can be probed after their init sections have been
    discarded (e.g. on probe deferral or manual rebind through sysfs) so the
    probe function and match table must not live in init.
    
    Fixes: 783f6d3dcf35 ("phy: bcm63xx-usbh: Add BCM63xx USBH driver")
    Cc: [email protected]      # 5.9
    Cc: Álvaro Fernández Rojas <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Neil Armstrong <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Vinod Koul <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

pinctrl: renesas: rzg2l: Fix ISEL restore on resume [+ + +]

Author: Claudiu Beznea <[email protected]>
Date:   Wed Dec 17 20:00:00 2025 +0200

    pinctrl: renesas: rzg2l: Fix ISEL restore on resume
    
    commit 44bf66122c12ef6d3382a9b84b9be1802e5f0e95 upstream.
    
    Commit 1d2da79708cb ("pinctrl: renesas: rzg2l: Avoid configuring ISEL in
    gpio_irq_{en,dis}able*()") dropped the configuration of ISEL from
    struct irq_chip::{irq_enable, irq_disable} APIs and moved it to
    struct gpio_chip::irq::{child_to_parent_hwirq,
    child_irq_domain_ops::free} APIs to fix spurious IRQs.
    
    After commit 1d2da79708cb ("pinctrl: renesas: rzg2l: Avoid configuring ISEL
    in gpio_irq_{en,dis}able*()"), ISEL was no longer configured properly on
    resume. This is because the pinctrl resume code used
    struct irq_chip::irq_enable  (called from rzg2l_gpio_irq_restore()) to
    reconfigure the wakeup interrupts. Some drivers (e.g. Ethernet) may also
    reconfigure non-wakeup interrupts on resume through their own code,
    eventually calling struct irq_chip::irq_enable.
    
    Fix this by adding ISEL configuration back into the
    struct irq_chip::irq_enable API and on resume path for wakeup interrupts.
    
    As struct irq_chip::irq_enable needs now to lock to update the ISEL,
    convert the struct rzg2l_pinctrl::lock to a raw spinlock and replace the
    locking API calls with the raw variants. Otherwise the lockdep reports
    invalid wait context when probing the adv7511 module on RZ/G2L:
    
     [ BUG: Invalid wait context ]
     6.17.0-rc5-next-20250911-00001-gfcfac22533c9 #18 Not tainted
     -----------------------------
     (udev-worker)/165 is trying to lock:
     ffff00000e3664a8 (&pctrl->lock){....}-{3:3}, at: rzg2l_gpio_irq_enable+0x38/0x78
     other info that might help us debug this:
     context-{5:5}
     3 locks held by (udev-worker)/165:
     #0: ffff00000e890108 (&dev->mutex){....}-{4:4}, at: __driver_attach+0x90/0x1ac
     #1: ffff000011c07240 (request_class){+.+.}-{4:4}, at: __setup_irq+0xb4/0x6dc
     #2: ffff000011c070c8 (lock_class){....}-{2:2}, at: __setup_irq+0xdc/0x6dc
     stack backtrace:
     CPU: 1 UID: 0 PID: 165 Comm: (udev-worker) Not tainted 6.17.0-rc5-next-20250911-00001-gfcfac22533c9 #18 PREEMPT
     Hardware name: Renesas SMARC EVK based on r9a07g044l2 (DT)
     Call trace:
     show_stack+0x18/0x24 (C)
     dump_stack_lvl+0x90/0xd0
     dump_stack+0x18/0x24
     __lock_acquire+0xa14/0x20b4
     lock_acquire+0x1c8/0x354
     _raw_spin_lock_irqsave+0x60/0x88
     rzg2l_gpio_irq_enable+0x38/0x78
     irq_enable+0x40/0x8c
     __irq_startup+0x78/0xa4
     irq_startup+0x108/0x16c
     __setup_irq+0x3c0/0x6dc
     request_threaded_irq+0xec/0x1ac
     devm_request_threaded_irq+0x80/0x134
     adv7511_probe+0x928/0x9a4 [adv7511]
     i2c_device_probe+0x22c/0x3dc
     really_probe+0xbc/0x2a0
     __driver_probe_device+0x78/0x12c
     driver_probe_device+0x40/0x164
     __driver_attach+0x9c/0x1ac
     bus_for_each_dev+0x74/0xd0
     driver_attach+0x24/0x30
     bus_add_driver+0xe4/0x208
     driver_register+0x60/0x128
     i2c_register_driver+0x48/0xd0
     adv7511_init+0x5c/0x1000 [adv7511]
     do_one_initcall+0x64/0x30c
     do_init_module+0x58/0x23c
     load_module+0x1bcc/0x1d40
     init_module_from_file+0x88/0xc4
     idempotent_init_module+0x188/0x27c
     __arm64_sys_finit_module+0x68/0xac
     invoke_syscall+0x48/0x110
     el0_svc_common.constprop.0+0xc0/0xe0
     do_el0_svc+0x1c/0x28
     el0_svc+0x4c/0x160
     el0t_64_sync_handler+0xa0/0xe4
     el0t_64_sync+0x198/0x19c
    
    Having ISEL configuration back into the struct irq_chip::irq_enable API
    should be safe with respect to spurious IRQs, as in the probe case IRQs
    are enabled anyway in struct gpio_chip::irq::child_to_parent_hwirq. No
    spurious IRQs were detected on suspend/resume, boot, ethernet link
    insert/remove tests (executed on RZ/G3S). Boot, ethernet link
    insert/remove tests were also executed successfully on RZ/G2L.
    
    Fixes: 1d2da79708cb ("pinctrl: renesas: rzg2l: Avoid configuring ISEL in gpio_irq_{en,dis}able*(")
    Cc: [email protected]
    Signed-off-by: Claudiu Beznea <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    [claudiu.beznea:
     - in rzg2l_write_oen() kept v6.12 code and use
       raw_spin_lock_irqsave()/raw_spin_unlock_irqrestore()
     - in rzg2l_gpio_set() kept v6.12 code and use raw_spin_unlock_irqrestore()
     - in rzg2l_pinctrl_resume_noirq() kept v6.12 code
     - manually adjust rzg3s_oen_write(), rzv2h_oen_write() to use
       raw_spin_lock_irqsave()/raw_spin_unlock_irqrestore()]
    Signed-off-by: Claudiu Beznea <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

platform/chrome: cros_ec_ishtp: Fix UAF after unbinding driver [+ + +]

Author: Tzung-Bi Shih <[email protected]>
Date:   Fri Oct 31 03:39:00 2025 +0000

    platform/chrome: cros_ec_ishtp: Fix UAF after unbinding driver
    
    commit 944edca81e7aea15f83cf9a13a6ab67f711e8abd upstream.
    
    After unbinding the driver, another kthread `cros_ec_console_log_work`
    is still accessing the device, resulting an UAF and crash.
    
    The driver doesn't unregister the EC device in .remove() which should
    shutdown sub-devices synchronously.  Fix it.
    
    Fixes: 26a14267aff2 ("platform/chrome: Add ChromeOS EC ISHTP driver")
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Tzung-Bi Shih <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

platform/mellanox: mlxbf-pmc: Remove trailing whitespaces from event names [+ + +]

Author: Shravan Kumar Ramani <[email protected]>
Date:   Thu Dec 18 12:18:13 2025 +0000

    platform/mellanox: mlxbf-pmc: Remove trailing whitespaces from event names
    
    [ Upstream commit f13bce715d1600698310a4a7832f6a52499d5395 ]
    
    Some event names have trailing whitespaces at the end which causes programming
    of counters using the name for these specific events to fail and hence need to
    be removed.
    
    Fixes: 423c3361855c ("platform/mellanox: mlxbf-pmc: Add support for BlueField-3")
    Signed-off-by: Shravan Kumar Ramani <[email protected]>
    Reviewed-by: David Thompson <[email protected]>
    Link: https://patch.msgid.link/065cbae0717dcc1169681c4dbb1a6e050b8574b3.1766059953.git.shravankr@nvidia.com
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

platform/x86/intel/hid: Add Dell Pro Rugged 10/12 tablet to VGBS DMI quirks [+ + +]

Author: Chia-Lin Kao (AceLan) <[email protected]>
Date:   Thu Nov 27 15:04:07 2025 +0800

    platform/x86/intel/hid: Add Dell Pro Rugged 10/12 tablet to VGBS DMI quirks
    
    [ Upstream commit b169e1733cadb614e87f69d7a5ae1b186c50d313 ]
    
    Dell Pro Rugged 10/12 tablets has a reliable VGBS method.
    If VGBS is not called on boot, the on-screen keyboard won't appear if the
    device is booted without a keyboard.
    
    Call VGBS on boot on thess devices to get the initial state of
    SW_TABLET_MODE in a reliable way.
    
    Signed-off-by: Chia-Lin Kao (AceLan) <[email protected]>
    Reviewed-by: Hans de Goede <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

platform/x86: hp-bioscfg: Fix out-of-bounds array access in ACPI package parsing [+ + +]

Author: Junrui Luo <[email protected]>
Date:   Fri Dec 26 19:42:05 2025 +0800

    platform/x86: hp-bioscfg: Fix out-of-bounds array access in ACPI package parsing
    
    [ Upstream commit e44c42c830b7ab36e3a3a86321c619f24def5206 ]
    
    The hp_populate_*_elements_from_package() functions in the hp-bioscfg
    driver contain out-of-bounds array access vulnerabilities.
    
    These functions parse ACPI packages into internal data structures using
    a for loop with index variable 'elem' that iterates through
    enum_obj/integer_obj/order_obj/password_obj/string_obj arrays.
    
    When processing multi-element fields like PREREQUISITES and
    ENUM_POSSIBLE_VALUES, these functions read multiple consecutive array
    elements using expressions like 'enum_obj[elem + reqs]' and
    'enum_obj[elem + pos_values]' within nested loops.
    
    The bug is that the bounds check only validated elem, but did not consider
    the additional offset when accessing elem + reqs or elem + pos_values.
    
    The fix changes the bounds check to validate the actual accessed index.
    
    Reported-by: Yuhao Jiang <[email protected]>
    Reported-by: Junrui Luo <[email protected]>
    Fixes: e6c7b3e15559 ("platform/x86: hp-bioscfg: string-attributes")
    Signed-off-by: Junrui Luo <[email protected]>
    Link: https://patch.msgid.link/SYBPR01MB788173D7DD4EA2CB6383683DAFB0A@SYBPR01MB7881.ausprd01.prod.outlook.com
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

platform/x86: ibm_rtl: fix EBDA signature search pointer arithmetic [+ + +]

Author: Junrui Luo <[email protected]>
Date:   Fri Dec 19 16:30:29 2025 +0800

    platform/x86: ibm_rtl: fix EBDA signature search pointer arithmetic
    
    [ Upstream commit 15dd100349b8526cbdf2de0ce3e72e700eb6c208 ]
    
    The ibm_rtl_init() function searches for the signature but has a pointer
    arithmetic error. The loop counter suggests searching at 4-byte intervals
    but the implementation only advances by 1 byte per iteration.
    
    Fix by properly advancing the pointer by sizeof(unsigned int) bytes
    each iteration.
    
    Reported-by: Yuhao Jiang <[email protected]>
    Reported-by: Junrui Luo <[email protected]>
    Fixes: 35f0ce032b0f ("IBM Real-Time "SMI Free" mode driver -v7")
    Signed-off-by: Junrui Luo <[email protected]>
    Link: https://patch.msgid.link/SYBPR01MB78812D887A92DE3802D0D06EAFA9A@SYBPR01MB7881.ausprd01.prod.outlook.com
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

platform/x86: intel: chtwc_int33fe: don't dereference swnode args [+ + +]

Author: Bartosz Golaszewski <[email protected]>
Date:   Fri Nov 21 11:04:50 2025 +0100

    platform/x86: intel: chtwc_int33fe: don't dereference swnode args
    
    commit 527250cd9092461f1beac3e4180a4481bffa01b5 upstream.
    
    Members of struct software_node_ref_args should not be dereferenced
    directly but set using the provided macros. Commit d7cdbbc93c56
    ("software node: allow referencing firmware nodes") changed the name of
    the software node member and caused a build failure. Remove all direct
    dereferences of the ref struct as a fix.
    
    However, this driver also seems to abuse the software node interface by
    waiting for a node with an arbitrary name "intel-xhci-usb-sw" to appear
    in the system before setting up the reference for the I2C device, while
    the actual software node already exists in the intel-xhci-usb-role-switch
    module and should be used to set up a static reference. Add a FIXME for
    a future improvement.
    
    Fixes: d7cdbbc93c56 ("software node: allow referencing firmware nodes")
    Fixes: 53c24c2932e5 ("platform/x86: intel_cht_int33fe: use inline reference properties")
    Cc: [email protected]
    Reported-by: Stephen Rothwell <[email protected]>
    Closes: https://lore.kernel.org/all/[email protected]/
    Signed-off-by: Bartosz Golaszewski <[email protected]>
    Reviewed-by: Hans de Goede <[email protected]>
    Acked-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Philipp Zabel <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

platform/x86: msi-laptop: add missing sysfs_remove_group() [+ + +]

Author: Thomas Fourier <[email protected]>
Date:   Wed Dec 17 11:36:13 2025 +0100

    platform/x86: msi-laptop: add missing sysfs_remove_group()
    
    [ Upstream commit 1461209cf813b6ee6d40f29b96b544587df6d2b1 ]
    
    A sysfs group is created in msi_init() when old_ec_model is enabled, but
    never removed. Remove the msipf_old_attribute_group in that case.
    
    Fixes: 03696e51d75a ("msi-laptop: Disable brightness control for new EC")
    Signed-off-by: Thomas Fourier <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PM: runtime: Do not clear needs_force_resume with enabled runtime PM [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Mon Dec 15 15:21:34 2025 +0100

    PM: runtime: Do not clear needs_force_resume with enabled runtime PM
    
    commit 359afc8eb02a518fbdd0cbd462c8c2827c6cbec2 upstream.
    
    Commit 89d9cec3b1e9 ("PM: runtime: Clear power.needs_force_resume in
    pm_runtime_reinit()") added provisional clearing of power.needs_force_resume
    to pm_runtime_reinit(), but it is done unconditionally which is a
    mistake because pm_runtime_reinit() may race with driver probing
    and removal [1].
    
    To address this, notice that power.needs_force_resume should never
    be set when runtime PM is enabled and so it only needs to be cleared
    when runtime PM is disabled, and update pm_runtime_init() to only
    clear that flag when runtime PM is disabled.
    
    Fixes: 89d9cec3b1e9 ("PM: runtime: Clear power.needs_force_resume in pm_runtime_reinit()")
    Reported-by: Ed Tsai <[email protected]>
    Closes: https://lore.kernel.org/linux-pm/[email protected]/ [1]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Cc: 6.17+ <[email protected]> # 6.17+
    Reviewed-by: Ulf Hansson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

pmdomain: imx: Fix reference count leak in imx_gpc_probe() [+ + +]

Author: Wentao Liang <[email protected]>
Date:   Thu Dec 11 04:02:52 2025 +0000

    pmdomain: imx: Fix reference count leak in imx_gpc_probe()
    
    commit 73cb5f6eafb0ac7aea8cdeb8ff12981aa741d8fb upstream.
    
    of_get_child_by_name() returns a node pointer with refcount incremented.
    Use the __free() attribute to manage the pgc_node reference, ensuring
    automatic of_node_put() cleanup when pgc_node goes out of scope.
    
    This eliminates the need for explicit error handling paths and avoids
    reference count leaks.
    
    Fixes: 721cabf6c660 ("soc: imx: move PGC handling to a new GPC driver")
    Cc: [email protected]
    Signed-off-by: Wentao Liang <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc, mm: Fix mprotect on book3s 32-bit [+ + +]

Author: Dave Vasilevsky <[email protected]>
Date:   Sun Nov 16 01:40:46 2025 -0500

    powerpc, mm: Fix mprotect on book3s 32-bit
    
    commit 78fc63ffa7813e33681839bb33826c24195f0eb7 upstream.
    
    On 32-bit book3s with hash-MMUs, tlb_flush() was a no-op. This was
    unnoticed because all uses until recently were for unmaps, and thus
    handled by __tlb_remove_tlb_entry().
    
    After commit 4a18419f71cd ("mm/mprotect: use mmu_gather") in kernel 5.19,
    tlb_gather_mmu() started being used for mprotect as well. This caused
    mprotect to simply not work on these machines:
    
      int *ptr = mmap(NULL, 4096, PROT_READ|PROT_WRITE,
                      MAP_PRIVATE|MAP_ANONYMOUS, -1, 0);
      *ptr = 1; // force HPTE to be created
      mprotect(ptr, 4096, PROT_READ);
      *ptr = 2; // should segfault, but succeeds
    
    Fixed by making tlb_flush() actually flush TLB pages. This finally
    agrees with the behaviour of boot3s64's tlb_flush().
    
    Fixes: 4a18419f71cd ("mm/mprotect: use mmu_gather")
    Cc: [email protected]
    Reviewed-by: Christophe Leroy <[email protected]>
    Reviewed-by: Ritesh Harjani (IBM) <[email protected]>
    Signed-off-by: Dave Vasilevsky <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc/64s/slb: Fix SLB multihit issue during SLB preload [+ + +]

Author: Donet Tom <[email protected]>
Date:   Thu Oct 30 20:27:26 2025 +0530

    powerpc/64s/slb: Fix SLB multihit issue during SLB preload
    
    commit 00312419f0863964625d6dcda8183f96849412c6 upstream.
    
    On systems using the hash MMU, there is a software SLB preload cache that
    mirrors the entries loaded into the hardware SLB buffer. This preload
    cache is subject to periodic eviction — typically after every 256 context
    switches — to remove old entry.
    
    To optimize performance, the kernel skips switch_mmu_context() in
    switch_mm_irqs_off() when the prev and next mm_struct are the same.
    However, on hash MMU systems, this can lead to inconsistencies between
    the hardware SLB and the software preload cache.
    
    If an SLB entry for a process is evicted from the software cache on one
    CPU, and the same process later runs on another CPU without executing
    switch_mmu_context(), the hardware SLB may retain stale entries. If the
    kernel then attempts to reload that entry, it can trigger an SLB
    multi-hit error.
    
    The following timeline shows how stale SLB entries are created and can
    cause a multi-hit error when a process moves between CPUs without a
    MMU context switch.
    
    CPU 0                                   CPU 1
    -----                                    -----
    Process P
    exec                                    swapper/1
     load_elf_binary
      begin_new_exc
        activate_mm
         switch_mm_irqs_off
          switch_mmu_context
           switch_slb
           /*
            * This invalidates all
            * the entries in the HW
            * and setup the new HW
            * SLB entries as per the
            * preload cache.
            */
    context_switch
    sched_migrate_task migrates process P to cpu-1
    
    Process swapper/0                       context switch (to process P)
    (uses mm_struct of Process P)           switch_mm_irqs_off()
                                             switch_slb
                                               load_slb++
                                                /*
                                                * load_slb becomes 0 here
                                                * and we evict an entry from
                                                * the preload cache with
                                                * preload_age(). We still
                                                * keep HW SLB and preload
                                                * cache in sync, that is
                                                * because all HW SLB entries
                                                * anyways gets evicted in
                                                * switch_slb during SLBIA.
                                                * We then only add those
                                                * entries back in HW SLB,
                                                * which are currently
                                                * present in preload_cache
                                                * (after eviction).
                                                */
                                            load_elf_binary continues...
                                             setup_new_exec()
                                              slb_setup_new_exec()
    
                                            sched_switch event
                                            sched_migrate_task migrates
                                            process P to cpu-0
    
    context_switch from swapper/0 to Process P
     switch_mm_irqs_off()
      /*
       * Since both prev and next mm struct are same we don't call
       * switch_mmu_context(). This will cause the HW SLB and SW preload
       * cache to go out of sync in preload_new_slb_context. Because there
       * was an SLB entry which was evicted from both HW and preload cache
       * on cpu-1. Now later in preload_new_slb_context(), when we will try
       * to add the same preload entry again, we will add this to the SW
       * preload cache and then will add it to the HW SLB. Since on cpu-0
       * this entry was never invalidated, hence adding this entry to the HW
       * SLB will cause a SLB multi-hit error.
       */
    load_elf_binary continues...
     START_THREAD
      start_thread
       preload_new_slb_context
       /*
        * This tries to add a new EA to preload cache which was earlier
        * evicted from both cpu-1 HW SLB and preload cache. This caused the
        * HW SLB of cpu-0 to go out of sync with the SW preload cache. The
        * reason for this was, that when we context switched back on CPU-0,
        * we should have ideally called switch_mmu_context() which will
        * bring the HW SLB entries on CPU-0 in sync with SW preload cache
        * entries by setting up the mmu context properly. But we didn't do
        * that since the prev mm_struct running on cpu-0 was same as the
        * next mm_struct (which is true for swapper / kernel threads). So
        * now when we try to add this new entry into the HW SLB of cpu-0,
        * we hit a SLB multi-hit error.
        */
    
    WARNING: CPU: 0 PID: 1810970 at arch/powerpc/mm/book3s64/slb.c:62
    assert_slb_presence+0x2c/0x50(48 results) 02:47:29 [20157/42149]
    Modules linked in:
    CPU: 0 UID: 0 PID: 1810970 Comm: dd Not tainted 6.16.0-rc3-dirty #12
    VOLUNTARY
    Hardware name: IBM pSeries (emulated by qemu) POWER8 (architected)
    0x4d0200 0xf000004 of:SLOF,HEAD hv:linux,kvm pSeries
    NIP:  c00000000015426c LR: c0000000001543b4 CTR: 0000000000000000
    REGS: c0000000497c77e0 TRAP: 0700   Not tainted  (6.16.0-rc3-dirty)
    MSR:  8000000002823033 <SF,VEC,VSX,FP,ME,IR,DR,RI,LE>  CR: 28888482  XER: 00000000
    CFAR: c0000000001543b0 IRQMASK: 3
    <...>
    NIP [c00000000015426c] assert_slb_presence+0x2c/0x50
    LR [c0000000001543b4] slb_insert_entry+0x124/0x390
    Call Trace:
      0x7fffceb5ffff (unreliable)
      preload_new_slb_context+0x100/0x1a0
      start_thread+0x26c/0x420
      load_elf_binary+0x1b04/0x1c40
      bprm_execve+0x358/0x680
      do_execveat_common+0x1f8/0x240
      sys_execve+0x58/0x70
      system_call_exception+0x114/0x300
      system_call_common+0x160/0x2c4
    
    >From the above analysis, during early exec the hardware SLB is cleared,
    and entries from the software preload cache are reloaded into hardware
    by switch_slb. However, preload_new_slb_context and slb_setup_new_exec
    also attempt to load some of the same entries, which can trigger a
    multi-hit. In most cases, these additional preloads simply hit existing
    entries and add nothing new. Removing these functions avoids redundant
    preloads and eliminates the multi-hit issue. This patch removes these
    two functions.
    
    We tested process switching performance using the context_switch
    benchmark on POWER9/hash, and observed no regression.
    
    Without this patch: 129041 ops/sec
    With this patch:    129341 ops/sec
    
    We also measured SLB faults during boot, and the counts are essentially
    the same with and without this patch.
    
    SLB faults without this patch: 19727
    SLB faults with this patch:    19786
    
    Fixes: 5434ae74629a ("powerpc/64s/hash: Add a SLB preload cache")
    cc: [email protected]
    Suggested-by: Nicholas Piggin <[email protected]>
    Signed-off-by: Donet Tom <[email protected]>
    Signed-off-by: Ritesh Harjani (IBM) <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/0ac694ae683494fe8cadbd911a1a5018d5d3c541.1761834163.git.ritesh.list@gmail.com
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc/addnote: Fix overflow on 32-bit builds [+ + +]

Author: Ben Collins <[email protected]>
Date:   Mon Apr 21 22:31:13 2025 -0400

    powerpc/addnote: Fix overflow on 32-bit builds
    
    [ Upstream commit 825ce89a3ef17f84cf2c0eacfa6b8dc9fd11d13f ]
    
    The PUT_64[LB]E() macros need to cast the value to unsigned long long
    like the GET_64[LB]E() macros. Caused lots of warnings when compiled
    on 32-bit, and clobbered addresses (36-bit P4080).
    
    Signed-off-by: Ben Collins <[email protected]>
    Reviewed-by: Christophe Leroy <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/2025042122-mustard-wrasse-694572@boujee-and-buff
    Signed-off-by: Sasha Levin <[email protected]>

powerpc/kexec: Enable SMT before waking offline CPUs [+ + +]

Author: Nysal Jan K.A. <[email protected]>
Date:   Tue Oct 28 16:25:12 2025 +0530

    powerpc/kexec: Enable SMT before waking offline CPUs
    
    commit c2296a1e42418556efbeb5636c4fa6aa6106713a upstream.
    
    If SMT is disabled or a partial SMT state is enabled, when a new kernel
    image is loaded for kexec, on reboot the following warning is observed:
    
    kexec: Waking offline cpu 228.
    WARNING: CPU: 0 PID: 9062 at arch/powerpc/kexec/core_64.c:223 kexec_prepare_cpus+0x1b0/0x1bc
    [snip]
     NIP kexec_prepare_cpus+0x1b0/0x1bc
     LR  kexec_prepare_cpus+0x1a0/0x1bc
     Call Trace:
      kexec_prepare_cpus+0x1a0/0x1bc (unreliable)
      default_machine_kexec+0x160/0x19c
      machine_kexec+0x80/0x88
      kernel_kexec+0xd0/0x118
      __do_sys_reboot+0x210/0x2c4
      system_call_exception+0x124/0x320
      system_call_vectored_common+0x15c/0x2ec
    
    This occurs as add_cpu() fails due to cpu_bootable() returning false for
    CPUs that fail the cpu_smt_thread_allowed() check or non primary
    threads if SMT is disabled.
    
    Fix the issue by enabling SMT and resetting the number of SMT threads to
    the number of threads per core, before attempting to wake up all present
    CPUs.
    
    Fixes: 38253464bc82 ("cpu/SMT: Create topology_smt_thread_allowed()")
    Reported-by: Sachin P Bappalige <[email protected]>
    Cc: [email protected] # v6.6+
    Reviewed-by: Srikar Dronamraju <[email protected]>
    Signed-off-by: Nysal Jan K.A. <[email protected]>
    Tested-by: Samir M <[email protected]>
    Reviewed-by: Sourabh Jain <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc/pseries/cmm: adjust BALLOON_MIGRATE when migrating pages [+ + +]

Author: David Hildenbrand <[email protected]>
Date:   Mon Jan 5 12:42:05 2026 -0500

    powerpc/pseries/cmm: adjust BALLOON_MIGRATE when migrating pages
    
    [ Upstream commit 0da2ba35c0d532ca0fe7af698b17d74c4d084b9a ]
    
    Let's properly adjust BALLOON_MIGRATE like the other drivers.
    
    Note that the INFLATE/DEFLATE events are triggered from the core when
    enqueueing/dequeueing pages.
    
    This was found by code inspection.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: fe030c9b85e6 ("powerpc/pseries/cmm: Implement balloon compaction")
    Signed-off-by: David Hildenbrand <[email protected]>
    Reviewed-by: Ritesh Harjani (IBM) <[email protected]>
    Cc: Christophe Leroy <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc/pseries/cmm: call balloon_devinfo_init() also without CONFIG_BALLOON_COMPACTION [+ + +]

Author: David Hildenbrand <[email protected]>
Date:   Tue Oct 21 12:06:05 2025 +0200

    powerpc/pseries/cmm: call balloon_devinfo_init() also without CONFIG_BALLOON_COMPACTION
    
    commit fc6bcf9ac4de76f5e7bcd020b3c0a86faff3f2d5 upstream.
    
    Patch series "powerpc/pseries/cmm: two smaller fixes".
    
    Two smaller fixes identified while doing a bigger rework.
    
    
    This patch (of 2):
    
    We always have to initialize the balloon_dev_info, even when compaction is
    not configured in: otherwise the containing list and the lock are left
    uninitialized.
    
    Likely not many such configs exist in practice, but let's CC stable to
    be sure.
    
    This was found by code inspection.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: fe030c9b85e6 ("powerpc/pseries/cmm: Implement balloon compaction")
    Signed-off-by: David Hildenbrand <[email protected]>
    Reviewed-by: Ritesh Harjani (IBM) <[email protected]>
    Cc: Christophe Leroy <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

powerpc: Add reloc_offset() to font bitmap pointer used for bootx_printf() [+ + +]

Author: Finn Thain <[email protected]>
Date:   Mon Nov 10 10:30:22 2025 +1100

    powerpc: Add reloc_offset() to font bitmap pointer used for bootx_printf()
    
    commit b94b73567561642323617155bf4ee24ef0d258fe upstream.
    
    Since Linux v6.7, booting using BootX on an Old World PowerMac produces
    an early crash. Stan Johnson writes, "the symptoms are that the screen
    goes blank and the backlight stays on, and the system freezes (Linux
    doesn't boot)."
    
    Further testing revealed that the failure can be avoided by disabling
    CONFIG_BOOTX_TEXT. Bisection revealed that the regression was caused by
    a change to the font bitmap pointer that's used when btext_init() begins
    painting characters on the display, early in the boot process.
    
    Christophe Leroy explains, "before kernel text is relocated to its final
    location ... data is addressed with an offset which is added to the
    Global Offset Table (GOT) entries at the start of bootx_init()
    by function reloc_got2(). But the pointers that are located inside a
    structure are not referenced in the GOT and are therefore not updated by
    reloc_got2(). It is therefore needed to apply the offset manually by using
    PTRRELOC() macro."
    
    Cc: [email protected]
    Link: https://lists.debian.org/debian-powerpc/2025/10/msg00111.html
    Link: https://lore.kernel.org/linuxppc-dev/[email protected]/
    Reported-by: Cedar Maxwell <[email protected]>
    Closes: https://lists.debian.org/debian-powerpc/2025/09/msg00031.html
    Bisected-by: Stan Johnson <[email protected]>
    Tested-by: Stan Johnson <[email protected]>
    Fixes: 0ebc7feae79a ("powerpc: Use shared font data")
    Suggested-by: Christophe Leroy <[email protected]>
    Signed-off-by: Finn Thain <[email protected]>
    Reviewed-by: Christophe Leroy <[email protected]>
    Signed-off-by: Madhavan Srinivasan <[email protected]>
    Link: https://patch.msgid.link/22b3b247425a052b079ab84da926706b3702c2c7.1762731022.git.fthain@linux-m68k.org
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

r8169: fix RTL8117 Wake-on-Lan in DASH mode [+ + +]

Author: René Rebe <[email protected]>
Date:   Tue Dec 2 19:41:37 2025 +0100

    r8169: fix RTL8117 Wake-on-Lan in DASH mode
    
    commit dd75c723ef566f7f009c047f47e0eee95fe348ab upstream.
    
    Wake-on-Lan does currently not work for r8169 in DASH mode, e.g. the
    ASUS Pro WS X570-ACE with RTL8168fp/RTL8117.
    
    Fix by not returning early in rtl_prepare_power_down when dash_enabled.
    While this fixes WoL, it still kills the OOB RTL8117 remote management
    BMC connection. Fix by not calling rtl8168_driver_stop if WoL is enabled.
    
    Fixes: 065c27c184d6 ("r8169: phy power ops")
    Signed-off-by: René Rebe <[email protected]>
    Cc: [email protected]
    Reviewed-by: Heiner Kallweit <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

RDMA/bnxt_re: fix dma_free_coherent() pointer [+ + +]

Author: Thomas Fourier <[email protected]>
Date:   Tue Dec 30 09:51:21 2025 +0100

    RDMA/bnxt_re: fix dma_free_coherent() pointer
    
    [ Upstream commit fcd431a9627f272b4c0bec445eba365fe2232a94 ]
    
    The dma_alloc_coherent() allocates a dma-mapped buffer, pbl->pg_arr[i].
    The dma_free_coherent() should pass the same buffer to
    dma_free_coherent() and not page-aligned.
    
    Fixes: 1ac5a4047975 ("RDMA/bnxt_re: Add bnxt_re RoCE driver")
    Signed-off-by: Thomas Fourier <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/bnxt_re: Fix IB_SEND_IP_CSUM handling in post_send [+ + +]

Author: Alok Tiwari <[email protected]>
Date:   Fri Dec 19 01:32:57 2025 -0800

    RDMA/bnxt_re: Fix IB_SEND_IP_CSUM handling in post_send
    
    [ Upstream commit f01765a2361323e78e3d91b1cb1d5527a83c5cf7 ]
    
    The bnxt_re SEND path checks wr->send_flags to enable features such as
    IP checksum offload. However, send_flags is a bitmask and may contain
    multiple flags (e.g. IB_SEND_SIGNALED | IB_SEND_IP_CSUM), while the
    existing code uses a switch() statement that only matches when
    send_flags is exactly IB_SEND_IP_CSUM.
    
    As a result, checksum offload is not enabled when additional SEND
    flags are present.
    
    Replace the switch() with a bitmask test:
    
        if (wr->send_flags & IB_SEND_IP_CSUM)
    
    This ensures IP checksum offload is enabled correctly when multiple
    SEND flags are used.
    
    Fixes: 1ac5a4047975 ("RDMA/bnxt_re: Add bnxt_re RoCE driver")
    Signed-off-by: Alok Tiwari <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Kalesh AP <[email protected]>
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/bnxt_re: Fix incorrect BAR check in bnxt_qplib_map_creq_db() [+ + +]

Author: Alok Tiwari <[email protected]>
Date:   Wed Dec 17 02:01:41 2025 -0800

    RDMA/bnxt_re: Fix incorrect BAR check in bnxt_qplib_map_creq_db()
    
    [ Upstream commit 145a417a39d7efbc881f52e829817376972b278c ]
    
    RCFW_COMM_CONS_PCI_BAR_REGION is defined as BAR 2, so checking
    !creq_db->reg.bar_id is incorrect and always false.
    
    pci_resource_start() returns the BAR base address, and a value of 0
    indicates that the BAR is unassigned. Update the condition to test
    bar_base == 0 instead.
    
    This ensures the driver detects and logs an error for an unassigned
    RCFW communication BAR.
    
    Fixes: cee0c7bba486 ("RDMA/bnxt_re: Refactor command queue management code")
    Signed-off-by: Alok Tiwari <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Kalesh AP <[email protected]>
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/bnxt_re: Fix to use correct page size for PDE table [+ + +]

Author: Kalesh AP <[email protected]>
Date:   Tue Dec 23 18:48:55 2025 +0530

    RDMA/bnxt_re: Fix to use correct page size for PDE table
    
    [ Upstream commit 3d70e0fb0f289b0c778041c5bb04d099e1aa7c1c ]
    
    In bnxt_qplib_alloc_init_hwq(), while allocating memory for PDE table
    driver incorrectly is using the "pg_size" value passed to the function.
    Fixed to use the right value 4K. Also, fixed the allocation size for
    PBL table.
    
    Fixes: 0c4dcd602817 ("RDMA/bnxt_re: Refactor hardware queue memory allocation")
    Signed-off-by: Damodharam Ammepalli <[email protected]>
    Signed-off-by: Kalesh AP <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Selvin Xavier <[email protected]>
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/cm: Fix leaking the multicast GID table reference [+ + +]

Author: Jason Gunthorpe <[email protected]>
Date:   Fri Nov 28 20:53:21 2025 -0400

    RDMA/cm: Fix leaking the multicast GID table reference
    
    commit 57f3cb6c84159d12ba343574df2115fb18dd83ca upstream.
    
    If the CM ID is destroyed while the CM event for multicast creating is
    still queued the cancel_work_sync() will prevent the work from running
    which also prevents destroying the ah_attr. This leaks a refcount and
    triggers a WARN:
    
       GID entry ref leak for dev syz1 index 2 ref=573
       WARNING: CPU: 1 PID: 655 at drivers/infiniband/core/cache.c:809 release_gid_table drivers/infiniband/core/cache.c:806 [inline]
       WARNING: CPU: 1 PID: 655 at drivers/infiniband/core/cache.c:809 gid_table_release_one+0x284/0x3cc drivers/infiniband/core/cache.c:886
    
    Destroy the ah_attr after canceling the work, it is safe to call this
    twice.
    
    Link: https://patch.msgid.link/r/[email protected]
    Cc: [email protected]
    Fixes: fe454dc31e84 ("RDMA/ucma: Fix use-after-free bug in ucma_create_uevent")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

RDMA/core: always drop device refcount in ib_del_sub_device_and_put() [+ + +]

Author: Tetsuo Handa <[email protected]>
Date:   Sat Dec 20 11:11:33 2025 +0900

    RDMA/core: always drop device refcount in ib_del_sub_device_and_put()
    
    [ Upstream commit fa3c411d21ebc26ffd175c7256c37cefa35020aa ]
    
    Since nldev_deldev() (introduced by commit 060c642b2ab8 ("RDMA/nldev: Add
    support to add/delete a sub IB device through netlink") grabs a reference
    using ib_device_get_by_index() before calling ib_del_sub_device_and_put(),
    we need to drop that reference before returning -EOPNOTSUPP error.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=881d65229ca4f9ae8c84
    Fixes: bca51197620a ("RDMA/core: Support IB sub device with type "SMI"")
    Signed-off-by: Tetsuo Handa <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Parav Pandit <[email protected]>
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/core: Check for the presence of LS_NLA_TYPE_DGID correctly [+ + +]

Author: Jason Gunthorpe <[email protected]>
Date:   Fri Nov 28 13:37:28 2025 -0400

    RDMA/core: Check for the presence of LS_NLA_TYPE_DGID correctly
    
    commit a7b8e876e0ef0232b8076972c57ce9a7286b47ca upstream.
    
    The netlink response for RDMA_NL_LS_OP_IP_RESOLVE should always have a
    LS_NLA_TYPE_DGID attribute, it is invalid if it does not.
    
    Use the nl parsing logic properly and call nla_parse_deprecated() to fill
    the nlattrs array and then directly index that array to get the data for
    the DGID. Just fail if it is NULL.
    
    Remove the for loop searching for the nla, and squash the validation and
    parsing into one function.
    
    Fixes an uninitialized read from the stack triggered by userspace if it
    does not provide the DGID to a kernel initiated RDMA_NL_LS_OP_IP_RESOLVE
    query.
    
        BUG: KMSAN: uninit-value in hex_byte_pack include/linux/hex.h:13 [inline]
        BUG: KMSAN: uninit-value in ip6_string+0xef4/0x13a0 lib/vsprintf.c:1490
         hex_byte_pack include/linux/hex.h:13 [inline]
         ip6_string+0xef4/0x13a0 lib/vsprintf.c:1490
         ip6_addr_string+0x18a/0x3e0 lib/vsprintf.c:1509
         ip_addr_string+0x245/0xee0 lib/vsprintf.c:1633
         pointer+0xc09/0x1bd0 lib/vsprintf.c:2542
         vsnprintf+0xf8a/0x1bd0 lib/vsprintf.c:2930
         vprintk_store+0x3ae/0x1530 kernel/printk/printk.c:2279
         vprintk_emit+0x307/0xcd0 kernel/printk/printk.c:2426
         vprintk_default+0x3f/0x50 kernel/printk/printk.c:2465
         vprintk+0x36/0x50 kernel/printk/printk_safe.c:82
         _printk+0x17e/0x1b0 kernel/printk/printk.c:2475
         ib_nl_process_good_ip_rsep drivers/infiniband/core/addr.c:128 [inline]
         ib_nl_handle_ip_res_resp+0x963/0x9d0 drivers/infiniband/core/addr.c:141
         rdma_nl_rcv_msg drivers/infiniband/core/netlink.c:-1 [inline]
         rdma_nl_rcv_skb drivers/infiniband/core/netlink.c:239 [inline]
         rdma_nl_rcv+0xefa/0x11c0 drivers/infiniband/core/netlink.c:259
         netlink_unicast_kernel net/netlink/af_netlink.c:1320 [inline]
         netlink_unicast+0xf04/0x12b0 net/netlink/af_netlink.c:1346
         netlink_sendmsg+0x10b3/0x1250 net/netlink/af_netlink.c:1896
         sock_sendmsg_nosec net/socket.c:714 [inline]
         __sock_sendmsg+0x333/0x3d0 net/socket.c:729
         ____sys_sendmsg+0x7e0/0xd80 net/socket.c:2617
         ___sys_sendmsg+0x271/0x3b0 net/socket.c:2671
         __sys_sendmsg+0x1aa/0x300 net/socket.c:2703
         __compat_sys_sendmsg net/compat.c:346 [inline]
         __do_compat_sys_sendmsg net/compat.c:353 [inline]
         __se_compat_sys_sendmsg net/compat.c:350 [inline]
         __ia32_compat_sys_sendmsg+0xa4/0x100 net/compat.c:350
         ia32_sys_call+0x3f6c/0x4310 arch/x86/include/generated/asm/syscalls_32.h:371
         do_syscall_32_irqs_on arch/x86/entry/syscall_32.c:83 [inline]
         __do_fast_syscall_32+0xb0/0x150 arch/x86/entry/syscall_32.c:306
         do_fast_syscall_32+0x38/0x80 arch/x86/entry/syscall_32.c:331
         do_SYSENTER_32+0x1f/0x30 arch/x86/entry/syscall_32.c:3
    
    Link: https://patch.msgid.link/r/[email protected]
    Cc: [email protected]
    Fixes: ae43f8286730 ("IB/core: Add IP to GID netlink offload")
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

RDMA/core: Fix logic error in ib_get_gids_from_rdma_hdr() [+ + +]

Author: Jang Ingyu <[email protected]>
Date:   Fri Dec 19 13:15:08 2025 +0900

    RDMA/core: Fix logic error in ib_get_gids_from_rdma_hdr()
    
    [ Upstream commit 8aaa848eaddd9ef8680fc6aafbd3a0646da5df40 ]
    
    Fix missing comparison operator for RDMA_NETWORK_ROCE_V1 in the
    conditional statement. The constant was used directly instead of
    being compared with net_type, causing the condition to always
    evaluate to true.
    
    Fixes: 1c15b4f2a42f ("RDMA/core: Modify enum ib_gid_type and enum rdma_network_type")
    Signed-off-by: Jang Ingyu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/efa: Remove possible negative shift [+ + +]

Author: Michael Margolin <[email protected]>
Date:   Wed Dec 10 17:36:56 2025 +0000

    RDMA/efa: Remove possible negative shift
    
    [ Upstream commit 85463eb6a46caf2f1e0e1a6d0731f2f3bab17780 ]
    
    The page size used for device might in some cases be smaller than
    PAGE_SIZE what results in a negative shift when calculating the number of
    host pages in PAGE_SIZE for a debug log. Remove the debug line together
    with the calculation.
    
    Fixes: 40909f664d27 ("RDMA/efa: Add EFA verbs implementation")
    Link: https://patch.msgid.link/r/[email protected]
    Reviewed-by: Tom Sela <[email protected]>
    Reviewed-by: Yonatan Nachum <[email protected]>
    Signed-off-by: Michael Margolin <[email protected]>
    Reviewed-by: Gal Pressman <[email protected]>
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/irdma: avoid invalid read in irdma_net_event [+ + +]

Author: Michal Schmidt <[email protected]>
Date:   Thu Nov 27 15:31:50 2025 +0100

    RDMA/irdma: avoid invalid read in irdma_net_event
    
    [ Upstream commit 6f05611728e9d0ab024832a4f1abb74a5f5d0bb0 ]
    
    irdma_net_event() should not dereference anything from "neigh" (alias
    "ptr") until it has checked that the event is NETEVENT_NEIGH_UPDATE.
    Other events come with different structures pointed to by "ptr" and they
    may be smaller than struct neighbour.
    
    Move the read of neigh->dev under the NETEVENT_NEIGH_UPDATE case.
    
    The bug is mostly harmless, but it triggers KASAN on debug kernels:
    
     BUG: KASAN: stack-out-of-bounds in irdma_net_event+0x32e/0x3b0 [irdma]
     Read of size 8 at addr ffffc900075e07f0 by task kworker/27:2/542554
    
     CPU: 27 PID: 542554 Comm: kworker/27:2 Kdump: loaded Not tainted 5.14.0-630.el9.x86_64+debug #1
     Hardware name: [...]
     Workqueue: events rt6_probe_deferred
     Call Trace:
      <IRQ>
      dump_stack_lvl+0x60/0xb0
      print_address_description.constprop.0+0x2c/0x3f0
      print_report+0xb4/0x270
      kasan_report+0x92/0xc0
      irdma_net_event+0x32e/0x3b0 [irdma]
      notifier_call_chain+0x9e/0x180
      atomic_notifier_call_chain+0x5c/0x110
      rt6_do_redirect+0xb91/0x1080
      tcp_v6_err+0xe9b/0x13e0
      icmpv6_notify+0x2b2/0x630
      ndisc_redirect_rcv+0x328/0x530
      icmpv6_rcv+0xc16/0x1360
      ip6_protocol_deliver_rcu+0xb84/0x12e0
      ip6_input_finish+0x117/0x240
      ip6_input+0xc4/0x370
      ipv6_rcv+0x420/0x7d0
      __netif_receive_skb_one_core+0x118/0x1b0
      process_backlog+0xd1/0x5d0
      __napi_poll.constprop.0+0xa3/0x440
      net_rx_action+0x78a/0xba0
      handle_softirqs+0x2d4/0x9c0
      do_softirq+0xad/0xe0
      </IRQ>
    
    Fixes: 915cc7ac0f8e ("RDMA/irdma: Add miscellaneous utility definitions")
    Link: https://patch.msgid.link/r/[email protected]
    Signed-off-by: Michal Schmidt <[email protected]>
    Signed-off-by: Jason Gunthorpe <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/rtrs: Fix clt_path::max_pages_per_mr calculation [+ + +]

Author: Honggang LI <[email protected]>
Date:   Mon Dec 29 10:56:17 2025 +0800

    RDMA/rtrs: Fix clt_path::max_pages_per_mr calculation
    
    [ Upstream commit 43bd09d5b750f700499ae8ec45fd41a4c48673e6 ]
    
    If device max_mr_size bits in the range [mr_page_shift+31:mr_page_shift]
    are zero, the `min3` function will set clt_path::max_pages_per_mr to
    zero.
    
    `alloc_path_reqs` will pass zero, which is invalid, as the third parameter
    to `ib_alloc_mr`.
    
    Fixes: 6a98d71daea1 ("RDMA/rtrs: client: main functionality")
    Signed-off-by: Honggang LI <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

reset: fix BIT macro reference [+ + +]

Author: Encrow Thorne <[email protected]>
Date:   Mon Nov 10 14:10:37 2025 +0800

    reset: fix BIT macro reference
    
    [ Upstream commit f3d8b64ee46c9b4b0b82b1a4642027728bac95b8 ]
    
    RESET_CONTROL_FLAGS_BIT_* macros use BIT(), but reset.h does not
    include bits.h. This causes compilation errors when including
    reset.h standalone.
    
    Include bits.h to make reset.h self-contained.
    
    Suggested-by: Troy Mitchell <[email protected]>
    Reviewed-by: Troy Mitchell <[email protected]>
    Reviewed-by: Philipp Zabel <[email protected]>
    Signed-off-by: Encrow Thorne <[email protected]>
    Signed-off-by: Philipp Zabel <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Revert "drm/amd/display: Fix pbn to kbps Conversion" [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Tue Dec 9 11:14:47 2025 -0600

    Revert "drm/amd/display: Fix pbn to kbps Conversion"
    
    commit 72e24456a54fe04710d89626cc5a88703e2f6202 upstream.
    
    Deeply daisy chained DP/MST displays are no longer able to light
    up. This reverts commit e0dec00f3d05 ("drm/amd/display: Fix pbn
    to kbps Conversion")
    
    Cc: Jerry Zuo <[email protected]>
    Reported-by: [email protected]
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/4756
    Signed-off-by: Mario Limonciello <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit e1c94109c76e8a77a21531bd53f6c63356c81158)
    Cc: [email protected] # 6.17+
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

Revert "drm/amd: Skip power ungate during suspend for VPE" [+ + +]

Author: Mario Limonciello (AMD) <[email protected]>
Date:   Sat Nov 29 19:46:31 2025 -0600

    Revert "drm/amd: Skip power ungate during suspend for VPE"
    
    commit 3925683515e93844be204381d2d5a1df5de34f31 upstream.
    
    Skipping power ungate exposed some scenarios that will fail
    like below:
    
    ```
    amdgpu: Register(0) [regVPEC_QUEUE_RESET_REQ] failed to reach value 0x00000000 != 0x00000001n
    amdgpu 0000:c1:00.0: amdgpu: VPE queue reset failed
    ...
    amdgpu: [drm] *ERROR* wait_for_completion_timeout timeout!
    ```
    
    The underlying s2idle issue that prompted this commit is going to
    be fixed in BIOS.
    This reverts commit 2a6c826cfeedd7714611ac115371a959ead55bda.
    
    Fixes: 2a6c826cfeed ("drm/amd: Skip power ungate during suspend for VPE")
    Cc: [email protected]
    Signed-off-by: Mario Limonciello (AMD) <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Reported-by: Konstantin <[email protected]>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=220812
    Reported-by: Matthew Schwartz <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

rpmsg: glink: fix rpmsg device leak [+ + +]

Author: Srinivas Kandagatla <[email protected]>
Date:   Fri Aug 22 11:00:42 2025 +0100

    rpmsg: glink: fix rpmsg device leak
    
    commit a53e356df548f6b0e82529ef3cc6070f42622189 upstream.
    
    While testing rpmsg-char interface it was noticed that duplicate sysfs
    entries are getting created and below warning is noticed.
    
    Reason for this is that we are leaking rpmsg device pointer, setting it
    null without actually unregistering device.
    Any further attempts to unregister fail because rpdev is NULL,
    resulting in a leak.
    
    Fix this by unregistering rpmsg device before removing its reference
    from rpmsg channel.
    
    sysfs: cannot create duplicate filename '/devices/platform/soc@0/3700000.remot
    eproc/remoteproc/remoteproc1/3700000.remoteproc:glink-edge/3700000.remoteproc:
    glink-edge.adsp_apps.-1.-1'
    [  114.115347] CPU: 0 UID: 0 PID: 9 Comm: kworker/0:0 Not
     tainted 6.16.0-rc4 #7 PREEMPT
    [  114.115355] Hardware name: Qualcomm Technologies, Inc. Robotics RB3gen2 (DT)
    [  114.115358] Workqueue: events qcom_glink_work
    [  114.115371] Call trace:8
    [  114.115374]  show_stack+0x18/0x24 (C)
    [  114.115382]  dump_stack_lvl+0x60/0x80
    [  114.115388]  dump_stack+0x18/0x24
    [  114.115393]  sysfs_warn_dup+0x64/0x80
    [  114.115402]  sysfs_create_dir_ns+0xf4/0x120
    [  114.115409]  kobject_add_internal+0x98/0x260
    [  114.115416]  kobject_add+0x9c/0x108
    [  114.115421]  device_add+0xc4/0x7a0
    [  114.115429]  rpmsg_register_device+0x5c/0xb0
    [  114.115434]  qcom_glink_work+0x4bc/0x820
    [  114.115438]  process_one_work+0x148/0x284
    [  114.115446]  worker_thread+0x2c4/0x3e0
    [  114.115452]  kthread+0x12c/0x204
    [  114.115457]  ret_from_fork+0x10/0x20
    [  114.115464] kobject: kobject_add_internal failed for 3700000.remoteproc:
    glink-edge.adsp_apps.-1.-1 with -EEXIST, don't try to register things with
    the same name in the same directory.
    [  114.250045] rpmsg 3700000.remoteproc:glink-edge.adsp_apps.-1.-1:
    device_add failed: -17
    
    Fixes: 835764ddd9af ("rpmsg: glink: Move the common glink protocol implementation to glink_native.c")
    Cc: [email protected]
    Signed-off-by: Srinivas Kandagatla <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/dasd: Fix gendisk parent after copy pair swap [+ + +]

Author: Stefan Haberland <[email protected]>
Date:   Wed Nov 26 17:06:31 2025 +0100

    s390/dasd: Fix gendisk parent after copy pair swap
    
    commit c943bfc6afb8d0e781b9b7406f36caa8bbf95cb9 upstream.
    
    After a copy pair swap the block device's "device" symlink points to
    the secondary CCW device, but the gendisk's parent remained the
    primary, leaving /sys/block/<dasdx> under the wrong parent.
    
    Move the gendisk to the secondary's device with device_move(), keeping
    the sysfs topology consistent after the swap.
    
    Fixes: 413862caad6f ("s390/dasd: add copy pair swap capability")
    Cc: [email protected] #6.1
    Reviewed-by: Jan Hoeppner <[email protected]>
    Signed-off-by: Stefan Haberland <[email protected]>
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

s390/ipl: Clear SBP flag when bootprog is set [+ + +]

Author: Sven Schnelle <[email protected]>
Date:   Fri Dec 5 10:58:57 2025 +0100

    s390/ipl: Clear SBP flag when bootprog is set
    
    commit b1aa01d31249bd116b18c7f512d3e46b4b4ad83b upstream.
    
    With z16 a new flag 'search boot program' was introduced for
    list-directed IPL (SCSI, NVMe, ECKD DASD). If this flag is set,
    e.g. via selecting the "Automatic" value for the "Boot program
    selector" control on an HMC load panel, it is copied to the reipl
    structure from the initial ipl structure. When a user now sets a
    boot prog via sysfs, the flag is not cleared and the bootloader
    will again automatically select the boot program, ignoring user
    configuration.
    
    To avoid that, clear the SBP flag when a bootprog sysfs file is
    written.
    
    Cc: [email protected]
    Reviewed-by: Peter Oberparleiter <[email protected]>
    Reviewed-by: Heiko Carstens <[email protected]>
    Signed-off-by: Sven Schnelle <[email protected]>
    Signed-off-by: Heiko Carstens <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

samples/ftrace: Adjust LoongArch register restore order in direct calls [+ + +]

Author: Chenghao Duan <[email protected]>
Date:   Wed Dec 31 15:19:25 2025 +0800

    samples/ftrace: Adjust LoongArch register restore order in direct calls
    
    commit bb85d206be208bbf834883e948125a35ac59993a upstream.
    
    Ensure that in the ftrace direct call logic, the CPU register state
    (with ra = parent return address) is restored to the correct state after
    the execution of the custom trampoline function and before returning to
    the traced function. Additionally, guarantee the correctness of the jump
    logic for jr t0 (traced function address).
    
    Cc: [email protected]
    Fixes: 9cdc3b6a299c ("LoongArch: ftrace: Add direct call support")
    Reported-by: Youling Tang <[email protected]>
    Acked-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Chenghao Duan <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched/deadline: only set free_cpus for online runqueues [+ + +]

Author: Doug Berger <[email protected]>
Date:   Thu Aug 14 18:22:36 2025 -0700

    sched/deadline: only set free_cpus for online runqueues
    
    [ Upstream commit 382748c05e58a9f1935f5a653c352422375566ea ]
    
    Commit 16b269436b72 ("sched/deadline: Modify cpudl::free_cpus
    to reflect rd->online") introduced the cpudl_set/clear_freecpu
    functions to allow the cpu_dl::free_cpus mask to be manipulated
    by the deadline scheduler class rq_on/offline callbacks so the
    mask would also reflect this state.
    
    Commit 9659e1eeee28 ("sched/deadline: Remove cpu_active_mask
    from cpudl_find()") removed the check of the cpu_active_mask to
    save some processing on the premise that the cpudl::free_cpus
    mask already reflected the runqueue online state.
    
    Unfortunately, there are cases where it is possible for the
    cpudl_clear function to set the free_cpus bit for a CPU when the
    deadline runqueue is offline. When this occurs while a CPU is
    connected to the default root domain the flag may retain the bad
    state after the CPU has been unplugged. Later, a different CPU
    that is transitioning through the default root domain may push a
    deadline task to the powered down CPU when cpudl_find sees its
    free_cpus bit is set. If this happens the task will not have the
    opportunity to run.
    
    One example is outlined here:
    https://lore.kernel.org/lkml/[email protected]
    
    Another occurs when the last deadline task is migrated from a
    CPU that has an offlined runqueue. The dequeue_task member of
    the deadline scheduler class will eventually call cpudl_clear
    and set the free_cpus bit for the CPU.
    
    This commit modifies the cpudl_clear function to be aware of the
    online state of the deadline runqueue so that the free_cpus mask
    can be updated appropriately.
    
    It is no longer necessary to manage the mask outside of the
    cpudl_set/clear functions so the cpudl_set/clear_freecpu
    functions are removed. In addition, since the free_cpus mask is
    now only updated under the cpudl lock the code was changed to
    use the non-atomic __cpumask functions.
    
    Signed-off-by: Doug Berger <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

sched/eevdf: Fix min_vruntime vs avg_vruntime [+ + +]

Author: Peter Zijlstra <[email protected]>
Date:   Mon Dec 29 15:23:50 2025 -0500

    sched/eevdf: Fix min_vruntime vs avg_vruntime
    
    [ Upstream commit 79f3f9bedd149ea438aaeb0fb6a083637affe205 ]
    
    Basically, from the constraint that the sum of lag is zero, you can
    infer that the 0-lag point is the weighted average of the individual
    vruntime, which is what we're trying to compute:
    
            \Sum w_i * v_i
      avg = --------------
               \Sum w_i
    
    Now, since vruntime takes the whole u64 (worse, it wraps), this
    multiplication term in the numerator is not something we can compute;
    instead we do the min_vruntime (v0 henceforth) thing like:
    
      v_i = (v_i - v0) + v0
    
    This does two things:
     - it keeps the key: (v_i - v0) 'small';
     - it creates a relative 0-point in the modular space.
    
    If you do that subtitution and work it all out, you end up with:
    
            \Sum w_i * (v_i - v0)
      avg = --------------------- + v0
                  \Sum w_i
    
    Since you cannot very well track a ratio like that (and not suffer
    terrible numerical problems) we simpy track the numerator and
    denominator individually and only perform the division when strictly
    needed.
    
    Notably, the numerator lives in cfs_rq->avg_vruntime and the denominator
    lives in cfs_rq->avg_load.
    
    The one extra 'funny' is that these numbers track the entities in the
    tree, and current is typically outside of the tree, so avg_vruntime()
    adds current when needed before doing the division.
    
    (vruntime_eligible() elides the division by cross-wise multiplication)
    
    Anyway, as mentioned above, we currently use the CFS era min_vruntime
    for this purpose. However, this thing can only move forward, while the
    above avg can in fact move backward (when a non-eligible task leaves,
    the average becomes smaller), this can cause trouble when through
    happenstance (or construction) these values drift far enough apart to
    wreck the game.
    
    Replace cfs_rq::min_vruntime with cfs_rq::zero_vruntime which is kept
    near/at avg_vruntime, following its motion.
    
    The down-side is that this requires computing the avg more often.
    
    Fixes: 147f3efaa241 ("sched/fair: Implement an EEVDF-like scheduling policy")
    Reported-by: Zicheng Qu <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Cc: [email protected]
    [ Adjust context in comments + init_cfs_rq ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched/fair: Revert max_newidle_lb_cost bump [+ + +]

Author: Peter Zijlstra <[email protected]>
Date:   Fri Nov 7 17:01:20 2025 +0100

    sched/fair: Revert max_newidle_lb_cost bump
    
    [ Upstream commit d206fbad9328ddb68ebabd7cf7413392acd38081 ]
    
    Many people reported regressions on their database workloads due to:
    
      155213a2aed4 ("sched/fair: Bump sd->max_newidle_lb_cost when newidle balance fails")
    
    For instance Adam Li reported a 6% regression on SpecJBB.
    
    Conversely this will regress schbench again; on my machine from 2.22
    Mrps/s down to 2.04 Mrps/s.
    
    Reported-by: Joseph Salisbury <[email protected]>
    Reported-by: Adam Li <[email protected]>
    Reported-by: Dietmar Eggemann <[email protected]>
    Reported-by: Hazem Mohamed Abuelfotoh <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Reviewed-by: Dietmar Eggemann <[email protected]>
    Tested-by: Dietmar Eggemann <[email protected]>
    Tested-by: Chris Mason <[email protected]>
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

sched/rt: Fix race in push_rt_task [+ + +]

Author: Harshit Agarwal <[email protected]>
Date:   Tue Feb 25 18:05:53 2025 +0000

    sched/rt: Fix race in push_rt_task
    
    commit 690e47d1403e90b7f2366f03b52ed3304194c793 upstream.
    
    Overview
    ========
    When a CPU chooses to call push_rt_task and picks a task to push to
    another CPU's runqueue then it will call find_lock_lowest_rq method
    which would take a double lock on both CPUs' runqueues. If one of the
    locks aren't readily available, it may lead to dropping the current
    runqueue lock and reacquiring both the locks at once. During this window
    it is possible that the task is already migrated and is running on some
    other CPU. These cases are already handled. However, if the task is
    migrated and has already been executed and another CPU is now trying to
    wake it up (ttwu) such that it is queued again on the runqeue
    (on_rq is 1) and also if the task was run by the same CPU, then the
    current checks will pass even though the task was migrated out and is no
    longer in the pushable tasks list.
    
    Crashes
    =======
    This bug resulted in quite a few flavors of crashes triggering kernel
    panics with various crash signatures such as assert failures, page
    faults, null pointer dereferences, and queue corruption errors all
    coming from scheduler itself.
    
    Some of the crashes:
    -> kernel BUG at kernel/sched/rt.c:1616! BUG_ON(idx >= MAX_RT_PRIO)
       Call Trace:
       ? __die_body+0x1a/0x60
       ? die+0x2a/0x50
       ? do_trap+0x85/0x100
       ? pick_next_task_rt+0x6e/0x1d0
       ? do_error_trap+0x64/0xa0
       ? pick_next_task_rt+0x6e/0x1d0
       ? exc_invalid_op+0x4c/0x60
       ? pick_next_task_rt+0x6e/0x1d0
       ? asm_exc_invalid_op+0x12/0x20
       ? pick_next_task_rt+0x6e/0x1d0
       __schedule+0x5cb/0x790
       ? update_ts_time_stats+0x55/0x70
       schedule_idle+0x1e/0x40
       do_idle+0x15e/0x200
       cpu_startup_entry+0x19/0x20
       start_secondary+0x117/0x160
       secondary_startup_64_no_verify+0xb0/0xbb
    
    -> BUG: kernel NULL pointer dereference, address: 00000000000000c0
       Call Trace:
       ? __die_body+0x1a/0x60
       ? no_context+0x183/0x350
       ? __warn+0x8a/0xe0
       ? exc_page_fault+0x3d6/0x520
       ? asm_exc_page_fault+0x1e/0x30
       ? pick_next_task_rt+0xb5/0x1d0
       ? pick_next_task_rt+0x8c/0x1d0
       __schedule+0x583/0x7e0
       ? update_ts_time_stats+0x55/0x70
       schedule_idle+0x1e/0x40
       do_idle+0x15e/0x200
       cpu_startup_entry+0x19/0x20
       start_secondary+0x117/0x160
       secondary_startup_64_no_verify+0xb0/0xbb
    
    -> BUG: unable to handle page fault for address: ffff9464daea5900
       kernel BUG at kernel/sched/rt.c:1861! BUG_ON(rq->cpu != task_cpu(p))
    
    -> kernel BUG at kernel/sched/rt.c:1055! BUG_ON(!rq->nr_running)
       Call Trace:
       ? __die_body+0x1a/0x60
       ? die+0x2a/0x50
       ? do_trap+0x85/0x100
       ? dequeue_top_rt_rq+0xa2/0xb0
       ? do_error_trap+0x64/0xa0
       ? dequeue_top_rt_rq+0xa2/0xb0
       ? exc_invalid_op+0x4c/0x60
       ? dequeue_top_rt_rq+0xa2/0xb0
       ? asm_exc_invalid_op+0x12/0x20
       ? dequeue_top_rt_rq+0xa2/0xb0
       dequeue_rt_entity+0x1f/0x70
       dequeue_task_rt+0x2d/0x70
       __schedule+0x1a8/0x7e0
       ? blk_finish_plug+0x25/0x40
       schedule+0x3c/0xb0
       futex_wait_queue_me+0xb6/0x120
       futex_wait+0xd9/0x240
       do_futex+0x344/0xa90
       ? get_mm_exe_file+0x30/0x60
       ? audit_exe_compare+0x58/0x70
       ? audit_filter_rules.constprop.26+0x65e/0x1220
       __x64_sys_futex+0x148/0x1f0
       do_syscall_64+0x30/0x80
       entry_SYSCALL_64_after_hwframe+0x62/0xc7
    
    -> BUG: unable to handle page fault for address: ffff8cf3608bc2c0
       Call Trace:
       ? __die_body+0x1a/0x60
       ? no_context+0x183/0x350
       ? spurious_kernel_fault+0x171/0x1c0
       ? exc_page_fault+0x3b6/0x520
       ? plist_check_list+0x15/0x40
       ? plist_check_list+0x2e/0x40
       ? asm_exc_page_fault+0x1e/0x30
       ? _cond_resched+0x15/0x30
       ? futex_wait_queue_me+0xc8/0x120
       ? futex_wait+0xd9/0x240
       ? try_to_wake_up+0x1b8/0x490
       ? futex_wake+0x78/0x160
       ? do_futex+0xcd/0xa90
       ? plist_check_list+0x15/0x40
       ? plist_check_list+0x2e/0x40
       ? plist_del+0x6a/0xd0
       ? plist_check_list+0x15/0x40
       ? plist_check_list+0x2e/0x40
       ? dequeue_pushable_task+0x20/0x70
       ? __schedule+0x382/0x7e0
       ? asm_sysvec_reschedule_ipi+0xa/0x20
       ? schedule+0x3c/0xb0
       ? exit_to_user_mode_prepare+0x9e/0x150
       ? irqentry_exit_to_user_mode+0x5/0x30
       ? asm_sysvec_reschedule_ipi+0x12/0x20
    
    Above are some of the common examples of the crashes that were observed
    due to this issue.
    
    Details
    =======
    Let's look at the following scenario to understand this race.
    
    1) CPU A enters push_rt_task
      a) CPU A has chosen next_task = task p.
      b) CPU A calls find_lock_lowest_rq(Task p, CPU Z’s rq).
      c) CPU A identifies CPU X as a destination CPU (X < Z).
      d) CPU A enters double_lock_balance(CPU Z’s rq, CPU X’s rq).
      e) Since X is lower than Z, CPU A unlocks CPU Z’s rq. Someone else has
         locked CPU X’s rq, and thus, CPU A must wait.
    
    2) At CPU Z
      a) Previous task has completed execution and thus, CPU Z enters
         schedule, locks its own rq after CPU A releases it.
      b) CPU Z dequeues previous task and begins executing task p.
      c) CPU Z unlocks its rq.
      d) Task p yields the CPU (ex. by doing IO or waiting to acquire a
         lock) which triggers the schedule function on CPU Z.
      e) CPU Z enters schedule again, locks its own rq, and dequeues task p.
      f) As part of dequeue, it sets p.on_rq = 0 and unlocks its rq.
    
    3) At CPU B
      a) CPU B enters try_to_wake_up with input task p.
      b) Since CPU Z dequeued task p, p.on_rq = 0, and CPU B updates
         B.state = WAKING.
      c) CPU B via select_task_rq determines CPU Y as the target CPU.
    
    4) The race
      a) CPU A acquires CPU X’s lock and relocks CPU Z.
      b) CPU A reads task p.cpu = Z and incorrectly concludes task p is
         still on CPU Z.
      c) CPU A failed to notice task p had been dequeued from CPU Z while
         CPU A was waiting for locks in double_lock_balance. If CPU A knew
         that task p had been dequeued, it would return NULL forcing
         push_rt_task to give up the task p's migration.
      d) CPU B updates task p.cpu = Y and calls ttwu_queue.
      e) CPU B locks Ys rq. CPU B enqueues task p onto Y and sets task
         p.on_rq = 1.
      f) CPU B unlocks CPU Y, triggering memory synchronization.
      g) CPU A reads task p.on_rq = 1, cementing its assumption that task p
         has not migrated.
      h) CPU A decides to migrate p to CPU X.
    
    This leads to A dequeuing p from Y's queue and various crashes down the
    line.
    
    Solution
    ========
    The solution here is fairly simple. After obtaining the lock (at 4a),
    the check is enhanced to make sure that the task is still at the head of
    the pushable tasks list. If not, then it is anyway not suitable for
    being pushed out.
    
    Testing
    =======
    The fix is tested on a cluster of 3 nodes, where the panics due to this
    are hit every couple of days. A fix similar to this was deployed on such
    cluster and was stable for more than 30 days.
    
    Co-developed-by: Jon Kohler <[email protected]>
    Signed-off-by: Jon Kohler <[email protected]>
    Co-developed-by: Gauri Patwardhan <[email protected]>
    Signed-off-by: Gauri Patwardhan <[email protected]>
    Co-developed-by: Rahul Chunduru <[email protected]>
    Signed-off-by: Rahul Chunduru <[email protected]>
    Signed-off-by: Harshit Agarwal <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Reviewed-by: "Steven Rostedt (Google)" <[email protected]>
    Reviewed-by: Phil Auld <[email protected]>
    Tested-by: Will Ton <[email protected]>
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Rajani Kantha <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched_ext: Factor out local_dsq_post_enq() from dispatch_enqueue() [+ + +]

Author: Tejun Heo <[email protected]>
Date:   Thu Jan 1 21:01:16 2026 -0500

    sched_ext: Factor out local_dsq_post_enq() from dispatch_enqueue()
    
    [ Upstream commit 530b6637c79e728d58f1d9b66bd4acf4b735b86d ]
    
    Factor out local_dsq_post_enq() which performs post-enqueue handling for
    local DSQs - triggering resched_curr() if SCX_ENQ_PREEMPT is specified or if
    the current CPU is idle. No functional change.
    
    This will be used by the next patch to fix move_local_task_to_local_dsq().
    
    Cc: [email protected] # v6.12+
    Reviewed-by: Andrea Righi <[email protected]>
    Reviewed-by: Emil Tsalapatis <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched_ext: Fix incorrect sched_class settings for per-cpu migration tasks [+ + +]

Author: Zqiang <[email protected]>
Date:   Mon Dec 29 14:39:20 2025 -0500

    sched_ext: Fix incorrect sched_class settings for per-cpu migration tasks
    
    [ Upstream commit 1dd6c84f1c544e552848a8968599220bd464e338 ]
    
    When loading the ebpf scheduler, the tasks in the scx_tasks list will
    be traversed and invoke __setscheduler_class() to get new sched_class.
    however, this would also incorrectly set the per-cpu migration
    task's->sched_class to rt_sched_class, even after unload, the per-cpu
    migration task's->sched_class remains sched_rt_class.
    
    The log for this issue is as follows:
    
    ./scx_rustland --stats 1
    [  199.245639][  T630] sched_ext: "rustland" does not implement cgroup cpu.weight
    [  199.269213][  T630] sched_ext: BPF scheduler "rustland" enabled
    04:25:09 [INFO] RustLand scheduler attached
    
    bpftrace -e 'iter:task /strcontains(ctx->task->comm, "migration")/
    { printf("%s:%d->%pS\n", ctx->task->comm, ctx->task->pid, ctx->task->sched_class); }'
    Attaching 1 probe...
    migration/0:24->rt_sched_class+0x0/0xe0
    migration/1:27->rt_sched_class+0x0/0xe0
    migration/2:33->rt_sched_class+0x0/0xe0
    migration/3:39->rt_sched_class+0x0/0xe0
    migration/4:45->rt_sched_class+0x0/0xe0
    migration/5:52->rt_sched_class+0x0/0xe0
    migration/6:58->rt_sched_class+0x0/0xe0
    migration/7:64->rt_sched_class+0x0/0xe0
    
    sched_ext: BPF scheduler "rustland" disabled (unregistered from user space)
    EXIT: unregistered from user space
    04:25:21 [INFO] Unregister RustLand scheduler
    
    bpftrace -e 'iter:task /strcontains(ctx->task->comm, "migration")/
    { printf("%s:%d->%pS\n", ctx->task->comm, ctx->task->pid, ctx->task->sched_class); }'
    Attaching 1 probe...
    migration/0:24->rt_sched_class+0x0/0xe0
    migration/1:27->rt_sched_class+0x0/0xe0
    migration/2:33->rt_sched_class+0x0/0xe0
    migration/3:39->rt_sched_class+0x0/0xe0
    migration/4:45->rt_sched_class+0x0/0xe0
    migration/5:52->rt_sched_class+0x0/0xe0
    migration/6:58->rt_sched_class+0x0/0xe0
    migration/7:64->rt_sched_class+0x0/0xe0
    
    This commit therefore generate a new scx_setscheduler_class() and
    add check for stop_sched_class to replace __setscheduler_class().
    
    Fixes: f0e1a0643a59 ("sched_ext: Implement BPF extensible scheduler class")
    Cc: [email protected] # v6.12+
    Signed-off-by: Zqiang <[email protected]>
    Reviewed-by: Andrea Righi <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    [ Adjust context ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

sched_ext: Fix missing post-enqueue handling in move_local_task_to_local_dsq() [+ + +]

Author: Tejun Heo <[email protected]>
Date:   Thu Jan 1 21:01:17 2026 -0500

    sched_ext: Fix missing post-enqueue handling in move_local_task_to_local_dsq()
    
    [ Upstream commit f5e1e5ec204da11fa87fdf006d451d80ce06e118 ]
    
    move_local_task_to_local_dsq() is used when moving a task from a non-local
    DSQ to a local DSQ on the same CPU. It directly manipulates the local DSQ
    without going through dispatch_enqueue() and was missing the post-enqueue
    handling that triggers preemption when SCX_ENQ_PREEMPT is set or the idle
    task is running.
    
    The function is used by move_task_between_dsqs() which backs
    scx_bpf_dsq_move() and may be called while the CPU is busy.
    
    Add local_dsq_post_enq() call to move_local_task_to_local_dsq(). As the
    dispatch path doesn't need post-enqueue handling, add SCX_RQ_IN_BALANCE
    early exit to keep consume_dispatch_q() behavior unchanged and avoid
    triggering unnecessary resched when scx_bpf_dsq_move() is used from the
    dispatch path.
    
    Fixes: 4c30f5ce4f7a ("sched_ext: Implement scx_bpf_dispatch[_vtime]_from_dsq()")
    Cc: [email protected] # v6.12+
    Reviewed-by: Andrea Righi <[email protected]>
    Reviewed-by: Emil Tsalapatis <[email protected]>
    Signed-off-by: Tejun Heo <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scripts/faddr2line: Fix "Argument list too long" error [+ + +]

Author: Pankaj Raghav <[email protected]>
Date:   Sun Sep 21 12:03:58 2025 +0200

    scripts/faddr2line: Fix "Argument list too long" error
    
    [ Upstream commit ff5c0466486ba8d07ab2700380e8fd6d5344b4e9 ]
    
    The run_readelf() function reads the entire output of readelf into a
    single shell variable. For large object files with extensive debug
    information, the size of this variable can exceed the system's
    command-line argument length limit.
    
    When this variable is subsequently passed to sed via `echo "${out}"`, it
    triggers an "Argument list too long" error, causing the script to fail.
    
    Fix this by redirecting the output of readelf to a temporary file
    instead of a variable. The sed commands are then modified to read from
    this file, avoiding the argument length limitation entirely.
    
    Signed-off-by: Pankaj Raghav <[email protected]>
    Signed-off-by: Josh Poimboeuf <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scs: fix a wrong parameter in __scs_magic [+ + +]

Author: Zhichi Lin <[email protected]>
Date:   Sat Oct 11 16:22:22 2025 +0800

    scs: fix a wrong parameter in __scs_magic
    
    commit 08bd4c46d5e63b78e77f2605283874bbe868ab19 upstream.
    
    __scs_magic() needs a 'void *' variable, but a 'struct task_struct *' is
    given.  'task_scs(tsk)' is the starting address of the task's shadow call
    stack, and '__scs_magic(task_scs(tsk))' is the end address of the task's
    shadow call stack.  Here should be '__scs_magic(task_scs(tsk))'.
    
    The user-visible effect of this bug is that when CONFIG_DEBUG_STACK_USAGE
    is enabled, the shadow call stack usage checking function
    (scs_check_usage) would scan an incorrect memory range.  This could lead
    to:
    
    1. **Inaccurate stack usage reporting**: The function would calculate
       wrong usage statistics for the shadow call stack, potentially showing
       incorrect value in kmsg.
    
    2. **Potential kernel crash**: If the value of __scs_magic(tsk)is
       greater than that of __scs_magic(task_scs(tsk)), the for loop may
       access unmapped memory, potentially causing a kernel panic.  However,
       this scenario is unlikely because task_struct is allocated via the slab
       allocator (which typically returns lower addresses), while the shadow
       call stack returned by task_scs(tsk) is allocated via vmalloc(which
       typically returns higher addresses).
    
    However, since this is purely a debugging feature
    (CONFIG_DEBUG_STACK_USAGE), normal production systems should be not
    unaffected.  The bug only impacts developers and testers who are actively
    debugging stack usage with this configuration enabled.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 5bbaf9d1fcb9 ("scs: Add support for stack usage debugging")
    Signed-off-by: Jiyuan Xie <[email protected]>
    Signed-off-by: Zhichi Lin <[email protected]>
    Reviewed-by: Sami Tolvanen <[email protected]>
    Acked-by: Will Deacon <[email protected]>
    Cc: Andrey Konovalov <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Marco Elver <[email protected]>
    Cc: Will Deacon <[email protected]>
    Cc: Yee Lee <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: aic94xx: fix use-after-free in device removal path [+ + +]

Author: Junrui Luo <[email protected]>
Date:   Wed Oct 29 00:29:04 2025 +0800

    scsi: aic94xx: fix use-after-free in device removal path
    
    commit f6ab594672d4cba08540919a4e6be2e202b60007 upstream.
    
    The asd_pci_remove() function fails to synchronize with pending tasklets
    before freeing the asd_ha structure, leading to a potential
    use-after-free vulnerability.
    
    When a device removal is triggered (via hot-unplug or module unload),
    race condition can occur.
    
    The fix adds tasklet_kill() before freeing the asd_ha structure,
    ensuring all scheduled tasklets complete before cleanup proceeds.
    
    Reported-by: Yuhao Jiang <[email protected]>
    Reported-by: Junrui Luo <[email protected]>
    Fixes: 2908d778ab3e ("[SCSI] aic94xx: new driver")
    Cc: [email protected]
    Signed-off-by: Junrui Luo <[email protected]>
    Link: https://patch.msgid.link/ME2PR01MB3156AB7DCACA206C845FC7E8AFFDA@ME2PR01MB3156.ausprd01.prod.outlook.com
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: mpi3mr: Read missing IOCFacts flag for reply queue full overflow [+ + +]

Author: Chandrakanth Patil <[email protected]>
Date:   Thu Dec 11 05:59:29 2025 +0530

    scsi: mpi3mr: Read missing IOCFacts flag for reply queue full overflow
    
    commit d373163194982f43b92c552c138c29d9f0b79553 upstream.
    
    The driver was not reading the MAX_REQ_PER_REPLY_QUEUE_LIMIT IOCFacts
    flag, so the reply-queue-full handling was never enabled, even on
    firmware that supports it. Reading this flag enables the feature and
    prevents reply queue overflow.
    
    Fixes: f08b24d82749 ("scsi: mpi3mr: Avoid reply queue full condition")
    Cc: [email protected]
    Signed-off-by: Chandrakanth Patil <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: qla2xxx: Fix initiator mode with qlini_mode=exclusive [+ + +]

Author: Tony Battersby <[email protected]>
Date:   Mon Nov 10 10:48:45 2025 -0500

    scsi: qla2xxx: Fix initiator mode with qlini_mode=exclusive
    
    [ Upstream commit 8f58fc64d559b5fda1b0a5e2a71422be61e79ab9 ]
    
    When given the module parameter qlini_mode=exclusive, qla2xxx in
    initiator mode is initially unable to successfully send SCSI commands to
    devices it finds while scanning, resulting in an escalating series of
    resets until an adapter reset clears the issue.  Fix by checking the
    active mode instead of the module parameter.
    
    Signed-off-by: Tony Battersby <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: qla2xxx: Fix lost interrupts with qlini_mode=disabled [+ + +]

Author: Tony Battersby <[email protected]>
Date:   Mon Nov 10 10:50:05 2025 -0500

    scsi: qla2xxx: Fix lost interrupts with qlini_mode=disabled
    
    [ Upstream commit 4f6aaade2a22ac428fa99ed716cf2b87e79c9837 ]
    
    When qla2xxx is loaded with qlini_mode=disabled,
    ha->flags.disable_msix_handshake is used before it is set, resulting in
    the wrong interrupt handler being used on certain HBAs
    (qla2xxx_msix_rsp_q_hs() is used when qla2xxx_msix_rsp_q() should be
    used).  The only difference between these two interrupt handlers is that
    the _hs() version writes to a register to clear the "RISC" interrupt,
    whereas the other version does not.  So this bug results in the RISC
    interrupt being cleared when it should not be.  This occasionally causes
    a different interrupt handler qla24xx_msix_default() for a different
    vector to see ((stat & HSRX_RISC_INT) == 0) and ignore its interrupt,
    which then causes problems like:
    
    qla2xxx [0000:02:00.0]-d04c:6: MBX Command timeout for cmd 20,
      iocontrol=8 jiffies=1090c0300 mb[0-3]=[0x4000 0x0 0x40 0xda] mb7 0x500
      host_status 0x40000010 hccr 0x3f00
    qla2xxx [0000:02:00.0]-101e:6: Mailbox cmd timeout occurred, cmd=0x20,
      mb[0]=0x20. Scheduling ISP abort
    (the cmd varies; sometimes it is 0x20, 0x22, 0x54, 0x5a, 0x5d, or 0x6a)
    
    This problem can be reproduced with a 16 or 32 Gbps HBA by loading
    qla2xxx with qlini_mode=disabled and running a high IOPS test while
    triggering frequent RSCN database change events.
    
    While analyzing the problem I discovered that even with
    disable_msix_handshake forced to 0, it is not necessary to clear the
    RISC interrupt from qla2xxx_msix_rsp_q_hs() (more below).  So just
    completely remove qla2xxx_msix_rsp_q_hs() and the logic for selecting
    it, which also fixes the bug with qlini_mode=disabled.
    
    The test below describes the justification for not needing
    qla2xxx_msix_rsp_q_hs():
    
    Force disable_msix_handshake to 0:
    qla24xx_config_rings():
    if (0 && (ha->fw_attributes & BIT_6) && (IS_MSIX_NACK_CAPABLE(ha)) &&
        (ha->flags.msix_enabled)) {
    
    In qla24xx_msix_rsp_q() and qla2xxx_msix_rsp_q_hs(), check:
      (rd_reg_dword(®->host_status) & HSRX_RISC_INT)
    
    Count the number of calls to each function with HSRX_RISC_INT set and
    the number with HSRX_RISC_INT not set while performing some I/O.
    
    If qla2xxx_msix_rsp_q_hs() clears the RISC interrupt (original code):
    qla24xx_msix_rsp_q:    50% of calls have HSRX_RISC_INT set
    qla2xxx_msix_rsp_q_hs:  5% of calls have HSRX_RISC_INT set
    (# of qla2xxx_msix_rsp_q_hs interrupts) =
        (# of qla24xx_msix_rsp_q interrupts) * 3
    
    If qla2xxx_msix_rsp_q_hs() does not clear the RISC interrupt (patched
    code):
    qla24xx_msix_rsp_q:    100% of calls have HSRX_RISC_INT set
    qla2xxx_msix_rsp_q_hs:   9% of calls have HSRX_RISC_INT set
    (# of qla2xxx_msix_rsp_q_hs interrupts) =
        (# of qla24xx_msix_rsp_q interrupts) * 3
    
    In the case of the original code, qla24xx_msix_rsp_q() was seeing
    HSRX_RISC_INT set only 50% of the time because qla2xxx_msix_rsp_q_hs()
    was clearing it when it shouldn't have been.  In the patched code,
    qla24xx_msix_rsp_q() sees HSRX_RISC_INT set 100% of the time, which
    makes sense if that interrupt handler needs to clear the RISC interrupt
    (which it does).  qla2xxx_msix_rsp_q_hs() sees HSRX_RISC_INT only 9% of
    the time, which is just overlap from the other interrupt during the
    high IOPS test.
    
    Tested with SCST on:
    QLE2742  FW:v9.08.02 (32 Gbps 2-port)
    QLE2694L FW:v9.10.11 (16 Gbps 4-port)
    QLE2694L FW:v9.08.02 (16 Gbps 4-port)
    QLE2672  FW:v8.07.12 (16 Gbps 2-port)
    both initiator and target mode
    
    Signed-off-by: Tony Battersby <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: qla2xxx: Use reinit_completion on mbx_intr_comp [+ + +]

Author: Tony Battersby <[email protected]>
Date:   Mon Nov 10 10:51:28 2025 -0500

    scsi: qla2xxx: Use reinit_completion on mbx_intr_comp
    
    [ Upstream commit 957aa5974989fba4ae4f807ebcb27f12796edd4d ]
    
    If a mailbox command completes immediately after
    wait_for_completion_timeout() times out, ha->mbx_intr_comp could be left
    in an inconsistent state, causing the next mailbox command not to wait
    for the hardware.  Fix by reinitializing the completion before use.
    
    Signed-off-by: Tony Battersby <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: Revert "scsi: qla2xxx: Perform lockless command completion in abort path" [+ + +]

Author: Tony Battersby <[email protected]>
Date:   Mon Nov 10 10:47:35 2025 -0500

    scsi: Revert "scsi: qla2xxx: Perform lockless command completion in abort path"
    
    commit b57fbc88715b6d18f379463f48a15b560b087ffe upstream.
    
    This reverts commit 0367076b0817d5c75dfb83001ce7ce5c64d803a9.
    
    The commit being reverted added code to __qla2x00_abort_all_cmds() to
    call sp->done() without holding a spinlock.  But unlike the older code
    below it, this new code failed to check sp->cmd_type and just assumed
    TYPE_SRB, which results in a jump to an invalid pointer in target-mode
    with TYPE_TGT_CMD:
    
    qla2xxx [0000:65:00.0]-d034:8: qla24xx_do_nack_work create sess success
      0000000009f7a79b
    qla2xxx [0000:65:00.0]-5003:8: ISP System Error - mbx1=1ff5h mbx2=10h
      mbx3=0h mbx4=0h mbx5=191h mbx6=0h mbx7=0h.
    qla2xxx [0000:65:00.0]-d01e:8: -> fwdump no buffer
    qla2xxx [0000:65:00.0]-f03a:8: qla_target(0): System error async event
      0x8002 occurred
    qla2xxx [0000:65:00.0]-00af:8: Performing ISP error recovery -
      ha=0000000058183fda.
    BUG: kernel NULL pointer dereference, address: 0000000000000000
    PF: supervisor instruction fetch in kernel mode
    PF: error_code(0x0010) - not-present page
    PGD 0 P4D 0
    Oops: 0010 [#1] SMP
    CPU: 2 PID: 9446 Comm: qla2xxx_8_dpc Tainted: G           O       6.1.133 #1
    Hardware name: Supermicro Super Server/X11SPL-F, BIOS 4.2 12/15/2023
    RIP: 0010:0x0
    Code: Unable to access opcode bytes at 0xffffffffffffffd6.
    RSP: 0018:ffffc90001f93dc8 EFLAGS: 00010206
    RAX: 0000000000000282 RBX: 0000000000000355 RCX: ffff88810d16a000
    RDX: ffff88810dbadaa8 RSI: 0000000000080000 RDI: ffff888169dc38c0
    RBP: ffff888169dc38c0 R08: 0000000000000001 R09: 0000000000000045
    R10: ffffffffa034bdf0 R11: 0000000000000000 R12: ffff88810800bb40
    R13: 0000000000001aa8 R14: ffff888100136610 R15: ffff8881070f7400
    FS:  0000000000000000(0000) GS:ffff88bf80080000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: ffffffffffffffd6 CR3: 000000010c8ff006 CR4: 00000000003706e0
    DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Call Trace:
     <TASK>
     ? __die+0x4d/0x8b
     ? page_fault_oops+0x91/0x180
     ? trace_buffer_unlock_commit_regs+0x38/0x1a0
     ? exc_page_fault+0x391/0x5e0
     ? asm_exc_page_fault+0x22/0x30
     __qla2x00_abort_all_cmds+0xcb/0x3e0 [qla2xxx_scst]
     qla2x00_abort_all_cmds+0x50/0x70 [qla2xxx_scst]
     qla2x00_abort_isp_cleanup+0x3b7/0x4b0 [qla2xxx_scst]
     qla2x00_abort_isp+0xfd/0x860 [qla2xxx_scst]
     qla2x00_do_dpc+0x581/0xa40 [qla2xxx_scst]
     kthread+0xa8/0xd0
     </TASK>
    
    Then commit 4475afa2646d ("scsi: qla2xxx: Complete command early within
    lock") added the spinlock back, because not having the lock caused a
    race and a crash.  But qla2x00_abort_srb() in the switch below already
    checks for qla2x00_chip_is_down() and handles it the same way, so the
    code above the switch is now redundant and still buggy in target-mode.
    Remove it.
    
    Cc: [email protected]
    Signed-off-by: Tony Battersby <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: scsi_debug: Fix atomic write enable module param description [+ + +]

Author: John Garry <[email protected]>
Date:   Thu Dec 11 10:06:51 2025 +0000

    scsi: scsi_debug: Fix atomic write enable module param description
    
    [ Upstream commit 1f7d6e2efeedd8f545d3e0e9bf338023bf4ea584 ]
    
    The atomic write enable module param is "atomic_wr", and not
    "atomic_write", so fix the module param description.
    
    Fixes: 84f3a3c01d70 ("scsi: scsi_debug: Atomic write support")
    Signed-off-by: John Garry <[email protected]>
    Reviewed-by: Bart Van Assche <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: smartpqi: Add support for Hurray Data new controller PCI device [+ + +]

Author: David Strahan <[email protected]>
Date:   Thu Nov 6 10:38:21 2025 -0600

    scsi: smartpqi: Add support for Hurray Data new controller PCI device
    
    [ Upstream commit 48e6b7e708029cea451e53a8c16fc8c16039ecdc ]
    
    Add support for new Hurray Data controller.
    
    All entries are in HEX.
    
    Add PCI IDs for Hurray Data controllers:
                                             VID  / DID  / SVID / SDID
                                             ----   ----   ----   ----
                                             9005   028f   207d   4840
    
    Reviewed-by: Scott Benesh <[email protected]>
    Reviewed-by: Scott Teel <[email protected]>
    Reviewed-by: Mike McGowen <[email protected]>
    Signed-off-by: David Strahan <[email protected]>
    Signed-off-by: Don Brace <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

scsi: target: Reset t_task_cdb pointer in error case [+ + +]

Author: Andrey Vatoropin <[email protected]>
Date:   Tue Nov 18 08:42:31 2025 +0000

    scsi: target: Reset t_task_cdb pointer in error case
    
    commit 5053eab38a4c4543522d0c320c639c56a8b59908 upstream.
    
    If allocation of cmd->t_task_cdb fails, it remains NULL but is later
    dereferenced in the 'err' path.
    
    In case of error, reset NULL t_task_cdb value to point at the default
    fixed-size buffer.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: 9e95fb805dc0 ("scsi: target: Fix NULL pointer dereference")
    Cc: [email protected]
    Signed-off-by: Andrey Vatoropin <[email protected]>
    Reviewed-by: Mike Christie <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: ufs: core: Add ufshcd_update_evt_hist() for UFS suspend error [+ + +]

Author: Seunghwan Baek <[email protected]>
Date:   Wed Dec 10 15:38:54 2025 +0900

    scsi: ufs: core: Add ufshcd_update_evt_hist() for UFS suspend error
    
    commit c9f36f04a8a2725172cdf2b5e32363e4addcb14c upstream.
    
    If UFS resume fails, the event history is updated in ufshcd_resume(), but
    there is no code anywhere to record UFS suspend. Therefore, add code to
    record UFS suspend error event history.
    
    Fixes: dd11376b9f1b ("scsi: ufs: Split the drivers/scsi/ufs directory")
    Cc: [email protected]
    Signed-off-by: Seunghwan Baek <[email protected]>
    Reviewed-by: Peter Wang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

scsi: ufs: host: mediatek: Fix shutdown/suspend race condition [+ + +]

Author: Peter Wang <[email protected]>
Date:   Wed Sep 24 17:43:27 2025 +0800

    scsi: ufs: host: mediatek: Fix shutdown/suspend race condition
    
    [ Upstream commit 014de20bb36ba03e0e0b0a7e0a1406ab900c9fda ]
    
    Address a race condition between shutdown and suspend operations in the
    UFS Mediatek driver. Before entering suspend, check if a shutdown is in
    progress to prevent conflicts and ensure system stability.
    
    Signed-off-by: Peter Wang <[email protected]>
    Acked-by: Chun-Hung Wu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Martin K. Petersen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests/ftrace: traceonoff_triggers: strip off names [+ + +]

Author: Yipeng Zou <[email protected]>
Date:   Fri Aug 18 09:32:26 2023 +0800

    selftests/ftrace: traceonoff_triggers: strip off names
    
    [ Upstream commit b889b4fb4cbea3ca7eb9814075d6a51936394bd9 ]
    
    The func_traceonoff_triggers.tc sometimes goes to fail
    on my board, Kunpeng-920.
    
    [root@localhost]# ./ftracetest ./test.d/ftrace/func_traceonoff_triggers.tc -l fail.log
    === Ftrace unit tests ===
    [1] ftrace - test for function traceon/off triggers     [FAIL]
    [2] (instance)  ftrace - test for function traceon/off triggers [UNSUPPORTED]
    
    I look up the log, and it shows that the md5sum is different between csum1 and csum2.
    
    ++ cnt=611
    ++ sleep .1
    +++ cnt_trace
    +++ grep -v '^#' trace
    +++ wc -l
    ++ cnt2=611
    ++ '[' 611 -ne 611 ']'
    +++ cat tracing_on
    ++ on=0
    ++ '[' 0 '!=' 0 ']'
    +++ md5sum trace
    ++ csum1='76896aa74362fff66a6a5f3cf8a8a500  trace'
    ++ sleep .1
    +++ md5sum trace
    ++ csum2='ee8625a21c058818fc26e45c1ed3f6de  trace'
    ++ '[' '76896aa74362fff66a6a5f3cf8a8a500  trace' '!=' 'ee8625a21c058818fc26e45c1ed3f6de  trace' ']'
    ++ fail 'Tracing file is still changing'
    ++ echo Tracing file is still changing
    Tracing file is still changing
    ++ exit_fail
    ++ exit 1
    
    So I directly dump the trace file before md5sum, the diff shows that:
    
    [root@localhost]# diff trace_1.log trace_2.log -y --suppress-common-lines
    dockerd-12285   [036] d.... 18385.510290: sched_stat | <...>-12285   [036] d.... 18385.510290: sched_stat
    dockerd-12285   [036] d.... 18385.510291: sched_swit | <...>-12285   [036] d.... 18385.510291: sched_swit
    <...>-740       [044] d.... 18385.602859: sched_stat | kworker/44:1-740 [044] d.... 18385.602859: sched_stat
    <...>-740       [044] d.... 18385.602860: sched_swit | kworker/44:1-740 [044] d.... 18385.602860: sched_swit
    
    And we can see that <...> filed be filled with names.
    
    We can strip off the names there to fix that.
    
    After strip off the names:
    
    kworker/u257:0-12 [019] d..2.  2528.758910: sched_stat | -12 [019] d..2.  2528.758910: sched_stat_runtime: comm=k
    kworker/u257:0-12 [019] d..2.  2528.758912: sched_swit | -12 [019] d..2.  2528.758912: sched_switch: prev_comm=kw
    <idle>-0          [000] d.s5.  2528.762318: sched_waki | -0  [000] d.s5.  2528.762318: sched_waking: comm=sshd pi
    <idle>-0          [037] dNh2.  2528.762326: sched_wake | -0  [037] dNh2.  2528.762326: sched_wakeup: comm=sshd pi
    <idle>-0          [037] d..2.  2528.762334: sched_swit | -0  [037] d..2.  2528.762334: sched_switch: prev_comm=sw
    
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: d87b29179aa0 ("selftests: ftrace: Use md5sum to take less time of checking logs")
    Suggested-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Yipeng Zou <[email protected]>
    Acked-by: Masami Hiramatsu (Google) <[email protected]>
    Reviewed-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Shuah Khan <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests: mptcp: pm: ensure unknown flags are ignored [+ + +]

Author: Matthieu Baerts (NGI0) <[email protected]>
Date:   Fri Dec 5 19:55:15 2025 +0100

    selftests: mptcp: pm: ensure unknown flags are ignored
    
    commit 29f4801e9c8dfd12bdcb33b61a6ac479c7162bd7 upstream.
    
    This validates the previous commit: the userspace can set unknown flags
    -- the 7th bit is currently unused -- without errors, but only the
    supported ones are printed in the endpoints dumps.
    
    The 'Fixes' tag here below is the same as the one from the previous
    commit: this patch here is not fixing anything wrong in the selftests,
    but it validates the previous fix for an issue introduced by this commit
    ID.
    
    Fixes: 01cacb00b35c ("mptcp: add netlink-based PM")
    Cc: [email protected]
    Reviewed-by: Mat Martineau <[email protected]>
    Signed-off-by: Matthieu Baerts (NGI0) <[email protected]>
    Link: https://patch.msgid.link/20251205-net-mptcp-misc-fixes-6-19-rc1-v1-2-9e4781a6c1b8@kernel.org
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

selftests: net: fix "buffer overflow detected" for tap.c [+ + +]

Author: Alice C. Munduruca <[email protected]>
Date:   Tue Dec 16 12:06:41 2025 -0500

    selftests: net: fix "buffer overflow detected" for tap.c
    
    [ Upstream commit 472c5dd6b95c02b3e5d7395acf542150e91165e7 ]
    
    When the selftest 'tap.c' is compiled with '-D_FORTIFY_SOURCE=3',
    the strcpy() in rtattr_add_strsz() is replaced with a checked
    version which causes the test to consistently fail when compiled
    with toolchains for which this option is enabled by default.
    
     TAP version 13
     1..3
     # Starting 3 tests from 1 test cases.
     #  RUN           tap.test_packet_valid_udp_gso ...
     *** buffer overflow detected ***: terminated
     # test_packet_valid_udp_gso: Test terminated by assertion
     #          FAIL  tap.test_packet_valid_udp_gso
     not ok 1 tap.test_packet_valid_udp_gso
     #  RUN           tap.test_packet_valid_udp_csum ...
     *** buffer overflow detected ***: terminated
     # test_packet_valid_udp_csum: Test terminated by assertion
     #          FAIL  tap.test_packet_valid_udp_csum
     not ok 2 tap.test_packet_valid_udp_csum
     #  RUN           tap.test_packet_crash_tap_invalid_eth_proto ...
     *** buffer overflow detected ***: terminated
     # test_packet_crash_tap_invalid_eth_proto: Test terminated by assertion
     #          FAIL  tap.test_packet_crash_tap_invalid_eth_proto
     not ok 3 tap.test_packet_crash_tap_invalid_eth_proto
     # FAILED: 0 / 3 tests passed.
     # Totals: pass:0 fail:3 xfail:0 xpass:0 skip:0 error:0
    
    A buffer overflow is detected by the fortified glibc __strcpy_chk()
    since the __builtin_object_size() of `RTA_DATA(rta)` is incorrectly
    reported as 1, even though there is ample space in its bounding
    buffer `req`.
    
    Additionally, given that IFLA_IFNAME also expects a null-terminated
    string, callers of rtaddr_add_str{,sz}() could simply use the
    rtaddr_add_strsz() variant. (which has been renamed to remove the
    trailing `sz`) memset() has been used for this function since it
    is unchecked and thus circumvents the issue discussed in the
    previous paragraph.
    
    Fixes: 2e64fe4624d1 ("selftests: add few test cases for tap driver")
    Signed-off-by: Alice C. Munduruca <[email protected]>
    Reviewed-by: Cengiz Can <[email protected]>
    Reviewed-by: Willem de Bruijn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests: netfilter: packetdrill: avoid failure on HZ=100 kernel [+ + +]

Author: Florian Westphal <[email protected]>
Date:   Thu Dec 11 13:16:49 2025 +0100

    selftests: netfilter: packetdrill: avoid failure on HZ=100 kernel
    
    [ Upstream commit fec7b0795548b43e2c3c46e3143c34ef6070341c ]
    
    packetdrill --ip_version=ipv4 --mtu=1500 --tolerance_usecs=1000000 --non_fatal packet conntrack_syn_challenge_ack.pkt
    conntrack v1.4.8 (conntrack-tools): 1 flow entries have been shown.
    conntrack_syn_challenge_ack.pkt:32: error executing `conntrack -f $NFCT_IP_VERSION \
    -L -p tcp --dport 8080 | grep UNREPLIED | grep -q SYN_SENT` command: non-zero status 1
    
    Affected kernel had CONFIG_HZ=100; reset packet was still sitting in
    backlog.
    
    Reported-by: Yi Chen <[email protected]>
    Fixes: a8a388c2aae4 ("selftests: netfilter: add packetdrill based conntrack tests")
    Signed-off-by: Florian Westphal <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

serial: core: fix OF node leak [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Tue Dec 30 08:59:17 2025 -0500

    serial: core: fix OF node leak
    
    [ Upstream commit 273cc3406c8d4e830ed45967c70d08d20ca1380e ]
    
    Make sure to drop the OF node reference taken when initialising the
    control and port devices when the devices are later released.
    
    Fixes: d36f0e9a0002 ("serial: core: restore of_node information in sysfs")
    Cc: Aidan Stewart <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Stable-dep-of: 24ec03cc5512 ("serial: core: Restore sysfs fwnode information")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: core: Fix serial device initialization [+ + +]

Author: Alexander Stein <[email protected]>
Date:   Fri Dec 19 16:28:12 2025 +0100

    serial: core: Fix serial device initialization
    
    commit f54151148b969fb4b62bec8093d255306d20df30 upstream.
    
    During restoring sysfs fwnode information the information of_node_reused
    was dropped. This was previously set by device_set_of_node_from_dev().
    Add it back manually
    
    Fixes: 24ec03cc5512 ("serial: core: Restore sysfs fwnode information")
    Cc: stable <[email protected]>
    Suggested-by: Cosmin Tanislav <[email protected]>
    Signed-off-by: Alexander Stein <[email protected]>
    Tested-by: Michael Walle <[email protected]>
    Tested-by: Marek Szyprowski <[email protected]>
    Tested-by: Cosmin Tanislav <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: core: Restore sysfs fwnode information [+ + +]

Author: Andy Shevchenko <[email protected]>
Date:   Tue Dec 30 08:59:18 2025 -0500

    serial: core: Restore sysfs fwnode information
    
    [ Upstream commit 24ec03cc55126b7b3adf102f4b3d9f716532b329 ]
    
    The change that restores sysfs fwnode information does it only for OF cases.
    Update the fix to cover all possible types of fwnodes.
    
    Fixes: d36f0e9a0002 ("serial: core: restore of_node information in sysfs")
    Cc: stable <[email protected]>
    Signed-off-by: Andy Shevchenko <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: sh-sci: Check that the DMA cookie is valid [+ + +]

Author: Claudiu Beznea <[email protected]>
Date:   Wed Dec 17 15:57:59 2025 +0200

    serial: sh-sci: Check that the DMA cookie is valid
    
    commit c3ca8a0aac832fe8047608bb2ae2cca314c6d717 upstream.
    
    The driver updates struct sci_port::tx_cookie to zero right before the TX
    work is scheduled, or to -EINVAL when DMA is disabled.
    dma_async_is_complete(), called through dma_cookie_status() (and possibly
    through dmaengine_tx_status()), considers cookies valid only if they have
    values greater than or equal to 1.
    
    Passing zero or -EINVAL to dmaengine_tx_status() before any TX DMA
    transfer has started leads to an incorrect TX status being reported, as the
    cookie is invalid for the DMA subsystem. This may cause long wait times
    when the serial device is opened for configuration before any TX activity
    has occurred.
    
    Check that the TX cookie is valid before passing it to
    dmaengine_tx_status().
    
    Fixes: 7cc0e0a43a91 ("serial: sh-sci: Check if TX data was written to device in .tx_empty()")
    Cc: stable <[email protected]>
    Signed-off-by: Claudiu Beznea <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: sprd: Return -EPROBE_DEFER when uart clock is not ready [+ + +]

Author: Wenhua Lin <[email protected]>
Date:   Wed Oct 22 11:08:40 2025 +0800

    serial: sprd: Return -EPROBE_DEFER when uart clock is not ready
    
    [ Upstream commit 29e8a0c587e328ed458380a45d6028adf64d7487 ]
    
    In sprd_clk_init(), when devm_clk_get() returns -EPROBE_DEFER
    for either uart or source clock, we should propagate the
    error instead of just warning and continuing with NULL clocks.
    
    Currently the driver only emits a warning when clock acquisition
    fails and proceeds with NULL clock pointers. This can lead to
    issues later when the clocks are actually needed. More importantly,
    when the clock provider is not ready yet and returns -EPROBE_DEFER,
    we should return this error to allow deferred probing.
    
    This change adds explicit checks for -EPROBE_DEFER after both:
    1. devm_clk_get(uport->dev, uart)
    2. devm_clk_get(uport->dev, source)
    
    When -EPROBE_DEFER is encountered, the function now returns
    -EPROBE_DEFER to let the driver framework retry probing
    later when the clock dependencies are resolved.
    
    Signed-off-by: Wenhua Lin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Cixi Geng <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

serial: xilinx_uartps: fix rs485 delay_rts_after_send [+ + +]

Author: j.turek <[email protected]>
Date:   Tue Dec 30 10:55:56 2025 -0500

    serial: xilinx_uartps: fix rs485 delay_rts_after_send
    
    [ Upstream commit 267ee93c417e685d9f8e079e41c70ba6ee4df5a5 ]
    
    RTS line control with delay should be triggered when there is no more
    bytes in kfifo and hardware buffer is empty. Without this patch RTS
    control is scheduled right after feeding hardware buffer and this is too
    early.
    
    RTS line may change state before hardware buffer is empty.
    
    With this patch delayed RTS state change is triggered when function
    cdns_uart_handle_tx is called from cdns_uart_isr on
    CDNS_UART_IXR_TXEMPTY exactly when hardware completed transmission
    
    Fixes: fccc9d9233f9 ("tty: serial: uartps: Add rs485 support to uartps driver")
    Cc: stable <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Turek  <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

serial: xilinx_uartps: Use helper function hrtimer_update_function() [+ + +]

Author: Nam Cao <[email protected]>
Date:   Tue Dec 30 10:55:55 2025 -0500

    serial: xilinx_uartps: Use helper function hrtimer_update_function()
    
    [ Upstream commit eee00df8e1f1f5648ed8f9e40e2bb54c2877344a ]
    
    The field 'function' of struct hrtimer should not be changed directly, as
    the write is lockless and a concurrent timer expiry might end up using the
    wrong function pointer.
    
    Switch to use hrtimer_update_function() which also performs runtime checks
    that it is safe to modify the callback.
    
    Signed-off-by: Nam Cao <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Link: https://lore.kernel.org/all/af7823518fb060c6c97105a2513cfc61adbdf38f.1738746927.git.namcao@linutronix.de
    Stable-dep-of: 267ee93c417e ("serial: xilinx_uartps: fix rs485 delay_rts_after_send")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

shmem: fix recovery on rename failures [+ + +]

Author: Al Viro <[email protected]>
Date:   Sat Dec 13 17:50:23 2025 -0500

    shmem: fix recovery on rename failures
    
    [ Upstream commit e1b4c6a58304fd490124cc2b454d80edc786665c ]
    
    maple_tree insertions can fail if we are seriously short on memory;
    simple_offset_rename() does not recover well if it runs into that.
    The same goes for simple_offset_rename_exchange().
    
    Moreover, shmem_whiteout() expects that if it succeeds, the caller will
    progress to d_move(), i.e. that shmem_rename2() won't fail past the
    successful call of shmem_whiteout().
    
    Not hard to fix, fortunately - mtree_store() can't fail if the index we
    are trying to store into is already present in the tree as a singleton.
    
    For simple_offset_rename_exchange() that's enough - we just need to be
    careful about the order of operations.
    
    For simple_offset_rename() solution is to preinsert the target into the
    tree for new_dir; the rest can be done without any potentially failing
    operations.
    
    That preinsertion has to be done in shmem_rename2() rather than in
    simple_offset_rename() itself - otherwise we'd need to deal with the
    possibility of failure after successful shmem_whiteout().
    
    Fixes: a2e459555c5f ("shmem: stable directory offsets")
    Reviewed-by: Christian Brauner <[email protected]>
    Reviewed-by: Chuck Lever <[email protected]>
    Signed-off-by: Al Viro <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

smb/server: fix return value of smb2_ioctl() [+ + +]

Author: ChenXiaoSong <[email protected]>
Date:   Fri Oct 17 18:46:10 2025 +0800

    smb/server: fix return value of smb2_ioctl()
    
    [ Upstream commit 269df046c1e15ab34fa26fd90db9381f022a0963 ]
    
    __process_request() will not print error messages if smb2_ioctl()
    always returns 0.
    
    Fix this by returning the correct value at the end of function.
    
    Signed-off-by: ChenXiaoSong <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

smc91x: fix broken irq-context in PREEMPT_RT [+ + +]

Author: Yeoreum Yun <[email protected]>
Date:   Wed Dec 17 08:51:15 2025 +0000

    smc91x: fix broken irq-context in PREEMPT_RT
    
    [ Upstream commit 6402078bd9d1ed46e79465e1faaa42e3458f8a33 ]
    
    When smc91x.c is built with PREEMPT_RT, the following splat occurs
    in FVP_RevC:
    
    [   13.055000] smc91x LNRO0003:00 eth0: link up, 10Mbps, half-duplex, lpa 0x0000
    [   13.062137] BUG: workqueue leaked atomic, lock or RCU: kworker/2:1[106]
    [   13.062137]      preempt=0x00000000 lock=0->0 RCU=0->1 workfn=mld_ifc_work
    [   13.062266] C
    ** replaying previous printk message **
    [   13.062266] CPU: 2 UID: 0 PID: 106 Comm: kworker/2:1 Not tainted 6.18.0-dirty #179 PREEMPT_{RT,(full)}
    [   13.062353] Hardware name:  , BIOS
    [   13.062382] Workqueue: mld mld_ifc_work
    [   13.062469] Call trace:
    [   13.062494]  show_stack+0x24/0x40 (C)
    [   13.062602]  __dump_stack+0x28/0x48
    [   13.062710]  dump_stack_lvl+0x7c/0xb0
    [   13.062818]  dump_stack+0x18/0x34
    [   13.062926]  process_scheduled_works+0x294/0x450
    [   13.063043]  worker_thread+0x260/0x3d8
    [   13.063124]  kthread+0x1c4/0x228
    [   13.063235]  ret_from_fork+0x10/0x20
    
    This happens because smc_special_trylock() disables IRQs even on PREEMPT_RT,
    but smc_special_unlock() does not restore IRQs on PREEMPT_RT.
    The reason is that smc_special_unlock() calls spin_unlock_irqrestore(),
    and rcu_read_unlock_bh() in __dev_queue_xmit() cannot invoke
    rcu_read_unlock() through __local_bh_enable_ip() when current->softirq_disable_cnt becomes zero.
    
    To address this issue, replace smc_special_trylock() with spin_trylock_irqsave().
    
    Fixes: 342a93247e08 ("locking/spinlock: Provide RT variant header: <linux/spinlock_rt.h>")
    Signed-off-by: Yeoreum Yun <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

soc/tegra: fuse: Do not register SoC device on ACPI boot [+ + +]

Author: Kartik Rajput <[email protected]>
Date:   Wed Oct 8 16:46:18 2025 +0530

    soc/tegra: fuse: Do not register SoC device on ACPI boot
    
    commit c87f820bc4748fdd4d50969e8930cd88d1b61582 upstream.
    
    On Tegra platforms using ACPI, the SMCCC driver already registers the
    SoC device. This makes the registration performed by the Tegra fuse
    driver redundant.
    
    When booted via ACPI, skip registering the SoC device and suppress
    printing SKU information from the Tegra fuse driver, as this information
    is already provided by the SMCCC driver.
    
    Fixes: 972167c69080 ("soc/tegra: fuse: Add ACPI support for Tegra194 and Tegra234")
    Cc: [email protected]
    Signed-off-by: Kartik Rajput <[email protected]>
    Signed-off-by: Thierry Reding <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soc: amlogic: canvas: fix device leak on lookup [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Sep 26 16:24:53 2025 +0200

    soc: amlogic: canvas: fix device leak on lookup
    
    commit 32200f4828de9d7e6db379909898e718747f4e18 upstream.
    
    Make sure to drop the reference taken to the canvas platform device when
    looking up its driver data.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away so there is no point in keeping the reference.
    
    Also note that commit 28f851e6afa8 ("soc: amlogic: canvas: add missing
    put_device() call in meson_canvas_get()") fixed the leak in a lookup
    error path, but the reference is still leaking on success.
    
    Fixes: d4983983d987 ("soc: amlogic: add meson-canvas driver")
    Cc: [email protected]      # 4.20: 28f851e6afa8
    Cc: Yu Kuai <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Martin Blumenstingl <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Neil Armstrong <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soc: apple: mailbox: fix device leak on lookup [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Sep 26 16:31:31 2025 +0200

    soc: apple: mailbox: fix device leak on lookup
    
    commit f401671e90ccc26b3022f177c4156a429c024f6c upstream.
    
    Make sure to drop the reference taken to the mbox platform device when
    looking up its driver data.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away so there is no point in keeping the reference.
    
    Fixes: 6e1457fcad3f ("soc: apple: mailbox: Add ASC/M3 mailbox driver")
    Cc: [email protected]      # 6.8
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Neal Gompa <[email protected]>
    Signed-off-by: Sven Peter <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soc: qcom: ocmem: fix device leak on lookup [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Sep 26 16:35:10 2025 +0200

    soc: qcom: ocmem: fix device leak on lookup
    
    commit b5c16ea57b030b8e9428ec726e26219dfe05c3d9 upstream.
    
    Make sure to drop the reference taken to the ocmem platform device when
    looking up its driver data.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away so there is no point in keeping the reference.
    
    Also note that commit 0ff027027e05 ("soc: qcom: ocmem: Fix missing
    put_device() call in of_get_ocmem") fixed the leak in a lookup error
    path, but the reference is still leaking on success.
    
    Fixes: 88c1e9404f1d ("soc: qcom: add OCMEM driver")
    Cc: [email protected]      # 5.5: 0ff027027e05
    Cc: Brian Masney <[email protected]>
    Cc: Miaoqian Lin <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Brian Masney <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soc: qcom: pbs: fix device leak on lookup [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Sep 26 16:35:11 2025 +0200

    soc: qcom: pbs: fix device leak on lookup
    
    commit 94124bf253d24b13e89c45618a168d5a1d8a61e7 upstream.
    
    Make sure to drop the reference taken to the pbs platform device when
    looking up its driver data.
    
    Note that holding a reference to a device does not prevent its driver
    data from going away so there is no point in keeping the reference.
    
    Fixes: 5b2dd77be1d8 ("soc: qcom: add QCOM PBS driver")
    Cc: [email protected]      # 6.9
    Cc: Anjelique Melendez <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soc: samsung: exynos-pmu: fix device leak on regmap lookup [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Fri Nov 21 13:18:52 2025 +0100

    soc: samsung: exynos-pmu: fix device leak on regmap lookup
    
    commit 990eb9a8eb4540ab90c7b34bb07b87ff13881cad upstream.
    
    Make sure to drop the reference taken when looking up the PMU device and
    its regmap.
    
    Note that holding a reference to a device does not prevent its regmap
    from going away so there is no point in keeping the reference.
    
    Fixes: 0b7c6075022c ("soc: samsung: exynos-pmu: Add regmap support for SoCs that protect PMU regs")
    Cc: [email protected]      # 6.9
    Cc: Peter Griffin <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

soundwire: stream: extend sdw_alloc_stream() to take 'type' parameter [+ + +]

Author: Pierre-Louis Bossart <[email protected]>
Date:   Mon Jan 5 10:10:07 2026 -0500

    soundwire: stream: extend sdw_alloc_stream() to take 'type' parameter
    
    [ Upstream commit dc90bbefa792031d89fe2af9ad4a6febd6be96a9 ]
    
    In the existing definition of sdw_stream_runtime, the 'type' member is
    never set and defaults to PCM. To prepare for the BPT/BRA support, we
    need to special-case streams and make use of the 'type'.
    
    No functional change for now, the implicit PCM type is now explicit.
    
    Signed-off-by: Pierre-Louis Bossart <[email protected]>
    Signed-off-by: Bard Liao <[email protected]>
    Reviewed-by: Péter Ujfalusi <[email protected]>
    Reviewed-by: Liam Girdwood <[email protected]>
    Reviewed-by: Ranjani Sridharan <[email protected]>
    Tested-by: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Vinod Koul <[email protected]>
    Stable-dep-of: bcba17279327 ("ASoC: qcom: sdw: fix memory leak for sdw_stream_runtime")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

spi: cadence-quadspi: Fix clock disable on probe failure path [+ + +]

Author: Anurag Dutta <[email protected]>
Date:   Fri Dec 12 12:53:12 2025 +0530

    spi: cadence-quadspi: Fix clock disable on probe failure path
    
    [ Upstream commit 1889dd2081975ce1f6275b06cdebaa8d154847a9 ]
    
    When cqspi_request_mmap_dma() returns -EPROBE_DEFER after runtime PM
    is enabled, the error path calls clk_disable_unprepare() on an already
    disabled clock, causing an imbalance.
    
    Use pm_runtime_get_sync() to increment the usage counter and resume the
    device. This prevents runtime_suspend() from being invoked and causing
    a double clock disable.
    
    Fixes: 140623410536 ("mtd: spi-nor: Add driver for Cadence Quad SPI Flash Controller")
    Signed-off-by: Anurag Dutta <[email protected]>
    Tested-by: Nishanth Menon <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

spi: fsl-cpm: Check length parity before switching to 16 bit mode [+ + +]

Author: Christophe Leroy <[email protected]>
Date:   Thu Nov 20 09:34:49 2025 +0100

    spi: fsl-cpm: Check length parity before switching to 16 bit mode
    
    commit 1417927df8049a0194933861e9b098669a95c762 upstream.
    
    Commit fc96ec826bce ("spi: fsl-cpm: Use 16 bit mode for large transfers
    with even size") failed to make sure that the size is really even
    before switching to 16 bit mode. Until recently the problem went
    unnoticed because kernfs uses a pre-allocated bounce buffer of size
    PAGE_SIZE for reading EEPROM.
    
    But commit 8ad6249c51d0 ("eeprom: at25: convert to spi-mem API")
    introduced an additional dynamically allocated bounce buffer whose size
    is exactly the size of the transfer, leading to a buffer overrun in
    the fsl-cpm driver when that size is odd.
    
    Add the missing length parity verification and remain in 8 bit mode
    when the length is not even.
    
    Fixes: fc96ec826bce ("spi: fsl-cpm: Use 16 bit mode for large transfers with even size")
    Cc: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]/
    Signed-off-by: Christophe Leroy <[email protected]>
    Reviewed-by: Sverdlin Alexander <[email protected]>
    Link: https://patch.msgid.link/3c4d81c3923c93f95ec56702a454744a4bad3cfc.1763627618.git.christophe.leroy@csgroup.eu
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

SUNRPC: svcauth_gss: avoid NULL deref on zero length gss_token in gss_read_proxy_verf [+ + +]

Author: Joshua Rogers <[email protected]>
Date:   Fri Nov 7 10:05:33 2025 -0500

    SUNRPC: svcauth_gss: avoid NULL deref on zero length gss_token in gss_read_proxy_verf
    
    commit d4b69a6186b215d2dc1ebcab965ed88e8d41768d upstream.
    
    A zero length gss_token results in pages == 0 and in_token->pages[0]
    is NULL. The code unconditionally evaluates
    page_address(in_token->pages[0]) for the initial memcpy, which can
    dereference NULL even when the copy length is 0. Guard the first
    memcpy so it only runs when length > 0.
    
    Fixes: 5866efa8cbfb ("SUNRPC: Fix svcauth_gss_proxy_init()")
    Cc: [email protected]
    Signed-off-by: Joshua Rogers <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

svcrdma: bound check rq_pages index in inline path [+ + +]

Author: Joshua Rogers <[email protected]>
Date:   Wed Dec 31 10:25:11 2025 -0500

    svcrdma: bound check rq_pages index in inline path
    
    [ Upstream commit d1bea0ce35b6095544ee82bb54156fc62c067e58 ]
    
    svc_rdma_copy_inline_range indexed rqstp->rq_pages[rc_curpage] without
    verifying rc_curpage stays within the allocated page array. Add guards
    before the first use and after advancing to a new page.
    
    Fixes: d7cc73972661 ("svcrdma: support multiple Read chunks per RPC")
    Cc: [email protected]
    Signed-off-by: Joshua Rogers <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    [ replaced rqstp->rq_maxpages with ARRAY_SIZE(rqstp->rq_pages) ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

svcrdma: return 0 on success from svc_rdma_copy_inline_range [+ + +]

Author: Joshua Rogers <[email protected]>
Date:   Fri Nov 7 10:09:48 2025 -0500

    svcrdma: return 0 on success from svc_rdma_copy_inline_range
    
    commit 94972027ab55b200e031059fd6c7a649f8248020 upstream.
    
    The function comment specifies 0 on success and -EINVAL on invalid
    parameters. Make the tail return 0 after a successful copy loop.
    
    Fixes: d7cc73972661 ("svcrdma: support multiple Read chunks per RPC")
    Cc: [email protected]
    Signed-off-by: Joshua Rogers <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

svcrdma: use rc_pageoff for memcpy byte offset [+ + +]

Author: Joshua Rogers <[email protected]>
Date:   Fri Nov 7 10:09:47 2025 -0500

    svcrdma: use rc_pageoff for memcpy byte offset
    
    commit a8ee9099f30654917aa68f55d707b5627e1dbf77 upstream.
    
    svc_rdma_copy_inline_range added rc_curpage (page index) to the page
    base instead of the byte offset rc_pageoff. Use rc_pageoff so copies
    land within the current page.
    
    Found by ZeroPath (https://zeropath.com)
    
    Fixes: 8e122582680c ("svcrdma: Move svc_rdma_read_info::ri_pageno to struct svc_rdma_recv_ctxt")
    Cc: [email protected]
    Signed-off-by: Joshua Rogers <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

team: fix check for port enabled in team_queue_override_port_prio_changed() [+ + +]

Author: Jiri Pirko <[email protected]>
Date:   Fri Dec 12 11:29:53 2025 +0100

    team: fix check for port enabled in team_queue_override_port_prio_changed()
    
    [ Upstream commit 932ac51d9953eaf77a1252f79b656d4ca86163c6 ]
    
    There has been a syzkaller bug reported recently with the following
    trace:
    
    list_del corruption, ffff888058bea080->prev is LIST_POISON2 (dead000000000122)
    ------------[ cut here ]------------
    kernel BUG at lib/list_debug.c:59!
    Oops: invalid opcode: 0000 [#1] SMP KASAN NOPTI
    CPU: 3 UID: 0 PID: 21246 Comm: syz.0.2928 Not tainted syzkaller #0 PREEMPT(full)
    Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-debian-1.16.3-2~bpo12+1 04/01/2014
    RIP: 0010:__list_del_entry_valid_or_report+0x13e/0x200 lib/list_debug.c:59
    Code: 48 c7 c7 e0 71 f0 8b e8 30 08 ef fc 90 0f 0b 48 89 ef e8 a5 02 55 fd 48 89 ea 48 89 de 48 c7 c7 40 72 f0 8b e8 13 08 ef fc 90 <0f> 0b 48 89 ef e8 88 02 55 fd 48 89 ea 48 b8 00 00 00 00 00 fc ff
    RSP: 0018:ffffc9000d49f370 EFLAGS: 00010286
    RAX: 000000000000004e RBX: ffff888058bea080 RCX: ffffc9002817d000
    RDX: 0000000000000000 RSI: ffffffff819becc6 RDI: 0000000000000005
    RBP: dead000000000122 R08: 0000000000000005 R09: 0000000000000000
    R10: 0000000080000000 R11: 0000000000000001 R12: ffff888039e9c230
    R13: ffff888058bea088 R14: ffff888058bea080 R15: ffff888055461480
    FS:  00007fbbcfe6f6c0(0000) GS:ffff8880d6d0a000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 000000110c3afcb0 CR3: 00000000382c7000 CR4: 0000000000352ef0
    Call Trace:
     <TASK>
     __list_del_entry_valid include/linux/list.h:132 [inline]
     __list_del_entry include/linux/list.h:223 [inline]
     list_del_rcu include/linux/rculist.h:178 [inline]
     __team_queue_override_port_del drivers/net/team/team_core.c:826 [inline]
     __team_queue_override_port_del drivers/net/team/team_core.c:821 [inline]
     team_queue_override_port_prio_changed drivers/net/team/team_core.c:883 [inline]
     team_priority_option_set+0x171/0x2f0 drivers/net/team/team_core.c:1534
     team_option_set drivers/net/team/team_core.c:376 [inline]
     team_nl_options_set_doit+0x8ae/0xe60 drivers/net/team/team_core.c:2653
     genl_family_rcv_msg_doit+0x209/0x2f0 net/netlink/genetlink.c:1115
     genl_family_rcv_msg net/netlink/genetlink.c:1195 [inline]
     genl_rcv_msg+0x55c/0x800 net/netlink/genetlink.c:1210
     netlink_rcv_skb+0x158/0x420 net/netlink/af_netlink.c:2552
     genl_rcv+0x28/0x40 net/netlink/genetlink.c:1219
     netlink_unicast_kernel net/netlink/af_netlink.c:1320 [inline]
     netlink_unicast+0x5aa/0x870 net/netlink/af_netlink.c:1346
     netlink_sendmsg+0x8c8/0xdd0 net/netlink/af_netlink.c:1896
     sock_sendmsg_nosec net/socket.c:727 [inline]
     __sock_sendmsg net/socket.c:742 [inline]
     ____sys_sendmsg+0xa98/0xc70 net/socket.c:2630
     ___sys_sendmsg+0x134/0x1d0 net/socket.c:2684
     __sys_sendmsg+0x16d/0x220 net/socket.c:2716
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xcd/0xfa0 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    
    The problem is in this flow:
    1) Port is enabled, queue_id != 0, in qom_list
    2) Port gets disabled
            -> team_port_disable()
            -> team_queue_override_port_del()
            -> del (removed from list)
    3) Port is disabled, queue_id != 0, not in any list
    4) Priority changes
            -> team_queue_override_port_prio_changed()
            -> checks: port disabled && queue_id != 0
            -> calls del - hits the BUG as it is removed already
    
    To fix this, change the check in team_queue_override_port_prio_changed()
    so it returns early if port is not enabled.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=422806e5f4cce722a71f
    Fixes: 6c31ff366c11 ("team: remove synchronize_rcu() called during queue override change")
    Signed-off-by: Jiri Pirko <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ti-sysc: allow OMAP2 and OMAP4 timers to be reserved on AM33xx [+ + +]

Author: Matthias Schiffer <[email protected]>
Date:   Mon Aug 25 15:11:13 2025 +0200

    ti-sysc: allow OMAP2 and OMAP4 timers to be reserved on AM33xx
    
    [ Upstream commit 3f61783920504b2cf99330b372d82914bb004d8e ]
    
    am33xx.dtsi has the same clock setup as am35xx.dtsi, setting
    ti,no-reset-on-init and ti,no-idle on timer1_target and timer2_target,
    so AM33 needs the same workaround as AM35 to avoid ti-sysc probe
    failing on certain target modules.
    
    Signed-off-by: Matthias Schiffer <[email protected]>
    Signed-off-by: Alexander Stein <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Kevin Hilman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

tools/mm/page_owner_sort: fix timestamp comparison for stable sorting [+ + +]

Author: Kaushlendra Kumar <[email protected]>
Date:   Tue Dec 9 10:15:52 2025 +0530

    tools/mm/page_owner_sort: fix timestamp comparison for stable sorting
    
    commit 7013803444dd3bbbe28fd3360c084cec3057c554 upstream.
    
    The ternary operator in compare_ts() returns 1 when timestamps are equal,
    causing unstable sorting behavior. Replace with explicit three-way
    comparison that returns 0 for equal timestamps, ensuring stable qsort
    ordering and consistent output.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: 8f9c447e2e2b ("tools/vm/page_owner_sort.c: support sorting pid and time")
    Signed-off-by: Kaushlendra Kumar <[email protected]>
    Cc: Chongxi Zhao <[email protected]>
    Cc: <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tools/testing/nvdimm: Use per-DIMM device handle [+ + +]

Author: Alison Schofield <[email protected]>
Date:   Fri Oct 31 16:42:20 2025 -0700

    tools/testing/nvdimm: Use per-DIMM device handle
    
    commit f59b701b4674f7955170b54c4167c5590f4714eb upstream.
    
    KASAN reports a global-out-of-bounds access when running these nfit
    tests: clear.sh, pmem-errors.sh, pfn-meta-errors.sh, btt-errors.sh,
    daxdev-errors.sh, and inject-error.sh.
    
    [] BUG: KASAN: global-out-of-bounds in nfit_test_ctl+0x769f/0x7840 [nfit_test]
    [] Read of size 4 at addr ffffffffc03ea01c by task ndctl/1215
    [] The buggy address belongs to the variable:
    [] handle+0x1c/0x1df4 [nfit_test]
    
    nfit_test_search_spa() uses handle[nvdimm->id] to retrieve a device
    handle and triggers a KASAN error when it reads past the end of the
    handle array. It should not be indexing the handle array at all.
    
    The correct device handle is stored in per-DIMM test data. Each DIMM
    has a struct nfit_mem that embeds a struct acpi_nfit_memdev that
    describes the NFIT device handle. Use that device handle here.
    
    Fixes: 10246dc84dfc ("acpi nfit: nfit_test supports translate SPA")
    Cc: [email protected]
    Signed-off-by: Alison Schofield <[email protected]>
    Reviewed-by: Dave Jiang <[email protected]>> ---
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Ira Weiny <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tpm2-sessions: Fix tpm2_read_public range checks [+ + +]

Author: Jarkko Sakkinen <[email protected]>
Date:   Thu Jan 1 21:45:19 2026 -0500

    tpm2-sessions: Fix tpm2_read_public range checks
    
    [ Upstream commit bda1cbf73c6e241267c286427f2ed52b5735d872 ]
    
    tpm2_read_public() has some rudimentary range checks but the function does
    not ensure that the response buffer has enough bytes for the full TPMT_HA
    payload.
    
    Re-implement the function with necessary checks and validation, and return
    name and name size for all handle types back to the caller.
    
    Cc: [email protected] # v6.10+
    Fixes: d0a25bb961e6 ("tpm: Add HMAC session name/handle append")
    Signed-off-by: Jarkko Sakkinen <[email protected]>
    Reviewed-by: Jonathan McDowell <[email protected]>
    [ different semantics around u8 name_size() ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tpm: Cap the number of PCR banks [+ + +]

Author: Jarkko Sakkinen <[email protected]>
Date:   Tue Sep 30 15:58:02 2025 +0300

    tpm: Cap the number of PCR banks
    
    commit faf07e611dfa464b201223a7253e9dc5ee0f3c9e upstream.
    
    tpm2_get_pcr_allocation() does not cap any upper limit for the number of
    banks. Cap the limit to eight banks so that out of bounds values coming
    from external I/O cause on only limited harm.
    
    Cc: [email protected] # v5.10+
    Fixes: bcfff8384f6c ("tpm: dynamically allocate the allocated_banks array")
    Tested-by: Lai Yi <[email protected]>
    Reviewed-by: Jonathan McDowell <[email protected]>
    Reviewed-by: Roberto Sassu <[email protected]>
    Signed-off-by: Jarkko Sakkinen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Do not register unsupported perf events [+ + +]

Author: Steven Rostedt <[email protected]>
Date:   Tue Dec 16 18:24:40 2025 -0500

    tracing: Do not register unsupported perf events
    
    commit ef7f38df890f5dcd2ae62f8dbde191d72f3bebae upstream.
    
    Synthetic events currently do not have a function to register perf events.
    This leads to calling the tracepoint register functions with a NULL
    function pointer which triggers:
    
     ------------[ cut here ]------------
     WARNING: kernel/tracepoint.c:175 at tracepoint_add_func+0x357/0x370, CPU#2: perf/2272
     Modules linked in: kvm_intel kvm irqbypass
     CPU: 2 UID: 0 PID: 2272 Comm: perf Not tainted 6.18.0-ftest-11964-ge022764176fc-dirty #323 PREEMPTLAZY
     Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.17.0-debian-1.17.0-1 04/01/2014
     RIP: 0010:tracepoint_add_func+0x357/0x370
     Code: 28 9c e8 4c 0b f5 ff eb 0f 4c 89 f7 48 c7 c6 80 4d 28 9c e8 ab 89 f4 ff 31 c0 5b 41 5c 41 5d 41 5e 41 5f 5d c3 cc cc cc cc cc <0f> 0b 49 c7 c6 ea ff ff ff e9 ee fe ff ff 0f 0b e9 f9 fe ff ff 0f
     RSP: 0018:ffffabc0c44d3c40 EFLAGS: 00010246
     RAX: 0000000000000001 RBX: ffff9380aa9e4060 RCX: 0000000000000000
     RDX: 000000000000000a RSI: ffffffff9e1d4a98 RDI: ffff937fcf5fd6c8
     RBP: 0000000000000001 R08: 0000000000000007 R09: ffff937fcf5fc780
     R10: 0000000000000003 R11: ffffffff9c193910 R12: 000000000000000a
     R13: ffffffff9e1e5888 R14: 0000000000000000 R15: ffffabc0c44d3c78
     FS:  00007f6202f5f340(0000) GS:ffff93819f00f000(0000) knlGS:0000000000000000
     CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
     CR2: 000055d3162281a8 CR3: 0000000106a56003 CR4: 0000000000172ef0
     Call Trace:
      <TASK>
      tracepoint_probe_register+0x5d/0x90
      synth_event_reg+0x3c/0x60
      perf_trace_event_init+0x204/0x340
      perf_trace_init+0x85/0xd0
      perf_tp_event_init+0x2e/0x50
      perf_try_init_event+0x6f/0x230
      ? perf_event_alloc+0x4bb/0xdc0
      perf_event_alloc+0x65a/0xdc0
      __se_sys_perf_event_open+0x290/0x9f0
      do_syscall_64+0x93/0x7b0
      ? entry_SYSCALL_64_after_hwframe+0x76/0x7e
      ? trace_hardirqs_off+0x53/0xc0
      entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    Instead, have the code return -ENODEV, which doesn't warn and has perf
    error out with:
    
     # perf record -e synthetic:futex_wait
    Error:
    The sys_perf_event_open() syscall returned with 19 (No such device) for event (synthetic:futex_wait).
    "dmesg | grep -i perf" may provide additional information.
    
    Ideally perf should support synthetic events, but for now just fix the
    warning. The support can come later.
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Cc: Arnaldo Carvalho de Melo <[email protected]>
    Cc: Jiri Olsa <[email protected]>
    Cc: Namhyung Kim <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Fixes: 4b147936fa509 ("tracing: Add support for 'synthetic' events")
    Reported-by: Ian Rogers <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Fix fixed array of synthetic event [+ + +]

Author: Steven Rostedt <[email protected]>
Date:   Thu Dec 4 15:19:35 2025 -0500

    tracing: Fix fixed array of synthetic event
    
    commit 47ef834209e5981f443240d8a8b45bf680df22aa upstream.
    
    The commit 4d38328eb442d ("tracing: Fix synth event printk format for str
    fields") replaced "%.*s" with "%s" but missed removing the number size of
    the dynamic and static strings. The commit e1a453a57bc7 ("tracing: Do not
    add length to print format in synthetic events") fixed the dynamic part
    but did not fix the static part. That is, with the commands:
    
      # echo 's:wake_lat char[] wakee; u64 delta;' >> /sys/kernel/tracing/dynamic_events
      # echo 'hist:keys=pid:ts=common_timestamp.usecs if !(common_flags & 0x18)' > /sys/kernel/tracing/events/sched/sched_waking/trigger
      # echo 'hist:keys=next_pid:delta=common_timestamp.usecs-$ts:onmatch(sched.sched_waking).trace(wake_lat,next_comm,$delta)' > /sys/kernel/tracing/events/sched/sched_switch/trigger
    
    That caused the output of:
    
              <idle>-0       [001] d..5.   193.428167: wake_lat: wakee=(efault)sshd-sessiondelta=155
        sshd-session-879     [001] d..5.   193.811080: wake_lat: wakee=(efault)kworker/u34:5delta=58
              <idle>-0       [002] d..5.   193.811198: wake_lat: wakee=(efault)bashdelta=91
    
    The commit e1a453a57bc7 fixed the part where the synthetic event had
    "char[] wakee". But if one were to replace that with a static size string:
    
      # echo 's:wake_lat char[16] wakee; u64 delta;' >> /sys/kernel/tracing/dynamic_events
    
    Where "wakee" is defined as "char[16]" and not "char[]" making it a static
    size, the code triggered the "(efaul)" again.
    
    Remove the added STR_VAR_LEN_MAX size as the string is still going to be
    nul terminated.
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Cc: Douglas Raillard <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Fixes: e1a453a57bc7 ("tracing: Do not add length to print format in synthetic events")
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tty: fix tty_port_tty_*hangup() kernel-doc [+ + +]

Author: Jiri Slaby (SUSE) <[email protected]>
Date:   Tue Jun 24 10:06:41 2025 +0200

    tty: fix tty_port_tty_*hangup() kernel-doc
    
    commit 6241b49540a65a6d5274fa938fd3eb4cbfe2e076 upstream.
    
    The commit below added a new helper, but omitted to move (and add) the
    corressponding kernel-doc. Do it now.
    
    Signed-off-by: "Jiri Slaby (SUSE)" <[email protected]>
    Fixes: 2b5eac0f8c6e ("tty: introduce and use tty_port_tty_vhangup() helper")
    Link: https://lore.kernel.org/all/[email protected]/
    Reported-by: Ilpo Järvinen <[email protected]>
    Cc: Jonathan Corbet <[email protected]>
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tty: introduce and use tty_port_tty_vhangup() helper [+ + +]

Author: Jiri Slaby (SUSE) <[email protected]>
Date:   Tue Dec 30 13:32:36 2025 -0500

    tty: introduce and use tty_port_tty_vhangup() helper
    
    [ Upstream commit 2b5eac0f8c6e79bc152c8804f9f88d16717013ab ]
    
    This code (tty_get -> vhangup -> tty_put) is repeated on few places.
    Introduce a helper similar to tty_port_tty_hangup() (asynchronous) to
    handle even vhangup (synchronous).
    
    And use it on those places.
    
    In fact, reuse the tty_port_tty_hangup()'s code and call tty_vhangup()
    depending on a new bool parameter.
    
    Signed-off-by: "Jiri Slaby (SUSE)" <[email protected]>
    Cc: Karsten Keil <[email protected]>
    Cc: David Lin <[email protected]>
    Cc: Johan Hovold <[email protected]>
    Cc: Alex Elder <[email protected]>
    Cc: Oliver Neukum <[email protected]>
    Cc: Marcel Holtmann <[email protected]>
    Cc: Johan Hedberg <[email protected]>
    Cc: Luiz Augusto von Dentz <[email protected]>
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Stable-dep-of: 74098cc06e75 ("xhci: dbgtty: fix device unregister: fixup")
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: dwc3: keep susphy enabled during exit to avoid controller faults [+ + +]

Author: Udipto Goswami <[email protected]>
Date:   Wed Nov 26 11:12:21 2025 +0530

    usb: dwc3: keep susphy enabled during exit to avoid controller faults
    
    commit e1003aa7ec9eccdde4c926bd64ef42816ad55f25 upstream.
    
    On some platforms, switching USB roles from host to device can trigger
    controller faults due to premature PHY power-down. This occurs when the
    PHY is disabled too early during teardown, causing synchronization
    issues between the PHY and controller.
    
    Keep susphy enabled during dwc3_host_exit() and dwc3_gadget_exit()
    ensures the PHY remains in a low-power state capable of handling
    required commands during role switch.
    
    Cc: stable <[email protected]>
    Fixes: 6d735722063a ("usb: dwc3: core: Prevent phy suspend during init")
    Suggested-by: Thinh Nguyen <[email protected]>
    Signed-off-by: Udipto Goswami <[email protected]>
    Acked-by: Thinh Nguyen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: dwc3: of-simple: fix clock resource leak in dwc3_of_simple_probe [+ + +]

Author: Miaoqian Lin <[email protected]>
Date:   Thu Dec 11 10:49:36 2025 +0400

    usb: dwc3: of-simple: fix clock resource leak in dwc3_of_simple_probe
    
    commit 3b4961313d31e200c9e974bb1536cdea217f78b5 upstream.
    
    When clk_bulk_prepare_enable() fails, the error path jumps to
    err_resetc_assert, skipping clk_bulk_put_all() and leaking the
    clock references acquired by clk_bulk_get_all().
    
    Add err_clk_put_all label to properly release clock resources
    in all error paths.
    
    Found via static analysis and code review.
    
    Fixes: c0c61471ef86 ("usb: dwc3: of-simple: Convert to bulk clk API")
    Cc: stable <[email protected]>
    Signed-off-by: Miaoqian Lin <[email protected]>
    Acked-by: Thinh Nguyen <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: gadget: lpc32xx_udc: fix clock imbalance in error path [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Dec 18 16:35:15 2025 +0100

    usb: gadget: lpc32xx_udc: fix clock imbalance in error path
    
    commit 782be79e4551550d7a82b1957fc0f7347e6d461f upstream.
    
    A recent change fixing a device reference leak introduced a clock
    imbalance by reusing an error path so that the clock may be disabled
    before having been enabled.
    
    Note that the clock framework allows for passing in NULL clocks so there
    is no risk for a NULL pointer dereference.
    
    Also drop the bogus I2C client NULL check added by the offending commit
    as the pointer has already been verified to be non-NULL.
    
    Fixes: c84117912bdd ("USB: lpc32xx_udc: Fix error handling in probe")
    Cc: [email protected]
    Cc: Ma Ke <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Vladimir Zapolskiy <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

USB: lpc32xx_udc: Fix error handling in probe [+ + +]

Author: Ma Ke <[email protected]>
Date:   Mon Dec 15 10:09:31 2025 +0800

    USB: lpc32xx_udc: Fix error handling in probe
    
    commit c84117912bddd9e5d87e68daf182410c98181407 upstream.
    
    lpc32xx_udc_probe() acquires an i2c_client reference through
    isp1301_get_client() but fails to release it in both error handling
    paths and the normal removal path. This could result in a reference
    count leak for the I2C device, preventing proper cleanup and potentially
    leading to resource exhaustion. Add put_device() to release the
    reference in the probe failure path and in the remove function.
    
    Calling path: isp1301_get_client() -> of_find_i2c_device_by_node() ->
    i2c_find_device_by_fwnode(). As comments of i2c_find_device_by_fwnode()
    says, 'The user must call put_device(&client->dev) once done with the
    i2c client.'
    
    Found by code review.
    
    Cc: stable <[email protected]>
    Fixes: 24a28e428351 ("USB: gadget driver for LPC32xx")
    Signed-off-by: Ma Ke <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: ohci-nxp: fix device leak on probe failure [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Dec 18 16:35:17 2025 +0100

    usb: ohci-nxp: fix device leak on probe failure
    
    commit b4c61e542faf8c9131d69ecfc3ad6de96d1b2ab8 upstream.
    
    Make sure to drop the reference taken when looking up the PHY I2C device
    during probe on probe failure (e.g. probe deferral) and on driver
    unbind.
    
    Fixes: 73108aa90cbf ("USB: ohci-nxp: Use isp1301 driver")
    Cc: [email protected]      # 3.5
    Reported-by: Ma Ke <[email protected]>
    Link: https://lore.kernel.org/lkml/[email protected]/
    Signed-off-by: Johan Hovold <[email protected]>
    Acked-by: Alan Stern <[email protected]>
    Reviewed-by: Vladimir Zapolskiy <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: phy: fsl-usb: Fix use-after-free in delayed work during device removal [+ + +]

Author: Duoming Zhou <[email protected]>
Date:   Fri Dec 5 11:48:31 2025 +0800

    usb: phy: fsl-usb: Fix use-after-free in delayed work during device removal
    
    commit 41ca62e3e21e48c2903b3b45e232cf4f2ff7434f upstream.
    
    The delayed work item otg_event is initialized in fsl_otg_conf() and
    scheduled under two conditions:
    1. When a host controller binds to the OTG controller.
    2. When the USB ID pin state changes (cable insertion/removal).
    
    A race condition occurs when the device is removed via fsl_otg_remove():
    the fsl_otg instance may be freed while the delayed work is still pending
    or executing. This leads to use-after-free when the work function
    fsl_otg_event() accesses the already freed memory.
    
    The problematic scenario:
    
    (detach thread)            | (delayed work)
    fsl_otg_remove()           |
      kfree(fsl_otg_dev) //FREE| fsl_otg_event()
                               |   og = container_of(...) //USE
                               |   og-> //USE
    
    Fix this by calling disable_delayed_work_sync() in fsl_otg_remove()
    before deallocating the fsl_otg structure. This ensures the delayed work
    is properly canceled and completes execution prior to memory deallocation.
    
    This bug was identified through static analysis.
    
    Fixes: 0807c500a1a6 ("USB: add Freescale USB OTG Transceiver driver")
    Cc: stable <[email protected]>
    Signed-off-by: Duoming Zhou <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: phy: isp1301: fix non-OF device reference imbalance [+ + +]

Author: Johan Hovold <[email protected]>
Date:   Thu Dec 18 16:35:16 2025 +0100

    usb: phy: isp1301: fix non-OF device reference imbalance
    
    commit b4b64fda4d30a83a7f00e92a0c8a1d47699609f3 upstream.
    
    A recent change fixing a device reference leak in a UDC driver
    introduced a potential use-after-free in the non-OF case as the
    isp1301_get_client() helper only increases the reference count for the
    returned I2C device in the OF case.
    
    Increment the reference count also for non-OF so that the caller can
    decrement it unconditionally.
    
    Note that this is inherently racy just as using the returned I2C device
    is since nothing is preventing the PHY driver from being unbound while
    in use.
    
    Fixes: c84117912bdd ("USB: lpc32xx_udc: Fix error handling in probe")
    Cc: [email protected]
    Cc: Ma Ke <[email protected]>
    Signed-off-by: Johan Hovold <[email protected]>
    Reviewed-by: Vladimir Zapolskiy <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: renesas_usbhs: Fix a resource leak in usbhs_pipe_malloc() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Thu Dec 4 21:21:29 2025 +0800

    usb: renesas_usbhs: Fix a resource leak in usbhs_pipe_malloc()
    
    commit 36cc7e09df9e43db21b46519b740145410dd9f4a upstream.
    
    usbhsp_get_pipe() set pipe's flags to IS_USED. In error paths,
    usbhsp_put_pipe() is required to clear pipe's flags to prevent
    pipe exhaustion.
    
    Fixes: f1407d5c6624 ("usb: renesas_usbhs: Add Renesas USBHS common code")
    Cc: stable <[email protected]>
    Signed-off-by: Haoxiang Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: typec: altmodes/displayport: Drop the device reference in dp_altmode_probe() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Sat Dec 6 15:04:45 2025 +0800

    usb: typec: altmodes/displayport: Drop the device reference in dp_altmode_probe()
    
    commit 128bb7fab342546352603bde8b49ff54e3af0529 upstream.
    
    In error paths, call typec_altmode_put_plug() to drop the device reference
    obtained by typec_altmode_get_plug().
    
    Fixes: 71ba4fe56656 ("usb: typec: altmodes/displayport: add SOP' support")
    Cc: stable <[email protected]>
    Signed-off-by: Haoxiang Li <[email protected]>
    Reviewed-by: Heikki Krogerus <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: typec: ucsi: Handle incorrect num_connectors capability [+ + +]

Author: Mark Pearson <[email protected]>
Date:   Thu Aug 21 14:53:07 2025 -0400

    usb: typec: ucsi: Handle incorrect num_connectors capability
    
    [ Upstream commit 30cd2cb1abf4c4acdb1ddb468c946f68939819fb ]
    
    The UCSI spec states that the num_connectors field is 7 bits, and the
    8th bit is reserved and should be set to zero.
    Some buggy FW has been known to set this bit, and it can lead to a
    system not booting.
    Flag that the FW is not behaving correctly, and auto-fix the value
    so that the system boots correctly.
    
    Found on Lenovo P1 G8 during Linux enablement program. The FW will
    be fixed, but seemed worth addressing in case it hit platforms that
    aren't officially Linux supported.
    
    Signed-off-by: Mark Pearson <[email protected]>
    Reviewed-by: Heikki Krogerus <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usb: usb-storage: Maintain minimal modifications to the bcdDevice range. [+ + +]

Author: Chen Changcheng <[email protected]>
Date:   Thu Dec 18 09:23:18 2025 +0800

    usb: usb-storage: Maintain minimal modifications to the bcdDevice range.
    
    commit 0831269b5f71594882accfceb02638124f88955d upstream.
    
    We cannot determine which models require the NO_ATA_1X and
    IGNORE_RESIDUE quirks aside from the EL-R12 optical drive device.
    
    Fixes: 955a48a5353f ("usb: usb-storage: No additional quirks need to be added to the EL-R12 optical drive.")
    Signed-off-by: Chen Changcheng <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

usb: usb-storage: No additional quirks need to be added to the EL-R12 optical drive. [+ + +]

Author: Chen Changcheng <[email protected]>
Date:   Fri Nov 21 14:40:20 2025 +0800

    usb: usb-storage: No additional quirks need to be added to the EL-R12 optical drive.
    
    [ Upstream commit 955a48a5353f4fe009704a9a4272a3adf627cd35 ]
    
    The optical drive of EL-R12 has the same vid and pid as INIC-3069,
    as follows:
    T:  Bus=02 Lev=02 Prnt=02 Port=01 Cnt=01 Dev#=  3 Spd=5000 MxCh= 0
    D:  Ver= 3.00 Cls=00(>ifc ) Sub=00 Prot=00 MxPS= 9 #Cfgs=  1
    P:  Vendor=13fd ProdID=3940 Rev= 3.10
    S:  Manufacturer=HL-DT-ST
    S:  Product= DVD+-RW GT80N
    S:  SerialNumber=423349524E4E38303338323439202020
    C:* #Ifs= 1 Cfg#= 1 Atr=80 MxPwr=144mA
    I:* If#= 0 Alt= 0 #EPs= 2 Cls=08(stor.) Sub=02 Prot=50 Driver=usb-storage
    E:  Ad=83(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms
    E:  Ad=0a(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms
    
    This will result in the optical drive device also adding
    the quirks of US_FL_NO_ATA_1X. When performing an erase operation,
    it will fail, and the reason for the failure is as follows:
    [  388.967742] sr 5:0:0:0: [sr0] tag#0 Send: scmd 0x00000000d20c33a7
    [  388.967742] sr 5:0:0:0: [sr0] tag#0 CDB: ATA command pass through(12)/Blank a1 11 00 00 00 00 00 00 00 00 00 00
    [  388.967773] sr 5:0:0:0: [sr0] tag#0 Done: SUCCESS Result: hostbyte=DID_TARGET_FAILURE driverbyte=DRIVER_OK cmd_age=0s
    [  388.967773] sr 5:0:0:0: [sr0] tag#0 CDB: ATA command pass through(12)/Blank a1 11 00 00 00 00 00 00 00 00 00 00
    [  388.967803] sr 5:0:0:0: [sr0] tag#0 Sense Key : Illegal Request [current]
    [  388.967803] sr 5:0:0:0: [sr0] tag#0 Add. Sense: Invalid field in cdb
    [  388.967803] sr 5:0:0:0: [sr0] tag#0 scsi host busy 1 failed 0
    [  388.967803] sr 5:0:0:0: Notifying upper driver of completion (result 8100002)
    [  388.967834] sr 5:0:0:0: [sr0] tag#0 0 sectors total, 0 bytes done.
    
    For the EL-R12 standard optical drive, all operational commands
    and usage scenarios were tested without adding the IGNORE_RESIDUE quirks,
    and no issues were encountered. It can be reasonably concluded
    that removing the IGNORE_RESIDUE quirks has no impact.
    
    Signed-off-by: Chen Changcheng <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usb: xhci: limit run_graceperiod for only usb 3.0 devices [+ + +]

Author: Hongyu Xie <[email protected]>
Date:   Wed Nov 19 16:23:55 2025 +0200

    usb: xhci: limit run_graceperiod for only usb 3.0 devices
    
    [ Upstream commit 8d34983720155b8f05de765f0183d9b0e1345cc0 ]
    
    run_graceperiod blocks usb 2.0 devices from auto suspending after
    xhci_start for 500ms.
    
    Log shows:
    [   13.387170] xhci_hub_control:1271: xhci-hcd PNP0D10:03: Get port status 7-1 read: 0x2a0, return 0x100
    [   13.387177] hub_event:5779: hub 7-0:1.0: state 7 ports 1 chg 0000 evt 0000
    [   13.387182] hub_suspend:3903: hub 7-0:1.0: hub_suspend
    [   13.387188] hcd_bus_suspend:2250: usb usb7: bus auto-suspend, wakeup 1
    [   13.387191] hcd_bus_suspend:2279: usb usb7: suspend raced with wakeup event
    [   13.387193] hcd_bus_resume:2303: usb usb7: usb auto-resume
    [   13.387296] hub_event:5779: hub 3-0:1.0: state 7 ports 1 chg 0000 evt 0000
    [   13.393343] handle_port_status:2034: xhci-hcd PNP0D10:02: handle_port_status: starting usb5 port polling.
    [   13.393353] xhci_hub_control:1271: xhci-hcd PNP0D10:02: Get port status 5-1 read: 0x206e1, return 0x10101
    [   13.400047] hub_suspend:3903: hub 3-0:1.0: hub_suspend
    [   13.403077] hub_resume:3948: hub 7-0:1.0: hub_resume
    [   13.403080] xhci_hub_control:1271: xhci-hcd PNP0D10:03: Get port status 7-1 read: 0x2a0, return 0x100
    [   13.403085] hub_event:5779: hub 7-0:1.0: state 7 ports 1 chg 0000 evt 0000
    [   13.403087] hub_suspend:3903: hub 7-0:1.0: hub_suspend
    [   13.403090] hcd_bus_suspend:2250: usb usb7: bus auto-suspend, wakeup 1
    [   13.403093] hcd_bus_suspend:2279: usb usb7: suspend raced with wakeup event
    [   13.403095] hcd_bus_resume:2303: usb usb7: usb auto-resume
    [   13.405002] handle_port_status:1913: xhci-hcd PNP0D10:04: Port change event, 9-1, id 1, portsc: 0x6e1
    [   13.405016] hub_activate:1169: usb usb5-port1: status 0101 change 0001
    [   13.405026] xhci_clear_port_change_bit:658: xhci-hcd PNP0D10:02: clear port1 connect change, portsc: 0x6e1
    [   13.413275] hcd_bus_suspend:2250: usb usb3: bus auto-suspend, wakeup 1
    [   13.419081] hub_resume:3948: hub 7-0:1.0: hub_resume
    [   13.419086] xhci_hub_control:1271: xhci-hcd PNP0D10:03: Get port status 7-1 read: 0x2a0, return 0x100
    [   13.419095] hub_event:5779: hub 7-0:1.0: state 7 ports 1 chg 0000 evt 0000
    [   13.419100] hub_suspend:3903: hub 7-0:1.0: hub_suspend
    [   13.419106] hcd_bus_suspend:2250: usb usb7: bus auto-suspend, wakeup 1
    [   13.419110] hcd_bus_suspend:2279: usb usb7: suspend raced with wakeup event
    [   13.419112] hcd_bus_resume:2303: usb usb7: usb auto-resume
    [   13.420455] handle_port_status:2034: xhci-hcd PNP0D10:04: handle_port_status: starting usb9 port polling.
    [   13.420493] handle_port_status:1913: xhci-hcd PNP0D10:05: Port change event, 10-1, id 1, portsc: 0x6e1
    [   13.425332] hcd_bus_suspend:2279: usb usb3: suspend raced with wakeup event
    [   13.431931] handle_port_status:2034: xhci-hcd PNP0D10:05: handle_port_status: starting usb10 port polling.
    [   13.435080] hub_resume:3948: hub 7-0:1.0: hub_resume
    [   13.435084] xhci_hub_control:1271: xhci-hcd PNP0D10:03: Get port status 7-1 read: 0x2a0, return 0x100
    [   13.435092] hub_event:5779: hub 7-0:1.0: state 7 ports 1 chg 0000 evt 0000
    [   13.435096] hub_suspend:3903: hub 7-0:1.0: hub_suspend
    [   13.435102] hcd_bus_suspend:2250: usb usb7: bus auto-suspend, wakeup 1
    [   13.435106] hcd_bus_suspend:2279: usb usb7: suspend raced with wakeup event
    
    usb7 and other usb 2.0 root hub were rapidly toggling between suspend
    and resume states. More, "suspend raced with wakeup event" confuses people.
    
    So, limit run_graceperiod for only usb 3.0 devices
    
    Signed-off-by: Hongyu Xie <[email protected]>
    Signed-off-by: Mathias Nyman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usbip: Fix locking bug in RT-enabled kernels [+ + +]

Author: Lizhi Xu <[email protected]>
Date:   Tue Sep 16 09:41:43 2025 +0800

    usbip: Fix locking bug in RT-enabled kernels
    
    [ Upstream commit 09bf21bf5249880f62fe759b53b14b4b52900c6c ]
    
    Interrupts are disabled before entering usb_hcd_giveback_urb().
    A spinlock_t becomes a sleeping lock on PREEMPT_RT, so it cannot be
    acquired with disabled interrupts.
    
    Save the interrupt status and restore it after usb_hcd_giveback_urb().
    
    syz reported:
    BUG: sleeping function called from invalid context at kernel/locking/spinlock_rt.c:48
    Call Trace:
     dump_stack_lvl+0x189/0x250 lib/dump_stack.c:120
     rt_spin_lock+0xc7/0x2c0 kernel/locking/spinlock_rt.c:57
     spin_lock include/linux/spinlock_rt.h:44 [inline]
     mon_bus_complete drivers/usb/mon/mon_main.c:134 [inline]
     mon_complete+0x5c/0x200 drivers/usb/mon/mon_main.c:147
     usbmon_urb_complete include/linux/usb/hcd.h:738 [inline]
     __usb_hcd_giveback_urb+0x254/0x5e0 drivers/usb/core/hcd.c:1647
     vhci_urb_enqueue+0xb4f/0xe70 drivers/usb/usbip/vhci_hcd.c:818
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=205ef33a3b636b4181fb
    Signed-off-by: Lizhi Xu <[email protected]>
    Acked-by: Shuah Khan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usbnet: Fix using smp_processor_id() in preemptible code warnings [+ + +]

Author: Zqiang <[email protected]>
Date:   Fri Jan 2 12:37:27 2026 -0800

    usbnet: Fix using smp_processor_id() in preemptible code warnings
    
    [ Upstream commit 327cd4b68b4398b6c24f10eb2b2533ffbfc10185 ]
    
    Syzbot reported the following warning:
    
    BUG: using smp_processor_id() in preemptible [00000000] code: dhcpcd/2879
    caller is usbnet_skb_return+0x74/0x490 drivers/net/usb/usbnet.c:331
    CPU: 1 UID: 0 PID: 2879 Comm: dhcpcd Not tainted 6.15.0-rc4-syzkaller-00098-g615dca38c2ea #0 PREEMPT(voluntary)
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:94 [inline]
     dump_stack_lvl+0x16c/0x1f0 lib/dump_stack.c:120
     check_preemption_disabled+0xd0/0xe0 lib/smp_processor_id.c:49
     usbnet_skb_return+0x74/0x490 drivers/net/usb/usbnet.c:331
     usbnet_resume_rx+0x4b/0x170 drivers/net/usb/usbnet.c:708
     usbnet_change_mtu+0x1be/0x220 drivers/net/usb/usbnet.c:417
     __dev_set_mtu net/core/dev.c:9443 [inline]
     netif_set_mtu_ext+0x369/0x5c0 net/core/dev.c:9496
     netif_set_mtu+0xb0/0x160 net/core/dev.c:9520
     dev_set_mtu+0xae/0x170 net/core/dev_api.c:247
     dev_ifsioc+0xa31/0x18d0 net/core/dev_ioctl.c:572
     dev_ioctl+0x223/0x10e0 net/core/dev_ioctl.c:821
     sock_do_ioctl+0x19d/0x280 net/socket.c:1204
     sock_ioctl+0x42f/0x6a0 net/socket.c:1311
     vfs_ioctl fs/ioctl.c:51 [inline]
     __do_sys_ioctl fs/ioctl.c:906 [inline]
     __se_sys_ioctl fs/ioctl.c:892 [inline]
     __x64_sys_ioctl+0x190/0x200 fs/ioctl.c:892
     do_syscall_x64 arch/x86/entry/syscall_64.c:63 [inline]
     do_syscall_64+0xcd/0x260 arch/x86/entry/syscall_64.c:94
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
    
    For historical and portability reasons, the netif_rx() is usually
    run in the softirq or interrupt context, this commit therefore add
    local_bh_disable/enable() protection in the usbnet_resume_rx().
    
    Fixes: 43daa96b166c ("usbnet: Stop RX Q on MTU change")
    Link: https://syzkaller.appspot.com/bug?id=81f55dfa587ee544baaaa5a359a060512228c1e1
    Suggested-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Zqiang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Paolo Abeni <[email protected]>
    [Harshit: Resolved conflicts due to missing commit: 2c04d279e857 ("net:
    usb: Convert tasklet API to new bottom half workqueue mechanism") in
    6.12.y]
    Signed-off-by: Harshit Mogalapalli <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

vfio/pci: Disable qword access to the PCI ROM bar [+ + +]

Author: Kevin Tian <[email protected]>
Date:   Tue Jan 6 02:44:28 2026 +0000

    vfio/pci: Disable qword access to the PCI ROM bar
    
    [ Upstream commit dc85a46928c41423ad89869baf05a589e2975575 ]
    
    Commit 2b938e3db335 ("vfio/pci: Enable iowrite64 and ioread64 for vfio
    pci") enables qword access to the PCI bar resources. However certain
    devices (e.g. Intel X710) are observed with problem upon qword accesses
    to the rom bar, e.g. triggering PCI aer errors.
    
    This is triggered by Qemu which caches the rom content by simply does a
    pread() of the remaining size until it gets the full contents. The other
    bars would only perform operations at the same access width as their
    guest drivers.
    
    Instead of trying to identify all broken devices, universally disable
    qword access to the rom bar i.e. going back to the old way which worked
    reliably for years.
    
    Reported-by: Farrah Chen <[email protected]>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=220740
    Fixes: 2b938e3db335 ("vfio/pci: Enable iowrite64 and ioread64 for vfio pci")
    Cc: [email protected]
    Signed-off-by: Kevin Tian <[email protected]>
    Tested-by: Farrah Chen <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alex Williamson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

vfio/pds: Fix memory leak in pds_vfio_dirty_enable() [+ + +]

Author: Zilin Guan <[email protected]>
Date:   Thu Dec 25 14:31:50 2025 +0000

    vfio/pds: Fix memory leak in pds_vfio_dirty_enable()
    
    [ Upstream commit 665077d78dc7941ce6a330c02023a2b469cc8cc7 ]
    
    pds_vfio_dirty_enable() allocates memory for region_info. If
    interval_tree_iter_first() returns NULL, the function returns -EINVAL
    immediately without freeing the allocated memory, causing a memory leak.
    
    Fix this by jumping to the out_free_region_info label to ensure
    region_info is freed.
    
    Fixes: 2e7c6feb4ef52 ("vfio/pds: Add multi-region support")
    Signed-off-by: Zilin Guan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alex Williamson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

vhost/vsock: improve RCU read sections around vhost_vsock_get() [+ + +]

Author: Stefano Garzarella <[email protected]>
Date:   Wed Nov 26 14:38:26 2025 +0100

    vhost/vsock: improve RCU read sections around vhost_vsock_get()
    
    [ Upstream commit d8ee3cfdc89b75dc059dc21c27bef2c1440f67eb ]
    
    vhost_vsock_get() uses hash_for_each_possible_rcu() to find the
    `vhost_vsock` associated with the `guest_cid`. hash_for_each_possible_rcu()
    should only be called within an RCU read section, as mentioned in the
    following comment in include/linux/rculist.h:
    
    /**
     * hlist_for_each_entry_rcu - iterate over rcu list of given type
     * @pos:        the type * to use as a loop cursor.
     * @head:       the head for your list.
     * @member:     the name of the hlist_node within the struct.
     * @cond:       optional lockdep expression if called from non-RCU protection.
     *
     * This list-traversal primitive may safely run concurrently with
     * the _rcu list-mutation primitives such as hlist_add_head_rcu()
     * as long as the traversal is guarded by rcu_read_lock().
     */
    
    Currently, all calls to vhost_vsock_get() are between rcu_read_lock()
    and rcu_read_unlock() except for calls in vhost_vsock_set_cid() and
    vhost_vsock_reset_orphans(). In both cases, the current code is safe,
    but we can make improvements to make it more robust.
    
    About vhost_vsock_set_cid(), when building the kernel with
    CONFIG_PROVE_RCU_LIST enabled, we get the following RCU warning when the
    user space issues `ioctl(dev, VHOST_VSOCK_SET_GUEST_CID, ...)` :
    
      WARNING: suspicious RCU usage
      6.18.0-rc7 #62 Not tainted
      -----------------------------
      drivers/vhost/vsock.c:74 RCU-list traversed in non-reader section!!
    
      other info that might help us debug this:
    
      rcu_scheduler_active = 2, debug_locks = 1
      1 lock held by rpc-libvirtd/3443:
       #0: ffffffffc05032a8 (vhost_vsock_mutex){+.+.}-{4:4}, at: vhost_vsock_dev_ioctl+0x2ff/0x530 [vhost_vsock]
    
      stack backtrace:
      CPU: 2 UID: 0 PID: 3443 Comm: rpc-libvirtd Not tainted 6.18.0-rc7 #62 PREEMPT(none)
      Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.17.0-7.fc42 06/10/2025
      Call Trace:
       <TASK>
       dump_stack_lvl+0x75/0xb0
       dump_stack+0x14/0x1a
       lockdep_rcu_suspicious.cold+0x4e/0x97
       vhost_vsock_get+0x8f/0xa0 [vhost_vsock]
       vhost_vsock_dev_ioctl+0x307/0x530 [vhost_vsock]
       __x64_sys_ioctl+0x4f2/0xa00
       x64_sys_call+0xed0/0x1da0
       do_syscall_64+0x73/0xfa0
       entry_SYSCALL_64_after_hwframe+0x76/0x7e
       ...
       </TASK>
    
    This is not a real problem, because the vhost_vsock_get() caller, i.e.
    vhost_vsock_set_cid(), holds the `vhost_vsock_mutex` used by the hash
    table writers. Anyway, to prevent that warning, add lockdep_is_held()
    condition to hash_for_each_possible_rcu() to verify that either the
    caller is in an RCU read section or `vhost_vsock_mutex` is held when
    CONFIG_PROVE_RCU_LIST is enabled; and also clarify the comment for
    vhost_vsock_get() to better describe the locking requirements and the
    scope of the returned pointer validity.
    
    About vhost_vsock_reset_orphans(), currently this function is only
    called via vsock_for_each_connected_socket(), which holds the
    `vsock_table_lock` spinlock (which is also an RCU read-side critical
    section). However, add an explicit RCU read lock there to make the code
    more robust and explicit about the RCU requirements, and to prevent
    issues if the calling context changes in the future or if
    vhost_vsock_reset_orphans() is called from other contexts.
    
    Fixes: 834e772c8db0 ("vhost/vsock: fix use-after-free in network stack callers")
    Cc: [email protected]
    Signed-off-by: Stefano Garzarella <[email protected]>
    Reviewed-by: Stefan Hajnoczi <[email protected]>
    Message-Id: <[email protected]>
    Message-ID: <20251126210313.GA499503@fedora>
    Acked-by: Jason Wang <[email protected]>
    Signed-off-by: Michael S. Tsirkin <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

via_wdt: fix critical boot hang due to unnamed resource allocation [+ + +]

Author: Li Qiang <[email protected]>
Date:   Sun Sep 28 16:33:32 2025 +0800

    via_wdt: fix critical boot hang due to unnamed resource allocation
    
    [ Upstream commit 7aa31ee9ec92915926e74731378c009c9cc04928 ]
    
    The VIA watchdog driver uses allocate_resource() to reserve a MMIO
    region for the watchdog control register. However, the allocated
    resource was not given a name, which causes the kernel resource tree
    to contain an entry marked as "<BAD>" under /proc/iomem on x86
    platforms.
    
    During boot, this unnamed resource can lead to a critical hang because
    subsequent resource lookups and conflict checks fail to handle the
    invalid entry properly.
    
    Signed-off-by: Li Qiang <[email protected]>
    Reviewed-by: Guenter Roeck <[email protected]>
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Wim Van Sebroeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

virtio: vdpa: Fix reference count leak in octep_sriov_enable() [+ + +]

Author: Miaoqian Lin <[email protected]>
Date:   Mon Oct 27 14:07:35 2025 +0800

    virtio: vdpa: Fix reference count leak in octep_sriov_enable()
    
    commit b41ca62c0019de1321d75f2b2f274a28784a41ed upstream.
    
    pci_get_device() will increase the reference count for the returned
    pci_dev, and also decrease the reference count for the input parameter
    from if it is not NULL.
    
    If we break the loop in  with 'vf_pdev' not NULL. We
    need to call pci_dev_put() to decrease the reference count.
    
    Found via static anlaysis and this is similar to commit c508eb042d97
    ("perf/x86/intel/uncore: Fix reference count leak in sad_cfg_iio_topology()")
    
    Fixes: 8b6c724cdab8 ("virtio: vdpa: vDPA driver for Marvell OCTEON DPU devices")
    Cc: [email protected]
    Signed-off-by: Miaoqian Lin <[email protected]>
    Signed-off-by: Michael S. Tsirkin <[email protected]>
    Message-Id: <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: brcmfmac: Add DMI nvram filename quirk for Acer A1 840 tablet [+ + +]

Author: Hans de Goede <[email protected]>
Date:   Mon Nov 3 11:03:14 2025 +0100

    wifi: brcmfmac: Add DMI nvram filename quirk for Acer A1 840 tablet
    
    [ Upstream commit a8e5a110c0c38e08e5dd66356cd1156e91cf88e1 ]
    
    The Acer A1 840 tablet contains quite generic names in the sys_vendor and
    product_name DMI strings, without this patch brcmfmac will try to load:
    brcmfmac43340-sdio.Insyde-BayTrail.txt as nvram file which is a bit
    too generic.
    
    Add a DMI quirk so that a unique and clearly identifiable nvram file name
    is used on the Acer A1 840 tablet.
    
    Acked-by: Arend van Spriel <[email protected]>
    Signed-off-by: Hans de Goede <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: cfg80211: sme: store capped length in __cfg80211_connect_result() [+ + +]

Author: Dan Carpenter <[email protected]>
Date:   Wed Dec 3 14:14:47 2025 +0300

    wifi: cfg80211: sme: store capped length in __cfg80211_connect_result()
    
    [ Upstream commit 2b77b9551d1184cb5af8271ff350e6e2c1b3db0d ]
    
    The QGenie AI code review tool says we should store the capped length to
    wdev->u.client.ssid_len.  The AI is correct.
    
    Fixes: 62b635dcd69c ("wifi: cfg80211: sme: cap SSID length in __cfg80211_connect_result()")
    Signed-off-by: Dan Carpenter <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: cfg80211: stop radar detection in cfg80211_leave() [+ + +]

Author: Johannes Berg <[email protected]>
Date:   Fri Nov 21 17:40:21 2025 +0100

    wifi: cfg80211: stop radar detection in cfg80211_leave()
    
    [ Upstream commit 9f33477b9a31a1edfe2df9f1a0359cccb0e16b4c ]
    
    If an interface is set down or, per the previous patch, changes
    type, radar detection for it should be cancelled. This is done
    for AP mode in mac80211 (somewhat needlessly, since cfg80211 can
    do it, but didn't until now), but wasn't handled for mesh, so if
    radar detection was started and then the interface set down or
    its type switched (the latter sometimes happning in the hwsim
    test 'mesh_peer_connected_dfs'), radar detection would be around
    with the interface unknown to the driver, later leading to some
    warnings around chanctx usage.
    
    Link: https://patch.msgid.link/20251121174021.290120e419e3.I2a5650c9062e29c988992dd8ce0d8eb570d23267@changeid
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: cfg80211: use cfg80211_leave() in iftype change [+ + +]

Author: Johannes Berg <[email protected]>
Date:   Fri Nov 21 17:40:20 2025 +0100

    wifi: cfg80211: use cfg80211_leave() in iftype change
    
    [ Upstream commit 7a27b73943a70ee226fa125327101fb18e94701d ]
    
    When changing the interface type, all activity on the interface has
    to be stopped first. This was done independent of existing code in
    cfg80211_leave(), so didn't handle e.g. background radar detection.
    Use cfg80211_leave() to handle it the same way.
    
    Note that cfg80211_leave() behaves slightly differently for IBSS in
    wireless extensions, it won't send an event in that case. We could
    handle that, but since nl80211 was used to change the type, IBSS is
    rare, and wext is already a corner case, it doesn't seem worth it.
    
    Link: https://patch.msgid.link/20251121174021.922ef48ce007.I970c8514252ef8a864a7fbdab9591b71031dee03@changeid
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: mac80211: do not use old MBSSID elements [+ + +]

Author: Aloka Dixit <[email protected]>
Date:   Mon Dec 15 09:46:56 2025 -0800

    wifi: mac80211: do not use old MBSSID elements
    
    [ Upstream commit a519be2f5d958c5804f2cfd68f1f384291271fab ]
    
    When userspace brings down and deletes a non-transmitted profile,
    it is expected to send a new updated Beacon template for the
    transmitted profile of that multiple BSSID (MBSSID) group which
    does not include the removed profile in MBSSID element. This
    update comes via NL80211_CMD_SET_BEACON.
    
    Such updates work well as long as the group continues to have at
    least one non-transmitted profile as NL80211_ATTR_MBSSID_ELEMS
    is included in the new Beacon template.
    
    But when the last non-trasmitted profile is removed, it still
    gets included in Beacon templates sent to driver. This happens
    because when no MBSSID elements are sent by the userspace,
    ieee80211_assign_beacon() ends up using the element stored from
    earlier Beacon template.
    
    Do not copy old MBSSID elements, instead userspace should always
    include these when applicable.
    
    Fixes: 2b3171c6fe0a ("mac80211: MBSSID beacon handling in AP mode")
    Signed-off-by: Aloka Dixit <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: mt76: Fix DTS power-limits on little endian systems [+ + +]

Author: Sven Eckelmann (Plasma Cloud) <[email protected]>
Date:   Fri Sep 26 11:32:54 2025 +0200

    wifi: mt76: Fix DTS power-limits on little endian systems
    
    commit 38b845e1f9e810869b0a0b69f202b877b7b7fb12 upstream.
    
    The power-limits for ru and mcs and stored in the devicetree as bytewise
    array (often with sizes which are not a multiple of 4). These arrays have a
    prefix which defines for how many modes a line is applied. This prefix is
    also only a byte - but the code still tried to fix the endianness of this
    byte with a be32 operation. As result, loading was mostly failing or was
    sending completely unexpected values to the firmware.
    
    Since the other rates are also stored in the devicetree as bytewise arrays,
    just drop the u32 access + be32_to_cpu conversion and directly access them
    as bytes arrays.
    
    Cc: [email protected]
    Fixes: 22b980badc0f ("mt76: add functions for parsing rate power limits from DT")
    Fixes: a9627d992b5e ("mt76: extend DT rate power limits to support 11ax devices")
    Signed-off-by: Sven Eckelmann (Plasma Cloud) <[email protected]>
    Signed-off-by: Felix Fietkau <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: mt76: mt7925: add handler to hif suspend/resume event [+ + +]

Author: Quan Zhou <[email protected]>
Date:   Mon Jan 5 12:16:28 2026 +0100

    wifi: mt76: mt7925: add handler to hif suspend/resume event
    
    [ Upstream commit 8f6571ad470feb242dcef36e53f7cf1bba03780f ]
    
    When the system suspend or resume, the WiFi driver sends
    an hif_ctrl command to the firmware and waits for an event.
    Due to changes in the event format reported by the chip, the
    current mt7925's driver does not account for these changes,
    resulting in command timeout. Add flow to handle hif_ctrl
    event to avoid command timeout. We also exented API
    mt76_connac_mcu_set_hif_suspend for connac3 this time.
    
    Signed-off-by: Quan Zhou <[email protected]>
    Link: https://patch.msgid.link/3a0844ff5162142c4a9f3cf7104f75076ddd3b87.1735910562.git.quan.zhou@mediatek.com
    Signed-off-by: Felix Fietkau <[email protected]>
    Signed-off-by: Jan Kiszka <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: mt76: mt7925: fix CLC command timeout when suspend/resume [+ + +]

Author: Quan Zhou <[email protected]>
Date:   Mon Jan 5 12:16:27 2026 +0100

    wifi: mt76: mt7925: fix CLC command timeout when suspend/resume
    
    [ Upstream commit a0f721b8d986b62b4de316444f2b2e356d17e3b5 ]
    
    When enter suspend/resume while in a connected state, the upper layer
    will trigger disconnection before entering suspend, and at the same time,
    it will trigger regd_notifier() and update CLC, causing the CLC event to
    not be received due to suspend, resulting in a command timeout.
    
    Therefore, the update of CLC is postponed until resume, to ensure data
    consistency and avoid the occurrence of command timeout.
    
    Signed-off-by: Quan Zhou <[email protected]>
    Link: https://patch.msgid.link/bab00a2805d0533fd8beaa059222659858a9dcb5.1735910455.git.quan.zhou@mediatek.com
    Signed-off-by: Felix Fietkau <[email protected]>
    Signed-off-by: Jan Kiszka <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: mt76: mt7925: fix the unfinished command of regd_notifier before suspend [+ + +]

Author: Quan Zhou <[email protected]>
Date:   Mon Jan 5 12:16:26 2026 +0100

    wifi: mt76: mt7925: fix the unfinished command of regd_notifier before suspend
    
    [ Upstream commit 1b97fc8443aea01922560de9f24a6383e6eb6ae8 ]
    
    Before entering suspend, we need to ensure that all MCU command are
    completed. In some cases, such as with regd_notifier, there is a
    chance that CLC commands, will be executed before suspend.
    
    Signed-off-by: Quan Zhou <[email protected]>
    Link: https://patch.msgid.link/3af7b4e5bf7437832b016e32743657d1d55b1f9d.1735910288.git.quan.zhou@mediatek.com
    Signed-off-by: Felix Fietkau <[email protected]>
    Signed-off-by: Jan Kiszka <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

wifi: mt76: mt792x: fix wifi init fail by setting MCU_RUNNING after CLC load [+ + +]

Author: Quan Zhou <[email protected]>
Date:   Tue Nov 18 19:54:54 2025 +0800

    wifi: mt76: mt792x: fix wifi init fail by setting MCU_RUNNING after CLC load
    
    [ Upstream commit 066f417be5fd8c7fe581c5550206364735dad7a3 ]
    
    Set the MT76_STATE_MCU_RUNNING bit only after mt7921_load_clc()
    has successfully completed. Previously, the MCU_RUNNING state
    was set before loading CLC, which could cause conflict between
    chip mcu_init retry and mac_reset flow, result in chip init fail
    and chip abnormal status. By moving the state set after CLC load,
    firmware initialization becomes robust and resolves init fail issue.
    
    Signed-off-by: Quan Zhou <[email protected]>
    Reviewed-by: [email protected]
    Link: https://patch.msgid.link/19ec8e4465142e774f17801025accd0ae2214092.1763465933.git.quan.zhou@mediatek.com
    Signed-off-by: Felix Fietkau <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: rtl8xxxu: Fix HT40 channel config for RTL8192CU, RTL8723AU [+ + +]

Author: Bitterblue Smith <[email protected]>
Date:   Thu Nov 20 16:10:01 2025 +0200

    wifi: rtl8xxxu: Fix HT40 channel config for RTL8192CU, RTL8723AU
    
    [ Upstream commit 5511ba3de434892e5ef3594d6eabbd12b1629356 ]
    
    Flip the response rate subchannel. It was backwards, causing low
    speeds when using 40 MHz channel width. "iw dev ... station dump"
    showed a low RX rate, 11M or less.
    
    Also fix the channel width field of RF6052_REG_MODE_AG.
    
    Tested only with RTL8192CU, but these settings are identical for
    RTL8723AU.
    
    Signed-off-by: Bitterblue Smith <[email protected]>
    Reviewed-by: Ping-Ke Shih <[email protected]>
    Signed-off-by: Ping-Ke Shih <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

wifi: rtlwifi: 8192cu: fix tid out of range in rtl92cu_tx_fill_desc() [+ + +]

Author: Morning Star <[email protected]>
Date:   Thu Nov 27 16:37:08 2025 +0800

    wifi: rtlwifi: 8192cu: fix tid out of range in rtl92cu_tx_fill_desc()
    
    [ Upstream commit dd39edb445f07400e748da967a07d5dca5c5f96e ]
    
    TID getting from ieee80211_get_tid() might be out of range of array size
    of sta_entry->tids[], so check TID is less than MAX_TID_COUNT. Othwerwise,
    UBSAN warn:
    
     UBSAN: array-index-out-of-bounds in drivers/net/wireless/realtek/rtlwifi/rtl8192cu/trx.c:514:30
     index 10 is out of range for type 'rtl_tid_data [9]'
    
    Fixes: 8ca4cdef9329 ("wifi: rtlwifi: rtl8192cu: Fix TX aggregation")
    Signed-off-by: Morning Star <[email protected]>
    Signed-off-by: Ping-Ke Shih <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

wifi: rtw88: limit indirect IO under powered off for RTL8822CS [+ + +]

Author: Ping-Ke Shih <[email protected]>
Date:   Tue Nov 25 09:38:49 2025 +0800

    wifi: rtw88: limit indirect IO under powered off for RTL8822CS
    
    [ Upstream commit f3ccdfda345ca9a624ea425840a926b8338c1e25 ]
    
    The indirect IO is necessary for RTL8822CS, but not necessary for other
    chips. Otherwiese, it throws errors and becomes unusable.
    
     rtw88_8723cs mmc1:0001:1: WOW Firmware version 11.0.0, H2C version 0
     rtw88_8723cs mmc1:0001:1: Firmware version 11.0.0, H2C version 0
     rtw88_8723cs mmc1:0001:1: sdio read32 failed (0xf0): -110
     rtw88_8723cs mmc1:0001:1: sdio write8 failed (0x1c): -110
     rtw88_8723cs mmc1:0001:1: sdio read32 failed (0xf0): -110
    
    By vendor driver, only RTL8822CS and RTL8822ES need indirect IO, but
    RTL8822ES isn't supported yet. Therefore, limit it to RTL8822CS only.
    
    Reported-by: Andrey Skvortsov <[email protected]>
    Closes: https://lore.kernel.org/linux-wireless/[email protected]/T/#m997b4522f7209ba629561c776bfd1d13ab24c1d4
    Fixes: 58de1f91e033 ("wifi: rtw88: sdio: use indirect IO for device registers before power-on")
    Signed-off-by: Ping-Ke Shih <[email protected]>
    Tested-by: Andrey Skvortsov <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/fpu: Fix FPU state core dump truncation on CPUs with no extended xfeatures [+ + +]

Author: Yongxin Liu <[email protected]>
Date:   Wed Dec 10 08:02:20 2025 +0800

    x86/fpu: Fix FPU state core dump truncation on CPUs with no extended xfeatures
    
    [ Upstream commit c8161e5304abb26e6c0bec6efc947992500fa6c5 ]
    
    Zero can be a valid value of num_records. For example, on Intel Atom x6425RE,
    only x87 and SSE are supported (features 0, 1), and fpu_user_cfg.max_features
    is 3. The for_each_extended_xfeature() loop only iterates feature 2, which is
    not enabled, so num_records = 0. This is valid and should not cause core dump
    failure.
    
    The issue is that dump_xsave_layout_desc() returns 0 for both genuine errors
    (dump_emit() failure) and valid cases (no extended features). Use negative
    return values for errors and only abort on genuine failures.
    
    Fixes: ba386777a30b ("x86/elf: Add a new FPU buffer layout info to x86 core files")
    Signed-off-by: Yongxin Liu <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/mce: Do not clear bank's poll bit in mce_poll_banks on AMD SMCA systems [+ + +]

Author: Avadhut Naik <[email protected]>
Date:   Fri Nov 21 19:04:04 2025 +0000

    x86/mce: Do not clear bank's poll bit in mce_poll_banks on AMD SMCA systems
    
    commit d7ac083f095d894a0b8ac0573516bfd035e6b25a upstream.
    
    Currently, when a CMCI storm detected on a Machine Check bank, subsides, the
    bank's corresponding bit in the mce_poll_banks per-CPU variable is cleared
    unconditionally by cmci_storm_end().
    
    On AMD SMCA systems, this essentially disables polling on that particular bank
    on that CPU. Consequently, any subsequent correctable errors or storms will not
    be logged.
    
    Since AMD SMCA systems allow banks to be managed by both polling and
    interrupts, the polling banks bitmap for a CPU, i.e., mce_poll_banks, should
    not be modified when a storm subsides.
    
    Fixes: 7eae17c4add5 ("x86/mce: Add per-bank CMCI storm mitigation")
    Signed-off-by: Avadhut Naik <[email protected]>
    Signed-off-by: Borislav Petkov (AMD) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/microcode/AMD: Fix Entrysign revision check for Zen5/Strix Halo [+ + +]

Author: Rong Zhang <[email protected]>
Date:   Tue Dec 30 02:22:21 2025 +0800

    x86/microcode/AMD: Fix Entrysign revision check for Zen5/Strix Halo
    
    commit 150b1b97e27513535dcd3795d5ecd28e61b6cb8c upstream.
    
    Zen5 also contains family 1Ah, models 70h-7Fh, which are mistakenly missing
    from cpu_has_entrysign(). Add the missing range.
    
    Fixes: 8a9fb5129e8e ("x86/microcode/AMD: Limit Entrysign signature checking to known generations")
    Signed-off-by: Rong Zhang <[email protected]>
    Signed-off-by: Borislav Petkov (AMD) <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/microcode/AMD: Select which microcode patch to load [+ + +]

Author: Borislav Petkov (AMD) <[email protected]>
Date:   Thu Sep 25 13:46:00 2025 +0200

    x86/microcode/AMD: Select which microcode patch to load
    
    commit 8d171045069c804e5ffaa18be590c42c6af0cf3f upstream.
    
    All microcode patches up to the proper BIOS Entrysign fix are loaded
    only after the sha256 signature carried in the driver has been verified.
    
    Microcode patches after the Entrysign fix has been applied, do not need
    that signature verification anymore.
    
    In order to not abandon machines which haven't received the BIOS update
    yet, add the capability to select which microcode patch to load.
    
    The corresponding microcode container supplied through firmware-linux
    has been modified to carry two patches per CPU type
    (family/model/stepping) so that the proper one gets selected.
    
    Signed-off-by: Borislav Petkov (AMD) <[email protected]>
    Tested-by: Waiman Long <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/msi: Make irq_retrigger() functional for posted MSI [+ + +]

Author: Thomas Gleixner <[email protected]>
Date:   Tue Dec 23 13:19:53 2025 -0500

    x86/msi: Make irq_retrigger() functional for posted MSI
    
    [ Upstream commit 0edc78b82bea85e1b2165d8e870a5c3535919695 ]
    
    Luigi reported that retriggering a posted MSI interrupt does not work
    correctly.
    
    The reason is that the retrigger happens at the vector domain by sending an
    IPI to the actual vector on the target CPU. That works correctly exactly
    once because the posted MSI interrupt chip does not issue an EOI as that's
    only required for the posted MSI notification vector itself.
    
    As a consequence the vector becomes stale in the ISR, which not only
    affects this vector but also any lower priority vector in the affected
    APIC because the ISR bit is not cleared.
    
    Luigi proposed to set the vector in the remap PIR bitmap and raise the
    posted MSI notification vector. That works, but that still does not cure a
    related problem:
    
      If there is ever a stray interrupt on such a vector, then the related
      APIC ISR bit becomes stale due to the lack of EOI as described above.
      Unlikely to happen, but if it happens it's not debuggable at all.
    
    So instead of playing games with the PIR, this can be actually solved
    for both cases by:
    
     1) Keeping track of the posted interrupt vector handler state
    
     2) Implementing a posted MSI specific irq_ack() callback which checks that
        state. If the posted vector handler is inactive it issues an EOI,
        otherwise it delegates that to the posted handler.
    
    This is correct versus affinity changes and concurrent events on the posted
    vector as the actual handler invocation is serialized through the interrupt
    descriptor lock.
    
    Fixes: ed1e48ea4370 ("iommu/vt-d: Enable posted mode for device MSIs")
    Reported-by: Luigi Rizzo <[email protected]>
    Signed-off-by: Thomas Gleixner <[email protected]>
    Tested-by: Luigi Rizzo <[email protected]>
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Closes: https://lore.kernel.org/lkml/[email protected]
    [ DEFINE_PER_CPU_CACHE_HOT => DEFINE_PER_CPU ]
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/ptrace: Always inline trivial accessors [+ + +]

Author: Peter Zijlstra <[email protected]>
Date:   Fri Oct 31 12:04:24 2025 +0100

    x86/ptrace: Always inline trivial accessors
    
    [ Upstream commit 1fe4002cf7f23d70c79bda429ca2a9423ebcfdfa ]
    
    A KASAN build bloats these single load/store helpers such that
    it fails to inline them:
    
      vmlinux.o: error: objtool: irqentry_exit+0x5e8: call to instruction_pointer_set() with UACCESS enabled
    
    Make sure the compiler isn't allowed to do stupid.
    
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/xen: Fix sparse warning in enlighten_pv.c [+ + +]

Author: Juergen Gross <[email protected]>
Date:   Mon Dec 15 12:51:12 2025 +0100

    x86/xen: Fix sparse warning in enlighten_pv.c
    
    [ Upstream commit e5aff444e3a7bdeef5ea796a2099fc3c60a070fa ]
    
    The sparse tool issues a warning for arch/x76/xen/enlighten_pv.c:
    
       arch/x86/xen/enlighten_pv.c:120:9: sparse: sparse: incorrect type
         in initializer (different address spaces)
         expected void const [noderef] __percpu *__vpp_verify
         got bool *
    
    This is due to the percpu variable xen_in_preemptible_hcall being
    exported via EXPORT_SYMBOL_GPL() instead of EXPORT_PER_CPU_SYMBOL_GPL().
    
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Fixes: fdfd811ddde3 ("x86/xen: allow privcmd hypercalls to be preempted")
    Reviewed-by: Boris Ostrovsky <[email protected]>
    Signed-off-by: Juergen Gross <[email protected]>
    Message-ID: <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

x86/xen: Move Xen upcall handler [+ + +]

Author: Brian Gerst <[email protected]>
Date:   Fri Mar 14 11:12:14 2025 -0400

    x86/xen: Move Xen upcall handler
    
    [ Upstream commit 1ab7b5ed44ba9bce581e225f40219b793bc779d6 ]
    
    Move the upcall handler to Xen-specific files.
    
    No functional changes.
    
    Signed-off-by: Brian Gerst <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Reviewed-by: Juergen Gross <[email protected]>
    Reviewed-by: Sohil Mehta <[email protected]>
    Cc: Andy Lutomirski <[email protected]>
    Cc: H. Peter Anvin <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Cc: Josh Poimboeuf <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Stable-dep-of: e5aff444e3a7 ("x86/xen: Fix sparse warning in enlighten_pv.c")
    Signed-off-by: Sasha Levin <[email protected]>

xfs: don't leak a locked dquot when xfs_dquot_attach_buf fails [+ + +]

Author: Christoph Hellwig <[email protected]>
Date:   Mon Nov 10 14:22:53 2025 +0100

    xfs: don't leak a locked dquot when xfs_dquot_attach_buf fails
    
    commit 204c8f77e8d4a3006f8abe40331f221a597ce608 upstream.
    
    xfs_qm_quotacheck_dqadjust acquired the dquot through xfs_qm_dqget,
    which means it owns a reference and holds q_qlock.  Both need to
    be dropped on an error exit.
    
    Cc: <[email protected]> # v6.13
    Fixes: ca378189fdfa ("xfs: convert quotacheck to attach dquot buffers")
    Reported-by: kernel test robot <[email protected]>
    Reported-by: Dan Carpenter <[email protected]>
    Signed-off-by: Christoph Hellwig <[email protected]>
    Reviewed-by: Darrick J. Wong <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: fix a memory leak in xfs_buf_item_init() [+ + +]

Author: Haoxiang Li <[email protected]>
Date:   Wed Dec 10 17:06:01 2025 +0800

    xfs: fix a memory leak in xfs_buf_item_init()
    
    commit fc40459de82543b565ebc839dca8f7987f16f62e upstream.
    
    xfs_buf_item_get_format() may allocate memory for bip->bli_formats,
    free the memory in the error path.
    
    Fixes: c3d5f0c2fb85 ("xfs: complain if anyone tries to create a too-large buffer log item")
    Cc: [email protected]
    Signed-off-by: Haoxiang Li <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Reviewed-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: fix a UAF problem in xattr repair [+ + +]

Author: Darrick J. Wong <[email protected]>
Date:   Thu Dec 4 13:43:50 2025 -0800

    xfs: fix a UAF problem in xattr repair
    
    commit 5990fd756943836978ad184aac980e2b36ab7e01 upstream.
    
    The xchk_setup_xattr_buf function can allocate a new value buffer, which
    means that any reference to ab->value before the call could become a
    dangling pointer.  Fix this by moving an assignment to after the buffer
    setup.
    
    Cc: [email protected] # v6.10
    Fixes: e47dcf113ae348 ("xfs: repair extended attributes")
    Signed-off-by: Darrick J. Wong <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xfs: fix stupid compiler warning [+ + +]

Author: Darrick J. Wong <[email protected]>
Date:   Thu Dec 4 13:44:15 2025 -0800

    xfs: fix stupid compiler warning
    
    commit f06725052098d7b1133ac3846d693c383dc427a2 upstream.
    
    gcc 14.2 warns about:
    
    xfs_attr_item.c: In function ‘xfs_attr_recover_work’:
    xfs_attr_item.c:785:9: warning: ‘ip’ may be used uninitialized [-Wmaybe-uninitialized]
      785 |         xfs_trans_ijoin(tp, ip, 0);
          |         ^~~~~~~~~~~~~~~~~~~~~~~~~~
    xfs_attr_item.c:740:42: note: ‘ip’ was declared here
      740 |         struct xfs_inode                *ip;
          |                                          ^~
    
    I think this is bogus since xfs_attri_recover_work either returns a real
    pointer having initialized ip or an ERR_PTR having not touched it, but
    the tools are smarter than me so let's just null-init the variable
    anyway.
    
    Cc: [email protected] # v6.8
    Fixes: e70fb328d52772 ("xfs: recreate work items when recovering intent items")
    Signed-off-by: Darrick J. Wong <[email protected]>
    Reviewed-by: Carlos Maiolino <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Carlos Maiolino <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

xhci: dbgtty: fix device unregister: fixup [+ + +]

Author: Łukasz Bartosik <[email protected]>
Date:   Tue Dec 30 13:32:37 2025 -0500

    xhci: dbgtty: fix device unregister: fixup
    
    [ Upstream commit 74098cc06e753d3ffd8398b040a3a1dfb65260c0 ]
    
    This fixup replaces tty_vhangup() call with call to
    tty_port_tty_vhangup(). Both calls hangup tty device
    synchronously however tty_port_tty_vhangup() increases
    reference count during the hangup operation using
    scoped_guard(tty_port_tty).
    
    Cc: stable <[email protected]>
    Fixes: 1f73b8b56cf3 ("xhci: dbgtty: fix device unregister")
    Signed-off-by: Łukasz Bartosik <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>