Changelog in Linux kernel 6.1.134

acpi: nfit: fix narrowing conversion in acpi_nfit_ctl [+ + +]

Author: Murad Masimov <[email protected]>
Date:   Thu Jan 23 19:39:45 2025 +0300

    acpi: nfit: fix narrowing conversion in acpi_nfit_ctl
    
    commit 2ff0e408db36c21ed3fa5e3c1e0e687c82cf132f upstream.
    
    Syzkaller has reported a warning in to_nfit_bus_uuid(): "only secondary
    bus families can be translated". This warning is emited if the argument
    is equal to NVDIMM_BUS_FAMILY_NFIT == 0. Function acpi_nfit_ctl() first
    verifies that a user-provided value call_pkg->nd_family of type u64 is
    not equal to 0. Then the value is converted to int, and only after that
    is compared to NVDIMM_BUS_FAMILY_MAX. This can lead to passing an invalid
    argument to acpi_nfit_ctl(), if call_pkg->nd_family is non-zero, while
    the lower 32 bits are zero.
    
    Furthermore, it is best to return EINVAL immediately upon seeing the
    invalid user input.  The WARNING is insufficient to prevent further
    undefined behavior based on other invalid user input.
    
    All checks of the input value should be applied to the original variable
    call_pkg->nd_family.
    
    [iweiny: update commit message]
    
    Fixes: 6450ddbd5d8e ("ACPI: NFIT: Define runtime firmware activation commands")
    Cc: [email protected]
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=c80d8dc0d9fa81a3cd8c
    Signed-off-by: Murad Masimov <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Ira Weiny <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ACPI: processor: idle: Return an error if both P_LVL{2,3} idle states are invalid [+ + +]

Author: Giovanni Gherdovich <[email protected]>
Date:   Fri Mar 28 15:30:39 2025 +0100

    ACPI: processor: idle: Return an error if both P_LVL{2,3} idle states are invalid
    
    [ Upstream commit 9e9b893404d43894d69a18dd2fc8fcf1c36abb7e ]
    
    Prior to commit 496121c02127 ("ACPI: processor: idle: Allow probing on
    platforms with one ACPI C-state"), the acpi_idle driver wouldn't load on
    systems without a valid C-State at least as deep as C2.
    
    The behavior was desirable for guests on hypervisors such as VMWare
    ESXi, which by default don't have the _CST ACPI method, and set the C2
    and C3 latencies to 101 and 1001 microseconds respectively via the FADT,
    to signify they're unsupported.
    
    Since the above change though, these virtualized deployments end up
    loading acpi_idle, and thus entering the default C1 C-State set by
    acpi_processor_get_power_info_default(); this is undesirable for a
    system that's communicating to the OS it doesn't want C-States (missing
    _CST, and invalid C2/C3 in FADT).
    
    Make acpi_processor_get_power_info_fadt() return -ENODEV in that case,
    so that acpi_processor_get_cstate_info() exits early and doesn't set
    pr->flags.power = 1.
    
    Fixes: 496121c02127 ("ACPI: processor: idle: Allow probing on platforms with one ACPI C-state")
    Signed-off-by: Giovanni Gherdovich <[email protected]>
    Reviewed-by: Zhang Rui <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [ rjw: Changelog edits ]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ACPI: resource: Skip IRQ override on ASUS Vivobook 14 X1404VAP [+ + +]

Author: Paul Menzel <[email protected]>
Date:   Tue Mar 18 17:09:02 2025 +0100

    ACPI: resource: Skip IRQ override on ASUS Vivobook 14 X1404VAP
    
    commit 2da31ea2a085cd189857f2db0f7b78d0162db87a upstream.
    
    Like the ASUS Vivobook X1504VAP and Vivobook X1704VAP, the ASUS Vivobook 14
    X1404VAP has its keyboard IRQ (1) described as ActiveLow in the DSDT, which
    the kernel overrides to EdgeHigh breaking the keyboard.
    
        $ sudo dmidecode
        […]
        System Information
                Manufacturer: ASUSTeK COMPUTER INC.
                Product Name: ASUS Vivobook 14 X1404VAP_X1404VA
        […]
        $ grep -A 30 PS2K dsdt.dsl | grep IRQ -A 1
                     IRQ (Level, ActiveLow, Exclusive, )
                         {1}
    
    Add the X1404VAP to the irq1_level_low_skip_override[] quirk table to fix
    this.
    
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=219224
    Cc: All applicable <[email protected]>
    Signed-off-by: Paul Menzel <[email protected]>
    Reviewed-by: Hans de Goede <[email protected]>
    Tested-by: Anton Shyndin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

affs: don't write overlarge OFS data block size fields [+ + +]

Author: Simon Tatham <[email protected]>
Date:   Thu Feb 20 08:14:44 2025 +0000

    affs: don't write overlarge OFS data block size fields
    
    [ Upstream commit 011ea742a25a77bac3d995f457886a67d178c6f0 ]
    
    If a data sector on an OFS floppy contains a value > 0x1e8 (the
    largest amount of data that fits in the sector after its header), then
    an Amiga reading the file can return corrupt data, by taking the
    overlarge size at its word and reading past the end of the buffer it
    read the disk sector into!
    
    The cause: when affs_write_end_ofs() writes data to an OFS filesystem,
    the new size field for a data block was computed by adding the amount
    of data currently being written (into the block) to the existing value
    of the size field. This is correct if you're extending the file at the
    end, but if you seek backwards in the file and overwrite _existing_
    data, it can lead to the size field being larger than the maximum
    legal value.
    
    This commit changes the calculation so that it sets the size field to
    the max of its previous size and the position within the block that we
    just wrote up to.
    
    Signed-off-by: Simon Tatham <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

affs: generate OFS sequence numbers starting at 1 [+ + +]

Author: Simon Tatham <[email protected]>
Date:   Thu Feb 20 08:14:43 2025 +0000

    affs: generate OFS sequence numbers starting at 1
    
    [ Upstream commit e4cf8ec4de4e13f156c1d61977d282d90c221085 ]
    
    If I write a file to an OFS floppy image, and try to read it back on
    an emulated Amiga running Workbench 1.3, the Amiga reports a disk
    error trying to read the file. (That is, it's unable to read it _at
    all_, even to copy it to the NIL: device. It isn't a matter of getting
    the wrong data and being unable to parse the file format.)
    
    This is because the 'sequence number' field in the OFS data block
    header is supposed to be based at 1, but affs writes it based at 0.
    All three locations changed by this patch were setting the sequence
    number to a variable 'bidx' which was previously obtained by dividing
    a file position by bsize, so bidx will naturally use 0 for the first
    block. Therefore all three should add 1 to that value before writing
    it into the sequence number field.
    
    With this change, the Amiga successfully reads the file.
    
    For data block reference: https://wiki.osdev.org/FFS_(Amiga)
    
    Signed-off-by: Simon Tatham <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda/realtek: Add mute LED quirk for HP Pavilion x360 14-dy1xxx [+ + +]

Author: Navon John Lukose <[email protected]>
Date:   Sat Mar 8 03:03:19 2025 +0530

    ALSA: hda/realtek: Add mute LED quirk for HP Pavilion x360 14-dy1xxx
    
    [ Upstream commit b11a74ac4f545626d0dc95a8ca8c41df90532bf3 ]
    
    Add a fixup to enable the mute LED on HP Pavilion x360 Convertible
    14-dy1xxx with ALC295 codec. The appropriate coefficient index and bits
    were identified through a brute-force method, as detailed in
    https://bbs.archlinux.org/viewtopic.php?pid=2079504#p2079504.
    
    Signed-off-by: Navon John Lukose <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda/realtek: Add support for ASUS ROG Strix G614 Laptops using CS35L41 HDA [+ + +]

Author: Stefan Binding <[email protected]>
Date:   Wed Mar 5 17:06:47 2025 +0000

    ALSA: hda/realtek: Add support for ASUS ROG Strix G614 Laptops using CS35L41 HDA
    
    [ Upstream commit 9120b2b4ad0dad2f6bbb6bcacd0456f806fda62d ]
    
    Add support for ASUS G614PH/PM/PP and G614FH/FM/FP.
    
    Laptops use 2 CS35L41 Amps with HDA, using Internal boost, with I2C
    
    Signed-off-by: Stefan Binding <[email protected]>
    Signed-off-by: Takashi Iwai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda/realtek: Add support for ASUS Zenbook UM3406KA Laptops using CS35L41 HDA [+ + +]

Author: Stefan Binding <[email protected]>
Date:   Wed Mar 5 17:06:51 2025 +0000

    ALSA: hda/realtek: Add support for ASUS Zenbook UM3406KA Laptops using CS35L41 HDA
    
    [ Upstream commit 8463d2adbe1901247937fcdfe4b525130f6db10b ]
    
    Laptop uses 2 CS35L41 Amps with HDA, using External boost with I2C
    
    Signed-off-by: Stefan Binding <[email protected]>
    Signed-off-by: Takashi Iwai <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda/realtek: Always honor no_shutup_pins [+ + +]

Author: Takashi Iwai <[email protected]>
Date:   Sat Mar 15 15:30:19 2025 +0100

    ALSA: hda/realtek: Always honor no_shutup_pins
    
    [ Upstream commit 5a0c72c1da3cbc0cd4940a95d1be2830104c6edf ]
    
    The workaround for Dell machines to skip the pin-shutup for mic pins
    introduced alc_headset_mic_no_shutup() that is replaced from the
    generic snd_hda_shutup_pins() for certain codecs.  The problem is that
    the call is done unconditionally even if spec->no_shutup_pins is set.
    This seems causing problems on other platforms like Lenovo.
    
    This patch corrects the behavior and the driver honors always
    spec->no_shutup_pins flag and skips alc_headset_mic_no_shutup() if
    it's set.
    
    Fixes: dad3197da7a3 ("ALSA: hda/realtek - Fixup headphone noise via runtime suspend")
    Reported-and-tested-by: Oleg Gorobets <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda/realtek: Fix Asus Z13 2025 audio [+ + +]

Author: Antheas Kapenekakis <[email protected]>
Date:   Thu Feb 27 18:51:07 2025 +0100

    ALSA: hda/realtek: Fix Asus Z13 2025 audio
    
    [ Upstream commit 12784ca33b62fd327631749e6a0cd2a10110a56c ]
    
    Use the basic quirk for this type of amplifier. Sound works in speakers,
    headphones, and microphone. Whereas none worked before.
    
    Tested-by: Kyle Gospodnetich <[email protected]>
    Signed-off-by: Antheas Kapenekakis <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ALSA: hda: Fix speakers on ASUS EXPERTBOOK P5405CSA 1.0 [+ + +]

Author: Daniel Bárta <[email protected]>
Date:   Thu Feb 27 17:12:55 2025 +0100

    ALSA: hda: Fix speakers on ASUS EXPERTBOOK P5405CSA 1.0
    
    [ Upstream commit f479ecc5ef15ed8d774968c1a8726a49420f11a0 ]
    
    After some digging around I have found that this laptop has Cirrus's smart
    aplifiers connected to SPI bus (spi1-CSC3551:00-cs35l41-hda).
    
    To get them correctly detected and working I had to modify patch_realtek.c
    with ASUS EXPERTBOOK P5405CSA 1.0 SystemID (0x1043, 0x1f63) and add
    corresponding hda_quirk (ALC245_FIXUP_CS35L41_SPI_2).
    
    Signed-off-by: Daniel Bárta <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Takashi Iwai <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arch/powerpc: drop GENERIC_PTDUMP from mpc885_ads_defconfig [+ + +]

Author: Anshuman Khandual <[email protected]>
Date:   Wed Feb 26 17:54:01 2025 +0530

    arch/powerpc: drop GENERIC_PTDUMP from mpc885_ads_defconfig
    
    [ Upstream commit 2c5e6ac2db64ace51f66a9f3b3b3ab9553d748e8 ]
    
    GENERIC_PTDUMP gets selected on powerpc explicitly and hence can be
    dropped off from mpc885_ads_defconfig.  Replace with CONFIG_PTDUMP_DEBUGFS
    instead.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: e084728393a5 ("powerpc/ptdump: Convert powerpc to GENERIC_PTDUMP")
    Signed-off-by: Anshuman Khandual <[email protected]>
    Suggested-by: Christophe Leroy <[email protected]>
    Reviewed-by: Christophe Leroy <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: Catalin Marinas <[email protected]>
    Cc: Heiko Carstens <[email protected]>
    Cc: Ingo Molnar <[email protected]>
    Cc: Jonathan Corbet <[email protected]>
    Cc: Marc Zyngier <[email protected]>
    Cc: Mark Rutland <[email protected]>
    Cc: Palmer Dabbelt <[email protected]>
    Cc: Paul Walmsley <[email protected]>
    Cc: Steven Price <[email protected]>
    Cc: Thomas Gleixner <[email protected]>
    Cc: Vasily Gorbik <[email protected]>
    Cc: Will Deacon <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arcnet: Add NULL check in com20020pci_probe() [+ + +]

Author: Henry Martin <[email protected]>
Date:   Wed Apr 2 21:50:36 2025 +0800

    arcnet: Add NULL check in com20020pci_probe()
    
    [ Upstream commit fda8c491db2a90ff3e6fbbae58e495b4ddddeca3 ]
    
    devm_kasprintf() returns NULL when memory allocation fails. Currently,
    com20020pci_probe() does not check for this case, which results in a
    NULL pointer dereference.
    
    Add NULL check after devm_kasprintf() to prevent this issue and ensure
    no resources are left allocated.
    
    Fixes: 6b17a597fc2f ("arcnet: restoring support for multiple Sohard Arcnet cards")
    Signed-off-by: Henry Martin <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

arm64: Don't call NULL in do_compat_alignment_fixup() [+ + +]

Author: Angelos Oikonomopoulos <[email protected]>
Date:   Tue Apr 1 10:51:50 2025 +0200

    arm64: Don't call NULL in do_compat_alignment_fixup()
    
    commit c28f31deeacda307acfee2f18c0ad904e5123aac upstream.
    
    do_alignment_t32_to_handler() only fixes up alignment faults for
    specific instructions; it returns NULL otherwise (e.g. LDREX). When
    that's the case, signal to the caller that it needs to proceed with the
    regular alignment fault handling (i.e. SIGBUS). Without this patch, the
    kernel panics:
    
      Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000
      Mem abort info:
        ESR = 0x0000000086000006
        EC = 0x21: IABT (current EL), IL = 32 bits
        SET = 0, FnV = 0
        EA = 0, S1PTW = 0
        FSC = 0x06: level 2 translation fault
      user pgtable: 4k pages, 48-bit VAs, pgdp=00000800164aa000
      [0000000000000000] pgd=0800081fdbd22003, p4d=0800081fdbd22003, pud=08000815d51c6003, pmd=0000000000000000
      Internal error: Oops: 0000000086000006 [#1] SMP
      Modules linked in: cfg80211 rfkill xt_nat xt_tcpudp xt_conntrack nft_chain_nat xt_MASQUERADE nf_nat nf_conntrack_netlink nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 xfrm_user xfrm_algo xt_addrtype nft_compat br_netfilter veth nvme_fa>
       libcrc32c crc32c_generic raid0 multipath linear dm_mod dax raid1 md_mod xhci_pci nvme xhci_hcd nvme_core t10_pi usbcore igb crc64_rocksoft crc64 crc_t10dif crct10dif_generic crct10dif_ce crct10dif_common usb_common i2c_algo_bit i2c>
      CPU: 2 PID: 3932954 Comm: WPEWebProcess Not tainted 6.1.0-31-arm64 #1  Debian 6.1.128-1
      Hardware name: GIGABYTE MP32-AR1-00/MP32-AR1-00, BIOS F18v (SCP: 1.08.20211002) 12/01/2021
      pstate: 80400009 (Nzcv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
      pc : 0x0
      lr : do_compat_alignment_fixup+0xd8/0x3dc
      sp : ffff80000f973dd0
      x29: ffff80000f973dd0 x28: ffff081b42526180 x27: 0000000000000000
      x26: 0000000000000000 x25: 0000000000000000 x24: 0000000000000000
      x23: 0000000000000004 x22: 0000000000000000 x21: 0000000000000001
      x20: 00000000e8551f00 x19: ffff80000f973eb0 x18: 0000000000000000
      x17: 0000000000000000 x16: 0000000000000000 x15: 0000000000000000
      x14: 0000000000000000 x13: 0000000000000000 x12: 0000000000000000
      x11: 0000000000000000 x10: 0000000000000000 x9 : ffffaebc949bc488
      x8 : 0000000000000000 x7 : 0000000000000000 x6 : 0000000000000000
      x5 : 0000000000400000 x4 : 0000fffffffffffe x3 : 0000000000000000
      x2 : ffff80000f973eb0 x1 : 00000000e8551f00 x0 : 0000000000000001
      Call trace:
       0x0
       do_alignment_fault+0x40/0x50
       do_mem_abort+0x4c/0xa0
       el0_da+0x48/0xf0
       el0t_32_sync_handler+0x110/0x140
       el0t_32_sync+0x190/0x194
      Code: bad PC value
      ---[ end trace 0000000000000000 ]---
    
    Signed-off-by: Angelos Oikonomopoulos <[email protected]>
    Fixes: 3fc24ef32d3b ("arm64: compat: Implement misalignment fixups for multiword loads")
    Cc: <[email protected]> # 6.1.x
    Reviewed-by: Anshuman Khandual <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Catalin Marinas <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ASoC: cs35l41: check the return value from spi_setup() [+ + +]

Author: Vitaliy Shevtsov <[email protected]>
Date:   Tue Mar 4 16:56:37 2025 +0500

    ASoC: cs35l41: check the return value from spi_setup()
    
    [ Upstream commit ad5a0970f86d82e39ebd06d45a1f7aa48a1316f8 ]
    
    Currently the return value from spi_setup() is not checked for a failure.
    It is unlikely it will ever fail in this particular case but it is still
    better to add this check for the sake of completeness and correctness. This
    is cheap since it is performed once when the device is being probed.
    
    Handle spi_setup() return value.
    
    Found by Linux Verification Center (linuxtesting.org) with Svace.
    
    Fixes: 872fc0b6bde8 ("ASoC: cs35l41: Set the max SPI speed for the whole device")
    Signed-off-by: Vitaliy Shevtsov <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: imx-card: Add NULL check in imx_card_probe() [+ + +]

Author: Henry Martin <[email protected]>
Date:   Tue Apr 1 22:25:10 2025 +0800

    ASoC: imx-card: Add NULL check in imx_card_probe()
    
    [ Upstream commit 93d34608fd162f725172e780b1c60cc93a920719 ]
    
    devm_kasprintf() returns NULL when memory allocation fails. Currently,
    imx_card_probe() does not check for this case, which results in a NULL
    pointer dereference.
    
    Add NULL check after devm_kasprintf() to prevent this issue.
    
    Fixes: aa736700f42f ("ASoC: imx-card: Add imx-card machine driver")
    Signed-off-by: Henry Martin <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ASoC: ti: j721e-evm: Fix clock configuration for ti,j7200-cpb-audio compatible [+ + +]

Author: Jayesh Choudhary <[email protected]>
Date:   Tue Mar 18 17:05:24 2025 +0530

    ASoC: ti: j721e-evm: Fix clock configuration for ti,j7200-cpb-audio compatible
    
    [ Upstream commit 45ff65e30deb919604e68faed156ad96ce7474d9 ]
    
    For 'ti,j7200-cpb-audio' compatible, there is support for only one PLL for
    48k. For 11025, 22050, 44100 and 88200 sampling rates, due to absence of
    J721E_CLK_PARENT_44100, we get EINVAL while running any audio application.
    Add support for these rates by using the 48k parent clock and adjusting
    the clock for these rates later in j721e_configure_refclk.
    
    Fixes: 6748d0559059 ("ASoC: ti: Add custom machine driver for j721e EVM (CPB and IVI)")
    Signed-off-by: Jayesh Choudhary <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Mark Brown <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

bpf: Use preempt_count() directly in bpf_send_signal_common() [+ + +]

Author: Hou Tao <[email protected]>
Date:   Thu Feb 20 12:22:59 2025 +0800

    bpf: Use preempt_count() directly in bpf_send_signal_common()
    
    [ Upstream commit b4a8b5bba712a711d8ca1f7d04646db63f9c88f5 ]
    
    bpf_send_signal_common() uses preemptible() to check whether or not the
    current context is preemptible. If it is preemptible, it will use
    irq_work to send the signal asynchronously instead of trying to hold a
    spin-lock, because spin-lock is sleepable under PREEMPT_RT.
    
    However, preemptible() depends on CONFIG_PREEMPT_COUNT. When
    CONFIG_PREEMPT_COUNT is turned off (e.g., CONFIG_PREEMPT_VOLUNTARY=y),
    !preemptible() will be evaluated as 1 and bpf_send_signal_common() will
    use irq_work unconditionally.
    
    Fix it by unfolding "!preemptible()" and using "preempt_count() != 0 ||
    irqs_disabled()" instead.
    
    Fixes: 87c544108b61 ("bpf: Send signals asynchronously if !preemptible")
    Signed-off-by: Hou Tao <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexei Starovoitov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

btrfs: handle errors from btrfs_dec_ref() properly [+ + +]

Author: Josef Bacik <[email protected]>
Date:   Tue May 7 14:12:15 2024 -0400

    btrfs: handle errors from btrfs_dec_ref() properly
    
    commit 5eb178f373b4f16f3b42d55ff88fc94dd95b93b1 upstream.
    
    In walk_up_proc() we BUG_ON(ret) from btrfs_dec_ref().  This is
    incorrect, we have proper error handling here, return the error.
    
    Signed-off-by: Josef Bacik <[email protected]>
    Reviewed-by: David Sterba <[email protected]>
    Signed-off-by: David Sterba <[email protected]>
    Signed-off-by: Jianqi Ren <[email protected]>
    Signed-off-by: He Zhe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

can: statistics: use atomic access in hot path [+ + +]

Author: Oliver Hartkopp <[email protected]>
Date:   Mon Mar 10 15:33:53 2025 +0100

    can: statistics: use atomic access in hot path
    
    [ Upstream commit 80b5f90158d1364cbd80ad82852a757fc0692bf2 ]
    
    In can_send() and can_receive() CAN messages and CAN filter matches are
    counted to be visible in the CAN procfs files.
    
    KCSAN detected a data race within can_send() when two CAN frames have
    been generated by a timer event writing to the same CAN netdevice at the
    same time. Use atomic operations to access the statistics in the hot path
    to fix the KCSAN complaint.
    
    Reported-by: [email protected]
    Closes: https://lore.kernel.org/all/[email protected]
    Signed-off-by: Oliver Hartkopp <[email protected]>
    Reviewed-by: Vincent Mailhol <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Marc Kleine-Budde <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: amlogic: g12a: fix mmc A peripheral clock [+ + +]

Author: Jerome Brunet <[email protected]>
Date:   Fri Dec 13 11:03:23 2024 +0100

    clk: amlogic: g12a: fix mmc A peripheral clock
    
    [ Upstream commit 0079e77c08de692cb20b38e408365c830a44b1ef ]
    
    The bit index of the peripheral clock for mmc A is wrong
    This was probably not a problem for mmc A as the peripheral is likely left
    enabled by the bootloader.
    
    No issues has been reported so far but it could be a problem, most likely
    some form of conflict between the ethernet and mmc A clock, breaking
    ethernet on init.
    
    Use the value provided by the documentation for mmc A before this
    becomes an actual problem.
    
    Fixes: 085a4ea93d54 ("clk: meson: g12a: add peripheral clock controller")
    Reviewed-by: Neil Armstrong <[email protected]>
    Link: https://lore.kernel.org/r/20241213-amlogic-clk-g12a-mmca-fix-v1-1-5af421f58b64@baylibre.com
    Signed-off-by: Jerome Brunet <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: amlogic: g12b: fix cluster A parent data [+ + +]

Author: Jerome Brunet <[email protected]>
Date:   Fri Dec 13 15:30:17 2024 +0100

    clk: amlogic: g12b: fix cluster A parent data
    
    [ Upstream commit 8995f8f108c3ac5ad52b12a6cfbbc7b3b32e9a58 ]
    
    Several clocks used by both g12a and g12b use the g12a cpu A clock hw
    pointer as clock parent. This is incorrect on g12b since the parents of
    cluster A cpu clock are different. Also the hw clock provided as parent to
    these children is not even registered clock on g12b.
    
    Fix the problem by reverting to the global namespace and let CCF pick
    the appropriate, as it is already done for other clocks, such as
    cpu_clk_trace_div.
    
    Fixes: 25e682a02d91 ("clk: meson: g12a: migrate to the new parent description method")
    Reviewed-by: Neil Armstrong <[email protected]>
    Link: https://lore.kernel.org/r/20241213-amlogic-clk-g12a-cpua-parent-fix-v1-1-d8c0f41865fe@baylibre.com
    Signed-off-by: Jerome Brunet <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: amlogic: gxbb: drop incorrect flag on 32k clock [+ + +]

Author: Jerome Brunet <[email protected]>
Date:   Fri Dec 20 11:25:36 2024 +0100

    clk: amlogic: gxbb: drop incorrect flag on 32k clock
    
    [ Upstream commit f38f7fe4830c5cb4eac138249225f119e7939965 ]
    
    gxbb_32k_clk_div sets CLK_DIVIDER_ROUND_CLOSEST in the init_data flag which
    is incorrect. This is field is not where the divider flags belong.
    
    Thankfully, CLK_DIVIDER_ROUND_CLOSEST maps to bit 4 which is an unused
    clock flag, so there is no unintended consequence to this error.
    
    Effectively, the clock has been used without CLK_DIVIDER_ROUND_CLOSEST
    so far, so just drop it.
    
    Fixes: 14c735c8e308 ("clk: meson-gxbb: Add EE 32K Clock for CEC")
    Reviewed-by: Neil Armstrong <[email protected]>
    Link: https://lore.kernel.org/r/20241220-amlogic-clk-gxbb-32k-fixes-v1-1-baca56ecf2db@baylibre.com
    Signed-off-by: Jerome Brunet <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: amlogic: gxbb: drop non existing 32k clock parent [+ + +]

Author: Jerome Brunet <[email protected]>
Date:   Fri Dec 20 11:25:37 2024 +0100

    clk: amlogic: gxbb: drop non existing 32k clock parent
    
    [ Upstream commit 7915d7d5407c026fa9343befb4d3343f7a345f97 ]
    
    The 32k clock reference a parent 'cts_slow_oscin' with a fixme note saying
    that this clock should be provided by AO controller.
    
    The HW probably has this clock but it does not exist at the moment in
    any controller implementation. Furthermore, referencing clock by the global
    name should be avoided whenever possible.
    
    There is no reason to keep this hack around, at least for now.
    
    Fixes: 14c735c8e308 ("clk: meson-gxbb: Add EE 32K Clock for CEC")
    Reviewed-by: Neil Armstrong <[email protected]>
    Link: https://lore.kernel.org/r/20241220-amlogic-clk-gxbb-32k-fixes-v1-2-baca56ecf2db@baylibre.com
    Signed-off-by: Jerome Brunet <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: qcom: gcc-msm8953: fix stuck venus0_core0 clock [+ + +]

Author: Vladimir Lypak <[email protected]>
Date:   Sat Mar 15 16:26:18 2025 +0100

    clk: qcom: gcc-msm8953: fix stuck venus0_core0 clock
    
    [ Upstream commit cdc59600bccf2cb4c483645438a97d4ec55f326b ]
    
    This clock can't be enable with VENUS_CORE0 GDSC turned off. But that
    GDSC is under HW control so it can be turned off at any moment.
    Instead of checking the dependent clock we can just vote for it to
    enable later when GDSC gets turned on.
    
    Fixes: 9bb6cfc3c77e6 ("clk: qcom: Add Global Clock Controller driver for MSM8953")
    Signed-off-by: Vladimir Lypak <[email protected]>
    Signed-off-by: Barnabás Czémán <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: qcom: mmcc-sdm660: fix stuck video_subcore0 clock [+ + +]

Author: Barnabás Czémán <[email protected]>
Date:   Sat Mar 15 16:26:17 2025 +0100

    clk: qcom: mmcc-sdm660: fix stuck video_subcore0 clock
    
    [ Upstream commit 000cbe3896c56bf5c625e286ff096533a6b27657 ]
    
    This clock can't be enable with VENUS_CORE0 GDSC turned off. But that
    GDSC is under HW control so it can be turned off at any moment.
    Instead of checking the dependent clock we can just vote for it to
    enable later when GDSC gets turned on.
    
    Fixes: 5db3ae8b33de6 ("clk: qcom: Add SDM660 Multimedia Clock Controller (MMCC) driver")
    Signed-off-by: Barnabás Czémán <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: rockchip: rk3328: fix wrong clk_ref_usb3otg parent [+ + +]

Author: Peter Geis <[email protected]>
Date:   Wed Jan 15 01:26:22 2025 +0000

    clk: rockchip: rk3328: fix wrong clk_ref_usb3otg parent
    
    [ Upstream commit a9e60f1ffe1ca57d6af6a2573e2f950e76efbf5b ]
    
    Correct the clk_ref_usb3otg parent to fix clock control for the usb3
    controller on rk3328. Verified against the rk3328 trm, the rk3228h trm,
    and the rk3328 usb3 phy clock map.
    
    Fixes: fe3511ad8a1c ("clk: rockchip: add clock controller for rk3328")
    Signed-off-by: Peter Geis <[email protected]>
    Reviewed-by: Dragan Simic <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Heiko Stuebner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

clk: samsung: Fix UBSAN panic in samsung_clk_init() [+ + +]

Author: Will McVicker <[email protected]>
Date:   Wed Feb 12 10:32:52 2025 -0800

    clk: samsung: Fix UBSAN panic in samsung_clk_init()
    
    [ Upstream commit d19d7345a7bcdb083b65568a11b11adffe0687af ]
    
    With UBSAN_ARRAY_BOUNDS=y, I'm hitting the below panic due to
    dereferencing `ctx->clk_data.hws` before setting
    `ctx->clk_data.num = nr_clks`. Move that up to fix the crash.
    
      UBSAN: array index out of bounds: 00000000f2005512 [#1] PREEMPT SMP
      <snip>
      Call trace:
       samsung_clk_init+0x110/0x124 (P)
       samsung_clk_init+0x48/0x124 (L)
       samsung_cmu_register_one+0x3c/0xa0
       exynos_arm64_register_cmu+0x54/0x64
       __gs101_cmu_top_of_clk_init_declare+0x28/0x60
       ...
    
    Fixes: e620a1e061c4 ("drivers/clk: convert VL struct to struct_size")
    Signed-off-by: Will McVicker <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

context_tracking: Always inline ct_{nmi,irq}_{enter,exit}() [+ + +]

Author: Josh Poimboeuf <[email protected]>
Date:   Mon Mar 31 21:26:45 2025 -0700

    context_tracking: Always inline ct_{nmi,irq}_{enter,exit}()
    
    [ Upstream commit 9ac50f7311dc8b39e355582f14c1e82da47a8196 ]
    
    Thanks to CONFIG_DEBUG_SECTION_MISMATCH, empty functions can be
    generated out of line.  These can be called from noinstr code, so make
    sure they're always inlined.
    
    Fixes the following warnings:
    
      vmlinux.o: warning: objtool: irqentry_nmi_enter+0xa2: call to ct_nmi_enter() leaves .noinstr.text section
      vmlinux.o: warning: objtool: irqentry_nmi_exit+0x16: call to ct_nmi_exit() leaves .noinstr.text section
      vmlinux.o: warning: objtool: irqentry_exit+0x78: call to ct_irq_exit() leaves .noinstr.text section
    
    Fixes: 6f0e6c1598b1 ("context_tracking: Take IRQ eqs entrypoints over RCU")
    Reported-by: Randy Dunlap <[email protected]>
    Signed-off-by: Josh Poimboeuf <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Frederic Weisbecker <[email protected]>
    Cc: Paul E. McKenney <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/8509bce3f536bcd4ae7af3a2cf6930d48c5e631a.1743481539.git.jpoimboe@kernel.org
    Closes: https://lore.kernel.org/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

coresight-etm4x: add isb() before reading the TRCSTATR [+ + +]

Author: Yuanfang Zhang <[email protected]>
Date:   Thu Jan 16 17:04:20 2025 +0800

    coresight-etm4x: add isb() before reading the TRCSTATR
    
    [ Upstream commit 4ff6039ffb79a4a8a44b63810a8a2f2b43264856 ]
    
    As recommended by section 4.3.7 ("Synchronization when using system
    instructions to progrom the trace unit") of ARM IHI 0064H.b, the
    self-hosted trace analyzer must perform a Context synchronization
    event between writing to the TRCPRGCTLR and reading the TRCSTATR.
    Additionally, add an ISB between the each read of TRCSTATR on
    coresight_timeout() when using system instructions to program the
    trace unit.
    
    Fixes: 1ab3bb9df5e3 ("coresight: etm4x: Add necessary synchronization for sysreg access")
    Signed-off-by: Yuanfang Zhang <[email protected]>
    Signed-off-by: Suzuki K Poulose <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

coresight: catu: Fix number of pages while using 64k pages [+ + +]

Author: Ilkka Koskinen <[email protected]>
Date:   Thu Jan 9 21:53:48 2025 +0000

    coresight: catu: Fix number of pages while using 64k pages
    
    [ Upstream commit 0e14e062f5ff98aa15264dfa87c5f5e924028561 ]
    
    Trying to record a trace on kernel with 64k pages resulted in -ENOMEM.
    This happens due to a bug in calculating the number of table pages, which
    returns zero. Fix the issue by rounding up.
    
    $ perf record --kcore -e cs_etm/@tmc_etr55,cycacc,branch_broadcast/k --per-thread taskset --cpu-list 1 dd if=/dev/zero of=/dev/null
    failed to mmap with 12 (Cannot allocate memory)
    
    Fixes: 8ed536b1e283 ("coresight: catu: Add support for scatter gather tables")
    Signed-off-by: Ilkka Koskinen <[email protected]>
    Signed-off-by: Suzuki K Poulose <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

cpufreq: governor: Fix negative 'idle_time' handling in dbs_update() [+ + +]

Author: Jie Zhan <[email protected]>
Date:   Thu Feb 13 11:55:10 2025 +0800

    cpufreq: governor: Fix negative 'idle_time' handling in dbs_update()
    
    [ Upstream commit 3698dd6b139dc37b35a9ad83d9330c1f99666c02 ]
    
    We observed an issue that the CPU frequency can't raise up with a 100% CPU
    load when NOHZ is off and the 'conservative' governor is selected.
    
    'idle_time' can be negative if it's obtained from get_cpu_idle_time_jiffy()
    when NOHZ is off.  This was found and explained in commit 9485e4ca0b48
    ("cpufreq: governor: Fix handling of special cases in dbs_update()").
    
    However, commit 7592019634f8 ("cpufreq: governors: Fix long idle detection
    logic in load calculation") introduced a comparison between 'idle_time' and
    'samling_rate' to detect a long idle interval.  While 'idle_time' is
    converted to int before comparison, it's actually promoted to unsigned
    again when compared with an unsigned 'sampling_rate'.  Hence, this leads to
    wrong idle interval detection when it's in fact 100% busy and sets
    policy_dbs->idle_periods to a very large value.  'conservative' adjusts the
    frequency to minimum because of the large 'idle_periods', such that the
    frequency can't raise up.  'Ondemand' doesn't use policy_dbs->idle_periods
    so it fortunately avoids the issue.
    
    Correct negative 'idle_time' to 0 before any use of it in dbs_update().
    
    Fixes: 7592019634f8 ("cpufreq: governors: Fix long idle detection logic in load calculation")
    Signed-off-by: Jie Zhan <[email protected]>
    Reviewed-by: Chen Yu <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

cpufreq: scpi: compare kHz instead of Hz [+ + +]

Author: zuoqian <[email protected]>
Date:   Sat Jan 25 08:49:49 2025 +0000

    cpufreq: scpi: compare kHz instead of Hz
    
    [ Upstream commit 4742da9774a416908ef8e3916164192c15c0e2d1 ]
    
    The CPU rate from clk_get_rate() may not be divisible by 1000
    (e.g., 133333333). But the rate calculated from frequency(kHz) is
    always divisible by 1000 (e.g., 133333000).
    Comparing the rate causes a warning during CPU scaling:
    "cpufreq: __target_index: Failed to change cpu frequency: -5".
    When we choose to compare kHz here, the issue does not occur.
    
    Fixes: 343a8d17fa8d ("cpufreq: scpi: remove arm_big_little dependency")
    Signed-off-by: zuoqian <[email protected]>
    Reviewed-by: Dan Carpenter <[email protected]>
    Signed-off-by: Viresh Kumar <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

crypto: hisilicon/sec2 - fix for aead auth key length [+ + +]

Author: Wenkai Lin <[email protected]>
Date:   Wed Feb 5 11:56:26 2025 +0800

    crypto: hisilicon/sec2 - fix for aead auth key length
    
    [ Upstream commit 1b284ffc30b02808a0de698667cbcf5ce5f9144e ]
    
    According to the HMAC RFC, the authentication key
    can be 0 bytes, and the hardware can handle this
    scenario. Therefore, remove the incorrect validation
    for this case.
    
    Fixes: 2f072d75d1ab ("crypto: hisilicon - Add aead support on SEC2")
    Signed-off-by: Wenkai Lin <[email protected]>
    Signed-off-by: Chenghai Huang <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

crypto: hisilicon/sec2 - fix for aead authsize alignment [+ + +]

Author: Wenkai Lin <[email protected]>
Date:   Wed Feb 5 11:56:27 2025 +0800

    crypto: hisilicon/sec2 - fix for aead authsize alignment
    
    [ Upstream commit a49cc71e219040d771a8c1254879984f98192811 ]
    
    The hardware only supports authentication sizes
    that are 4-byte aligned. Therefore, the driver
    switches to software computation in this case.
    
    Fixes: 2f072d75d1ab ("crypto: hisilicon - Add aead support on SEC2")
    Signed-off-by: Wenkai Lin <[email protected]>
    Signed-off-by: Chenghai Huang <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

crypto: nx - Fix uninitialised hv_nxc on error [+ + +]

Author: Herbert Xu <[email protected]>
Date:   Sat Mar 15 16:50:42 2025 +0800

    crypto: nx - Fix uninitialised hv_nxc on error
    
    [ Upstream commit 9b00eb923f3e60ca76cbc8b31123716f3a87ac6a ]
    
    The compiler correctly warns that hv_nxc may be used uninitialised
    as that will occur when NX-GZIP is unavailable.
    
    Fix it by rearranging the code and delay setting caps_feat until
    the final query succeeds.
    
    Fixes: b4ba22114c78 ("crypto/nx: Get NX capabilities for GZIP coprocessor type")
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/amd/display: avoid NPD when ASIC does not support DMUB [+ + +]

Author: Thadeu Lima de Souza Cascardo <[email protected]>
Date:   Wed Feb 5 10:06:38 2025 -0300

    drm/amd/display: avoid NPD when ASIC does not support DMUB
    
    [ Upstream commit 42d9d7bed270247f134190ba0cb05bbd072f58c2 ]
    
    ctx->dmub_srv will de NULL if the ASIC does not support DMUB, which is
    tested in dm_dmub_sw_init.
    
    However, it will be dereferenced in dmub_hw_lock_mgr_cmd if
    should_use_dmub_lock returns true.
    
    This has been the case since dmub support has been added for PSR1.
    
    Fix this by checking for dmub_srv in should_use_dmub_lock.
    
    [   37.440832] BUG: kernel NULL pointer dereference, address: 0000000000000058
    [   37.447808] #PF: supervisor read access in kernel mode
    [   37.452959] #PF: error_code(0x0000) - not-present page
    [   37.458112] PGD 0 P4D 0
    [   37.460662] Oops: Oops: 0000 [#1] PREEMPT SMP NOPTI
    [   37.465553] CPU: 2 UID: 1000 PID: 1745 Comm: DrmThread Not tainted 6.14.0-rc1-00003-gd62e938120f0 #23 99720e1cb1e0fc4773b8513150932a07de3c6e88
    [   37.478324] Hardware name: Google Morphius/Morphius, BIOS Google_Morphius.13434.858.0 10/26/2023
    [   37.487103] RIP: 0010:dmub_hw_lock_mgr_cmd+0x77/0xb0
    [   37.492074] Code: 44 24 0e 00 00 00 00 48 c7 04 24 45 00 00 0c 40 88 74 24 0d 0f b6 02 88 44 24 0c 8b 01 89 44 24 08 85 f6 75 05 c6 44 24 0e 01 <48> 8b 7f 58 48 89 e6 ba 01 00 00 00 e8 08 3c 2a 00 65 48 8b 04 5
    [   37.510822] RSP: 0018:ffff969442853300 EFLAGS: 00010202
    [   37.516052] RAX: 0000000000000000 RBX: ffff92db03000000 RCX: ffff969442853358
    [   37.523185] RDX: ffff969442853368 RSI: 0000000000000001 RDI: 0000000000000000
    [   37.530322] RBP: 0000000000000001 R08: 00000000000004a7 R09: 00000000000004a5
    [   37.537453] R10: 0000000000000476 R11: 0000000000000062 R12: ffff92db0ade8000
    [   37.544589] R13: ffff92da01180ae0 R14: ffff92da011802a8 R15: ffff92db03000000
    [   37.551725] FS:  0000784a9cdfc6c0(0000) GS:ffff92db2af00000(0000) knlGS:0000000000000000
    [   37.559814] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [   37.565562] CR2: 0000000000000058 CR3: 0000000112b1c000 CR4: 00000000003506f0
    [   37.572697] Call Trace:
    [   37.575152]  <TASK>
    [   37.577258]  ? __die_body+0x66/0xb0
    [   37.580756]  ? page_fault_oops+0x3e7/0x4a0
    [   37.584861]  ? exc_page_fault+0x3e/0xe0
    [   37.588706]  ? exc_page_fault+0x5c/0xe0
    [   37.592550]  ? asm_exc_page_fault+0x22/0x30
    [   37.596742]  ? dmub_hw_lock_mgr_cmd+0x77/0xb0
    [   37.601107]  dcn10_cursor_lock+0x1e1/0x240
    [   37.605211]  program_cursor_attributes+0x81/0x190
    [   37.609923]  commit_planes_for_stream+0x998/0x1ef0
    [   37.614722]  update_planes_and_stream_v2+0x41e/0x5c0
    [   37.619703]  dc_update_planes_and_stream+0x78/0x140
    [   37.624588]  amdgpu_dm_atomic_commit_tail+0x4362/0x49f0
    [   37.629832]  ? srso_return_thunk+0x5/0x5f
    [   37.633847]  ? mark_held_locks+0x6d/0xd0
    [   37.637774]  ? _raw_spin_unlock_irq+0x24/0x50
    [   37.642135]  ? srso_return_thunk+0x5/0x5f
    [   37.646148]  ? lockdep_hardirqs_on+0x95/0x150
    [   37.650510]  ? srso_return_thunk+0x5/0x5f
    [   37.654522]  ? _raw_spin_unlock_irq+0x2f/0x50
    [   37.658883]  ? srso_return_thunk+0x5/0x5f
    [   37.662897]  ? wait_for_common+0x186/0x1c0
    [   37.666998]  ? srso_return_thunk+0x5/0x5f
    [   37.671009]  ? drm_crtc_next_vblank_start+0xc3/0x170
    [   37.675983]  commit_tail+0xf5/0x1c0
    [   37.679478]  drm_atomic_helper_commit+0x2a2/0x2b0
    [   37.684186]  drm_atomic_commit+0xd6/0x100
    [   37.688199]  ? __cfi___drm_printfn_info+0x10/0x10
    [   37.692911]  drm_atomic_helper_update_plane+0xe5/0x130
    [   37.698054]  drm_mode_cursor_common+0x501/0x670
    [   37.702600]  ? __cfi_drm_mode_cursor_ioctl+0x10/0x10
    [   37.707572]  drm_mode_cursor_ioctl+0x48/0x70
    [   37.711851]  drm_ioctl_kernel+0xf2/0x150
    [   37.715781]  drm_ioctl+0x363/0x590
    [   37.719189]  ? __cfi_drm_mode_cursor_ioctl+0x10/0x10
    [   37.724165]  amdgpu_drm_ioctl+0x41/0x80
    [   37.728013]  __se_sys_ioctl+0x7f/0xd0
    [   37.731685]  do_syscall_64+0x87/0x100
    [   37.735355]  ? vma_end_read+0x12/0xe0
    [   37.739024]  ? srso_return_thunk+0x5/0x5f
    [   37.743041]  ? find_held_lock+0x47/0xf0
    [   37.746884]  ? vma_end_read+0x12/0xe0
    [   37.750552]  ? srso_return_thunk+0x5/0x5f
    [   37.754565]  ? lock_release+0x1c4/0x2e0
    [   37.758406]  ? vma_end_read+0x12/0xe0
    [   37.762079]  ? exc_page_fault+0x84/0xe0
    [   37.765921]  ? srso_return_thunk+0x5/0x5f
    [   37.769938]  ? lockdep_hardirqs_on+0x95/0x150
    [   37.774303]  ? srso_return_thunk+0x5/0x5f
    [   37.778317]  ? exc_page_fault+0x84/0xe0
    [   37.782163]  entry_SYSCALL_64_after_hwframe+0x55/0x5d
    [   37.787218] RIP: 0033:0x784aa5ec3059
    [   37.790803] Code: 04 25 28 00 00 00 48 89 45 c8 31 c0 48 8d 45 10 c7 45 b0 10 00 00 00 48 89 45 b8 48 8d 45 d0 48 89 45 c0 b8 10 00 00 00 0f 05 <41> 89 c0 3d 00 f0 ff ff 77 1d 48 8b 45 c8 64 48 2b 04 25 28 00 0
    [   37.809553] RSP: 002b:0000784a9cdf90e0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
    [   37.817121] RAX: ffffffffffffffda RBX: 0000784a9cdf917c RCX: 0000784aa5ec3059
    [   37.824256] RDX: 0000784a9cdf917c RSI: 00000000c01c64a3 RDI: 0000000000000020
    [   37.831391] RBP: 0000784a9cdf9130 R08: 0000000000000100 R09: 0000000000ff0000
    [   37.838525] R10: 0000000000000000 R11: 0000000000000246 R12: 0000025c01606ed0
    [   37.845657] R13: 0000025c00030200 R14: 00000000c01c64a3 R15: 0000000000000020
    [   37.852799]  </TASK>
    [   37.854992] Modules linked in:
    [   37.864546] gsmi: Log Shutdown Reason 0x03
    [   37.868656] CR2: 0000000000000058
    [   37.871979] ---[ end trace 0000000000000000 ]---
    [   37.880976] RIP: 0010:dmub_hw_lock_mgr_cmd+0x77/0xb0
    [   37.885954] Code: 44 24 0e 00 00 00 00 48 c7 04 24 45 00 00 0c 40 88 74 24 0d 0f b6 02 88 44 24 0c 8b 01 89 44 24 08 85 f6 75 05 c6 44 24 0e 01 <48> 8b 7f 58 48 89 e6 ba 01 00 00 00 e8 08 3c 2a 00 65 48 8b 04 5
    [   37.904703] RSP: 0018:ffff969442853300 EFLAGS: 00010202
    [   37.909933] RAX: 0000000000000000 RBX: ffff92db03000000 RCX: ffff969442853358
    [   37.917068] RDX: ffff969442853368 RSI: 0000000000000001 RDI: 0000000000000000
    [   37.924201] RBP: 0000000000000001 R08: 00000000000004a7 R09: 00000000000004a5
    [   37.931336] R10: 0000000000000476 R11: 0000000000000062 R12: ffff92db0ade8000
    [   37.938469] R13: ffff92da01180ae0 R14: ffff92da011802a8 R15: ffff92db03000000
    [   37.945602] FS:  0000784a9cdfc6c0(0000) GS:ffff92db2af00000(0000) knlGS:0000000000000000
    [   37.953689] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [   37.959435] CR2: 0000000000000058 CR3: 0000000112b1c000 CR4: 00000000003506f0
    [   37.966570] Kernel panic - not syncing: Fatal exception
    [   37.971901] Kernel Offset: 0x30200000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
    [   37.982840] gsmi: Log Shutdown Reason 0x02
    
    Fixes: b5c764d6ed55 ("drm/amd/display: Use HW lock mgr for PSR1")
    Signed-off-by: Thadeu Lima de Souza Cascardo <[email protected]>
    Cc: Sun peng Li <[email protected]>
    Cc: Tom Chung <[email protected]>
    Cc: Daniel Wheeler <[email protected]>
    Cc: Alex Deucher <[email protected]>
    Reviewed-by: Rodrigo Siqueira <[email protected]>
    Reviewed-by: Leo Li <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/amd/display: fix type mismatch in CalculateDynamicMetadataParameters() [+ + +]

Author: Vitaliy Shevtsov <[email protected]>
Date:   Thu Feb 27 01:28:51 2025 +0500

    drm/amd/display: fix type mismatch in CalculateDynamicMetadataParameters()
    
    [ Upstream commit c3c584c18c90a024a54716229809ba36424f9660 ]
    
    There is a type mismatch between what CalculateDynamicMetadataParameters()
    takes and what is passed to it. Currently this function accepts several
    args as signed long but it's called with unsigned integers and integer. On
    some systems where long is 32 bits and one of these unsigned int params is
    greater than INT_MAX it may cause passing input params as negative values.
    
    Fix this by changing these argument types from long to unsigned int and to
    int respectively. Also this will align the function's definition with
    similar functions in other dcn* drivers.
    
    Found by Linux Verification Center (linuxtesting.org) with Svace.
    
    Fixes: 6725a88f88a7 ("drm/amd/display: Add DCN3 DML")
    Signed-off-by: Vitaliy Shevtsov <[email protected]>
    Reviewed-by: Alex Hung <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/amd: Keep display off while going into S4 [+ + +]

Author: Mario Limonciello <[email protected]>
Date:   Thu Mar 6 12:51:24 2025 -0600

    drm/amd: Keep display off while going into S4
    
    [ Upstream commit 4afacc9948e1f8fdbca401d259ae65ad93d298c0 ]
    
    When userspace invokes S4 the flow is:
    
    1) amdgpu_pmops_prepare()
    2) amdgpu_pmops_freeze()
    3) Create hibernation image
    4) amdgpu_pmops_thaw()
    5) Write out image to disk
    6) Turn off system
    
    Then on resume amdgpu_pmops_restore() is called.
    
    This flow has a problem that because amdgpu_pmops_thaw() is called
    it will call amdgpu_device_resume() which will resume all of the GPU.
    
    This includes turning the display hardware back on and discovering
    connectors again.
    
    This is an unexpected experience for the display to turn back on.
    Adjust the flow so that during the S4 sequence display hardware is
    not turned back on.
    
    Reported-by: Xaver Hugl <[email protected]>
    Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/2038
    Cc: Muhammad Usama Anjum <[email protected]>
    Tested-by: Muhammad Usama Anjum <[email protected]>
    Acked-by: Alex Deucher <[email protected]>
    Acked-by: Harry Wentland <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Mario Limonciello <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    (cherry picked from commit 68bfdc8dc0a1a7fdd9ab61e69907ae71a6fd3d91)
    Signed-off-by: Sasha Levin <[email protected]>

drm/amdgpu/gfx11: fix num_mec [+ + +]

Author: Alex Deucher <[email protected]>
Date:   Wed Mar 26 09:35:02 2025 -0400

    drm/amdgpu/gfx11: fix num_mec
    
    [ Upstream commit 4161050d47e1b083a7e1b0b875c9907e1a6f1f1f ]
    
    GC11 only has 1 mec.
    
    Fixes: 3d879e81f0f9 ("drm/amdgpu: add init support for GFX11 (v2)")
    Reviewed-by: Sunil Khatri <[email protected]>
    Signed-off-by: Alex Deucher <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/bridge: it6505: fix HDCP V match check is not performed correctly [+ + +]

Author: Hermes Wu <[email protected]>
Date:   Tue Jan 21 15:01:51 2025 +0800

    drm/bridge: it6505: fix HDCP V match check is not performed correctly
    
    [ Upstream commit a5072fc77fb9e38fa9fd883642c83c3720049159 ]
    
    Fix a typo where V compare incorrectly compares av[] with av[] itself,
    which can result in HDCP failure.
    
    The loop of V compare is expected to iterate for 5 times
    which compare V array form av[0][] to av[4][].
    It should check loop counter reach the last statement "i == 5"
    before return true
    
    Fixes: 0989c02c7a5c ("drm/bridge: it6505: fix HDCP CTS compare V matching")
    Signed-off-by: Hermes Wu <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Robert Foss <[email protected]>
    Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

drm/bridge: ti-sn65dsi86: Fix multiple instances [+ + +]

Author: Geert Uytterhoeven <[email protected]>
Date:   Tue Dec 10 15:18:46 2024 +0100

    drm/bridge: ti-sn65dsi86: Fix multiple instances
    
    [ Upstream commit 574f5ee2c85a00a579549d50e9fc9c6c072ee4c4 ]
    
    Each bridge instance creates up to four auxiliary devices with different
    names.  However, their IDs are always zero, causing duplicate filename
    errors when a system has multiple bridges:
    
        sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ti_sn65dsi86.gpio.0'
    
    Fix this by using a unique instance ID per bridge instance.  The
    instance ID is derived from the I2C adapter number and the bridge's I2C
    address, to support multiple instances on the same bus.
    
    Fixes: bf73537f411b ("drm/bridge: ti-sn65dsi86: Break GPIO and MIPI-to-eDP bridge into sub-drivers")
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    Reviewed-by: Douglas Anderson <[email protected]>
    Signed-off-by: Douglas Anderson <[email protected]>
    Link: https://patchwork.freedesktop.org/patch/msgid/7a68a0e3f927e26edca6040067fb653eb06efb79.1733840089.git.geert+renesas@glider.be
    Signed-off-by: Sasha Levin <[email protected]>

drm/dp_mst: Fix drm RAD print [+ + +]

Author: Wayne Lin <[email protected]>
Date:   Mon Jan 13 17:10:59 2025 +0800

    drm/dp_mst: Fix drm RAD print
    
    [ Upstream commit 6bbce873a9c97cb12f5455c497be279ac58e707f ]
    
    [Why]
    The RAD of sideband message printed today is incorrect.
    For RAD stored within MST branch
    - If MST branch LCT is 1, it's RAD array is untouched and remained as 0.
    - If MST branch LCT is larger than 1, use nibble to store the up facing
      port number in cascaded sequence as illustrated below:
    
      u8 RAD[0] = (LCT_2_UFP << 4) | LCT_3_UFP
         RAD[1] = (LCT_4_UFP << 4) | LCT_5_UFP
         ...
    
    In drm_dp_mst_rad_to_str(), it wrongly to use BIT_MASK(4) to fetch the port
    number of one nibble.
    
    [How]
    Adjust the code by:
    - RAD array items are valuable only for LCT >= 1.
    - Use 0xF as the mask to replace BIT_MASK(4)
    
    V2:
    - Document how RAD is constructed (Imre)
    
    V3:
    - Adjust the comment for rad[] so kdoc formats it properly (Lyude)
    
    Fixes: 2f015ec6eab6 ("drm/dp_mst: Add sideband down request tracing + selftests")
    Cc: Imre Deak <[email protected]>
    Cc: Ville Syrjälä <[email protected]>
    Cc: Harry Wentland <[email protected]>
    Cc: Lyude Paul <[email protected]>
    Reviewed-by: Lyude Paul <[email protected]>
    Signed-off-by: Wayne Lin <[email protected]>
    Signed-off-by: Lyude Paul <[email protected]>
    Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

drm/mediatek: dsi: fix error codes in mtk_dsi_host_transfer() [+ + +]

Author: Dan Carpenter <[email protected]>
Date:   Wed Jan 8 12:35:57 2025 +0300

    drm/mediatek: dsi: fix error codes in mtk_dsi_host_transfer()
    
    [ Upstream commit dcb166ee43c3d594e7b73a24f6e8cf5663eeff2c ]
    
    There is a type bug because the return statement:
    
            return ret < 0 ? ret : recv_cnt;
    
    The issue is that ret is an int, recv_cnt is a u32 and the function
    returns ssize_t, which is a signed long.  The way that the type promotion
    works is that the negative error codes are first cast to u32 and then
    to signed long.  The error codes end up being positive instead of
    negative and the callers treat them as success.
    
    Fixes: 81cc7e51c4f1 ("drm/mediatek: Allow commands to be sent during video mode")
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/r/[email protected]/
    Signed-off-by: Dan Carpenter <[email protected]>
    Reviewed-by: Mattijs Korpershoek <[email protected]>
    Reviewed-by: AngeloGioacchino Del Regno <[email protected]>
    Reviewed-by: CK Hu <[email protected]>
    Link: https://patchwork.kernel.org/project/dri-devel/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/mediatek: mtk_hdmi: Fix typo for aud_sampe_size member [+ + +]

Author: AngeloGioacchino Del Regno <[email protected]>
Date:   Mon Feb 17 16:48:12 2025 +0100

    drm/mediatek: mtk_hdmi: Fix typo for aud_sampe_size member
    
    [ Upstream commit 72fcb88e7bbc053ed4fc74cebb0315b98a0f20c3 ]
    
    Rename member aud_sampe_size of struct hdmi_audio_param to
    aud_sample_size to fix a typo and enhance readability.
    
    This commit brings no functional changes.
    
    Fixes: 8f83f26891e1 ("drm/mediatek: Add HDMI support")
    Reviewed-by: CK Hu <[email protected]>
    Signed-off-by: AngeloGioacchino Del Regno <[email protected]>
    Link: https://patchwork.kernel.org/project/linux-mediatek/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/mediatek: mtk_hdmi: Unregister audio platform device on failure [+ + +]

Author: AngeloGioacchino Del Regno <[email protected]>
Date:   Mon Feb 17 16:48:10 2025 +0100

    drm/mediatek: mtk_hdmi: Unregister audio platform device on failure
    
    [ Upstream commit 0be123cafc06eed0fd1227166a66e786434b0c50 ]
    
    The probe function of this driver may fail after registering the
    audio platform device: in that case, the state is not getting
    cleaned up, leaving this device registered.
    
    Adding up to the mix, should the probe function of this driver
    return a probe deferral for N times, we're registering up to N
    audio platform devices and, again, never freeing them up.
    
    To fix this, add a pointer to the audio platform device in the
    mtk_hdmi structure, and add a devm action to unregister it upon
    driver removal or probe failure.
    
    Fixes: 8f83f26891e1 ("drm/mediatek: Add HDMI support")
    Reviewed-by: CK Hu <[email protected]>
    Signed-off-by: AngeloGioacchino Del Regno <[email protected]>
    Link: https://patchwork.kernel.org/project/linux-mediatek/patch/[email protected]/
    Signed-off-by: Chun-Kuang Hu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/msm/dsi: Set PHY usescase (and mode) before registering DSI host [+ + +]

Author: Marijn Suijten <[email protected]>
Date:   Mon Feb 17 12:17:42 2025 +0100

    drm/msm/dsi: Set PHY usescase (and mode) before registering DSI host
    
    [ Upstream commit 660c396c98c061f9696bebacc178b74072e80054 ]
    
    Ordering issues here cause an uninitialized (default STANDALONE)
    usecase to be programmed (which appears to be a MUX) in some cases
    when msm_dsi_host_register() is called, leading to the slave PLL in
    bonded-DSI mode to source from a clock parent (dsi1vco) that is off.
    
    This should seemingly not be a problem as the actual dispcc clocks from
    DSI1 that are muxed in the clock tree of DSI0 are way further down, this
    bit still seems to have an effect on them somehow and causes the right
    side of the panel controlled by DSI1 to not function.
    
    In an ideal world this code is refactored to no longer have such
    error-prone calls "across subsystems", and instead model the "PLL src"
    register field as a regular mux so that changing the clock parents
    programmatically or in DTS via `assigned-clock-parents` has the
    desired effect.
    But for the avid reader, the clocks that we *are* muxing into DSI0's
    tree are way further down, so if this bit turns out to be a simple mux
    between dsiXvco and out_div, that shouldn't have any effect as this
    whole tree is off anyway.
    
    Fixes: 57bf43389337 ("drm/msm/dsi: Pass down use case to PHY")
    Reviewed-by: Abhinav Kumar <[email protected]>
    Reviewed-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Marijn Suijten <[email protected]>
    Patchwork: https://patchwork.freedesktop.org/patch/637650/
    Link: https://lore.kernel.org/r/20250217-drm-msm-initial-dualpipe-dsc-fixes-v3-2-913100d6103f@somainline.org
    Signed-off-by: Dmitry Baryshkov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm/vkms: Fix use after free and double free on init error [+ + +]

Author: José Expósito <[email protected]>
Date:   Wed Feb 12 09:49:12 2025 +0100

    drm/vkms: Fix use after free and double free on init error
    
    [ Upstream commit ed15511a773df86205bda66c37193569575ae828 ]
    
    If the driver initialization fails, the vkms_exit() function might
    access an uninitialized or freed default_config pointer and it might
    double free it.
    
    Fix both possible errors by initializing default_config only when the
    driver initialization succeeded.
    
    Reported-by: Louis Chauvet <[email protected]>
    Closes: https://lore.kernel.org/all/Z5uDHcCmAwiTsGte@louis-chauvet-laptop/
    Fixes: 2df7af93fdad ("drm/vkms: Add vkms_config type")
    Signed-off-by: José Expósito <[email protected]>
    Reviewed-by: Thomas Zimmermann <[email protected]>
    Reviewed-by: Louis Chauvet <[email protected]>
    Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]
    Signed-off-by: Louis Chauvet <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

drm: xlnx: zynqmp: Fix max dma segment size [+ + +]

Author: Tomi Valkeinen <[email protected]>
Date:   Wed Jan 15 11:03:39 2025 +0200

    drm: xlnx: zynqmp: Fix max dma segment size
    
    [ Upstream commit 28b529a98525123acd37372a04d21e87ec2edcf7 ]
    
    Fix "mapping sg segment longer than device claims to support" warning by
    setting the max segment size.
    
    Fixes: d76271d22694 ("drm: xlnx: DRM/KMS driver for Xilinx ZynqMP DisplayPort Subsystem")
    Reviewed-by: Sean Anderson <[email protected]>
    Tested-by: Sean Anderson <[email protected]>
    Signed-off-by: Tomi Valkeinen <[email protected]>
    Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

EDAC/ie31200: Fix the DIMM size mask for several SoCs [+ + +]

Author: Qiuxu Zhuo <[email protected]>
Date:   Mon Mar 10 09:14:02 2025 +0800

    EDAC/ie31200: Fix the DIMM size mask for several SoCs
    
    [ Upstream commit 3427befbbca6b19fe0e37f91d66ce5221de70bf1 ]
    
    The DIMM size mask for {Sky, Kaby, Coffee} Lake is not bits{7:0},
    but bits{5:0}. Fix it.
    
    Fixes: 953dee9bbd24 ("EDAC, ie31200_edac: Add Skylake support")
    Signed-off-by: Qiuxu Zhuo <[email protected]>
    Signed-off-by: Tony Luck <[email protected]>
    Tested-by: Gary Wang <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

EDAC/ie31200: Fix the error path order of ie31200_init() [+ + +]

Author: Qiuxu Zhuo <[email protected]>
Date:   Mon Mar 10 09:14:03 2025 +0800

    EDAC/ie31200: Fix the error path order of ie31200_init()
    
    [ Upstream commit 231e341036d9988447e3b3345cf741a98139199e ]
    
    The error path order of ie31200_init() is incorrect, fix it.
    
    Fixes: 709ed1bcef12 ("EDAC/ie31200: Fallback if host bridge device is already initialized")
    Signed-off-by: Qiuxu Zhuo <[email protected]>
    Signed-off-by: Tony Luck <[email protected]>
    Tested-by: Gary Wang <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

EDAC/ie31200: Fix the size of EDAC_MC_LAYER_CHIP_SELECT layer [+ + +]

Author: Qiuxu Zhuo <[email protected]>
Date:   Mon Mar 10 09:14:01 2025 +0800

    EDAC/ie31200: Fix the size of EDAC_MC_LAYER_CHIP_SELECT layer
    
    [ Upstream commit d59d844e319d97682c8de29b88d2d60922a683b3 ]
    
    The EDAC_MC_LAYER_CHIP_SELECT layer pertains to the rank, not the DIMM.
    Fix its size to reflect the number of ranks instead of the number of DIMMs.
    Also delete the unused macros IE31200_{DIMMS,RANKS}.
    
    Fixes: 7ee40b897d18 ("ie31200_edac: Introduce the driver")
    Signed-off-by: Qiuxu Zhuo <[email protected]>
    Signed-off-by: Tony Luck <[email protected]>
    Tested-by: Gary Wang <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

exfat: fix the infinite loop in exfat_find_last_cluster() [+ + +]

Author: Yuezhang Mo <[email protected]>
Date:   Mon Mar 17 10:53:10 2025 +0800

    exfat: fix the infinite loop in exfat_find_last_cluster()
    
    [ Upstream commit b0522303f67255926b946aa66885a0104d1b2980 ]
    
    In exfat_find_last_cluster(), the cluster chain is traversed until
    the EOF cluster. If the cluster chain includes a loop due to file
    system corruption, the EOF cluster cannot be traversed, resulting
    in an infinite loop.
    
    If the number of clusters indicated by the file size is inconsistent
    with the cluster chain length, exfat_find_last_cluster() will return
    an error, so if this inconsistency is found, the traversal can be
    aborted without traversing to the EOF cluster.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=f7d147e6db52b1e09dba
    Tested-by: [email protected]
    Fixes: 31023864e67a ("exfat: add fat entry operations")
    Signed-off-by: Yuezhang Mo <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ext4: don't over-report free space or inodes in statvfs [+ + +]

Author: Theodore Ts'o <[email protected]>
Date:   Fri Mar 14 00:38:42 2025 -0400

    ext4: don't over-report free space or inodes in statvfs
    
    commit f87d3af7419307ae26e705a2b2db36140db367a2 upstream.
    
    This fixes an analogus bug that was fixed in xfs in commit
    4b8d867ca6e2 ("xfs: don't over-report free space or inodes in
    statvfs") where statfs can report misleading / incorrect information
    where project quota is enabled, and the free space is less than the
    remaining quota.
    
    This commit will resolve a test failure in generic/762 which tests for
    this bug.
    
    Cc: [email protected]
    Fixes: 689c958cbe6b ("ext4: add project quota support")
    Signed-off-by: Theodore Ts'o <[email protected]>
    Reviewed-by: "Darrick J. Wong" <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ext4: fix OOB read when checking dotdot dir [+ + +]

Author: Acs, Jakub <[email protected]>
Date:   Thu Mar 20 15:46:49 2025 +0000

    ext4: fix OOB read when checking dotdot dir
    
    commit d5e206778e96e8667d3bde695ad372c296dc9353 upstream.
    
    Mounting a corrupted filesystem with directory which contains '.' dir
    entry with rec_len == block size results in out-of-bounds read (later
    on, when the corrupted directory is removed).
    
    ext4_empty_dir() assumes every ext4 directory contains at least '.'
    and '..' as directory entries in the first data block. It first loads
    the '.' dir entry, performs sanity checks by calling ext4_check_dir_entry()
    and then uses its rec_len member to compute the location of '..' dir
    entry (in ext4_next_entry). It assumes the '..' dir entry fits into the
    same data block.
    
    If the rec_len of '.' is precisely one block (4KB), it slips through the
    sanity checks (it is considered the last directory entry in the data
    block) and leaves "struct ext4_dir_entry_2 *de" point exactly past the
    memory slot allocated to the data block. The following call to
    ext4_check_dir_entry() on new value of de then dereferences this pointer
    which results in out-of-bounds mem access.
    
    Fix this by extending __ext4_check_dir_entry() to check for '.' dir
    entries that reach the end of data block. Make sure to ignore the phony
    dir entries for checksum (by checking name_len for non-zero).
    
    Note: This is reported by KASAN as use-after-free in case another
    structure was recently freed from the slot past the bound, but it is
    really an OOB read.
    
    This issue was found by syzkaller tool.
    
    Call Trace:
    [   38.594108] BUG: KASAN: slab-use-after-free in __ext4_check_dir_entry+0x67e/0x710
    [   38.594649] Read of size 2 at addr ffff88802b41a004 by task syz-executor/5375
    [   38.595158]
    [   38.595288] CPU: 0 UID: 0 PID: 5375 Comm: syz-executor Not tainted 6.14.0-rc7 #1
    [   38.595298] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.3-0-ga6ed6b701f0a-prebuilt.qemu.org 04/01/2014
    [   38.595304] Call Trace:
    [   38.595308]  <TASK>
    [   38.595311]  dump_stack_lvl+0xa7/0xd0
    [   38.595325]  print_address_description.constprop.0+0x2c/0x3f0
    [   38.595339]  ? __ext4_check_dir_entry+0x67e/0x710
    [   38.595349]  print_report+0xaa/0x250
    [   38.595359]  ? __ext4_check_dir_entry+0x67e/0x710
    [   38.595368]  ? kasan_addr_to_slab+0x9/0x90
    [   38.595378]  kasan_report+0xab/0xe0
    [   38.595389]  ? __ext4_check_dir_entry+0x67e/0x710
    [   38.595400]  __ext4_check_dir_entry+0x67e/0x710
    [   38.595410]  ext4_empty_dir+0x465/0x990
    [   38.595421]  ? __pfx_ext4_empty_dir+0x10/0x10
    [   38.595432]  ext4_rmdir.part.0+0x29a/0xd10
    [   38.595441]  ? __dquot_initialize+0x2a7/0xbf0
    [   38.595455]  ? __pfx_ext4_rmdir.part.0+0x10/0x10
    [   38.595464]  ? __pfx___dquot_initialize+0x10/0x10
    [   38.595478]  ? down_write+0xdb/0x140
    [   38.595487]  ? __pfx_down_write+0x10/0x10
    [   38.595497]  ext4_rmdir+0xee/0x140
    [   38.595506]  vfs_rmdir+0x209/0x670
    [   38.595517]  ? lookup_one_qstr_excl+0x3b/0x190
    [   38.595529]  do_rmdir+0x363/0x3c0
    [   38.595537]  ? __pfx_do_rmdir+0x10/0x10
    [   38.595544]  ? strncpy_from_user+0x1ff/0x2e0
    [   38.595561]  __x64_sys_unlinkat+0xf0/0x130
    [   38.595570]  do_syscall_64+0x5b/0x180
    [   38.595583]  entry_SYSCALL_64_after_hwframe+0x76/0x7e
    
    Fixes: ac27a0ec112a0 ("[PATCH] ext4: initial copy of files from ext3")
    Signed-off-by: Jakub Acs <[email protected]>
    Cc: Theodore Ts'o <[email protected]>
    Cc: Andreas Dilger <[email protected]>
    Cc: [email protected]
    Cc: [email protected]
    Cc: Mahmoud Adam <[email protected]>
    Cc: [email protected]
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Theodore Ts'o <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

fbdev: au1100fb: Move a variable assignment behind a null pointer check [+ + +]

Author: Markus Elfring <[email protected]>
Date:   Thu Apr 13 21:35:36 2023 +0200

    fbdev: au1100fb: Move a variable assignment behind a null pointer check
    
    [ Upstream commit 2df2c0caaecfd869b49e14f2b8df822397c5dd7f ]
    
    The address of a data structure member was determined before
    a corresponding null pointer check in the implementation of
    the function “au1100fb_setmode”.
    
    This issue was detected by using the Coccinelle software.
    
    Fixes: 3b495f2bb749 ("Au1100 FB driver uplift for 2.6.")
    Signed-off-by: Markus Elfring <[email protected]>
    Acked-by: Uwe Kleine-König <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fbdev: sm501fb: Add some geometry checks. [+ + +]

Author: Danila Chernetsov <[email protected]>
Date:   Wed Mar 19 01:30:11 2025 +0000

    fbdev: sm501fb: Add some geometry checks.
    
    [ Upstream commit aee50bd88ea5fde1ff4cc021385598f81a65830c ]
    
    Added checks for xoffset, yoffset settings.
    Incorrect settings of these parameters can lead to errors
    in sm501fb_pan_ functions.
    
    Found by Linux Verification Center (linuxtesting.org) with SVACE.
    
    Fixes: 5fc404e47bdf ("[PATCH] fb: SM501 framebuffer driver")
    Signed-off-by: Danila Chernetsov <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fs/ntfs3: Fix a couple integer overflows on 32bit systems [+ + +]

Author: Dan Carpenter <[email protected]>
Date:   Sun Feb 16 23:52:00 2025 +0300

    fs/ntfs3: Fix a couple integer overflows on 32bit systems
    
    [ Upstream commit 5ad414f4df2294b28836b5b7b69787659d6aa708 ]
    
    On 32bit systems the "off + sizeof(struct NTFS_DE)" addition can
    have an integer wrapping issue.  Fix it by using size_add().
    
    Fixes: 82cae269cfa9 ("fs/ntfs3: Add initialization of super block")
    Signed-off-by: Dan Carpenter <[email protected]>
    Signed-off-by: Konstantin Komarov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fs/procfs: fix the comment above proc_pid_wchan() [+ + +]

Author: Bart Van Assche <[email protected]>
Date:   Wed Mar 19 14:02:22 2025 -0700

    fs/procfs: fix the comment above proc_pid_wchan()
    
    [ Upstream commit 6287fbad1cd91f0c25cdc3a580499060828a8f30 ]
    
    proc_pid_wchan() used to report kernel addresses to user space but that is
    no longer the case today.  Bring the comment above proc_pid_wchan() in
    sync with the implementation.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: b2f73922d119 ("fs/proc, core/debug: Don't expose absolute kernel addresses via wchan")
    Signed-off-by: Bart Van Assche <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Eric W. Biederman <[email protected]>
    Cc: Alexey Dobriyan <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

fuse: fix dax truncate/punch_hole fault path [+ + +]

Author: Alistair Popple <[email protected]>
Date:   Fri Feb 28 14:30:56 2025 +1100

    fuse: fix dax truncate/punch_hole fault path
    
    [ Upstream commit 7851bf649d423edd7286b292739f2eefded3d35c ]
    
    Patch series "fs/dax: Fix ZONE_DEVICE page reference counts", v9.
    
    Device and FS DAX pages have always maintained their own page reference
    counts without following the normal rules for page reference counting.  In
    particular pages are considered free when the refcount hits one rather
    than zero and refcounts are not added when mapping the page.
    
    Tracking this requires special PTE bits (PTE_DEVMAP) and a secondary
    mechanism for allowing GUP to hold references on the page (see
    get_dev_pagemap).  However there doesn't seem to be any reason why FS DAX
    pages need their own reference counting scheme.
    
    By treating the refcounts on these pages the same way as normal pages we
    can remove a lot of special checks.  In particular pXd_trans_huge()
    becomes the same as pXd_leaf(), although I haven't made that change here.
    It also frees up a valuable SW define PTE bit on architectures that have
    devmap PTE bits defined.
    
    It also almost certainly allows further clean-up of the devmap managed
    functions, but I have left that as a future improvment.  It also enables
    support for compound ZONE_DEVICE pages which is one of my primary
    motivators for doing this work.
    
    This patch (of 20):
    
    FS DAX requires file systems to call into the DAX layout prior to
    unlinking inodes to ensure there is no ongoing DMA or other remote access
    to the direct mapped page.  The fuse file system implements
    fuse_dax_break_layouts() to do this which includes a comment indicating
    that passing dmap_end == 0 leads to unmapping of the whole file.
    
    However this is not true - passing dmap_end == 0 will not unmap anything
    before dmap_start, and further more dax_layout_busy_page_range() will not
    scan any of the range to see if there maybe ongoing DMA access to the
    range.  Fix this by passing -1 for dmap_end to fuse_dax_break_layouts()
    which will invalidate the entire file range to
    dax_layout_busy_page_range().
    
    Link: https://lkml.kernel.org/r/cover.8068ad144a7eea4a813670301f4d2a86a8e68ec4.1740713401.git-series.apopple@nvidia.com
    Link: https://lkml.kernel.org/r/f09a34b6c40032022e4ddee6fadb7cc676f08867.1740713401.git-series.apopple@nvidia.com
    Fixes: 6ae330cad6ef ("virtiofs: serialize truncate/punch_hole and dax fault path")
    Signed-off-by: Alistair Popple <[email protected]>
    Co-developed-by: Dan Williams <[email protected]>
    Signed-off-by: Dan Williams <[email protected]>
    Reviewed-by: Balbir Singh <[email protected]>
    Tested-by: Alison Schofield <[email protected]>
    Cc: Vivek Goyal <[email protected]>
    Cc: Alexander Gordeev <[email protected]>
    Cc: Asahi Lina <[email protected]>
    Cc: Bjorn Helgaas <[email protected]>
    Cc: Catalin Marinas <[email protected]>
    Cc: Christian Borntraeger <[email protected]>
    Cc: Christoph Hellwig <[email protected]>
    Cc: Chunyan Zhang <[email protected]>
    Cc: "Darrick J. Wong" <[email protected]>
    Cc: Dave Chinner <[email protected]>
    Cc: Dave Hansen <[email protected]>
    Cc: Dave Jiang <[email protected]>
    Cc: David Hildenbrand <[email protected]>
    Cc: Gerald Schaefer <[email protected]>
    Cc: Heiko Carstens <[email protected]>
    Cc: Huacai Chen <[email protected]>
    Cc: Ira Weiny <[email protected]>
    Cc: Jan Kara <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: John Hubbard <[email protected]>
    Cc: linmiaohe <[email protected]>
    Cc: Logan Gunthorpe <[email protected]>
    Cc: Matthew Wilcow (Oracle) <[email protected]>
    Cc: Michael "Camp Drill Sergeant" Ellerman <[email protected]>
    Cc: Nicholas Piggin <[email protected]>
    Cc: Peter Xu <[email protected]>
    Cc: Sven Schnelle <[email protected]>
    Cc: Ted Ts'o <[email protected]>
    Cc: Vasily Gorbik <[email protected]>
    Cc: Vishal Verma <[email protected]>
    Cc: WANG Xuerui <[email protected]>
    Cc: Will Deacon <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

HID: i2c-hid: improve i2c_hid_get_report error message [+ + +]

Author: Wentao Guan <[email protected]>
Date:   Fri Feb 14 19:04:18 2025 +0800

    HID: i2c-hid: improve i2c_hid_get_report error message
    
    [ Upstream commit 723aa55c08c9d1e0734e39a815fd41272eac8269 ]
    
    We have two places to print "failed to set a report to ...",
    use "get a report from" instead of "set a report to", it makes
    people who knows less about the module to know where the error
    happened.
    
    Before:
    i2c_hid_acpi i2c-FTSC1000:00: failed to set a report to device: -11
    
    After:
    i2c_hid_acpi i2c-FTSC1000:00: failed to get a report from device: -11
    
    Signed-off-by: Wentao Guan <[email protected]>
    Signed-off-by: Jiri Kosina <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

HID: remove superfluous (and wrong) Makefile entry for CONFIG_INTEL_ISH_FIRMWARE_DOWNLOADER [+ + +]

Author: Jiri Kosina <[email protected]>
Date:   Wed Mar 12 09:08:22 2025 +0100

    HID: remove superfluous (and wrong) Makefile entry for CONFIG_INTEL_ISH_FIRMWARE_DOWNLOADER
    
    [ Upstream commit fe0fb58325e519008e2606a5aa2cff7ad23e212d ]
    
    The line
    
            obj-$(INTEL_ISH_FIRMWARE_DOWNLOADER)   += intel-ish-hid/
    
    in top-level HID Makefile is both superfluous (as CONFIG_INTEL_ISH_FIRMWARE_DOWNLOADER
    depends on CONFIG_INTEL_ISH_HID, which contains intel-ish-hid/ already) and wrong (as it's
    missing the CONFIG_ prefix).
    
    Just remove it.
    
    Fixes: 91b228107da3e ("HID: intel-ish-hid: ISH firmware loader client driver")
    Reported-by: Jiri Slaby <[email protected]>
    Acked-by: Srinivas Pandruvada <[email protected]>
    Signed-off-by: Jiri Kosina <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

hwmon: (nct6775-core) Fix out of bounds access for NCT679{8,9} [+ + +]

Author: Tasos Sahanidis <[email protected]>
Date:   Wed Mar 12 05:08:32 2025 +0200

    hwmon: (nct6775-core) Fix out of bounds access for NCT679{8,9}
    
    [ Upstream commit 815f80ad20b63830949a77c816e35395d5d55144 ]
    
    pwm_num is set to 7 for these chips, but NCT6776_REG_PWM_MODE and
    NCT6776_PWM_MODE_MASK only contain 6 values.
    
    Fix this by adding another 0 to the end of each array.
    
    Signed-off-by: Tasos Sahanidis <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Guenter Roeck <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

i3c: master: svc: Fix missing the IBI rules [+ + +]

Author: Stanley Chu <[email protected]>
Date:   Tue Mar 18 13:36:04 2025 +0800

    i3c: master: svc: Fix missing the IBI rules
    
    [ Upstream commit 9cecad134d84d14dc72a0eea7a107691c3e5a837 ]
    
    The code does not add IBI rules for devices with controller capability.
    However, the secondary controller has the controller capability and works
    at target mode when the device is probed. Therefore, add IBI rules for
    such devices.
    
    Fixes: dd3c52846d59 ("i3c: master: svc: Add Silvaco I3C master driver")
    Signed-off-by: Stanley Chu <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexandre Belloni <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

IB/mad: Check available slots before posting receive WRs [+ + +]

Author: Maher Sanalla <[email protected]>
Date:   Thu Mar 13 16:20:17 2025 +0200

    IB/mad: Check available slots before posting receive WRs
    
    [ Upstream commit 37826f0a8c2f6b6add5179003b8597e32a445362 ]
    
    The ib_post_receive_mads() function handles posting receive work
    requests (WRs) to MAD QPs and is called in two cases:
    1) When a MAD port is opened.
    2) When a receive WQE is consumed upon receiving a new MAD.
    
    Whereas, if MADs arrive during the port open phase, a race condition
    might cause an extra WR to be posted, exceeding the QP’s capacity.
    This leads to failures such as:
    infiniband mlx5_0: ib_post_recv failed: -12
    infiniband mlx5_0: Couldn't post receive WRs
    infiniband mlx5_0: Couldn't start port
    infiniband mlx5_0: Couldn't open port 1
    
    Fix this by checking the current receive count before posting a new WR.
    If the QP’s receive queue is full, do not post additional WRs.
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Signed-off-by: Maher Sanalla <[email protected]>
    Link: https://patch.msgid.link/c4984ba3c3a98a5711a558bccefcad789587ecf1.1741875592.git.leon@kernel.org
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iio: accel: mma8452: Ensure error return on failure to matching oversampling ratio [+ + +]

Author: Jonathan Cameron <[email protected]>
Date:   Mon Feb 17 14:01:28 2025 +0000

    iio: accel: mma8452: Ensure error return on failure to matching oversampling ratio
    
    [ Upstream commit df330c808182a8beab5d0f84a6cbc9cff76c61fc ]
    
    If a match was not found, then the write_raw() callback would return
    the odr index, not an error. Return -EINVAL if this occurs.
    To avoid similar issues in future, introduce j, a new indexing variable
    rather than using ret for this purpose.
    
    Fixes: 79de2ee469aa ("iio: accel: mma8452: claim direct mode during write raw")
    Reviewed-by: David Lechner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iio: accel: msa311: Fix failure to release runtime pm if direct mode claim fails. [+ + +]

Author: Jonathan Cameron <[email protected]>
Date:   Mon Feb 17 14:01:33 2025 +0000

    iio: accel: msa311: Fix failure to release runtime pm if direct mode claim fails.
    
    [ Upstream commit 60a0cf2ebab92011055ab7db6553c0fc3c546938 ]
    
    Reorder the claiming of direct mode and runtime pm calls to simplify
    handling a little.  For correct error handling, after the reorder
    iio_device_release_direct_mode() must be claimed in an error occurs
    in pm_runtime_resume_and_get()
    
    Fixes: 1ca2cfbc0c33 ("iio: add MEMSensing MSA311 3-axis accelerometer driver")
    Reviewed-by: David Lechner <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

iio: adc: ad7124: Fix comparison of channel configs [+ + +]

Author: Uwe Kleine-König <[email protected]>
Date:   Mon Mar 3 12:47:01 2025 +0100

    iio: adc: ad7124: Fix comparison of channel configs
    
    [ Upstream commit 05a5d874f7327b75e9bc4359618017e047cc129c ]
    
    Checking the binary representation of two structs (of the same type)
    for equality doesn't have the same semantic as comparing all members for
    equality. The former might find a difference where the latter doesn't in
    the presence of padding or when ambiguous types like float or bool are
    involved. (Floats typically have different representations for single
    values, like -0.0 vs +0.0, or 0.5 * 2² vs 0.25 * 2³. The type bool has
    at least 8 bits and the raw values 1 and 2 (probably) both evaluate to
    true, but memcmp finds a difference.)
    
    When searching for a channel that already has the configuration we need,
    the comparison by member is the one that is needed.
    
    Convert the comparison accordingly to compare the members one after
    another. Also add a static_assert guard to (somewhat) ensure that when
    struct ad7124_channel_config::config_props is expanded, the comparison
    is adapted, too.
    
    This issue is somewhat theoretic, but using memcmp() on a struct is a
    bad pattern that is worth fixing.
    
    Fixes: 7b8d045e497a ("iio: adc: ad7124: allow more than 8 channels")
    Signed-off-by: Uwe Kleine-König <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jonathan Cameron <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

io_uring/filetable: ensure node switch is always done, if needed [+ + +]

Author: Jens Axboe <[email protected]>
Date:   Thu Apr 3 10:48:49 2025 -0600

    io_uring/filetable: ensure node switch is always done, if needed
    
    No upstream patch exists for this issue, as it was introduced by
    a stable backport.
    
    A previous backport relied on other code changes in the io_uring file
    table and resource node handling, which means that sometimes a resource
    node switch can get missed. For 6.1-stable, that code is still in
    io_install_fixed_file(), so ensure we fall-through to that case for the
    success path too.
    
    Fixes: a3812a47a320 ("io_uring: drop any code related to SCM_RIGHTS")
    Signed-off-by: Jens Axboe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ipv6: Do not consider link down nexthops in path selection [+ + +]

Author: Ido Schimmel <[email protected]>
Date:   Wed Apr 2 14:42:24 2025 +0300

    ipv6: Do not consider link down nexthops in path selection
    
    [ Upstream commit 8b8e0dd357165e0258d9f9cdab5366720ed2f619 ]
    
    Nexthops whose link is down are not supposed to be considered during
    path selection when the "ignore_routes_with_linkdown" sysctl is set.
    This is done by assigning them a negative region boundary.
    
    However, when comparing the computed hash (unsigned) with the region
    boundary (signed), the negative region boundary is treated as unsigned,
    resulting in incorrect nexthop selection.
    
    Fix by treating the computed hash as signed. Note that the computed hash
    is always in range of [0, 2^31 - 1].
    
    Fixes: 3d709f69a3e7 ("ipv6: Use hash-threshold instead of modulo-N")
    Signed-off-by: Ido Schimmel <[email protected]>
    Reviewed-by: Willem de Bruijn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipv6: fix omitted netlink attributes when using RTEXT_FILTER_SKIP_STATS [+ + +]

Author: Fernando Fernandez Mancera <[email protected]>
Date:   Wed Apr 2 14:17:51 2025 +0200

    ipv6: fix omitted netlink attributes when using RTEXT_FILTER_SKIP_STATS
    
    [ Upstream commit 7ac6ea4a3e0898db76aecccd68fb2c403eb7d24e ]
    
    Using RTEXT_FILTER_SKIP_STATS is incorrectly skipping non-stats IPv6
    netlink attributes on link dump. This causes issues on userspace tools,
    e.g iproute2 is not rendering address generation mode as it should due
    to missing netlink attribute.
    
    Move the filling of IFLA_INET6_STATS and IFLA_INET6_ICMP6STATS to a
    helper function guarded by a flag check to avoid hitting the same
    situation in the future.
    
    Fixes: d5566fd72ec1 ("rtnetlink: RTEXT_FILTER_SKIP_STATS support to avoid dumping inet/inet6 stats")
    Signed-off-by: Fernando Fernandez Mancera <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ipv6: Start path selection from the first nexthop [+ + +]

Author: Ido Schimmel <[email protected]>
Date:   Wed Apr 2 14:42:23 2025 +0300

    ipv6: Start path selection from the first nexthop
    
    [ Upstream commit 4d0ab3a6885e3e9040310a8d8f54503366083626 ]
    
    Cited commit transitioned IPv6 path selection to use hash-threshold
    instead of modulo-N. With hash-threshold, each nexthop is assigned a
    region boundary in the multipath hash function's output space and a
    nexthop is chosen if the calculated hash is smaller than the nexthop's
    region boundary.
    
    Hash-threshold does not work correctly if path selection does not start
    with the first nexthop. For example, if fib6_select_path() is always
    passed the last nexthop in the group, then it will always be chosen
    because its region boundary covers the entire hash function's output
    space.
    
    Fix this by starting the selection process from the first nexthop and do
    not consider nexthops for which rt6_score_route() provided a negative
    score.
    
    Fixes: 3d709f69a3e7 ("ipv6: Use hash-threshold instead of modulo-N")
    Reported-by: Stanislav Fomichev <[email protected]>
    Closes: https://lore.kernel.org/netdev/Z9RIyKZDNoka53EO@mini-arch/
    Signed-off-by: Ido Schimmel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

isofs: fix KMSAN uninit-value bug in do_isofs_readdir() [+ + +]

Author: Qasim Ijaz <[email protected]>
Date:   Tue Feb 11 19:59:00 2025 +0000

    isofs: fix KMSAN uninit-value bug in do_isofs_readdir()
    
    [ Upstream commit 81a82e8f33880793029cd6f8a766fb13b737e6a7 ]
    
    In do_isofs_readdir() when assigning the variable
    "struct iso_directory_record *de" the b_data field of the buffer_head
    is accessed and an offset is added to it, the size of b_data is 2048
    and the offset size is 2047, meaning
    "de = (struct iso_directory_record *) (bh->b_data + offset);"
    yields the final byte of the 2048 sized b_data block.
    
    The first byte of the directory record (de_len) is then read and
    found to be 31, meaning the directory record size is 31 bytes long.
    The directory record is defined by the structure:
    
            struct iso_directory_record {
                    __u8 length;                     // 1 byte
                    __u8 ext_attr_length;            // 1 byte
                    __u8 extent[8];                  // 8 bytes
                    __u8 size[8];                    // 8 bytes
                    __u8 date[7];                    // 7 bytes
                    __u8 flags;                      // 1 byte
                    __u8 file_unit_size;             // 1 byte
                    __u8 interleave;                 // 1 byte
                    __u8 volume_sequence_number[4];  // 4 bytes
                    __u8 name_len;                   // 1 byte
                    char name[];                     // variable size
            } __attribute__((packed));
    
    The fixed portion of this structure occupies 33 bytes. Therefore, a
    valid directory record must be at least 33 bytes long
    (even without considering the variable-length name field).
    Since de_len is only 31, it is insufficient to contain
    the complete fixed header.
    
    The code later hits the following sanity check that
    compares de_len against the sum of de->name_len and
    sizeof(struct iso_directory_record):
    
            if (de_len < de->name_len[0] + sizeof(struct iso_directory_record)) {
                    ...
            }
    
    Since the fixed portion of the structure is
    33 bytes (up to and including name_len member),
    a valid record should have de_len of at least 33 bytes;
    here, however, de_len is too short, and the field de->name_len
    (located at offset 32) is accessed even though it lies beyond
    the available 31 bytes.
    
    This access on the corrupted isofs data triggers a KASAN uninitialized
    memory warning. The fix would be to first verify that de_len is at least
    sizeof(struct iso_directory_record) before accessing any
    fields like de->name_len.
    
    Reported-by: syzbot <[email protected]>
    Tested-by: syzbot <[email protected]>
    Closes: https://syzkaller.appspot.com/bug?extid=812641c6c3d7586a1613
    Fixes: 2deb1acc653c ("isofs: fix access to unallocated memory when reading corrupted filesystem")
    Signed-off-by: Qasim Ijaz <[email protected]>
    Signed-off-by: Jan Kara <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

jfs: add index corruption check to DT_GETPAGE() [+ + +]

Author: Roman Smirnov <[email protected]>
Date:   Wed Feb 26 11:25:22 2025 +0300

    jfs: add index corruption check to DT_GETPAGE()
    
    commit a8dfb2168906944ea61acfc87846b816eeab882d upstream.
    
    If the file system is corrupted, the header.stblindex variable
    may become greater than 127. Because of this, an array access out
    of bounds may occur:
    
    ------------[ cut here ]------------
    UBSAN: array-index-out-of-bounds in fs/jfs/jfs_dtree.c:3096:10
    index 237 is out of range for type 'struct dtslot[128]'
    CPU: 0 UID: 0 PID: 5822 Comm: syz-executor740 Not tainted 6.13.0-rc4-syzkaller-00110-g4099a71718b0 #0
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 09/13/2024
    Call Trace:
     <TASK>
     __dump_stack lib/dump_stack.c:94 [inline]
     dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
     ubsan_epilogue lib/ubsan.c:231 [inline]
     __ubsan_handle_out_of_bounds+0x121/0x150 lib/ubsan.c:429
     dtReadFirst+0x622/0xc50 fs/jfs/jfs_dtree.c:3096
     dtReadNext fs/jfs/jfs_dtree.c:3147 [inline]
     jfs_readdir+0x9aa/0x3c50 fs/jfs/jfs_dtree.c:2862
     wrap_directory_iterator+0x91/0xd0 fs/readdir.c:65
     iterate_dir+0x571/0x800 fs/readdir.c:108
     __do_sys_getdents64 fs/readdir.c:403 [inline]
     __se_sys_getdents64+0x1e2/0x4b0 fs/readdir.c:389
     do_syscall_x64 arch/x86/entry/common.c:52 [inline]
     do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
     entry_SYSCALL_64_after_hwframe+0x77/0x7f
     </TASK>
    ---[ end trace ]---
    
    Add a stblindex check for corruption.
    
    Reported-by: syzbot <[email protected]>
    Closes: https://syzkaller.appspot.com/bug?extid=9120834fc227768625ba
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Cc: [email protected]
    Signed-off-by: Roman Smirnov <[email protected]>
    Signed-off-by: Dave Kleikamp <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

jfs: fix slab-out-of-bounds read in ea_get() [+ + +]

Author: Qasim Ijaz <[email protected]>
Date:   Thu Feb 13 21:05:53 2025 +0000

    jfs: fix slab-out-of-bounds read in ea_get()
    
    commit fdf480da5837c23b146c4743c18de97202fcab37 upstream.
    
    During the "size_check" label in ea_get(), the code checks if the extended
    attribute list (xattr) size matches ea_size. If not, it logs
    "ea_get: invalid extended attribute" and calls print_hex_dump().
    
    Here, EALIST_SIZE(ea_buf->xattr) returns 4110417968, which exceeds
    INT_MAX (2,147,483,647). Then ea_size is clamped:
    
            int size = clamp_t(int, ea_size, 0, EALIST_SIZE(ea_buf->xattr));
    
    Although clamp_t aims to bound ea_size between 0 and 4110417968, the upper
    limit is treated as an int, causing an overflow above 2^31 - 1. This leads
    "size" to wrap around and become negative (-184549328).
    
    The "size" is then passed to print_hex_dump() (called "len" in
    print_hex_dump()), it is passed as type size_t (an unsigned
    type), this is then stored inside a variable called
    "int remaining", which is then assigned to "int linelen" which
    is then passed to hex_dump_to_buffer(). In print_hex_dump()
    the for loop, iterates through 0 to len-1, where len is
    18446744073525002176, calling hex_dump_to_buffer()
    on each iteration:
    
            for (i = 0; i < len; i += rowsize) {
                    linelen = min(remaining, rowsize);
                    remaining -= rowsize;
    
                    hex_dump_to_buffer(ptr + i, linelen, rowsize, groupsize,
                                       linebuf, sizeof(linebuf), ascii);
    
                    ...
            }
    
    The expected stopping condition (i < len) is effectively broken
    since len is corrupted and very large. This eventually leads to
    the "ptr+i" being passed to hex_dump_to_buffer() to get closer
    to the end of the actual bounds of "ptr", eventually an out of
    bounds access is done in hex_dump_to_buffer() in the following
    for loop:
    
            for (j = 0; j < len; j++) {
                            if (linebuflen < lx + 2)
                                    goto overflow2;
                            ch = ptr[j];
                    ...
            }
    
    To fix this we should validate "EALIST_SIZE(ea_buf->xattr)"
    before it is utilised.
    
    Reported-by: syzbot <[email protected]>
    Tested-by: syzbot <[email protected]>
    Closes: https://syzkaller.appspot.com/bug?extid=4e6e7e4279d046613bc5
    Fixes: d9f9d96136cb ("jfs: xattr: check invalid xattr size more strictly")
    Cc: [email protected]
    Signed-off-by: Qasim Ijaz <[email protected]>
    Signed-off-by: Dave Kleikamp <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kexec: initialize ELF lowest address to ULONG_MAX [+ + +]

Author: Sourabh Jain <[email protected]>
Date:   Fri Jan 31 17:08:24 2025 +0530

    kexec: initialize ELF lowest address to ULONG_MAX
    
    [ Upstream commit 9986fb5164c8b21f6439cfd45ba36d8cc80c9710 ]
    
    Patch series "powerpc/crash: use generic crashkernel reservation", v3.
    
    Commit 0ab97169aa05 ("crash_core: add generic function to do reservation")
    added a generic function to reserve crashkernel memory.  So let's use the
    same function on powerpc and remove the architecture-specific code that
    essentially does the same thing.
    
    The generic crashkernel reservation also provides a way to split the
    crashkernel reservation into high and low memory reservations, which can
    be enabled for powerpc in the future.
    
    Additionally move powerpc to use generic APIs to locate memory hole for
    kexec segments while loading kdump kernel.
    
    This patch (of 7):
    
    kexec_elf_load() loads an ELF executable and sets the address of the
    lowest PT_LOAD section to the address held by the lowest_load_addr
    function argument.
    
    To determine the lowest PT_LOAD address, a local variable lowest_addr
    (type unsigned long) is initialized to UINT_MAX.  After loading each
    PT_LOAD, its address is compared to lowest_addr.  If a loaded PT_LOAD
    address is lower, lowest_addr is updated.  However, setting lowest_addr to
    UINT_MAX won't work when the kernel image is loaded above 4G, as the
    returned lowest PT_LOAD address would be invalid.  This is resolved by
    initializing lowest_addr to ULONG_MAX instead.
    
    This issue was discovered while implementing crashkernel high/low
    reservation on the PowerPC architecture.
    
    Link: https://lkml.kernel.org/r/[email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: a0458284f062 ("powerpc: Add support code for kexec_file_load()")
    Signed-off-by: Sourabh Jain <[email protected]>
    Acked-by: Hari Bathini <[email protected]>
    Acked-by: Baoquan He <[email protected]>
    Cc: Madhavan Srinivasan <[email protected]>
    Cc: Mahesh Salgaonkar <[email protected]>
    Cc: Michael Ellerman <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: add bounds check for create lease context [+ + +]

Author: Norbert Szetei <[email protected]>
Date:   Sat Mar 15 12:19:28 2025 +0900

    ksmbd: add bounds check for create lease context
    
    commit bab703ed8472aa9d109c5f8c1863921533363dae upstream.
    
    Add missing bounds check for create lease context.
    
    Cc: [email protected]
    Reported-by: Norbert Szetei <[email protected]>
    Tested-by: Norbert Szetei <[email protected]>
    Signed-off-by: Norbert Szetei <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix multichannel connection failure [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Mon Mar 24 20:19:20 2025 +0900

    ksmbd: fix multichannel connection failure
    
    [ Upstream commit c1883049aa9b2b7dffd3a68c5fc67fa92c174bd9 ]
    
    ksmbd check that the session of second channel is in the session list of
    first connection. If it is in session list, multichannel connection
    should not be allowed.
    
    Fixes: b95629435b84 ("ksmbd: fix racy issue from session lookup and expire")
    Reported-by: Sean Heelan <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: fix session use-after-free in multichannel connection [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Thu Mar 27 21:22:51 2025 +0900

    ksmbd: fix session use-after-free in multichannel connection
    
    commit fa4cdb8cbca7d6cb6aa13e4d8d83d1103f6345db upstream.
    
    There is a race condition between session setup and
    ksmbd_sessions_deregister. The session can be freed before the connection
    is added to channel list of session.
    This patch check reference count of session before freeing it.
    
    Cc: [email protected]
    Reported-by: Sean Heelan <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: fix use-after-free in ksmbd_sessions_deregister() [+ + +]

Author: Namjae Jeon <[email protected]>
Date:   Sat Mar 22 09:20:19 2025 +0900

    ksmbd: fix use-after-free in ksmbd_sessions_deregister()
    
    commit 15a9605f8d69dc85005b1a00c31a050b8625e1aa upstream.
    
    In multichannel mode, UAF issue can occur in session_deregister
    when the second channel sets up a session through the connection of
    the first channel. session that is freed through the global session
    table can be accessed again through ->sessions of connection.
    
    Cc: [email protected]
    Reported-by: Norbert Szetei <[email protected]>
    Tested-by: Norbert Szetei <[email protected]>
    Signed-off-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

ksmbd: use aead_request_free to match aead_request_alloc [+ + +]

Author: Miaoqian Lin <[email protected]>
Date:   Tue Mar 18 20:12:34 2025 +0800

    ksmbd: use aead_request_free to match aead_request_alloc
    
    [ Upstream commit 6171063e9d046ffa46f51579b2ca4a43caef581a ]
    
    Use aead_request_free() instead of kfree() to properly free memory
    allocated by aead_request_alloc(). This ensures sensitive crypto data
    is zeroed before being freed.
    
    Fixes: e2f34481b24d ("cifsd: add server-side procedures for SMB3")
    Signed-off-by: Miaoqian Lin <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ksmbd: validate zero num_subauth before sub_auth is accessed [+ + +]

Author: Norbert Szetei <[email protected]>
Date:   Sat Mar 29 16:06:01 2025 +0000

    ksmbd: validate zero num_subauth before sub_auth is accessed
    
    commit bf21e29d78cd2c2371023953d9c82dfef82ebb36 upstream.
    
    Access psid->sub_auth[psid->num_subauth - 1] without checking
    if num_subauth is non-zero leads to an out-of-bounds read.
    This patch adds a validation step to ensure num_subauth != 0
    before sub_auth is accessed.
    
    Cc: [email protected]
    Signed-off-by: Norbert Szetei <[email protected]>
    Acked-by: Namjae Jeon <[email protected]>
    Signed-off-by: Steve French <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

kunit/overflow: Fix UB in overflow_allocation_test [+ + +]

Author: Ivan Orlov <[email protected]>
Date:   Thu Aug 15 01:04:31 2024 +0100

    kunit/overflow: Fix UB in overflow_allocation_test
    
    commit 92e9bac18124682c4b99ede9ee3bcdd68f121e92 upstream.
    
    The 'device_name' array doesn't exist out of the
    'overflow_allocation_test' function scope. However, it is being used as
    a driver name when calling 'kunit_driver_create' from
    'kunit_device_register'. It produces the kernel panic with KASAN
    enabled.
    
    Since this variable is used in one place only, remove it and pass the
    device name into kunit_device_register directly as an ascii string.
    
    Signed-off-by: Ivan Orlov <[email protected]>
    Reviewed-by: David Gow <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Kees Cook <[email protected]>
    Signed-off-by: Jianqi Ren <[email protected]>
    Signed-off-by: He Zhe <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lib: 842: Improve error handling in sw842_compress() [+ + +]

Author: Tanya Agarwal <[email protected]>
Date:   Tue Jan 14 19:42:04 2025 +0530

    lib: 842: Improve error handling in sw842_compress()
    
    [ Upstream commit af324dc0e2b558678aec42260cce38be16cc77ca ]
    
    The static code analysis tool "Coverity Scan" pointed the following
    implementation details out for further development considerations:
    CID 1309755: Unused value
    In sw842_compress: A value assigned to a variable is never used. (CWE-563)
    returned_value: Assigning value from add_repeat_template(p, repeat_count)
    to ret here, but that stored value is overwritten before it can be used.
    
    Conclusion:
    Add error handling for the return value from an add_repeat_template()
    call.
    
    Fixes: 2da572c959dd ("lib: add software 842 compression/decompression")
    Signed-off-by: Tanya Agarwal <[email protected]>
    Signed-off-by: Herbert Xu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

libbpf: Fix hypothetical STT_SECTION extern NULL deref case [+ + +]

Author: Andrii Nakryiko <[email protected]>
Date:   Wed Feb 19 16:28:21 2025 -0800

    libbpf: Fix hypothetical STT_SECTION extern NULL deref case
    
    [ Upstream commit e0525cd72b5979d8089fe524a071ea93fd011dc9 ]
    
    Fix theoretical NULL dereference in linker when resolving *extern*
    STT_SECTION symbol against not-yet-existing ELF section. Not sure if
    it's possible in practice for valid ELF object files (this would require
    embedded assembly manipulations, at which point BTF will be missing),
    but fix the s/dst_sym/dst_sec/ typo guarding this condition anyways.
    
    Fixes: faf6ed321cf6 ("libbpf: Add BPF static linker APIs")
    Fixes: a46349227cd8 ("libbpf: Add linker extern resolution support for functions and global variables")
    Signed-off-by: Andrii Nakryiko <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexei Starovoitov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Linux: Linux 6.1.134 [+ + +]

Author: Greg Kroah-Hartman <[email protected]>
Date:   Thu Apr 10 14:33:44 2025 +0200

    Linux 6.1.134
    
    Link: https://lore.kernel.org/r/[email protected]
    Tested-by: Mark Brown <[email protected]>
    Tested-by: Pavel Machek (CIP) <[email protected]>
    Tested-by: Florian Fainelli <[email protected]>
    Tested-by: Peter Schneider <[email protected]>
    Tested-by: SeongJae Park <[email protected]>
    Tested-by: Ron Economos <[email protected]>
    Tested-by: Jon Hunter <[email protected]>
    Tested-by: Linux Kernel Functional Testing <[email protected]>
    Tested-by: Salvatore Bonaccorso <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Tested-by: Shuah Khan <[email protected]>
    Tested-by: Mark Brown <[email protected]>
    Tested-by: Florian Fainelli <[email protected]>
    Tested-by: Peter Schneider <[email protected]>
    Tested-by: Linux Kernel Functional Testing <[email protected]>
    Tested-by: Jon Hunter <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

lockdep/mm: Fix might_fault() lockdep check of current->mm->mmap_lock [+ + +]

Author: Peter Zijlstra <[email protected]>
Date:   Mon Nov 4 14:39:10 2024 +0100

    lockdep/mm: Fix might_fault() lockdep check of current->mm->mmap_lock
    
    [ Upstream commit a1b65f3f7c6f7f0a08a7dba8be458c6415236487 ]
    
    Turns out that this commit, about 10 years ago:
    
      9ec23531fd48 ("sched/preempt, mm/fault: Trigger might_sleep() in might_fault() with disabled pagefaults")
    
    ... accidentally (and unnessecarily) put the lockdep part of
    __might_fault() under CONFIG_DEBUG_ATOMIC_SLEEP=y.
    
    This is potentially notable because large distributions such as
    Ubuntu are running with !CONFIG_DEBUG_ATOMIC_SLEEP.
    
    Restore the debug check.
    
    [ mingo: Update changelog. ]
    
    Fixes: 9ec23531fd48 ("sched/preempt, mm/fault: Trigger might_sleep() in might_fault() with disabled pagefaults")
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Cc: Andrew Morton <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

lockdep: Don't disable interrupts on RT in disable_irq_nosync_lockdep.*() [+ + +]

Author: Sebastian Andrzej Siewior <[email protected]>
Date:   Wed Feb 12 11:36:18 2025 +0100

    lockdep: Don't disable interrupts on RT in disable_irq_nosync_lockdep.*()
    
    [ Upstream commit 87886b32d669abc11c7be95ef44099215e4f5788 ]
    
    disable_irq_nosync_lockdep() disables interrupts with lockdep enabled to
    avoid false positive reports by lockdep that a certain lock has not been
    acquired with disabled interrupts. The user of this macros expects that
    a lock can be acquried without disabling interrupts because the IRQ line
    triggering the interrupt is disabled.
    
    This triggers a warning on PREEMPT_RT because after
    disable_irq_nosync_lockdep.*() the following spinlock_t now is acquired
    with disabled interrupts.
    
    On PREEMPT_RT there is no difference between spin_lock() and
    spin_lock_irq() so avoiding disabling interrupts in this case works for
    the two remaining callers as of today.
    
    Don't disable interrupts on PREEMPT_RT in disable_irq_nosync_lockdep.*().
    
    Closes: https://lore.kernel.org/[email protected]
    Fixes: e8106b941ceab ("[PATCH] lockdep: core, add enable/disable_irq_irqsave/irqrestore() APIs")
    Reported-by: Guenter Roeck <[email protected]>
    Suggested-by: "Steven Rostedt (Google)" <[email protected]>
    Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Tested-by: Guenter Roeck <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

locking/semaphore: Use wake_q to wake up processes outside lock critical section [+ + +]

Author: Waiman Long <[email protected]>
Date:   Fri Mar 7 15:26:52 2025 -0800

    locking/semaphore: Use wake_q to wake up processes outside lock critical section
    
    [ Upstream commit 85b2b9c16d053364e2004883140538e73b333cdb ]
    
    A circular lock dependency splat has been seen involving down_trylock():
    
      ======================================================
      WARNING: possible circular locking dependency detected
      6.12.0-41.el10.s390x+debug
      ------------------------------------------------------
      dd/32479 is trying to acquire lock:
      0015a20accd0d4f8 ((console_sem).lock){-.-.}-{2:2}, at: down_trylock+0x26/0x90
    
      but task is already holding lock:
      000000017e461698 (&zone->lock){-.-.}-{2:2}, at: rmqueue_bulk+0xac/0x8f0
    
      the existing dependency chain (in reverse order) is:
      -> #4 (&zone->lock){-.-.}-{2:2}:
      -> #3 (hrtimer_bases.lock){-.-.}-{2:2}:
      -> #2 (&rq->__lock){-.-.}-{2:2}:
      -> #1 (&p->pi_lock){-.-.}-{2:2}:
      -> #0 ((console_sem).lock){-.-.}-{2:2}:
    
    The console_sem -> pi_lock dependency is due to calling try_to_wake_up()
    while holding the console_sem raw_spinlock. This dependency can be broken
    by using wake_q to do the wakeup instead of calling try_to_wake_up()
    under the console_sem lock. This will also make the semaphore's
    raw_spinlock become a terminal lock without taking any further locks
    underneath it.
    
    The hrtimer_bases.lock is a raw_spinlock while zone->lock is a
    spinlock. The hrtimer_bases.lock -> zone->lock dependency happens via
    the debug_objects_fill_pool() helper function in the debugobjects code.
    
      -> #4 (&zone->lock){-.-.}-{2:2}:
             __lock_acquire+0xe86/0x1cc0
             lock_acquire.part.0+0x258/0x630
             lock_acquire+0xb8/0xe0
             _raw_spin_lock_irqsave+0xb4/0x120
             rmqueue_bulk+0xac/0x8f0
             __rmqueue_pcplist+0x580/0x830
             rmqueue_pcplist+0xfc/0x470
             rmqueue.isra.0+0xdec/0x11b0
             get_page_from_freelist+0x2ee/0xeb0
             __alloc_pages_noprof+0x2c2/0x520
             alloc_pages_mpol_noprof+0x1fc/0x4d0
             alloc_pages_noprof+0x8c/0xe0
             allocate_slab+0x320/0x460
             ___slab_alloc+0xa58/0x12b0
             __slab_alloc.isra.0+0x42/0x60
             kmem_cache_alloc_noprof+0x304/0x350
             fill_pool+0xf6/0x450
             debug_object_activate+0xfe/0x360
             enqueue_hrtimer+0x34/0x190
             __run_hrtimer+0x3c8/0x4c0
             __hrtimer_run_queues+0x1b2/0x260
             hrtimer_interrupt+0x316/0x760
             do_IRQ+0x9a/0xe0
             do_irq_async+0xf6/0x160
    
    Normally a raw_spinlock to spinlock dependency is not legitimate
    and will be warned if CONFIG_PROVE_RAW_LOCK_NESTING is enabled,
    but debug_objects_fill_pool() is an exception as it explicitly
    allows this dependency for non-PREEMPT_RT kernel without causing
    PROVE_RAW_LOCK_NESTING lockdep splat. As a result, this dependency is
    legitimate and not a bug.
    
    Anyway, semaphore is the only locking primitive left that is still
    using try_to_wake_up() to do wakeup inside critical section, all the
    other locking primitives had been migrated to use wake_q to do wakeup
    outside of the critical section. It is also possible that there are
    other circular locking dependencies involving printk/console_sem or
    other existing/new semaphores lurking somewhere which may show up in
    the future. Let just do the migration now to wake_q to avoid headache
    like this.
    
    Reported-by: [email protected]
    Signed-off-by: Waiman Long <[email protected]>
    Signed-off-by: Boqun Feng <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

LoongArch: BPF: Fix off-by-one error in build_prologue() [+ + +]

Author: Hengqi Chen <[email protected]>
Date:   Sun Mar 30 16:31:09 2025 +0800

    LoongArch: BPF: Fix off-by-one error in build_prologue()
    
    commit 7e2586991e36663c9bc48c828b83eab180ad30a9 upstream.
    
    Vincent reported that running BPF progs with tailcalls on LoongArch
    causes kernel hard lockup. Debugging the issues shows that the JITed
    image missing a jirl instruction at the end of the epilogue.
    
    There are two passes in JIT compiling, the first pass set the flags and
    the second pass generates JIT code based on those flags. With BPF progs
    mixing bpf2bpf and tailcalls, build_prologue() generates N insns in the
    first pass and then generates N+1 insns in the second pass. This makes
    epilogue_offset off by one and we will jump to some unexpected insn and
    cause lockup. Fix this by inserting a nop insn.
    
    Cc: [email protected]
    Fixes: 5dc615520c4d ("LoongArch: Add BPF JIT support")
    Fixes: bb035ef0cc91 ("LoongArch: BPF: Support mixing bpf2bpf and tailcalls")
    Reported-by: Vincent Li <[email protected]>
    Tested-by: Vincent Li <[email protected]>
    Closes: https://lore.kernel.org/loongarch/CAK3+h2w6WESdBN3UCr3WKHByD7D6Q_Ve1EDAjotVrnx6Or_c8g@mail.gmail.com/
    Closes: https://lore.kernel.org/bpf/CAK3+h2woEjG_N=-XzqEGaAeCmgu2eTCUc7p6bP4u8Q+DFHm-7g@mail.gmail.com/
    Signed-off-by: Hengqi Chen <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: BPF: Use move_addr() for BPF_PSEUDO_FUNC [+ + +]

Author: Hengqi Chen <[email protected]>
Date:   Sun Mar 30 16:31:09 2025 +0800

    LoongArch: BPF: Use move_addr() for BPF_PSEUDO_FUNC
    
    commit 52266f1015a8b5aabec7d127f83d105f702b388e upstream.
    
    Vincent reported that running XDP synproxy program on LoongArch results
    in the following error:
    
        JIT doesn't support bpf-to-bpf calls
    
    With dmesg:
    
        multi-func JIT bug 1391 != 1390
    
    The root cause is that verifier will refill the imm with the correct
    addresses of bpf_calls for BPF_PSEUDO_FUNC instructions and then run
    the last pass of JIT. So we generate different JIT code for the same
    instruction in two passes (one for placeholder and the other for the
    real address). Let's use move_addr() instead.
    
    See commit 64f50f6575721ef0 ("LoongArch, bpf: Use 4 instructions for
    function address in JIT") for a similar fix.
    
    Cc: [email protected]
    Fixes: 69c087ba6225 ("bpf: Add bpf_for_each_map_elem() helper")
    Fixes: bb035ef0cc91 ("LoongArch: BPF: Support mixing bpf2bpf and tailcalls")
    Reported-by: Vincent Li <[email protected]>
    Tested-by: Vincent Li <[email protected]>
    Closes: https://lore.kernel.org/loongarch/CAK3+h2yfM9FTNiXvEQBkvtuoJrvzmN4c_NZsFXqEk4Cj1tsBNA@mail.gmail.com/T/#u
    Signed-off-by: Hengqi Chen <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

LoongArch: Fix help text of CMDLINE_EXTEND in Kconfig [+ + +]

Author: 谢致邦 (XIE Zhibang) <[email protected]>
Date:   Sun Mar 30 16:31:09 2025 +0800

    LoongArch: Fix help text of CMDLINE_EXTEND in Kconfig
    
    [ Upstream commit be216cbc1ddf99a51915414ce147311c0dfd50a2 ]
    
    It is the built-in command line appended to the bootloader command line,
    not the bootloader command line appended to the built-in command line.
    
    Fixes: fa96b57c1490 ("LoongArch: Add build infrastructure")
    Signed-off-by: 谢致邦 (XIE Zhibang) <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

LoongArch: Increase ARCH_DMA_MINALIGN up to 16 [+ + +]

Author: Huacai Chen <[email protected]>
Date:   Sun Mar 30 16:31:09 2025 +0800

    LoongArch: Increase ARCH_DMA_MINALIGN up to 16
    
    commit 4103cfe9dcb88010ae4911d3ff417457d1b6a720 upstream.
    
    ARCH_DMA_MINALIGN is 1 by default, but some LoongArch-specific devices
    (such as APBDMA) require 16 bytes alignment. When the data buffer length
    is too small, the hardware may make an error writing cacheline. Thus, it
    is dangerous to allocate a small memory buffer for DMA. It's always safe
    to define ARCH_DMA_MINALIGN as L1_CACHE_BYTES but unnecessary (kmalloc()
    need small memory objects). Therefore, just increase it to 16.
    
    Cc: [email protected]
    Tested-by: Binbin Zhou <[email protected]>
    Signed-off-by: Huacai Chen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mdacon: rework dependency list [+ + +]

Author: Arnd Bergmann <[email protected]>
Date:   Tue Feb 25 17:44:23 2025 +0100

    mdacon: rework dependency list
    
    [ Upstream commit 5bbcc7645f4b244ffb5ac6563fbe9d3d42194447 ]
    
    mdacon has roughly the same dependencies as vgacon but expresses them
    as a negative list instead of a positive list, with the only practical
    difference being PowerPC/CHRP, which uses vga16fb instead of vgacon.
    
    The CONFIG_MDA_CONSOLE description advises to only turn it on when vgacon
    is also used because MDA/Hercules-only systems should be using vgacon
    instead, so just change the list to enforce that directly for simplicity.
    
    The probing was broken from 2002 to 2008, this improves on the fix
    that was added then: If vgacon is a loadable module, then mdacon
    cannot be built-in now, and the list of systems that support vgacon
    is carried over.
    
    Fixes: 0b9cf3aa6b1e ("mdacon messing up default vc's - set default to vc13-16 again")
    Signed-off-by: Arnd Bergmann <[email protected]>
    Reviewed-by: Thomas Zimmermann <[email protected]>
    Signed-off-by: Helge Deller <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

media: platform: allgro-dvt: unregister v4l2_device on the error path [+ + +]

Author: Joe Hattori <[email protected]>
Date:   Thu Dec 5 11:06:21 2024 +0900

    media: platform: allgro-dvt: unregister v4l2_device on the error path
    
    [ Upstream commit c2b96a6818159fba8a3bcc38262da9e77f9b3ec7 ]
    
    In allegro_probe(), the v4l2 device is not unregistered in the error
    path, which results in a memory leak. Fix it by calling
    v4l2_device_unregister() before returning error.
    
    Fixes: d74d4e2359ec ("media: allegro: move driver out of staging")
    Signed-off-by: Joe Hattori <[email protected]>
    Reviewed-by: Michael Tretter <[email protected]>
    Signed-off-by: Sebastian Fricke <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

media: streamzap: fix race between device disconnection and urb callback [+ + +]

Author: Murad Masimov <[email protected]>
Date:   Mon Jan 13 13:51:30 2025 +0300

    media: streamzap: fix race between device disconnection and urb callback
    
    commit f656cfbc7a293a039d6a0c7100e1c846845148c1 upstream.
    
    Syzkaller has reported a general protection fault at function
    ir_raw_event_store_with_filter(). This crash is caused by a NULL pointer
    dereference of dev->raw pointer, even though it is checked for NULL in
    the same function, which means there is a race condition. It occurs due
    to the incorrect order of actions in the streamzap_disconnect() function:
    rc_unregister_device() is called before usb_kill_urb(). The dev->raw
    pointer is freed and set to NULL in rc_unregister_device(), and only
    after that usb_kill_urb() waits for in-progress requests to finish.
    
    If rc_unregister_device() is called while streamzap_callback() handler is
    not finished, this can lead to accessing freed resources. Thus
    rc_unregister_device() should be called after usb_kill_urb().
    
    Found by Linux Verification Center (linuxtesting.org) with Syzkaller.
    
    Fixes: 8e9e60640067 ("V4L/DVB: staging/lirc: port lirc_streamzap to ir-core")
    Cc: [email protected]
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=34008406ee9a31b13c73
    Signed-off-by: Murad Masimov <[email protected]>
    Signed-off-by: Sean Young <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

media: verisilicon: HEVC: Initialize start_bit field [+ + +]

Author: Benjamin Gaignard <[email protected]>
Date:   Mon Jan 20 09:10:52 2025 +0100

    media: verisilicon: HEVC: Initialize start_bit field
    
    [ Upstream commit 7fcb42b3835e90ef18d68555934cf72adaf58402 ]
    
    The HEVC driver needs to set the start_bit field explicitly to avoid
    causing corrupted frames when the VP9 decoder is used in parallel. The
    reason for this problem is that the VP9 and the HEVC decoder share this
    register.
    
    Fixes: cb5dd5a0fa51 ("media: hantro: Introduce G2/HEVC decoder")
    Signed-off-by: Benjamin Gaignard <[email protected]>
    Tested-by: Nicolas Dufresne <[email protected]>
    Reviewed-by: Nicolas Dufresne <[email protected]>
    Signed-off-by: Sebastian Fricke <[email protected]>
    Signed-off-by: Hans Verkuil <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

memory: omap-gpmc: drop no compatible check [+ + +]

Author: Roger Quadros <[email protected]>
Date:   Mon Mar 10 15:15:14 2025 +0100

    memory: omap-gpmc: drop no compatible check
    
    [ Upstream commit edcccc6892f65eff5fd3027a13976131dc7fd733 ]
    
    We are no longer depending on legacy device trees so
    drop the no compatible check for NAND and OneNAND
    nodes.
    
    Suggested-by: Rob Herring (Arm) <[email protected]>
    Signed-off-by: Roger Quadros <[email protected]>
    Reviewed-by: Rob Herring (Arm) <[email protected]>
    Link: https://lore.kernel.org/r/20250114-omap-gpmc-drop-no-compatible-check-v1-1-262c8d549732@kernel.org
    Signed-off-by: Krzysztof Kozlowski <[email protected]>
    Signed-off-by: Arnd Bergmann <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mfd: sm501: Switch to BIT() to mitigate integer overflows [+ + +]

Author: Nikita Zhandarovich <[email protected]>
Date:   Wed Jan 15 09:12:06 2025 -0800

    mfd: sm501: Switch to BIT() to mitigate integer overflows
    
    [ Upstream commit 2d8cb9ffe18c2f1e5bd07a19cbce85b26c1d0cf0 ]
    
    If offset end up being high enough, right hand expression in functions
    like sm501_gpio_set() shifted left for that number of bits, may
    not fit in int type.
    
    Just in case, fix that by using BIT() both as an option safe from
    overflow issues and to make this step look similar to other gpio
    drivers.
    
    Found by Linux Verification Center (linuxtesting.org) with static
    analysis tool SVACE.
    
    Fixes: f61be273d369 ("sm501: add gpiolib support")
    Signed-off-by: Nikita Zhandarovich <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Lee Jones <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

mmc: sdhci-omap: Disable MMC_CAP_AGGRESSIVE_PM for eMMC/SD [+ + +]

Author: Ulf Hansson <[email protected]>
Date:   Wed Mar 12 13:17:12 2025 +0100

    mmc: sdhci-omap: Disable MMC_CAP_AGGRESSIVE_PM for eMMC/SD
    
    commit 49d162635151d0dd04935070d7cf67137ab863aa upstream.
    
    We have received reports about cards can become corrupt related to the
    aggressive PM support. Let's make a partial revert of the change that
    enabled the feature.
    
    Reported-by: David Owens <[email protected]>
    Reported-by: Romain Naour <[email protected]>
    Reported-by: Robert Nelson <[email protected]>
    Tested-by: Robert Nelson <[email protected]>
    Fixes: 3edf588e7fe0 ("mmc: sdhci-omap: Allow SDIO card power off and enable aggressive PM")
    Cc: [email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Acked-by: Adrian Hunter <[email protected]>
    Reviewed-by: Tony Lindgren <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

mmc: sdhci-pxav3: set NEED_RSP_BUSY capability [+ + +]

Author: Karel Balej <[email protected]>
Date:   Mon Mar 10 15:07:04 2025 +0100

    mmc: sdhci-pxav3: set NEED_RSP_BUSY capability
    
    commit a41fcca4b342811b473bbaa4b44f1d34d87fcce6 upstream.
    
    Set the MMC_CAP_NEED_RSP_BUSY capability for the sdhci-pxav3 host to
    prevent conversion of R1B responses to R1. Without this, the eMMC card
    in the samsung,coreprimevelte smartphone using the Marvell PXA1908 SoC
    with this mmc host doesn't probe with the ETIMEDOUT error originating in
    __mmc_poll_for_busy.
    
    Note that the other issues reported for this phone and host, namely
    floods of "Tuning failed, falling back to fixed sampling clock" dmesg
    messages for the eMMC and unstable SDIO are not mitigated by this
    change.
    
    Link: https://lore.kernel.org/r/[email protected]/
    Link: https://lore.kernel.org/r/[email protected]/
    Link: https://lore.kernel.org/r/[email protected]/
    Cc: [email protected]
    Signed-off-by: Karel Balej <[email protected]>
    Acked-by: Adrian Hunter <[email protected]>
    Tested-by: Duje Mihanović <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Ulf Hansson <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

net/mlx5e: SHAMPO, Make reserved size independent of page size [+ + +]

Author: Lama Kayal <[email protected]>
Date:   Sun Mar 23 14:28:26 2025 +0200

    net/mlx5e: SHAMPO, Make reserved size independent of page size
    
    [ Upstream commit fab05835688526f9de123d1e98e4d1f838da4e22 ]
    
    When hw-gro is enabled, the maximum number of header entries that are
    needed per wqe (hd_per_wqe) is calculated based on the size of the
    reservations among other parameters.
    
    Miscalculation of the size of reservations leads to incorrect
    calculation of hd_per_wqe as 0, particularly in the case of large page
    size like in aarch64, this prevents the SHAMPO header from being
    correctly initialized in the device, ultimately causing the following
    cqe err that indicates a violation of PD.
    
     mlx5_core 0000:00:08.0 eth2: ERR CQE on RQ: 0x1180
     mlx5_core 0000:00:08.0 eth2: Error cqe on cqn 0x510, ci 0x0, qn 0x1180, opcode 0xe, syndrome  0x4, vendor syndrome 0x32
     00000000: 00 00 00 00 04 4a 00 00 00 00 00 00 20 00 93 32
     00000010: 55 00 00 00 fb cc 00 00 00 00 00 00 07 18 00 00
     00000020: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4a
     00000030: 00 00 00 9a 93 00 32 04 00 00 00 00 00 00 da e1
    
    Use the correct formula for calculating the size of reservations,
    precisely it shouldn't be dependent on page size, instead use the
    correct multiply of MLX5E_SHAMPO_WQ_BASE_RESRV_SIZE.
    
    Fixes: e5ca8fb08ab2 ("net/mlx5e: Add control path for SHAMPO feature")
    Signed-off-by: Lama Kayal <[email protected]>
    Reviewed-by: Dragos Tatulea <[email protected]>
    Signed-off-by: Tariq Toukan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: dsa: mv88e6xxx: propperly shutdown PPU re-enable timer on destroy [+ + +]

Author: David Oberhollenzer <[email protected]>
Date:   Tue Apr 1 15:56:37 2025 +0200

    net: dsa: mv88e6xxx: propperly shutdown PPU re-enable timer on destroy
    
    [ Upstream commit a58d882841a0750da3c482cd3d82432b1c7edb77 ]
    
    The mv88e6xxx has an internal PPU that polls PHY state. If we want to
    access the internal PHYs, we need to disable the PPU first. Because
    that is a slow operation, a 10ms timer is used to re-enable it,
    canceled with every access, so bulk operations effectively only
    disable it once and re-enable it some 10ms after the last access.
    
    If a PHY is accessed and then the mv88e6xxx module is removed before
    the 10ms are up, the PPU re-enable ends up accessing a dangling pointer.
    
    This especially affects probing during bootup. The MDIO bus and PHY
    registration may succeed, but registration with the DSA framework
    may fail later on (e.g. because the CPU port depends on another,
    very slow device that isn't done probing yet, returning -EPROBE_DEFER).
    In this case, probe() fails, but the MDIO subsystem may already have
    accessed the MIDO bus or PHYs, arming the timer.
    
    This is fixed as follows:
     - If probe fails after mv88e6xxx_phy_init(), make sure we also call
       mv88e6xxx_phy_destroy() before returning
     - In mv88e6xxx_remove(), make sure we do the teardown in the correct
       order, calling mv88e6xxx_phy_destroy() after unregistering the
       switch device.
     - In mv88e6xxx_phy_destroy(), destroy both the timer and the work item
       that the timer might schedule, synchronously waiting in case one of
       the callbacks already fired and destroying the timer first, before
       waiting for the work item.
     - Access to the PPU is guarded by a mutex, the worker acquires it
       with a mutex_trylock(), not proceeding with the expensive shutdown
       if that fails. We grab the mutex in mv88e6xxx_phy_destroy() to make
       sure the slow PPU shutdown is already done or won't even enter, when
       we wait for the work item.
    
    Fixes: 2e5f032095ff ("dsa: add support for the Marvell 88E6131 switch chip")
    Signed-off-by: David Oberhollenzer <[email protected]>
    Reviewed-by: Vladimir Oltean <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: fix geneve_opt length integer overflow [+ + +]

Author: Lin Ma <[email protected]>
Date:   Thu Apr 3 00:56:32 2025 +0800

    net: fix geneve_opt length integer overflow
    
    [ Upstream commit b27055a08ad4b415dcf15b63034f9cb236f7fb40 ]
    
    struct geneve_opt uses 5 bit length for each single option, which
    means every vary size option should be smaller than 128 bytes.
    
    However, all current related Netlink policies cannot promise this
    length condition and the attacker can exploit a exact 128-byte size
    option to *fake* a zero length option and confuse the parsing logic,
    further achieve heap out-of-bounds read.
    
    One example crash log is like below:
    
    [    3.905425] ==================================================================
    [    3.905925] BUG: KASAN: slab-out-of-bounds in nla_put+0xa9/0xe0
    [    3.906255] Read of size 124 at addr ffff888005f291cc by task poc/177
    [    3.906646]
    [    3.906775] CPU: 0 PID: 177 Comm: poc-oob-read Not tainted 6.1.132 #1
    [    3.907131] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.0-0-gd239552ce722-prebuilt.qemu.org 04/01/2014
    [    3.907784] Call Trace:
    [    3.907925]  <TASK>
    [    3.908048]  dump_stack_lvl+0x44/0x5c
    [    3.908258]  print_report+0x184/0x4be
    [    3.909151]  kasan_report+0xc5/0x100
    [    3.909539]  kasan_check_range+0xf3/0x1a0
    [    3.909794]  memcpy+0x1f/0x60
    [    3.909968]  nla_put+0xa9/0xe0
    [    3.910147]  tunnel_key_dump+0x945/0xba0
    [    3.911536]  tcf_action_dump_1+0x1c1/0x340
    [    3.912436]  tcf_action_dump+0x101/0x180
    [    3.912689]  tcf_exts_dump+0x164/0x1e0
    [    3.912905]  fw_dump+0x18b/0x2d0
    [    3.913483]  tcf_fill_node+0x2ee/0x460
    [    3.914778]  tfilter_notify+0xf4/0x180
    [    3.915208]  tc_new_tfilter+0xd51/0x10d0
    [    3.918615]  rtnetlink_rcv_msg+0x4a2/0x560
    [    3.919118]  netlink_rcv_skb+0xcd/0x200
    [    3.919787]  netlink_unicast+0x395/0x530
    [    3.921032]  netlink_sendmsg+0x3d0/0x6d0
    [    3.921987]  __sock_sendmsg+0x99/0xa0
    [    3.922220]  __sys_sendto+0x1b7/0x240
    [    3.922682]  __x64_sys_sendto+0x72/0x90
    [    3.922906]  do_syscall_64+0x5e/0x90
    [    3.923814]  entry_SYSCALL_64_after_hwframe+0x6e/0xd8
    [    3.924122] RIP: 0033:0x7e83eab84407
    [    3.924331] Code: 48 89 fa 4c 89 df e8 38 aa 00 00 8b 93 08 03 00 00 59 5e 48 83 f8 fc 74 1a 5b c3 0f 1f 84 00 00 00 00 00 48 8b 44 24 10 0f 05 <5b> c3 0f 1f 80 00 00 00 00 83 e2 39 83 faf
    [    3.925330] RSP: 002b:00007ffff505e370 EFLAGS: 00000202 ORIG_RAX: 000000000000002c
    [    3.925752] RAX: ffffffffffffffda RBX: 00007e83eaafa740 RCX: 00007e83eab84407
    [    3.926173] RDX: 00000000000001a8 RSI: 00007ffff505e3c0 RDI: 0000000000000003
    [    3.926587] RBP: 00007ffff505f460 R08: 00007e83eace1000 R09: 000000000000000c
    [    3.926977] R10: 0000000000000000 R11: 0000000000000202 R12: 00007ffff505f3c0
    [    3.927367] R13: 00007ffff505f5c8 R14: 00007e83ead1b000 R15: 00005d4fbbe6dcb8
    
    Fix these issues by enforing correct length condition in related
    policies.
    
    Fixes: 925d844696d9 ("netfilter: nft_tunnel: add support for geneve opts")
    Fixes: 4ece47787077 ("lwtunnel: add options setting and dumping for geneve")
    Fixes: 0ed5269f9e41 ("net/sched: add tunnel option support to act_tunnel_key")
    Fixes: 0a6e77784f49 ("net/sched: allow flower to match tunnel options")
    Signed-off-by: Lin Ma <[email protected]>
    Reviewed-by: Xin Long <[email protected]>
    Acked-by: Cong Wang <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net: mvpp2: Prevent parser TCAM memory corruption [+ + +]

Author: Tobias Waldekranz <[email protected]>
Date:   Tue Apr 1 08:58:04 2025 +0200

    net: mvpp2: Prevent parser TCAM memory corruption
    
    [ Upstream commit 96844075226b49af25a69a1d084b648ec2d9b08d ]
    
    Protect the parser TCAM/SRAM memory, and the cached (shadow) SRAM
    information, from concurrent modifications.
    
    Both the TCAM and SRAM tables are indirectly accessed by configuring
    an index register that selects the row to read or write to. This means
    that operations must be atomic in order to, e.g., avoid spreading
    writes across multiple rows. Since the shadow SRAM array is used to
    find free rows in the hardware table, it must also be protected in
    order to avoid TOCTOU errors where multiple cores allocate the same
    row.
    
    This issue was detected in a situation where `mvpp2_set_rx_mode()` ran
    concurrently on two CPUs. In this particular case the
    MVPP2_PE_MAC_UC_PROMISCUOUS entry was corrupted, causing the
    classifier unit to drop all incoming unicast - indicated by the
    `rx_classifier_drops` counter.
    
    Fixes: 3f518509dedc ("ethernet: Add new driver for Marvell Armada 375 network unit")
    Signed-off-by: Tobias Waldekranz <[email protected]>
    Reviewed-by: Maxime Chevallier <[email protected]>
    Tested-by: Maxime Chevallier <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

net_sched: skbprio: Remove overly strict queue assertions [+ + +]

Author: Cong Wang <[email protected]>
Date:   Sat Mar 29 15:25:35 2025 -0700

    net_sched: skbprio: Remove overly strict queue assertions
    
    [ Upstream commit ce8fe975fd99b49c29c42e50f2441ba53112b2e8 ]
    
    In the current implementation, skbprio enqueue/dequeue contains an assertion
    that fails under certain conditions when SKBPRIO is used as a child qdisc under
    TBF with specific parameters. The failure occurs because TBF sometimes peeks at
    packets in the child qdisc without actually dequeuing them when tokens are
    unavailable.
    
    This peek operation creates a discrepancy between the parent and child qdisc
    queue length counters. When TBF later receives a high-priority packet,
    SKBPRIO's queue length may show a different value than what's reflected in its
    internal priority queue tracking, triggering the assertion.
    
    The fix removes this overly strict assertions in SKBPRIO, they are not
    necessary at all.
    
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=a3422a19b05ea96bee18
    Fixes: aea5f654e6b7 ("net/sched: add skbprio scheduler")
    Cc: Nishanth Devarajan <[email protected]>
    Signed-off-by: Cong Wang <[email protected]>
    Acked-by: Paolo Abeni <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nft_set_hash: GC reaps elements with conncount for dynamic sets only [+ + +]

Author: Pablo Neira Ayuso <[email protected]>
Date:   Fri Mar 21 23:24:20 2025 +0100

    netfilter: nft_set_hash: GC reaps elements with conncount for dynamic sets only
    
    [ Upstream commit 9d74da1177c800eb3d51c13f9821b7b0683845a5 ]
    
    conncount has its own GC handler which determines when to reap stale
    elements, this is convenient for dynamic sets. However, this also reaps
    non-dynamic sets with static configurations coming from control plane.
    Always run connlimit gc handler but honor feedback to reap element if
    this set is dynamic.
    
    Fixes: 290180e2448c ("netfilter: nf_tables: add connlimit support")
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netfilter: nft_tunnel: fix geneve_opt type confusion addition [+ + +]

Author: Lin Ma <[email protected]>
Date:   Thu Apr 3 01:00:26 2025 +0800

    netfilter: nft_tunnel: fix geneve_opt type confusion addition
    
    [ Upstream commit 1b755d8eb1ace3870789d48fbd94f386ad6e30be ]
    
    When handling multiple NFTA_TUNNEL_KEY_OPTS_GENEVE attributes, the
    parsing logic should place every geneve_opt structure one by one
    compactly. Hence, when deciding the next geneve_opt position, the
    pointer addition should be in units of char *.
    
    However, the current implementation erroneously does type conversion
    before the addition, which will lead to heap out-of-bounds write.
    
    [    6.989857] ==================================================================
    [    6.990293] BUG: KASAN: slab-out-of-bounds in nft_tunnel_obj_init+0x977/0xa70
    [    6.990725] Write of size 124 at addr ffff888005f18974 by task poc/178
    [    6.991162]
    [    6.991259] CPU: 0 PID: 178 Comm: poc-oob-write Not tainted 6.1.132 #1
    [    6.991655] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS rel-1.16.0-0-gd239552ce722-prebuilt.qemu.org 04/01/2014
    [    6.992281] Call Trace:
    [    6.992423]  <TASK>
    [    6.992586]  dump_stack_lvl+0x44/0x5c
    [    6.992801]  print_report+0x184/0x4be
    [    6.993790]  kasan_report+0xc5/0x100
    [    6.994252]  kasan_check_range+0xf3/0x1a0
    [    6.994486]  memcpy+0x38/0x60
    [    6.994692]  nft_tunnel_obj_init+0x977/0xa70
    [    6.995677]  nft_obj_init+0x10c/0x1b0
    [    6.995891]  nf_tables_newobj+0x585/0x950
    [    6.996922]  nfnetlink_rcv_batch+0xdf9/0x1020
    [    6.998997]  nfnetlink_rcv+0x1df/0x220
    [    6.999537]  netlink_unicast+0x395/0x530
    [    7.000771]  netlink_sendmsg+0x3d0/0x6d0
    [    7.001462]  __sock_sendmsg+0x99/0xa0
    [    7.001707]  ____sys_sendmsg+0x409/0x450
    [    7.002391]  ___sys_sendmsg+0xfd/0x170
    [    7.003145]  __sys_sendmsg+0xea/0x170
    [    7.004359]  do_syscall_64+0x5e/0x90
    [    7.005817]  entry_SYSCALL_64_after_hwframe+0x6e/0xd8
    [    7.006127] RIP: 0033:0x7ec756d4e407
    [    7.006339] Code: 48 89 fa 4c 89 df e8 38 aa 00 00 8b 93 08 03 00 00 59 5e 48 83 f8 fc 74 1a 5b c3 0f 1f 84 00 00 00 00 00 48 8b 44 24 10 0f 05 <5b> c3 0f 1f 80 00 00 00 00 83 e2 39 83 faf
    [    7.007364] RSP: 002b:00007ffed5d46760 EFLAGS: 00000202 ORIG_RAX: 000000000000002e
    [    7.007827] RAX: ffffffffffffffda RBX: 00007ec756cc4740 RCX: 00007ec756d4e407
    [    7.008223] RDX: 0000000000000000 RSI: 00007ffed5d467f0 RDI: 0000000000000003
    [    7.008620] RBP: 00007ffed5d468a0 R08: 0000000000000000 R09: 0000000000000000
    [    7.009039] R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000000
    [    7.009429] R13: 00007ffed5d478b0 R14: 00007ec756ee5000 R15: 00005cbd4e655cb8
    
    Fix this bug with correct pointer addition and conversion in parse
    and dump code.
    
    Fixes: 925d844696d9 ("netfilter: nft_tunnel: add support for geneve opts")
    Signed-off-by: Lin Ma <[email protected]>
    Signed-off-by: Pablo Neira Ayuso <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

netlabel: Fix NULL pointer exception caused by CALIPSO on IPv4 sockets [+ + +]

Author: Debin Zhu <[email protected]>
Date:   Tue Apr 1 20:40:18 2025 +0800

    netlabel: Fix NULL pointer exception caused by CALIPSO on IPv4 sockets
    
    [ Upstream commit 078aabd567de3d63d37d7673f714e309d369e6e2 ]
    
    When calling netlbl_conn_setattr(), addr->sa_family is used
    to determine the function behavior. If sk is an IPv4 socket,
    but the connect function is called with an IPv6 address,
    the function calipso_sock_setattr() is triggered.
    Inside this function, the following code is executed:
    
    sk_fullsock(__sk) ? inet_sk(__sk)->pinet6 : NULL;
    
    Since sk is an IPv4 socket, pinet6 is NULL, leading to a
    null pointer dereference.
    
    This patch fixes the issue by checking if inet6_sk(sk)
    returns a NULL pointer before accessing pinet6.
    
    Signed-off-by: Debin Zhu <[email protected]>
    Signed-off-by: Bitao Ouyang <[email protected]>
    Acked-by: Paul Moore <[email protected]>
    Fixes: ceba1832b1b2 ("calipso: Set the calipso socket label to match the secattr.")
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nfsd: put dl_stid if fail to queue dl_recall [+ + +]

Author: Li Lingfeng <[email protected]>
Date:   Thu Feb 13 22:42:20 2025 +0800

    nfsd: put dl_stid if fail to queue dl_recall
    
    commit 230ca758453c63bd38e4d9f4a21db698f7abada8 upstream.
    
    Before calling nfsd4_run_cb to queue dl_recall to the callback_wq, we
    increment the reference count of dl_stid.
    We expect that after the corresponding work_struct is processed, the
    reference count of dl_stid will be decremented through the callback
    function nfsd4_cb_recall_release.
    However, if the call to nfsd4_run_cb fails, the incremented reference
    count of dl_stid will not be decremented correspondingly, leading to the
    following nfs4_stid leak:
    unreferenced object 0xffff88812067b578 (size 344):
      comm "nfsd", pid 2761, jiffies 4295044002 (age 5541.241s)
      hex dump (first 32 bytes):
        01 00 00 00 6b 6b 6b 6b b8 02 c0 e2 81 88 ff ff  ....kkkk........
        00 6b 6b 6b 6b 6b 6b 6b 00 00 00 00 ad 4e ad de  .kkkkkkk.....N..
      backtrace:
        kmem_cache_alloc+0x4b9/0x700
        nfsd4_process_open1+0x34/0x300
        nfsd4_open+0x2d1/0x9d0
        nfsd4_proc_compound+0x7a2/0xe30
        nfsd_dispatch+0x241/0x3e0
        svc_process_common+0x5d3/0xcc0
        svc_process+0x2a3/0x320
        nfsd+0x180/0x2e0
        kthread+0x199/0x1d0
        ret_from_fork+0x30/0x50
        ret_from_fork_asm+0x1b/0x30
    unreferenced object 0xffff8881499f4d28 (size 368):
      comm "nfsd", pid 2761, jiffies 4295044005 (age 5541.239s)
      hex dump (first 32 bytes):
        01 00 00 00 00 00 00 00 30 4d 9f 49 81 88 ff ff  ........0M.I....
        30 4d 9f 49 81 88 ff ff 20 00 00 00 01 00 00 00  0M.I.... .......
      backtrace:
        kmem_cache_alloc+0x4b9/0x700
        nfs4_alloc_stid+0x29/0x210
        alloc_init_deleg+0x92/0x2e0
        nfs4_set_delegation+0x284/0xc00
        nfs4_open_delegation+0x216/0x3f0
        nfsd4_process_open2+0x2b3/0xee0
        nfsd4_open+0x770/0x9d0
        nfsd4_proc_compound+0x7a2/0xe30
        nfsd_dispatch+0x241/0x3e0
        svc_process_common+0x5d3/0xcc0
        svc_process+0x2a3/0x320
        nfsd+0x180/0x2e0
        kthread+0x199/0x1d0
        ret_from_fork+0x30/0x50
        ret_from_fork_asm+0x1b/0x30
    Fix it by checking the result of nfsd4_run_cb and call nfs4_put_stid if
    fail to queue dl_recall.
    
    Cc: [email protected]
    Signed-off-by: Li Lingfeng <[email protected]>
    Reviewed-by: Jeff Layton <[email protected]>
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NFSD: Skip sending CB_RECALL_ANY when the backchannel isn't up [+ + +]

Author: Chuck Lever <[email protected]>
Date:   Tue Jan 14 17:09:24 2025 -0500

    NFSD: Skip sending CB_RECALL_ANY when the backchannel isn't up
    
    commit 8a388c1fabeb6606e16467b23242416c0dbeffad upstream.
    
    NFSD sends CB_RECALL_ANY to clients when the server is low on
    memory or that client has a large number of delegations outstanding.
    
    We've seen cases where NFSD attempts to send CB_RECALL_ANY requests
    to disconnected clients, and gets confused. These calls never go
    anywhere if a backchannel transport to the target client isn't
    available. Before the server can send any backchannel operation, the
    client has to connect first and then do a BIND_CONN_TO_SESSION.
    
    This patch doesn't address the root cause of the confusion, but
    there's no need to queue up these optional operations if they can't
    go anywhere.
    
    Fixes: 44df6f439a17 ("NFSD: add delegation reaper to react to low memory condition")
    Reviewed-by: Jeff Layton <[email protected]>
    Cc: [email protected]
    Signed-off-by: Chuck Lever <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

NFSv4: Don't trigger uneccessary scans for return-on-close delegations [+ + +]

Author: Trond Myklebust <[email protected]>
Date:   Tue Feb 18 16:50:30 2025 -0500

    NFSv4: Don't trigger uneccessary scans for return-on-close delegations
    
    [ Upstream commit 47acca884f714f41d95dc654f802845544554784 ]
    
    The amount of looping through the list of delegations is occasionally
    leading to soft lockups. Avoid at least some loops by not requiring the
    NFSv4 state manager to scan for delegations that are marked for
    return-on-close. Instead, either mark them for immediate return (if
    possible) or else leave it up to nfs4_inode_return_delegation_on_close()
    to return them once the file is closed by the application.
    
    Fixes: b757144fd77c ("NFSv4: Be less aggressive about returning delegations for open files")
    Signed-off-by: Trond Myklebust <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ntb: intel: Fix using link status DB's [+ + +]

Author: Nikita Shubin <[email protected]>
Date:   Thu Jun 6 11:15:19 2024 +0300

    ntb: intel: Fix using link status DB's
    
    [ Upstream commit 8144e9c8f30fb23bb736a5d24d5c9d46965563c4 ]
    
    Make sure we are not using DB's which were remapped for link status.
    
    Fixes: f6e51c354b60 ("ntb: intel: split out the gen3 code")
    Signed-off-by: Nikita Shubin <[email protected]>
    Reviewed-by: Dave Jiang <[email protected]>
    Signed-off-by: Jon Mason <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ntb_hw_switchtec: Fix shift-out-of-bounds in switchtec_ntb_mw_set_trans [+ + +]

Author: Yajun Deng <[email protected]>
Date:   Wed Aug 16 16:33:05 2023 +0800

    ntb_hw_switchtec: Fix shift-out-of-bounds in switchtec_ntb_mw_set_trans
    
    [ Upstream commit de203da734fae00e75be50220ba5391e7beecdf9 ]
    
    There is a kernel API ntb_mw_clear_trans() would pass 0 to both addr and
    size. This would make xlate_pos negative.
    
    [   23.734156] switchtec switchtec0: MW 0: part 0 addr 0x0000000000000000 size 0x0000000000000000
    [   23.734158] ================================================================================
    [   23.734172] UBSAN: shift-out-of-bounds in drivers/ntb/hw/mscc/ntb_hw_switchtec.c:293:7
    [   23.734418] shift exponent -1 is negative
    
    Ensuring xlate_pos is a positive or zero before BIT.
    
    Fixes: 1e2fd202f859 ("ntb_hw_switchtec: Check for alignment of the buffer in mw_set_trans()")
    Signed-off-by: Yajun Deng <[email protected]>
    Reviewed-by: Logan Gunthorpe <[email protected]>
    Signed-off-by: Jon Mason <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ntb_perf: Delete duplicate dmaengine_unmap_put() call in perf_copy_chunk() [+ + +]

Author: Markus Elfring <[email protected]>
Date:   Mon Sep 23 10:38:11 2024 +0200

    ntb_perf: Delete duplicate dmaengine_unmap_put() call in perf_copy_chunk()
    
    commit 4279e72cab31dd3eb8c89591eb9d2affa90ab6aa upstream.
    
    The function call “dmaengine_unmap_put(unmap)” was used in an if branch.
    The same call was immediately triggered by a subsequent goto statement.
    Thus avoid such a call repetition.
    
    This issue was detected by using the Coccinelle software.
    
    Fixes: 5648e56d03fa ("NTB: ntb_perf: Add full multi-port NTB API support")
    Cc: [email protected]
    Signed-off-by: Markus Elfring <[email protected]>
    Signed-off-by: Jon Mason <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

nvme-pci: clean up CMBMSC when registering CMB fails [+ + +]

Author: Icenowy Zheng <[email protected]>
Date:   Thu Feb 13 01:04:43 2025 +0800

    nvme-pci: clean up CMBMSC when registering CMB fails
    
    [ Upstream commit 6a3572e10f740acd48e2713ef37e92186a3ce5e8 ]
    
    CMB decoding should get disabled when the CMB block isn't successfully
    registered to P2P DMA subsystem.
    
    Clean up the CMBMSC register in this error handling codepath to disable
    CMB decoding (and CMBLOC/CMBSZ registers).
    
    Signed-off-by: Icenowy Zheng <[email protected]>
    Reviewed-by: Christoph Hellwig <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nvme-pci: skip CMB blocks incompatible with PCI P2P DMA [+ + +]

Author: Icenowy Zheng <[email protected]>
Date:   Thu Feb 13 01:04:44 2025 +0800

    nvme-pci: skip CMB blocks incompatible with PCI P2P DMA
    
    [ Upstream commit 56cf7ef0d490b28fad8f8629fc135c5ab7c9f54e ]
    
    The PCI P2PDMA code will register the CMB block to the memory
    hot-plugging subsystem, which have an alignment requirement. Memory
    blocks that do not satisfy this alignment requirement (usually 2MB) will
    lead to a WARNING from memory hotplugging.
    
    Verify the CMB block's address and size against the alignment and only
    try to send CMB blocks compatible with it to prevent this warning.
    
    Tested on Intel DC D4502 SSD, which has a 512K CMB block that is too
    small for memory hotplugging (thus PCI P2PDMA).
    
    Signed-off-by: Icenowy Zheng <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

nvme-tcp: fix possible UAF in nvme_tcp_poll [+ + +]

Author: Sagi Grimberg <[email protected]>
Date:   Thu Feb 20 13:18:30 2025 +0200

    nvme-tcp: fix possible UAF in nvme_tcp_poll
    
    [ Upstream commit 8c1624b63a7d24142a2bbc3a5ee7e95f004ea36e ]
    
    nvme_tcp_poll() may race with the send path error handler because
    it may complete the request while it is actively being polled for
    completion, resulting in a UAF panic [1]:
    
    We should make sure to stop polling when we see an error when
    trying to read from the socket. Hence make sure to propagate the
    error so that the block layer breaks the polling cycle.
    
    [1]:
    --
    [35665.692310] nvme nvme2: failed to send request -13
    [35665.702265] nvme nvme2: unsupported pdu type (3)
    [35665.702272] BUG: kernel NULL pointer dereference, address: 0000000000000000
    [35665.702542] nvme nvme2: queue 1 receive failed:  -22
    [35665.703209] #PF: supervisor write access in kernel mode
    [35665.703213] #PF: error_code(0x0002) - not-present page
    [35665.703214] PGD 8000003801cce067 P4D 8000003801cce067 PUD 37e6f79067 PMD 0
    [35665.703220] Oops: 0002 [#1] SMP PTI
    [35665.703658] nvme nvme2: starting error recovery
    [35665.705809] Hardware name: Inspur aaabbb/YZMB-00882-104, BIOS 4.1.26 09/22/2022
    [35665.705812] Workqueue: kblockd blk_mq_requeue_work
    [35665.709172] RIP: 0010:_raw_spin_lock+0xc/0x30
    [35665.715788] Call Trace:
    [35665.716201]  <TASK>
    [35665.716613]  ? show_trace_log_lvl+0x1c1/0x2d9
    [35665.717049]  ? show_trace_log_lvl+0x1c1/0x2d9
    [35665.717457]  ? blk_mq_request_bypass_insert+0x2c/0xb0
    [35665.717950]  ? __die_body.cold+0x8/0xd
    [35665.718361]  ? page_fault_oops+0xac/0x140
    [35665.718749]  ? blk_mq_start_request+0x30/0xf0
    [35665.719144]  ? nvme_tcp_queue_rq+0xc7/0x170 [nvme_tcp]
    [35665.719547]  ? exc_page_fault+0x62/0x130
    [35665.719938]  ? asm_exc_page_fault+0x22/0x30
    [35665.720333]  ? _raw_spin_lock+0xc/0x30
    [35665.720723]  blk_mq_request_bypass_insert+0x2c/0xb0
    [35665.721101]  blk_mq_requeue_work+0xa5/0x180
    [35665.721451]  process_one_work+0x1e8/0x390
    [35665.721809]  worker_thread+0x53/0x3d0
    [35665.722159]  ? process_one_work+0x390/0x390
    [35665.722501]  kthread+0x124/0x150
    [35665.722849]  ? set_kthread_struct+0x50/0x50
    [35665.723182]  ret_from_fork+0x1f/0x30
    
    Reported-by: Zhang Guanghui <[email protected]>
    Signed-off-by: Sagi Grimberg <[email protected]>
    Reviewed-by: Chaitanya Kulkarni <[email protected]>
    Signed-off-by: Keith Busch <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

objtool, media: dib8000: Prevent divide-by-zero in dib8000_set_dds() [+ + +]

Author: Josh Poimboeuf <[email protected]>
Date:   Mon Mar 24 14:56:06 2025 -0700

    objtool, media: dib8000: Prevent divide-by-zero in dib8000_set_dds()
    
    [ Upstream commit e63d465f59011dede0a0f1d21718b59a64c3ff5c ]
    
    If dib8000_set_dds()'s call to dib8000_read32() returns zero, the result
    is a divide-by-zero.  Prevent that from happening.
    
    Fixes the following warning with an UBSAN kernel:
    
      drivers/media/dvb-frontends/dib8000.o: warning: objtool: dib8000_tune() falls through to next function dib8096p_cfg_DibRx()
    
    Fixes: 173a64cb3fcf ("[media] dib8000: enhancement")
    Reported-by: kernel test robot <[email protected]>
    Signed-off-by: Josh Poimboeuf <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Mauro Carvalho Chehab <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/bd1d504d930ae3f073b1e071bcf62cae7708773c.1742852847.git.jpoimboe@kernel.org
    Closes: https://lore.kernel.org/r/[email protected]/
    Signed-off-by: Sasha Levin <[email protected]>

ocfs2: validate l_tree_depth to avoid out-of-bounds access [+ + +]

Author: Vasiliy Kovalev <[email protected]>
Date:   Fri Feb 14 11:49:08 2025 +0300

    ocfs2: validate l_tree_depth to avoid out-of-bounds access
    
    [ Upstream commit a406aff8c05115119127c962cbbbbd202e1973ef ]
    
    The l_tree_depth field is 16-bit (__le16), but the actual maximum depth is
    limited to OCFS2_MAX_PATH_DEPTH.
    
    Add a check to prevent out-of-bounds access if l_tree_depth has an invalid
    value, which may occur when reading from a corrupted mounted disk [1].
    
    Link: https://lkml.kernel.org/r/[email protected]
    Fixes: ccd979bdbce9 ("[PATCH] OCFS2: The Second Oracle Cluster Filesystem")
    Signed-off-by: Vasiliy Kovalev <[email protected]>
    Reported-by: [email protected]
    Closes: https://syzkaller.appspot.com/bug?extid=66c146268dc88f4341fd [1]
    Reviewed-by: Joseph Qi <[email protected]>
    Cc: Joel Becker <[email protected]>
    Cc: Junxiao Bi <[email protected]>
    Cc: Changwei Ge <[email protected]>
    Cc: Jun Piao <[email protected]>
    Cc: Kurt Hackel <[email protected]>
    Cc: Mark Fasheh <[email protected]>
    Cc: Vasiliy Kovalev <[email protected]>
    Signed-off-by: Andrew Morton <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

octeontx2-af: Fix mbox INTR handler when num VFs > 64 [+ + +]

Author: Geetha sowjanya <[email protected]>
Date:   Thu Mar 27 14:44:41 2025 +0530

    octeontx2-af: Fix mbox INTR handler when num VFs > 64
    
    [ Upstream commit 0fdba88a211508984eb5df62008c29688692b134 ]
    
    When number of RVU VFs > 64, the vfs value passed to "rvu_queue_work"
    function is incorrect. Due to which mbox workqueue entries for
    VFs 0 to 63 never gets added to workqueue.
    
    Fixes: 9bdc47a6e328 ("octeontx2-af: Mbox communication support btw AF and it's VFs")
    Signed-off-by: Geetha sowjanya <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

octeontx2-af: Free NIX_AF_INT_VEC_GEN irq [+ + +]

Author: Geetha sowjanya <[email protected]>
Date:   Thu Mar 27 15:10:54 2025 +0530

    octeontx2-af: Free NIX_AF_INT_VEC_GEN irq
    
    [ Upstream commit 323d6db6dc7decb06f2545efb9496259ddacd4f4 ]
    
    Due to the incorrect initial vector number in
    rvu_nix_unregister_interrupts(), NIX_AF_INT_VEC_GEN is not
    geeting free. Fix the vector number to include NIX_AF_INT_VEC_GEN
    irq.
    
    Fixes: 5ed66306eab6 ("octeontx2-af: Add devlink health reporters for NIX")
    Signed-off-by: Geetha sowjanya <[email protected]>
    Reviewed-by: Simon Horman <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

of: property: Increase NR_FWNODE_REFERENCE_ARGS [+ + +]

Author: Zijun Hu <[email protected]>
Date:   Tue Feb 25 21:58:06 2025 +0800

    of: property: Increase NR_FWNODE_REFERENCE_ARGS
    
    [ Upstream commit eb50844d728f11e87491f7c7af15a4a737f1159d ]
    
    Currently, the following two macros have different values:
    
    // The maximal argument count for firmware node reference
     #define NR_FWNODE_REFERENCE_ARGS       8
    // The maximal argument count for DT node reference
     #define MAX_PHANDLE_ARGS 16
    
    It may cause firmware node reference's argument count out of range if
    directly assign DT node reference's argument count to firmware's.
    
    drivers/of/property.c:of_fwnode_get_reference_args() is doing the direct
    assignment, so may cause firmware's argument count @args->nargs got out
    of range, namely, in [9, 16].
    
    Fix by increasing NR_FWNODE_REFERENCE_ARGS to 16 to meet DT requirement.
    Will align both macros later to avoid such inconsistency.
    
    Fixes: 3e3119d3088f ("device property: Introduce fwnode_property_get_reference_args")
    Signed-off-by: Zijun Hu <[email protected]>
    Acked-by: Sakari Ailus <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Rob Herring (Arm) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI/ASPM: Fix link state exit during switch upstream function removal [+ + +]

Author: Daniel Stodden <[email protected]>
Date:   Sun Dec 22 19:39:08 2024 -0800

    PCI/ASPM: Fix link state exit during switch upstream function removal
    
    [ Upstream commit cbf937dcadfd571a434f8074d057b32cd14fbea5 ]
    
    Before 456d8aa37d0f ("PCI/ASPM: Disable ASPM on MFD function removal to
    avoid use-after-free"), we would free the ASPM link only after the last
    function on the bus pertaining to the given link was removed.
    
    That was too late. If function 0 is removed before sibling function,
    link->downstream would point to free'd memory after.
    
    After above change, we freed the ASPM parent link state upon any function
    removal on the bus pertaining to a given link.
    
    That is too early. If the link is to a PCIe switch with MFD on the upstream
    port, then removing functions other than 0 first would free a link which
    still remains parent_link to the remaining downstream ports.
    
    The resulting GPFs are especially frequent during hot-unplug, because
    pciehp removes devices on the link bus in reverse order.
    
    On that switch, function 0 is the virtual P2P bridge to the internal bus.
    Free exactly when function 0 is removed -- before the parent link is
    obsolete, but after all subordinate links are gone.
    
    Link: https://lore.kernel.org/r/e12898835f25234561c9d7de4435590d957b85d9.1734924854.git.dns@arista.com
    Fixes: 456d8aa37d0f ("PCI/ASPM: Disable ASPM on MFD function removal to avoid use-after-free")
    Signed-off-by: Daniel Stodden <[email protected]>
    Signed-off-by: Bjorn Helgaas <[email protected]>
    [kwilczynski: commit log]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI/portdrv: Only disable pciehp interrupts early when needed [+ + +]

Author: Feng Tang <[email protected]>
Date:   Mon Mar 3 10:36:30 2025 +0800

    PCI/portdrv: Only disable pciehp interrupts early when needed
    
    [ Upstream commit 9d7db4db19827380e225914618c0c1bf435ed2f5 ]
    
    Firmware developers reported that Linux issues two PCIe hotplug commands in
    very short intervals on an ARM server, which doesn't comply with the PCIe
    spec.  According to PCIe r6.1, sec 6.7.3.2, if the Command Completed event
    is supported, software must wait for a command to complete or wait at
    least 1 second before sending a new command.
    
    In the failure case, the first PCIe hotplug command is from
    get_port_device_capability(), which sends a command to disable PCIe hotplug
    interrupts without waiting for its completion, and the second command comes
    from pcie_enable_notification() of pciehp driver, which enables hotplug
    interrupts again.
    
    Fix this by only disabling the hotplug interrupts when the pciehp driver is
    not enabled.
    
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: 2bd50dd800b5 ("PCI: PCIe: Disable PCIe port services during port initialization")
    Suggested-by: Lukas Wunner <[email protected]>
    Signed-off-by: Feng Tang <[email protected]>
    [bhelgaas: commit log]
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Reviewed-by: Lukas Wunner <[email protected]>
    Reviewed-by: Kuppuswamy Sathyanarayanan <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: Avoid reset when disabled via sysfs [+ + +]

Author: Nishanth Aravamudan <[email protected]>
Date:   Fri Feb 7 14:56:00 2025 -0600

    PCI: Avoid reset when disabled via sysfs
    
    [ Upstream commit 479380efe1625e251008d24b2810283db60d6fcd ]
    
    After d88f521da3ef ("PCI: Allow userspace to query and set device reset
    mechanism"), userspace can disable reset of specific PCI devices by writing
    an empty string to the sysfs reset_method file.
    
    However, pci_slot_resettable() does not check pci_reset_supported(), which
    means that pci_reset_function() will still reset the device even if
    userspace has disabled all the reset methods.
    
    I was able to reproduce this issue with a vfio device passed to a qemu
    guest, where I had disabled PCI reset via sysfs.
    
    Add an explicit check of pci_reset_supported() in both
    pci_slot_resettable() and pci_bus_resettable() to ensure both the reset
    status and reset execution are bypassed if an administrator disables it for
    a device.
    
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: d88f521da3ef ("PCI: Allow userspace to query and set device reset mechanism")
    Signed-off-by: Nishanth Aravamudan <[email protected]>
    [bhelgaas: commit log]
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Cc: Alex Williamson <[email protected]>
    Cc: Raphael Norwitz <[email protected]>
    Cc: Amey Narkhede <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: Yishai Hadas <[email protected]>
    Cc: Shameer Kolothum <[email protected]>
    Cc: Kevin Tian <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: brcmstb: Fix error path after a call to regulator_bulk_get() [+ + +]

Author: Jim Quinlan <[email protected]>
Date:   Fri Feb 14 12:39:32 2025 -0500

    PCI: brcmstb: Fix error path after a call to regulator_bulk_get()
    
    [ Upstream commit 3651ad5249c51cf7eee078e12612557040a6bdb4 ]
    
    If the regulator_bulk_get() returns an error and no regulators
    are created, we need to set their number to zero.
    
    If we don't do this and the PCIe link up fails, a call to the
    regulator_bulk_free() will result in a kernel panic.
    
    While at it, print the error value, as we cannot return an error
    upwards as the kernel will WARN() on an error from add_bus().
    
    Fixes: 9e6be018b263 ("PCI: brcmstb: Enable child bus device regulators from DT")
    Signed-off-by: Jim Quinlan <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [kwilczynski: commit log, use comma in the message to match style with
    other similar messages]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: brcmstb: Fix potential premature regulator disabling [+ + +]

Author: Jim Quinlan <[email protected]>
Date:   Fri Feb 14 12:39:33 2025 -0500

    PCI: brcmstb: Fix potential premature regulator disabling
    
    [ Upstream commit b7de1b60ecab2f7b6f05d8116e93228a0bbb8563 ]
    
    The platform supports enabling and disabling regulators only on
    ports below the Root Complex.
    
    Thus, we need to verify this both when adding and removing the bus,
    otherwise regulators may be disabled prematurely when a bus further
    down the topology is removed.
    
    Fixes: 9e6be018b263 ("PCI: brcmstb: Enable child bus device regulators from DT")
    Signed-off-by: Jim Quinlan <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Reviewed-by: Manivannan Sadhasivam <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [kwilczynski: commit log]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: brcmstb: Use internal register to change link capability [+ + +]

Author: Jim Quinlan <[email protected]>
Date:   Fri Feb 14 12:39:30 2025 -0500

    PCI: brcmstb: Use internal register to change link capability
    
    [ Upstream commit 0c97321e11e0e9e18546f828492758f6aaecec59 ]
    
    The driver has been mistakenly writing to a read-only (RO)
    configuration space register (PCI_EXP_LNKCAP) to change the
    PCIe link capability.
    
    Although harmless in this case, the proper write destination
    is an internal register that is reflected by PCI_EXP_LNKCAP.
    
    Thus, fix the brcm_pcie_set_gen() function to correctly update
    the link capability.
    
    Fixes: c0452137034b ("PCI: brcmstb: Add Broadcom STB PCIe host controller driver")
    Signed-off-by: Jim Quinlan <[email protected]>
    Reviewed-by: Florian Fainelli <[email protected]>
    Reviewed-by: Manivannan Sadhasivam <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [kwilczynski: commit log]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: cadence-ep: Fix the driver to send MSG TLP for INTx without data payload [+ + +]

Author: Hans Zhang <[email protected]>
Date:   Sat Feb 15 00:57:24 2025 +0800

    PCI: cadence-ep: Fix the driver to send MSG TLP for INTx without data payload
    
    [ Upstream commit 3ac47fbf4f6e8c3a7c3855fac68cc3246f90f850 ]
    
    Per the Cadence's "PCIe Controller IP for AX14" user guide, Version
    1.04, Section 9.1.7.1, "AXI Subordinate to PCIe Address Translation
    Registers", Table 9.4, the bit 16 of the AXI Subordinate Address
    (axi_s_awaddr) when set corresponds to MSG with data, and when not set,
    to MSG without data.
    
    However, the driver is currently doing the opposite and due to this,
    the INTx is never received on the host.
    
    So, fix the driver to reflect the documentation and also make INTx work.
    
    Fixes: 37dddf14f1ae ("PCI: cadence: Add EndPoint Controller driver for Cadence PCIe controller")
    Signed-off-by: Hans Zhang <[email protected]>
    Signed-off-by: Hans Zhang <[email protected]>
    Reviewed-by: Manivannan Sadhasivam <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    [kwilczynski: commit log]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: pciehp: Don't enable HPIE when resuming in poll mode [+ + +]

Author: Ilpo Järvinen <[email protected]>
Date:   Fri Mar 21 18:21:14 2025 +0200

    PCI: pciehp: Don't enable HPIE when resuming in poll mode
    
    [ Upstream commit 527664f738afb6f2c58022cd35e63801e5dc7aec ]
    
    PCIe hotplug can operate in poll mode without interrupt handlers using a
    polling kthread only.  eb34da60edee ("PCI: pciehp: Disable hotplug
    interrupt during suspend") failed to consider that and enables HPIE
    (Hot-Plug Interrupt Enable) unconditionally when resuming the Port.
    
    Only set HPIE if non-poll mode is in use. This makes
    pcie_enable_interrupt() match how pcie_enable_notification() already
    handles HPIE.
    
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: eb34da60edee ("PCI: pciehp: Disable hotplug interrupt during suspend")
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Reviewed-by: Lukas Wunner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: Remove stray put_device() in pci_register_host_bridge() [+ + +]

Author: Dan Carpenter <[email protected]>
Date:   Fri Mar 7 11:46:34 2025 +0300

    PCI: Remove stray put_device() in pci_register_host_bridge()
    
    [ Upstream commit 6e8d06e5096c80cbf41313b4a204f43071ca42be ]
    
    This put_device() was accidentally left over from when we changed the code
    from using device_register() to calling device_add().  Delete it.
    
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: 9885440b16b8 ("PCI: Fix pci_host_bridge struct device release/free handling")
    Signed-off-by: Dan Carpenter <[email protected]>
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: Use downstream bridges for distributing resources [+ + +]

Author: Kai-Heng Feng <[email protected]>
Date:   Wed Dec 4 10:24:57 2024 +0800

    PCI: Use downstream bridges for distributing resources
    
    [ Upstream commit 1a596ad00ffe9b37fc60a93cbdd4daead3bf95f3 ]
    
    7180c1d08639 ("PCI: Distribute available resources for root buses, too")
    breaks BAR assignment on some devices:
    
      pci 0006:03:00.0: BAR 0 [mem 0x6300c0000000-0x6300c1ffffff 64bit pref]: assigned
      pci 0006:03:00.1: BAR 0 [mem 0x6300c2000000-0x6300c3ffffff 64bit pref]: assigned
      pci 0006:03:00.2: BAR 0 [mem size 0x00800000 64bit pref]: can't assign; no space
      pci 0006:03:00.0: VF BAR 0 [mem size 0x02000000 64bit pref]: can't assign; no space
      pci 0006:03:00.1: VF BAR 0 [mem size 0x02000000 64bit pref]: can't assign; no space
    
    The apertures of domain 0006 before 7180c1d08639:
    
      6300c0000000-63ffffffffff : PCI Bus 0006:00
        6300c0000000-6300c9ffffff : PCI Bus 0006:01
          6300c0000000-6300c9ffffff : PCI Bus 0006:02        # 160MB
            6300c0000000-6300c8ffffff : PCI Bus 0006:03      #   144MB
              6300c0000000-6300c1ffffff : 0006:03:00.0       #     32MB
              6300c2000000-6300c3ffffff : 0006:03:00.1       #     32MB
              6300c4000000-6300c47fffff : 0006:03:00.2       #      8MB
              6300c4800000-6300c67fffff : 0006:03:00.0       #     32MB
              6300c6800000-6300c87fffff : 0006:03:00.1       #     32MB
            6300c9000000-6300c9bfffff : PCI Bus 0006:04      #    12MB
              6300c9000000-6300c9bfffff : PCI Bus 0006:05    #    12MB
                6300c9000000-6300c91fffff : PCI Bus 0006:06  #      2MB
                6300c9200000-6300c93fffff : PCI Bus 0006:07  #      2MB
                6300c9400000-6300c95fffff : PCI Bus 0006:08  #      2MB
                6300c9600000-6300c97fffff : PCI Bus 0006:09  #      2MB
    
    After 7180c1d08639:
    
      6300c0000000-63ffffffffff : PCI Bus 0006:00
        6300c0000000-6300c9ffffff : PCI Bus 0006:01
          6300c0000000-6300c9ffffff : PCI Bus 0006:02        # 160MB
            6300c0000000-6300c43fffff : PCI Bus 0006:03      #    68MB
              6300c0000000-6300c1ffffff : 0006:03:00.0       #      32MB
              6300c2000000-6300c3ffffff : 0006:03:00.1       #      32MB
                  --- no space ---      : 0006:03:00.2       #       8MB
                  --- no space ---      : 0006:03:00.0       #      32MB
                  --- no space ---      : 0006:03:00.1       #      32MB
            6300c4400000-6300c4dfffff : PCI Bus 0006:04      #    10MB
              6300c4400000-6300c4dfffff : PCI Bus 0006:05    #      10MB
                6300c4400000-6300c45fffff : PCI Bus 0006:06  #        2MB
                6300c4600000-6300c47fffff : PCI Bus 0006:07  #        2MB
                6300c4800000-6300c49fffff : PCI Bus 0006:08  #        2MB
                6300c4a00000-6300c4bfffff : PCI Bus 0006:09  #        2MB
    
    We can see that the window to 0006:03 gets shrunken too much and 0006:04
    eats away the window for 0006:03:00.2.
    
    The offending commit distributes the upstream bridge's resources multiple
    times to every downstream bridge, hence makes the aperture smaller than
    desired because calculation of io_per_b, mmio_per_b and mmio_pref_per_b
    becomes incorrect.
    
    Instead, distribute downstream bridges' own resources to resolve the issue.
    
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: 7180c1d08639 ("PCI: Distribute available resources for root buses, too")
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=219540
    Signed-off-by: Kai-Heng Feng <[email protected]>
    Signed-off-by: Bjorn Helgaas <[email protected]>
    Tested-by: Chia-Lin Kao (AceLan) <[email protected]>
    Reviewed-by: Mika Westerberg <[email protected]>
    Cc: Carol Soto <[email protected]>
    Cc: Jonathan Cameron <[email protected]>
    Cc: Chris Chiu <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

PCI: xilinx-cpm: Fix IRQ domain leak in error path of probe [+ + +]

Author: Thippeswamy Havalige <[email protected]>
Date:   Mon Feb 24 21:20:22 2025 +0530

    PCI: xilinx-cpm: Fix IRQ domain leak in error path of probe
    
    [ Upstream commit 57b0302240741e73fe51f88404b3866e0d2933ad ]
    
    The IRQ domain allocated for the PCIe controller is not freed if
    resource_list_first_type() returns NULL, leading to a resource leak.
    
    This fix ensures properly cleaning up the allocated IRQ domain in
    the error path.
    
    Fixes: 49e427e6bdd1 ("Merge branch 'pci/host-probe-refactor'")
    Signed-off-by: Thippeswamy Havalige <[email protected]>
    [kwilczynski: added missing Fixes: tag, refactored to use one of the goto labels]
    Signed-off-by: Krzysztof Wilczyński <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

perf evlist: Add success path to evlist__create_syswide_maps [+ + +]

Author: Ian Rogers <[email protected]>
Date:   Fri Feb 28 14:22:59 2025 -0800

    perf evlist: Add success path to evlist__create_syswide_maps
    
    [ Upstream commit fe0ce8a9d85a48642880c9b78944cb0d23e779c5 ]
    
    Over various refactorings evlist__create_syswide_maps has been made to
    only ever return with -ENOMEM. Fix this so that when
    perf_evlist__set_maps is successfully called, 0 is returned.
    
    Reviewed-by: Howard Chu <[email protected]>
    Signed-off-by: Ian Rogers <[email protected]>
    Reviewed-by: Arnaldo Carvalho de Melo <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Fixes: 8c0498b6891d7ca5 ("perf evlist: Fix create_syswide_maps() not propagating maps")
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf python: Check if there is space to copy all the event [+ + +]

Author: Arnaldo Carvalho de Melo <[email protected]>
Date:   Wed Mar 12 17:31:41 2025 -0300

    perf python: Check if there is space to copy all the event
    
    [ Upstream commit 89aaeaf84231157288035b366cb6300c1c6cac64 ]
    
    The pyrf_event__new() method copies the event obtained from the perf
    ring buffer to a structure that will then be turned into a python object
    for further consumption, so it copies perf_event.header.size bytes to
    its 'event' member:
    
      $ pahole -C pyrf_event /tmp/build/perf-tools-next/python/perf.cpython-312-x86_64-linux-gnu.so
      struct pyrf_event {
            PyObject                   ob_base;              /*     0    16 */
            struct evsel *             evsel;                /*    16     8 */
            struct perf_sample         sample;               /*    24   312 */
    
            /* XXX last struct has 7 bytes of padding, 2 holes */
    
            /* --- cacheline 5 boundary (320 bytes) was 16 bytes ago --- */
            union perf_event           event;                /*   336  4168 */
    
            /* size: 4504, cachelines: 71, members: 4 */
            /* member types with holes: 1, total: 2 */
            /* paddings: 1, sum paddings: 7 */
            /* last cacheline: 24 bytes */
      };
    
      $
    
    It was doing so without checking if the event just obtained has more
    than that space, fix it.
    
    This isn't a proper, final solution, as we need to support larger
    events, but for the time being we at least bounds check and document it.
    
    Fixes: 877108e42b1b9ba6 ("perf tools: Initial python binding")
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Reviewed-by: Ian Rogers <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf python: Decrement the refcount of just created event on failure [+ + +]

Author: Arnaldo Carvalho de Melo <[email protected]>
Date:   Wed Mar 12 17:31:39 2025 -0300

    perf python: Decrement the refcount of just created event on failure
    
    [ Upstream commit 3de5a2bf5b4847f7a59a184568f969f8fe05d57f ]
    
    To avoid a leak if we have the python object but then something happens
    and we need to return the operation, decrement the offset of the newly
    created object.
    
    Fixes: 377f698db12150a1 ("perf python: Add struct evsel into struct pyrf_event")
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Reviewed-by: Ian Rogers <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf python: Don't keep a raw_data pointer to consumed ring buffer space [+ + +]

Author: Arnaldo Carvalho de Melo <[email protected]>
Date:   Wed Mar 12 17:31:40 2025 -0300

    perf python: Don't keep a raw_data pointer to consumed ring buffer space
    
    [ Upstream commit f3fed3ae34d606819d87a63d970cc3092a5be7ab ]
    
    When processing tracepoints the perf python binding was parsing the
    event before calling perf_mmap__consume(&md->core) in
    pyrf_evlist__read_on_cpu().
    
    But part of this event parsing was to set the perf_sample->raw_data
    pointer to the payload of the event, which then could be overwritten by
    other event before tracepoint fields were asked for via event.prev_comm
    in a python program, for instance.
    
    This also happened with other fields, but strings were were problems
    were surfacing, as there is UTF-8 validation for the potentially garbled
    data.
    
    This ended up showing up as (with some added debugging messages):
    
      ( field 'prev_comm' ret=0x7f7c31f65110, raw_size=68 )  ( field 'prev_pid' ret=0x7f7c23b1bed0, raw_size=68 )  ( field 'prev_prio' ret=0x7f7c239c0030, raw_size=68 )  ( field 'prev_state' ret=0x7f7c239c0250, raw_size=68 ) time 14771421785867 prev_comm= prev_pid=1919907691 prev_prio=796026219 prev_state=0x303a32313175 ==>
      ( XXX '��' len=16, raw_size=68)  ( field 'next_comm' ret=(nil), raw_size=68 ) Traceback (most recent call last):
       File "/home/acme/git/perf-tools-next/tools/perf/python/tracepoint.py", line 51, in <module>
         main()
       File "/home/acme/git/perf-tools-next/tools/perf/python/tracepoint.py", line 46, in main
         event.next_comm,
         ^^^^^^^^^^^^^^^
      AttributeError: 'perf.sample_event' object has no attribute 'next_comm'
    
    When event.next_comm was asked for, the PyUnicode_FromString() python
    API would fail and that tracepoint field wouldn't be available, stopping
    the tools/perf/python/tracepoint.py test tool.
    
    But, since we already do a copy of the whole event in pyrf_event__new,
    just use it and while at it remove what was done in in e8968e654191390a
    ("perf python: Fix pyrf_evlist__read_on_cpu event consuming") because we
    don't really need to wait for parsing the sample before declaring the
    event as consumed.
    
    This copy is questionable as is now, as it limits the maximum event +
    sample_type and tracepoint payload to sizeof(union perf_event), this all
    has been "working" because 'struct perf_event_mmap2', the largest entry
    in 'union perf_event' is:
    
      $ pahole -C perf_event ~/bin/perf | grep mmap2
            struct perf_record_mmap2   mmap2;              /*     0  4168 */
      $
    
    Fixes: bae57e3825a3dded ("perf python: Add support to resolve tracepoint fields")
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Reviewed-by: Ian Rogers <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf python: Fixup description of sample.id event member [+ + +]

Author: Arnaldo Carvalho de Melo <[email protected]>
Date:   Wed Mar 12 17:31:36 2025 -0300

    perf python: Fixup description of sample.id event member
    
    [ Upstream commit 1376c195e8ad327bb9f2d32e0acc5ac39e7cb30a ]
    
    Some old cut'n'paste error, its "ip", so the description should be
    "event ip", not "event type".
    
    Fixes: 877108e42b1b9ba6 ("perf tools: Initial python binding")
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Reviewed-by: Ian Rogers <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf tools: annotate asm_pure_loop.S [+ + +]

Author: Marcus Meissner <[email protected]>
Date:   Sun Mar 23 09:53:45 2025 +0100

    perf tools: annotate asm_pure_loop.S
    
    [ Upstream commit 9a352a90e88a041f4b26d359493e12a7f5ae1a6a ]
    
    Annotate so it is built with non-executable stack.
    
    Fixes: 8b97519711c3 ("perf test: Add asm pureloop test tool")
    Signed-off-by: Marcus Meissner <[email protected]>
    Reviewed-by: Leo Yan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf units: Fix insufficient array space [+ + +]

Author: Arnaldo Carvalho de Melo <[email protected]>
Date:   Mon Mar 10 16:45:32 2025 -0300

    perf units: Fix insufficient array space
    
    [ Upstream commit cf67629f7f637fb988228abdb3aae46d0c1748fe ]
    
    No need to specify the array size, let the compiler figure that out.
    
    This addresses this compiler warning that was noticed while build
    testing on fedora rawhide:
    
      31    15.81 fedora:rawhide                : FAIL gcc version 15.0.1 20250225 (Red Hat 15.0.1-0) (GCC)
        util/units.c: In function 'unit_number__scnprintf':
        util/units.c:67:24: error: initializer-string for array of 'char' is too long [-Werror=unterminated-string-initialization]
           67 |         char unit[4] = "BKMG";
              |                        ^~~~~~
        cc1: all warnings being treated as errors
    
    Fixes: 9808143ba2e54818 ("perf tools: Add unit_number__scnprintf function")
    Signed-off-by: Arnaldo Carvalho de Melo <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Namhyung Kim <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

perf/ring_buffer: Allow the EPOLLRDNORM flag for poll [+ + +]

Author: Tao Chen <[email protected]>
Date:   Fri Mar 14 11:00:36 2025 +0800

    perf/ring_buffer: Allow the EPOLLRDNORM flag for poll
    
    [ Upstream commit c96fff391c095c11dc87dab35be72dee7d217cde ]
    
    The poll man page says POLLRDNORM is equivalent to POLLIN. For poll(),
    it seems that if user sets pollfd with POLLRDNORM in userspace, perf_poll
    will not return until timeout even if perf_output_wakeup called,
    whereas POLLIN returns.
    
    Fixes: 76369139ceb9 ("perf: Split up buffer handling from core code")
    Signed-off-by: Tao Chen <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Arnaldo Carvalho de Melo <[email protected]>
    Cc: "H. Peter Anvin" <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

perf/x86/intel: Apply static call for drain_pebs [+ + +]

Author: Peter Zijlstra (Intel) <[email protected]>
Date:   Tue Jan 21 07:23:00 2025 -0800

    perf/x86/intel: Apply static call for drain_pebs
    
    commit 314dfe10576912e1d786b13c5d4eee8c51b63caa upstream.
    
    The x86_pmu_drain_pebs static call was introduced in commit 7c9903c9bf71
    ("x86/perf, static_call: Optimize x86_pmu methods"), but it's not really
    used to replace the old method.
    
    Apply the static call for drain_pebs.
    
    Fixes: 7c9903c9bf71 ("x86/perf, static_call: Optimize x86_pmu methods")
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Signed-off-by: Kan Liang <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Cc: [email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

perf/x86/intel: Avoid disable PMU if !cpuc->enabled in sample read [+ + +]

Author: Kan Liang <[email protected]>
Date:   Tue Jan 21 07:23:01 2025 -0800

    perf/x86/intel: Avoid disable PMU if !cpuc->enabled in sample read
    
    commit f9bdf1f953392c9edd69a7f884f78c0390127029 upstream.
    
    The WARN_ON(this_cpu_read(cpu_hw_events.enabled)) in the
    intel_pmu_save_and_restart_reload() is triggered, when sampling read
    topdown events.
    
    In a NMI handler, the cpu_hw_events.enabled is set and used to indicate
    the status of core PMU. The generic pmu->pmu_disable_count, updated in
    the perf_pmu_disable/enable pair, is not touched.
    However, the perf_pmu_disable/enable pair is invoked when sampling read
    in a NMI handler. The cpuc->enabled is mistakenly set by the
    perf_pmu_enable().
    
    Avoid disabling PMU if the core PMU is already disabled.
    Merge the logic together.
    
    Fixes: 7b2c05a15d29 ("perf/x86/intel: Generic support for hardware TopDown metrics")
    Suggested-by: Peter Zijlstra (Intel) <[email protected]>
    Signed-off-by: Kan Liang <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Cc: [email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

pinctrl: renesas: rza2: Fix missing of_node_put() call [+ + +]

Author: Fabrizio Castro <[email protected]>
Date:   Wed Mar 5 16:37:53 2025 +0000

    pinctrl: renesas: rza2: Fix missing of_node_put() call
    
    [ Upstream commit abcdeb4e299a11ecb5a3ea0cce00e68e8f540375 ]
    
    of_parse_phandle_with_fixed_args() requires its caller to
    call into of_node_put() on the node pointer from the output
    structure, but such a call is currently missing.
    
    Call into of_node_put() to rectify that.
    
    Fixes: b59d0e782706 ("pinctrl: Add RZ/A2 pin and gpio controller")
    Signed-off-by: Fabrizio Castro <[email protected]>
    Reviewed-by: Lad Prabhakar <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

pinctrl: renesas: rzg2l: Fix missing of_node_put() call [+ + +]

Author: Fabrizio Castro <[email protected]>
Date:   Wed Mar 5 16:37:51 2025 +0000

    pinctrl: renesas: rzg2l: Fix missing of_node_put() call
    
    [ Upstream commit a5779e625e2b377f16a6675c432aaf299ce5028c ]
    
    of_parse_phandle_with_fixed_args() requires its caller to
    call into of_node_put() on the node pointer from the output
    structure, but such a call is currently missing.
    
    Call into of_node_put() to rectify that.
    
    Fixes: c4c4637eb57f ("pinctrl: renesas: Add RZ/G2L pin and gpio controller driver")
    Signed-off-by: Fabrizio Castro <[email protected]>
    Reviewed-by: Lad Prabhakar <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

pinctrl: renesas: rzv2m: Fix missing of_node_put() call [+ + +]

Author: Fabrizio Castro <[email protected]>
Date:   Wed Mar 5 16:37:52 2025 +0000

    pinctrl: renesas: rzv2m: Fix missing of_node_put() call
    
    [ Upstream commit 5a550b00704d3a2cd9d766a9427b0f8166da37df ]
    
    of_parse_phandle_with_fixed_args() requires its caller to
    call into of_node_put() on the node pointer from the output
    structure, but such a call is currently missing.
    
    Call into of_node_put() to rectify that.
    
    Fixes: 92a9b8252576 ("pinctrl: renesas: Add RZ/V2M pin and gpio controller driver")
    Signed-off-by: Fabrizio Castro <[email protected]>
    Reviewed-by: Lad Prabhakar <[email protected]>
    Reviewed-by: Geert Uytterhoeven <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Geert Uytterhoeven <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

pinctrl: tegra: Set SFIO mode to Mux Register [+ + +]

Author: Prathamesh Shete <[email protected]>
Date:   Thu Mar 6 10:35:42 2025 +0530

    pinctrl: tegra: Set SFIO mode to Mux Register
    
    [ Upstream commit 17013f0acb322e5052ff9b9d0fab0ab5a4bfd828 ]
    
    Tegra devices have an 'sfsel' bit field that determines whether a pin
    operates in SFIO (Special Function I/O) or GPIO mode. Currently,
    tegra_pinctrl_gpio_disable_free() sets this bit when releasing a GPIO.
    
    However, tegra_pinctrl_set_mux() can be called independently in certain
    code paths where gpio_disable_free() is not invoked. In such cases, failing
    to set the SFIO mode could lead to incorrect pin configurations, resulting
    in functional issues for peripherals relying on SFIO.
    
    This patch ensures that whenever set_mux() is called, the SFIO mode is
    correctly set in the Mux Register if the 'sfsel' bit is present. This
    prevents situations where the pin remains in GPIO mode despite being
    configured for SFIO use.
    
    Fixes: 971dac7123c7 ("pinctrl: add a driver for NVIDIA Tegra")
    Signed-off-by: Prathamesh Shete <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Linus Walleij <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

platform/x86: intel-hid: fix volume buttons on Microsoft Surface Go 4 tablet [+ + +]

Author: Dmitry Panchenko <[email protected]>
Date:   Thu Feb 20 17:39:31 2025 +0200

    platform/x86: intel-hid: fix volume buttons on Microsoft Surface Go 4 tablet
    
    [ Upstream commit 2738d06fb4f01145b24c542fb06de538ffc56430 ]
    
    Volume buttons on Microsoft Surface Go 4 tablet didn't send any events.
    Add Surface Go 4 DMI match to button_array_table to fix this.
    
    Signed-off-by: Dmitry Panchenko <[email protected]>
    Reviewed-by: Hans de Goede <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

platform/x86: ISST: Correct command storage data length [+ + +]

Author: Srinivas Pandruvada <[email protected]>
Date:   Fri Mar 28 15:47:49 2025 -0700

    platform/x86: ISST: Correct command storage data length
    
    commit 9462e74c5c983cce34019bfb27f734552bebe59f upstream.
    
    After resume/online turbo limit ratio (TRL) is restored partially if
    the admin explicitly changed TRL from user space.
    
    A hash table is used to store SST mail box and MSR settings when modified
    to restore those settings after resume or online. This uses a struct
    isst_cmd field "data" to store these settings. This is a 64 bit field.
    But isst_store_new_cmd() is only assigning as u32. This results in
    truncation of 32 bits.
    
    Change the argument to u64 from u32.
    
    Fixes: f607874f35cb ("platform/x86: ISST: Restore state on resume")
    Signed-off-by: Srinivas Pandruvada <[email protected]>
    Cc: [email protected]
    Link: https://lore.kernel.org/r/[email protected]
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

PM: sleep: Adjust check before setting power.must_resume [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Thu Feb 27 11:53:50 2025 +0100

    PM: sleep: Adjust check before setting power.must_resume
    
    [ Upstream commit eeb87d17aceab7803a5a5bcb6cf2817b745157cf ]
    
    The check before setting power.must_resume in device_suspend_noirq()
    does not take power.child_count into account, but it should do that, so
    use pm_runtime_need_not_resume() in it for this purpose and adjust the
    comment next to it accordingly.
    
    Fixes: 107d47b2b95e ("PM: sleep: core: Simplify the SMART_SUSPEND flag handling")
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Reviewed-by: Ulf Hansson <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

PM: sleep: Fix handling devices with direct_complete set on errors [+ + +]

Author: Rafael J. Wysocki <[email protected]>
Date:   Thu Mar 13 17:00:00 2025 +0100

    PM: sleep: Fix handling devices with direct_complete set on errors
    
    [ Upstream commit 03f1444016b71feffa1dfb8a51f15ba592f94b13 ]
    
    When dpm_suspend() fails, some devices with power.direct_complete set
    may not have been handled by device_suspend() yet, so runtime PM has
    not been disabled for them yet even though power.direct_complete is set.
    
    Since device_resume() expects that runtime PM has been disabled for all
    devices with power.direct_complete set, it will attempt to reenable
    runtime PM for the devices that have not been processed by device_suspend()
    which does not make sense.  Had those devices had runtime PM disabled
    before device_suspend() had run, device_resume() would have inadvertently
    enable runtime PM for them, but this is not expected to happen because
    it would require ->prepare() callbacks to return positive values for
    devices with runtime PM disabled, which would be invalid.
    
    In practice, this issue is most likely benign because pm_runtime_enable()
    will not allow the "disable depth" counter to underflow, but it causes a
    warning message to be printed for each affected device.
    
    To allow device_resume() to distinguish the "direct complete" devices
    that have been processed by device_suspend() from those which have not
    been handled by it, make device_suspend() set power.is_suspended for
    "direct complete" devices.
    
    Next, move the power.is_suspended check in device_resume() before the
    power.direct_complete check in it to make it skip the "direct complete"
    devices that have not been handled by device_suspend().
    
    This change is based on a preliminary patch from Saravana Kannan.
    
    Fixes: aae4518b3124 ("PM / sleep: Mechanism to avoid resuming runtime-suspended devices unnecessarily")
    Link: https://lore.kernel.org/linux-pm/[email protected]/
    Reported-by: Saravana Kannan <[email protected]>
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Reviewed-by: Saravana Kannan <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

power: supply: max77693: Fix wrong conversion of charge input threshold value [+ + +]

Author: Artur Weber <[email protected]>
Date:   Sun Mar 16 21:11:49 2025 +0100

    power: supply: max77693: Fix wrong conversion of charge input threshold value
    
    [ Upstream commit 30cc7b0d0e9341d419eb7da15fb5c22406dbe499 ]
    
    The charge input threshold voltage register on the MAX77693 PMIC accepts
    four values: 0x0 for 4.3v, 0x1 for 4.7v, 0x2 for 4.8v and 0x3 for 4.9v.
    Due to an oversight, the driver calculated the values for 4.7v and above
    starting from 0x0, rather than from 0x1 ([(4700000 - 4700000) / 100000]
    gives 0).
    
    Add 1 to the calculation to ensure that 4.7v is converted to a register
    value of 0x1 and that the other two voltages are converted correctly as
    well.
    
    Fixes: 87c2d9067893 ("power: max77693: Add charger driver for Maxim 77693")
    Signed-off-by: Artur Weber <[email protected]>
    Reviewed-by: Krzysztof Kozlowski <[email protected]>
    Link: https://lore.kernel.org/r/20250316-max77693-charger-input-threshold-fix-v1-1-2b037d0ac722@gmail.com
    Signed-off-by: Sebastian Reichel <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

rcu-tasks: Always inline rcu_irq_work_resched() [+ + +]

Author: Josh Poimboeuf <[email protected]>
Date:   Mon Mar 31 21:26:46 2025 -0700

    rcu-tasks: Always inline rcu_irq_work_resched()
    
    [ Upstream commit 6309a5c43b0dc629851f25b2e5ef8beff61d08e5 ]
    
    Thanks to CONFIG_DEBUG_SECTION_MISMATCH, empty functions can be
    generated out of line.  rcu_irq_work_resched() can be called from
    noinstr code, so make sure it's always inlined.
    
    Fixes: 564506495ca9 ("rcu/context-tracking: Move deferred nocb resched to context tracking")
    Reported-by: Randy Dunlap <[email protected]>
    Signed-off-by: Josh Poimboeuf <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Frederic Weisbecker <[email protected]>
    Cc: Paul E. McKenney <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/e84f15f013c07e4c410d972e75620c53b62c1b3e.1743481539.git.jpoimboe@kernel.org
    Closes: https://lore.kernel.org/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/core: Don't expose hw_counters outside of init net namespace [+ + +]

Author: Roman Gushchin <[email protected]>
Date:   Thu Feb 27 16:54:20 2025 +0000

    RDMA/core: Don't expose hw_counters outside of init net namespace
    
    [ Upstream commit a1ecb30f90856b0be4168ad51b8875148e285c1f ]
    
    Commit 467f432a521a ("RDMA/core: Split port and device counter sysfs
    attributes") accidentally almost exposed hw counters to non-init net
    namespaces. It didn't expose them fully, as an attempt to read any of
    those counters leads to a crash like this one:
    
    [42021.807566] BUG: kernel NULL pointer dereference, address: 0000000000000028
    [42021.814463] #PF: supervisor read access in kernel mode
    [42021.819549] #PF: error_code(0x0000) - not-present page
    [42021.824636] PGD 0 P4D 0
    [42021.827145] Oops: 0000 [#1] SMP PTI
    [42021.830598] CPU: 82 PID: 2843922 Comm: switchto-defaul Kdump: loaded Tainted: G S      W I        XXX
    [42021.841697] Hardware name: XXX
    [42021.849619] RIP: 0010:hw_stat_device_show+0x1e/0x40 [ib_core]
    [42021.855362] Code: 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 0f 1f 44 00 00 49 89 d0 4c 8b 5e 20 48 8b 8f b8 04 00 00 48 81 c7 f0 fa ff ff <48> 8b 41 28 48 29 ce 48 83 c6 d0 48 c1 ee 04 69 d6 ab aa aa aa 48
    [42021.873931] RSP: 0018:ffff97fe90f03da0 EFLAGS: 00010287
    [42021.879108] RAX: ffff9406988a8c60 RBX: ffff940e1072d438 RCX: 0000000000000000
    [42021.886169] RDX: ffff94085f1aa000 RSI: ffff93c6cbbdbcb0 RDI: ffff940c7517aef0
    [42021.893230] RBP: ffff97fe90f03e70 R08: ffff94085f1aa000 R09: 0000000000000000
    [42021.900294] R10: ffff94085f1aa000 R11: ffffffffc0775680 R12: ffffffff87ca2530
    [42021.907355] R13: ffff940651602840 R14: ffff93c6cbbdbcb0 R15: ffff94085f1aa000
    [42021.914418] FS:  00007fda1a3b9700(0000) GS:ffff94453fb80000(0000) knlGS:0000000000000000
    [42021.922423] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [42021.928130] CR2: 0000000000000028 CR3: 00000042dcfb8003 CR4: 00000000003726f0
    [42021.935194] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    [42021.942257] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    [42021.949324] Call Trace:
    [42021.951756]  <TASK>
    [42021.953842]  [<ffffffff86c58674>] ? show_regs+0x64/0x70
    [42021.959030]  [<ffffffff86c58468>] ? __die+0x78/0xc0
    [42021.963874]  [<ffffffff86c9ef75>] ? page_fault_oops+0x2b5/0x3b0
    [42021.969749]  [<ffffffff87674b92>] ? exc_page_fault+0x1a2/0x3c0
    [42021.975549]  [<ffffffff87801326>] ? asm_exc_page_fault+0x26/0x30
    [42021.981517]  [<ffffffffc0775680>] ? __pfx_show_hw_stats+0x10/0x10 [ib_core]
    [42021.988482]  [<ffffffffc077564e>] ? hw_stat_device_show+0x1e/0x40 [ib_core]
    [42021.995438]  [<ffffffff86ac7f8e>] dev_attr_show+0x1e/0x50
    [42022.000803]  [<ffffffff86a3eeb1>] sysfs_kf_seq_show+0x81/0xe0
    [42022.006508]  [<ffffffff86a11134>] seq_read_iter+0xf4/0x410
    [42022.011954]  [<ffffffff869f4b2e>] vfs_read+0x16e/0x2f0
    [42022.017058]  [<ffffffff869f50ee>] ksys_read+0x6e/0xe0
    [42022.022073]  [<ffffffff8766f1ca>] do_syscall_64+0x6a/0xa0
    [42022.027441]  [<ffffffff8780013b>] entry_SYSCALL_64_after_hwframe+0x78/0xe2
    
    The problem can be reproduced using the following steps:
      ip netns add foo
      ip netns exec foo bash
      cat /sys/class/infiniband/mlx4_0/hw_counters/*
    
    The panic occurs because of casting the device pointer into an
    ib_device pointer using container_of() in hw_stat_device_show() is
    wrong and leads to a memory corruption.
    
    However the real problem is that hw counters should never been exposed
    outside of the non-init net namespace.
    
    Fix this by saving the index of the corresponding attribute group
    (it might be 1 or 2 depending on the presence of driver-specific
    attributes) and zeroing the pointer to hw_counters group for compat
    devices during the initialization.
    
    With this fix applied hw_counters are not available in a non-init
    net namespace:
      find /sys/class/infiniband/mlx4_0/ -name hw_counters
        /sys/class/infiniband/mlx4_0/ports/1/hw_counters
        /sys/class/infiniband/mlx4_0/ports/2/hw_counters
        /sys/class/infiniband/mlx4_0/hw_counters
    
      ip netns add foo
      ip netns exec foo bash
      find /sys/class/infiniband/mlx4_0/ -name hw_counters
    
    Fixes: 467f432a521a ("RDMA/core: Split port and device counter sysfs attributes")
    Signed-off-by: Roman Gushchin <[email protected]>
    Cc: Jason Gunthorpe <[email protected]>
    Cc: Leon Romanovsky <[email protected]>
    Cc: Maher Sanalla <[email protected]>
    Cc: [email protected]
    Cc: [email protected]
    Link: https://patch.msgid.link/[email protected]
    Reviewed-by: Parav Pandit <[email protected]>
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/erdma: Prevent use-after-free in erdma_accept_newconn() [+ + +]

Author: Cheng Xu <[email protected]>
Date:   Thu Mar 6 20:04:40 2025 +0800

    RDMA/erdma: Prevent use-after-free in erdma_accept_newconn()
    
    [ Upstream commit 83437689249e6a17b25e27712fbee292e42e7855 ]
    
    After the erdma_cep_put(new_cep) being called, new_cep will be freed,
    and the following dereference will cause a UAF problem. Fix this issue.
    
    Fixes: 920d93eac8b9 ("RDMA/erdma: Add connection management (CM) support")
    Signed-off-by: Markus Elfring <[email protected]>
    Signed-off-by: Cheng Xu <[email protected]>
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/mlx5: Fix calculation of total invalidated pages [+ + +]

Author: Chiara Meiohas <[email protected]>
Date:   Thu Mar 13 16:29:54 2025 +0200

    RDMA/mlx5: Fix calculation of total invalidated pages
    
    [ Upstream commit 79195147644653ebffadece31a42181e4c48c07d ]
    
    When invalidating an address range in mlx5, there is an optimization to
    do UMR operations in chunks.
    Previously, the invalidation counter was incorrectly updated for the
    same indexes within a chunk. Now, the invalidation counter is updated
    only when a chunk is complete and mlx5r_umr_update_xlt() is called.
    This ensures that the counter accurately represents the number of pages
    invalidated using UMR.
    
    Fixes: a3de94e3d61e ("IB/mlx5: Introduce ODP diagnostic counters")
    Signed-off-by: Chiara Meiohas <[email protected]>
    Reviewed-by: Michael Guralnik <[email protected]>
    Link: https://patch.msgid.link/560deb2433318e5947282b070c915f3c81fef77f.1741875692.git.leon@kernel.org
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

RDMA/mlx5: Fix mlx5_poll_one() cur_qp update flow [+ + +]

Author: Patrisious Haddad <[email protected]>
Date:   Thu Mar 13 16:29:53 2025 +0200

    RDMA/mlx5: Fix mlx5_poll_one() cur_qp update flow
    
    [ Upstream commit 5ed3b0cb3f827072e93b4c5b6e2b8106fd7cccbd ]
    
    When cur_qp isn't NULL, in order to avoid fetching the QP from
    the radix tree again we check if the next cqe QP is identical to
    the one we already have.
    
    The bug however is that we are checking if the QP is identical by
    checking the QP number inside the CQE against the QP number inside the
    mlx5_ib_qp, but that's wrong since the QP number from the CQE is from
    FW so it should be matched against mlx5_core_qp which is our FW QP
    number.
    
    Otherwise we could use the wrong QP when handling a CQE which could
    cause the kernel trace below.
    
    This issue is mainly noticeable over QPs 0 & 1, since for now they are
    the only QPs in our driver whereas the QP number inside mlx5_ib_qp
    doesn't match the QP number inside mlx5_core_qp.
    
    BUG: kernel NULL pointer dereference, address: 0000000000000012
     #PF: supervisor read access in kernel mode
     #PF: error_code(0x0000) - not-present page
     PGD 0 P4D 0
     Oops: Oops: 0000 [#1] SMP
     CPU: 0 UID: 0 PID: 7927 Comm: kworker/u62:1 Not tainted 6.14.0-rc3+ #189
     Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.16.3-0-ga6ed6b701f0a-prebuilt.qemu.org 04/01/2014
     Workqueue: ib-comp-unb-wq ib_cq_poll_work [ib_core]
     RIP: 0010:mlx5_ib_poll_cq+0x4c7/0xd90 [mlx5_ib]
     Code: 03 00 00 8d 58 ff 21 cb 66 39 d3 74 39 48 c7 c7 3c 89 6e a0 0f b7 db e8 b7 d2 b3 e0 49 8b 86 60 03 00 00 48 c7 c7 4a 89 6e a0 <0f> b7 5c 98 02 e8 9f d2 b3 e0 41 0f b7 86 78 03 00 00 83 e8 01 21
     RSP: 0018:ffff88810511bd60 EFLAGS: 00010046
     RAX: 0000000000000010 RBX: 0000000000000000 RCX: 0000000000000000
     RDX: 0000000000000000 RSI: ffff88885fa1b3c0 RDI: ffffffffa06e894a
     RBP: 00000000000000b0 R08: 0000000000000000 R09: ffff88810511bc10
     R10: 0000000000000001 R11: 0000000000000001 R12: ffff88810d593000
     R13: ffff88810e579108 R14: ffff888105146000 R15: 00000000000000b0
     FS:  0000000000000000(0000) GS:ffff88885fa00000(0000) knlGS:0000000000000000
     CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
     CR2: 0000000000000012 CR3: 00000001077e6001 CR4: 0000000000370eb0
     Call Trace:
      <TASK>
      ? __die+0x20/0x60
      ? page_fault_oops+0x150/0x3e0
      ? exc_page_fault+0x74/0x130
      ? asm_exc_page_fault+0x22/0x30
      ? mlx5_ib_poll_cq+0x4c7/0xd90 [mlx5_ib]
      __ib_process_cq+0x5a/0x150 [ib_core]
      ib_cq_poll_work+0x31/0x90 [ib_core]
      process_one_work+0x169/0x320
      worker_thread+0x288/0x3a0
      ? work_busy+0xb0/0xb0
      kthread+0xd7/0x1f0
      ? kthreads_online_cpu+0x130/0x130
      ? kthreads_online_cpu+0x130/0x130
      ret_from_fork+0x2d/0x50
      ? kthreads_online_cpu+0x130/0x130
      ret_from_fork_asm+0x11/0x20
      </TASK>
    
    Fixes: e126ba97dba9 ("mlx5: Add driver for Mellanox Connect-IB adapters")
    Signed-off-by: Patrisious Haddad <[email protected]>
    Reviewed-by: Edward Srouji <[email protected]>
    Link: https://patch.msgid.link/4ada09d41f1e36db62c44a9b25c209ea5f054316.1741875692.git.leon@kernel.org
    Signed-off-by: Leon Romanovsky <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

remoteproc: core: Clear table_sz when rproc_shutdown [+ + +]

Author: Peng Fan <[email protected]>
Date:   Wed Mar 19 18:01:05 2025 +0800

    remoteproc: core: Clear table_sz when rproc_shutdown
    
    [ Upstream commit efdde3d73ab25cef4ff2d06783b0aad8b093c0e4 ]
    
    There is case as below could trigger kernel dump:
    Use U-Boot to start remote processor(rproc) with resource table
    published to a fixed address by rproc. After Kernel boots up,
    stop the rproc, load a new firmware which doesn't have resource table
    ,and start rproc.
    
    When starting rproc with a firmware not have resource table,
    `memcpy(loaded_table, rproc->cached_table, rproc->table_sz)` will
    trigger dump, because rproc->cache_table is set to NULL during the last
    stop operation, but rproc->table_sz is still valid.
    
    This issue is found on i.MX8MP and i.MX9.
    
    Dump as below:
    Unable to handle kernel NULL pointer dereference at virtual address 0000000000000000
    Mem abort info:
      ESR = 0x0000000096000004
      EC = 0x25: DABT (current EL), IL = 32 bits
      SET = 0, FnV = 0
      EA = 0, S1PTW = 0
      FSC = 0x04: level 0 translation fault
    Data abort info:
      ISV = 0, ISS = 0x00000004, ISS2 = 0x00000000
      CM = 0, WnR = 0, TnD = 0, TagAccess = 0
      GCS = 0, Overlay = 0, DirtyBit = 0, Xs = 0
    user pgtable: 4k pages, 48-bit VAs, pgdp=000000010af63000
    [0000000000000000] pgd=0000000000000000, p4d=0000000000000000
    Internal error: Oops: 0000000096000004 [#1] PREEMPT SMP
    Modules linked in:
    CPU: 2 UID: 0 PID: 1060 Comm: sh Not tainted 6.14.0-rc7-next-20250317-dirty #38
    Hardware name: NXP i.MX8MPlus EVK board (DT)
    pstate: a0000005 (NzCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
    pc : __pi_memcpy_generic+0x110/0x22c
    lr : rproc_start+0x88/0x1e0
    Call trace:
     __pi_memcpy_generic+0x110/0x22c (P)
     rproc_boot+0x198/0x57c
     state_store+0x40/0x104
     dev_attr_store+0x18/0x2c
     sysfs_kf_write+0x7c/0x94
     kernfs_fop_write_iter+0x120/0x1cc
     vfs_write+0x240/0x378
     ksys_write+0x70/0x108
     __arm64_sys_write+0x1c/0x28
     invoke_syscall+0x48/0x10c
     el0_svc_common.constprop.0+0xc0/0xe0
     do_el0_svc+0x1c/0x28
     el0_svc+0x30/0xcc
     el0t_64_sync_handler+0x10c/0x138
     el0t_64_sync+0x198/0x19c
    
    Clear rproc->table_sz to address the issue.
    
    Fixes: 9dc9507f1880 ("remoteproc: Properly deal with the resource table when detaching")
    Signed-off-by: Peng Fan <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

remoteproc: qcom_q6v5_mss: Handle platforms with one power domain [+ + +]

Author: Luca Weiss <[email protected]>
Date:   Mon Feb 17 23:05:18 2025 +0100

    remoteproc: qcom_q6v5_mss: Handle platforms with one power domain
    
    [ Upstream commit 4641840341f37dc8231e0840ec1514b4061b4322 ]
    
    For example MSM8974 has mx voltage rail exposed as regulator and only cx
    voltage rail is exposed as power domain. This power domain (cx) is
    attached internally in power domain and cannot be attached in this driver.
    
    Fixes: 8750cf392394 ("remoteproc: qcom_q6v5_mss: Allow replacing regulators with power domains")
    Co-developed-by: Matti Lehtimäki <[email protected]>
    Signed-off-by: Matti Lehtimäki <[email protected]>
    Reviewed-by: Stephan Gerhold <[email protected]>
    Signed-off-by: Luca Weiss <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

remoteproc: qcom_q6v5_pas: Make single-PD handling more robust [+ + +]

Author: Luca Weiss <[email protected]>
Date:   Tue Jan 28 22:54:00 2025 +0100

    remoteproc: qcom_q6v5_pas: Make single-PD handling more robust
    
    [ Upstream commit e917b73234b02aa4966325e7380d2559bf127ba9 ]
    
    Only go into the if condition for single-PD handling when there's
    actually just one power domain specified there. Otherwise it'll be an
    issue in the dts and we should fail in the regular code path.
    
    This also mirrors the latest changes in the qcom_q6v5_mss driver.
    
    Suggested-by: Stephan Gerhold <[email protected]>
    Fixes: 17ee2fb4e856 ("remoteproc: qcom: pas: Vote for active/proxy power domains")
    Signed-off-by: Luca Weiss <[email protected]>
    Reviewed-by: Stephan Gerhold <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Bjorn Andersson <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

ring-buffer: Fix bytes_dropped calculation issue [+ + +]

Author: Feng Yang <[email protected]>
Date:   Sun Feb 23 15:01:06 2025 +0800

    ring-buffer: Fix bytes_dropped calculation issue
    
    [ Upstream commit c73f0b69648501978e8b3e8fa7eef7f4197d0481 ]
    
    The calculation of bytes-dropped and bytes_dropped_nested is reversed.
    Although it does not affect the final calculation of total_dropped,
    it should still be modified.
    
    Link: https://lore.kernel.org/[email protected]
    Fixes: 6c43e554a2a5 ("ring-buffer: Add ring buffer startup selftest")
    Signed-off-by: Feng Yang <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

riscv: ftrace: Add parentheses in macro definitions of make_call_t0 and make_call_ra [+ + +]

Author: Juhan Jin <[email protected]>
Date:   Thu Feb 6 13:28:36 2025 -0600

    riscv: ftrace: Add parentheses in macro definitions of make_call_t0 and make_call_ra
    
    [ Upstream commit 5f1a58ed91a040d4625d854f9bb3dd4995919202 ]
    
    This patch adds parentheses to parameters caller and callee of macros
    make_call_t0 and make_call_ra. Every existing invocation of these two
    macros uses a single variable for each argument, so the absence of the
    parentheses seems okay. However, future invocations might use more
    complex expressions as arguments. For example, a future invocation might
    look like this: make_call_t0(a - b, c, call). Without parentheses in the
    macro definition, the macro invocation expands to:
    
    ...
    unsigned int offset = (unsigned long) c - (unsigned long) a - b;
    ...
    
    which is clearly wrong.
    
    The use of parentheses ensures arguments are correctly evaluated and
    potentially saves future users of make_call_t0 and make_call_ra debugging
    trouble.
    
    Fixes: 6724a76cff85 ("riscv: ftrace: Reduce the detour code size to half")
    Signed-off-by: Juhan Jin <[email protected]>
    Reviewed-by: Alexandre Ghiti <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Alexandre Ghiti <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

rndis_host: Flag RNDIS modems as WWAN devices [+ + +]

Author: Lubomir Rintel <[email protected]>
Date:   Tue Mar 25 10:58:41 2025 +0100

    rndis_host: Flag RNDIS modems as WWAN devices
    
    [ Upstream commit 67d1a8956d2d62fe6b4c13ebabb57806098511d8 ]
    
    Set FLAG_WWAN instead of FLAG_ETHERNET for RNDIS interfaces on Mobile
    Broadband Modems, as opposed to regular Ethernet adapters.
    
    Otherwise NetworkManager gets confused, misjudges the device type,
    and wouldn't know it should connect a modem to get the device to work.
    What would be the result depends on ModemManager version -- older
    ModemManager would end up disconnecting a device after an unsuccessful
    probe attempt (if it connected without needing to unlock a SIM), while
    a newer one might spawn a separate PPP connection over a tty interface
    instead, resulting in a general confusion and no end of chaos.
    
    The only way to get this work reliably is to fix the device type
    and have good enough version ModemManager (or equivalent).
    
    Fixes: 63ba395cd7a5 ("rndis_host: support Novatel Verizon USB730L")
    Signed-off-by: Lubomir Rintel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

rtnetlink: Allocate vfinfo size for VF GUIDs when supported [+ + +]

Author: Mark Zhang <[email protected]>
Date:   Tue Mar 25 11:02:26 2025 +0200

    rtnetlink: Allocate vfinfo size for VF GUIDs when supported
    
    [ Upstream commit 23f00807619d15063d676218f36c5dfeda1eb420 ]
    
    Commit 30aad41721e0 ("net/core: Add support for getting VF GUIDs")
    added support for getting VF port and node GUIDs in netlink ifinfo
    messages, but their size was not taken into consideration in the
    function that allocates the netlink message, causing the following
    warning when a netlink message is filled with many VF port and node
    GUIDs:
     # echo 64 > /sys/bus/pci/devices/0000\:08\:00.0/sriov_numvfs
     # ip link show dev ib0
     RTNETLINK answers: Message too long
     Cannot send link get request: Message too long
    
    Kernel warning:
    
     ------------[ cut here ]------------
     WARNING: CPU: 2 PID: 1930 at net/core/rtnetlink.c:4151 rtnl_getlink+0x586/0x5a0
     Modules linked in: xt_conntrack xt_MASQUERADE nfnetlink xt_addrtype iptable_nat nf_nat br_netfilter overlay mlx5_ib macsec mlx5_core tls rpcrdma rdma_ucm ib_uverbs ib_iser libiscsi scsi_transport_iscsi ib_umad rdma_cm iw_cm ib_ipoib fuse ib_cm ib_core
     CPU: 2 UID: 0 PID: 1930 Comm: ip Not tainted 6.14.0-rc2+ #1
     Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS rel-1.13.0-0-gf21b5a4aeb02-prebuilt.qemu.org 04/01/2014
     RIP: 0010:rtnl_getlink+0x586/0x5a0
     Code: cb 82 e8 3d af 0a 00 4d 85 ff 0f 84 08 ff ff ff 4c 89 ff 41 be ea ff ff ff e8 66 63 5b ff 49 c7 07 80 4f cb 82 e9 36 fc ff ff <0f> 0b e9 16 fe ff ff e8 de a0 56 00 66 66 2e 0f 1f 84 00 00 00 00
     RSP: 0018:ffff888113557348 EFLAGS: 00010246
     RAX: 00000000ffffffa6 RBX: ffff88817e87aa34 RCX: dffffc0000000000
     RDX: 0000000000000003 RSI: 0000000000000000 RDI: ffff88817e87afb8
     RBP: 0000000000000009 R08: ffffffff821f44aa R09: 0000000000000000
     R10: ffff8881260f79a8 R11: ffff88817e87af00 R12: ffff88817e87aa00
     R13: ffffffff8563d300 R14: 00000000ffffffa6 R15: 00000000ffffffff
     FS:  00007f63a5dbf280(0000) GS:ffff88881ee00000(0000) knlGS:0000000000000000
     CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
     CR2: 00007f63a5ba4493 CR3: 00000001700fe002 CR4: 0000000000772eb0
     DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
     DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
     PKRU: 55555554
     Call Trace:
      <TASK>
      ? __warn+0xa5/0x230
      ? rtnl_getlink+0x586/0x5a0
      ? report_bug+0x22d/0x240
      ? handle_bug+0x53/0xa0
      ? exc_invalid_op+0x14/0x50
      ? asm_exc_invalid_op+0x16/0x20
      ? skb_trim+0x6a/0x80
      ? rtnl_getlink+0x586/0x5a0
      ? __pfx_rtnl_getlink+0x10/0x10
      ? rtnetlink_rcv_msg+0x1e5/0x860
      ? __pfx___mutex_lock+0x10/0x10
      ? rcu_is_watching+0x34/0x60
      ? __pfx_lock_acquire+0x10/0x10
      ? stack_trace_save+0x90/0xd0
      ? filter_irq_stacks+0x1d/0x70
      ? kasan_save_stack+0x30/0x40
      ? kasan_save_stack+0x20/0x40
      ? kasan_save_track+0x10/0x30
      rtnetlink_rcv_msg+0x21c/0x860
      ? entry_SYSCALL_64_after_hwframe+0x76/0x7e
      ? __pfx_rtnetlink_rcv_msg+0x10/0x10
      ? arch_stack_walk+0x9e/0xf0
      ? rcu_is_watching+0x34/0x60
      ? lock_acquire+0xd5/0x410
      ? rcu_is_watching+0x34/0x60
      netlink_rcv_skb+0xe0/0x210
      ? __pfx_rtnetlink_rcv_msg+0x10/0x10
      ? __pfx_netlink_rcv_skb+0x10/0x10
      ? rcu_is_watching+0x34/0x60
      ? __pfx___netlink_lookup+0x10/0x10
      ? lock_release+0x62/0x200
      ? netlink_deliver_tap+0xfd/0x290
      ? rcu_is_watching+0x34/0x60
      ? lock_release+0x62/0x200
      ? netlink_deliver_tap+0x95/0x290
      netlink_unicast+0x31f/0x480
      ? __pfx_netlink_unicast+0x10/0x10
      ? rcu_is_watching+0x34/0x60
      ? lock_acquire+0xd5/0x410
      netlink_sendmsg+0x369/0x660
      ? lock_release+0x62/0x200
      ? __pfx_netlink_sendmsg+0x10/0x10
      ? import_ubuf+0xb9/0xf0
      ? __import_iovec+0x254/0x2b0
      ? lock_release+0x62/0x200
      ? __pfx_netlink_sendmsg+0x10/0x10
      ____sys_sendmsg+0x559/0x5a0
      ? __pfx_____sys_sendmsg+0x10/0x10
      ? __pfx_copy_msghdr_from_user+0x10/0x10
      ? rcu_is_watching+0x34/0x60
      ? do_read_fault+0x213/0x4a0
      ? rcu_is_watching+0x34/0x60
      ___sys_sendmsg+0xe4/0x150
      ? __pfx____sys_sendmsg+0x10/0x10
      ? do_fault+0x2cc/0x6f0
      ? handle_pte_fault+0x2e3/0x3d0
      ? __pfx_handle_pte_fault+0x10/0x10
      ? preempt_count_sub+0x14/0xc0
      ? __down_read_trylock+0x150/0x270
      ? __handle_mm_fault+0x404/0x8e0
      ? __pfx___handle_mm_fault+0x10/0x10
      ? lock_release+0x62/0x200
      ? __rcu_read_unlock+0x65/0x90
      ? rcu_is_watching+0x34/0x60
      __sys_sendmsg+0xd5/0x150
      ? __pfx___sys_sendmsg+0x10/0x10
      ? __up_read+0x192/0x480
      ? lock_release+0x62/0x200
      ? __rcu_read_unlock+0x65/0x90
      ? rcu_is_watching+0x34/0x60
      do_syscall_64+0x6d/0x140
      entry_SYSCALL_64_after_hwframe+0x76/0x7e
     RIP: 0033:0x7f63a5b13367
     Code: 0e 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b9 0f 1f 00 f3 0f 1e fa 64 8b 04 25 18 00 00 00 85 c0 75 10 b8 2e 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 51 c3 48 83 ec 28 89 54 24 1c 48 89 74 24 10
     RSP: 002b:00007fff8c726bc8 EFLAGS: 00000246 ORIG_RAX: 000000000000002e
     RAX: ffffffffffffffda RBX: 0000000067b687c2 RCX: 00007f63a5b13367
     RDX: 0000000000000000 RSI: 00007fff8c726c30 RDI: 0000000000000004
     RBP: 00007fff8c726cb8 R08: 0000000000000000 R09: 0000000000000034
     R10: 00007fff8c726c7c R11: 0000000000000246 R12: 0000000000000001
     R13: 0000000000000000 R14: 00007fff8c726cd0 R15: 00007fff8c726cd0
      </TASK>
     irq event stamp: 0
     hardirqs last  enabled at (0): [<0000000000000000>] 0x0
     hardirqs last disabled at (0): [<ffffffff813f9e58>] copy_process+0xd08/0x2830
     softirqs last  enabled at (0): [<ffffffff813f9e58>] copy_process+0xd08/0x2830
     softirqs last disabled at (0): [<0000000000000000>] 0x0
     ---[ end trace 0000000000000000 ]---
    
    Thus, when calculating ifinfo message size, take VF GUIDs sizes into
    account when supported.
    
    Fixes: 30aad41721e0 ("net/core: Add support for getting VF GUIDs")
    Signed-off-by: Mark Zhang <[email protected]>
    Reviewed-by: Maher Sanalla <[email protected]>
    Signed-off-by: Mark Bloch <[email protected]>
    Reviewed-by: Sabrina Dubroca <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

sched/deadline: Use online cpus for validating runtime [+ + +]

Author: Shrikanth Hegde <[email protected]>
Date:   Thu Mar 6 10:59:53 2025 +0530

    sched/deadline: Use online cpus for validating runtime
    
    [ Upstream commit 14672f059d83f591afb2ee1fff56858efe055e5a ]
    
    The ftrace selftest reported a failure because writing -1 to
    sched_rt_runtime_us returns -EBUSY. This happens when the possible
    CPUs are different from active CPUs.
    
    Active CPUs are part of one root domain, while remaining CPUs are part
    of def_root_domain. Since active cpumask is being used, this results in
    cpus=0 when a non active CPUs is used in the loop.
    
    Fix it by looping over the online CPUs instead for validating the
    bandwidth calculations.
    
    Signed-off-by: Shrikanth Hegde <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Reviewed-by: Juri Lelli <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

sched/smt: Always inline sched_smt_active() [+ + +]

Author: Josh Poimboeuf <[email protected]>
Date:   Mon Mar 31 21:26:44 2025 -0700

    sched/smt: Always inline sched_smt_active()
    
    [ Upstream commit 09f37f2d7b21ff35b8b533f9ab8cfad2fe8f72f6 ]
    
    sched_smt_active() can be called from noinstr code, so it should always
    be inlined.  The CONFIG_SCHED_SMT version already has __always_inline.
    Do the same for its !CONFIG_SCHED_SMT counterpart.
    
    Fixes the following warning:
    
      vmlinux.o: error: objtool: intel_idle_ibrs+0x13: call to sched_smt_active() leaves .noinstr.text section
    
    Fixes: 321a874a7ef8 ("sched/smt: Expose sched_smt_present static key")
    Reported-by: kernel test robot <[email protected]>
    Signed-off-by: Josh Poimboeuf <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/1d03907b0a247cf7fb5c1d518de378864f603060.1743481539.git.jpoimboe@kernel.org
    Closes: https://lore.kernel.org/r/[email protected]/
    Signed-off-by: Sasha Levin <[email protected]>

selftests/bpf: Fix string read in strncmp benchmark [+ + +]

Author: Viktor Malik <[email protected]>
Date:   Thu Mar 13 13:28:52 2025 +0100

    selftests/bpf: Fix string read in strncmp benchmark
    
    [ Upstream commit de07b182899227d5fd1ca7a1a7d495ecd453d49c ]
    
    The strncmp benchmark uses the bpf_strncmp helper and a hand-written
    loop to compare two strings. The values of the strings are filled from
    userspace. One of the strings is non-const (in .bss) while the other is
    const (in .rodata) since that is the requirement of bpf_strncmp.
    
    The problem is that in the hand-written loop, Clang optimizes the reads
    from the const string to always return 0 which breaks the benchmark.
    
    Use barrier_var to prevent the optimization.
    
    The effect can be seen on the strncmp-no-helper variant.
    
    Before this change:
    
        # ./bench strncmp-no-helper
        Setting up benchmark 'strncmp-no-helper'...
        Benchmark 'strncmp-no-helper' started.
        Iter   0 (112.309us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Iter   1 (-23.238us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Iter   2 ( 58.994us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Iter   3 (-30.466us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Iter   4 ( 29.996us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Iter   5 ( 16.949us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Iter   6 (-60.035us): hits    0.000M/s (  0.000M/prod), drops    0.000M/s, total operations    0.000M/s
        Summary: hits    0.000 ± 0.000M/s (  0.000M/prod), drops    0.000 ± 0.000M/s, total operations    0.000 ± 0.000M/s
    
    After this change:
    
        # ./bench strncmp-no-helper
        Setting up benchmark 'strncmp-no-helper'...
        Benchmark 'strncmp-no-helper' started.
        Iter   0 ( 77.711us): hits    5.534M/s (  5.534M/prod), drops    0.000M/s, total operations    5.534M/s
        Iter   1 ( 11.215us): hits    6.006M/s (  6.006M/prod), drops    0.000M/s, total operations    6.006M/s
        Iter   2 (-14.253us): hits    5.931M/s (  5.931M/prod), drops    0.000M/s, total operations    5.931M/s
        Iter   3 ( 59.087us): hits    6.005M/s (  6.005M/prod), drops    0.000M/s, total operations    6.005M/s
        Iter   4 (-21.379us): hits    6.010M/s (  6.010M/prod), drops    0.000M/s, total operations    6.010M/s
        Iter   5 (-20.310us): hits    5.861M/s (  5.861M/prod), drops    0.000M/s, total operations    5.861M/s
        Iter   6 ( 53.937us): hits    6.004M/s (  6.004M/prod), drops    0.000M/s, total operations    6.004M/s
        Summary: hits    5.969 ± 0.061M/s (  5.969M/prod), drops    0.000 ± 0.000M/s, total operations    5.969 ± 0.061M/s
    
    Fixes: 9c42652f8be3 ("selftests/bpf: Add benchmark for bpf_strncmp() helper")
    Suggested-by: Andrii Nakryiko <[email protected]>
    Signed-off-by: Viktor Malik <[email protected]>
    Signed-off-by: Andrii Nakryiko <[email protected]>
    Acked-by: Hou Tao <[email protected]>
    Link: https://lore.kernel.org/bpf/[email protected]
    Signed-off-by: Alexei Starovoitov <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

selftests/bpf: Select NUMA_NO_NODE to create map [+ + +]

Author: Saket Kumar Bhaskar <[email protected]>
Date:   Fri Jan 31 12:35:22 2025 +0530

    selftests/bpf: Select NUMA_NO_NODE to create map
    
    [ Upstream commit 4107a1aeb20ed4cdad6a0d49de92ea0f933c71b7 ]
    
    On powerpc, a CPU does not necessarily originate from NUMA node 0.
    This contrasts with architectures like x86, where CPU 0 is not
    hot-pluggable, making NUMA node 0 a consistently valid node.
    This discrepancy can lead to failures when creating a map on NUMA
    node 0, which is initialized by default, if no CPUs are allocated
    from NUMA node 0.
    
    This patch fixes the issue by setting NUMA_NO_NODE (-1) for map
    creation for this selftest.
    
    Fixes: 96eabe7a40aa ("bpf: Allow selecting numa node during map creation")
    Signed-off-by: Saket Kumar Bhaskar <[email protected]>
    Signed-off-by: Andrii Nakryiko <[email protected]>
    Acked-by: Yonghong Song <[email protected]>
    Link: https://lore.kernel.org/bpf/cf1f61468b47425ecf3728689bc9636ddd1d910e.1738302337.git.skb99@linux.ibm.com
    Signed-off-by: Sasha Levin <[email protected]>

selinux: Chain up tool resolving errors in install_policy.sh [+ + +]

Author: Tim Schumacher <[email protected]>
Date:   Fri Mar 7 10:56:43 2025 +0100

    selinux: Chain up tool resolving errors in install_policy.sh
    
    [ Upstream commit 6ae0042f4d3f331e841495eb0a3d51598e593ec2 ]
    
    Subshell evaluations are not exempt from errexit, so if a command is
    not available, `which` will fail and exit the script as a whole.
    This causes the helpful error messages to not be printed if they are
    tacked on using a `$?` comparison.
    
    Resolve the issue by using chains of logical operators, which are not
    subject to the effects of errexit.
    
    Fixes: e37c1877ba5b1 ("scripts/selinux: modernize mdp")
    Signed-off-by: Tim Schumacher <[email protected]>
    Signed-off-by: Paul Moore <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

smack: dont compile ipv6 code unless ipv6 is configured [+ + +]

Author: Konstantin Andreev <[email protected]>
Date:   Fri Jan 17 19:36:42 2025 +0300

    smack: dont compile ipv6 code unless ipv6 is configured
    
    [ Upstream commit bfcf4004bcbce2cb674b4e8dbd31ce0891766bac ]
    
    I want to be sure that ipv6-specific code
    is not compiled in kernel binaries
    if ipv6 is not configured.
    
    [1] was getting rid of "unused variable" warning, but,
    with that, it also mandated compilation of a handful ipv6-
    specific functions in ipv4-only kernel configurations:
    
    smk_ipv6_localhost, smack_ipv6host_label, smk_ipv6_check.
    
    Their compiled bodies are likely to be removed by compiler
    from the resulting binary, but, to be on the safe side,
    I remove them from the compiler view.
    
    [1]
    Fixes: 00720f0e7f28 ("smack: avoid unused 'sip' variable warning")
    
    Signed-off-by: Konstantin Andreev <[email protected]>
    Signed-off-by: Casey Schaufler <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

soundwire: slave: fix an OF node reference leak in soundwire slave device [+ + +]

Author: Joe Hattori <[email protected]>
Date:   Thu Dec 5 12:48:44 2024 +0900

    soundwire: slave: fix an OF node reference leak in soundwire slave device
    
    [ Upstream commit aac2f8363f773ae1f65aab140e06e2084ac6b787 ]
    
    When initializing a soundwire slave device, an OF node is stored to the
    device with refcount incremented. However, the refcount is not
    decremented in .release(), thus call of_node_put() in
    sdw_slave_release().
    
    Fixes: a2e484585ad3 ("soundwire: core: add device tree support for slave devices")
    Signed-off-by: Joe Hattori <[email protected]>
    Reviewed-by: Krzysztof Kozlowski <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Vinod Koul <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

spufs: fix a leak in spufs_create_context() [+ + +]

Author: Al Viro <[email protected]>
Date:   Wed Mar 12 19:38:28 2025 -0400

    spufs: fix a leak in spufs_create_context()
    
    [ Upstream commit 0f5cce3fc55b08ee4da3372baccf4bcd36a98396 ]
    
    Leak fixes back in 2008 missed one case - if we are trying to set affinity
    and spufs_mkdir() fails, we need to drop the reference to neighbor.
    
    Fixes: 58119068cb27 "[POWERPC] spufs: Fix memory leak on SPU affinity"
    Signed-off-by: Al Viro <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

spufs: fix a leak on spufs_new_file() failure [+ + +]

Author: Al Viro <[email protected]>
Date:   Sat Mar 8 19:26:31 2025 -0500

    spufs: fix a leak on spufs_new_file() failure
    
    [ Upstream commit d1ca8698ca1332625d83ea0d753747be66f9906d ]
    
    It's called from spufs_fill_dir(), and caller of that will do
    spufs_rmdir() in case of failure.  That does remove everything
    we'd managed to create, but... the problem dentry is still
    negative.  IOW, it needs to be explicitly dropped.
    
    Fixes: 3f51dd91c807 "[PATCH] spufs: fix spufs_fill_dir error path"
    Signed-off-by: Al Viro <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

spufs: fix gang directory lifetimes [+ + +]

Author: Al Viro <[email protected]>
Date:   Wed Mar 12 19:18:39 2025 -0400

    spufs: fix gang directory lifetimes
    
    [ Upstream commit c134deabf4784e155d360744d4a6a835b9de4dd4 ]
    
    prior to "[POWERPC] spufs: Fix gang destroy leaks" we used to have
    a problem with gang lifetimes - creation of a gang returns opened
    gang directory, which normally gets removed when that gets closed,
    but if somebody has created a context belonging to that gang and
    kept it alive until the gang got closed, removal failed and we
    ended up with a leak.
    
    Unfortunately, it had been fixed the wrong way.  Dentry of gang
    directory was no longer pinned, and rmdir on close was gone.
    One problem was that failure of open kept calling simple_rmdir()
    as cleanup, which meant an unbalanced dput().  Another bug was
    in the success case - gang creation incremented link count on
    root directory, but that was no longer undone when gang got
    destroyed.
    
    Fix consists of
            * reverting the commit in question
            * adding a counter to gang, protected by ->i_rwsem
    of gang directory inode.
            * having it set to 1 at creation time, dropped
    in both spufs_dir_close() and spufs_gang_close() and bumped
    in spufs_create_context(), provided that it's not 0.
            * using simple_recursive_removal() to take the gang
    directory out when counter reaches zero.
    
    Fixes: 877907d37da9 "[POWERPC] spufs: Fix gang destroy leaks"
    Signed-off-by: Al Viro <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

staging: rtl8723bs: select CONFIG_CRYPTO_LIB_AES [+ + +]

Author: 谢致邦 (XIE Zhibang) <[email protected]>
Date:   Sat Feb 22 19:36:17 2025 +0000

    staging: rtl8723bs: select CONFIG_CRYPTO_LIB_AES
    
    [ Upstream commit b2a9a6a26b7e954297e51822e396572026480bad ]
    
    This fixes the following issue:
    ERROR: modpost: "aes_expandkey" [drivers/staging/rtl8723bs/r8723bs.ko]
    undefined!
    ERROR: modpost: "aes_encrypt" [drivers/staging/rtl8723bs/r8723bs.ko]
    undefined!
    
    Fixes: 7d40753d8820 ("staging: rtl8723bs: use in-kernel aes encryption in OMAC1 routines")
    Fixes: 3d3a170f6d80 ("staging: rtl8723bs: use in-kernel aes encryption")
    Signed-off-by: 谢致邦 (XIE Zhibang) <[email protected]>
    Reviewed-by: Hans de Goede <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

thermal: int340x: Add NULL check for adev [+ + +]

Author: Chenyuan Yang <[email protected]>
Date:   Wed Mar 12 23:36:11 2025 -0500

    thermal: int340x: Add NULL check for adev
    
    [ Upstream commit 2542a3f70e563a9e70e7ded314286535a3321bdb ]
    
    Not all devices have an ACPI companion fwnode, so adev might be NULL.
    This is similar to the commit cd2fd6eab480
    ("platform/x86: int3472: Check for adev == NULL").
    
    Add a check for adev not being set and return -ENODEV in that case to
    avoid a possible NULL pointer deref in int3402_thermal_probe().
    
    Note, under the same directory, int3400_thermal_probe() has such a
    check.
    
    Fixes: 77e337c6e23e ("Thermal: introduce INT3402 thermal driver")
    Signed-off-by: Chenyuan Yang <[email protected]>
    Acked-by: Uwe Kleine-König <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    [ rjw: Subject edit, added Fixes: ]
    Signed-off-by: Rafael J. Wysocki <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

tracing/osnoise: Fix possible recursive locking for cpus_read_lock() [+ + +]

Author: Ran Xiaokai <[email protected]>
Date:   Fri Mar 21 09:52:49 2025 +0000

    tracing/osnoise: Fix possible recursive locking for cpus_read_lock()
    
    commit 7e6b3fcc9c5294aeafed0dbe1a09a1bc899bd0f2 upstream.
    
    Lockdep reports this deadlock log:
    
    osnoise: could not start sampling thread
    ============================================
    WARNING: possible recursive locking detected
    --------------------------------------------
           CPU0
           ----
      lock(cpu_hotplug_lock);
      lock(cpu_hotplug_lock);
    
     Call Trace:
      <TASK>
      print_deadlock_bug+0x282/0x3c0
      __lock_acquire+0x1610/0x29a0
      lock_acquire+0xcb/0x2d0
      cpus_read_lock+0x49/0x120
      stop_per_cpu_kthreads+0x7/0x60
      start_kthread+0x103/0x120
      osnoise_hotplug_workfn+0x5e/0x90
      process_one_work+0x44f/0xb30
      worker_thread+0x33e/0x5e0
      kthread+0x206/0x3b0
      ret_from_fork+0x31/0x50
      ret_from_fork_asm+0x11/0x20
      </TASK>
    
    This is the deadlock scenario:
    osnoise_hotplug_workfn()
      guard(cpus_read_lock)();      // first lock call
      start_kthread(cpu)
        if (IS_ERR(kthread)) {
          stop_per_cpu_kthreads(); {
            cpus_read_lock();      // second lock call. Cause the AA deadlock
          }
        }
    
    It is not necessary to call stop_per_cpu_kthreads() which stops osnoise
    kthread for every other CPUs in the system if a failure occurs during
    hotplug of a certain CPU.
    For start_per_cpu_kthreads(), if the start_kthread() call fails,
    this function calls stop_per_cpu_kthreads() to handle the error.
    Therefore, similarly, there is no need to call stop_per_cpu_kthreads()
    again within start_kthread().
    So just remove stop_per_cpu_kthreads() from start_kthread to solve this issue.
    
    Cc: [email protected]
    Link: https://lore.kernel.org/[email protected]
    Fixes: c8895e271f79 ("trace/osnoise: Support hotplug operations")
    Signed-off-by: Ran Xiaokai <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Do not use PERF enums when perf is not defined [+ + +]

Author: Steven Rostedt <[email protected]>
Date:   Sun Mar 23 15:21:51 2025 -0400

    tracing: Do not use PERF enums when perf is not defined
    
    commit 8eb1518642738c6892bd629b46043513a3bf1a6a upstream.
    
    An update was made to up the module ref count when a synthetic event is
    registered for both trace and perf events. But if perf is not configured
    in, the perf enums used will cause the kernel to fail to build.
    
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Cc: Douglas Raillard <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Fixes: 21581dd4e7ff ("tracing: Ensure module defining synth event cannot be unloaded while tracing")
    Reported-by: kernel test robot <[email protected]>
    Closes: https://lore.kernel.org/oe-kbuild-all/[email protected]/
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Ensure module defining synth event cannot be unloaded while tracing [+ + +]

Author: Douglas Raillard <[email protected]>
Date:   Tue Mar 18 18:09:05 2025 +0000

    tracing: Ensure module defining synth event cannot be unloaded while tracing
    
    commit 21581dd4e7ff6c07d0ab577e3c32b13a74b31522 upstream.
    
    Currently, using synth_event_delete() will fail if the event is being
    used (tracing in progress), but that is normally done in the module exit
    function. At that stage, failing is problematic as returning a non-zero
    status means the module will become locked (impossible to unload or
    reload again).
    
    Instead, ensure the module exit function does not get called in the
    first place by increasing the module refcnt when the event is enabled.
    
    Cc: [email protected]
    Cc: Mathieu Desnoyers <[email protected]>
    Fixes: 35ca5207c2d11 ("tracing: Add synthetic event command generation functions")
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Douglas Raillard <[email protected]>
    Acked-by: Masami Hiramatsu (Google) <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Fix synth event printk format for str fields [+ + +]

Author: Douglas Raillard <[email protected]>
Date:   Tue Mar 25 16:52:02 2025 +0000

    tracing: Fix synth event printk format for str fields
    
    commit 4d38328eb442dc06aec4350fd9594ffa6488af02 upstream.
    
    The printk format for synth event uses "%.*s" to print string fields,
    but then only passes the pointer part as var arg.
    
    Replace %.*s with %s as the C string is guaranteed to be null-terminated.
    
    The output in print fmt should never have been updated as __get_str()
    handles the string limit because it can access the length of the string in
    the string meta data that is saved in the ring buffer.
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Fixes: 8db4d6bfbbf92 ("tracing: Change synthetic event string format to limit printed length")
    Link: https://lore.kernel.org/[email protected]
    Signed-off-by: Douglas Raillard <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tracing: Fix use-after-free in print_graph_function_flags during tracer switching [+ + +]

Author: Tengda Wu <[email protected]>
Date:   Thu Mar 20 12:21:37 2025 +0000

    tracing: Fix use-after-free in print_graph_function_flags during tracer switching
    
    commit 7f81f27b1093e4895e87b74143c59c055c3b1906 upstream.
    
    Kairui reported a UAF issue in print_graph_function_flags() during
    ftrace stress testing [1]. This issue can be reproduced if puting a
    'mdelay(10)' after 'mutex_unlock(&trace_types_lock)' in s_start(),
    and executing the following script:
    
      $ echo function_graph > current_tracer
      $ cat trace > /dev/null &
      $ sleep 5  # Ensure the 'cat' reaches the 'mdelay(10)' point
      $ echo timerlat > current_tracer
    
    The root cause lies in the two calls to print_graph_function_flags
    within print_trace_line during each s_show():
    
      * One through 'iter->trace->print_line()';
      * Another through 'event->funcs->trace()', which is hidden in
        print_trace_fmt() before print_trace_line returns.
    
    Tracer switching only updates the former, while the latter continues
    to use the print_line function of the old tracer, which in the script
    above is print_graph_function_flags.
    
    Moreover, when switching from the 'function_graph' tracer to the
    'timerlat' tracer, s_start only calls graph_trace_close of the
    'function_graph' tracer to free 'iter->private', but does not set
    it to NULL. This provides an opportunity for 'event->funcs->trace()'
    to use an invalid 'iter->private'.
    
    To fix this issue, set 'iter->private' to NULL immediately after
    freeing it in graph_trace_close(), ensuring that an invalid pointer
    is not passed to other tracers. Additionally, clean up the unnecessary
    'iter->private = NULL' during each 'cat trace' when using wakeup and
    irqsoff tracers.
    
     [1] https://lore.kernel.org/all/[email protected]/
    
    Cc: [email protected]
    Cc: Masami Hiramatsu <[email protected]>
    Cc: Mathieu Desnoyers <[email protected]>
    Cc: Zheng Yejian <[email protected]>
    Link: https://lore.kernel.org/[email protected]
    Fixes: eecb91b9f98d ("tracing: Fix memleak due to race between current_tracer and trace")
    Closes: https://lore.kernel.org/all/CAMgjq7BW79KDSCyp+tZHjShSzHsScSiJxn5ffskp-QzVM06fxw@mail.gmail.com/
    Reported-by: Kairui Song <[email protected]>
    Signed-off-by: Tengda Wu <[email protected]>
    Signed-off-by: Steven Rostedt (Google) <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

tty: serial: fsl_lpuart: disable transmitter before changing RS485 related registers [+ + +]

Author: Sherry Sun <[email protected]>
Date:   Wed Mar 12 10:25:03 2025 +0800

    tty: serial: fsl_lpuart: disable transmitter before changing RS485 related registers
    
    [ Upstream commit f5cb528d6441eb860250a2f085773aac4f44085e ]
    
    According to the LPUART reference manual, TXRTSE and TXRTSPOL of MODIR
    register only can be changed when the transmitter is disabled.
    So disable the transmitter before changing RS485 related registers and
    re-enable it after the change is done.
    
    Fixes: 67b01837861c ("tty: serial: lpuart: Add RS485 support for 32-bit uart flavour")
    Cc: stable <[email protected]>
    Signed-off-by: Sherry Sun <[email protected]>
    Reviewed-by: Frank Li <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

tty: serial: fsl_lpuart: use UARTMODIR register bits for lpuart32 platform [+ + +]

Author: Sherry Sun <[email protected]>
Date:   Fri Apr 14 10:21:11 2023 +0800

    tty: serial: fsl_lpuart: use UARTMODIR register bits for lpuart32 platform
    
    [ Upstream commit d57d56e4dddfb5c92cd81abf8922055bf0fb85a4 ]
    
    For lpuart32 platforms, UARTMODIR register is used instead of UARTMODEM.
    So here should configure the corresponding UARTMODIR register bits to
    avoid confusion.
    
    Reviewed-by: Ilpo Järvinen <[email protected]>
    Signed-off-by: Sherry Sun <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Stable-dep-of: f5cb528d6441 ("tty: serial: fsl_lpuart: disable transmitter before changing RS485 related registers")
    Signed-off-by: Sasha Levin <[email protected]>

tunnels: Accept PACKET_HOST in skb_tunnel_check_pmtu(). [+ + +]

Author: Guillaume Nault <[email protected]>
Date:   Sat Mar 29 01:33:44 2025 +0100

    tunnels: Accept PACKET_HOST in skb_tunnel_check_pmtu().
    
    [ Upstream commit 8930424777e43257f5bf6f0f0f53defd0d30415c ]
    
    Because skb_tunnel_check_pmtu() doesn't handle PACKET_HOST packets,
    commit 30a92c9e3d6b ("openvswitch: Set the skbuff pkt_type for proper
    pmtud support.") forced skb->pkt_type to PACKET_OUTGOING for
    openvswitch packets that are sent using the OVS_ACTION_ATTR_OUTPUT
    action. This allowed such packets to invoke the
    iptunnel_pmtud_check_icmp() or iptunnel_pmtud_check_icmpv6() helpers
    and thus trigger PMTU update on the input device.
    
    However, this also broke other parts of PMTU discovery. Since these
    packets don't have the PACKET_HOST type anymore, they won't trigger the
    sending of ICMP Fragmentation Needed or Packet Too Big messages to
    remote hosts when oversized (see the skb_in->pkt_type condition in
    __icmp_send() for example).
    
    These two skb->pkt_type checks are therefore incompatible as one
    requires skb->pkt_type to be PACKET_HOST, while the other requires it
    to be anything but PACKET_HOST.
    
    It makes sense to not trigger ICMP messages for non-PACKET_HOST packets
    as these messages should be generated only for incoming l2-unicast
    packets. However there doesn't seem to be any reason for
    skb_tunnel_check_pmtu() to ignore PACKET_HOST packets.
    
    Allow both cases to work by allowing skb_tunnel_check_pmtu() to work on
    PACKET_HOST packets and not overriding skb->pkt_type in openvswitch
    anymore.
    
    Fixes: 30a92c9e3d6b ("openvswitch: Set the skbuff pkt_type for proper pmtud support.")
    Fixes: 4cb47a8644cc ("tunnels: PMTU discovery support for directly bridged IP packets")
    Signed-off-by: Guillaume Nault <[email protected]>
    Reviewed-by: Stefano Brivio <[email protected]>
    Reviewed-by: Aaron Conole <[email protected]>
    Tested-by: Aaron Conole <[email protected]>
    Link: https://patch.msgid.link/eac941652b86fddf8909df9b3bf0d97bc9444793.1743208264.git.gnault@redhat.com
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

udp: Fix memory accounting leak. [+ + +]

Author: Kuniyuki Iwashima <[email protected]>
Date:   Tue Apr 1 11:44:43 2025 -0700

    udp: Fix memory accounting leak.
    
    [ Upstream commit df207de9d9e7a4d92f8567e2c539d9c8c12fd99d ]
    
    Matt Dowling reported a weird UDP memory usage issue.
    
    Under normal operation, the UDP memory usage reported in /proc/net/sockstat
    remains close to zero.  However, it occasionally spiked to 524,288 pages
    and never dropped.  Moreover, the value doubled when the application was
    terminated.  Finally, it caused intermittent packet drops.
    
    We can reproduce the issue with the script below [0]:
    
      1. /proc/net/sockstat reports 0 pages
    
        # cat /proc/net/sockstat | grep UDP:
        UDP: inuse 1 mem 0
    
      2. Run the script till the report reaches 524,288
    
        # python3 test.py & sleep 5
        # cat /proc/net/sockstat | grep UDP:
        UDP: inuse 3 mem 524288  <-- (INT_MAX + 1) >> PAGE_SHIFT
    
      3. Kill the socket and confirm the number never drops
    
        # pkill python3 && sleep 5
        # cat /proc/net/sockstat | grep UDP:
        UDP: inuse 1 mem 524288
    
      4. (necessary since v6.0) Trigger proto_memory_pcpu_drain()
    
        # python3 test.py & sleep 1 && pkill python3
    
      5. The number doubles
    
        # cat /proc/net/sockstat | grep UDP:
        UDP: inuse 1 mem 1048577
    
    The application set INT_MAX to SO_RCVBUF, which triggered an integer
    overflow in udp_rmem_release().
    
    When a socket is close()d, udp_destruct_common() purges its receive
    queue and sums up skb->truesize in the queue.  This total is calculated
    and stored in a local unsigned integer variable.
    
    The total size is then passed to udp_rmem_release() to adjust memory
    accounting.  However, because the function takes a signed integer
    argument, the total size can wrap around, causing an overflow.
    
    Then, the released amount is calculated as follows:
    
      1) Add size to sk->sk_forward_alloc.
      2) Round down sk->sk_forward_alloc to the nearest lower multiple of
          PAGE_SIZE and assign it to amount.
      3) Subtract amount from sk->sk_forward_alloc.
      4) Pass amount >> PAGE_SHIFT to __sk_mem_reduce_allocated().
    
    When the issue occurred, the total in udp_destruct_common() was 2147484480
    (INT_MAX + 833), which was cast to -2147482816 in udp_rmem_release().
    
    At 1) sk->sk_forward_alloc is changed from 3264 to -2147479552, and
    2) sets -2147479552 to amount.  3) reverts the wraparound, so we don't
    see a warning in inet_sock_destruct().  However, udp_memory_allocated
    ends up doubling at 4).
    
    Since commit 3cd3399dd7a8 ("net: implement per-cpu reserves for
    memory_allocated"), memory usage no longer doubles immediately after
    a socket is close()d because __sk_mem_reduce_allocated() caches the
    amount in udp_memory_per_cpu_fw_alloc.  However, the next time a UDP
    socket receives a packet, the subtraction takes effect, causing UDP
    memory usage to double.
    
    This issue makes further memory allocation fail once the socket's
    sk->sk_rmem_alloc exceeds net.ipv4.udp_rmem_min, resulting in packet
    drops.
    
    To prevent this issue, let's use unsigned int for the calculation and
    call sk_forward_alloc_add() only once for the small delta.
    
    Note that first_packet_length() also potentially has the same problem.
    
    [0]:
    from socket import *
    
    SO_RCVBUFFORCE = 33
    INT_MAX = (2 ** 31) - 1
    
    s = socket(AF_INET, SOCK_DGRAM)
    s.bind(('', 0))
    s.setsockopt(SOL_SOCKET, SO_RCVBUFFORCE, INT_MAX)
    
    c = socket(AF_INET, SOCK_DGRAM)
    c.connect(s.getsockname())
    
    data = b'a' * 100
    
    while True:
        c.send(data)
    
    Fixes: f970bd9e3a06 ("udp: implement memory accounting helpers")
    Reported-by: Matt Dowling <[email protected]>
    Signed-off-by: Kuniyuki Iwashima <[email protected]>
    Reviewed-by: Willem de Bruijn <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

um: remove copy_from_kernel_nofault_allowed [+ + +]

Author: Benjamin Berg <[email protected]>
Date:   Mon Feb 10 17:09:26 2025 +0100

    um: remove copy_from_kernel_nofault_allowed
    
    [ Upstream commit 84a6fc378471fbeaf48f8604566a5a33a3d63c18 ]
    
    There is no need to override the default version of this function
    anymore as UML now has proper _nofault memory access functions.
    
    Doing this also fixes the fact that the implementation was incorrect as
    using mincore() will incorrectly flag pages as inaccessible if they were
    swapped out by the host.
    
    Fixes: f75b1b1bedfb ("um: Implement probe_kernel_read()")
    Signed-off-by: Benjamin Berg <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

usb: xhci: correct debug message page size calculation [+ + +]

Author: Niklas Neronin <[email protected]>
Date:   Thu Mar 6 16:49:47 2025 +0200

    usb: xhci: correct debug message page size calculation
    
    [ Upstream commit 55741c723318905e6d5161bf1e12749020b161e3 ]
    
    The ffs() function returns the index of the first set bit, starting from 1.
    If no bits are set, it returns zero. This behavior causes an off-by-one
    page size in the debug message, as the page size calculation [1]
    is zero-based, while ffs() is one-based.
    
    Fix this by subtracting one from the result of ffs(). Note that since
    variable 'val' is unsigned, subtracting one from zero will result in the
    maximum unsigned integer value. Consequently, the condition 'if (val < 16)'
    will still function correctly.
    
    [1], Page size: (2^(n+12)), where 'n' is the set page size bit.
    
    Fixes: 81720ec5320c ("usb: host: xhci: use ffs() in xhci_mem_init()")
    Signed-off-by: Niklas Neronin <[email protected]>
    Signed-off-by: Mathias Nyman <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

Linux: usbnet:fix NPE during rx_complete [+ + +]

Author: Ying Lu <[email protected]>
Date:   Wed Apr 2 16:58:59 2025 +0800

    usbnet:fix NPE during rx_complete
    
    commit 51de3600093429e3b712e5f091d767babc5dd6df upstream.
    
    Missing usbnet_going_away Check in Critical Path.
    The usb_submit_urb function lacks a usbnet_going_away
    validation, whereas __usbnet_queue_skb includes this check.
    
    This inconsistency creates a race condition where:
    A URB request may succeed, but the corresponding SKB data
    fails to be queued.
    
    Subsequent processes:
    (e.g., rx_complete → defer_bh → __skb_unlink(skb, list))
    attempt to access skb->next, triggering a NULL pointer
    dereference (Kernel Panic).
    
    Fixes: 04e906839a05 ("usbnet: fix cyclical race on disconnect with work queue")
    Cc: [email protected]
    Signed-off-by: Ying Lu <[email protected]>
    Link: https://patch.msgid.link/4c9ef2efaa07eb7f9a5042b74348a67e5a3a7aea.1743584159.git.luying1@xiaomi.com
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

vsock: avoid timeout during connect() if the socket is closing [+ + +]

Author: Stefano Garzarella <[email protected]>
Date:   Fri Mar 28 15:15:28 2025 +0100

    vsock: avoid timeout during connect() if the socket is closing
    
    [ Upstream commit fccd2b711d9628c7ce0111d5e4938652101ee30a ]
    
    When a peer attempts to establish a connection, vsock_connect() contains
    a loop that waits for the state to be TCP_ESTABLISHED. However, the
    other peer can be fast enough to accept the connection and close it
    immediately, thus moving the state to TCP_CLOSING.
    
    When this happens, the peer in the vsock_connect() is properly woken up,
    but since the state is not TCP_ESTABLISHED, it goes back to sleep
    until the timeout expires, returning -ETIMEDOUT.
    
    If the socket state is TCP_CLOSING, waiting for the timeout is pointless.
    vsock_connect() can return immediately without errors or delay since the
    connection actually happened. The socket will be in a closing state,
    but this is not an issue, and subsequent calls will fail as expected.
    
    We discovered this issue while developing a test that accepts and
    immediately closes connections to stress the transport switch between
    two connect() calls, where the first one was interrupted by a signal
    (see Closes link).
    
    Reported-by: Luigi Leonardi <[email protected]>
    Closes: https://lore.kernel.org/virtualization/bq6hxrolno2vmtqwcvb5bljfpb7mvwb3kohrvaed6auz5vxrfv@ijmd2f3grobn/
    Fixes: d021c344051a ("VSOCK: Introduce VM Sockets")
    Signed-off-by: Stefano Garzarella <[email protected]>
    Acked-by: Paolo Abeni <[email protected]>
    Tested-by: Luigi Leonardi <[email protected]>
    Reviewed-by: Luigi Leonardi <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Jakub Kicinski <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

watch_queue: fix pipe accounting mismatch [+ + +]

Author: Eric Sandeen <[email protected]>
Date:   Thu Feb 27 11:41:08 2025 -0600

    watch_queue: fix pipe accounting mismatch
    
    [ Upstream commit f13abc1e8e1a3b7455511c4e122750127f6bc9b0 ]
    
    Currently, watch_queue_set_size() modifies the pipe buffers charged to
    user->pipe_bufs without updating the pipe->nr_accounted on the pipe
    itself, due to the if (!pipe_has_watch_queue()) test in
    pipe_resize_ring(). This means that when the pipe is ultimately freed,
    we decrement user->pipe_bufs by something other than what than we had
    charged to it, potentially leading to an underflow. This in turn can
    cause subsequent too_many_pipe_buffers_soft() tests to fail with -EPERM.
    
    To remedy this, explicitly account for the pipe usage in
    watch_queue_set_size() to match the number set via account_pipe_buffers()
    
    (It's unclear why watch_queue_set_size() does not update nr_accounted;
    it may be due to intentional overprovisioning in watch_queue_set_size()?)
    
    Fixes: e95aada4cb93d ("pipe: wakeup wr_wait after setting max_usage")
    Signed-off-by: Eric Sandeen <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Christian Brauner <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: brcmfmac: keep power during suspend if board requires it [+ + +]

Author: Matthias Proske <[email protected]>
Date:   Wed Feb 12 19:59:35 2025 +0100

    wifi: brcmfmac: keep power during suspend if board requires it
    
    [ Upstream commit 8c3170628a9ce24a59647bd24f897e666af919b8 ]
    
    After commit 92cadedd9d5f ("brcmfmac: Avoid keeping power to SDIO card
    unless WOWL is used"), the wifi adapter by default is turned off on
    suspend and then re-probed on resume.
    
    This conflicts with some embedded boards that require to remain powered.
    They will fail on resume with:
    
    brcmfmac: brcmf_sdio_bus_rxctl: resumed on timeout
    ieee80211 phy1: brcmf_bus_started: failed: -110
    ieee80211 phy1: brcmf_attach: dongle is not responding: err=-110
    brcmfmac: brcmf_sdio_firmware_callback: brcmf_attach failed
    
    This commit checks for the Device Tree property 'cap-power-off-cards'.
    If this property is not set, it means that we do not have the capability
    to power off and should therefore remain powered.
    
    Signed-off-by: Matthias Proske <[email protected]>
    Acked-by: Arend van Spriel <[email protected]>
    Link: https://patch.msgid.link/[email protected]
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: iwlwifi: fw: allocate chained SG tables for dump [+ + +]

Author: Johannes Berg <[email protected]>
Date:   Sun Feb 9 14:34:45 2025 +0200

    wifi: iwlwifi: fw: allocate chained SG tables for dump
    
    [ Upstream commit 7774e3920029398ad49dc848b23840593f14d515 ]
    
    The firmware dumps can be pretty big, and since we use single
    pages for each SG table entry, even the table itself may end
    up being an order-5 allocation. Build chained tables so that
    we need not allocate a higher-order table here.
    
    This could be improved and cleaned up, e.g. by using the SG
    pool code or simply kvmalloc(), but all of that would require
    also updating the devcoredump first since that frees it all,
    so we need to be more careful. SG pool might also run against
    the CONFIG_ARCH_NO_SG_CHAIN limitation, which is irrelevant
    here.
    
    Also use _devcd_free_sgtable() for the error paths now, much
    simpler especially since it's in two places now.
    
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Miri Korenblit <[email protected]>
    Link: https://patch.msgid.link/20250209143303.697c7a465ac9.Iea982df46b5c075bfb77ade36f187d99a70c63db@changeid
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

wifi: iwlwifi: mvm: use the right version of the rate API [+ + +]

Author: Emmanuel Grumbach <[email protected]>
Date:   Sun Feb 9 14:34:50 2025 +0200

    wifi: iwlwifi: mvm: use the right version of the rate API
    
    [ Upstream commit a03e2082e678ea10d0d8bdf3ed933eb05a8ddbb0 ]
    
    The firmware uses the newer version of the API in recent devices. For
    older devices, we translate the rate to the new format.
    Don't parse the rate with old parsing macros.
    
    Signed-off-by: Emmanuel Grumbach <[email protected]>
    Reviewed-by: Johannes Berg <[email protected]>
    Signed-off-by: Miri Korenblit <[email protected]>
    Link: https://patch.msgid.link/20250209143303.13d70cdcbb4e.Ic92193bce4013b70a823cfef250ee79c16cf7c17@changeid
    Signed-off-by: Johannes Berg <[email protected]>
    Signed-off-by: Sasha Levin <[email protected]>

x86/dumpstack: Fix inaccurate unwinding from exception stacks due to misplaced assignment [+ + +]

Author: Jann Horn <[email protected]>
Date:   Tue Mar 25 03:01:23 2025 +0100

    x86/dumpstack: Fix inaccurate unwinding from exception stacks due to misplaced assignment
    
    [ Upstream commit 2c118f50d7fd4d9aefc4533a26f83338b2906b7a ]
    
    Commit:
    
      2e4be0d011f2 ("x86/show_trace_log_lvl: Ensure stack pointer is aligned, again")
    
    was intended to ensure alignment of the stack pointer; but it also moved
    the initialization of the "stack" variable down into the loop header.
    
    This was likely intended as a no-op cleanup, since the commit
    message does not mention it; however, this caused a behavioral change
    because the value of "regs" is different between the two places.
    
    Originally, get_stack_pointer() used the regs provided by the caller; after
    that commit, get_stack_pointer() instead uses the regs at the top of the
    stack frame the unwinder is looking at. Often, there are no such regs at
    all, and "regs" is NULL, causing get_stack_pointer() to fall back to the
    task's current stack pointer, which is not what we want here, but probably
    happens to mostly work. Other times, the original regs will point to
    another regs frame - in that case, the linear guess unwind logic in
    show_trace_log_lvl() will start unwinding too far up the stack, causing the
    first frame found by the proper unwinder to never be visited, resulting in
    a stack trace consisting purely of guess lines.
    
    Fix it by moving the "stack = " assignment back where it belongs.
    
    Fixes: 2e4be0d011f2 ("x86/show_trace_log_lvl: Ensure stack pointer is aligned, again")
    Signed-off-by: Jann Horn <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/entry: Fix ORC unwinder for PUSH_REGS with save_ret=1 [+ + +]

Author: Jann Horn <[email protected]>
Date:   Tue Mar 25 03:01:22 2025 +0100

    x86/entry: Fix ORC unwinder for PUSH_REGS with save_ret=1
    
    [ Upstream commit 57e2428f8df8263275344566e02c277648a4b7f1 ]
    
    PUSH_REGS with save_ret=1 is used by interrupt entry helper functions that
    initially start with a UNWIND_HINT_FUNC ORC state.
    
    However, save_ret=1 means that we clobber the helper function's return
    address (and then later restore the return address further down on the
    stack); after that point, the only thing on the stack we can unwind through
    is the IRET frame, so use UNWIND_HINT_IRET_REGS until we have a full
    pt_regs frame.
    
    ( An alternate approach would be to move the pt_regs->di overwrite down
      such that it is the final step of pt_regs setup; but I don't want to
      rearrange entry code just to make unwinding a tiny bit more elegant. )
    
    Fixes: 9e809d15d6b6 ("x86/entry: Reduce the code footprint of the 'idtentry' macro")
    Signed-off-by: Jann Horn <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Andy Lutomirski <[email protected]>
    Cc: Brian Gerst <[email protected]>
    Cc: Juergen Gross <[email protected]>
    Cc: H. Peter Anvin <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Cc: Kees Cook <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Josh Poimboeuf <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/fpu/xstate: Fix inconsistencies in guest FPU xfeatures [+ + +]

Author: Chao Gao <[email protected]>
Date:   Mon Mar 17 22:06:11 2025 +0800

    x86/fpu/xstate: Fix inconsistencies in guest FPU xfeatures
    
    [ Upstream commit dda366083e5ff307a4a728757db874bbfe7550be ]
    
    Guest FPUs manage vCPU FPU states. They are allocated via
    fpu_alloc_guest_fpstate() and are resized in fpstate_realloc() when XFD
    features are enabled.
    
    Since the introduction of guest FPUs, there have been inconsistencies in
    the kernel buffer size and xfeatures:
    
     1. fpu_alloc_guest_fpstate() uses fpu_user_cfg since its introduction. See:
    
        69f6ed1d14c6 ("x86/fpu: Provide infrastructure for KVM FPU cleanup")
        36487e6228c4 ("x86/fpu: Prepare guest FPU for dynamically enabled FPU features")
    
     2. __fpstate_reset() references fpu_kernel_cfg to set storage attributes.
    
     3. fpu->guest_perm uses fpu_kernel_cfg, affecting fpstate_realloc().
    
    A recent commit in the tip:x86/fpu tree partially addressed the inconsistency
    between (1) and (3) by using fpu_kernel_cfg for size calculation in (1),
    but left fpu_guest->xfeatures and fpu_guest->perm still referencing
    fpu_user_cfg:
    
      https://lore.kernel.org/all/[email protected]/
    
      1937e18cc3cf ("x86/fpu: Fix guest FPU state buffer allocation size")
    
    The inconsistencies within fpu_alloc_guest_fpstate() and across the
    mentioned functions cause confusion.
    
    Fix them by using fpu_kernel_cfg consistently in fpu_alloc_guest_fpstate(),
    except for fields related to the UABI buffer. Referencing fpu_kernel_cfg
    won't impact functionalities, as:
    
     1. fpu_guest->perm is overwritten shortly in fpu_init_guest_permissions()
        with fpstate->guest_perm, which already uses fpu_kernel_cfg.
    
     2. fpu_guest->xfeatures is solely used to check if XFD features are enabled.
        Including supervisor xfeatures doesn't affect the check.
    
    Fixes: 36487e6228c4 ("x86/fpu: Prepare guest FPU for dynamically enabled FPU features")
    Suggested-by: Chang S. Bae <[email protected]>
    Signed-off-by: Chao Gao <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Andy Lutomirski <[email protected]>
    Cc: H. Peter Anvin <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Cc: Oleg Nesterov <[email protected]>
    Cc: Dave Hansen <[email protected]>
    Cc: Juergen Gross <[email protected]>
    Cc: Stefano Stabellini <[email protected]>
    Cc: Paolo Bonzini <[email protected]>
    Cc: Vitaly Kuznetsov <[email protected]>
    Cc: Sean Christopherson <[email protected]>
    Cc: David Woodhouse <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/fpu: Avoid copying dynamic FP state from init_task in arch_dup_task_struct() [+ + +]

Author: Benjamin Berg <[email protected]>
Date:   Wed Feb 26 14:31:36 2025 +0100

    x86/fpu: Avoid copying dynamic FP state from init_task in arch_dup_task_struct()
    
    [ Upstream commit 5d3b81d4d8520efe888536b6906dc10fd1a228a8 ]
    
    The init_task instance of struct task_struct is statically allocated and
    may not contain the full FP state for userspace. As such, limit the copy
    to the valid area of both init_task and 'dst' and ensure all memory is
    initialized.
    
    Note that the FP state is only needed for userspace, and as such it is
    entirely reasonable for init_task to not contain parts of it.
    
    Fixes: 5aaeb5c01c5b ("x86/fpu, sched: Introduce CONFIG_ARCH_WANTS_DYNAMIC_TASK_STRUCT and use it on x86")
    Signed-off-by: Benjamin Berg <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Andy Lutomirski <[email protected]>
    Cc: H. Peter Anvin <[email protected]>
    Cc: Oleg Nesterov <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    ----
    
    v2:
    - Fix code if arch_task_struct_size < sizeof(init_task) by using
      memcpy_and_pad.
    
    Signed-off-by: Sasha Levin <[email protected]>

x86/fpu: Fix guest FPU state buffer allocation size [+ + +]

Author: Stanislav Spassov <[email protected]>
Date:   Tue Feb 18 14:10:45 2025 +0000

    x86/fpu: Fix guest FPU state buffer allocation size
    
    [ Upstream commit 1937e18cc3cf27e2b3ef70e8c161437051ab7608 ]
    
    Ongoing work on an optimization to batch-preallocate vCPU state buffers
    for KVM revealed a mismatch between the allocation sizes used in
    fpu_alloc_guest_fpstate() and fpstate_realloc(). While the former
    allocates a buffer sized to fit the default set of XSAVE features
    in UABI form (as per fpu_user_cfg), the latter uses its ksize argument
    derived (for the requested set of features) in the same way as the sizes
    found in fpu_kernel_cfg, i.e. using the compacted in-kernel
    representation.
    
    The correct size to use for guest FPU state should indeed be the
    kernel one as seen in fpstate_realloc(). The original issue likely
    went unnoticed through a combination of UABI size typically being
    larger than or equal to kernel size, and/or both amounting to the
    same number of allocated 4K pages.
    
    Fixes: 69f6ed1d14c6 ("x86/fpu: Provide infrastructure for KVM FPU cleanup")
    Signed-off-by: Stanislav Spassov <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/mm/pat: cpa-test: fix length for CPA_ARRAY test [+ + +]

Author: Mike Rapoport (Microsoft) <[email protected]>
Date:   Sun Jan 26 09:47:25 2025 +0200

    x86/mm/pat: cpa-test: fix length for CPA_ARRAY test
    
    [ Upstream commit 33ea120582a638b2f2e380a50686c2b1d7cce795 ]
    
    The CPA_ARRAY test always uses len[1] as numpages argument to
    change_page_attr_set() although the addresses array is different each
    iteration of the test loop.
    
    Replace len[1] with len[i] to have numpages matching the addresses array.
    
    Fixes: ecc729f1f471 ("x86/mm/cpa: Add ARRAY and PAGES_ARRAY selftests")
    Signed-off-by: "Mike Rapoport (Microsoft)" <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/mm: Fix flush_tlb_range() when used for zapping normal PMDs [+ + +]

Author: Jann Horn <[email protected]>
Date:   Fri Jan 3 19:39:38 2025 +0100

    x86/mm: Fix flush_tlb_range() when used for zapping normal PMDs
    
    commit 3ef938c3503563bfc2ac15083557f880d29c2e64 upstream.
    
    On the following path, flush_tlb_range() can be used for zapping normal
    PMD entries (PMD entries that point to page tables) together with the PTE
    entries in the pointed-to page table:
    
        collapse_pte_mapped_thp
          pmdp_collapse_flush
            flush_tlb_range
    
    The arm64 version of flush_tlb_range() has a comment describing that it can
    be used for page table removal, and does not use any last-level
    invalidation optimizations. Fix the X86 version by making it behave the
    same way.
    
    Currently, X86 only uses this information for the following two purposes,
    which I think means the issue doesn't have much impact:
    
     - In native_flush_tlb_multi() for checking if lazy TLB CPUs need to be
       IPI'd to avoid issues with speculative page table walks.
     - In Hyper-V TLB paravirtualization, again for lazy TLB stuff.
    
    The patch "x86/mm: only invalidate final translations with INVLPGB" which
    is currently under review (see
    <https://lore.kernel.org/all/[email protected]/>)
    would probably be making the impact of this a lot worse.
    
    Fixes: 016c4d92cd16 ("x86/mm/tlb: Add freed_tables argument to flush_tlb_mm_range")
    Signed-off-by: Jann Horn <[email protected]>
    Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
    Cc: [email protected]
    Link: https://lkml.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>

x86/platform: Only allow CONFIG_EISA for 32-bit [+ + +]

Author: Arnd Bergmann <[email protected]>
Date:   Wed Feb 26 22:37:14 2025 +0100

    x86/platform: Only allow CONFIG_EISA for 32-bit
    
    [ Upstream commit 976ba8da2f3c2f1e997f4f620da83ae65c0e3728 ]
    
    The CONFIG_EISA menu was cleaned up in 2018, but this inadvertently
    brought the option back on 64-bit machines: ISA remains guarded by
    a CONFIG_X86_32 check, but EISA no longer depends on ISA.
    
    The last Intel machines ith EISA support used a 82375EB PCI/EISA bridge
    from 1993 that could be paired with the 440FX chipset on early Pentium-II
    CPUs, long before the first x86-64 products.
    
    Fixes: 6630a8e50105 ("eisa: consolidate EISA Kconfig entry in drivers/eisa")
    Signed-off-by: Arnd Bergmann <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/sev: Add missing RIP_REL_REF() invocations during sme_enable() [+ + +]

Author: Kevin Loughlin <[email protected]>
Date:   Fri Nov 22 20:23:22 2024 +0000

    x86/sev: Add missing RIP_REL_REF() invocations during sme_enable()
    
    [ Upstream commit 72dafb567760320f2de7447cd6e979bf9d4e5d17 ]
    
    The following commit:
    
      1c811d403afd ("x86/sev: Fix position dependent variable references in startup code")
    
    introduced RIP_REL_REF() to force RIP-relative accesses to global variables,
    as needed to prevent crashes during early SEV/SME startup code.
    
    For completeness, RIP_REL_REF() should be used with additional variables during
    sme_enable():
    
      https://lore.kernel.org/all/CAMj1kXHnA0fJu6zh634=fbJswp59kSRAbhW+ubDGj1+NYwZJ-Q@mail.gmail.com/
    
    Access these vars with RIP_REL_REF() to prevent problem reoccurence.
    
    Fixes: 1c811d403afd ("x86/sev: Fix position dependent variable references in startup code")
    Signed-off-by: Kevin Loughlin <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Reviewed-by: Ard Biesheuvel <[email protected]>
    Reviewed-by: Tom Lendacky <[email protected]>
    Cc: Dave Hansen <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/sgx: Warn explicitly if X86_FEATURE_SGX_LC is not enabled [+ + +]

Author: Vladis Dronov <[email protected]>
Date:   Sun Mar 9 18:22:16 2025 +0100

    x86/sgx: Warn explicitly if X86_FEATURE_SGX_LC is not enabled
    
    [ Upstream commit 65be5c95d08eedda570a6c888a12384c77fe7614 ]
    
    The kernel requires X86_FEATURE_SGX_LC to be able to create SGX enclaves,
    not just X86_FEATURE_SGX.
    
    There is quite a number of hardware which has X86_FEATURE_SGX but not
    X86_FEATURE_SGX_LC. A kernel running on such hardware does not create
    the /dev/sgx_enclave file and does so silently.
    
    Explicitly warn if X86_FEATURE_SGX_LC is not enabled to properly notify
    users that the kernel disabled the SGX driver.
    
    The X86_FEATURE_SGX_LC, a.k.a. SGX Launch Control, is a CPU feature
    that enables LE (Launch Enclave) hash MSRs to be writable (with
    additional opt-in required in the 'feature control' MSR) when running
    enclaves, i.e. using a custom root key rather than the Intel proprietary
    key for enclave signing.
    
    I've hit this issue myself and have spent some time researching where
    my /dev/sgx_enclave file went on SGX-enabled hardware.
    
    Related links:
    
      https://github.com/intel/linux-sgx/issues/837
      https://patchwork.kernel.org/project/platform-driver-x86/patch/[email protected]/
    
    [ mingo: Made the error message a bit more verbose, and added other cases
             where the kernel fails to create the /dev/sgx_enclave device node. ]
    
    Signed-off-by: Vladis Dronov <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Acked-by: Kai Huang <[email protected]>
    Cc: Jarkko Sakkinen <[email protected]>
    Cc: Andy Lutomirski <[email protected]>
    Cc: Sean Christopherson <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Sasha Levin <[email protected]>

x86/tsc: Always save/restore TSC sched_clock() on suspend/resume [+ + +]

Author: Guilherme G. Piccoli <[email protected]>
Date:   Sat Feb 15 17:58:16 2025 -0300

    x86/tsc: Always save/restore TSC sched_clock() on suspend/resume
    
    commit d90c9de9de2f1712df56de6e4f7d6982d358cabe upstream.
    
    TSC could be reset in deep ACPI sleep states, even with invariant TSC.
    
    That's the reason we have sched_clock() save/restore functions, to deal
    with this situation. But what happens is that such functions are guarded
    with a check for the stability of sched_clock - if not considered stable,
    the save/restore routines aren't executed.
    
    On top of that, we have a clear comment in native_sched_clock() saying
    that *even* with TSC unstable, we continue using TSC for sched_clock due
    to its speed.
    
    In other words, if we have a situation of TSC getting detected as unstable,
    it marks the sched_clock as unstable as well, so subsequent S3 sleep cycles
    could bring bogus sched_clock values due to the lack of the save/restore
    mechanism, causing warnings like this:
    
      [22.954918] ------------[ cut here ]------------
      [22.954923] Delta way too big! 18446743750843854390 ts=18446744072977390405 before=322133536015 after=322133536015 write stamp=18446744072977390405
      [22.954923] If you just came from a suspend/resume,
      [22.954923] please switch to the trace global clock:
      [22.954923]   echo global > /sys/kernel/tracing/trace_clock
      [22.954923] or add trace_clock=global to the kernel command line
      [22.954937] WARNING: CPU: 2 PID: 5728 at kernel/trace/ring_buffer.c:2890 rb_add_timestamp+0x193/0x1c0
    
    Notice that the above was reproduced even with "trace_clock=global".
    
    The fix for that is to _always_ save/restore the sched_clock on suspend
    cycle _if TSC is used_ as sched_clock - only if we fallback to jiffies
    the sched_clock_stable() check becomes relevant to save/restore the
    sched_clock.
    
    Debugged-by: Thadeu Lima de Souza Cascardo <[email protected]>
    Signed-off-by: Guilherme G. Piccoli <[email protected]>
    Signed-off-by: Ingo Molnar <[email protected]>
    Cc: [email protected]
    Cc: Thomas Gleixner <[email protected]>
    Cc: Peter Zijlstra <[email protected]>
    Cc: Linus Torvalds <[email protected]>
    Link: https://lore.kernel.org/r/[email protected]
    Signed-off-by: Greg Kroah-Hartman <[email protected]>