Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Linux 6.4.0 #263

Closed
wants to merge 10,000 commits into from
Closed

Linux 6.4.0 #263

wants to merge 10,000 commits into from

Conversation

mmstick
Copy link
Member

@mmstick mmstick commented Jun 29, 2023

Replaces #262

kuba-moo and others added 30 commits June 15, 2023 15:09
Kuniyuki Iwashima says:

====================
udplite/dccp: Print deprecation notice.

UDP-Lite is assumed to have no users for 7 years, and DCCP is
orphaned for 7 years too.

Let's add deprecation notice and see if anyone responds to it.
====================

Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Jakub Kicinski <[email protected]>
…cifs-2.6

Pull smb client fixes from Steve French:
 "Eight, mostly small, smb3 client fixes:

   - important fix for deferred close oops (race with unmount) found
     with xfstest generic/098 to some servers

   - important reconnect fix

   - fix problem with max_credits mount option

   - two multichannel (interface related) fixes

   - one trivial removal of confusing comment

   - two small debugging improvements (to better spot crediting
     problems)"

* tag '6.4-rc6-smb3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6:
  cifs: add a warning when the in-flight count goes negative
  cifs: fix lease break oops in xfstest generic/098
  cifs: fix max_credits implementation
  cifs: fix sockaddr comparison in iface_cmp
  smb/client: print "Unknown" instead of bogus link speed value
  cifs: print all credit counters in DebugData
  cifs: fix status checks in cifs_tree_connect
  smb: remove obsolete comment
…ux/kernel/git/tytso/ext4

Pull ext4 fixes from Ted Ts'o:
 "Fix two regressions in ext4, one report by syzkaller[1], and reported
  by multiple users (and tracked by regzbot[2])"

[1] https://syzkaller.appspot.com/bug?extid=4acc7d910e617b360859
[2] https://linux-regtracking.leemhuis.info/regzbot/regression/ZIauBR7YiV3rVAHL@glitch/

* tag 'ext4_for_linus_stable' of git://git.kernel.org/pub/scm/linux/kernel/git/tytso/ext4:
  ext4: drop the call to ext4_error() from ext4_get_group_info()
  Revert "ext4: remove unnecessary check in ext4_bg_num_gdb_nometa"
…p.org/agd5f/linux into drm-fixes

amd-drm-fixes-6.4-2023-06-14:

amdgpu:
- GFX9 preemption fixes
- Add missing radeon secondary PCI ID
- vblflash fixes
- SMU 13 fix
- VCN 4.0 fix
- Re-enable TOPDOWN flag for large BAR systems to fix regression
- eDP fix
- PSR hang fix
- DPIA fix

radeon:
- fbdev client warning fix

Signed-off-by: Dave Airlie <[email protected]>
From: Alex Deucher <[email protected]>
Link: https://patchwork.freedesktop.org/patch/msgid/[email protected]
This seems to have existed for ever but is now more apparant after
commit 9bff18d ("drm/ttm: use per BO cleanup workers")

My analysis: two threads are running, one in the irq signalling the
fence, in dma_fence_signal_timestamp_locked, it has done the
DMA_FENCE_FLAG_SIGNALLED_BIT setting, but hasn't yet reached the
callbacks.

The second thread in nouveau_cli_work_ready, where it sees the fence is
signalled, so then puts the fence, cleanups the object and frees the
work item, which contains the callback.

Thread one goes again and tries to call the callback and causes the
use-after-free.

Proposed fix: lock the fence signalled check in nouveau_cli_work_ready,
so either the callbacks are done or the memory is freed.

Reviewed-by: Karol Herbst <[email protected]>
Fixes: 11e451e ("drm/nouveau: remove fence wait code from deferred client work handler")
Cc: [email protected]
Signed-off-by: Dave Airlie <[email protected]>
Link: https://lore.kernel.org/dri-devel/[email protected]/
…/git/mchehab/linux-media

Pull media fixes from Mauro Carvalho Chehab:
 "A fix for dvb-core to avoid a race condition during DVB board
  registration"

* tag 'media/v6.4-6' of git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media:
  Revert "media: dvb-core: Fix use-after-free on race condition at dvb_frontend"
…/kernel/git/broonie/regmap

Pull regmap fix from Mark Brown:
 "Another fix for the maple tree cache, Takashi noticed that unlike
  other caches the maple tree cache didn't check for read only registers
  before trying to sync which would result in spurious syncs for read
  only registers where we don't have a default.

  This was due to the check being open coded in the caches, we now check
  in the shared 'does this register need sync' function so that is fixed
  for this and future caches"

* tag 'regmap-fix-v6.4-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regmap:
  regmap: regcache: Don't sync read-only registers
…nux/kernel/git/broonie/regulator

Pull regulator fix from Mark Brown:
 "The set of regulators described for the Qualcomm PM8550 just seems to
  have been completely wrong and would likely not have worked at all if
  anything tried to actually configure anything except for enabling and
  disabling at runtime"

* tag 'regulator-fix-v6.4-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator:
  regulator: qcom-rpmh: Fix regulators for PM8550
…rnel/git/broonie/spi

Pull spi fixes from Mark Brown:
 "A few more driver specific fixes.

  The DesignWare fix is for an issue introduced by conversion to the
  chip select accessor functions and is pretty important but the other
  two are less severe"

* tag 'spi-fix-v6.4-rc6' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi:
  spi: dw: Replace incorrect spi_get_chipselect with set
  spi: fsl-dspi: avoid SCK glitches with continuous transfers
  spi: cadence-quadspi: Add missing check for dma_set_mask
…t/rdma/rdma

Pull rdma fixes from Jason Gunthorpe:
 "This is an unusually large bunch of bug fixes for the later rc cycle,
  rxe and mlx5 both dumped a lot of things at once. rxe continues to fix
  itself, and mlx5 is fixing a bunch of "queue counters" related bugs.

  There is one highly notable bug fix regarding the qkey. This small
  security check was missed in the original 2005 implementation and it
  allows some significant issues.

  Summary:

   - Two rtrs bug fixes for error unwind bugs

   - Several rxe bug fixes:
      * Incorrect Rx packet validation
      * Using memory without a refcount
      * Syzkaller found use before initialization
      * Regression fix for missing locking with the tasklet conversion
        from this merge window

   - Have bnxt report the correct link properties to userspace, this was
     a regression in v6.3

   - Several mlx5 bug fixes:
      * Kernel crash triggerable by userspace for the RAW ethernet
        profile
      * Defend against steering refcounting issues created by userspace
      * Incorrect change of QP port affinity parameters in some LAG
        configurations

   - Fix mlx5 Q counters:
      * Do not over allocate Q counters to allow userspace to use the
        full port capacity
      * Kernel crash triggered by eswitch due to mis-use of Q counters
      * Incorrect mlx5_device for Q counters in some LAG configurations

   - Properly implement the IBA spec restricting privileged qkeys to
     root

   - Always an error when reading from a disassociated device's event
     queue

   - isert bug fixes:
      * Avoid a deadlock with the CM handler and CM ID destruction
      * Correct list corruption due to incorrect locking
      * Fix a use after free around connection tear down"

* tag 'for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/rdma/rdma:
  RDMA/rxe: Fix rxe_cq_post
  IB/isert: Fix incorrect release of isert connection
  IB/isert: Fix possible list corruption in CMA handler
  IB/isert: Fix dead lock in ib_isert
  RDMA/mlx5: Fix affinity assignment
  IB/uverbs: Fix to consider event queue closing also upon non-blocking mode
  RDMA/uverbs: Restrict usage of privileged QKEYs
  RDMA/cma: Always set static rate to 0 for RoCE
  RDMA/mlx5: Fix Q-counters query in LAG mode
  RDMA/mlx5: Remove vport Q-counters dependency on normal Q-counters
  RDMA/mlx5: Fix Q-counters per vport allocation
  RDMA/mlx5: Create an indirect flow table for steering anchor
  RDMA/mlx5: Initiate dropless RQ for RAW Ethernet functions
  RDMA/rxe: Fix the use-before-initialization error of resp_pkts
  RDMA/bnxt_re: Fix reporting active_{speed,width} attributes
  RDMA/rxe: Fix ref count error in check_rkey()
  RDMA/rxe: Fix packet length checks
  RDMA/rtrs: Fix rxe_dealloc_pd warning
  RDMA/rtrs: Fix the last iu->buf leak in err path
…rnel/git/device-mapper/linux-dm

Pull device mapper fixes from Mike Snitzer:

 - Fix DM thinp discard performance regression introduced during this
   merge window where DM core was splitting large discards every 128K
   (max_sectors_kb) rather than every 64M (discard_max_bytes).

 - Extend DM core LOCKFS fix, made during 6.4 merge, to also fix race
   between do_mount and dm's do_suspend (in addition to the earlier
   fix's do_mount race with dm's do_resume).

 - Fix DM thin metadata operations to first check if the thin-pool is in
   "fail_io" mode; otherwise UAF can occur.

 - Fix DM thinp's call to __blkdev_issue_discard to use GFP_NOIO rather
   than GFP_NOWAIT (__blkdev_issue_discard cannot handle NULL return
   from bio_alloc).

* tag 'for-6.4/dm-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/device-mapper/linux-dm:
  dm: use op specific max_sectors when splitting abnormal io
  dm thin: fix issue_discard to pass GFP_NOIO to __blkdev_issue_discard
  dm thin metadata: check fail_io before using data_sm
  dm: don't lock fs when the map is NULL during suspend or resume
…ux/kernel/git/chenhuacai/linux-loongson

Pull LoongArch fixes from Huacai Chen:
 "Some trivial bug fixes for v6.4-rc7"

* tag 'loongarch-fixes-6.4-1' of git://git.kernel.org/pub/scm/linux/kernel/git/chenhuacai/linux-loongson:
  LoongArch: Fix debugfs_create_dir() error checking
  LoongArch: Avoid uninitialized alignment_mask
  LoongArch: Fix perf event id calculation
  LoongArch: Fix the write_fcsr() macro
  LoongArch: Let pmd_present() return true when splitting pmd
…git/netdev/net

Pull networking fixes from Jakub Kicinski:
 "Including fixes from wireless, and netfilter.

  Selftests excluded - we have 58 patches and diff of +442/-199, which
  isn't really small but perhaps with the exception of the WiFi locking
  change it's old(ish) bugs.

  We have no known problems with v6.4.

  The selftest changes are rather large as MPTCP folks try to apply
  Greg's guidance that selftest from torvalds/linux should be able to
  run against stable kernels.

  Last thing I should call out is the DCCP/UDP-lite deprecation notices.
  We are fairly sure those are dead, but if we're wrong reverting them
  back in won't be fun.

  Current release - regressions:

   - wifi:
      - cfg80211: fix double lock bug in reg_wdev_chan_valid()
      - iwlwifi: mvm: spin_lock_bh() to fix lockdep regression

  Current release - new code bugs:

   - handshake: remove fput() that causes use-after-free

  Previous releases - regressions:

   - sched: cls_u32: fix reference counter leak leading to overflow

   - sched: cls_api: fix lockup on flushing explicitly created chain

  Previous releases - always broken:

   - nf_tables: integrate pipapo into commit protocol

   - nf_tables: incorrect error path handling with NFT_MSG_NEWRULE, fix
     dangling pointer on failure

   - ping6: fix send to link-local addresses with VRF

   - sched: act_pedit: parse L3 header for L4 offset, the skb may not
     have the offset saved

   - sched: act_ct: fix promotion of offloaded unreplied tuple

   - sched: refuse to destroy an ingress and clsact Qdiscs if there are
     lockless change operations in flight

   - wifi: mac80211: fix handful of bugs in multi-link operation

   - ipvlan: fix bound dev checking for IPv6 l3s mode

   - eth: enetc: correct the indexes of highest and 2nd highest TCs

   - eth: ice: fix XDP memory leak when NIC is brought up and down

  Misc:

   - add deprecation notices for UDP-lite and DCCP

   - selftests: mptcp: skip tests not supported by old kernels

   - sctp: handle invalid error codes without calling BUG()"

* tag 'net-6.4-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net: (91 commits)
  dccp: Print deprecation notice.
  udplite: Print deprecation notice.
  octeon_ep: Add missing check for ioremap
  selftests/ptp: Fix timestamp printf format for PTP_SYS_OFFSET
  net: ethernet: stmicro: stmmac: fix possible memory leak in __stmmac_open
  net: tipc: resize nlattr array to correct size
  sfc: fix XDP queues mode with legacy IRQ
  net: macsec: fix double free of percpu stats
  net: lapbether: only support ethernet devices
  MAINTAINERS: add reviewers for SMC Sockets
  s390/ism: Fix trying to free already-freed IRQ by repeated ism_dev_exit()
  net: dsa: felix: fix taprio guard band overflow at 10Mbps with jumbo frames
  net/sched: cls_api: Fix lockup on flushing explicitly created chain
  ice: Fix ice module unload
  net/handshake: remove fput() that causes use-after-free
  selftests: forwarding: hw_stats_l3: Set addrgenmode in a separate step
  net/sched: qdisc_destroy() old ingress and clsact Qdiscs before grafting
  net/sched: Refactor qdisc_graft() for ingress and clsact Qdiscs
  net/sched: act_ct: Fix promotion of offloaded unreplied tuple
  wifi: iwlwifi: mvm: spin_lock_bh() to fix lockdep regression
  ...
Before executing each test from a fixture, FIXTURE_SETUP is run once.
When SKIP is used in FIXTURE_SETUP, the setup function returns early
but the test still proceeds to run, unless another SKIP macro is used
within the test definition, leading to some code repetition. Therefore,
allow tests to be skipped directly from the setup function.

Suggested-by: Jakub Kicinski <[email protected]>
Signed-off-by: Magali Lemes <[email protected]>
Signed-off-by: Jakub Kicinski <[email protected]>
TLS selftests use the ChaCha20-Poly1305 and SM4 algorithms, which are not
FIPS compliant. When fips=1, this set of tests fails. Add a check and only
run these tests if not in FIPS mode.

Fixes: 4f336e8 ("selftests/tls: add CHACHA20-POLY1305 to tls selftests")
Fixes: e506342 ("selftests/tls: add SM4 GCM/CCM to tls selftests")
Reviewed-by: Jakub Kicinski <[email protected]>
Signed-off-by: Magali Lemes <[email protected]>
Signed-off-by: Jakub Kicinski <[email protected]>
…algos

The vrf-xfrm-tests tests use the hmac(md5) and cbc(des3_ede)
algorithms for performing authentication and encryption, respectively.
This causes the tests to fail when fips=1 is set, since these algorithms
are not allowed in FIPS mode. Therefore, switch from hmac(md5) and
cbc(des3_ede) to hmac(sha1) and cbc(aes), which are FIPS compliant.

Fixes: 3f251d7 ("selftests: Add tests for vrf and xfrms")
Reviewed-by: David Ahern <[email protected]>
Signed-off-by: Magali Lemes <[email protected]>
Signed-off-by: Jakub Kicinski <[email protected]>
There are some MD5 tests which fail when the kernel is in FIPS mode,
since MD5 is not FIPS compliant. Add a check and only run those tests
if FIPS mode is not enabled.

Fixes: f0bee1e ("fcnal-test: Add TCP MD5 tests")
Fixes: 5cad8bc ("fcnal-test: Add TCP MD5 tests for VRF")
Reviewed-by: David Ahern <[email protected]>
Signed-off-by: Magali Lemes <[email protected]>
Signed-off-by: Jakub Kicinski <[email protected]>
Magali Lemes says:

====================
Check if FIPS mode is enabled when running selftests

Some test cases from net/tls, net/fcnal-test and net/vrf-xfrm-tests
that rely on cryptographic functions to work and use non-compliant FIPS
algorithms fail in FIPS mode.

In order to allow these tests to pass in a wider set of kernels,
 - for net/tls, skip the test variants that use the ChaCha20-Poly1305
and SM4 algorithms, when FIPS mode is enabled;
 - for net/fcnal-test, skip the MD5 tests, when FIPS mode is enabled;
 - for net/vrf-xfrm-tests, replace the algorithms that are not
FIPS-compliant with compliant ones.

v1: https://lore.kernel.org/netdev/[email protected]/
v2: https://lore.kernel.org/netdev/[email protected]/
v3: https://lore.kernel.org/netdev/[email protected]/
====================

Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Jakub Kicinski <[email protected]>
Although snd_seq_oss_midi_open() and snd_seq_oss_midi_close() can be
called concurrently from different code paths, we have no proper data
protection against races.  Introduce open_mutex to each seq_oss_midi
object for avoiding the races.

Reported-by: "Gong, Sishuai" <[email protected]>
Closes: https://lore.kernel.org/r/[email protected]
Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Takashi Iwai <[email protected]>
As reported in the bugzilla below, the PM resume of a UAC3 device may
fail due to the incomplete power state change, stuck at D1.  The
reason is that the driver expects the full D0 power state change only
at hw_params, while the normal PCM resume procedure doesn't call
hw_params.

For fixing the bug, we add the same power state update to D0 at the
prepare callback, which is certainly called by the resume procedure.

Note that, with this change, the power state change in the hw_params
becomes almost redundant, since snd_usb_hw_params() doesn't touch the
parameters (at least it tires so).  But dropping it is still a bit
risky (e.g. we have the media-driver binding), so I leave the D0 power
state change in snd_usb_hw_params() as is for now.

Fixes: a0a4959 ("ALSA: usb-audio: Operate UAC3 Power Domains in PCM callbacks")
Cc: <[email protected]>
Link: https://bugzilla.kernel.org/show_bug.cgi?id=217539
Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Takashi Iwai <[email protected]>
…playback

This commit adds new DEVICE_FLG with QUIRK_FLAG_DSD_RAW and Vendor Id for
HEM devices which supports native DSD. Prior to this change Linux kernel
was not enabling native DSD playback for HEM devices, and as a result,
DSD audio was being converted to PCM "on the fly". HEM devices,
when connected to the system, would only play audio in PCM format,
even if the source material was in DSD format. With the addition of new
VENDOR_FLG in the quircks.c file, the devices are now correctly
recognized, and raw DSD data is transmitted to the device,
allowing for native DSD playback.

Signed-off-by: Lukasz Tyl <[email protected]>
Cc: <[email protected]>
Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Takashi Iwai <[email protected]>
…ux/kernel/git/broonie/sound into for-linus

ASoC: Fixes for v6.4

A couple more fixes for v6.4, one fixing a misleading error log and
another stopping us seeing spurious failures setting the master volume
on some Tegra systems introduced by a change to how we calculate delay
times.
With offloading enabled, esp_xmit() gets invoked very late, from within
validate_xmit_xfrm() which is after validate_xmit_skb() validates and
linearizes the skb if the underlying device does not support fragments.

esp_output_tail() may add a fragment to the skb while adding the auth
tag/ IV. Devices without the proper support will then send skb->data
points to with the correct length so the packet will have garbage at the
end. A pcap sniffer will claim that the proper data has been sent since
it parses the skb properly.

It is not affected with INET_ESP_OFFLOAD disabled.

Linearize the skb after offloading if the sending hardware requires it.
It was tested on v4, v6 has been adopted.

Fixes: 7785bba ("esp: Add a software GRO codepath")
Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
Signed-off-by: Steffen Klassert <[email protected]>
Move allocation code down to avoid memory leak.

Fixes: 29f5474 ("iommu/amd: Add missing domain type checks")
Signed-off-by: Su Hui <[email protected]>
Reviewed-by: Jason Gunthorpe <[email protected]>
Reviewed-by: Jerry Snitselaar <[email protected]>
Reviewed-by: Vasant Hegde <[email protected]>
Link: https://lore.kernel.org/r/[email protected]
Signed-off-by: Joerg Roedel <[email protected]>
…totype

Reiji reports that the arm64 implementation of arch_perf_update_userpage()
is now ignored and replaced by the dummy stub in core code.
This seems to happen since the PMUv3 driver was moved to driver/perf.

As it turns out, dropping the __weak attribute from the *prototype*
of the function solves the problem. You're right, this doesn't seem
to make much sense. And yet... It appears that both symbols get
flagged as weak, and that the first one to appear in the link order
wins:

$ nm drivers/perf/arm_pmuv3.o|grep arch_perf_update_userpage
0000000000001db0 W arch_perf_update_userpage

Dropping the attribute from the prototype restores the expected
behaviour, and arm64 is able to enjoy arch_perf_update_userpage()
again.

Fixes: 7755cec ("arm64: perf: Move PMUv3 driver to drivers/perf")
Fixes: f1ec3a5 ("kernel/events: Add a missing prototype for arch_perf_update_userpage()")
Reported-by: Reiji Watanabe <[email protected]>
Signed-off-by: Marc Zyngier <[email protected]>
Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
Acked-by: Mark Rutland <[email protected]>
Tested-by: Reiji Watanabe <[email protected]>
Link: https://lkml.kernel.org/r/[email protected]
When counting a FRONTEND event, the MSR_PEBS_FRONTEND is not correctly
set on GNR and MTL p-core.

The umask value for the FRONTEND events is changed on GNR and MTL. The
new umask is missing in the extra_regs[] table.

Add a dedicated intel_gnr_extra_regs[] for GNR and MTL p-core.

Fixes: bc4000f ("perf/x86/intel: Add Granite Rapids")
Signed-off-by: Kan Liang <[email protected]>
Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
Cc: [email protected]
Link: https://lkml.kernel.org/r/[email protected]
Commits ffb1b4a ("x86/unwind/orc: Add 'signal' field to ORC
metadata") and fb79944 ("x86,objtool: Split UNWIND_HINT_EMPTY in
two") changed the ORC format. Although ORC is internal to the kernel,
it's the only way for external tools to get reliable kernel stack traces
on x86-64. In particular, the drgn debugger [1] uses ORC for stack
unwinding, and these format changes broke it [2]. As the drgn
maintainer, I don't care how often or how much the kernel changes the
ORC format as long as I have a way to detect the change.

It suffices to store a version identifier in the vmlinux and kernel
module ELF files (to use when parsing ORC sections from ELF), and in
kernel memory (to use when parsing ORC from a core dump+symbol table).
Rather than hard-coding a version number that needs to be manually
bumped, Peterz suggested hashing the definitions from orc_types.h. If
there is a format change that isn't caught by this, the hashing script
can be updated.

This patch adds an .orc_header allocated ELF section containing the
20-byte hash to vmlinux and kernel modules, along with the corresponding
__start_orc_header and __stop_orc_header symbols in vmlinux.

1: https://github.com/osandov/drgn
2: osandov/drgn#303

Fixes: ffb1b4a ("x86/unwind/orc: Add 'signal' field to ORC metadata")
Fixes: fb79944 ("x86,objtool: Split UNWIND_HINT_EMPTY in two")
Signed-off-by: Omar Sandoval <[email protected]>
Signed-off-by: Peter Zijlstra (Intel) <[email protected]>
Acked-by: Josh Poimboeuf <[email protected]>
Link: https://lkml.kernel.org/r/aef9c8dc43915b886a8c48509a12ec1b006ca1ca.1686690801.git.osandov@osandov.com
…inux/kernel/git/riscv/linux

Pull RISC-V fix from Palmer Dabbelt:

 - A documentation patch describing how we use patchwork

* tag 'riscv-for-linus-6.4-rc7' of git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux:
  Documentation: RISC-V: patch-acceptance: mention patchwork's role
…nux/kernel/git/paulmck/linux-rcu

Pull RCU fix from Paul McKenney:
 "This fixes a spinlock-initialization regression in SRCU that causes
  the SRCU notifier to fail.

  The fix simply adds the initialization, but introduces a #ifdef
  because there is no spinlock to initialize for the Tiny SRCU used in
  !SMP builds.

  Yes, it would be nice to abstract this somehow in order to hide it in
  SRCU, but I still don't see a good way of doing this"

* tag 'urgent-rcu.2023.06.11a' of git://git.kernel.org/pub/scm/linux/kernel/git/paulmck/linux-rcu:
  notifier: Initialize new struct srcu_usage field
The tick period is aligned very early while the first clock_event_device is
registered. At that point the system runs in periodic mode and switches
later to one-shot mode if possible.

The next wake-up event is programmed based on the aligned value
(tick_next_period) but the delta value, that is used to program the
clock_event_device, is computed based on ktime_get().

With the subtracted offset, the device fires earlier than the exact time
frame. With a large enough offset the system programs the timer for the
next wake-up and the remaining time left is too small to make any boot
progress. The system hangs.

Move the alignment later to the setup of tick_sched timer. At this point
the system switches to oneshot mode and a high resolution clocksource is
available. At this point it is safe to align tick_next_period because
ktime_get() will now return accurate (not jiffies based) time.

[bigeasy: Patch description + testing].

Fixes: e9523a0 ("tick/common: Align tick period with the HZ tick.")
Reported-by: Mathias Krause <[email protected]>
Reported-by: "Bhatnagar, Rishabh" <[email protected]>
Suggested-by: Mathias Krause <[email protected]>
Signed-off-by: Thomas Gleixner <[email protected]>
Signed-off-by: Sebastian Andrzej Siewior <[email protected]>
Signed-off-by: Thomas Gleixner <[email protected]>
Tested-by: Richard W.M. Jones <[email protected]>
Tested-by: Mathias Krause <[email protected]>
Acked-by: SeongJae Park <[email protected]>
Cc: [email protected]
Link: https://lore.kernel.org/[email protected]
Link: https://lore.kernel.org/[email protected]
Link: https://lore.kernel.org/r/[email protected]
This was referenced Jun 30, 2023
@mmstick
Copy link
Member Author

mmstick commented Jun 30, 2023

Haven't yet found a solution for this compiler error:

In file included from ./arch/x86/include/asm/linkage.h:6,
                 from ./include/linux/linkage.h:8,
                 from ./include/linux/printk.h:8,
                 from ./include/asm-generic/bug.h:22,
                 from ./arch/x86/include/asm/bug.h:87,
                 from ./include/linux/bug.h:5,
                 from ./include/linux/page-flags.h:10,
                 from kernel/bounds.c:10:
./arch/x86/include/asm/ibt.h:77:1: error: ‘nocf_check’ attribute ignored. Use ‘-fcf-protection’ option to enable it [-Werror=attributes]
   77 | extern __noendbr u64 ibt_save(bool disable);
      | ^~~~~~
./arch/x86/include/asm/ibt.h:78:1: error: ‘nocf_check’ attribute ignored. Use ‘-fcf-protection’ option to enable it [-Werror=attributes]
   78 | extern __noendbr void ibt_restore(u64 save);
      | ^~~~~~
cc1: all warnings being treated as errors
make[2]: *** [scripts/Makefile.build:114: kernel/bounds.s] Error 1

I've been looking through the diff between our 6.3.7 packaging and the Ubuntu mainline kernel packaging for 6.4.0 in Meld, with little luck so far. Little has changed from a packaging perspective between these releases, outside of a retpoline cflags patch and some new kernel options in the annotations script.

@jglathe
Copy link

jglathe commented Jun 30, 2023

I've had this since all of 6.3. I found a way to hack it out of existence (undefine it). Seems to work fine here. And yes, I had to apply it on this branch, too.

From 2edfc99b478339d5b1593d3b5924d0cb950861ee Mon Sep 17 00:00:00 2001
From: Jens Glathe <[email protected]>
Date: Sat, 18 Mar 2023 08:11:37 +0000
Subject: [PATCH] hotfix remove nocf_check attribute for ibt

Signed-off-by: Jens Glathe <[email protected]>
---
 arch/x86/include/asm/ibt.h | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/arch/x86/include/asm/ibt.h b/arch/x86/include/asm/ibt.h
index baae6b4fea23..f242c1fedd68 100644
--- a/arch/x86/include/asm/ibt.h
+++ b/arch/x86/include/asm/ibt.h
@@ -29,7 +29,7 @@
 #define ASM_ENDBR	"endbr32\n\t"
 #endif
 
-#define __noendbr	__attribute__((nocf_check))
+#define __noendbr
 
 /*
  * Create a dummy function pointer reference to prevent objtool from marking
-- 
2.37.2


0001-hotfix-remove-nocf_check-attribute-for-ibt.patch

@mmstick mmstick requested review from a team July 3, 2023 12:34
@XV-02
Copy link

XV-02 commented Jul 3, 2023

Looks like Virtualbox and our ZFS packaging will need updating to work with 6.4.0.
Nvidia (both 525 and 535), System76, and HP related DKMS seems fine.

@mmstick
Copy link
Member Author

mmstick commented Jul 5, 2023

Virtualbox and ZFS packages may be fixed by these PRs

Which can be tested with the zfs and virtualbox staging branches.

@XV-02
Copy link

XV-02 commented Jul 6, 2023

Given that 6.4.1 contains fixes for a now known security vulnerability, and that this PR is undergoing testing still, should we push to 6.4.1?

@mmstick
Copy link
Member Author

mmstick commented Jul 6, 2023

Yes, we should

@mmstick
Copy link
Member Author

mmstick commented Jul 6, 2023

Seems even our 6.2.6 kernel is affected.

@XV-02
Copy link

XV-02 commented Jul 6, 2023

There isn't, as far as I can tell, a 6.2 kernel we can move to in the interim. With 6.3's instability and its nouveau driver regressions (which is a problem with our large Nvidia user base) I think our best move is going to be trying to get 6.4.x to play nice.

@leviport
Copy link
Member

leviport commented Jul 6, 2023

I'm seeing the same freezing bug on my oryp9 that I saw on the 6.3.x kernels we've tested: #251 (comment)

I've narrowed it down to either my USB-C docking monitor that I use at the office (at home I use only the built-in display), or my bluetooth headphones that I use at the office.

I've been able to recover from this state by disconnecting my bluetooth headphones and/or switching to TTY3 and back to TTY2 (not sure which solved it, but I did both and it came back). This time I had dmesg -w running. Before it froze, I was seeing some of this:

[  479.610045] i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe B (start=66554 end=66555) time 214 us, min 1415, max 1439, scanline start 1410, end 1461
[  650.738328] i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe B (start=94764 end=94765) time 209 us, min 1415, max 1439, scanline start 1403, end 1453
[  838.949095] i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe B (start=125790 end=125791) time 188 us, min 1415, max 1439, scanline start 1400, end 1445
[ 1000.583743] i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe B (start=152435 end=152436) time 149 us, min 1415, max 1439, scanline start 1410, end 1445
[ 1694.881533] input: ATH-M50xBT2 (AVRCP) as /devices/virtual/input/input43
[ 1880.848113] i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe B (start=297544 end=297545) time 210 us, min 1415, max 1439, scanline start 1401, end 1452
[ 1902.819988] i915 0000:00:02.0: [drm] *ERROR* Atomic update failure on pipe B (start=301166 end=301167) time 212 us, min 1415, max 1439, scanline start 1400, end 1450

(you can see where I connected my bluetooth headset there as well)

and after the freeze, I saw all of this:

[ 2100.735563] i915 0000:00:02.0: [drm] *ERROR* [CRTC:131:pipe B] flip_done timed out
[ 2110.975701] i915 0000:00:02.0: [drm] *ERROR* flip_done timed out
[ 2110.975728] i915 0000:00:02.0: [drm] *ERROR* [CRTC:131:pipe B] commit wait timed out
[ 2121.215683] i915 0000:00:02.0: [drm] *ERROR* flip_done timed out
[ 2121.215705] i915 0000:00:02.0: [drm] *ERROR* [PLANE:82:plane 1B] commit wait timed out
[ 2131.455804] i915 0000:00:02.0: [drm] *ERROR* [CRTC:131:pipe B] flip_done timed out
[ 2141.695813] i915 0000:00:02.0: [drm] *ERROR* flip_done timed out
[ 2141.695825] i915 0000:00:02.0: [drm] *ERROR* [CRTC:131:pipe B] commit wait timed out
[ 2141.777020] rfkill: input handler enabled
[ 2151.936045] i915 0000:00:02.0: [drm] *ERROR* flip_done timed out
[ 2151.936060] i915 0000:00:02.0: [drm] *ERROR* [CONNECTOR:245:DP-1] commit wait timed out
[ 2152.039982] ------------[ cut here ]------------
[ 2152.039989] i915 0000:00:02.0: vblank wait timed out on crtc 1
[ 2152.040161] WARNING: CPU: 4 PID: 4203 at drivers/gpu/drm/drm_vblank.c:1310 drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.040379] Modules linked in: tls nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter ccm rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack snd_seq_dummy snd_hrtimer nf_defrag_ipv6 nf_defrag_ipv4 cmac algif_hash algif_skcipher af_alg nf_tables nfnetlink zstd nvidia_uvm(POE) bridge stp llc overlay bnep zram snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda_mlink snd_sof_intel_hda snd_sof_pci intel_rapl_msr snd_sof_xtensa_dsp intel_rapl_common snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi intel_uncore_frequency soundwire_bus intel_uncore_frequency_common intel_tcc_cooling iwlmvm snd_soc_core snd_hda_codec_realtek snd_compress snd_hda_codec_generic nvidia_drm(POE) x86_pkg_temp_thermal intel_powerclamp nvidia_modeset(POE) ledtrig_audio snd_hda_codec_hdmi ac97_bus snd_pcm_dmaengine mac80211 snd_hda_intel
[ 2152.040520]  snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec libarc4 uvcvideo snd_hda_core snd_hwdep videobuf2_vmalloc uvc videobuf2_memops videobuf2_v4l2 snd_pcm btusb coretemp btrtl videodev btbcm snd_seq_midi snd_seq_midi_event btintel videobuf2_common snd_rawmidi btmtk nls_iso8859_1 iwlwifi kvm_intel nvidia(POE) joydev mc snd_seq cmdlinepart bluetooth kvm pmt_telemetry ecdh_generic ecc pmt_class input_leds snd_seq_device spi_nor rapl cfg80211 snd_timer hid_multitouch ee1004 mtd serio_raw snd bfq intel_cstate intel_vsec soundcore igen6_edac intel_pmc_mux typec intel_scu_pltdrv intel_hid mac_hid sparse_keymap sch_fq_codel kyber_iosched msr parport_pc ppdev lp parport ramoops pstore_blk reed_solomon pstore_zone efi_pstore ip_tables x_tables autofs4 dm_crypt hid_logitech_hidpp hid_logitech_dj usbhid raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear system76_io(OE) i915 drm_buddy i2c_algo_bit ttm drm_display_helper cec rc_core
[ 2152.040691]  crct10dif_pclmul drm_kms_helper crc32_pclmul polyval_clmulni polyval_generic syscopyarea ghash_clmulni_intel sysfillrect sha512_ssse3 aesni_intel hid_generic sysimgblt nvme crypto_simd cryptd thunderbolt drm sdhci_pci nvme_core r8169 spi_intel_pci i2c_i801 cqhci spi_intel i2c_smbus intel_lpss_pci realtek xhci_pci sdhci intel_lpss i2c_hid_acpi xhci_pci_renesas idma64 nvme_common i2c_hid hid video wmi pinctrl_tigerlake system76_acpi(OE)
[ 2152.040766] CPU: 4 PID: 4203 Comm: Xorg Tainted: P        W  OE      6.4.0-76060400-generic #202306271339~1688382054~22.04~b23d286
[ 2152.040778] Hardware name: System76 Oryx Pro/Oryx Pro, BIOS 2023-06-08_36c78ea 06/08/2023
[ 2152.040782] RIP: 0010:drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.040955] Code: fe ff ff 48 8b 7b 08 4c 8b 6f 50 4d 85 ed 74 26 e8 56 1b 15 fb 44 89 e1 4c 89 ea 48 c7 c7 40 08 5e c0 48 89 c6 e8 e1 0b 77 fa <0f> 0b e9 7b fe ff ff 48 8b 1f eb 94 4c 8b 2f eb d5 e8 6b 2f 76 fb
[ 2152.040964] RSP: 0018:ffffab7dc5ffb5b0 EFLAGS: 00010246
[ 2152.040972] RAX: 0000000000000000 RBX: ffff91dbe6568000 RCX: 0000000000000000
[ 2152.040978] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 2152.040981] RBP: ffffab7dc5ffb608 R08: 0000000000000000 R09: 0000000000000000
[ 2152.040985] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
[ 2152.040988] R13: ffff91dbc2e33320 R14: 000000000004d106 R15: ffff91dbe6553960
[ 2152.040993] FS:  00007f4e55c2ba80(0000) GS:ffff91eb3d300000(0000) knlGS:0000000000000000
[ 2152.041000] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2152.041005] CR2: 00000562786a1000 CR3: 0000000182a4e000 CR4: 0000000000750ee0
[ 2152.041012] PKRU: 55555554
[ 2152.041015] Call Trace:
[ 2152.041020]  <TASK>
[ 2152.041028]  ? show_regs+0x6d/0x80
[ 2152.041046]  ? __warn+0x89/0x160
[ 2152.041060]  ? drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.041225]  ? report_bug+0x17e/0x1b0
[ 2152.041239]  ? handle_bug+0x46/0x90
[ 2152.041257]  ? exc_invalid_op+0x18/0x80
[ 2152.041271]  ? asm_exc_invalid_op+0x1b/0x20
[ 2152.041290]  ? drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.041448]  ? __pfx_autoremove_wake_function+0x10/0x10
[ 2152.041470]  drm_crtc_wait_one_vblank+0x17/0x30 [drm]
[ 2152.041623]  intel_crtc_wait_for_next_vblank+0xe/0x20 [i915]
[ 2152.042303]  hsw_audio_codec_disable+0xef/0x160 [i915]
[ 2152.042885]  intel_audio_codec_disable+0xb6/0x1d0 [i915]
[ 2152.043386]  intel_disable_ddi+0x101/0x1c0 [i915]
[ 2152.043875]  intel_encoders_disable+0x9b/0xd0 [i915]
[ 2152.044519]  hsw_crtc_disable+0x167/0x180 [i915]
[ 2152.045092]  intel_old_crtc_state_disables.constprop.0+0x38/0xa0 [i915]
[ 2152.045552]  intel_commit_modeset_disables+0x2a5/0x350 [i915]
[ 2152.045869]  ? intel_modeset_get_crtc_power_domains+0x8c/0xf0 [i915]
[ 2152.045943]  intel_atomic_commit_tail+0x190/0x920 [i915]
[ 2152.046015]  intel_atomic_commit+0x3bc/0x410 [i915]
[ 2152.046082]  drm_atomic_commit+0x96/0xd0 [drm]
[ 2152.046096]  ? __pfx___drm_printfn_info+0x10/0x10 [drm]
[ 2152.046115]  drm_client_modeset_commit_atomic+0x203/0x240 [drm]
[ 2152.046132]  drm_client_modeset_commit_locked+0x5b/0x170 [drm]
[ 2152.046147]  ? ttwu_queue_wakelist+0x135/0x1c0
[ 2152.046149]  __drm_fb_helper_restore_fbdev_mode_unlocked+0x86/0x100 [drm_kms_helper]
[ 2152.046160]  drm_fb_helper_set_par+0x2f/0x50 [drm_kms_helper]
[ 2152.046168]  intel_fbdev_set_par+0x1a/0x70 [i915]
[ 2152.046233]  fb_set_var+0x20a/0x460
[ 2152.046236]  ? set_next_entity+0xef/0x180
[ 2152.046237]  ? psi_group_change+0x219/0x530
[ 2152.046238]  ? pick_next_task_fair+0x395/0x5f0
[ 2152.046240]  fbcon_blank+0x24d/0x380
[ 2152.046242]  do_unblank_screen+0xc0/0x1c0
[ 2152.046244]  complete_change_console+0x67/0x150
[ 2152.046246]  vt_ioctl+0x484/0x9e0
[ 2152.046247]  tty_ioctl+0x2b8/0x8e0
[ 2152.046249]  ? do_syscall_64+0x67/0x90
[ 2152.046250]  ? syscall_exit_to_user_mode+0x29/0x50
[ 2152.046251]  ? do_syscall_64+0x67/0x90
[ 2152.046253]  ? __fget_light+0xa5/0x120
[ 2152.046254]  __x64_sys_ioctl+0x9d/0xe0
[ 2152.046256]  do_syscall_64+0x58/0x90
[ 2152.046258]  entry_SYSCALL_64_after_hwframe+0x72/0xdc
[ 2152.046259] RIP: 0033:0x7f4e55b1aaff
[ 2152.046262] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <41> 89 c0 3d 00 f0 ff ff 77 1f 48 8b 44 24 18 64 48 2b 04 25 28 00
[ 2152.046262] RSP: 002b:00007ffe93fb5390 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 2152.046263] RAX: ffffffffffffffda RBX: 0000000000000001 RCX: 00007f4e55b1aaff
[ 2152.046264] RDX: 0000000000000001 RSI: 0000000000005605 RDI: 000000000000000e
[ 2152.046264] RBP: 000056540ba2c898 R08: 0000000000000002 R09: 00007ffe93fb51e0
[ 2152.046265] R10: 0000000000000000 R11: 0000000000000246 R12: 000056540ba2c940
[ 2152.046265] R13: 000056540ba2c894 R14: 000056540ba2c950 R15: 000056540ba2c948
[ 2152.046266]  </TASK>
[ 2152.046266] ---[ end trace 0000000000000000 ]---
[ 2152.147935] ------------[ cut here ]------------
[ 2152.147939] i915 0000:00:02.0: vblank wait timed out on crtc 1
[ 2152.148043] WARNING: CPU: 4 PID: 4203 at drivers/gpu/drm/drm_vblank.c:1310 drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.148157] Modules linked in: tls nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter ccm rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack snd_seq_dummy snd_hrtimer nf_defrag_ipv6 nf_defrag_ipv4 cmac algif_hash algif_skcipher af_alg nf_tables nfnetlink zstd nvidia_uvm(POE) bridge stp llc overlay bnep zram snd_sof_pci_intel_tgl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda_mlink snd_sof_intel_hda snd_sof_pci intel_rapl_msr snd_sof_xtensa_dsp intel_rapl_common snd_sof snd_sof_utils snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi intel_uncore_frequency soundwire_bus intel_uncore_frequency_common intel_tcc_cooling iwlmvm snd_soc_core snd_hda_codec_realtek snd_compress snd_hda_codec_generic nvidia_drm(POE) x86_pkg_temp_thermal intel_powerclamp nvidia_modeset(POE) ledtrig_audio snd_hda_codec_hdmi ac97_bus snd_pcm_dmaengine mac80211 snd_hda_intel
[ 2152.148233]  snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec libarc4 uvcvideo snd_hda_core snd_hwdep videobuf2_vmalloc uvc videobuf2_memops videobuf2_v4l2 snd_pcm btusb coretemp btrtl videodev btbcm snd_seq_midi snd_seq_midi_event btintel videobuf2_common snd_rawmidi btmtk nls_iso8859_1 iwlwifi kvm_intel nvidia(POE) joydev mc snd_seq cmdlinepart bluetooth kvm pmt_telemetry ecdh_generic ecc pmt_class input_leds snd_seq_device spi_nor rapl cfg80211 snd_timer hid_multitouch ee1004 mtd serio_raw snd bfq intel_cstate intel_vsec soundcore igen6_edac intel_pmc_mux typec intel_scu_pltdrv intel_hid mac_hid sparse_keymap sch_fq_codel kyber_iosched msr parport_pc ppdev lp parport ramoops pstore_blk reed_solomon pstore_zone efi_pstore ip_tables x_tables autofs4 dm_crypt hid_logitech_hidpp hid_logitech_dj usbhid raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear system76_io(OE) i915 drm_buddy i2c_algo_bit ttm drm_display_helper cec rc_core
[ 2152.148323]  crct10dif_pclmul drm_kms_helper crc32_pclmul polyval_clmulni polyval_generic syscopyarea ghash_clmulni_intel sysfillrect sha512_ssse3 aesni_intel hid_generic sysimgblt nvme crypto_simd cryptd thunderbolt drm sdhci_pci nvme_core r8169 spi_intel_pci i2c_i801 cqhci spi_intel i2c_smbus intel_lpss_pci realtek xhci_pci sdhci intel_lpss i2c_hid_acpi xhci_pci_renesas idma64 nvme_common i2c_hid hid video wmi pinctrl_tigerlake system76_acpi(OE)
[ 2152.148363] CPU: 4 PID: 4203 Comm: Xorg Tainted: P        W  OE      6.4.0-76060400-generic #202306271339~1688382054~22.04~b23d286
[ 2152.148369] Hardware name: System76 Oryx Pro/Oryx Pro, BIOS 2023-06-08_36c78ea 06/08/2023
[ 2152.148371] RIP: 0010:drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.148456] Code: fe ff ff 48 8b 7b 08 4c 8b 6f 50 4d 85 ed 74 26 e8 56 1b 15 fb 44 89 e1 4c 89 ea 48 c7 c7 40 08 5e c0 48 89 c6 e8 e1 0b 77 fa <0f> 0b e9 7b fe ff ff 48 8b 1f eb 94 4c 8b 2f eb d5 e8 6b 2f 76 fb
[ 2152.148460] RSP: 0018:ffffab7dc5ffb5b0 EFLAGS: 00010246
[ 2152.148465] RAX: 0000000000000000 RBX: ffff91dbe6568000 RCX: 0000000000000000
[ 2152.148467] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 2152.148469] RBP: ffffab7dc5ffb608 R08: 0000000000000000 R09: 0000000000000000
[ 2152.148471] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
[ 2152.148472] R13: ffff91dbc2e33320 R14: 000000000004d106 R15: ffff91dbe6553960
[ 2152.148475] FS:  00007f4e55c2ba80(0000) GS:ffff91eb3d300000(0000) knlGS:0000000000000000
[ 2152.148479] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 2152.148481] CR2: 00000562786c7000 CR3: 0000000182a4e000 CR4: 0000000000750ee0
[ 2152.148484] PKRU: 55555554
[ 2152.148486] Call Trace:
[ 2152.148489]  <TASK>
[ 2152.148493]  ? show_regs+0x6d/0x80
[ 2152.148503]  ? __warn+0x89/0x160
[ 2152.148510]  ? drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.148592]  ? report_bug+0x17e/0x1b0
[ 2152.148600]  ? handle_bug+0x46/0x90
[ 2152.148609]  ? exc_invalid_op+0x18/0x80
[ 2152.148616]  ? asm_exc_invalid_op+0x1b/0x20
[ 2152.148626]  ? drm_wait_one_vblank+0x1ef/0x210 [drm]
[ 2152.148703]  ? __pfx_autoremove_wake_function+0x10/0x10
[ 2152.148714]  drm_crtc_wait_one_vblank+0x17/0x30 [drm]
[ 2152.148792]  intel_crtc_wait_for_next_vblank+0xe/0x20 [i915]
[ 2152.149173]  hsw_audio_codec_disable+0xf7/0x160 [i915]
[ 2152.149504]  intel_audio_codec_disable+0xb6/0x1d0 [i915]
[ 2152.149811]  intel_disable_ddi+0x101/0x1c0 [i915]
[ 2152.150097]  intel_encoders_disable+0x9b/0xd0 [i915]
[ 2152.150377]  hsw_crtc_disable+0x167/0x180 [i915]
[ 2152.150648]  intel_old_crtc_state_disables.constprop.0+0x38/0xa0 [i915]
[ 2152.150903]  intel_commit_modeset_disables+0x2a5/0x350 [i915]
[ 2152.151149]  ? intel_modeset_get_crtc_power_domains+0x8c/0xf0 [i915]
[ 2152.151397]  intel_atomic_commit_tail+0x190/0x920 [i915]
[ 2152.151631]  intel_atomic_commit+0x3bc/0x410 [i915]
[ 2152.151881]  drm_atomic_commit+0x96/0xd0 [drm]
[ 2152.151937]  ? __pfx___drm_printfn_info+0x10/0x10 [drm]
[ 2152.152041]  drm_client_modeset_commit_atomic+0x203/0x240 [drm]
[ 2152.152124]  drm_client_modeset_commit_locked+0x5b/0x170 [drm]
[ 2152.152189]  ? ttwu_queue_wakelist+0x135/0x1c0
[ 2152.152195]  __drm_fb_helper_restore_fbdev_mode_unlocked+0x86/0x100 [drm_kms_helper]
[ 2152.152232]  drm_fb_helper_set_par+0x2f/0x50 [drm_kms_helper]
[ 2152.152263]  intel_fbdev_set_par+0x1a/0x70 [i915]
[ 2152.152538]  fb_set_var+0x20a/0x460
[ 2152.152545]  ? set_next_entity+0xef/0x180
[ 2152.152550]  ? psi_group_change+0x219/0x530
[ 2152.152553]  ? pick_next_task_fair+0x395/0x5f0
[ 2152.152561]  fbcon_blank+0x24d/0x380
[ 2152.152569]  do_unblank_screen+0xc0/0x1c0
[ 2152.152575]  complete_change_console+0x67/0x150
[ 2152.152580]  vt_ioctl+0x484/0x9e0
[ 2152.152585]  tty_ioctl+0x2b8/0x8e0
[ 2152.152591]  ? do_syscall_64+0x67/0x90
[ 2152.152596]  ? syscall_exit_to_user_mode+0x29/0x50
[ 2152.152601]  ? do_syscall_64+0x67/0x90
[ 2152.152605]  ? __fget_light+0xa5/0x120
[ 2152.152612]  __x64_sys_ioctl+0x9d/0xe0
[ 2152.152617]  do_syscall_64+0x58/0x90
[ 2152.152622]  entry_SYSCALL_64_after_hwframe+0x72/0xdc
[ 2152.152627] RIP: 0033:0x7f4e55b1aaff
[ 2152.152634] Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <41> 89 c0 3d 00 f0 ff ff 77 1f 48 8b 44 24 18 64 48 2b 04 25 28 00
[ 2152.152637] RSP: 002b:00007ffe93fb5390 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 2152.152641] RAX: ffffffffffffffda RBX: 0000000000000001 RCX: 00007f4e55b1aaff
[ 2152.152644] RDX: 0000000000000001 RSI: 0000000000005605 RDI: 000000000000000e
[ 2152.152646] RBP: 000056540ba2c898 R08: 0000000000000002 R09: 00007ffe93fb51e0
[ 2152.152648] R10: 0000000000000000 R11: 0000000000000246 R12: 000056540ba2c940
[ 2152.152650] R13: 000056540ba2c894 R14: 000056540ba2c950 R15: 000056540ba2c948
[ 2152.152653]  </TASK>
[ 2152.152655] ---[ end trace 0000000000000000 ]---
[ 2177.698265] rfkill: input handler disabled

@leviport
Copy link
Member

leviport commented Jul 6, 2023

I've gotten two more freezes with bluetooth turned off completely, so I think we can rule that out. My oryp9 is currently in Hybrid mode, but I will try other graphics modes as well.

@mmstick mmstick mentioned this pull request Jul 6, 2023
@mmstick mmstick closed this Jul 6, 2023
@jackpot51 jackpot51 deleted the linux-6.4.0_jammy branch July 7, 2023 13:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.