simple-framebuffer: swiotlb buffer is full

Bug #2062426 reported by Rob Somerville
28
This bug affects 4 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned
nvidia-graphics-drivers-550 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

[ 29.548909] ------------[ cut here ]------------
[ 29.548912] simple-framebuffer simple-framebuffer.0: drm_WARN_ON(map->is_iomem)
[ 29.548924] WARNING: CPU: 7 PID: 2700 at drivers/gpu/drm/drm_gem_shmem_helper.c:319 drm_gem_shmem_vmap+0x1a5/0x1e0
[ 29.548932] Modules linked in: nf_tables libcrc32c snd_seq_dummy snd_hrtimer ccm qrtr binfmt_misc nvidia_uvm(POE) nvidia_drm(POE) nvidia_modeset(POE) nls_iso8859_1 nvidia(POE) intel_rapl_msr intel_rapl_common snd_hda_codec_hdmi snd_hda_intel rtl8192ee snd_intel_dspcfg snd_intel_sdw_acpi btcoexist snd_hda_codec snd_usb_audio rtl_pci snd_hda_core snd_usbmidi_lib rtlwifi snd_hwdep snd_ump snd_seq_midi edac_mce_amd snd_seq_midi_event snd_rawmidi mac80211 mc kvm_amd snd_seq snd_pcm snd_seq_device kvm joydev snd_timer input_leds cfg80211 snd irqbypass soundcore libarc4 rapl video wmi_bmof ccp k10temp i2c_piix4 gpio_amdpt mac_hid msr parport_pc ppdev lp parport efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 dm_crypt hid_generic usbhid hid crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic r8169 ghash_clmulni_intel sha256_ssse3 sha1_ssse3 ahci xhci_pci realtek libahci xhci_pci_renesas wmi aesni_intel crypto_simd cryptd
[ 29.549044] CPU: 7 PID: 2700 Comm: Xorg Tainted: P OE 6.8.0-22-generic #22-Ubuntu
[ 29.549047] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./A320M-DVS R4.0, BIOS P4.00 07/16/2020
[ 29.549049] RIP: 0010:drm_gem_shmem_vmap+0x1a5/0x1e0
[ 29.549054] Code: 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 44 a6 ec ff 48 c7 c1 73 62 82 9f 4c 89 ea 48 c7 c7 b2 3d 82 9f 48 89 c6 e8 4b 3f 44 ff <0f> 0b 48 8b 83 f0 00 00 00 4c 89 e6 48 8b 38 e8 f7 48 f5 ff b8 fb
[ 29.549056] RSP: 0018:ffffb6da8304ba08 EFLAGS: 00010246
[ 29.549060] RAX: 0000000000000000 RBX: ffffa0778c558a00 RCX: 0000000000000000
[ 29.549062] RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
[ 29.549063] RBP: ffffb6da8304ba28 R08: 0000000000000000 R09: 0000000000000000
[ 29.549065] R10: 0000000000000000 R11: 0000000000000000 R12: ffffa0778cdaa6c8
[ 29.549067] R13: ffffa07781d2efc0 R14: ffffa0778cdaa6c8 R15: ffffa0778cdaa6c8
[ 29.549069] FS: 00007e8343530ac0(0000) GS:ffffa07a8e580000(0000) knlGS:0000000000000000
[ 29.549071] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 29.549073] CR2: 0000567a82464778 CR3: 000000014044e000 CR4: 00000000003506f0
[ 29.549075] Call Trace:
[ 29.549078] <TASK>
[ 29.549082] ? show_regs+0x6d/0x80
[ 29.549087] ? __warn+0x89/0x160
[ 29.549092] ? drm_gem_shmem_vmap+0x1a5/0x1e0
[ 29.549096] ? report_bug+0x17e/0x1b0
[ 29.549103] ? handle_bug+0x51/0xa0
[ 29.549108] ? exc_invalid_op+0x18/0x80
[ 29.549112] ? asm_exc_invalid_op+0x1b/0x20
[ 29.549121] ? drm_gem_shmem_vmap+0x1a5/0x1e0
[ 29.549125] ? drm_gem_shmem_vmap+0x1a5/0x1e0
[ 29.549130] drm_gem_shmem_object_vmap+0x9/0x20
[ 29.549133] drm_gem_vmap+0x26/0x80
[ 29.549138] drm_gem_vmap_unlocked+0x2b/0x50
[ 29.549142] drm_gem_fb_vmap+0x40/0x150
[ 29.549147] drm_gem_begin_shadow_fb_access+0x25/0x40
[ 29.549150] drm_atomic_helper_prepare_planes.part.0+0x142/0x1e0
[ 29.549155] drm_atomic_helper_prepare_planes+0x5d/0x70
[ 29.549160] drm_atomic_helper_commit+0x84/0x160
[ 29.549164] drm_atomic_commit+0x99/0xd0
[ 29.549167] ? __pfx___drm_printfn_info+0x10/0x10
[ 29.549173] drm_atomic_helper_set_config+0x82/0xd0
[ 29.549176] drm_mode_setcrtc+0x535/0x8b0
[ 29.549187] ? __pfx_drm_mode_setcrtc+0x10/0x10
[ 29.549191] drm_ioctl_kernel+0xbc/0x120
[ 29.549194] ? srso_return_thunk+0x5/0x5f
[ 29.549200] drm_ioctl+0x2d4/0x550
[ 29.549204] ? __pfx_drm_mode_setcrtc+0x10/0x10
[ 29.549214] __x64_sys_ioctl+0xa3/0xf0
[ 29.549220] do_syscall_64+0x85/0x180
[ 29.549223] ? srso_return_thunk+0x5/0x5f
[ 29.549226] ? count_memcg_events.constprop.0+0x2a/0x50
[ 29.549231] ? srso_return_thunk+0x5/0x5f
[ 29.549234] ? handle_mm_fault+0xad/0x380
[ 29.549238] ? srso_return_thunk+0x5/0x5f
[ 29.549241] ? do_user_addr_fault+0x338/0x6b0
[ 29.549245] ? srso_return_thunk+0x5/0x5f
[ 29.549248] ? irqentry_exit_to_user_mode+0x7b/0x260
[ 29.549252] ? srso_return_thunk+0x5/0x5f
[ 29.549255] ? irqentry_exit+0x43/0x50
[ 29.549258] ? srso_return_thunk+0x5/0x5f
[ 29.549261] ? exc_page_fault+0x94/0x1b0
[ 29.549265] entry_SYSCALL_64_after_hwframe+0x6e/0x76
[ 29.549269] RIP: 0033:0x7e8343924ded
[ 29.549283] Code: 04 25 28 00 00 00 48 89 45 c8 31 c0 48 8d 45 10 c7 45 b0 10 00 00 00 48 89 45 b8 48 8d 45 d0 48 89 45 c0 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 1a 48 8b 45 c8 64 48 2b 04 25 28 00 00 00
[ 29.549285] RSP: 002b:00007ffeea092ae0 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 29.549288] RAX: ffffffffffffffda RBX: 0000567a826da2f0 RCX: 00007e8343924ded
[ 29.549290] RDX: 00007ffeea092b70 RSI: 00000000c06864a2 RDI: 0000000000000013
[ 29.549292] RBP: 00007ffeea092b30 R08: 0000000000000000 R09: 0000567a82f52a40
[ 29.549294] R10: 0000000000000000 R11: 0000000000000246 R12: 00007ffeea092b70
[ 29.549295] R13: 00000000c06864a2 R14: 0000000000000013 R15: 0000567a82465ba8
[ 29.549303] </TASK>
[ 29.549304] ---[ end trace 0000000000000000 ]---

[How to recreate]

Unknown, was reported in logfile yet PC was operating OK without any application crashes.

[Other info]

OS: Ubuntu Noble Numbat (development branch) x86_64
Kernel: 6.8.0-22-generic
Uptime: 5 hours, 9 mins
Packages: 2301 (dpkg), 8 (flatpak), 10 (snap)
Shell: bash 5.2.21
Resolution: 1360x768, 1360x768
DE: GNOME 46.0
WM: Mutter
WM Theme: Adwaita
Theme: Yaru [GTK2/3]
Icons: Yaru [GTK2/3]
Terminal: guake
CPU: AMD Ryzen 5 2600 (12) @ 3.400GHz
GPU: NVIDIA GeForce GTX 960
Memory: 3619MiB / 15914MiB

[NVIDIA-SMI]
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.67 Driver Version: 550.67 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA GeForce GTX 960 Off | 00000000:06:00.0 On | N/A |
| 35% 27C P8 15W / 208W | 79MiB / 4096MiB | 1% Default |
| | | N/A |
+-----------------------------------------+------------------------+----------------------+

+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| 0 N/A N/A 3638 G /usr/bin/gnome-shell 61MiB |
| 0 N/A N/A 4456 C+G /usr/libexec/xdg-desktop-portal-gnome 11MiB |
+-----------------------------------------------------------------------------------------+

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Libera.chat.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/2062426/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Revision history for this message
Rob Somerville (linuxgreybeard) wrote :

Can't ascertain if this is a kernel fault or a driver issue, have tagged it as NVIDIA driver issue as drm_gem_shmem_helper.c is part of GPU drivers.

affects: ubuntu → nvidia-graphics-drivers-550 (Ubuntu)
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in nvidia-graphics-drivers-550 (Ubuntu):
status: New → Confirmed
Revision history for this message
garyx (garyx) wrote :

I seem to be able to get this crash consistently after locking the desktop and after a little while I come back and the session has crashed.

Need to reboot usually after this. Same nvidia driver version 550. Up to date 24.04 install.

Revision history for this message
garyx (garyx) wrote :
Download full text (13.0 KiB)

simple-framebuffer simple-framebuffer.0: swiotlb buffer is full (sz: 524288 bytes), total 32768 (slots), used 7966 (slots)
------------[ cut here ]------------
simple-framebuffer simple-framebuffer.0: drm_WARN_ON(map->is_iomem)
WARNING: CPU: 7 PID: 5181 at drivers/gpu/drm/drm_gem_shmem_helper.c:319 drm_gem_shmem_vmap+0x1a5/0x1e0
Modules linked in: tls vhost_net tap veth nft_masq btrfs blake2b_generic nvme_tcp nvme_keyring nvme_fabrics nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter rfcomm vhost_vsock vmw_vsock_virtio_transport_common vhost vhost_iotlb vsock snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables bridge stp llc overlay qrtr cmac algif_hash algif_skcipher af_alg bnep binfmt_misc nls_iso8859_1 snd_hda_codec_realtek intel_rapl_msr intel_rapl_common snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_usb_audio snd_hda_codec snd_usbmidi_lib snd_ump snd_hda_core mc snd_hwdep edac_mce_amd snd_pcm snd_seq_midi kvm_amd btusb ucsi_ccg snd_seq_midi_event typec_ucsi btrtl snd_rawmidi typec btintel snd_seq kvm btbcm eeepc_wmi btmtk snd_seq_device asus_ec_sensors irqbypass asus_wmi snd_timer bluetooth ledtrig_audio platform_profile xpad snd
 rapl joydev i2c_nvidia_gpu ecdh_generic sparse_keymap mxm_wmi ff_memless wmi_bmof i2c_piix4 k10temp ecc soundcore i2c_ccgx_ucsi ccp input_leds nvidia_uvm(POE) mac_hid msr parport_pc ppdev lp parport efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 dm_mirror dm_region_hash dm_log hid_logitech_hidpp hid_logitech_dj hid_generic usbhid hid nvidia_drm(POE) nvidia_modeset(POE) crct10dif_pclmul crc32_pclmul polyval_clmulni polyval_generic ghash_clmulni_intel nvme sha256_ssse3 nvidia(POE) igb sha1_ssse3 ahci nvme_core xhci_pci i2c_algo_bit video libahci xhci_pci_renesas dca nvme_auth wmi aesni_intel crypto_simd cryptd
CPU: 7 PID: 5181 Comm: Xorg Tainted: P OE 6.8.7-060807-generic #202404170934
Hardware name: System manufacturer System Product Name/ROG STRIX X570-F GAMING, BIOS 3001 12/04/2020
RIP: 0010:drm_gem_shmem_vmap+0x1a5/0x1e0
Code: 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 04 15 02 00 48 c7 c1 3a f5 1c 99 4c 89 ea 48 c7 c7 5f d0 1c 99 48 89 c6 e8 6b 53 56 ff <0f> 0b 48 8b 83 f0 00 00 00 4c 89 e6 48 8b 38 e8 97 b7 0a 00 b8 fb
RSP: 0018:ffffc0321266b8b8 EFLAGS: 00010246
RAX: 0000000000000000 RBX: ffff992062220c00 RCX: 0000000000000000
RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
RBP: ffffc0321266b8d8 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000000 R12: ffff992057bb38c8
R13: ffff992041ba8b80 R14: ffff992057bb38c8 R15: ffff992057bb38c8
FS: 00007ef65c3bfac0(0000) GS:ffff992f2e580000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00005cdd41632058 CR3: 00000001ccfa8000 CR4: 0000000000f50ef0
PKRU: 55555554
Call Trace:
 <TASK>
 ? show_regs+0x6d/0x80
 ? __warn+0x89/0x160
 ? drm_gem_shmem_vmap+0x1a5/0x1...

Revision history for this message
Johannes (jonppe) wrote (last edit ):
Download full text (4.0 KiB)

On my Ubuntu 24.04, I seem to be getting the same issue when running 6.8.0 kernels (but not on 6.5.0 kernels).
I'm currently using Nvidia driver version 535.171.04 with mobile RTX 3080.

The issue happens always when going to suspend (by closing the lid).
When waking up, X or GDM3 seems to have crash and forces me to log in again. The system seems to be a bit slow/unstable still for a some tens of seconds after logging in.

/usr/libexec/gdm-x-session[90542]: (EE) modeset(G0): failed to set mode: Input/output error
kernel: ------------[ cut here ]------------
kernel: simple-framebuffer simple-framebuffer.0: drm_WARN_ON(map->is_iomem)
kernel: WARNING: CPU: 4 PID: 90542 at drivers/gpu/drm/drm_gem_shmem_helper.c:319 drm_gem_shmem_vmap+0x1a5/0x1e0
kernel: Modules linked in: ufs qnx4 hfsplus hfs minix ntfs msdos jfs nls_ucs2_utils xfs cpuid ccm rfcomm snd_seq_dummy snd_hrtimer xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables bridge stp llc qrtr overlay cmac algif_hash algif_skcipher af_alg bnep binfmt_misc nvidia_uvm(PO) nls_iso8859_1 nvidia_drm(PO) nvidia_modeset(PO) nvidia(PO) snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_intel intel_rapl_msr snd_intel_dspcfg intel_rapl_common snd_intel_sdw_acpi snd_hda_codec snd_hda_scodec_cs35l41_i2c snd_hda_core snd_hda_scodec_cs35l41 snd_hwdep edac_mce_amd uvcvideo btusb snd_hda_cs_dsp_ctls cs_dsp videobuf2_vmalloc btrtl snd_pcm iwlmvm snd_soc_cs35l41_lib uvc btintel kvm_amd snd_seq_midi videobuf2_memops btbcm mac80211 videobuf2_v4l2 btmtk snd_seq_midi_event libarc4 snd_rawmidi kvm bluetooth videodev snd_seq iwlwifi irqbypass videobuf2_common ecdh_generic snd_seq_device ecc rapl mc
kernel: wmi_bmof snd_timer cfg80211 ideapad_laptop sparse_keymap i2c_piix4 snd k10temp ccp platform_profile soundcore cm32181 industrialio serial_multi_instantiate input_leds joydev serio_raw mac_hid nfsd auth_rpcgss msr nfs_acl parport_pc lockd ppdev grace lp parport sunrpc efi_pstore nfnetlink dmi_sysfs ip_tables x_tables autofs4 btrfs blake2b_generic dm_crypt raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 usbhid crct10dif_pclmul crc32_pclmul polyval_clmulni hid_multitouch polyval_generic ghash_clmulni_intel hid_generic nvme sha256_ssse3 ucsi_acpi ahci nvme_core r8169 xhci_pci sha1_ssse3 typec_ucsi video i2c_hid_acpi libahci xhci_pci_renesas realtek nvme_auth typec i2c_hid wmi hid aesni_intel crypto_simd cryptd
kernel: CPU: 4 PID: 90542 Comm: Xorg Tainted: P W O 6.8.0-31-generic #31-Ubuntu
kernel: Hardware name: LENOVO 82N6/LNVNB161216, BIOS GKCN60WW 03/07/2023
kernel: RIP: 0010:drm_gem_shmem_vmap+0x1a5/0x1e0
kernel: Code: 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 44 a4 ec ff 48 c7 c1 ed 58 e2 a9 4c 89 ea 48 c7 c7 2c 34 e2 a9 48 89 c6 e8 1b 14 44 ff <0f> 0b 48 8b 83 f0 00 00 00 4c 89 e6 48 8b 38 e8 f7 48 f5 ff b8 fb
kernel: RSP: 0018:ffffb0400e44fa78 EFLAGS: 00010246
kernel: RAX: 0000000000000000 RBX: ffff9a88fe9e4c00 RCX: 0000000000000000
kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: 0000000000000000
kernel: RB...

Read more...

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Also confirmed in bug 2068574.

tags: added: noble simpledrm
Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Please try the workarounds in bug 2060268.

tags: added: nvidia
Revision history for this message
Johannes (jonppe) wrote :

I can confirm that the extra kernel parameter initcall_blacklist=simpledrm_platform_driver_init fix the issue. Unfortunately it is a bit inconvenient with full disk encryption

Revision history for this message
Daniel van Vugt (vanvugt) wrote :

Yeah that's not what a proper fix would do, it's just a workaround.

You can work around that new issue by adding 'amdgpu' or 'i915' to '/etc/initramfs-tools/modules' and then run:

  sudo update-initramfs -k all -u

or get the system to automatically insert all the modules you're currently using by changing the MODULES line in '/etc/initramfs-tools/initramfs.conf' to MODULES=dep and again:

  sudo update-initramfs -k all -u

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.