cancel
Showing results for 
Search instead for 
Did you mean: 

Server Processors

mask710415
Journeyman III

AMD-Vi: Completion-Wait loop timed out

The Network adapter on my server suddenly failed with following massages, since the message "AMD-Vi: Completion-Wait loop timed out" always comes before others, it seems like a consequence caused by AMD?

O/S: Debian

 

kernel: [37015873.926153] AMD-Vi: Completion-Wait loop timed out
kernel: [37015881.576869] NETDEV WATCHDOG: eth0 (i40e): transmit queue 0 timed out
kernel: [37015881.576883] WARNING: CPU: 27 PID: 0 at net/sched/sch_generic.c:467 dev_watchdog+0x24d/0x260
kernel: [37015881.577044] i40e 0000:41:00.0 eth0: tx_timeout: VSI_seid: 390, Q 0, NTC: 0xc2, HWB: 0x47, NTU: 0x47, TAIL: 0x47, INT: 0x0
kernel: [37015881.577045] i40e 0000:41:00.0 eth0: tx_timeout recovery level 1, txqueue 0
kernel: [37015881.577080] i40e 0000:41:00.1 eth1: tx_timeout: VSI_seid: 391, Q 1, NTC: 0x3d, HWB: 0xfc, NTU: 0xfc, TAIL: 0xfc, INT: 0x0
kernel: [37015881.577081] i40e 0000:41:00.1 eth1: tx_timeout recovery level 1, txqueue 1
kernel: [37015881.580958] bond1: (slave eth0): link status down for interface, disabling it in 200 ms

 


kernel: [36508574.881125] AMD-Vi: Completion-Wait loop timed out
ernel: [36508585.629831] NMI watchdog: Watchdog detected hard LOCKUP on cpu 107
kernel: [36508585.629859] RIP: 0010:native_queued_spin_lock_slowpath+0x119/0x1d0
kernel: [36508585.629861] Code: 0d 18 5e 74 68 c3 41 83 c0 01 c1 e1 10 41 c1 e0 12 44 09 c1 89 c8 c1 e8 10 66 87 47 02 89 c6 c1 e6 10 75 62 31 f6 eb 02 f3 90 <8b> 07 66 85 c0 75 f7 41 89 c0 66 45 31 c0 44 39 c1 0f 84 8a 00 00
kernel: [36508585.629861] RSP: 0018:ffffa7874e058d88 EFLAGS: 00000002
kernel: [36508585.629862] RAX: 0000000000ec0001 RBX: ffff989150811808 RCX: 0000000001b00000
kernel: [36508585.629862] RDX: ffff98d00f6ec940 RSI: ffff98b00e52c940 RDI: ffff989169243258
kernel: [36508585.629863] RBP: ffff989169243258 R08: 0000000001b00000 R09: 00000000cc96a000
kernel: [36508585.629863] R10: 0000000000000001 R11: 000ffffffffff000 R12: 0000000000000006
kernel: [36508585.629864] R13: ffff989169243200 R14: 00000000000cc96a R15: ffff989150811808
kernel: [36508585.629865] FS: 00007fcc76fd4100(0000) GS:ffff98d00f6c0000(0000) knlGS:0000000000000000
kernel: [36508585.629865] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
kernel: [36508585.629865] CR2: 0000559a5eb3b498 CR3: 0000002098156000 CR4: 0000000000350ee0
kernel: [36508585.629866] Call Trace:
kernel: [36508585.629866] <IRQ>
kernel: [36508585.629867] _raw_spin_lock_irqsave+0x32/0x40
kernel: [36508585.629867] amd_iommu_flush_iotlb_all+0x1a/0x50
kernel: [36508585.629867] iova_domain_flush+0x1a/0x30
kernel: [36508585.629868] queue_iova+0xe3/0x130
kernel: [36508585.629868] __iommu_dma_unmap+0x97/0x100
kernel: [36508585.629868] i40e_napi_poll+0x1a8/0x1280 [i40e]
kernel: [36508585.629868] net_rx_action+0x145/0x3e0
kernel: [36508585.629869] __do_softirq+0xc5/0x275
kernel: [36508585.629869] asm_call_irq_on_stack+0x12/0x20
kernel: [36508585.629869] </IRQ>
kernel: [36508585.629869] do_softirq_own_stack+0x37/0x40
kernel: [36508585.629870] irq_exit_rcu+0x8e/0xc0
kernel: [36508585.629870] common_interrupt+0x74/0x130
kernel: [36508585.629870] asm_common_interrupt+0x1e/0x40
kernel: [36508585.629871] RIP: 0010:kmem_cache_free+0x5e/0x410
kernel: [36508585.629871] Code: 48 c7 c0 00 00 00 80 45 31 ed 48 2b 05 bb 0b ee 00 48 01 d8 48 8b 0d a1 0b ee 00 48 c1 e8 0c 48 c1 e0 06 48 01 c8 48 8b 50 08 <48> 8d 72 ff 83 e2 01 48 0f 45 c6 48 8b 70 08 48 8d 56 ff 83 e6 01
kernel: [36508585.629872] RSP: 0018:ffffa787524a7d50 EFLAGS: 00000282
kernel: [36508585.629872] RAX: ffffdf8f95062e80 RBX: ffff9896018ba990 RCX: ffffdf8f80000000
kernel: [36508585.629872] RDX: ffffdf8f95062e01 RSI: ffff9895818ba990 RDI: ffff98a6126d4900
kernel: [36508585.629873] RBP: 0000000000000001 R08: ffff989220359ac0 R09: 0000000000000000
kernel: [36508585.629873] R10: 0000000000000007 R11: 0000000000000000 R12: ffff9895818ba990
kernel: [36508585.629873] R13: 0000000000000000 R14: 0000000000000bc6 R15: ffff98bea205c780
kernel: [36508585.629874] ? mem_cgroup_charge_skmem+0x81/0x110
kernel: [36508585.629874] inet_csk_accept+0x2d2/0x3b0
kernel: [36508585.629874] ? alloc_file_pseudo+0xa3/0x110
kernel: [36508585.629875] ? _cond_resched+0x16/0x40
kernel: [36508585.629875] inet_accept+0x43/0x160
kernel: [36508585.629875] __sys_accept4_file+0x122/0x1e0Feb 5 04:22:18 music-proxy44 kernel: [36508585.629875] ? do_epoll_wait+0x246/0x650
kernel: [36508585.629876] __sys_accept4+0x54/0x90
kernel: [36508585.629876] __x64_sys_accept4+0x1a/0x20
kernel: [36508585.629876] do_syscall_64+0x33/0x80
kernel: [36508585.629876] entry_SYSCALL_64_after_hwframe+0x44/0xa9

0 Likes
0 Replies