Project

General

Profile

Actions

Bug #13913

open

aggr is broken once resource pools are enabled

Added by Yuri Pankov 11 months ago. Updated 8 months ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
kernel
Start date:
Due date:
% Done:

0%

Estimated time:
Difficulty:
Medium
Tags:
Gerrit CR:

Description

To reproduce:

dladm create-aggr -l e1000g0 -l e1000g1 aggr0
pooladm -e

> ::stack
mac_tx_cpu_init+0x53(fffffe16f131e020, 0, fffffffffbc94d80)
mac_flow_cpu_init+0x203(fffffe16f131e020, fffffffffbc94d80)
mac_fanout_setup+0x65(fffffe16edcb3a38, fffffe16f131e020,
fffffe16f0209000, fffffffffbbd5860, fffffe16edcb3a38, 0)
mac_pool_link_update+0x2ff(fffffe16ef422020, fffffe16ef422008,
fffffe16fdfa4280)
i_mod_hash_walk_nosync+0x53(fffffe16dfdcfc40, fffffffffbbbcaf0,
fffffe16fdfa4280)
mod_hash_walk+0x3f(fffffe16dfdcfc40, fffffffffbbbcaf0, fffffe16fdfa4280)
mac_pool_update+0x27(fffffe16fdfa4280)
mac_pool_event_cb+0x97(0, 0, 0)
pool_event_notify+0x55(fffffe16f0ccd7a8)
taskq_thread+0x2cd(fffffe16f0b37df0)
thread_start+0xb()

On debug build this fails the "link is not aggr" assertion
in mac_fanout_setup().

Actions #1

Updated by Tony Norlin 8 months ago

Yuri Pankov wrote:

To reproduce:
[...]

[...]

On debug build this fails the "link is not aggr" assertion
in mac_fanout_setup().

I stumbled on this issue from the other way around. I had aggr active and wanted to enable the resource pools, but this resulted into panic instead:

fmdump -Vp -u bf325a05-9672-4a63-9853-9b4411c5ddd1
TIME UUID SUNW-MSG-ID
Sep 17 2021 16:34:28.490940000 bf325a05-9672-4a63-9853-9b4411c5ddd1 SUNOS-8000-KL

TIME                 CLASS                                 ENA
Sep 17 16:34:28.4844 ireport.os.sunos.panic.dump_available 0x0000000000000000
Sep 17 16:34:16.6186 ireport.os.sunos.panic.dump_pending_on_device 0x0000000000000000

nvlist version: 0
version = 0x0
class = list.suspect
uuid = bf325a05-9672-4a63-9853-9b4411c5ddd1
code = SUNOS-8000-KL
diag-time = 1631889268 486222
de = fmd://module/software-diagnosis
fault-list-sz = 0x1
fault-list = (array of embedded nvlists)
(start fault-list0)
nvlist version: 0
version = 0x0
class = defect.sunos.kernel.panic
certainty = 0x64
asru = sw:///:path=/var/crash//.bf325a05-9672-4a63-9853-9b4411c5ddd1
resource = sw:///:path=/var/crash//.bf325a05-9672-4a63-9853-9b4411c5ddd1
savecore-succcess = 1
dump-dir = /var/crash/
dump-files = vmdump.0
os-instance-uuid = bf325a05-9672-4a63-9853-9b4411c5ddd1
panicstr = BAD TRAP: type=e (#pf Page fault) rp=fffffe00f67286d0 addr=90 occurred in module "mac" due to a NULL pointer dereference
panicstack = unix:die+c6 () | unix:trap+11c1 () | unix:cmntrap+e9 () | mac:mac_tx_cpu_init+53 () | mac:mac_flow_cpu_init+203 () | mac:mac_fanout_setup+65 () | mac:mac_pool_link_update+2ff () | genunix:i_mod_hash_walk_nosync+53 () | genunix:mod_hash_walk+3f () | mac:mac_pool_update+27 () | mac:mac_pool_event_cb+97 () | genunix:pool_event_notify+55 () | genunix:taskq_thread+2cd () | unix:thread_start+b () |
crashtime = 1631889111
panic-time = Fri Sep 17 16:31:51 2021 CEST
(end fault-list0)

fault-status = 0x1
severity = Major
__ttl = 0x1
__tod = 0x6144a774 0x1d432660
Actions

Also available in: Atom PDF