diff mbox series

[net] ice: check whether AUX devices/drivers are supported in ice_rebuild

Message ID 20210903012500.39407-1-yongxin.liu@windriver.com
State New
Headers show
Series [net] ice: check whether AUX devices/drivers are supported in ice_rebuild | expand

Commit Message

Yongxin Liu Sept. 3, 2021, 1:25 a.m. UTC
In ice_rebuild(), check whether AUX devices/drivers are supported or not
before calling ice_plug_aux_dev().

Fix the following call trace, if RDMA functionality is not available.

  auxiliary ice.roce.0: adding auxiliary device failed!: -17
  sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ice.roce.0'
  Workqueue: ice ice_service_task [ice]
  Call Trace:
   dump_stack_lvl+0x38/0x49
   dump_stack+0x10/0x12
   sysfs_warn_dup+0x5b/0x70
   sysfs_do_create_link_sd.isra.2+0xc8/0xd0
   sysfs_create_link+0x25/0x40
   bus_add_device+0x6d/0x110
   device_add+0x49d/0x940
   ? _printk+0x52/0x6e
   ? _printk+0x52/0x6e
   __auxiliary_device_add+0x60/0xc0
   ice_plug_aux_dev+0xd3/0xf0 [ice]
   ice_rebuild+0x27d/0x510 [ice]
   ice_do_reset+0x51/0xe0 [ice]
   ice_service_task+0x108/0xe70 [ice]
   ? __switch_to+0x13b/0x510
   process_one_work+0x1de/0x420
   ? apply_wqattrs_cleanup+0xc0/0xc0
   worker_thread+0x34/0x400
   ? apply_wqattrs_cleanup+0xc0/0xc0
   kthread+0x14d/0x180
   ? set_kthread_struct+0x40/0x40
   ret_from_fork+0x1f/0x30

Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide RDMA")
Signed-off-by: Yongxin Liu <yongxin.liu@windriver.com>
---
 drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)

Comments

Leon Romanovsky Sept. 5, 2021, 7:23 a.m. UTC | #1
On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:
> In ice_rebuild(), check whether AUX devices/drivers are supported or not

> before calling ice_plug_aux_dev().

> 

> Fix the following call trace, if RDMA functionality is not available.

> 

>   auxiliary ice.roce.0: adding auxiliary device failed!: -17

>   sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ice.roce.0'

>   Workqueue: ice ice_service_task [ice]

>   Call Trace:

>    dump_stack_lvl+0x38/0x49

>    dump_stack+0x10/0x12

>    sysfs_warn_dup+0x5b/0x70

>    sysfs_do_create_link_sd.isra.2+0xc8/0xd0

>    sysfs_create_link+0x25/0x40

>    bus_add_device+0x6d/0x110

>    device_add+0x49d/0x940

>    ? _printk+0x52/0x6e

>    ? _printk+0x52/0x6e

>    __auxiliary_device_add+0x60/0xc0

>    ice_plug_aux_dev+0xd3/0xf0 [ice]

>    ice_rebuild+0x27d/0x510 [ice]

>    ice_do_reset+0x51/0xe0 [ice]

>    ice_service_task+0x108/0xe70 [ice]

>    ? __switch_to+0x13b/0x510

>    process_one_work+0x1de/0x420

>    ? apply_wqattrs_cleanup+0xc0/0xc0

>    worker_thread+0x34/0x400

>    ? apply_wqattrs_cleanup+0xc0/0xc0

>    kthread+0x14d/0x180

>    ? set_kthread_struct+0x40/0x40

>    ret_from_fork+0x1f/0x30

> 

> Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide RDMA")

> Signed-off-by: Yongxin Liu <yongxin.liu@windriver.com>

> ---

>  drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-

>  1 file changed, 3 insertions(+), 1 deletion(-)

> 

> diff --git a/drivers/net/ethernet/intel/ice/ice_main.c b/drivers/net/ethernet/intel/ice/ice_main.c

> index 0d6c143f6653..98cc708e9517 100644

> --- a/drivers/net/ethernet/intel/ice/ice_main.c

> +++ b/drivers/net/ethernet/intel/ice/ice_main.c

> @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf, enum ice_reset_req reset_type)

>  	/* if we get here, reset flow is successful */

>  	clear_bit(ICE_RESET_FAILED, pf->state);

>  

> -	ice_plug_aux_dev(pf);

> +	if (ice_is_aux_ena(pf))

> +		ice_plug_aux_dev(pf);

> +


The change is ok, but it hints that auxiliary bus is used horribly wrong
in this driver. In proper implementation, which should rely on driver/core,
every subdriver like ice.eth, ice.roce e.t.c is supposed to be retriggered
by the code and shouldn't  ave "if (ice_is_aux_ena(pf))" checks.

Thanks

>  	return;

>  

>  err_vsi_rebuild:

> -- 

> 2.14.5

>
Ertman, David M Sept. 8, 2021, 5:30 p.m. UTC | #2
> -----Original Message-----

> From: Leon Romanovsky <leon@kernel.org>

> Sent: Sunday, September 5, 2021 12:24 AM

> To: Yongxin Liu <yongxin.liu@windriver.com>

> Cc: Ertman, David M <david.m.ertman@intel.com>; Saleem, Shiraz

> <shiraz.saleem@intel.com>; Nguyen, Anthony L

> <anthony.l.nguyen@intel.com>; netdev@vger.kernel.org; linux-

> kernel@vger.kernel.org; davem@davemloft.net; Brandeburg, Jesse

> <jesse.brandeburg@intel.com>; intel-wired-lan@lists.osuosl.org;

> kuba@kernel.org

> Subject: Re: [PATCH net] ice: check whether AUX devices/drivers are

> supported in ice_rebuild

> 

> On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:

> > In ice_rebuild(), check whether AUX devices/drivers are supported or not

> > before calling ice_plug_aux_dev().

> >

> > Fix the following call trace, if RDMA functionality is not available.

> >

> >   auxiliary ice.roce.0: adding auxiliary device failed!: -17

> >   sysfs: cannot create duplicate filename '/bus/auxiliary/devices/ice.roce.0'

> >   Workqueue: ice ice_service_task [ice]

> >   Call Trace:

> >    dump_stack_lvl+0x38/0x49

> >    dump_stack+0x10/0x12

> >    sysfs_warn_dup+0x5b/0x70

> >    sysfs_do_create_link_sd.isra.2+0xc8/0xd0

> >    sysfs_create_link+0x25/0x40

> >    bus_add_device+0x6d/0x110

> >    device_add+0x49d/0x940

> >    ? _printk+0x52/0x6e

> >    ? _printk+0x52/0x6e

> >    __auxiliary_device_add+0x60/0xc0

> >    ice_plug_aux_dev+0xd3/0xf0 [ice]

> >    ice_rebuild+0x27d/0x510 [ice]

> >    ice_do_reset+0x51/0xe0 [ice]

> >    ice_service_task+0x108/0xe70 [ice]

> >    ? __switch_to+0x13b/0x510

> >    process_one_work+0x1de/0x420

> >    ? apply_wqattrs_cleanup+0xc0/0xc0

> >    worker_thread+0x34/0x400

> >    ? apply_wqattrs_cleanup+0xc0/0xc0

> >    kthread+0x14d/0x180

> >    ? set_kthread_struct+0x40/0x40

> >    ret_from_fork+0x1f/0x30

> >

> > Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide RDMA")

> > Signed-off-by: Yongxin Liu <yongxin.liu@windriver.com>

> > ---

> >  drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-

> >  1 file changed, 3 insertions(+), 1 deletion(-)

> >

> > diff --git a/drivers/net/ethernet/intel/ice/ice_main.c

> b/drivers/net/ethernet/intel/ice/ice_main.c

> > index 0d6c143f6653..98cc708e9517 100644

> > --- a/drivers/net/ethernet/intel/ice/ice_main.c

> > +++ b/drivers/net/ethernet/intel/ice/ice_main.c

> > @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf, enum

> ice_reset_req reset_type)

> >  	/* if we get here, reset flow is successful */

> >  	clear_bit(ICE_RESET_FAILED, pf->state);

> >

> > -	ice_plug_aux_dev(pf);

> > +	if (ice_is_aux_ena(pf))

> > +		ice_plug_aux_dev(pf);

> > +

> 

> The change is ok, but it hints that auxiliary bus is used horribly wrong

> in this driver. In proper implementation, which should rely on driver/core,

> every subdriver like ice.eth, ice.roce e.t.c is supposed to be retriggered

> by the code and shouldn't  ave "if (ice_is_aux_ena(pf))" checks.

> 

> Thanks


Hi Leon and Liu -

First of all, thanks Liu for tracking this down - it is an issue that needs to be fixed.  The ice_is_aux_ena() functions only
purpose is to determine if this PF supports RDMA functionality.  In probe(), the aux devices are not even initialized if
this test returns false.  If this check fixed the issue for you, the PF you are on does not currently support RDMA.
The bit this test is based on is only set one place in the driver currently - at probe time when we are checking the
capabilities (common_caps) of the device.

That being said, the call to check this should be in ice_plug_aux_dev function itself.  That way it is taken into account for all
attempts to create the auxiliary device.  There is another consideration about disabling RDMA that needs to also needs to be
taken into account to avoid a similar situation.

If it is acceptable, I will create a new patch today and get it out either this afternoon or tomorrow.

Thanks,
Dave E

> 

> >  	return;

> >

> >  err_vsi_rebuild:

> > --

> > 2.14.5

> >
Yongxin Liu Sept. 8, 2021, 7:14 p.m. UTC | #3
> -----Original Message-----

> From: Ertman, David M <david.m.ertman@intel.com>

> Sent: Thursday, September 9, 2021 1:31 AM

> To: Leon Romanovsky <leon@kernel.org>; Liu, Yongxin

> <Yongxin.Liu@windriver.com>

> Cc: Saleem, Shiraz <shiraz.saleem@intel.com>; Nguyen, Anthony L

> <anthony.l.nguyen@intel.com>; netdev@vger.kernel.org; linux-

> kernel@vger.kernel.org; davem@davemloft.net; Brandeburg, Jesse

> <jesse.brandeburg@intel.com>; intel-wired-lan@lists.osuosl.org;

> kuba@kernel.org

> Subject: RE: [PATCH net] ice: check whether AUX devices/drivers are supported

> in ice_rebuild

> 

> > -----Original Message-----

> > From: Leon Romanovsky <leon@kernel.org>

> > Sent: Sunday, September 5, 2021 12:24 AM

> > To: Yongxin Liu <yongxin.liu@windriver.com>

> > Cc: Ertman, David M <david.m.ertman@intel.com>; Saleem, Shiraz

> > <shiraz.saleem@intel.com>; Nguyen, Anthony L

> > <anthony.l.nguyen@intel.com>; netdev@vger.kernel.org; linux-

> > kernel@vger.kernel.org; davem@davemloft.net; Brandeburg, Jesse

> > <jesse.brandeburg@intel.com>; intel-wired-lan@lists.osuosl.org;

> > kuba@kernel.org

> > Subject: Re: [PATCH net] ice: check whether AUX devices/drivers are

> > supported in ice_rebuild

> >

> > On Fri, Sep 03, 2021 at 09:25:00AM +0800, Yongxin Liu wrote:

> > > In ice_rebuild(), check whether AUX devices/drivers are supported or

> > > not before calling ice_plug_aux_dev().

> > >

> > > Fix the following call trace, if RDMA functionality is not available.

> > >

> > >   auxiliary ice.roce.0: adding auxiliary device failed!: -17

> > >   sysfs: cannot create duplicate filename

> '/bus/auxiliary/devices/ice.roce.0'

> > >   Workqueue: ice ice_service_task [ice]

> > >   Call Trace:

> > >    dump_stack_lvl+0x38/0x49

> > >    dump_stack+0x10/0x12

> > >    sysfs_warn_dup+0x5b/0x70

> > >    sysfs_do_create_link_sd.isra.2+0xc8/0xd0

> > >    sysfs_create_link+0x25/0x40

> > >    bus_add_device+0x6d/0x110

> > >    device_add+0x49d/0x940

> > >    ? _printk+0x52/0x6e

> > >    ? _printk+0x52/0x6e

> > >    __auxiliary_device_add+0x60/0xc0

> > >    ice_plug_aux_dev+0xd3/0xf0 [ice]

> > >    ice_rebuild+0x27d/0x510 [ice]

> > >    ice_do_reset+0x51/0xe0 [ice]

> > >    ice_service_task+0x108/0xe70 [ice]

> > >    ? __switch_to+0x13b/0x510

> > >    process_one_work+0x1de/0x420

> > >    ? apply_wqattrs_cleanup+0xc0/0xc0

> > >    worker_thread+0x34/0x400

> > >    ? apply_wqattrs_cleanup+0xc0/0xc0

> > >    kthread+0x14d/0x180

> > >    ? set_kthread_struct+0x40/0x40

> > >    ret_from_fork+0x1f/0x30

> > >

> > > Fixes: f9f5301e7e2d ("ice: Register auxiliary device to provide

> > > RDMA")

> > > Signed-off-by: Yongxin Liu <yongxin.liu@windriver.com>

> > > ---

> > >  drivers/net/ethernet/intel/ice/ice_main.c | 4 +++-

> > >  1 file changed, 3 insertions(+), 1 deletion(-)

> > >

> > > diff --git a/drivers/net/ethernet/intel/ice/ice_main.c

> > b/drivers/net/ethernet/intel/ice/ice_main.c

> > > index 0d6c143f6653..98cc708e9517 100644

> > > --- a/drivers/net/ethernet/intel/ice/ice_main.c

> > > +++ b/drivers/net/ethernet/intel/ice/ice_main.c

> > > @@ -6466,7 +6466,9 @@ static void ice_rebuild(struct ice_pf *pf,

> > > enum

> > ice_reset_req reset_type)

> > >     /* if we get here, reset flow is successful */

> > >     clear_bit(ICE_RESET_FAILED, pf->state);

> > >

> > > -   ice_plug_aux_dev(pf);

> > > +   if (ice_is_aux_ena(pf))

> > > +           ice_plug_aux_dev(pf);

> > > +

> >

> > The change is ok, but it hints that auxiliary bus is used horribly

> > wrong in this driver. In proper implementation, which should rely on

> > driver/core, every subdriver like ice.eth, ice.roce e.t.c is supposed

> > to be retriggered by the code and shouldn't  ave "if (ice_is_aux_ena(pf))"

> checks.

> >

> > Thanks

> 

> Hi Leon and Liu -

> 

> First of all, thanks Liu for tracking this down - it is an issue that needs

> to be fixed.  The ice_is_aux_ena() functions only purpose is to determine if

> this PF supports RDMA functionality.  In probe(), the aux devices are not

> even initialized if this test returns false.  If this check fixed the issue

> for you, the PF you are on does not currently support RDMA.

> The bit this test is based on is only set one place in the driver currently -

> at probe time when we are checking the capabilities (common_caps) of the

> device.

> 

> That being said, the call to check this should be in ice_plug_aux_dev

> function itself.  That way it is taken into account for all attempts to

> create the auxiliary device.  There is another consideration about disabling

> RDMA that needs to also needs to be taken into account to avoid a similar

> situation.

> 

> If it is acceptable, I will create a new patch today and get it out either

> this afternoon or tomorrow.


Thanks Leon and Dave E for your quick response and input.
Please go ahead and I am expecting this issue to be fixed in a more reasonable way.

Thanks,
Yongxin

> 

> Thanks,

> Dave E

> 

> >

> > >     return;

> > >

> > >  err_vsi_rebuild:

> > > --

> > > 2.14.5

> > >
diff mbox series

Patch

diff --git a/drivers/net/ethernet/intel/ice/ice_main.c b/drivers/net/ethernet/intel/ice/ice_main.c
index 0d6c143f6653..98cc708e9517 100644
--- a/drivers/net/ethernet/intel/ice/ice_main.c
+++ b/drivers/net/ethernet/intel/ice/ice_main.c
@@ -6466,7 +6466,9 @@  static void ice_rebuild(struct ice_pf *pf, enum ice_reset_req reset_type)
 	/* if we get here, reset flow is successful */
 	clear_bit(ICE_RESET_FAILED, pf->state);
 
-	ice_plug_aux_dev(pf);
+	if (ice_is_aux_ena(pf))
+		ice_plug_aux_dev(pf);
+
 	return;
 
 err_vsi_rebuild: