tg3: driver sleeps indefinitely when EEH errors exceed eeh_max_freezes
authorDavid Christensen <drc@linux.vnet.ibm.com>
Wed, 17 Jun 2020 18:51:17 +0000 (11:51 -0700)
committerSasha Levin <sashal@kernel.org>
Tue, 30 Jun 2020 19:38:38 +0000 (15:38 -0400)
[ Upstream commit 3a2656a211caf35e56afc9425e6e518fa52f7fbc ]

The driver function tg3_io_error_detected() calls napi_disable twice,
without an intervening napi_enable, when the number of EEH errors exceeds
eeh_max_freezes, resulting in an indefinite sleep while holding rtnl_lock.

Add check for pcierr_recovery which skips code already executed for the
"Frozen" state.

Signed-off-by: David Christensen <drc@linux.vnet.ibm.com>
Reviewed-by: Michael Chan <michael.chan@broadcom.com>
Signed-off-by: David S. Miller <davem@davemloft.net>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
drivers/net/ethernet/broadcom/tg3.c

index c069a04a6e7e2c4d766e9a59291f94ad81973efe..5790b35064a8d8f561f4f607b78674a90a322171 100644 (file)
@@ -18174,8 +18174,8 @@ static pci_ers_result_t tg3_io_error_detected(struct pci_dev *pdev,
 
        rtnl_lock();
 
-       /* We probably don't have netdev yet */
-       if (!netdev || !netif_running(netdev))
+       /* Could be second call or maybe we don't have netdev yet */
+       if (!netdev || tp->pcierr_recovery || !netif_running(netdev))
                goto done;
 
        /* We needn't recover from permanent error */