Commit graph

1363 commits

Author SHA1 Message Date
John Baldwin
367fe30818 cxgbe: Remove most uses of sysctl_wire_old_buffer
Most of these sysctls don't call sbuf_* while holding any locks.  Of
the ones that do hold locks, all but one can be fixed to drop the lock
before calling sbuf_*.

Reviewed by:	np
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D45186

(cherry picked from commit 61089df8147eb7109696476c891514296d543bad)
2024-11-30 08:55:57 -05:00
John Baldwin
e42a182bb7 cxgbe: Add a 'show t4 memdump' DDB command
Reviewed by:	np
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D45187

(cherry picked from commit 1edf61f395d98a5abb7d6f38e5e64722b4ddf351)
2024-10-21 11:49:27 -07:00
Navdeep Parhar
42d8f6b759 cxgbe(4): Use correct synchronization when marking the adapter offline.
adapter->flags are guarded by a synch_op, as noted in the comment in
adapter.h where the flags are defined.

Fixes:	5241b210a4e cxgbe(4): Basic infrastructure for ULDs to participate in adapter reset.
Sponsored by:	Chelsio Communications

(cherry picked from commit 52e5a66eac22d24e29f204c5b65232378e71f4cf)
2024-10-21 10:15:33 -07:00
Navdeep Parhar
0eea8754ae cxgbe(4): Allow t4_tom to be unloaded safely.
* Disable IFCAP_TOE automatically on all ifnets on all adapters during
  unload.  This is user-friendly and avoids panics due to stale ifnet
  state after t4_tom is unloaded.
* Do not allow unload if tids are in use by the TOE on any adapter.

Reported by:	Bimal Abraham @ Chelsio
Sponsored by:	Chelsio Communications

(cherry picked from commit 9ba8670a8b175de79ea087688f51595b4f2db862)
2024-10-21 10:15:11 -07:00
Navdeep Parhar
48473a98a9 cxgbe/t4_tom: Remove duplicate unlock in t4_tom_deactivate.
Fixes:	c1c524852f62 cxgbe/t4_tom: Implement uld_stop and uld_restart for ULD_TOM.
Sponsored by:	Chelsio Communications

(cherry picked from commit cc110bbec6d23d8cff47733704c71de641d0c8e8)
2024-10-21 10:14:56 -07:00
Navdeep Parhar
b9f7a09c49 cxgbe(4): Clobber all tracer state on stop and redo only traceq on restart.
Tracers have to be recreated after a restart but that's okay given that
they are used for debugging only.

Sponsored by:	Chelsio Communications

(cherry picked from commit ee3da604dd016439850dae77366796313e60f0e0)
2024-10-21 10:14:39 -07:00
Navdeep Parhar
b56fe21c6d cxgbe/t4_tom: Do not set unresolved entries to STALE in t4_l2t_update.
An L2 entry in the driver's hash was marked STALE unconditionally if it
changed in the kernel when its driver refcount is 0.  Fix the driver to
do this for VALID entries only.

Sponsored by:	Chelsio Communications

(cherry picked from commit 3883300afe0bff5c5658274c4d8cfe773d08343e)
2024-10-21 10:14:23 -07:00
Navdeep Parhar
2c9c295f5e cxgbe/t4_tom: completely avoid L2T entries during stop/suspend.
1. Mark the L2T entry valid only if t4_write_l2e succeeds, which won't
   happen if the adapter is stopped.  This prevents L2T entries from
   sometimes getting (re)promoted to VALID on Tx activity during stop.
2. Discard a work request immediately instead of enqueueing it to the
   arp queue if the adapter is stopped.

Fixes:	c1c524852f62 cxgbe/t4_tom: Implement uld_stop and uld_restart for ULD_TOM.
Sponsored by:	Chelsio Communications

(cherry picked from commit 07f47e8850d0639d474026b203013072aeb32c81)
2024-10-21 10:14:04 -07:00
Navdeep Parhar
261c616077 cxgbe(4): Make sure that the tracing filters are usable after restart.
The destination queue for tracing filters is destroyed during stop or
suspend and the software state needs to reflect this.  A new destination
queue will be setup when the adapter resumes operation.

Sponsored by:	Chelsio Communications

(cherry picked from commit d82cb5c608ae9aaced4cb459ca511e62c64dbf74)
2024-10-21 10:13:47 -07:00
Navdeep Parhar
610ff48406 cxgbe/t4_tom: Demote STALE L2 table entries to RESOLVING in uld_stop.
The STALE state means the L2T entry is valid in hardware but needs to be
refreshed (ARP/NDP) in software.  But stop/suspend wipes the hardware
L2T and STALE entries need to be updated just like VALID entries to match
actual hardware state.

Fixes:	c1c524852f62 cxgbe/t4_tom: Implement uld_stop and uld_restart for ULD_TOM.
Sponsored by:	Chelsio Communications

(cherry picked from commit 171e57967b3e53f0fb48116df5003ce17163295c)
2024-10-21 10:09:52 -07:00
Navdeep Parhar
b5c0843322 cxgbe/t4_tom: Add synq entry to the list before calling send_synack.
This fixes a panic where the peer's ack to the synack arrives on a
different queue and do_pass_establish tries to remove the synqe from
synqe_list before it has been added by do_pass_accept_req.

Reported by:	Sony Arpita Das @ Chelsio
Fixes:	283333c0e329 cxgbe/t4_tom: Track all synq entries in a per-adapter list.
Sponsored by:	Chelsio Communications

(cherry picked from commit 674cbf38f6d0a0b307e52c4265da9f077606b035)
2024-10-21 10:09:37 -07:00
Navdeep Parhar
3301201fc9 cxgbe(4): Always report link-down on an abrupt stop.
This fixes a regression in 5241b210a4e1 where the driver stopped
reporting link down after a fatal error unless t4_reset_on_fatal_err was
also set.

Fixes:	5241b210a4e1 cxgbe(4): Basic infrastructure for ULDs to participate in adapter reset.
Sponsored by:	Chelsio Communications

(cherry picked from commit 7aeec281b8b2620d7d88c9abdfadd7e20a485b9c)
2024-10-21 10:09:14 -07:00
Navdeep Parhar
c7550138e5 cxgbe/iw_cxgbe: Fail early in some callbacks when the RNIC is stopped.
Stop allocating new resources when the RNIC is stopped but continue to
allow previously allocated resources to be freed.  Note that t4_tom's
uld_stop tears down all TOE connections, including those being used for
iWARP, and that triggers the cleanup of iWARP resources.

Fail post_send/post_recv early too to avoid the SQ doorbell.

Sponsored by:	Chelsio Communications

(cherry picked from commit 9fdb683d92b36cbd20bbd8d61f0c1138f8348dd4)
2024-10-21 10:08:56 -07:00
Navdeep Parhar
0366d0bd50 cxgbe/iw_cxgbe: c4iw_connect should return a negative errno.
Avoid a pointless assignment while here.

Sponsored by:	Chelsio Communications

(cherry picked from commit 3b4dcc0feb41120017381681d612d3cc21beb1d7)
2024-10-21 10:08:27 -07:00
Navdeep Parhar
11dc811bb8 cxgbe/iw_cxgbe: Tidy up a couple of CTRs in c4iw_create_listen.
backlog is an int and not a string.  While here, fix an adjacent CTR
that was spread over two lines even though it fits in one.

Sponsored by:	Chelsio Communications

(cherry picked from commit 3f250bb6f05b57890215398767bbb8aa00c888f3)
2024-10-21 10:08:09 -07:00
Navdeep Parhar
cd2ae69629 cxgbe/iw_cxgbe: Replace the fatal error flag with a stopped flag.
Now that suspend/resume is supported by the base driver, a fatal error
isn't the only reason that the RNIC can stop abruptly.  Also, this state
is no longer permanent as it's possible to resume operations after a
stop.  Rename the flag and associated routines to match the new state of
affairs.

Sponsored by:	Chelsio Communications

(cherry picked from commit 8254a276ad893ae2a1b35fcbbad255f06e29b8c6)
2024-10-21 10:07:49 -07:00
Navdeep Parhar
a7b24758d0 cxgbe(4): Export the core suspend/resume functionality to other modules.
Sponsored by:	Chelsio Communications

(cherry picked from commit 1e584ca30e3a97eedf37368e7ad75c31f08be039)
2024-10-21 10:07:20 -07:00
Navdeep Parhar
ccd65f200c cxgbe(4): reset routine for general use.
Add a reset_adapter wrapper that picks the most suitable reset routine
internally.  Use it in the fatal error handler as well as the sysctl
based reset.

Sponsored by:	Chelsio Communications

(cherry picked from commit d668a0b0abe4b079ac2a0d9cd0d7b71bd4a043c6)
2024-10-21 10:06:40 -07:00
Navdeep Parhar
888858a7bf cxgbe(4): Stop work request queues in a reliable manner.
Clear the EQ_HW_ALLOCATED flag with the wrq lock held and discard all
work requests, pending or new, when it's not set.

Sponsored by:	Chelsio Communications

(cherry picked from commit 0a9d1da6e6cede5e9c0ff63240d724049ad72b5b)
2024-10-21 10:06:19 -07:00
Navdeep Parhar
4016225138 cxgbe/iw_cxgbe: Fix typo in assertion.
eanbled -> enabled

(cherry picked from commit b5332809c633e7e37715f7823a8a8ee9799910a4)
2024-10-21 10:05:56 -07:00
Navdeep Parhar
3b600c38d8 cxgbe/t4_tom: bugfixes in stop/restart.
1. Remove toepcb from the toep_list on active open failure.
2. Purge the wr_list for an L2T entry on an adpater stop.

Fixes:	c1c524852f62 cxgbe/t4_tom: Implement uld_stop and uld_restart for ULD_TOM.
Sponsored by:	Chelsio Communications

(cherry picked from commit fef0e39f64a1db796ded8777dbee71fc287f6107)
2024-10-21 10:05:39 -07:00
Navdeep Parhar
788088340c cxgbe/iw_cxgbe: Add a placeholder uld_restart.
Sponsored by:	Chelsio Communications

(cherry picked from commit f1c4ed150334ae2844810a58b7384cb0e3abe664)
2024-10-21 10:03:34 -07:00
Navdeep Parhar
b836830270 cxgbe/t4_tom: Implement uld_stop and uld_restart for ULD_TOM.
This allows the adapter to be suspended or reset even when stateful TOE is
active, in some limited configurations.

The LLD has already stopped the adapter hardware and all its queues by the time
these ULD routines get called.  The general approach in t4_tom is to purge the
lookup tables immediately so that they are ready for operation by the time the
adapter resumes, and park all the resources left hanging by the stopped hardware
into separate "stranded" queues that can be dealt with at leisure.

Outstanding active opens, live connections, and synq entries (for connections in
the middle of the 3-way handshake) are all treated as if the hardware had
reported an abrupt error for the tid.  The servers/listeners are a bit different
in that no error is reported.  They're just noted as non-functional when the
hardware stops and are recreated by the driver during restart.

Sponsored by:	Chelsio Communications

(cherry picked from commit c1c524852f625cf5f420653f7850d1fe3ff6b4ca)
2024-10-21 10:03:14 -07:00
Navdeep Parhar
fcda529bf0 cxgbe/t4_tom: Track all synq entries in a per-adapter list.
Live tid entries in tid_tab are either full fledged connections or synq
entries.  toep_list tracks the connections already and this change adds
a synqe_list to track the synq entries.  These two lists can be used to
enumerate and iterate over all live tids.

Sponsored by:	Chelsio Communications

(cherry picked from commit 283333c0e329fd7aceff16fa3bf2b9892744d883)
2024-10-21 10:02:37 -07:00
Navdeep Parhar
da45eba6d1 cxgbe(4): Stop and restart the L2T allocator with the LLD.
L2T entries are used by both filters and TOE and the L2T is shared
between the base driver (LLD) and the TOM ULD.  Add a flag to indicate
that the L2T is stopped, which means:
* t4_alloc_l2e and t4_l2t_alloc_switching will not allocate new entries.
* t4_tom will ignore all ARP/NDP updates from the kernel.
* Previously allocated L2T entries can still be freed.

Sponsored by:   Chelsio Communications

(cherry picked from commit cd93fdee5c8bbdb00d10f8a1fa43f30f151a1ef7)
2024-10-21 10:02:17 -07:00
Navdeep Parhar
fb313ad44d cxgbe(4): Stop and restart the atid allocator with the LLD.
atids are used by both filters and TOE and the atid table is in the base
driver (LLD).  New atids cannot be allocated when the allocator is
stopped but existing ones can still be freed.  It is expected that the
owners of outstanding atids will release them in their own stop
processing, before the adapter is restarted.

Sponsored by:	Chelsio Communications

(cherry picked from commit fd3aca5b41968421f243b760ac5733f39f25cc56)
2024-10-21 10:01:50 -07:00
Navdeep Parhar
41e9881bf7 cxgbe(4): Basic infrastructure for ULDs to participate in adapter reset.
The suspend/resume/reset implementation in the base driver (LLD)
currently works when only stateless features are in use.  This commit
adds basic infrastructure for stateful upper layer drivers (ULDs) to
participate in suspend/resume/reset.

* Add a uld_restart to indicate that the adapter has been restarted
  after a stop and the ULD should resume operations.
* Move the existing functionality in t4_suspend/t4_resume to stop_lld and
  restart_lld.  Use these and the new uld restart routines everywhere the
  adapter has to be stopped abruptly and restarted, namely:
  1. PCIE bus suspend/resume/reset methods invoked by the kernel.
  2. Manual internal-reset using driver sysctl.
  3. Automatic internal-reset on a fatal error.
* Implement an alternate internal-reset for use in VMs and for testing.

Typical reset sequence is:
stop_adapter(sc);
stop_lld(sc);
stop_all_uld(sc);
set_adapter_hwstatus(sc, false);

/* hw reset takes place here. */

restart_adapter(sc);
restart_lld(sc);
  set_adapter_hwstatus(sc, true);
restart_all_uld(sc);

Sponsored by:	Chelsio Communications

(cherry picked from commit 5241b210a4e1029f3005faf82cd74965645c401b)
2024-10-21 10:01:06 -07:00
Navdeep Parhar
4f1b1077fb cxgbe(4): Changes to ULD list management.
* Convert t4_uld_list to an array.  There will be at most 3 items in the
  list and it's simpler to track them in an array with a fixed slot for
  each ULD.
* There is no need to refcount ULDs so stop doing that.
* Add uld_ prefix to all members of uld_info.
* Rename async_event to uld_stop to match its actual purpose.  Call it
  for all ULDs and not just ULD_IWARP.

Reviewed by:	jhb
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D46029

(cherry picked from commit cf5e6370f15cffabbbf508083ba7d48ec8abfa79)
2024-10-21 10:00:26 -07:00
Navdeep Parhar
85a5adf4ed cxgbe(4): Make no assumptions about the start and size of the L2T table.
Do not assume that the table starts at index 0 and is typically 4K in
size.  The only thing the driver needs to verify is that its use of
F_SYNC_WR doesn't collide with the L2T hwidx range.

Reviewed by:	jhb
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D46028

(cherry picked from commit cfcfd3c7bf5b60da42b13ac5d8085c762613c302)
2024-10-21 09:46:30 -07:00
Navdeep Parhar
e867a25031 cxgbe/t4_tom: Free up hardware resources when the final CPL is received.
Final CPL means the tid is done in the hardware and other resources
associated with it can be freed right away.  There is no need to wait
for the kernel to detach the toepcb.

Reviewed by:	jhb
MFC after:	1 week
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D45991

(cherry picked from commit 27479403a764cf3b97194887a1f819c1e35357aa)
2024-10-21 09:46:30 -07:00
Navdeep Parhar
c18966b3dc cxgbe/t4_tom: Detach the toep from the tcpcb when entering TIME_WAIT.
The kernel used to call tod_pcb_detach when entering TIME_WAIT but that
seems to have changed, likely with the TIME_WAIT overhaul in the kernel
some time ago.  Catch up by having the driver perform the detach.

The hardware does not handle TIME_WAIT so it's important to detach and
let the kernel arm the 2MSL timer to deal with it.

Reported by:	Sony Arpita Das @ Chelsio
Reviewed by:	jhb
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D45990

(cherry picked from commit bbc326241d91ab2cee2ec2c5c0aa8a906480132f)
2024-10-21 09:46:29 -07:00
Navdeep Parhar
032bd06d98 cxgbe(4): Tidy up t4_l2t.h
Align the comment for if_t with the rest and remove the declaration of
t4_l2t_set_switching, which was removed in 061bbaf7e7.

No functional change.

Sponsored by:	Chelsio Communications

(cherry picked from commit b3bbc6cc79a3b76d342b2a8fd68e5266eb50ff11)
2024-10-21 09:46:29 -07:00
Zhenlei Huang
09ed34449c cxgbe(4): Stop checking for failures from malloc/mb_alloc_ext_pgs(M_WAITOK)
MFC after:	1 week
Differential Revision:	https://reviews.freebsd.org/D45852

(cherry picked from commit 955b380365af174b3d35905b7b8afae97506a0bd)
2024-09-30 12:44:21 +08:00
Mark Johnston
aa937aa8c5 cxgbe: Flush transmitted packets more regularly in netmap mode
Previously, when transmitting short runs of packets via cxgbe_nm_tx(),
we would wait until a large number of packets were buffered before
scheduling a task to clean transmit buffers.

Obtained from:	np

(cherry picked from commit 6af3d59962449c66688361598163dd174cc5fb2a)
2024-08-08 15:55:48 +00:00
Navdeep Parhar
60bf442ee2 cxgbe(4): Do not report link state change during interface detach.
This fixes a panic when multiple VIs are configured on an interface and
only the non-primary VI is up at the time of driver detach.  The problem
was that the driver would queue a link state change notification for an
interface about to be freed.

To reproduce the panic, add "hw.cxgbe.num_vis=2" to loader.conf and

 # kldload if_cxgbe
 # ifconfig vcc0 up
 # devctl detach t6nex0

  trap 0x9, rip = 0xffffffff8107db70, rsp = 0xfffffe0055263d60, rbp = 0xfffffe0055263dd0
  taskqueue_run_locked() at taskqueue_run_locked+0x2a0/frame 0xfffffe0055263dd0
  taskqueue_run() at taskqueue_run+0x72/frame 0xfffffe0055263df0
  taskqueue_swi_run() at taskqueue_swi_run+0x18/frame 0xfffffe0055263e10
  intr_event_execute_handlers() at intr_event_execute_handlers+0x249/frame 0xfffffe0055263e50
  ithread_execute_handlers() at ithread_execute_handlers+0x9e/frame 0xfffffe0055263e70

Reviewed by:	jhb
Sponsored by:	Chelsio Communications
Differential Revision:	https://reviews.freebsd.org/D45864

(cherry picked from commit dc20d49aa939caea365cbdf0341b00de69253be4)
2024-07-16 23:40:13 -07:00
Navdeep Parhar
8d214aa2c1 cxgbe(4): Consolidate all mk_set_tcb_field_ulp in one place.
Sponsored by:   Chelsio Communications

(cherry picked from commit 64a00f877fc23d904d5f4ca00471e09954eb9381)
2024-07-16 23:39:43 -07:00
Navdeep Parhar
307639c1e9 cxgbe(4): sc->port is indexed by port_id and not tx_chan.
Sponsored by:	Chelsio Communications

(cherry picked from commit cca3506dc15464baf9b09b143879ed13fb5684c7)
2024-07-16 23:39:32 -07:00
Navdeep Parhar
29791662e9 cxgbe/tom: Fix the rx channel selection in options2.
This affects TOE operation when multiple rx c-channels are in use for
offload, which is an unusual configuration.

Sponsored by:	Chelsio Communications

(cherry picked from commit c6c6d4aff90da83a292b4c2bbbe1f4d6e01cd82e)
2024-07-16 23:39:13 -07:00
Navdeep Parhar
c0b69999b1 cxgbe(4): Query TPCHMAP once and not once per port.
Sponsored by:	Chelsio Communications

(cherry picked from commit 7f10048f983a31c09587b436e7fb073f7cbb76e4)
2024-07-16 23:38:57 -07:00
Navdeep Parhar
0ca32f02de cxgbe(4): Rename rx_c_chan to rx_chan.
It is the equivalent of tx_chan but for receive so rx_chan is a better
name.  Initialize both using helper functions and make sure both are
displayed in the sysctl MIB.

Sponsored by:	Chelsio Communications

(cherry picked from commit 480ff89c67b25113515018cdcd13179229b4a0d3)
2024-07-16 23:38:40 -07:00
Navdeep Parhar
5af25b6e4d cxgbe(4): Minor tweaks to comments.
No functional change intended.

Sponsored by:	Chelsio Communications

(cherry picked from commit 21aba396551e29808fa56a6d4fab17e871c3524f)
2024-07-16 23:38:18 -07:00
Navdeep Parhar
67faf45be5 cxgbe(4): Initialize mps_bg_map to an invalid value if it's not known.
Sponsored by:	Chelsio Communications

(cherry picked from commit 9de0036b55dcdabc54fa69ea6e583b22d739818c)
2024-07-16 23:38:00 -07:00
Navdeep Parhar
92383a2c84 cxgbe(4): Do not read hardware registers to determine the number of ports.
PORTVEC obtained from the firmware is the authoritative source of this
information, and nports (calculated from PORTVEC) is available by the
time t4_port_init runs.

Sponsored by:	Chelsio Communications

(cherry picked from commit 4d1362cdc7375984a48f5f0048b1fe909524d21d)
2024-07-16 23:37:39 -07:00
Navdeep Parhar
e5efc8cf5d cxgbe(4): Allocate a taskqueue per port instead of per channel.
All the channels are not used on all boards and there's no point
allocating taskqueues that will never be used.

Sponsored by:	Chelsio Communications

(cherry picked from commit 857d74b6340e418396d79a46b264ce0eedd760e4)
2024-07-16 23:37:25 -07:00
Navdeep Parhar
a238d1b045 cxgbe(4): Shared code update to deal with partial failure in query_params.
Obtained from:	Chelsio Communications
Sponsored by:	Chelsio Communications

(cherry picked from commit 2965ae59144177eb3570721295da9cbc447f1105)
2024-07-16 23:37:03 -07:00
Navdeep Parhar
b67afd194c cxgbe(4): Reword the comment explaining the atid/cookie split.
Avoid a magic constant while here.  No functional change intended.

Sponsored by:	Chelsio Communications

(cherry picked from commit 43f6f08488046788b0ad66e9a5119f36e5de71ab)
2024-07-16 23:36:39 -07:00
Navdeep Parhar
d80e7fca01 cxgbe(4): Add missing description for a port type.
Sponsored by:	Chelsio Communications

(cherry picked from commit f0ee6112eb61a5130acedd20d9827627c0975fba)
2024-07-16 23:36:16 -07:00
Navdeep Parhar
6bc34cdd96 cxgbe(4): Retire t4_intr_clear.
The firmware clears the interrupts already and it has a better idea of
exactly what to clear for which generation of the ASIC.  There is no
need for the driver to get involved.

Sponsored by:	Chelsio Communications

(cherry picked from commit 1c7f9c8b4673abf3723be09afed4443261e0d186)
2024-07-16 23:35:55 -07:00
Navdeep Parhar
41309c878d cxgbe(4): Add a helper function to locate MPS/MAC registers.
These register blocks are at different locations in different chips.

Sponsored by:   Chelsio Communications

(cherry picked from commit b59c5d97edf17525405d95b1f5746c4a79a9c7c4)
2024-07-16 23:35:30 -07:00
Navdeep Parhar
1089f5755e cxgbe(4): Remove tx_modq lookup table.
The driver always uses the same modulation queue as the channel and the
table is unnecessary.

Sponsored by:   Chelsio Communications

(cherry picked from commit f76effed14b25bfa0c47b10f6d8a076104c48d94)
2024-07-16 23:27:14 -07:00