Collected changes for Jan 2022 release: Octarina #1463

eugeneia · 2021-11-18T12:47:09Z

PR to collect changes to be included in the upcoming Octarina release.

Included:

Partial implementation of the initialization procedure.

Complete debug messages have become a little overwhelming now that we are allocating thousands of pages of memory for the adapter. Just for the moment disabling the hexdumps is the more sensible default. More fine-grained debug logging is likely needed.

Refactored the error checking to always be done when posting a command to the command queue. Previously this was a manual step for each command and that seems more error prone.

Now successfully: - Providing boot memory to the adapter (6 pages) - Querying adapter capabilities (current and maximum) - Setting adapter capabilities (keep current) - Providing init memory to the adapter (4232 pages !) The output from the init sequence looks like this: TRACE Read the initialization segment TRACE Write the physical location of the command queues to the init segment. TRACE Wait for the 'initializing' field to clear fw_rev 14 12 1220 cmd_interface_rev 5 cmdq_phy_addr cdata<void *>: 0x1f000000 log_cmdq_size 5 log_cmdq_stride 6 ready true nic_interface_supported true internal_timer 2.0108995831647e+14 health_syndrome 0 Command: ENABLE_HCA Command: QUERY_ISSI cur_issi = 0 sup_issi = 01 Command: QUERY_PAGES query_pages'boot' 6 Command: MANAGE_PAGES Command: QUERY_HCA_CAP Command: QUERY_HCA_CAP Capabilities - current and (maximum): eth_net_offloads = 0 (0) end_pad = 1 (1) cq_eq_remap = 1 (1) device_frequency_mhz = 275 (275) log_max_vlan_list = 12 (12) log_min_stride_sz_rq = 0 (0) log_max_klm_list_size = 16 (16) log_max_rqt = 0 (0) log_max_l2_table = 16 (16) log_max_current_uc_list = 10 (10) log_min_stride_sz_sq = 0 (0) log_uar_page_sz = 0 (8) log_max_wq_sz = 0 (0) log_max_current_mc_list = 14 (14) log_max_msg = 30 (30) log_max_stride_sz_rq = 0 (0) max_flow_counter = 0 (0) log_max_eq_sz = 22 (22) log_max_rqt_size = 0 (0) basic_cyclic_rcv_wqe = 0 (0) cache_line_128byte = 0 (0) max_tc = 0 (0) cmdif_checksum = 0 (3) driver_version = 0 (0) log_max_tis = 0 (0) port_type = 1 (1) wq_signature = 1 (1) log_max_tir = 0 (0) max_indirection = 4 (4) log_max_rq = 0 (0) cq_resize = 1 (1) cq_oi = 1 (1) cq_moderation = 1 (1) log_max_pd = 24 (24) log_max_mkey = 24 (24) log_max_transport_domain = 0 (0) rc = 1 (1) num_ports = 1 (1) bf = 1 (1) vport_counters = 1 (1) log_max_eq = 8 (8) pad_tx_eth_packet = 0 (0) log_pg_sz = 12 (12) uar_sz = 5 (5) cq_period_start_from_cqe = 1 (1) uc = 1 (1) log_max_mrw_sz = 64 (64) log_max_cq = 24 (24) vport_group_manager = 1 (1) log_max_tis_per_sq = 0 (0) start_pad = 0 (0) log_max_cq_sz = 22 (22) nic_flow_table = 0 (0) scqe_break_moderation = 1 (1) ud = 1 (1) log_max_sq = 0 (0) cqe_version = 0 (0) log_bf_reg_size = 9 (9) sctr_data_cqe = 1 (1) log_max_rmp = 0 (0) cqe_version = 0 (0) log_bf_reg_size = 9 (9) sctr_data_cqe = 1 (1) log_max_rmp = 0 (0) log_max_stride_sz_sq = 0 (0) imaicl = 0 (0) xrc = 1 (1) Command: SET_HCA_CAP Command: QUERY_PAGES query_pages'init' 4232 Command: MANAGE_PAGES Command: INIT_HCA

Pulls in more of the initialization procedure. See especially commit 4575dc7.

Added QUERY_VPORT_STATE, MODIFY_VPORT_STATE, QUERY_NIC_VPORT_CONTEXT. Note: I am not sure that these commands are actually needed since we are not using SR-IOV. The PRM mandates using some VPORT commands but I don't see them in the trace from the Linux mlx5 driver. So we may be able to remove this code.

Completed Mellanox initialization sequence

Required argument for new code merged from master in v2016.06. Request exclusive lock on the device.

This commit introduces a clean and working version of the device initialization.

The Mellanox driver is now split into two apps: 'ConnectX4' to setup the NIC and 'IO' to attach to a queue pair.

Previously a single IO app instances could be supported with multiple send and receive queues. The intention was to share traffic amongst the queues in order to overcome NIC hardware limitations on per-queue packet rate. The problem with that approach is that splitting traffic across queues potentially changes the relative ordering of packets and this potentially creates problems for the user. Better to avoid "DWIM" and have the user arrange multiple IO apps in an application-appropriate way if indeed they do need to overcome the ~16Mpps per-queue limit on the ConnectX-4 (100G).

Now also explicitly sets promiscuous mode on the ethernet port.

The selftest function, while still very simplistic, now presents its output more neatly and checks that counter values match expectations. The test now uses unicast traffic (need add additional tests to check multicast and broadcast and ...). Example output: selftest: waiting for both links up Links up. Sending 10,000,000 packets. hardware counter 02:00.0 82:00.0 ---------------- -------------------- -------------------- rx_bcast_octets 0 0 rx_bcast_packets 0 0 rx_error_octets 0 0 rx_error_packets 0 0 rx_mcast_octets 0 0 rx_mcast_packets 0 0 rx_ucast_octets 1,000,000,000 1,000,000,000 rx_ucast_packets 10,000,000 10,000,000 tx_bcast_octets 0 0 tx_bcast_packets 0 0 tx_error_octets 0 0 tx_error_packets 0 0 tx_mcast_octets 0 0 tx_mcast_packets 0 0 tx_ucast_octets 1,000,000,000 1,000,000,000 tx_ucast_packets 10,000,000 10,000,000 selftest: ok

Enabled "overflow ignore" on the receive completion queue so that we do not need to acknowledge completion entries. This should be safe because the completion queue and the receive queue are the same size i.e. each receive queue entry will have a separate completion entry with no collisions.

Fix the opcode for DISABLE_HCA (currently unused), add assert to alloc_pages() method.

Add the same set of counters supported by the Intel driver. Add a pull() method to the ConnextX4 object to periodically sync the stats so the "control" app can be run as a regular Snabb app.

Polling for command completion in sync_stats() creates too much latency and causes packet drops. This is mitigated by creating separtate HCAs for each command that reads a statistics register and perfom polling for completion only in sync_stats().

...and multiple switch() configs

# Conflicts: # src/apps/xdp/xdp.lua

(for new ICMP, ARP, and NDP counters)

Release 2022.01.13

…ctions Recent changes ([1], [2], [3]) exposed some previously internal variables and functions from core.packet. This patch restores local bindings for those within core.packet in order to maintain performance. [1] 6174563 (#1450) [2] ee4e42d (#1450) [3] 9f0694f (#1288)

lwAFTR introduced a way to run applications with vmprofile disabled, in a way that branched on engine.vmprofile_enabled on the fast path. This patch cleans this up to only disable the profiler timer, and run setvmprofile unconditionally in both cases. Also removes the duplicate/unused/ineffectual profile option from lib.ptree.worker (profiling can be disabled via lib.scheduling).

…ina-regressions

Fixes for regressions in max-next headed for Octarina release

lukego and others added 30 commits June 8, 2016 08:59

Merge branch 'master' into mellanox

87191fe

Merge branch 'mellanox-open-step' into mellanox

e77b579

Partial implementation of the initialization procedure.

connectx4: Set debug = false

13e6550

Complete debug messages have become a little overwhelming now that we are allocating thousands of pages of memory for the adapter. Just for the moment disabling the hexdumps is the more sensible default. More fine-grained debug logging is likely needed.

connectx4: Remove stray print statements

cd8e30e

connectx4: Move checkstatus() into post()

a285999

Refactored the error checking to always be done when posting a command to the command queue. Previously this was a manual step for each command and that seems more error prone.

Merge branch 'mellanox-open-step' into mellanox

f87817f

Pulls in more of the initialization procedure. See especially commit 4575dc7.

connectx4: Comment on ommitted SET_DRIVER_VERSION

8c0be9d

connectx4: Added CREATE_EQ (event queue) command

f196ac7

Merge pull request #943 from lukego/mellanox-open-step

21e2507

Completed Mellanox initialization sequence

Merge tag 'v2016.06' into mellanox

5812688

connectx4: Give 'lock' argument to map_pci_memory

51d4d45

Required argument for new code merged from master in v2016.06. Request exclusive lock on the device.

connectx4: Clean driver initialization

7659eb6

This commit introduces a clean and working version of the device initialization.

Merge branch 'snabbco/master' (v2016.10) into mellanox

ade9645

connectx4: Refactored for multiprocess operation

21d0dc3

The Mellanox driver is now split into two apps: 'ConnectX4' to setup the NIC and 'IO' to attach to a queue pair.

connectx4: Add 'mtu' paramter (default 9500)

ffe4a22

Now also explicitly sets promiscuous mode on the ethernet port.

Merge tag 'v2018.01.1' into mellanox-connectx5

c5a493e

Cherry-picked apps.melanox from lukego/debug100g

48f6080

Uncommited changes from lukego

4474000

connectx4.lua: minor changes

580cf9b

Fix the opcode for DISABLE_HCA (currently unused), add assert to alloc_pages() method.

connectx4.lua: fix alignment of HCA command queue

10ceea4

connectx4.lua: fix RSS

1b9cf2b

Register ConnectX5 device in lib.hardware.pci

8332671

connectx4.lua: identify driver for device_info

263919a

connectx4.lua: add statistics counters

d57c67a

Add the same set of counters supported by the Intel driver. Add a pull() method to the ConnextX4 object to periodically sync the stats so the "control" app can be run as a regular Snabb app.

eugeneia added 16 commits December 6, 2021 13:19

apps.mellanox.connectx_test: exercise stop()...

ad4c941

...and multiple switch() configs

apps.mellanox.connectx_test: add match test

d36174e

apps.mellanox: add README

bf1322e

Merge branch 'mellanox-2021' into lwaftr-dev-2021-Q4-softwire-v3

a0117e8

lib.hardware.pci: add mellanox MT2892 Family [ConnectX-6 Dx] model

22f950c

Merge branch 'mellanox-2021' into lwaftr-dev-2021-Q4-softwire-v3

91d9ee0

apps.xdp: only run rxtx_match selftest if SNABB_XDP_NQUEUES=1

ffb0ba9

apps.xdp: mark stop() NYI / not supported

1a5f7c4

Merge PR #1450 (driver for Linux AF_XDP sockets)

928c456

Merge PR #1454 (Add filter option to XDP app) into max-next

f6db445

Merge PR #1461 (Driver for Mellanox Connect-X 4/5/6) into max-next

bcf7d3c

Merge PR #1464 (Tracing JIT stability improvements) into max-next

660e4c8

Merge remote-tracking branch 'snabbco/max-next' into lwaftr-rc-2022.01

64b7c8d

# Conflicts: # src/apps/xdp/xdp.lua

lwAFTR 2022.01.13

7b0d35b

apps.ipv4.arp: fix selftest

1531f37

lwaftr: update end-to-end test data

3f3c40c

(for new ICMP, ARP, and NDP counters)

eugeneia mentioned this pull request Jan 13, 2022

Release 2022.01.13 Igalia/snabb#1243

Merged

eugeneia and others added 2 commits January 13, 2022 15:58

Merge pull request #1243 from eugeneia/lwaftr-rc-2022.01

90b07e3

Release 2022.01.13

Merge PR #1465 (upstream lwAFTR 2022.01.13) into max-next

2d7341e

eugeneia changed the title ~~Collected changes for 2021 release: Octarina~~ Collected changes for Jan 2022 release: Octarina Jan 13, 2022

eugeneia and others added 8 commits January 18, 2022 13:39

Fixup previous commit: remove unused profile option in lib.ptree.worker

52048a9

Fix regression in config-migrations

862af2c

lwaftr migrate-configuration: support snabb-softwire-v2, fix bugs

992a845

snabbnfv: do not load ingress_drop_monitor

6bd9e7a

Merge remote-tracking branch 'origin/lwaftr-rc-2022.01.19' into octar…

eafba5a

…ina-regressions

Merge pull request #1466 from eugeneia/octarina-regressions

14fa389

Fixes for regressions in max-next headed for Octarina release

eugeneia merged commit d270298 into next Jan 21, 2022

eugeneia mentioned this pull request Jan 21, 2022

Snabb 2022.01 “Octarina” #1467

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collected changes for Jan 2022 release: Octarina #1463

Collected changes for Jan 2022 release: Octarina #1463

eugeneia commented Nov 18, 2021 •

edited

Loading

Collected changes for Jan 2022 release: Octarina #1463

Collected changes for Jan 2022 release: Octarina #1463

Conversation

eugeneia commented Nov 18, 2021 • edited Loading

eugeneia commented Nov 18, 2021 •

edited

Loading