summaryrefslogtreecommitdiff
path: root/Documentation/ABI/testing/debugfs-driver-qat
blob: bd6793760f29842b841217d4226f6d5538403a62 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
What:		/sys/kernel/debug/qat_<device>_<BDF>/fw_counters
Date:		November 2023
KernelVersion:	6.6
Contact:	qat-linux@intel.com
Description:	(RO) Read returns the number of requests sent to the FW and the number of responses
		received from the FW for each Acceleration Engine
		Reported firmware counters::

			<N>: Number of requests sent from Acceleration Engine N to FW and responses
			     Acceleration Engine N received from FW

What:		/sys/kernel/debug/qat_<device>_<BDF>/heartbeat/config
Date:		November 2023
KernelVersion:	6.6
Contact:	qat-linux@intel.com
Description:	(RW) Read returns value of the Heartbeat update period.
		Write to the file changes this period value.

		This period should reflect planned polling interval of device
		health status. High frequency Heartbeat monitoring wastes CPU cycles
		but minimizes the customer’s system downtime. Also, if there are
		large service requests that take some time to complete, high frequency
		Heartbeat monitoring could result in false reports of unresponsiveness
		and in those cases, period needs to be increased.

		This parameter is effective only for c3xxx, c62x, dh895xcc devices.
		4xxx has this value internally fixed to 200ms.

		Default value is set to 500. Minimal allowed value is 200.
		All values are expressed in milliseconds.

What:		/sys/kernel/debug/qat_<device>_<BDF>/heartbeat/queries_failed
Date:		November 2023
KernelVersion:	6.6
Contact:	qat-linux@intel.com
Description:	(RO) Read returns the number of times the device became unresponsive.

		Attribute returns value of the counter which is incremented when
		status query results negative.

What:		/sys/kernel/debug/qat_<device>_<BDF>/heartbeat/queries_sent
Date:		November 2023
KernelVersion:	6.6
Contact:	qat-linux@intel.com
Description:	(RO) Read returns the number of times the control process checked
		if the device is responsive.

		Attribute returns value of the counter which is incremented on
		every status query.

What:		/sys/kernel/debug/qat_<device>_<BDF>/heartbeat/status
Date:		November 2023
KernelVersion:	6.6
Contact:	qat-linux@intel.com
Description:	(RO) Read returns the device health status.

		Returns 0 when device is healthy or -1 when is unresponsive
		or the query failed to send.

		The driver does not monitor for Heartbeat. It is left for a user
		to poll the status periodically.

What:		/sys/kernel/debug/qat_<device>_<BDF>/pm_status
Date:		January 2024
KernelVersion:	6.7
Contact:	qat-linux@intel.com
Description:	(RO) Read returns power management information specific to the
		QAT device.

		This attribute is only available for qat_4xxx devices.

What:		/sys/kernel/debug/qat_<device>_<BDF>/cnv_errors
Date:		January 2024
KernelVersion:	6.7
Contact:	qat-linux@intel.com
Description:	(RO) Read returns, for each Acceleration Engine (AE), the number
		of errors and the type of the last error detected by the device
		when performing verified compression.
		Reported counters::

			<N>: Number of Compress and Verify (CnV) errors and type
			     of the last CnV error detected by Acceleration
			     Engine N.

What:		/sys/kernel/debug/qat_<device>_<BDF>/heartbeat/inject_error
Date:		March 2024
KernelVersion:	6.8
Contact:	qat-linux@intel.com
Description:	(WO) Write to inject an error that simulates an heartbeat
		failure. This is to be used for testing purposes.

		After writing this file, the driver stops arbitration on a
		random engine and disables the fetching of heartbeat counters.
		If a workload is running on the device, a job submitted to the
		accelerator might not get a response and a read of the
		`heartbeat/status` attribute might report -1, i.e. device
		unresponsive.
		The error is unrecoverable thus the device must be restarted to
		restore its functionality.

		This attribute is available only when the kernel is built with
		CONFIG_CRYPTO_DEV_QAT_ERROR_INJECTION=y.

		A write of 1 enables error injection.

		The following example shows how to enable error injection::

			# cd /sys/kernel/debug/qat_<device>_<BDF>
			# echo 1 > heartbeat/inject_error