diff options
| author | Martin KaFai Lau <martin.lau@kernel.org> | 2025-01-29 13:31:19 -0800 | 
|---|---|---|
| committer | Martin KaFai Lau <martin.lau@kernel.org> | 2025-01-29 13:33:10 -0800 | 
| commit | 9bf412d4d5b1f431e6cdd8111094be39c031036c (patch) | |
| tree | f01b165541b84bf465130e3e20902b031391fe37 /net/strparser/strparser.c | |
| parent | bc27c52eea189e8f7492d40739b7746d67b65beb (diff) | |
| parent | 6fcfe96e0f6e9bebe1b185f1548a9a8cb1b68dea (diff) | |
Merge branch 'bpf-fix-wrong-copied_seq-calculation-and-add-tests'
Jiayuan Chen says:
====================
A previous commit described in this topic
http://lore.kernel.org/bpf/20230523025618.113937-9-john.fastabend@gmail.com
directly updated 'sk->copied_seq' in the tcp_eat_skb() function when the
action of a BPF program was SK_REDIRECT. For other actions, like SK_PASS,
the update logic for 'sk->copied_seq' was moved to
tcp_bpf_recvmsg_parser() to ensure the accuracy of the 'fionread' feature.
That commit works for a single stream_verdict scenario, as it also
modified 'sk_data_ready->sk_psock_verdict_data_ready->tcp_read_skb'
to remove updating 'sk->copied_seq'.
However, for programs where both stream_parser and stream_verdict are
active (strparser purpose), tcp_read_sock() was used instead of
tcp_read_skb() (sk_data_ready->strp_data_ready->tcp_read_sock).
tcp_read_sock() now still updates 'sk->copied_seq', leading to duplicated
updates.
In summary, for strparser + SK_PASS, copied_seq is redundantly calculated
in both tcp_read_sock() and tcp_bpf_recvmsg_parser().
The issue causes incorrect copied_seq calculations, which prevent
correct data reads from the recv() interface in user-land.
Also we added test cases for bpf + strparser and separated them from
sockmap_basic, as strparser has more encapsulation and parsing
capabilities compared to sockmap.
====================
Link: https://patch.msgid.link/20250122100917.49845-1-mrpre@163.com
Signed-off-by: Martin KaFai Lau <martin.lau@kernel.org>
Diffstat (limited to 'net/strparser/strparser.c')
| -rw-r--r-- | net/strparser/strparser.c | 11 | 
1 files changed, 9 insertions, 2 deletions
| diff --git a/net/strparser/strparser.c b/net/strparser/strparser.c index 8299ceb3e373..95696f42647e 100644 --- a/net/strparser/strparser.c +++ b/net/strparser/strparser.c @@ -347,7 +347,10 @@ static int strp_read_sock(struct strparser *strp)  	struct socket *sock = strp->sk->sk_socket;  	read_descriptor_t desc; -	if (unlikely(!sock || !sock->ops || !sock->ops->read_sock)) +	if (unlikely(!sock || !sock->ops)) +		return -EBUSY; + +	if (unlikely(!strp->cb.read_sock && !sock->ops->read_sock))  		return -EBUSY;  	desc.arg.data = strp; @@ -355,7 +358,10 @@ static int strp_read_sock(struct strparser *strp)  	desc.count = 1; /* give more than one skb per call */  	/* sk should be locked here, so okay to do read_sock */ -	sock->ops->read_sock(strp->sk, &desc, strp_recv); +	if (strp->cb.read_sock) +		strp->cb.read_sock(strp, &desc, strp_recv); +	else +		sock->ops->read_sock(strp->sk, &desc, strp_recv);  	desc.error = strp->cb.read_sock_done(strp, desc.error); @@ -468,6 +474,7 @@ int strp_init(struct strparser *strp, struct sock *sk,  	strp->cb.unlock = cb->unlock ? : strp_sock_unlock;  	strp->cb.rcv_msg = cb->rcv_msg;  	strp->cb.parse_msg = cb->parse_msg; +	strp->cb.read_sock = cb->read_sock;  	strp->cb.read_sock_done = cb->read_sock_done ? : default_read_sock_done;  	strp->cb.abort_parser = cb->abort_parser ? : strp_abort_strp; | 
