-
Notifications
You must be signed in to change notification settings - Fork 97
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tcp long connections metrics #1249
base: main
Are you sure you want to change the base?
Conversation
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
@nlgwcy @LemmyHuang @xiangxinyong can you review the pr, have i writen correct ebpf code, i have done the changes which has been told in previous comments in the proposal. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- Please present the results of the long connection metrics
- The original destination address problem is still unhandle. If you plan to optimize later, you can create a sub-issue under the existing lfx issue to track the task.
pkg/controller/telemetry/metric.go
Outdated
@@ -55,6 +55,7 @@ type MetricController struct { | |||
EnableAccesslog atomic.Bool | |||
EnableMonitoring atomic.Bool | |||
EnableWorkloadMetric atomic.Bool | |||
EnableLongTCPMetric atomic.Bool |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think it's necessary to distinguish between metrics for long connections and short connections.
As long as metrics is turned on, we should handle all types of connections.
bpf/kmesh/workload/include/config.h
Outdated
#define kmesh_perf_map km_perf_map | ||
#define kmesh_perf_info km_perf_info | ||
#define map_of_long_tcp_conns km_longtcpconns_map | ||
#define long_tcp_conns_events km_longtcpconns_events |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
BPF_OBJ_NAME_LEN = 16
So I think, this name of map is too long.
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
…userspace Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
…km_longconn_ev ringbuff Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
b79a168
to
a698cac
Compare
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
c274149
to
3df32ec
Compare
Signed-off-by: Yash Patel <yp969803@gmail.com> rfac: sendmsg.c Signed-off-by: Yash Patel <yp969803@gmail.com> chore: generated bpf2go Signed-off-by: Yash Patel <yp969803@gmail.com> rfac: added is_managed_by_kmesh to sendmsg.c Signed-off-by: Yash Patel <yp969803@gmail.com>
3df32ec
to
e26073e
Compare
@LiZhenCheng9527 @nlgwcy can you look at the above error during bpf_compaitliblity test, why is this happening, i have also shorten the program length so that number of instruction sets not exceeds |
so the problem arrises in the line of code, with the error
when i try to comment these lines and run unit tests, verifier accept it. these lines are written previously in the sockops_prog
in the program
|
Signed-off-by: Yash Patel <yp969803@gmail.com> chore: generated bpf2go Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
9a01a5a
to
1ebbcb3
Compare
@@ -8,6 +8,7 @@ | |||
|
|||
#include "bpf_common.h" | |||
#include "config.h" | |||
#include "workload.h" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think probe should not include workload to prevent cicular include
} | ||
|
||
construct_orig_dst_info(sk, info); | ||
construct_orig_dst_info(sk, &info_vals); | ||
__builtin_memcpy(info, &info_vals, sizeof(struct tcp_probe_info)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
i am not sure why memcpy here
bpf/kmesh/probes/tcp_probe.h
Outdated
__u32 *index = bpf_map_lookup_elem(&map_of_soc_id_counter, &zero); | ||
if (index && *index < MAP_SIZE_OF_TCP_CONNS) { | ||
bpf_map_update_elem(&map_of_soc_id, index, &storage->sock_cookie, BPF_ANY); | ||
*index += 1; | ||
bpf_map_update_elem(&map_of_soc_id_counter, &zero, index, BPF_ANY); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you add some comments about these two maps operation
} | ||
|
||
static inline void | ||
refresh_tcp_conn_info_on_state_change(struct bpf_tcp_sock *tcp_sock, struct sock_storage_data *storage, __u32 state) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can u make it use refresh_tcp_conn_info
bpf/kmesh/workload/sendmsg.c
Outdated
__uint(max_entries, 1); | ||
__type(key, int); | ||
__type(value, __u64); | ||
} tcp_conn_last_flush SEC(".maps"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please obey the kmesh mao naming convention
bpf/kmesh/workload/sendmsg.c
Outdated
// Initialize last flush time if not set | ||
bpf_map_update_elem(&tcp_conn_last_flush, &key, &init_time, BPF_ANY); | ||
} else if ((now - *last_time) >= TIMER_INTERVAL_NS) { | ||
flush_tcp_conns(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IIRC, this is flushing all the connections probe info, i think it is a little bit heavy.
Can you do that for this particular socket?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You may get the bpf_tcp_sock from struct sk_msg_md
struct bpf_sock *sk = msg->sk;
struct bpf_tcp_sock *tcp_sk = bpf_skc_to_tcp_sock(sk);
Signed-off-by: Yash Patel <yp969803@gmail.com>
1ebbcb3
to
249fde9
Compare
@yp969803 Pleas rely if you have resolved that or have some diff idea |
@hzxuzhonghu i have seen your comments now, i will resolve it asap !! |
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
5883b74
to
c9b2790
Compare
@hzxuzhonghu done the changes, most of the ebpf code is complete, just adding another hook for recv_bytes is left i will do this in end, from now i will start writing test. |
Signed-off-by: Yash Patel <yp969803@gmail.com>
Codecov ReportAttention: Patch coverage is
❌ Your patch check has failed because the patch coverage (10.16%) is below the target coverage (80.00%). You can increase the patch coverage or adjust the target coverage.
... and 2 files with indirect coverage changes Continue to review full report in Codecov by Sentry.
🚀 New features to boost your workflow:
|
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
Signed-off-by: Yash Patel <yp969803@gmail.com>
What type of PR is this?
/kind feature
What this PR does / why we need it:
The pr introduces new feature of tcp_long_conn metrics
Which issue(s) this PR fixes:
Fixes #1211
Special notes for your reviewer:
Does this PR introduce a user-facing change?:
Yes