Support big strings #3228

danobi · 2024-06-10T01:55:03Z

Previously, all strings needed to be stored on the stack. Given that BPF
has a 512B stack limit, this put serious limits on strings. In practice,
it was limited to about 200B.

This PR raises the limit much higher to around 1024. The new limit
we hit is from the LLVM memset() builtin. It only supports up to 1K. To
go above this, we need to create our own memset() routine in BPF. This
is quite easy to do and will be attempted in a later commit.

The way big strings work is by assigning each str() call in the script a
unique ID. Then we create a percpu array map such that for each CPU,
there is a percpu entry that can accomodate each str() callsite. This
makes it so scripts can keep as many strings "on the stack" as they'd
like, with the tradeoff being the amount of memory we need to
preallocate.

As long as the kernel never allows BPF programs to nest (eg execute prog
A while prog A is already on the call stack), this approach is robust to
self-corruption.

This closes #305.

Checklist

Language changes are updated in man/adoc/bpftrace.adoc
User-visible and non-trivial changes updated in CHANGELOG.md
The new behaviour is covered by tests

danobi · 2024-06-10T01:57:12Z

In another window:

$ /usr/bin/asdf/asdf/dsf/sdfddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd
bash: /usr/bin/asdf/asdf/dsf/sdfddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd: No such file or directory

$ sudo BPFTRACE_MAX_STRLEN=1024 ./build/src/bpftrace -e 't:syscalls:sys_enter_execve { @[str(args.filename)] = count() }'
Attaching 1 probe...
^C

@[/usr/bin/asdf/asdf/dsf/sdfddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddddd]: 1
@[/home/dxu/.tmux/plugins/tmux-continuum/scripts/continuum_save.sh]: 2
@[/bin/sh]: 2
@[/usr/bin/cut]: 2

@tyroguru you'll be happy to know I woke up this morning with the approach fully formed in my head. But perhaps also sad to know it blew up my sunday :)

danobi · 2024-06-10T01:59:19Z

Related but not done: #2853

src/main.cpp

tests/runtime/call

jordalgo · 2024-06-11T22:21:40Z

This all looks great and surprisingly straightforward.

src/ast/passes/codegen_llvm.cpp

Order the ids in the order they were defined in codegen_llvm.h. This makes it easier to spot missing ids.

We were missing cgroup_path_id and watchpoint_id. Without this, there would be bugs for cgroupid() if it is used inside a loop or in an expanded probe. Similarly, if watchpoint probes are expanded, there would have been bugs.

There is no need to keep the strlen value explicitly on stack. We can leave it up to LLVM whether or not to spill onto stack or keep in a register.

Before, strncmp codegen assumed that caller was passing in two alloca'd strings. This assumption was fine but overly narrow now that we are implementing big strings. Big strings are not an array type (alloca), but rather a proper pointer. Relax assumption by using a more general pointer to i8.

Previously, all strings needed to be stored on the stack. Given that BPF has a 512B stack limit, this put serious limits on strings. In practice, it was limited to about 200B. This commit raises the limit much higher to around 1024. The new limit we hit is from the LLVM memset() builtin. It only supports up to 1K. To go above this, we need to create our own memset() routine in BPF. This is quite easy to do and will be attempted in a later commit. The way big strings work is by assigning each str() call in the script a unique ID. Then we create a percpu array map such that for each CPU, there is a percpu entry that can accomodate each str() callsite. This makes it so scripts can keep as many strings "on the stack" as they'd like, with the tradeoff being the amount of memory we need to preallocate. As long as the kernel never allows BPF programs to nest (eg execute prog A while prog A is already on the call stack), this approach is robust to self-corruption. Note we do not yet remove or change default value of BPFTRACE_MAX_STRLEN. That is b/c a few other helpers (buf(), path(), maybe more) also read the value and allocate space on stack accordingly. We need to decouple or convert those helpers to scratch maps as well. This closes bpftrace#305.

danobi mentioned this pull request Jun 10, 2024

LNL NOCODEC alsa-bat playback failure thesofproject/sof#9123

Closed

danobi mentioned this pull request Jun 10, 2024

Fully remove BPFTRACE_MAX_STRLEN config #3229

Closed

danobi force-pushed the big_str branch 2 times, most recently from bf0893f to c919a83 Compare June 10, 2024 03:02

marc-hb mentioned this pull request Jun 10, 2024

tools: Initial version of IPC4 tx message logger using eBPF thesofproject/sof-test#1205

Open

danobi force-pushed the big_str branch 2 times, most recently from 1fd35fa to d026d58 Compare June 10, 2024 17:16

danobi marked this pull request as ready for review June 10, 2024 17:32

danobi requested review from ajor, viktormalik and fbs as code owners June 10, 2024 17:32

jordalgo reviewed Jun 11, 2024

View reviewed changes

src/main.cpp Show resolved Hide resolved

jordalgo reviewed Jun 11, 2024

View reviewed changes

tests/runtime/call Outdated Show resolved Hide resolved

jordalgo reviewed Jun 11, 2024

View reviewed changes

src/ast/passes/codegen_llvm.cpp Show resolved Hide resolved

danobi added 6 commits June 12, 2024 11:57

codegen: NFC: Order saved IDs in create_reset_ids()

2f69d77

Order the ids in the order they were defined in codegen_llvm.h. This makes it easier to spot missing ids.

codegen: Add missing IDs in create_reset_ids()

4b80df8

We were missing cgroup_path_id and watchpoint_id. Without this, there would be bugs for cgroupid() if it is used inside a loop or in an expanded probe. Similarly, if watchpoint probes are expanded, there would have been bugs.

codegen: NFC: Avoid storing str() len param explicitly on stack

0ffb4de

There is no need to keep the strlen value explicitly on stack. We can leave it up to LLVM whether or not to spill onto stack or keep in a register.

Update CHANGELOG

e7b5f01

danobi force-pushed the big_str branch from d026d58 to e7b5f01 Compare June 12, 2024 18:11

jordalgo approved these changes Jun 12, 2024

View reviewed changes

danobi merged commit 81b6a51 into bpftrace:master Jun 12, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support big strings #3228

Support big strings #3228

danobi commented Jun 10, 2024 •

edited

Loading

danobi commented Jun 10, 2024 •

edited

Loading

danobi commented Jun 10, 2024

jordalgo commented Jun 11, 2024

Support big strings #3228

Support big strings #3228

Conversation

danobi commented Jun 10, 2024 • edited Loading

Checklist

danobi commented Jun 10, 2024 • edited Loading

danobi commented Jun 10, 2024

jordalgo commented Jun 11, 2024

danobi commented Jun 10, 2024 •

edited

Loading

danobi commented Jun 10, 2024 •

edited

Loading