.. | .. |
---|
20 | 20 | /sys/kernel/debug/tracing/kprobe_events, and enable it via |
---|
21 | 21 | /sys/kernel/debug/tracing/events/kprobes/<EVENT>/enable. |
---|
22 | 22 | |
---|
| 23 | +You can also use /sys/kernel/debug/tracing/dynamic_events instead of |
---|
| 24 | +kprobe_events. That interface will provide unified access to other |
---|
| 25 | +dynamic events too. |
---|
23 | 26 | |
---|
24 | 27 | Synopsis of kprobe_events |
---|
25 | 28 | ------------------------- |
---|
.. | .. |
---|
27 | 30 | |
---|
28 | 31 | p[:[GRP/]EVENT] [MOD:]SYM[+offs]|MEMADDR [FETCHARGS] : Set a probe |
---|
29 | 32 | r[MAXACTIVE][:[GRP/]EVENT] [MOD:]SYM[+0] [FETCHARGS] : Set a return probe |
---|
| 33 | + p:[GRP/]EVENT] [MOD:]SYM[+0]%return [FETCHARGS] : Set a return probe |
---|
30 | 34 | -:[GRP/]EVENT : Clear a probe |
---|
31 | 35 | |
---|
32 | 36 | GRP : Group name. If omitted, use "kprobes" for it. |
---|
.. | .. |
---|
34 | 38 | based on SYM+offs or MEMADDR. |
---|
35 | 39 | MOD : Module name which has given SYM. |
---|
36 | 40 | SYM[+offs] : Symbol+offset where the probe is inserted. |
---|
| 41 | + SYM%return : Return address of the symbol |
---|
37 | 42 | MEMADDR : Address where the probe is inserted. |
---|
38 | 43 | MAXACTIVE : Maximum number of instances of the specified function that |
---|
39 | 44 | can be probed simultaneously, or 0 for the default value |
---|
40 | | - as defined in Documentation/kprobes.txt section 1.3.1. |
---|
| 45 | + as defined in Documentation/trace/kprobes.rst section 1.3.1. |
---|
41 | 46 | |
---|
42 | 47 | FETCHARGS : Arguments. Each probe can have up to 128 args. |
---|
43 | 48 | %REG : Fetch register REG |
---|
.. | .. |
---|
45 | 50 | @SYM[+|-offs] : Fetch memory at SYM +|- offs (SYM should be a data symbol) |
---|
46 | 51 | $stackN : Fetch Nth entry of stack (N >= 0) |
---|
47 | 52 | $stack : Fetch stack address. |
---|
48 | | - $retval : Fetch return value.(*) |
---|
| 53 | + $argN : Fetch the Nth function argument. (N >= 1) (\*1) |
---|
| 54 | + $retval : Fetch return value.(\*2) |
---|
49 | 55 | $comm : Fetch current task comm. |
---|
50 | | - +|-offs(FETCHARG) : Fetch memory at FETCHARG +|- offs address.(**) |
---|
| 56 | + +|-[u]OFFS(FETCHARG) : Fetch memory at FETCHARG +|- OFFS address.(\*3)(\*4) |
---|
| 57 | + \IMM : Store an immediate value to the argument. |
---|
51 | 58 | NAME=FETCHARG : Set NAME as the argument name of FETCHARG. |
---|
52 | 59 | FETCHARG:TYPE : Set TYPE as the type of FETCHARG. Currently, basic types |
---|
53 | 60 | (u8/u16/u32/u64/s8/s16/s32/s64), hexadecimal types |
---|
54 | | - (x8/x16/x32/x64), "string" and bitfield are supported. |
---|
| 61 | + (x8/x16/x32/x64), "string", "ustring" and bitfield |
---|
| 62 | + are supported. |
---|
55 | 63 | |
---|
56 | | - (*) only for return probe. |
---|
57 | | - (**) this is useful for fetching a field of data structures. |
---|
| 64 | + (\*1) only for the probe on function entry (offs == 0). |
---|
| 65 | + (\*2) only for return probe. |
---|
| 66 | + (\*3) this is useful for fetching a field of data structures. |
---|
| 67 | + (\*4) "u" means user-space dereference. See :ref:`user_mem_access`. |
---|
58 | 68 | |
---|
59 | 69 | Types |
---|
60 | 70 | ----- |
---|
.. | .. |
---|
64 | 74 | in decimal ('s' and 'u') or hexadecimal ('x'). Without type casting, 'x32' |
---|
65 | 75 | or 'x64' is used depends on the architecture (e.g. x86-32 uses x32, and |
---|
66 | 76 | x86-64 uses x64). |
---|
| 77 | +These value types can be an array. To record array data, you can add '[N]' |
---|
| 78 | +(where N is a fixed number, less than 64) to the base type. |
---|
| 79 | +E.g. 'x16[4]' means an array of x16 (2bytes hex) with 4 elements. |
---|
| 80 | +Note that the array can be applied to memory type fetchargs, you can not |
---|
| 81 | +apply it to registers/stack-entries etc. (for example, '$stack1:x8[8]' is |
---|
| 82 | +wrong, but '+8($stack):x8[8]' is OK.) |
---|
67 | 83 | String type is a special type, which fetches a "null-terminated" string from |
---|
68 | 84 | kernel space. This means it will fail and store NULL if the string container |
---|
69 | | -has been paged out. |
---|
| 85 | +has been paged out. "ustring" type is an alternative of string for user-space. |
---|
| 86 | +See :ref:`user_mem_access` for more info.. |
---|
| 87 | +The string array type is a bit different from other types. For other base |
---|
| 88 | +types, <base-type>[1] is equal to <base-type> (e.g. +0(%di):x32[1] is same |
---|
| 89 | +as +0(%di):x32.) But string[1] is not equal to string. The string type itself |
---|
| 90 | +represents "char array", but string array type represents "char * array". |
---|
| 91 | +So, for example, +0(%di):string[1] is equal to +0(+0(%di)):string. |
---|
70 | 92 | Bitfield is another special type, which takes 3 parameters, bit-width, bit- |
---|
71 | 93 | offset, and container-size (usually 32). The syntax is:: |
---|
72 | 94 | |
---|
73 | 95 | b<bit-width>@<bit-offset>/<container-size> |
---|
74 | 96 | |
---|
| 97 | +Symbol type('symbol') is an alias of u32 or u64 type (depends on BITS_PER_LONG) |
---|
| 98 | +which shows given pointer in "symbol+offset" style. |
---|
75 | 99 | For $comm, the default type is "string"; any other type is invalid. |
---|
76 | 100 | |
---|
| 101 | +.. _user_mem_access: |
---|
| 102 | + |
---|
| 103 | +User Memory Access |
---|
| 104 | +------------------ |
---|
| 105 | +Kprobe events supports user-space memory access. For that purpose, you can use |
---|
| 106 | +either user-space dereference syntax or 'ustring' type. |
---|
| 107 | + |
---|
| 108 | +The user-space dereference syntax allows you to access a field of a data |
---|
| 109 | +structure in user-space. This is done by adding the "u" prefix to the |
---|
| 110 | +dereference syntax. For example, +u4(%si) means it will read memory from the |
---|
| 111 | +address in the register %si offset by 4, and the memory is expected to be in |
---|
| 112 | +user-space. You can use this for strings too, e.g. +u0(%si):string will read |
---|
| 113 | +a string from the address in the register %si that is expected to be in user- |
---|
| 114 | +space. 'ustring' is a shortcut way of performing the same task. That is, |
---|
| 115 | ++0(%si):ustring is equivalent to +u0(%si):string. |
---|
| 116 | + |
---|
| 117 | +Note that kprobe-event provides the user-memory access syntax but it doesn't |
---|
| 118 | +use it transparently. This means if you use normal dereference or string type |
---|
| 119 | +for user memory, it might fail, and may always fail on some archs. The user |
---|
| 120 | +has to carefully check if the target data is in kernel or user space. |
---|
77 | 121 | |
---|
78 | 122 | Per-Probe Event Filtering |
---|
79 | 123 | ------------------------- |
---|
.. | .. |
---|
105 | 149 | /sys/kernel/debug/tracing/kprobe_profile. |
---|
106 | 150 | The first column is event name, the second is the number of probe hits, |
---|
107 | 151 | the third is the number of probe miss-hits. |
---|
| 152 | + |
---|
| 153 | +Kernel Boot Parameter |
---|
| 154 | +--------------------- |
---|
| 155 | +You can add and enable new kprobe events when booting up the kernel by |
---|
| 156 | +"kprobe_event=" parameter. The parameter accepts a semicolon-delimited |
---|
| 157 | +kprobe events, which format is similar to the kprobe_events. |
---|
| 158 | +The difference is that the probe definition parameters are comma-delimited |
---|
| 159 | +instead of space. For example, adding myprobe event on do_sys_open like below |
---|
| 160 | + |
---|
| 161 | + p:myprobe do_sys_open dfd=%ax filename=%dx flags=%cx mode=+4($stack) |
---|
| 162 | + |
---|
| 163 | +should be below for kernel boot parameter (just replace spaces with comma) |
---|
| 164 | + |
---|
| 165 | + p:myprobe,do_sys_open,dfd=%ax,filename=%dx,flags=%cx,mode=+4($stack) |
---|
108 | 166 | |
---|
109 | 167 | |
---|
110 | 168 | Usage examples |
---|
.. | .. |
---|
171 | 229 | echo 1 > /sys/kernel/debug/tracing/events/kprobes/myprobe/enable |
---|
172 | 230 | echo 1 > /sys/kernel/debug/tracing/events/kprobes/myretprobe/enable |
---|
173 | 231 | |
---|
| 232 | +Use the following command to start tracing in an interval. |
---|
| 233 | +:: |
---|
| 234 | + |
---|
| 235 | + # echo 1 > tracing_on |
---|
| 236 | + Open something... |
---|
| 237 | + # echo 0 > tracing_on |
---|
| 238 | + |
---|
174 | 239 | And you can see the traced information via /sys/kernel/debug/tracing/trace. |
---|
175 | 240 | :: |
---|
176 | 241 | |
---|
.. | .. |
---|
190 | 255 | Each line shows when the kernel hits an event, and <- SYMBOL means kernel |
---|
191 | 256 | returns from SYMBOL(e.g. "sys_open+0x1b/0x1d <- do_sys_open" means kernel |
---|
192 | 257 | returns from do_sys_open to sys_open+0x1b). |
---|
193 | | - |
---|