| .. | .. |
|---|
| 20 | 20 | /sys/kernel/debug/tracing/kprobe_events, and enable it via |
|---|
| 21 | 21 | /sys/kernel/debug/tracing/events/kprobes/<EVENT>/enable. |
|---|
| 22 | 22 | |
|---|
| 23 | +You can also use /sys/kernel/debug/tracing/dynamic_events instead of |
|---|
| 24 | +kprobe_events. That interface will provide unified access to other |
|---|
| 25 | +dynamic events too. |
|---|
| 23 | 26 | |
|---|
| 24 | 27 | Synopsis of kprobe_events |
|---|
| 25 | 28 | ------------------------- |
|---|
| .. | .. |
|---|
| 27 | 30 | |
|---|
| 28 | 31 | p[:[GRP/]EVENT] [MOD:]SYM[+offs]|MEMADDR [FETCHARGS] : Set a probe |
|---|
| 29 | 32 | r[MAXACTIVE][:[GRP/]EVENT] [MOD:]SYM[+0] [FETCHARGS] : Set a return probe |
|---|
| 33 | + p:[GRP/]EVENT] [MOD:]SYM[+0]%return [FETCHARGS] : Set a return probe |
|---|
| 30 | 34 | -:[GRP/]EVENT : Clear a probe |
|---|
| 31 | 35 | |
|---|
| 32 | 36 | GRP : Group name. If omitted, use "kprobes" for it. |
|---|
| .. | .. |
|---|
| 34 | 38 | based on SYM+offs or MEMADDR. |
|---|
| 35 | 39 | MOD : Module name which has given SYM. |
|---|
| 36 | 40 | SYM[+offs] : Symbol+offset where the probe is inserted. |
|---|
| 41 | + SYM%return : Return address of the symbol |
|---|
| 37 | 42 | MEMADDR : Address where the probe is inserted. |
|---|
| 38 | 43 | MAXACTIVE : Maximum number of instances of the specified function that |
|---|
| 39 | 44 | can be probed simultaneously, or 0 for the default value |
|---|
| 40 | | - as defined in Documentation/kprobes.txt section 1.3.1. |
|---|
| 45 | + as defined in Documentation/trace/kprobes.rst section 1.3.1. |
|---|
| 41 | 46 | |
|---|
| 42 | 47 | FETCHARGS : Arguments. Each probe can have up to 128 args. |
|---|
| 43 | 48 | %REG : Fetch register REG |
|---|
| .. | .. |
|---|
| 45 | 50 | @SYM[+|-offs] : Fetch memory at SYM +|- offs (SYM should be a data symbol) |
|---|
| 46 | 51 | $stackN : Fetch Nth entry of stack (N >= 0) |
|---|
| 47 | 52 | $stack : Fetch stack address. |
|---|
| 48 | | - $retval : Fetch return value.(*) |
|---|
| 53 | + $argN : Fetch the Nth function argument. (N >= 1) (\*1) |
|---|
| 54 | + $retval : Fetch return value.(\*2) |
|---|
| 49 | 55 | $comm : Fetch current task comm. |
|---|
| 50 | | - +|-offs(FETCHARG) : Fetch memory at FETCHARG +|- offs address.(**) |
|---|
| 56 | + +|-[u]OFFS(FETCHARG) : Fetch memory at FETCHARG +|- OFFS address.(\*3)(\*4) |
|---|
| 57 | + \IMM : Store an immediate value to the argument. |
|---|
| 51 | 58 | NAME=FETCHARG : Set NAME as the argument name of FETCHARG. |
|---|
| 52 | 59 | FETCHARG:TYPE : Set TYPE as the type of FETCHARG. Currently, basic types |
|---|
| 53 | 60 | (u8/u16/u32/u64/s8/s16/s32/s64), hexadecimal types |
|---|
| 54 | | - (x8/x16/x32/x64), "string" and bitfield are supported. |
|---|
| 61 | + (x8/x16/x32/x64), "string", "ustring" and bitfield |
|---|
| 62 | + are supported. |
|---|
| 55 | 63 | |
|---|
| 56 | | - (*) only for return probe. |
|---|
| 57 | | - (**) this is useful for fetching a field of data structures. |
|---|
| 64 | + (\*1) only for the probe on function entry (offs == 0). |
|---|
| 65 | + (\*2) only for return probe. |
|---|
| 66 | + (\*3) this is useful for fetching a field of data structures. |
|---|
| 67 | + (\*4) "u" means user-space dereference. See :ref:`user_mem_access`. |
|---|
| 58 | 68 | |
|---|
| 59 | 69 | Types |
|---|
| 60 | 70 | ----- |
|---|
| .. | .. |
|---|
| 64 | 74 | in decimal ('s' and 'u') or hexadecimal ('x'). Without type casting, 'x32' |
|---|
| 65 | 75 | or 'x64' is used depends on the architecture (e.g. x86-32 uses x32, and |
|---|
| 66 | 76 | x86-64 uses x64). |
|---|
| 77 | +These value types can be an array. To record array data, you can add '[N]' |
|---|
| 78 | +(where N is a fixed number, less than 64) to the base type. |
|---|
| 79 | +E.g. 'x16[4]' means an array of x16 (2bytes hex) with 4 elements. |
|---|
| 80 | +Note that the array can be applied to memory type fetchargs, you can not |
|---|
| 81 | +apply it to registers/stack-entries etc. (for example, '$stack1:x8[8]' is |
|---|
| 82 | +wrong, but '+8($stack):x8[8]' is OK.) |
|---|
| 67 | 83 | String type is a special type, which fetches a "null-terminated" string from |
|---|
| 68 | 84 | kernel space. This means it will fail and store NULL if the string container |
|---|
| 69 | | -has been paged out. |
|---|
| 85 | +has been paged out. "ustring" type is an alternative of string for user-space. |
|---|
| 86 | +See :ref:`user_mem_access` for more info.. |
|---|
| 87 | +The string array type is a bit different from other types. For other base |
|---|
| 88 | +types, <base-type>[1] is equal to <base-type> (e.g. +0(%di):x32[1] is same |
|---|
| 89 | +as +0(%di):x32.) But string[1] is not equal to string. The string type itself |
|---|
| 90 | +represents "char array", but string array type represents "char * array". |
|---|
| 91 | +So, for example, +0(%di):string[1] is equal to +0(+0(%di)):string. |
|---|
| 70 | 92 | Bitfield is another special type, which takes 3 parameters, bit-width, bit- |
|---|
| 71 | 93 | offset, and container-size (usually 32). The syntax is:: |
|---|
| 72 | 94 | |
|---|
| 73 | 95 | b<bit-width>@<bit-offset>/<container-size> |
|---|
| 74 | 96 | |
|---|
| 97 | +Symbol type('symbol') is an alias of u32 or u64 type (depends on BITS_PER_LONG) |
|---|
| 98 | +which shows given pointer in "symbol+offset" style. |
|---|
| 75 | 99 | For $comm, the default type is "string"; any other type is invalid. |
|---|
| 76 | 100 | |
|---|
| 101 | +.. _user_mem_access: |
|---|
| 102 | + |
|---|
| 103 | +User Memory Access |
|---|
| 104 | +------------------ |
|---|
| 105 | +Kprobe events supports user-space memory access. For that purpose, you can use |
|---|
| 106 | +either user-space dereference syntax or 'ustring' type. |
|---|
| 107 | + |
|---|
| 108 | +The user-space dereference syntax allows you to access a field of a data |
|---|
| 109 | +structure in user-space. This is done by adding the "u" prefix to the |
|---|
| 110 | +dereference syntax. For example, +u4(%si) means it will read memory from the |
|---|
| 111 | +address in the register %si offset by 4, and the memory is expected to be in |
|---|
| 112 | +user-space. You can use this for strings too, e.g. +u0(%si):string will read |
|---|
| 113 | +a string from the address in the register %si that is expected to be in user- |
|---|
| 114 | +space. 'ustring' is a shortcut way of performing the same task. That is, |
|---|
| 115 | ++0(%si):ustring is equivalent to +u0(%si):string. |
|---|
| 116 | + |
|---|
| 117 | +Note that kprobe-event provides the user-memory access syntax but it doesn't |
|---|
| 118 | +use it transparently. This means if you use normal dereference or string type |
|---|
| 119 | +for user memory, it might fail, and may always fail on some archs. The user |
|---|
| 120 | +has to carefully check if the target data is in kernel or user space. |
|---|
| 77 | 121 | |
|---|
| 78 | 122 | Per-Probe Event Filtering |
|---|
| 79 | 123 | ------------------------- |
|---|
| .. | .. |
|---|
| 105 | 149 | /sys/kernel/debug/tracing/kprobe_profile. |
|---|
| 106 | 150 | The first column is event name, the second is the number of probe hits, |
|---|
| 107 | 151 | the third is the number of probe miss-hits. |
|---|
| 152 | + |
|---|
| 153 | +Kernel Boot Parameter |
|---|
| 154 | +--------------------- |
|---|
| 155 | +You can add and enable new kprobe events when booting up the kernel by |
|---|
| 156 | +"kprobe_event=" parameter. The parameter accepts a semicolon-delimited |
|---|
| 157 | +kprobe events, which format is similar to the kprobe_events. |
|---|
| 158 | +The difference is that the probe definition parameters are comma-delimited |
|---|
| 159 | +instead of space. For example, adding myprobe event on do_sys_open like below |
|---|
| 160 | + |
|---|
| 161 | + p:myprobe do_sys_open dfd=%ax filename=%dx flags=%cx mode=+4($stack) |
|---|
| 162 | + |
|---|
| 163 | +should be below for kernel boot parameter (just replace spaces with comma) |
|---|
| 164 | + |
|---|
| 165 | + p:myprobe,do_sys_open,dfd=%ax,filename=%dx,flags=%cx,mode=+4($stack) |
|---|
| 108 | 166 | |
|---|
| 109 | 167 | |
|---|
| 110 | 168 | Usage examples |
|---|
| .. | .. |
|---|
| 171 | 229 | echo 1 > /sys/kernel/debug/tracing/events/kprobes/myprobe/enable |
|---|
| 172 | 230 | echo 1 > /sys/kernel/debug/tracing/events/kprobes/myretprobe/enable |
|---|
| 173 | 231 | |
|---|
| 232 | +Use the following command to start tracing in an interval. |
|---|
| 233 | +:: |
|---|
| 234 | + |
|---|
| 235 | + # echo 1 > tracing_on |
|---|
| 236 | + Open something... |
|---|
| 237 | + # echo 0 > tracing_on |
|---|
| 238 | + |
|---|
| 174 | 239 | And you can see the traced information via /sys/kernel/debug/tracing/trace. |
|---|
| 175 | 240 | :: |
|---|
| 176 | 241 | |
|---|
| .. | .. |
|---|
| 190 | 255 | Each line shows when the kernel hits an event, and <- SYMBOL means kernel |
|---|
| 191 | 256 | returns from SYMBOL(e.g. "sys_open+0x1b/0x1d <- do_sys_open" means kernel |
|---|
| 192 | 257 | returns from do_sys_open to sys_open+0x1b). |
|---|
| 193 | | - |
|---|