tuned-amdgpu/README.md

76 lines
3.5 KiB
Markdown
Raw Normal View History

2021-10-13 01:14:30 +00:00
# tuned-amdgpu
2024-08-05 13:47:39 +00:00
Hacky solution to integrate AMDGPU power/clock control into `tuned` profiles
with Ansible.
2024-08-27 11:54:55 +00:00
Extends every `tuned` profile found in `/usr/lib/tuned`
using the [AMDGPU hwmon interfaces](https://docs.kernel.org/gpu/amdgpu/thermal.html):
2024-03-25 00:27:59 +00:00
2024-08-27 11:54:55 +00:00
- `default`: the out-of-the-box GPU clock/power configuration
- `overclock`: the _optimized_ card configuration. Includes the clock/voltage/power settings outlined below.
- `peak`: the same as `overclock`, but with clock gating removed. May help profiling.
2023-06-03 22:40:20 +00:00
2024-08-27 11:54:55 +00:00
Contrary to the name, the `overclock` profiles can be used to de-tune the card as well.
2024-03-25 00:27:59 +00:00
2024-08-27 12:32:08 +00:00
## Assumptions / Limitations
2024-08-27 12:37:50 +00:00
Only tested with `RX6000` series GPUs and the _mainline_ `amdgpu` driver.
Other permutations may _not_ work properly. Please use at your own risk!
Multiple GPUs in a single system are not yet managed,
assumes a single GPU with displays attached.
2024-08-27 12:32:08 +00:00
## Profiles
2023-07-08 04:54:20 +00:00
Two _'profiles'_ are in each name:
- before `amdgpu` is the source profile provided with `tuned`
- after `amdgpu` tells the GPU clock profile offered, outlined below
| Output profile | Description |
2022-08-03 07:18:22 +00:00
|:---|---|
| `balanced-amdgpu-default` | Includes the (assumed) existing `balanced` tuned profile.<br/><br/>Only adjusts the GPU power limit (typically lower). Clocks/voltage curve remain the default. |
| `desktop-amdgpu-overclock` | Includes the (assumed) existing `desktop` tuned profile.<br/><br/>Adjusts the GPU power limit, clocks, _and_ the voltage curve. |
2023-07-08 04:50:28 +00:00
| `desktop-amdgpu-peak` | Includes the (assumed) existing `desktop` tuned profile.<br/><br/>Same as the `overclock` profile, but locks clocks to their highest configured values |
2022-08-03 06:11:02 +00:00
## Config
2022-08-03 08:05:18 +00:00
2024-08-05 13:34:59 +00:00
The playbook will render/make effective this config file: `/etc/tuned/amdgpu-profile-vars.conf`
Here is a preview:
```ini
2024-08-27 11:54:55 +00:00
tuned_amdgpu_clock_min=500
tuned_amdgpu_clock_max=2715
tuned_amdgpu_memclock_static=1075
tuned_amdgpu_power_multi_def=0.869969040247678
tuned_amdgpu_power_multi_oc=1.0
tuned_amdgpu_mv_offset=+60
```
2024-08-27 11:54:55 +00:00
These are the result of [Variables](#Variables) below; changes outside of _Ansible_ are not immediately effective. Switching `tuned` profiles or restarting the service would be required.
2024-08-05 13:34:59 +00:00
One can use `gamemode` for dynamic switching. Sample `~/.config/gamemode.ini` below:
```ini
[custom]
start=tuned-adm profile latency-performance-amdgpu-overclock
end=tuned-adm profile latency-performance-amdgpu-default
```
See this [Arch Wiki](https://wiki.archlinux.org/title/Gamemode) link for more comprehensive information.
## Variables
These are the variables you'll want to change/consider.
2022-08-03 08:05:18 +00:00
2024-08-27 12:12:51 +00:00
| Variable | Description |
|------------------------|-------------|
2024-08-27 12:20:46 +00:00
| `tuned_amdgpu_clock_min` | Mininum **GPU** clock _(in `Mhz`)_ for `overclock` and `peak` profiles |
| `tuned_amdgpu_clock_max` | Maximum **GPU** clock _(in `MHz`)_ for `overclock` and `peak` profiles |
| `tuned_amdgpu_mv_offset` | GPU voltage _offset_. Takes `+/-` some integer in _millivolts_ to raise or lower. eg: `-25` for `0.025V` undervolt. |
| `tuned_amdgpu_power_multi_def` | Float between `0.0` _(none)_ and `1.0` _(full)_; effective power limit relative to _board capability_. For the `default` profiles |
| `tuned_amdgpu_power_multi_oc` | Instance of `tuned_amdgpu_power_multi_def` for `overclock` and `peak` profiles |
| `tuned_amdgpu_memclock_static` | _Static_ **memory** clock _(in `MHz`)_ for `overclock` and `peak` profiles.<br/><br/>Not the effective data rate _(multiplied by generation)_, but the actual clock. Static assignment avoids potential display flickering. |