2024-03-28 12:19:56 |
dann frazier |
description |
Initial support for multi-size THP landed upstream in v6.8. In the 6.9 merge window, 2 other series have landed that show significant performance improvements on arm64
mm/memory: optimize fork() with PTE-mapped THP
https://lkml.iu.edu/hypermail/linux/kernel/2401.3/02766.html
Transparent Contiguous PTEs for User Mappings:
https://lwn.net/Articles/962330/
On an Ampere AltraMax system w/ 4K page size, kernel builds in a tmpfs are reduced from 6m30s to 5m17s, a ~19% improvement. |
Initial support for multi-size THP landed upstream in v6.8. In the 6.9 merge window, 2 other series have landed that show significant performance improvements on arm64
mm/memory: optimize fork() with PTE-mapped THP
https://lkml.iu.edu/hypermail/linux/kernel/2401.3/02766.html
Transparent Contiguous PTEs for User Mappings:
https://lwn.net/Articles/962330/
On an Ampere AltraMax system w/ 4K page size, kernel builds in a tmpfs are reduced from 6m30s to 5m17s, a ~19% improvement.
It has been reported that this can have a *10x* improvement for certain GPU workloads on ARM:
https://lwn.net/Articles/954094/ |
|