kernel_samsung_sm7125

jenna

Author	SHA1	Message	Date
Jenna	919cc2ab49	defconfig: Enable Simple Low Memory Killer Change-Id: I6694b67c89c08e2521b054e3917a72fb6260f8df Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com> # Conflicts: # arch/arm64/configs/vendor/pixel_experience-a52q_defconfig # arch/arm64/configs/vendor/pixel_experience-a72q_defconfig	6 months ago
Sultan Alsawaf	ad0b65a783	defconfig: Enable CASS Change-Id: I6694b67c89c08e2521b054e3917a72fb6260f8df Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Alexander Winkowski	96dc6ec4a3	mm: Set default swappiness to 100	6 months ago
atndko	3c95d20818	drivers: android: Place a no-op PSI driver for SLMK Signed-off-by: atndko <z1281552865@gmail.com> Signed-off-by: Jebaitedneko <Jebaitedneko@gmail.com> Signed-off-by: Panchajanya1999 <rsk52959@gmail.com>	6 months ago
Sultan Alsawaf	7d96f7bacb	mm: Always indicate OOM kill progress when Simple LMK is enabled When Simple LMK is enabled, the page allocator slowpath always thinks that no OOM kill progress is made because out_of_memory() returns false. As a result, spurious page allocation failures are observed when memory is low and Simple LMK is killing tasks, simply because the page allocator slowpath doesn't think that any OOM killing is taking place. Fix this by simply making out_of_memory() always return true when Simple LMK is enabled. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	26ea3c9934	simple_lmk: Reap anonymous memory from victims The OOM reaper makes it possible to immediately release anonymous memory from a dying process in order to free up memory faster. This provides immediate relief under heavy memory pressure instead of waiting for victim processes to naturally release their memory. Utilize the OOM reaper by creating another kthread in Simple LMK to perform victim reaping. Similar to the OOM reaper kthread (which is unused with Simple LMK), this new kthread allows reaping to race with exit_mmap() in order to preclude the need to take a reference to an mm's address space and thus potentially mmput() an mm's last reference. Doing so would stall the reaper kthread, preventing it from being able to quickly reap new victims. Reaping is done on victims one at a time by descending order of anonymous pages, so that the most promising victims with the most anonymous pages are reaped first. Victims are also marked for reaping via MMF_OOM_VICTIM so that they reap themselves first in exit_mmap(). Even if a victim isn't reaped by the reaper thread, it'll free its anonymous memory first thing in exit_mmap() as a small win towards making memory available sooner. By relieving memory pressure faster via reaping, Simple LMK not only doesn't need to kill as many processes, but also improves system responsiveness when memory is low since memory pressure is relieved sooner. Although not strictly required, Simple LMK should be the only one utilizing the OOM reaper. Any other code that may utilize the OOM reaper, such as patches that invoke the OOM reaper for all SIGKILLs, should be disabled. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	0787646d32	simple_lmk: Reduce unnecessary wake ups We can check if the waitqueue is actually active before calling wake_up() in order to avoid an unnecessary wake_up() if the reclaim thread is already running. Furthermore, the release barrier when zeroing needs_reclaim is unnecessary, so remove it. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	6434ba1b52	simple_lmk: Ratelimit the 'no processes available to kill' message Under extreme simulated memory pressure, the 'no processes available to kill' message can be spammed hundreds of thousands of times, which is not productive. Ratelimit it. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	4736a011ba	simple_lmk: Fix victim scheduling priority elevation As it turns out, victim scheduling priority elevation has always been broken for two reasons: 1. The minimum valid RT priority is 1, not 0. As a result, sched_setscheduler_nocheck() always fails with -EINVAL. 2. The thread within a victim thread group which happens to hold the mm is not necessarily the only thread with references to the mm, and isn't necessarily the thread which will release the final mm reference. As a result, victim threads which hold mm references may take a while to release them, and the unlucky thread which puts the final mm reference may take a very long time to release all memory if it doesn't have RT scheduling priority. These issues cause victims to often take a very long time to release their memory, possibly up to several seconds depending on system load. This, in turn, causes Simple LMK to constantly hit the reclaim timeout and kill more processes, with Simple LMK being rather ineffective since victims may not release any memory for several seconds. Fix the broken scheduling priority elevation by changing the RT priority to the valid lowest priority of 1 and applying it to all threads in the thread group, instead of just the thread which holds the mm. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	2e24da16e2	simple_lmk: Thaw victims upon killing them With freezable cgroups and their recent utilization in Android, it's possible for some of Simple LMK's victims to be frozen at the time that they're selected for killing. The forced SIGKILL used for killing victims can only wake up processes containing TASK_WAKEKILL and/or TASK_INTERRUPTIBLE, not TASK_UNINTERRUPTIBLE, which is the state used on frozen tasks. In order to wake frozen tasks from their uninterruptible slumber so that they can die, we must thaw them. Leaving victims frozen can otherwise make them take an indefinite amount of time to process our SIGKILL and thus free memory. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	88133c3016	simple_lmk: Make the reclaim thread freezable There are two problems with the current uninterruptible wait used in the reclaim thread: the hung task detector is upset about an uninterruptible thread being asleep for so long, and killing processes can generate I/O. Since killing a process can generate I/O, the reclaim thread should participate in system-wide suspend operations. This neatly solves the hung task detector issue since wait_event_freezable() puts the current process into an interruptible sleep. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	3e49dbc48e	simple_lmk: Be extra paranoid if tasks can have no pages If it's possible for a task to have no pages, then there could be a case where `pages_found` is zero while `nr_found` isn't, which would cause the found tasks' locks to never be unlocked, and thus mayhem. We can change the `pages_found` check to use `nr_found` instead in order to naturally defend against this scenario, in case it is indeed possible. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	6033cfd36e	mm: Increment kswapd_waiters for throttled direct reclaimers Throttled direct reclaimers will wake up kswapd and wait for kswapd to satisfy their page allocation request, even when the failed allocation lacks the __GFP_KSWAPD_RECLAIM flag in its gfp mask. As a result, kswapd may think that there are no waiters and thus exit prematurely, causing throttled direct reclaimers lacking __GFP_KSWAPD_RECLAIM to stall on waiting for kswapd to wake them up. Incrementing the kswapd_waiters counter when such direct reclaimers become throttled fixes the problem. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Jens Axboe	0068768b2a	buffer: eliminate the need to call free_more_memory() in __getblk_slow() Since the previous commit removed any case where grow_buffers() would return failure due to memory allocations, we can safely remove the case where we have to call free_more_memory() in this function. Since this is also the last user of free_more_memory(), kill it off completely. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Jan Kara <jack@suse.cz> Signed-off-by: Jens Axboe <axboe@kernel.dk>	6 months ago
Jens Axboe	746c1fb825	buffer: grow_dev_page() should use __GFP_NOFAIL for all cases We currently use it for find_or_create_page(), which means that it cannot fail. Ensure we also pass in 'retry == true' to alloc_page_buffers(), which also ensure that it cannot fail. After this, there are no failure cases in grow_dev_page() that occur because of a failed memory allocation. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Jan Kara <jack@suse.cz> Signed-off-by: Jens Axboe <axboe@kernel.dk>	6 months ago
Jens Axboe	1b2b54982a	buffer: have alloc_page_buffers() use __GFP_NOFAIL Instead of adding weird retry logic in that function, utilize __GFP_NOFAIL to ensure that the vm takes care of handling any potential retries appropriately. This means we don't have to call free_more_memory() from here. Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Jan Kara <jack@suse.cz> Signed-off-by: Jens Axboe <axboe@kernel.dk>	6 months ago
Sultan Alsawaf	26276a19c5	mm: vmpressure: Fix rampant inaccuracies caused by stale data usage After a period of intense memory pressure is over, it's common for vmpressure to still have old reclaim efficiency data accumulated from this time. When memory pressure starts to rise again, this stale data will factor into vmpressure's calculations, and can cause vmpressure to report an erroneously high pressure. The reverse is possible, too: vmpressure may report pressures that are erroneously low due to stale data that's been stored. Furthermore, since kswapd can still be performing reclaim when there are no failed memory allocations stuck in the page allocator's slow path, vmpressure may still report pressures when there aren't any memory allocations to satisfy. This can cause last-resort memory reclaimers to kill processes to free memory when it's not needed. To fix the rampant stale data, keep track of when there are processes utilizing reclaim in the page allocator's slow path, and reset the accumulated data in vmpressure when a new period of elevated memory pressure begins. Extra measures are taken for the kswapd issue mentioned above by ignoring all reclaim efficiency data reported by kswapd when there aren't any failed memory allocations in the page allocator which utilize reclaim. Note that since sr_lock can now be used from IRQ context, IRQs must be disabled whenever sr_lock is used to prevent deadlocks. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	faf978cab1	mm: vmpressure: Fix a race that would erroneously clear accumulated data Since the code that determines whether data should be cleared and the code that actually clears the data are in separate spin-locked critical sections, new data could be generated on another CPU after it is determined that the existing data should be cleared, but before the current CPU clears the existing data. This would cause the new data reported by the other CPU to be lost. Fix the race by clearing accumulated data within the same spin-locked critical section that determines whether or not data should be cleared. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	b40c4e120b	mm: vmpressure: Ignore costly-order allocations for direct reclaim too The direct reclaim vmpressure path was erroneously excluded from the PAGE_ALLOC_COSTLY_ORDER check which was added in commit "mm: vmpressure: Ignore allocation orders above PAGE_ALLOC_COSTLY_ORDER". Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	db5c9632ef	simple_lmk: Optimize victim finder to eliminate hard-coded adj ranges Hard-coding adj ranges to search for victims results in a few problems. Firstly, the hard-coded adjs must be vigilantly updated to match what userspace uses, which makes long-term support a headache. Secondly, a full traversal of every running process must be done for each adj range, which can turn out to be quite expensive, especially if userspace assigns many different adj values and we want to enumerate them all. This leads us to the final problem, which is that processes with different adjs within the same hard-coded adj range will be treated the same, even though they're not: the process with a higher adj is less important, and the process with a lower adj is more important. This could be fixed by enumerating every possible adj, but again, that would necessitate several scans through the active process list, which is bad for performance, especially since latency is critical here. Since adjs are only 16 bits, and we only care about positive adjs, that leaves us with 15 bits of the adj that matter. This is a relatively small number of potential adjs (32,768), which makes it possible to allocate a static array that's indexed using the adj. Each entry in this array is a pointer to the first task_struct in a singly-linked list of task_structs sharing an adj. A `simple_lmk_next` member is added to task_struct to accommodate this linked list. The victim finder now iterates downward through the array searching for linked lists of tasks, starting from the highest adj found, so that the lowest-priority processes are always considered first for reclaim. This fixes all of the problems mentioned above, and now there is only one traversal through every running process. The array itself only takes up 256 KiB of memory on 64-bit, which is a very small price to pay for the advantages gained. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	9c24532fe5	simple_lmk: Cacheline-align the victims array and mm_free_lock on SMP The victims array and mm_free_lock data structures can be used very heavily in parallel on SMP, in which case they would benefit from being cacheline-aligned. Make it so for SMP. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	e5c2a8ab88	simple_lmk: Pass a custom swap function to sort() When sort() isn't provided with a custom swap function, it falls back onto its generic implementation of just swapping one byte at a time, which is quite slow. Since we know the type of the objects being sorted, we can provide our own swap function which simply uses the swap() macro. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	308878a60e	simple_lmk: Skip victim reduction when all victims need to be killed When there aren't enough pages found, it means all of the victims that were found need to be killed. The additional processing that attempts to reduce the number of victims can be skipped in this case. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	364ce3de5f	simple_lmk: Use MIN_FREE_PAGES wherever pages_needed is used There's no reason to pass this constant around in a parameter. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	eaa09aa65b	simple_lmk: Don't block in simple_lmk_mm_freed() on mm_free_lock When the mm_free_lock write lock is held, it means that reclaim is either starting or ending, in which case there's nothing that needs to be done in simple_lmk_mm_freed(). We can use a trylock here instead to avoid blocking. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	9bf77d7a45	mm: vmpressure: Don't export tunables to userspace Userspace could change these tunables and make Simple LMK function poorly. Don't export them. Reported-by: attack11 <fernandobouchet@gmail.com> Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	a704ba308e	simple_lmk: Update Kconfig description for VM pressure change Simple LMK uses VM pressure now, not a kswapd hook like before. Update the Kconfig description to reflect such. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	391413bab8	simple_lmk: Add !PSI dependency When PSI is enabled, lmkd in userspace will use PSI notifications to perform low memory kills. Therefore, to ensure that Simple LMK is the only active LMK implementation, add a !PSI dependency. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	ca5f1e0010	simple_lmk: Print a message when the timeout is reached This aids in selecting an adequate timeout. If the timeout is hit often and Simple LMK is killing too much, then the timeout should be lengthened. If the timeout is rarely hit and Simple LMK is not killing fast enough under pressure, then the timeout should be shortened. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
NeilBrown	dc3ed77c42	VFS: use synchronize_rcu_expedited() in namespace_unlock() The synchronize_rcu() in namespace_unlock() is called every time a filesystem is unmounted. If a great many filesystems are mounted, this can cause a noticable slow-down in, for example, system shutdown. The sequence: mkdir -p /tmp/Mtest/{0..5000} time for i in /tmp/Mtest/; do mount -t tmpfs tmpfs $i ; done time umount /tmp/Mtest/ on a 4-cpu VM can report 8 seconds to mount the tmpfs filesystems, and 100 seconds to unmount them. Boot the same VM with 1 CPU and it takes 18 seconds to mount the tmpfs filesystems, but only 36 to unmount. If we change the synchronize_rcu() to synchronize_rcu_expedited() the umount time on a 4-cpu VM drop to 0.6 seconds I think this 200-fold speed up is worth the slightly high system impact of using synchronize_rcu_expedited(). Acked-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com> (from general rcu perspective) Signed-off-by: NeilBrown <neilb@suse.com> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>	6 months ago
Sultan Alsawaf	40f4bdb561	simple_lmk: Remove unnecessary clean-up when timeout is reached Zeroing out the mm struct pointers when the timeout is hit isn't needed because mm_free_lock prevents any readers from accessing the mm struct pointers while clean-up occurs, and since the simple_lmk_mm_freed() loop bound is set to zero during clean-up, there is no possibility of dying processes ever reading stale mm struct pointers. Therefore, it is unnecessary to clear out the mm struct pointers when the timeout is reached. Now the only step to do when the timeout is reached is to re-init the completion, but since reinit_completion() just sets a struct member to zero, call reinit_completion() unconditionally as it is faster than encapsulating it within a conditional statement. Also take this opportunity to rename some variables and tidy up some code indentation. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	161ca84343	simple_lmk: Hold an RCU read lock instead of the tasklist read lock We already check to see if each eligible process isn't already dying, so an RCU read lock can be used to speed things up instead of holding the tasklist read lock. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	8375786e2b	mm: Don't stop kswapd on a per-node basis when there are no waiters The page allocator wakes all kswapds in an allocation context's allowed nodemask in the slow path, so it doesn't make sense to have the kswapd- waiter count per each NUMA node. Instead, it should be a global counter to stop all kswapds when there are no failed allocation requests. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	f8ea27c696	simple_lmk: Consider all positive adjs when finding victims We are allowed to kill any process with a positive adj, so we shouldn't exclude any processes with adjs greater than 999. This would present a problem with quirky applications that set their own adj score, such as stress-ng. In the case of stress-ng, it would set its adj score to 1000 and thus exempt itself from being killed by Simple LMK. This shouldn't be allowed; any process with a positive adj, up to the highest positive adj possible (32767) should be killable. Reported-by: Danny Lin <danny@kdrag0n.dev> Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	0bdf42f8f9	mm: vmpressure: Ignore allocation orders above PAGE_ALLOC_COSTLY_ORDER PAGE_ALLOC_COSTLY_ORDER allocations can cause vmpressure to incorrectly think that memory pressure is high, when it's really just that the allocation's high order is difficult to satisfy. When this rare scenario occurs, ignore the input to vmpressure to avoid sending out a spurious high-pressure signal. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	a806cc4251	mm: Don't warn on page allocation failures for OOM-killed processes It can be normal for a dying process to have its page allocation request fail when it has an OOM or LMK kill pending. In this case, it's actually detrimental to print out a massive allocation failure message because this means the running process needs to die quickly and release its memory, which is slowed down slightly by the massive kmsg splat. The allocation failure message is also a false positive in this case, since the failure is intentional rather than being the result of an inability to allocate memory. Suppress the allocation failure warning for processes that are killed to release memory in order to expedite their death and remedy the kmsg confusion from seeing spurious allocation failure messages. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	b4a1ee655f	mm: Adjust tsk_is_oom_victim() for Simple LMK The page allocator uses tsk_is_oom_victim() to determine when to fast-path memory allocations in order to get an allocating process out of the page allocator and into do_exit() quickly. Unfortunately, tsk_is_oom_victim()'s check to see if a process is killed for OOM purposes is to look for the presence of an OOM reaper artifact that only the OOM killer sets. This means that for processes killed by Simple LMK, there is no fast-pathing done in the page allocator to get them to die faster. Remedy this by changing tsk_is_oom_victim() to look for the existence of the TIF_MEMDIE flag, which Simple LMK sets for its victims. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	db3710af8b	mm: vmpressure: Don't cache the window size Caching the window size can result in delayed or inaccurate pressure reports. Since calculating a fresh window size is cheap, do so all the time instead of relying on a stale, cached value. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	0d4eb1e86e	mm: vmpressure: Interpret zero scanned pages as 100% pressure When no pages are scanned, it usually means no zones were reclaimable and nothing could be done. In this case, the reported pressure should be 100 to elicit help from any listeners. This fixes the vmpressure framework not working when memory pressure is very high. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	759c9e864c	mm: vmpressure: Don't exclude any allocation types Although userspace processes can't directly help with kernel memory pressure, killing userspace processes can relieve kernel memory if they are responsible for that pressure in the first place. It doesn't make sense to exclude any allocation types knowing that userspace can indeed affect all memory pressure, so don't exclude any allocation types from the pressure calculations. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	f41a44f0f9	simple_lmk: Update adj targeting for Android 10 Android 10 changed its adj assignments. Update Simple LMK to use the new adjs, which also requires looking at each pair of adjs as a range. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	3489929420	simple_lmk: Use vmpressure notifier to trigger kills Using kswapd's scan depth to trigger task kills is inconsistent and unreliable. When memory pressure quickly spikes, the kswapd scan depth trigger fails to kick off Simple LMK fast enough, causing severe lag. Additionally, kswapd could stop scanning prematurely before reaching the desired scan depth to trigger Simple LMK, which could also cause stalls. To remedy this, use the vmpressure framework instead, since it provides more consistent and accurate readings on memory pressure. This is not very tunable though, so remove CONFIG_ANDROID_SIMPLE_LMK_AGGRESSION. Triggering Simple LMK to kill when the reported memory pressure is 100 should yield good results on all setups. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Vinayak Menon	d760fac84a	mm: vmpressure: make vmpressure window variable Right now the vmpressure window is of constant size 2MB, which works well with the following exceptions. 1) False vmpressure triggers are seen when the RAM size is greater than 3GB. This results in lowmemorykiller, which uses vmpressure events, killing tasks unnecessarily. 2) Vmpressure events are received late under memory pressure. This behaviour is seen prominently in <=2GB RAM targets. This results in lowmemorykiller kicking in late to kill tasks resulting in avoidable page cache reclaim. The problem analysis shows that the issue is with the constant size of the vmpressure window which does not adapt to the varying memory conditions. This patch recalculates the vmpressure window size at the end of each window. The chosen window size is proportional to the total of free and cached memory at that point. Change-Id: I7e9ef4ddd82e2c2dd04ce09ec8d58a8829cfb64d Signed-off-by: Vinayak Menon <vinmenon@codeaurora.org>	6 months ago
Vinayak Menon	ab6833c737	mm: vmpressure: account allocstalls only on higher pressures At present any vmpressure value is scaled up if the pages are reclaimed through direct reclaim. This can result in false vmpressure values. Consider a case where a device is booted up and most of the memory is occuppied by file pages. kswapd will make sure that high watermark is maintained. Now when a sudden huge allocation request comes in, the system will definitely have to get into direct reclaims. The vmpressures can be very low, but because of allocstall accounting logic even these low values will be scaled to values nearing 100. This can result in unnecessary LMK kills for example. So define a tunable threshold for vmpressure above which the allocstalls will be accounted. Change-Id: Idd7c6724264ac89f1f68f2e9d70a32390ffca3e5 Signed-off-by: Vinayak Menon <vinmenon@codeaurora.org>	6 months ago
Vinayak Menon	5821b4a78e	mm: vmpressure: scale pressure based on reclaim context The existing calculation of vmpressure takes into account only the ratio of reclaimed to scanned pages, but not the time spent or the difficulty in reclaiming those pages. For e.g. when there are quite a number of file pages in the system, an allocation request can be satisfied by reclaiming the file pages alone. If such a reclaim is successful, the vmpressure value will remain low irrespective of the time spent by the reclaim code to free up the file pages. With a feature like lowmemorykiller, killing a task can be faster than reclaiming the file pages alone. So if the vmpressure values reflect the reclaim difficulty level, clients can make a decision based on that, for e.g. to kill a task early. This patch monitors the number of pages scanned in the direct reclaim path and scales the vmpressure level according to that. Signed-off-by: Vinayak Menon <vinmenon@codeaurora.org> Change-Id: I6e643d29a9a1aa0814309253a8b690ad86ec0b13	6 months ago
Vinayak Menon	c74c5745b3	mm: vmpressure: allow in-kernel clients to subscribe for events Currently, vmpressure is tied to memcg and its events are available only to userspace clients. This patch removes the dependency on CONFIG_MEMCG and adds a mechanism for in-kernel clients to subscribe for vmpressure events (in fact raw vmpressure values are delivered instead of vmpressure levels, to provide clients more flexibility to take actions on custom pressure levels which are not currently defined by vmpressure module). Change-Id: I38010f166546e8d7f12f5f355b5dbfd6ba04d587 Signed-off-by: Vinayak Menon <vinmenon@codeaurora.org>	6 months ago
David Ng	a5e9a54acc	mm, vmpressure: int cast vmpressure level/model for -1 comparison Resolve -Wenum-compare issue when comparing vmpressure level/model against -1 (invalid state). Change-Id: I1c76667ee8390e2d396c96e5ed73f30d0700ffa8 Signed-off-by: David Ng <dave@codeaurora.org>	6 months ago
Sultan Alsawaf	56ed6510d9	mm: Stop kswapd early when nothing's waiting for it to free pages Keeping kswapd running when all the failed allocations that invoked it are satisfied incurs a high overhead due to unnecessary page eviction and writeback, as well as spurious VM pressure events to various registered shrinkers. When kswapd doesn't need to work to make an allocation succeed anymore, stop it prematurely to save resources. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	c375dad93b	simple_lmk: Include swap memory usage in the size of victims Swap memory usage is important when determining what to kill, so include it in the victim size calculation. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago
Sultan Alsawaf	f52583d0f0	simple_lmk: Relax memory barriers and clean up some styling wake_up() executes a full memory barrier when waking a process up, so there's no need for the acquire in the wait event. Additionally, because of this, the atomic_cmpxchg() only needs a read barrier. The cmpxchg() in simple_lmk_mm_freed() is atomic when it doesn't need to be, so replace it with an extra line of code. The atomic_inc_return() in simple_lmk_mm_freed() lies within a lock, so it doesn't need explicit memory barriers. Signed-off-by: Sultan Alsawaf <sultan@kerneltoast.com>	6 months ago

1 2 3 4 5 ...

762081 Commits (919cc2ab499e7ea153a37171126cecb1487ed691) All Branches Search

762081 Commits (919cc2ab499e7ea153a37171126cecb1487ed691)

All Branches