Limit memory usage for a single Linux process

I’m running pdftoppm to convert a user-provided PDF into a 300DPI image. This works great, except if the user provides an PDF with a very large page size. pdftoppm will allocate enough memory to hold a 300DPI image of that size in memory, which for a 100 inch square page is 100*300 * 100*300 * 4 bytes per pixel = 3.5GB. A malicious user could just give me a silly-large PDF and cause all kinds of problems.

So what I’d like to do is put some kind of hard limit on memory usage for a child process I’m about to run–just have the process die if it tries to allocate more than, say, 500MB of memory. Is that possible?

I don’t think ulimit can be used for this, but is there a one-process equivalent?

Asked By: Ben Dilts

||

If your process doesn’t spawn more children that consume the most memory, you may use setrlimit function. More common user interface for that is using ulimit command of the shell:

$ ulimit -Sv 500000     # Set ~500 mb limit
$ pdftoppm ...

This will only limit “virtual” memory of your process, taking into account—and limiting—the memory the process being invoked shares with other processes, and the memory mapped but not reserved (for instance, Java’s large heap). Still, virtual memory is the closest approximation for processes that grow really large, making the said errors insignificant.

If your program spawns children, and it’s them which allocate memory, it becomes more complex, and you should write auxiliary scripts to run processes under your control. I wrote in my blog, why and how.

Answered By: P Shved

There’s some problems with ulimit. Here’s a useful read on the topic: Limiting time and memory consumption of a program in Linux, which lead to the timeout tool, which lets you cage a process (and its forks) by time or memory consumption.

The timeout tool requires Perl 5+ and the /proc filesystem mounted. After that you copy the tool to e.g. /usr/local/bin like so:

curl https://raw.githubusercontent.com/pshved/timeout/master/timeout | 
  sudo tee /usr/local/bin/timeout && sudo chmod 755 /usr/local/bin/timeout

After that, you can ‘cage’ your process by memory consumption as in your question like so:

timeout -m 500 pdftoppm Sample.pdf

Alternatively you could use -t <seconds> and -x <hertz> to respectively limit the process by time or CPU constraints.

The way this tool works is by checking multiple times per second if the spawned process has not oversubscribed its set boundaries. This means there actually is a small window where a process could potentially be oversubscribing before timeout notices and kills the process.

A more correct approach would hence likely involve cgroups, but that is much more involved to set up, even if you’d use Docker or runC, which among things, offer a more user-friendly abstraction around cgroups.

Answered By: kvz

Another way to limit this is to use Linux’s control groups. This is especially useful if you want to limit a process’s (or group of processes’) allocation of physical memory distinctly from virtual memory. For example:

cgcreate -g memory:myGroup
echo 500M > /sys/fs/cgroup/memory/myGroup/memory.limit_in_bytes
echo 5G > /sys/fs/cgroup/memory/myGroup/memory.memsw.limit_in_bytes

will create a control group named myGroup, cap the set of processes run under myGroup up to 500 MB of physical memory with memory.limit_in_bytes and up to 5000 MB of physical and swap memory together with memory.memsw.limit_in_bytes.
More info about these options can be found here: https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/6/html/resource_management_guide/sec-memory

To run a process under the control group:

cgexec -g memory:myGroup pdftoppm

Note that on a modern Ubuntu distribution this example requires installing the cgroup-tools package (previously cgroup-bin):

sudo apt install cgroup-tools

and editing /etc/default/grub to change GRUB_CMDLINE_LINUX_DEFAULT to:

GRUB_CMDLINE_LINUX_DEFAULT="cgroup_enable=memory swapaccount=1"

and then running sudo update-grub and rebooting to boot with the new kernel boot parameters.

Answered By: user65369

In addition to the tools from daemontools,
suggested by Mark Johnson,
you can also consider chpst, which is found in runit
Runit itself is bundled in busybox,
so you might already have it installed.

The man page of chpst shows the option:

-m bytes

    limit memory. 
    Limit the data segment, stack segment, locked physical pages,
    and total of all segment per process to bytes bytes each.

Answered By: oz123

As of 2022 / Ubuntu 22.04 the below script is obsolete. Ubuntu 22.04 no longer mounts cgroups v1 by default, and systemd-run now supports everything needed. The command to run a program with a hard memory limit is

systemd-run --user --scope -p MemoryMax=<memorylimit> 
  -p MemorySwapMax=<swaplimit> <command>
  • Note that memory and swap have separate limits, unlike the memory.memsw.* control files in cgroups v1 which controlled the total amount of memory + swap used. I have so far not found a way to set a limit on the combined memory + swap.

  • There is also a MemoryHigh parameter which is less strict than MemoryMax. It won’t kill the processes but starts to throttle them and agressively swap out memory.

The below script can be adapted to run on cgroups v2 as per Ciro Santilli’s answer, but with systemd-run now doing everything necessary there is no need to. I have a script similar to that from my original answer that works with systemd-run posted in a new answer.


Original answer:

I’m using the below script, which works great. It uses cgroups through cgmanager. Update: it now uses the commands from cgroup-tools. Name this script limitmem and put it in your $PATH and you can use it like limitmem 100M bash. This will limit both memory and swap usage. To limit just memory remove the line with memory.memsw.limit_in_bytes.

edit: On default Linux installations this only limits memory usage, not swap usage. To enable swap usage limiting, you need to enable swap accounting on your Linux system. Do that by setting/adding swapaccount=1 in /etc/default/grub so it looks something like

GRUB_CMDLINE_LINUX="swapaccount=1"

Then run sudo update-grub and reboot.

Disclaimer: I wouldn’t be surprised if cgroup-tools also breaks in the future. The correct solution would be to use the systemd api’s for cgroup management but there are no command line tools for that a.t.m.

edit (2021): Until now this script still works, but it goes against Linux’s recommendation to have a single program manage your cgroups. Nowadays that program is usually systemd. Unfortunately systemd has a number of limitations that make it difficult to replace this script with systemd invocations. The systemd-run --user command should allow a user to run a program with resource limitations, but that isn’t supported on cgroups v1. (Everyone uses cgroups v1 because docker doesn’t work on cgroupsv2 yet except for the very latest versions.) With root access (which this script also requires) it should be possible to use systemd-run to create the correct systemd-supported cgroups, and then manually set the memory and swap properties in the right cgroup, but that is still to be implemented. See also this bug comment for context, and here and here for relevant documentation.

According to @Mikko’s comment using a script like this with systemd runs the risk of systemd losing track of processes in a sessions. I haven’t noticed such problems, but I use this script mostly on a single-user machine.

#!/bin/sh

# This script uses commands from the cgroup-tools package. The cgroup-tools commands access the cgroup filesystem directly which is against the (new-ish) kernel's requirement that cgroups are managed by a single entity (which usually will be systemd). Additionally there is a v2 cgroup api in development which will probably replace the existing api at some point. So expect this script to break in the future. The correct way forward would be to use systemd's apis to create the cgroups, but afaik systemd currently (feb 2018) only exposes dbus apis for which there are no command line tools yet, and I didn't feel like writing those.

# strict mode: error if commands fail or if unset variables are used
set -eu

if [ "$#" -lt 2 ]
then
    echo Usage: `basename $0` "<limit> <command>..."
    echo or: `basename $0` "<memlimit> -s <swaplimit> <command>..."
    exit 1
fi

cgname="limitmem_$$"

# parse command line args and find limits

limit="$1"
swaplimit="$limit"
shift

if [ "$1" = "-s" ]
then
    shift
    swaplimit="$1"
    shift
fi

if [ "$1" = -- ]
then
    shift
fi

if [ "$limit" = "$swaplimit" ]
then
    memsw=0
    echo "limiting memory to $limit (cgroup $cgname) for command $@" >&2
else
    memsw=1
    echo "limiting memory to $limit and total virtual memory to $swaplimit (cgroup $cgname) for command $@" >&2
fi

# create cgroup
sudo cgcreate -g "memory:$cgname"
sudo cgset -r memory.limit_in_bytes="$limit" "$cgname"
bytes_limit=`cgget -g "memory:$cgname" | grep memory.limit_in_bytes | cut -d  -f2`

# try also limiting swap usage, but this fails if the system has no swap
if sudo cgset -r memory.memsw.limit_in_bytes="$swaplimit" "$cgname"
then
    bytes_swap_limit=`cgget -g "memory:$cgname" | grep memory.memsw.limit_in_bytes | cut -d  -f2`
else
    echo "failed to limit swap"
    memsw=0
fi

# create a waiting sudo'd process that will delete the cgroup once we're done. This prevents the user needing to enter their password to sudo again after the main command exists, which may take longer than sudo's timeout.
tmpdir=${XDG_RUNTIME_DIR:-$TMPDIR}
tmpdir=${tmpdir:-/tmp}
fifo="$tmpdir/limitmem_$$_cgroup_closer"
mkfifo --mode=u=rw,go= "$fifo"
sudo -b sh -c "head -c1 '$fifo' >/dev/null ; cgdelete -g 'memory:$cgname'"

# spawn subshell to run in the cgroup. If the command fails we still want to remove the cgroup so unset '-e'.
set +e
(
set -e
# move subshell into cgroup
sudo cgclassify -g "memory:$cgname" --sticky `sh -c 'echo $PPID'`  # $$ returns the main shell's pid, not this subshell's.
exec "$@"
)

# grab exit code 
exitcode=$?

set -e

# show memory usage summary

peak_mem=`cgget -g "memory:$cgname" | grep memory.max_usage_in_bytes | cut -d  -f2`
failcount=`cgget -g "memory:$cgname" | grep memory.failcnt | cut -d  -f2`
percent=`expr "$peak_mem" / ( "$bytes_limit" / 100 )`

echo "peak memory used: $peak_mem ($percent%); exceeded limit $failcount times" >&2

if [ "$memsw" = 1 ]
then
    peak_swap=`cgget -g "memory:$cgname" | grep memory.memsw.max_usage_in_bytes | cut -d  -f2`
    swap_failcount=`cgget -g "memory:$cgname" |grep memory.memsw.failcnt | cut -d  -f2`
    swap_percent=`expr "$peak_swap" / ( "$bytes_swap_limit" / 100 )`

    echo "peak virtual memory used: $peak_swap ($swap_percent%); exceeded limit $swap_failcount times" >&2
fi

# remove cgroup by sending a byte through the pipe
echo 1 > "$fifo"
rm "$fifo"

exit $exitcode
Answered By: JanKanis

I’m running Ubuntu 18.04.2 LTS and JanKanis script doesn’t work for me quite as he suggests. Running limitmem 100M script is limiting 100MB of RAM with unlimited swap.

Running limitmem 100M -s 100M script fails silently as cgget -g "memory:$cgname" has no parameter named memory.memsw.limit_in_bytes.

So I disabled swap:

# create cgroup
sudo cgcreate -g "memory:$cgname"
sudo cgset -r memory.limit_in_bytes="$limit" "$cgname"
sudo cgset -r memory.swappiness=0 "$cgname"
bytes_limit=`cgget -g "memory:$cgname" | grep memory.limit_in_bytes | cut -d  -f2`
Answered By: d9ngle

On any systemd-based distro you can also use cgroups indirectly through systemd-run. E.g. for your case of limiting pdftoppm to 500M of RAM, starting with cgroupsv2, you can simply do:

systemd-run --scope -p MemoryMax=500M --user pdftoppm

Previously, this required booting with the systemd.unified_cgroup_hierarchy kernel parameter, but tested as of Ubuntu 22.04 cgroup-tools 2.0-2, this does not seem to be the case anymore, the command just worked without any changes to the kernel parameters, and systemd.unified_cgroup_hierarchy is not set.

Before cgroupsv2 you could not use --user, and would instead run:

systemd-run --scope -p MemoryMax=500M pdftoppm

but without --user, this will ask you for a password every time, even though the app gets launched as your user. Do not allow this to delude you into thinking that the command needs sudo, because that would cause the command to run under root, which was hardly your intention.

Answered By: Hi-Angel

Not really an answer to the question as posed, but:

Could you check the file-size, to prevent issues BEFORE trying to process a pdf? That would remove the “ridiculously large” issue.

There are also programs that will process a pdf (there are python programs, for instance: http://theautomatic.net/2020/01/21/how-to-read-pdf-files-with-python/) whereby one could split the pdf into more manageable-sized chunks. Or do both: if the file-size is reasonable, process it; otherwise (else) split it into as many pieces as required, and process those. One could then re-combine the outputs. One might need to have some overlap between sections to prevent “border” issues.

Limiting the available memory might well force a failure to process larger files, or lead to massive memory swap issues.

Answered By: John Beck

cgroupsv2 update (Ubuntu 22.04)

Things have moved around a bit. Compared to https://unix.stackexchange.com/a/125024/32558 now you now need:

sudo cgcreate -a $USER:$USER -g memory:myGroup -t $USER:$USER
sudo cgset -r memory.max=500M myGroup
sudo cgset -r memory.swap.max=0 myGroup
sudo chmod o+w /sys/fs/cgroup/cgroup.procs
cgexec -g memory:myGroup mycmd arg0 arg1

The line:

sudo chmod o+w /sys/fs/cgroup/cgroup.procs

is needed for it to work without sudo: https://askubuntu.com/questions/1406329/how-to-run-cgexec-without-sudo-as-current-user-on-ubuntu-22-04-with-cgroups-v2/1450845#1450845 otherwise it fails with:

cgroup change of group failed

If you don’t run that command, you can also use:

sudo cgexec -g memory:myGroup mycmd arg0 arg1

but then that runs as root, which you usually don’t want it to do, this can be tested with sudo cgexec -g memory:myGroup id.

Compared to https://unix.stackexchange.com/a/125024/32558 which was originally for v1:

  • /sys/fs/cgroup/memory/myGroup/memory.limit_in_bytes is now /sys/fs/cgroup/myGroup/memory.max
  • /sys/fs/cgroup/memory/myGroup/memory.memsw.limit_in_bytes was split, now you just set the swap separatelly in /sys/fs/cgroup/myGroup/memory.swap.max rather than the sum

For the specific case of memory however, just use systemd-run as mentioned at: https://unix.stackexchange.com/a/536046/32558 that just worked and is by far the simplest approach.

Testing it out

malloc_touch.c

#include <assert.h>
#include <stdio.h>
#include <stdlib.h>

int main(int argc, char **argv) {
    size_t nbytes, step;
    if (argc > 1) {
        nbytes = strtoull(argv[1], NULL, 0);
    } else {
        nbytes = 0x10;
    }
    if (argc > 2) {
        step = strtoull(argv[2], NULL, 0);
    } else {
        step = 1;
    }

    char *base = malloc(nbytes);
    assert(base);
    char *i = base;
    while (i < base + nbytes) {
        *i = 13;
        i += step;
    }
    return EXIT_SUCCESS;
}

GitHub upstream.

First we find that 1M is about the minimum at which a C hello world will run:

sudo cgset -r memory.max=1M myGroup
cgexec -g memory:myGroup ./malloc_touch.out

Then starting from there we can try to malloc 100k:

sudo cgset -r memory.max=1M myGroup
cgexec -g memory:myGroup ./malloc_touch.out 100000

OK, there was still some left. Then if try 1M:

sudo cgset -r memory.max=1M myGroup
cgexec -g memory:myGroup ./malloc_touch.out 1000000

Killed as expected. Increase limit to 10M:

sudo cgset -r memory.max=10M myGroup
cgexec -g memory:myGroup ./malloc_touch.out 1000000

OK. Malloc 9M:

sudo cgset -r memory.max=10M myGroup
cgexec -g memory:myGroup ./malloc_touch.out 9000000

OK. Malloc 10M:

sudo cgset -r memory.max=10M myGroup
cgexec -g memory:myGroup ./malloc_touch.out 10000000

OK. Humm, not sure why, expected it to die. MiB vs MB? Try 11M

sudo cgset -r memory.max=10M myGroup
cgexec -g memory:myGroup ./malloc_touch.out 11000000

Killed as expected.

Tested on Ubuntu 22.10.

cgcreate is completely broken on Ubuntu 21.10

https://askubuntu.com/questions/1376093/is-cgroup-tools-using-cgroup-v1-or-v2

Fails with:

cgcreate: libcgroup initialization failed: Cgroup is not mounted

Apparently they moved part of the system to v2 but not the other.

Ubuntu 22.04
To resolve it you need to boot your host system into CGroupV1 mode by modifying your kernel’s boot arguments to include:
systemd.unified_cgroup_hierarchy=false.

sudo nano /etc/default/grub

(or other editor of your choice). 
Add/change

GRUB_CMDLINE_LINUX="cgroup_enable=memory swapaccount=1 systemd.unified_cgroup_hierarchy=false"

in the file.

sudo update-grub
sudo reboot
Answered By: renbuar

While this isn’t what the OP originally asked for, but for the sake of completeness, for processes that are already running, I use prlimit

E.G. $ prlimit -v1073741824 -pid <xx>

Sets the max limit for memory to 1 Gibibyte. One can set both soft limits (and carry out custom actions such as email) and hard limits.

Answered By: 0xc0de

I also like to limit the memory of some of my programs, so I have an updated script from my previous answer that works with systemd as the cgroup manager. In order not to clutter the other answer, I’ll add it here as a new answer.

Usage: limitmem 500M -s 200M <program>

That will hard-limit the program to 500M of ram and 200M swap, for a total of 700M memory usage. If swap usage is not specified, the below script will default to a limit of 1 GB. The new cgroups (v2) system unfortunately does not offer an integrated total virtual memory use control that would allow the kernel to partition between ram and swap use.

If the program is a Snap package (which starts its own cgroup, thereby leaving the cgroup limitmem puts it in), add the --snap flag.

The script (save as ~/bin/limitmem)

#!/bin/sh

# strict mode: error if commands fail or if unset variables are used
set -eu

limit=''
swaplimit="1G"
snap=0

while true
do
    # parse command line args and find limits

    if [ "$#" -lt 1 -o "$1" = "-h" -o "$1" = "--help" ]
    then
            echo Usage: `basename "$0"` "[--snap] <limit> <command>..."
            echo or: `basename "$0"` "[--snap] <memlimit> -s <swaplimit> <command>..."
            echo
            echo Pass "--snap" if the command is a Snap app, which will place itself into a systemd scope
            exit 1
    fi

    if [ "$1" = "-s" ]
    then
            shift
            swaplimit="$1"
            shift
            continue
    fi

    if [ "$1" = "--snap" ]
    then
        shift
        snap=1
        continue
    fi

    if [ "$1" = -- ]
    then
        shift
            break
    fi

    if [ "$limit" = "" ]
    then
        limit="$1"
        shift
        continue
    fi

    break  # Reached start of command
done

if [ "$snap" = 0 ]
then
    exec systemd-run --user --scope -p MemoryMax="$limit" -p MemorySwapMax="$swaplimit" "$@"
else
    scopename=snap.`basename "$1"`.*
    (
        sleep 2;
        systemctl set-property --user "$scopename" MemoryMax="$limit" MemorySwapMax="$swaplimit"
    )&
    exec "$@"
fi

I use a customized version of the above that records the processes and their cgroups, so another fish shell function I have can show a nice overview of the memory usage. This version depends on fish shell and being uid 1000:

#!/bin/sh

# strict mode: error if commands fail or if unset variables are used
set -eu

limit=''
swaplimit="1G"
snap=0

while true
do
    # parse command line args and find limits

    if [ "$#" -lt 1 -o "$1" = "-h" -o "$1" = "--help" ]
    then
            echo Usage: `basename "$0"` "[--snap] <limit> <command>..."
            echo or: `basename "$0"` "[--snap] <memlimit> -s <swaplimit> <command>..."
            echo
            echo Pass "--snap" if the command is a Snap app, which will place itself into a systemd scope
            exit 1
    fi

    if [ "$1" = "-s" ]
    then
            shift
            swaplimit="$1"
            shift
            continue
    fi

    if [ "$1" = "--snap" ]
    then
        shift
        snap=1
        continue
    fi

    if [ "$1" = -- ]
    then
        shift
            break
    fi

    if [ "$limit" = "" ]
    then
        limit="$1"
        shift
        continue
    fi

    break  # Reached start of command
done

if [ "$snap" = 0 ]
then
    exec systemd-run --user --scope -p MemoryMax="$limit" -p MemorySwapMax="$swaplimit" fish -c "set -U JCLIMITMEM $JCLIMITMEM (basename "$1"):(string split -f3 -m2 : (grep 0:: /proc/self/cgroup)); exec $argv" -- "$@"
else
    scopename=snap.`basename "$1"`.*
    fish -c "set -U JCLIMITMEM $JCLIMITMEM (basename "$1"):"/user.slice/user-1000.slice/user@1000.service/app.slice/$scopename"; echo $JCLIMITMEM"
    (
        sleep 2;
        systemctl set-property --user "$scopename" MemoryMax="$limit" MemorySwapMax="$swaplimit"
    )&
    exec "$@"
fi

With this fish function to show current memory usage:

function memoryusage
    set -l valid_cgroups
    for g in $JCLIMITMEM
        set -l name (string split -f1 : $g)
        set -l cgroup (string split -f2 -m1 : $g)

        if ! [ -d (fish -c "echo /sys/fs/cgroup/$cgroup" 2>/dev/null) ]
            continue
        end
        if ! contains $name:$cgroup $valid_cgroups
            set valid_cgroups $valid_cgroups $name:$cgroup
        end

        set -l c (fish -c "echo /sys/fs/cgroup/$cgroup" 2>/dev/null)

        set -l limit (cat $c/memory.max)
        #set -l max (cat "$c/memory.peak")
        set -l curr (cat "$c/memory.current")
        set -l failcnt (string split -f2   (grep oom  "$c/memory.events"))

        #set -l maxp (math "floor ($max * 100 / $limit)")
        set -l currp (math "floor ($curr * 100 / $limit)")
        set -l GB 1073741824
        set -l limitgb (math "floor ($limit / $GB)")
        set -l limitgbfrac (math "floor (($limit - $limitgb*$GB) * 100 / $GB)")
        #set -l maxgb (math "floor ($max / $GB)")
        #set -l maxgbfrac (math "floor (($max - $maxgb*$GB) * 100 / $GB)")
        set -l currgb (math "floor ($curr / $GB)")
        set -l currgbfrac (math "floor (($curr - $currgb*$GB) * 100 / $GB)")

        echo $name:
        printf "limit:     %12d %9s.%02d GB)n" $limit "($limitgb" $limitgbfrac
        #printf "max usage: %12d %5s, %2d.%02d GB)n" $max "($maxp%" $maxgb $maxgbfrac
        printf "usage:     %12d %5s, %2d.%02d GB)n" $curr "($currp%" $currgb $currgbfrac
        printf "failcount: %12dn" $failcnt
        echo
    end
    set JCLIMITMEM $valid_cgroups
end
Answered By: JanKanis
Categories: Answers Tags: , ,
Answers are sorted by their score. The answer accepted by the question owner as the best is marked with
at the top-right corner.