~lukeshu/systemd - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2016-09-25	core: Use @raw-io syscall group to filter I/O syscalls when PrivateDevices= ↵	Djalal Harouni
	is set Instead of having a local syscall list, use the @raw-io group which contains the same set of syscalls to filter.
2016-09-25	core:namespace: simplify ProtectHome= implementation	Djalal Harouni
	As with previous patch simplify ProtectHome and don't care about duplicates, they will be sorted by most restrictive mode and cleaned.
2016-09-25	core: simplify ProtectSystem= implementation	Djalal Harouni
	ProtectSystem= with all its different modes and other options like PrivateDevices= + ProtectKernelTunables= + ProtectHome= are orthogonal, however currently it's a bit hard to parse that from the implementation view. Simplify it by giving each mode its own table with all paths and references to other Protect options. With this change some entries are duplicated, but we do not care since duplicate mounts are first sorted by the most restrictive mode then cleaned.
2016-09-25	core:sandbox: add more /proc/* entries to ProtectKernelTunables=	Djalal Harouni
	Make ALSA entries, latency interface, mtrr, apm/acpi, suspend interface, filesystems configuration and IRQ tuning readonly. Most of these interfaces now days should be in /sys but they are still available through /proc, so just protect them. This patch does not touch /proc/net/...
2016-09-25	core:namespace: simplify mount calculation	Djalal Harouni
	Move out mount calculation on its own function. Actually the logic is smart enough to later drop nop and duplicates mounts, this change improves code readability. --- src/core/namespace.c \| 47 ++++++++++++++++++++++++++++++++++++----------- 1 file changed, 36 insertions(+), 11 deletions(-)
2016-09-25	core:namespace: put paths protected by ProtectKernelTunables= in	Djalal Harouni
	Instead of having all these paths everywhere, put the ones that are protected by ProtectKernelTunables= into their own table. This way it is easy to add paths and track which ones are protected.
2016-09-25	core:namespace: minor improvements to append_mounts()	Djalal Harouni

2016-09-25	execute: move SMACK setup code into its own function	Lennart Poettering
	While we are at it, move PAM code #ifdeffery into setup_pam() to simplify the main execution logic a bit.
2016-09-25	namespace: drop all mounts outside of the new root directory	Lennart Poettering
	There's no point in mounting these, if they are outside of the root directory we'll move to.
2016-09-25	main: minor simplification	Lennart Poettering

2016-09-25	execute: filter low-level I/O syscalls if PrivateDevices= is set	Lennart Poettering
	If device access is restricted via PrivateDevices=, let's also block the various low-level I/O syscalls at the same time, so that we know that the minimal set of devices in our virtualized /dev are really everything the unit can access.
2016-09-25	namespace: don't make the root directory of a namespace a mount if it ↵	Lennart Poettering
	already is one Let's not stack mounts needlessly.
2016-09-25	namespace: chase symlinks for mounts to set up in userspace	Lennart Poettering
	This adds logic to chase symlinks for all mount points that shall be created in a namespace environment in userspace, instead of leaving this to the kernel. This has the advantage that we can correctly handle absolute symlinks that shall be taken relative to a specific root directory. Moreover, we can properly handle mounts created on symlinked files or directories as we can merge their mounts as necessary. (This also drops the "done" flag in the namespace logic, which was never actually working, but was supposed to permit a partial rollback of the namespace logic, which however is only mildly useful as it wasn't clear in which case it would or would not be able to roll back.) Fixes: #3867
2016-09-25	namespace: invoke unshare() only after checking all parameters	Lennart Poettering
	Let's create the new namespace only after we validated and processed all parameters, right before we start with actually mounting things. This way, the window where we can roll back is larger (not that it matters IRL...)
2016-09-25	execute: drop group priviliges only after setting up namespace	Lennart Poettering
	If PrivateDevices=yes is set, the namespace code creates device nodes in /dev that should be owned by the host's root, hence let's make sure we set up the namespace before dropping group privileges.
2016-09-25	core: imply ProtectHome=read-only and ProtectSystem=strict if DynamicUser=1	Lennart Poettering
	Let's make sure that services that use DynamicUser=1 cannot leave files in the file system should the system accidentally have a world-writable directory somewhere. This effectively ensures that directories need to be whitelisted rather than blacklisted for access when DynamicUser=1 is set.
2016-09-25	core: introduce ProtectSystem=strict	Lennart Poettering
	Let's tighten our sandbox a bit more: with this change ProtectSystem= gains a new setting "strict". If set, the entire directory tree of the system is mounted read-only, but the API file systems /proc, /dev, /sys are excluded (they may be managed with PrivateDevices= and ProtectKernelTunables=). Also, /home and /root are excluded as those are left for ProtectHome= to manage. In this mode, all "real" file systems (i.e. non-API file systems) are mounted read-only, and specific directories may only be excluded via ReadWriteDirectories=, thus implementing an effective whitelist instead of blacklist of writable directories. While we are at, also add /efi to the list of paths always affected by ProtectSystem=. This is a follow-up for b52a109ad38cd37b660ccd5394ff5c171a5e5355 which added /efi as alternative for /boot. Our namespacing logic should respect that too.
2016-09-25	namespace: add some debug logging when enforcing InaccessiblePaths=	Lennart Poettering

2016-09-25	namespace: rework how ReadWritePaths= is applied	Lennart Poettering
	Previously, if ReadWritePaths= was nested inside a ReadOnlyPaths= specification, then we'd first recursively apply the ReadOnlyPaths= paths, and make everything below read-only, only in order to then flip the read-only bit again for the subdirs listed in ReadWritePaths= below it. This is not only ugly (as for the dirs in question we first turn on the RO bit, only to turn it off again immediately after), but also problematic in containers, where a container manager might have marked a set of dirs read-only and this code will undo this is ReadWritePaths= is set for any. With this patch behaviour in this regard is altered: ReadOnlyPaths= will not be applied to the children listed in ReadWritePaths= in the first place, so that we do not need to turn off the RO bit for those after all. This means that ReadWritePaths=/ReadOnlyPaths= may only be used to turn on the RO bit, but never to turn it off again. Or to say this differently: if some dirs are marked read-only via some external tool, then ReadWritePaths= will not undo it. This is not only the safer option, but also more in-line with what the man page currently claims: "Entries (files or directories) listed in ReadWritePaths= are accessible from within the namespace with the same access rights as from outside." To implement this change bind_remount_recursive() gained a new "blacklist" string list parameter, which when passed may contain subdirs that shall be excluded from the read-only mounting. A number of functions are updated to add more debug logging to make this more digestable.
2016-09-25	namespace: when enforcing fs namespace restrictions suppress redundant mounts	Lennart Poettering
	If /foo is marked to be read-only, and /foo/bar too, then the latter may be suppressed as it has no effect.
2016-09-25	namespace: simplify mount_path_compare() a bit	Lennart Poettering

2016-09-25	execute: if RuntimeDirectory= is set, it should be writable	Lennart Poettering
	Implicitly make all dirs set with RuntimeDirectory= writable, as the concept otherwise makes no sense.
2016-09-25	execute: move suppression of HOME=/ and SHELL=/bin/nologin into user-util.c	Lennart Poettering
	This adds a new call get_user_creds_clean(), which is just like get_user_creds() but returns NULL in the home/shell parameters if they contain no useful information. This code previously lived in execute.c, but by generalizing this we can reuse it in run.c.
2016-09-25	execute: split out creation of runtime dirs into its own functions	Lennart Poettering

2016-09-25	namespace: make sure InaccessibleDirectories= masks all mounts further down	Lennart Poettering
	If a dir is marked to be inaccessible then everything below it should be masked by it.
2016-09-25	core: add two new service settings ProtectKernelTunables= and ↵	Lennart Poettering
	ProtectControlGroups= If enabled, these will block write access to /sys, /proc/sys and /proc/sys/fs/cgroup.
2016-09-25	core: enforce seccomp for secondary archs too, for all rules	Lennart Poettering
	Let's make sure that all our rules apply to all archs the local kernel supports.
2016-09-16	tree-wide: rename config_parse_many to …_nulstr	Zbigniew Jędrzejewski-Szmek
	In preparation for adding a version which takes a strv.
2016-09-10	Merge pull request #4119 from keszybz/drop-more-kdbus	Evgeny Vereshchagin
	Drop more kdbus functionality
2016-09-10	service: fixup ExecStop for socket-activated shutdown (#4120)	Kyle Russell
	Previous fix didn't consider handling multiple ExecStop commands.
2016-09-09	unit: sent change signal before removing the unit if necessary (#4106)	Michael Olbrich
	If the unit is in the dbus queue when it is removed then the last change signal is never sent. Fix this by checking the dbus queue and explicitly send the change signal before sending the remove signal.
2016-09-09	pid1: drop kdbus_fd and all associated logic	Zbigniew Jędrzejewski-Szmek

2016-09-09	service: Continue shutdown on socket activated unit on termination (#4108)	Kyle Russell
	ENOTCONN may be a legitimate return code if the endpoint disappeared, but the service should still attempt to shutdown cleanly.
2016-09-06	seccomp: also detect if seccomp filtering is enabled	Felipe Sateler
	In https://github.com/systemd/systemd/pull/4004 , a runtime detection method for seccomp was added. However, it does not detect the case where CONFIG_SECCOMP=y but CONFIG_SECCOMP_FILTER=n. This is possible if the architecture does not support filtering yet. Add a check for that case too. While at it, change get_proc_field usage to use PR_GET_SECCOMP prctl, as that should save a few system calls and (unnecessary) allocations. Previously, reading of /proc/self/stat was done as recommended by prctl(2) as safer. However, given that we need to do the prctl call anyway, lets skip opening, reading and parsing the file. Code for checking inspired by https://outflux.net/teach-seccomp/autodetect.html
2016-08-31	core: introduce MemorySwapMax= (#3659)	Lennart Poettering
	Similar to MemoryMax=, MemorySwapMax= limits swap usage. This controls controls "memory.swap.max" attribute in unified cgroup.
2016-08-31	load-fragment: Resolve specifiers in OnCalendar and On*Sec (#4045)	Lennart Poettering
	Resolves #3534
2016-08-30	core: introduce MemorySwapMax=	WaLyong Cho
	Similar to MemoryMax=, MemorySwapMax= limits swap usage. This controls controls "memory.swap.max" attribute in unified cgroup.
2016-08-27	mount: add SloppyOptions= to mount_dump()	Barron Rulon

2016-08-27	mount: add new ForceUnmount= setting for mount units, mapping to umount(8)'s ↵	Barron Rulon
	"-f" switch
2016-08-26	load-fragment: Resolve specifiers in OnCalendar and On*Sec	Douglas Christman
	Resolves #3534
2016-08-26	mount: add new LazyUnmount= setting for mount units, mapping to umount(8)'s ↵	brulon
	"-l" switch (#3827)
2016-08-26	Merge pull request #3984 from poettering/refcnt	Evgeny Vereshchagin
	permit bus clients to pin units to avoid automatic GC
2016-08-23	core,network: Use const qualifiers for block-local variables in macro ↵	Felipe Sateler
	functions (#4019) Prevents discard-qualifiers warnings when the passed variable was const
2016-08-22	core: do not fail at step SECCOMP if there is no kernel support (#4004)	Felipe Sateler
	Fixes #3882
2016-08-22	core: let's use set_contains() where appropriate	Lennart Poettering

2016-08-22	core: cache last CPU usage counter, before destorying a cgroup	Lennart Poettering
	It is useful for clients to be able to read the last CPU usage counter value of a unit even if the unit is already terminated. Hence, before destroying a cgroup's cgroup cache the last CPU usage counter and return it if the cgroup is gone.
2016-08-22	core: add Ref()/Unref() bus calls for units	Lennart Poettering
	This adds two (privileged) bus calls Ref() and Unref() to the Unit interface. The two calls may be used by clients to pin a unit into memory, so that various runtime properties aren't flushed out by the automatic GC. This is necessary to permit clients to race-freely acquire runtime results (such as process exit status/code or accumulated CPU time) on successful service termination. Ref() and Unref() are fully recursive, hence act like the usual reference counting concept in C. Taking a reference is a privileged operation, as this allows pinning units into memory which consumes resources. Transient units may also gain a reference at the time of creation, via the new AddRef property (that is only defined for transient units at the time of creation).
2016-08-19	Merge pull request #3965 from htejun/systemd-controller-on-unified	Zbigniew Jędrzejewski-Szmek

2016-08-19	Merge pull request #3987 from keszybz/console-color-setup	Lennart Poettering
	Rework console color setup
2016-08-19	Merge pull request #3988 from keszybz/journald-dynamic-users	Lennart Poettering
	Journald dynamic users