~lukeshu/systemd - Unnamed repository; edit this file 'description' to name the repository.

Age	Commit message (Collapse)	Author
2015-01-08	journal: bump RLIMIT_NOFILE when journal files to 16K (if possible)	Lennart Poettering
	When there are a lot of split out journal files, we might run out of fds quicker then we want. Hence: bump RLIMIT_NOFILE to 16K if possible. Do these even for journalctl. On Fedora the soft RLIMIT_NOFILE is at 1K, the hard at 4K by default for normal user processes, this code hence bumps this up for users to 4K. https://bugzilla.redhat.com/show_bug.cgi?id=1179980
2015-01-08	util: make it easy to initialize the crtime from the current time in ↵	Lennart Poettering
	fd_setcrtime()
2015-01-08	journald: turn off COW for journal files on btrfs	Lennart Poettering
	btrfs' COW logic results in heavily fragment journal files, which is detrimental for perfomance. Hence, turn off COW for journal files as we create them. Turning off COW comes at the cost of data integrity guarantees, but this should be acceptable, given that we do our own checksumming, and generally have a pretty conservative write pattern. Also see discussion on linux-btrfs: http://www.spinics.net/lists/linux-btrfs/msg41001.html
2015-01-06	journal: consider file deletion errors a reason for rotation	Lennart Poettering

2015-01-06	journald: whenever we rotate a file, btrfs defrag it	Lennart Poettering
	Our write pattern is quite awful for CoW file systems (btrfs...), as we keep updating file parts in the beginning of the file. This results in fragmented journal files. Hence: when rotating files, defragment them, since at that point we know that no further write accesses will be made.
2015-01-06	tree-wide: remove unnecessary LOG_PRI	Zbigniew Jędrzejewski-Szmek
	LOG_DEBUG is already a log level, there is no need to use LOG_PRI which is for filtering out the facility.
2015-01-06	journald: allow restarting journald without losing stream connections	Lennart Poettering
	Making use of the fd storage capability of the previous commit, allow restarting journald by serilizing stream state to /run, and pushing open fds to PID 1.
2015-01-05	journald: reuse IOVEC_TOTAL_SIZE() macros where possible	Lennart Poettering

2015-01-05	journald: when we detect the journal file we are about to write to has been ↵	Lennart Poettering
	deleted, rotate https://bugzilla.redhat.com/show_bug.cgi?id=1171719
2015-01-05	journald: add some additional checks before we divide by values read from ↵	Lennart Poettering
	journal file headers Since the file headers might be replaced by zeroed pages now due to sigbus we should make sure we don't end up dividing by zero because we don't check values read from journal file headers for changes.
2015-01-05	journalctl: static variables immediately configured via command line ↵	Lennart Poettering
	arguments should be prefixed with "arg_"
2015-01-05	journal: install sigbus handler for journal tools too	Lennart Poettering
	This makes them robust regarding truncation. Ideally, we'd export this as an API, but given how messy SIGBUS handling is, and the uncertain ownership logic of signal handlers we should not do this (unless libc one day invents a scheme how to sanely install SIGBUS handlers for specific memory areas only). However, for now we can still make all our own tools robust. Note that external tools will only have read-access to the journal anyway, where SIGBUS is much more unlikely, given that only writes are subject to disk full problems.
2015-01-05	journald: constify all things	Lennart Poettering

2015-01-05	journald: prefix exported calls with "server_", unexport unnecessary calls	Lennart Poettering

2015-01-05	journald: process SIGBUS for the memory maps we set up	Lennart Poettering
	Even though we use fallocate() it appears that file systems like btrfs will trigger SIGBUS on certain low-disk-space situation. We should handle that, hence catch the signal, add it to a list of invalidated pages, and replace the page with an empty memory area. After each write check if SIGBUS was triggered, and consider the write invalid if it was. This should make journald a lot more robust with file systems where fallocate() is not reliable, for example all CoW file systems (btrfs...), where changing written data can fail with disk full errors. https://bugzilla.redhat.com/show_bug.cgi?id=1045810
2014-12-30	tree-wide: spelling fixes	Veres Lajos
	https://github.com/vlajos/misspell_fixer https://github.com/torstehu/systemd/commit/b6fdeb618cf2f3ce1645b3315f15f482710c7ffa Thanks to Torstein Husebo <torstein@huseboe.net>.
2014-12-26	journald: always allocate space for object fields	Zbigniew Jędrzejewski-Szmek
	If OBJECT_PID= came as the last field, we would not reallocate the iovec to bigger size, and fail the assertion later on in dispatch_message_real().
2014-12-26	journald: fix off by one in native transport	Zbigniew Jędrzejewski-Szmek
	https://bugzilla.redhat.com/show_bug.cgi?id=1177184
2014-12-24	util: make creation time xattr logic more generic	Lennart Poettering

2014-12-24	util: fix strict aliasing violations in use of struct inotify_event v5	Shawn Paul Landden
	There is alot of cleanup that will have to happen to turn on -fstrict-aliasing, but I think our code should be "correct" to the rule.
2014-12-19	journal: skipping of exhausted journal files is bad if direction changed	Michal Schmidt
	EOF is meaningless if the direction of iteration changes. Move the EOF optimization under the direction check. This fixes test-journal-interleaving for me. Thanks to Filipe Brandenburger for telling me about the failure.
2014-12-19	journal: make next_with_matches() always use f->current_offset	Michal Schmidt
	next_with_matches() is odd in that its "unit64_t offset" parameter is both input and output. In other it's purely for output. The function is called from two places in next_beyond_location(). In both of them "&cp" is used as the argument and in both cases cp is guaranteed to equal f->current_offset. Let's just have next_with_matches() ignore "offset" on input and operate with f->current_offset. I did not investigate why it is, but it makes my usual benchmark run reproducibly faster: $ time ./journalctl --since=2014-06-01 --until=2014-07-01 > /dev/null real 0m4.032s user 0m3.896s sys 0m0.135s (Compare to preceding commit, where real was 4.4s.)
2014-12-19	journal: fix skipping of duplicate entries in iteration	Michal Schmidt
	I accidentally broke the detection of duplicate entries in 7943f42275 "journal: optimize iteration by returning previously found candidate entry". When we have a known location of a candidate entry, we must not return from next_beyond_location() immediately. We must go through the duplicates detection to make sure the candidate differs from the already iterated entry. This fix slows down iteration a bit, but it's still faster than it was before the rework. $ time ./journalctl --since=2014-06-01 --until=2014-07-01 > /dev/null real 0m4.448s user 0m4.298s sys 0m0.149s (Compare with results from commit 7943f42275, where real was 5.3s before the rework.)
2014-12-18	journal: next_with_matches() now does not need a mapped object as input	Michal Schmidt
	Now that journal_file_next_entry() does not need a pointer to the current object, next_with_matches() does not need it either.
2014-12-18	journal: journal_file_next_entry() does not need pointer to current Object	Michal Schmidt
	The current offset is sufficient information.
2014-12-18	journal: optimize iteration by returning previously found candidate entry	Michal Schmidt
	In next_beyond_location() when the JournalFile's location type is LOCATION_SEEK, it means there's nothing to do, because we already have the location of the candidate entry. Do an early return. Note that now next_beyond_location() does not anymore guarantee on return that the entry is mapped, but previous patches made sure the caller does not care. This optimization is at least as good as "journal: optimize iteration: skip files that cannot improve current candidate entry" was. Timing results on my workstation, using: $ time ./journalctl -q --since=2014-06-01 --until=2014-07-01 > /dev/null Before "Revert "journal: optimize iteration: skip files that cannot improve current candidate entry": real 0m5.349s user 0m5.166s sys 0m0.181s Now: real 0m3.901s user 0m3.724s sys 0m0.176s
2014-12-18	journal: optimize iteration by skipping exhausted files	Michal Schmidt
	If from a previous iteration we know we are at the end of a journal file, don't bother looking into the file again. This is complicated by the fact that the EOF does not have to be permanent (think of "journalctl -f"). So we also check if the number of entries in the journal file changed. This optimization has a similar effect as "journal: optimize iteration: skip whole files behind current location" had.
2014-12-18	journal: drop unnecessary parameters of next_beyond_location()	Michal Schmidt
	offset is redundant, because the caller can rely on f->current_offset. The object pointer the function saves in *ret is thrown away by the caller.
2014-12-18	journal: remove redundant variable new_offset	Michal Schmidt
	The file's current_offset is already updated at this point, so let's use it.
2014-12-18	journal: compare candidate entries using JournalFiles' locations	Michal Schmidt
	When comparing the locations of candidate entries, we can rely on the location information stored in struct JournalFile.
2014-12-18	journal: simplify set_location()	Michal Schmidt
	set_location() is called from real_journal_next() when a winning entry has been picked from among the candidates in journal files. The location type is always set to LOCATION_DISCRETE. No need to pass it as a parameter. The per-JournalFile location information is already updated at this point. No need for having the direction and offset here.
2014-12-18	journal: keep per-JournalFile location info during iteration	Michal Schmidt
	In next_beyond_location() when we find a candidate entry in a journal file, save its location information in struct JournalFile. The purpose of remembering the locations of candidate entries is to be able to save work in the next iteration. This patch does only the remembering part. LOCATION_SEEK means the location identifies a candidate entry. When a winner is picked from among candidates, it becomes LOCATION_DISCRETE. LOCATION_TAIL here signifies we've iterated the file to the end (or the beginning in the case of reversed direction).
2014-12-18	journal: abstract the resetting of JournalFile's location	Michal Schmidt

2014-12-18	journal: move definition of LocationType to journal-file.h	Michal Schmidt
	In preparation for individual JournalFiles maintaining a location of their own.
2014-12-18	Revert "journal: optimize iteration: skip whole files behind current location"	Michal Schmidt
	This reverts commit b7c88ab8cc7d55a43450bf3dea750f95f2e910d6. This optimization will be made redundant by the following patches.
2014-12-18	Revert "journal: optimize iteration: skip files that cannot improve current ↵	Michal Schmidt
	candidate entry" This reverts commit f8b5a3b75fb55f0acb85c21424b3893c822742e9. This optimization will be made redundant by the following patches.
2014-12-18	journal: delete unused function journal_file_skip_entry()	Michal Schmidt
	Its only caller is a test.
2014-12-18	journal: delete unused function journal_file_move_to_entry_by_offset()	Michal Schmidt

2014-12-13	journal: replace contexts hashmap with a plain array	Michal Schmidt
	try_context() is such a hot path that the hashmap lookup is expensive. The number of contexts is small - it is the number of object types. Using a hashmap is overkill. A plain array will do. Before: $ time ./journalctl --since=2014-06-01 --until=2014-07-01 > /dev/null real 0m9.445s user 0m9.228s sys 0m0.213s After: $ time ./journalctl --since=2014-06-01 --until=2014-07-01 > /dev/null real 0m5.438s user 0m5.266s sys 0m0.170s
2014-12-13	journal: delete unused function mmap_cache_close_context	Michal Schmidt
	This never had any callers. Contexts are freed when the MMapCache is freed.
2014-12-13	journal: push type_to_context conversion down to journal_file_move_to()	Michal Schmidt

2014-12-13	journal: have a named enum ObjectType	Michal Schmidt

2014-12-13	journal: consistently use OBJECT_<type> names instead of numbers	Michal Schmidt
	Note that numbers 0 and -1 are both replaced with OBJECT_UNUSED, because they are treated the same everywhere (e.g. type_to_context() translates them both to 0).
2014-12-13	journal: consistently allow type==0 to mean "any type"	Michal Schmidt
	If type==0 and a non-NULL object were given as arguments to journal_file_hmac_put_object(), its object type check would fail and it would return -EBADMSG. All existing callers use either a positive type or -1. Still, for behavior consistency with journal_file_move_to_object() let's allow type 0 to pass.
2014-12-13	journal: move type_to_context() to journal-file.c	Michal Schmidt
	It has no other callers. It does not need to be in the header file.
2014-12-13	journal: remove journal_file_object_keep/release functions	Michal Schmidt
	The only user is sd_journal_enumerate_unique() and, as explained in the previous commit (fed67c38e3 "journal: map objects to context set by caller, not by actual object type"), the use of them there is now superfluous. Let's remove them. This reverts major parts of commits: ae97089d49 journal: fix access to munmapped memory in sd_journal_enumerate_unique 06cc69d44c sd-journal: fix sd_journal_enumerate_unique skipping values Tested with an "--enable-debug" build and "journalctl --list-boots". It gives the expected number of results. Additionally, if I then revert the previous commit ("journal: map objects to context set by caller, not to actual object type"), it crashes with SIGSEGV, as expected.
2014-12-13	journal: map objects to context set by caller, not by actual object type	Michal Schmidt
	When the caller of journal_file_move_to_object() specifies type==0, the object header is at first mapped in context 0. Then after the header is checked, the whole object is mapped in a context determined by the actual object type (which is not even range-checked using type_to_context()). This looks wrong. It should map in the caller-specified context. An old comment in sd_journal_enumerate_unique() supports this view: /* We do not use the type context here, but 0 instead, * so that we can look at this data object at the same * time as one on another file */ Clearly the expectation was that the data object will remain mapped in context 0 without being pushed away by mapping other objects in context OBJECT_DATA. I suspect that this was the real bug that got fixed by ae97089d49 "journal: fix access to munmapped memory in sd_journal_enumerate_unique". In other words, journal_file_object_keep/release are superfluous after applying this patch.
2014-12-13	journal: add debug mode for mmap-cache (--enable-debug=mmap-cache)	Michal Schmidt
	This is useful for exposing unsafe access to mmapped objects after the context that they were mapped in was already moved. For example: journal_file_move_to_object(f1, OBJECT_DATA, p1, &o1); journal_file_move_to_object(f2, OBJECT_DATA, p2, &o2); t = o1->object.type; /* this usually works, but is unsafe */
2014-12-12	copy: use btrfs reflinking only whe we know we copy full files	Lennart Poettering

2014-12-12	util: when using basename() for creating temporary files, verify the ↵	Lennart Poettering
	resulting name is actually valid Also, rename filename_is_safe() to filename_is_valid(), since it actually does a full validation for what the kernel will accept as file name, it's not just a heuristic.