linux.git - Linus' kernel tree

Age	Commit message (Collapse)	Author
2015-07-11	nsfs: Add a show_path method to fix mountinfo	Eric W. Biederman
	Today mountinfo displays a very unhelpful "/" for nsfs files. Add a show_path method returning the same string as ns_dname. This results in a bind mount of /proc/<pid>/ns/net showing up in /proc/<pid>/mountinfo as "net:[1234...]" instead of "/". Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>
2015-07-11	pNFS: Don't throw out valid layout segments	Trond Myklebust
	It is OK for layout segments to remain hashed even if no-one holds any references to them, provided that the segments are still valid. Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
2015-07-11	pNFS: pnfs_roc_drain() fix a race with open	Trond Myklebust
	If a process reopens the file before we can send off the CLOSE/DELEGRETURN, then pnfs_roc_drain() may end up waiting for a new set of layout segments that are marked as return-on-close, but haven't yet been returned. Fix this by only waiting for those layout segments that were invalidated in pnfs_roc(). Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
2015-07-11	pNFS: Fix races between return-on-close and layoutreturn.	Trond Myklebust
	If one or more of the layout segments reports an error during I/O, then we may have to send a layoutreturn to report the error back to the NFS metadata server. This patch ensures that the return-on-close code can detect the outstanding layoutreturn, and not preempt it. Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
2015-07-11	pNFS: pnfs_roc_drain should return 'true' when sleeping	Trond Myklebust
	Also clean up the case where we don't find a return-on-close layout segment. Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
2015-07-11	pNFS: Layoutreturn must invalidate all existing layout segments.	Trond Myklebust
	Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
2015-07-10	mnt: fs_fully_visible enforce noexec and nosuid if !SB_I_NOEXEC	Eric W. Biederman
	The filesystems proc and sysfs do not have executable files do not have exectuable files today and portions of userspace break if we do enforce nosuid and noexec consistency of nosuid and noexec flags between previous mounts and new mounts of proc and sysfs. Add the code to enforce consistency of the nosuid and noexec flags, and use the presence of SB_I_NOEXEC to signal that there is no need to bother. This results in a completely userspace invisible change that makes it clear fs_fully_visible can only skip the enforcement of noexec and nosuid because it is known the filesystems in question do not support executables. Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>
2015-07-10	vfs: Commit to never having exectuables on proc and sysfs.	Eric W. Biederman
	Today proc and sysfs do not contain any executable files. Several applications today mount proc or sysfs without noexec and nosuid and then depend on there being no exectuables files on proc or sysfs. Having any executable files show on proc or sysfs would cause a user space visible regression, and most likely security problems. Therefore commit to never allowing executables on proc and sysfs by adding a new flag to mark them as filesystems without executables and enforce that flag. Test the flag where MNT_NOEXEC is tested today, so that the only user visible effect will be that exectuables will be treated as if the execute bit is cleared. The filesystems proc and sysfs do not currently incoporate any executable files so this does not result in any user visible effects. This makes it unnecessary to vet changes to proc and sysfs tightly for adding exectuable files or changes to chattr that would modify existing files, as no matter what the individual file say they will not be treated as exectuable files by the vfs. Not having to vet changes to closely is important as without this we are only one proc_create call (or another goof up in the implementation of notify_change) from having problematic executables on proc. Those mistakes are all too easy to make and would create a situation where there are security issues or the assumptions of some program having to be broken (and cause userspace regressions). Signed-off-by: "Eric W. Biederman" <ebiederm@xmission.com>
2015-07-09	hpfs: hpfs_error: Remove static buffer, use vsprintf extension %pV instead	Joe Perches
	Removing unnecessary static buffers is good. Use the vsprintf %pV extension instead. Signed-off-by: Joe Perches <joe@perches.com> Signed-off-by: Mikulas Patocka <mikulas@twibright.com> Cc: stable@vger.kernel.org # v2.6.36+ Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2015-07-09	hpfs: kstrdup() out of memory handling	Sanidhya Kashyap
	There is a possibility of nothing being allocated to the new_opts in case of memory pressure, therefore return ENOMEM for such case. Signed-off-by: Sanidhya Kashyap <sanidhya.gatech@gmail.com> Signed-off-by: Mikulas Patocka <mikulas@twibright.com> Cc: stable@vger.kernel.org Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2015-07-09	hpfs: Remove unessary cast	Firo Yang
	Avoid a pointless kmem_cache_alloc() return value cast in fs/hpfs/super.c::hpfs_alloc_inode() Signed-off-by: Firo Yang <firogm@gmail.com> Signed-off-by: Mikulas Patocka <mikulas@twibright.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2015-07-09	hpfs: add fstrim support	Mikulas Patocka
	This patch adds support for fstrim to the HPFS filesystem. Signed-off-by: Mikulas Patocka <mikulas@twibright.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2015-07-09	ioctl_compat: handle FITRIM	Mikulas Patocka
	The FITRIM ioctl has the same arguments on 32-bit and 64-bit architectures, so we can add it to the list of compatible ioctls and drop it from compat_ioctl method of various filesystems. Signed-off-by: Mikulas Patocka <mpatocka@redhat.com> Cc: Al Viro <viro@zeniv.linux.org.uk> Cc: Ted Ts'o <tytso@google.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
2015-07-09	udf: Don't corrupt unalloc spacetable when writing it	Steven J. Magnani
	For a UDF filesystem configured with an Unallocated Space Table, a filesystem operation that triggers an update to the table results in on-disk corruption that prevents remounting: udf_read_tagged: tag version 0x0000 != 0x0002 \|\| 0x0003, block 274 For example: 1. Create a filesystem $ mkudffs --media-type=hd --blocksize=512 --lvid=BUGTEST \ --vid=BUGTEST --fsid=BUGTEST --space=unalloctable \ /dev/mmcblk0 2. Mount it # mount /dev/mmcblk0 /mnt 3. Create a file $ echo "No corruption, please" > /mnt/new.file 4. Umount # umount /mnt 5. Attempt remount # mount /dev/mmcblk0 /mnt This appears to be a longstanding bug caused by zero-initialization of the Unallocated Space Entry block buffer and only partial repopulation of required fields before writing to disk. Commit 0adfb339fd64 ("udf: Fix unalloc space handling in udf_update_inode") addressed one such field, but several others are required. Signed-off-by: Steven J. Magnani <steve@digidescorp.com> Signed-off-by: Jan Kara <jack@suse.com>
2015-07-08	NFSv4.2/flexfiles: Fix a typo in the flexfiles layoutstats code	Trond Myklebust
	Signed-off-by: Trond Myklebust <trond.myklebust@primarydata.com>
2015-07-06	ufs_inode_get{frag,block}(): get rid of 'phys' argument	Al Viro
	Just pass NULL as locked_page in case of first block in the indirect chain. Old calling conventions aside, a reason for having 'phys' was that ufs_inode_getfrag() used to be able to do _two_ allocations - indirect block and extending/reallocating a tail. We needed locked_page for the latter (it's a data), but we also needed to figure out that indirect block is metadata. So we used to pass non-NULL locked_page in all cases and used NULL phys as indication of being asked to allocate an indirect. With tail unpacking taken into a separate function we don't need those convolutions anymore. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_getfrag_block(): tidy up a bit	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_getblock(): failure to read an indirect block is -EIO	Al Viro
	... and not "write to beginning of the disk", TYVM... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_getfrag_block(): turn following indirects into a loop	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_getfrag(): pass index instead of 'fragment'	Al Viro
	same story as with ufs_inode_getblock() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_getfrag(): split extending the partial blocks off	Al Viro
	ufs_extend_tail() is handling that now. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_getblock(): pass indirect block number and full index	Al Viro
	... instead of messing with buffer_head. We can bloody well do sb_bread() in there. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_getblock(): pass index instead of 'fragment'	Al Viro
	The value passed to ufs_inode_getblock() as the 3rd argument had lower bits ignored; the upper bits were shifted down and used and they actually make sense - those are _lower_ bits of index in indirect block (i.e. they form the index within a fragment within an indirect block). Pass those as argument. Upper bits of index (i.e. the number of fragment within indirect block) will join them shortly. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_get{frag,block}(): leave sb_getblk() to caller	Al Viro
	just return the damn block number Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_getfrag_block(): get rid of macro jungles	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_get{frag,block}(): consolidate success exits	Al Viro
	These calling conventions are rudiments of pre-2.3 times; they really need to be sanitized. This is the first step; next will be _always_ returning a block number, instead of this "return a pointer to buffer_head, except when we get to the actual data" crap. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs: use the branch depth in ufs_getfrag_block()	Al Viro
	we'd already calculated it... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs: move calculation of offsets into ufs_getfrag_block()	Al Viro
	... and massage ufs_frag_map() to take those instead of fragment number. As it is, we duplicate the damn thing on the write side, open-coded and bloody hard to follow. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_inode_get{frag,block}(): get rid of retries	Al Viro
	We are holding ->truncate_mutex, so nobody else can alter our block pointers. Rechecks/retries were needed back when we only held BKL there, and had to cope with write_begin/writepage and writepage/truncate races. Can't happen anymore... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	__ufs_truncate_blocks(): avoid excessive dirtying of indirect blocks	Al Viro
	There's a case when an indirect block gets dirtied for no good reason - when there's a hole starting in the middle of area covered by it and spanning past its end, and truncate() is done precisely to the beginning of the hole. The block is obviously not modified at all - all removals happen beyond it. However, existing code ends up dirtying it just in case. It's trivial to fix and while it's not a real bug by any stretch of imagination, it makes the damn thing harder to follow. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	free_full_branch(): don't bother modifying the block we are going to free	Al Viro
	Note that it's already made unreachable from the inode, so we don't have to worry about ufs_frag_map() walking into something already freed. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	move marking inode dirty to the end of __ufs_truncate_blocks()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	free_full_branch(): saner calling conventions	Al Viro
	Have caller fetch the block number and remove it from wherever it was. Pass the block number instead. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_branch(): kill recursion	Al Viro
	turn recursion into a pair of loops Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_branch(): massage towards killing recursion	Al Viro
	We always have 0 < depth2 <= depth in there, so if (--depth) { if (--depth2) A B } else { C // not using depth2 } D // not using depth2 is equivalent to if (--depth2) A with s/depth/depth - 1/ if (--depth) B else C D Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	split ufs_truncate_branch() into full- and partial-branch variants	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs: unify the logics for collecting adjacent data blocks to free	Al Viro
	open-coded in several places... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_branch(): separate the calls with non-NULL offsets	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_branch(): never call with offsets != NULL && depth2 == 0	Al Viro
	For calls in __ufs_truncate_blocks() it's just a matter of not incrementing offsets[0] and not making that call - immediately following loop will be executed one extra time and we'll be just fine. For recursive call in ufs_trunc_branch() itself, just assing NULL to offsets if we would be about to make such call. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	__ufs_trunc_blocks(): turn the part after switch into a loop	Al Viro
	... and turn the switch into if (), since all cases with depth != 1 have just become identical. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	__ufs_truncate_blocks(): unify freeing the full branches	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	unify ufs_trunc_..indirect()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_..indirect(): more massage towards unifying	Al Viro
	Instead of manually checking that the array contains only zeroes, find the position of the last non-zero (in __ufs_truncate(), where we can conveniently do that) and use that to tell if there's any non-zero in the array tail passed to ufs_trunc_...indirect(). The goal of all that clumsiness is to get fold these functions together. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_...indirect(): pass the array of indices instead of offsets	Al Viro
	rather than bitslicing the offset just formed as sum of shifted indices, pass the array of those indices itself. NULL is used as equivalent of "all zeroes" (== free the entire branch). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	__ufs_truncate(); find cutoff distances into branches by offsets[] array	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_dindirect(): pass the number of blocks to keep	Al Viro
	same as the previous two. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_indirect(): pass the index of the first pointer to free	Al Viro
	... instead of file offset. Same cleanups as in the tindirect conversion in previous commit. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs_trunc_tindirect(): pass the number of blocks to keep	Al Viro
	IOW, the distance of cutoff from the begining of the branch (in blocks). That (and the fact that block just prior to cutoff is guaranteed to be present) allows to tell whether to free triple indirect block just by looking at the offset. While we are at it, using u64 for index in the block is wrong - those should be unsigned int. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs: beginning of __ufs_truncate_block() massage	Al Viro
	Use ufs_block_to_path() to find the cutoff path in the block pointers' tree. For now just use the information about the depth (to bypass the fully preserved subtrees); subsequent commits will use the information about actual path. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2015-07-06	ufs: the offsets ufs_block_to_path() puts into array are not sector_t	Al Viro
	type makes no sense - those are indices in block number arrays, not block numbers. And no, UFS is not likely to grow indirect blocks with 4Gpointers in them... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>