linux.git - Linus' kernel tree

Age	Commit message (Collapse)	Author
2013-04-09	procfs: kill ->write_proc()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	new helper: single_open_size()	Al Viro
	Same as single_open(), but preallocates the buffer of given size. Doesn't make any sense for sizes up to PAGE_SIZE and doesn't make sense if output of show() exceeds PAGE_SIZE only rarely - seq_read() will take care of growing the buffer and redoing show(). If you _know_ that it will be large, it might make more sense to look into saner iterator, rather than go with single-shot one. If that's impossible, single_open_size() might be for you. Again, don't use that without a good reason; occasionally that's really the best way to go, but very often there are better solutions. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	procfs: don't allow to use proc_create, create_proc_entry, etc. for directories	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	reiserfs: use proc_remove_subtree()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	procfs: switch /proc/self away from proc_dir_entry	Al Viro
	Just have it pinned in dcache all along and let procfs ->kill_sb() drop it before kill_anon_super(). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	mode_t, whack-a-mole at 11...	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	get rid of the last free_pipe_info() callers	Al Viro
	and rename __free_pipe_info() to free_pipe_info() Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	get rid of alloc_pipe_info() argument	Al Viro
	not used anymore Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	get rid of pipe->inode	Al Viro
	it's used only as a flag to distinguish normal pipes/FIFOs from the internal per-task one used by file-to-file splice. And pipe->files would work just as well for that purpose... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	introduce variants of pipe_lock/pipe_unlock for real pipes/FIFOs	Al Viro
	fs/pipe.c file_operations methods know that pipe is not an internal one; no need to check pipe->inode for those callers. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	pipe: set file->private_data to ->i_pipe	Al Viro
	simplify get_pipe_info(), while we are at it Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	pipe: don't use ->i_mutex	Al Viro
	now it can be done - put mutex into pipe_inode_info, use it instead of ->i_mutex Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	pipe: take allocation and freeing of pipe_inode_info out of ->i_mutex	Al Viro
	* new field - pipe->files; number of struct file over that pipe (all sharing the same inode, of course); protected by inode->i_lock. * pipe_release() decrements pipe->files, clears inode->i_pipe when if the counter has reached 0 (all under ->i_lock) and, in that case, frees pipe after having done pipe_unlock() * fifo_open() starts with grabbing ->i_lock, and either bumps pipe->files if ->i_pipe was non-NULL or allocates a new pipe (dropping and regaining ->i_lock) and rechecks ->i_pipe; if it's still NULL, inserts new pipe there, otherwise bumps ->i_pipe->files and frees the one we'd allocated. At that point we know that ->i_pipe is non-NULL and won't go away, so we can do pipe_lock() on it and proceed as we used to. If we end up failing, decrement pipe->files and if it reaches 0 clear ->i_pipe and free the sucker after pipe_unlock(). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	pipe: preparation to new locking rules	Al Viro
	* use the fact that file_inode(file)->i_pipe doesn't change while the file is opened - no locks needed to access that. * switch to pipe_lock/pipe_unlock where it's easy to do Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	pipe: switch wait_for_partner() and wake_up_partner() to pipe_inode_info	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	pipe: fold file_operations instances in one	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	fold fifo.c into pipe.c	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	lift sb_start_write out of ->splice_write()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	lift sb_start_write into default_file_splice_write()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	lift sb_start_write() out of ->write()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	switch compat readv/writev variants to COMPAT_SYSCALL_DEFINE	Al Viro
	... and take to fs/read_write.c Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	f2fs: use mnt_want_write_file() in ioctl	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	lift sb_start_write/sb_end_write out of ->aio_write()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	hpfs: move setting hpfs-private i_dirty to ->write_end()	Al Viro
	... so that writev(2) doesn't miss it. Get rid of hpfs_file_write(). Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	reiserfs: don't wank with EFBIG before calling do_sync_write()	Al Viro
	look for file_capable() in there... Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	fold release_mounts() into namespace_unlock()	Al Viro
	... and provide namespace_lock() as a trivial wrapper; switch to those two consistently. Result is patterned after rtnl_lock/rtnl_unlock pair. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	switch unlock_mount() to namespace_unlock(), convert all umount_tree() callers	Al Viro
	which allows to kill the last argument of umount_tree() and make release_mounts() static. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	more conversions to namespace_unlock()	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	get rid of the second argument of shrink_submounts()	Al Viro
	... it's always &unmounted. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	saner umount_tree()/release_mounts(), part 1	Al Viro
	global list of release_mounts() fodder, protected by namespace_sem; eventually, all umount_tree() callers will use it as kill list. Helper picking the contents of that list, releasing namespace_sem and doing release_mounts() on what it got. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	get rid of full-hash scan on detaching vfsmounts	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	mnt: release locks on error path in do_loopback	Andrey Vagin
	do_loopback calls lock_mount(path) and forget to unlock_mount if clone_mnt or copy_mnt fails. [ 77.661566] ================================================ [ 77.662939] [ BUG: lock held when returning to user space! ] [ 77.664104] 3.9.0-rc5+ #17 Not tainted [ 77.664982] ------------------------------------------------ [ 77.666488] mount/514 is leaving the kernel with locks still held! [ 77.668027] 2 locks held by mount/514: [ 77.668817] #0: (&sb->s_type->i_mutex_key#7){+.+.+.}, at: [<ffffffff811cca22>] lock_mount+0x32/0xe0 [ 77.671755] #1: (&namespace_sem){+++++.}, at: [<ffffffff811cca3a>] lock_mount+0x4a/0xe0 Signed-off-by: Andrey Vagin <avagin@openvz.org> Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	procfs: add proc_remove_subtree()	Al Viro
	just what it sounds like; do that only to procfs subtrees you've created - doing that to something shared with another driver is not only antisocial, but might cause interesting races with proc_create() and its ilk. Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	ecryptfs: close rmmod race	Al Viro
	Signed-off-by: Al Viro <viro@zeniv.linux.org.uk>
2013-04-09	net: sctp: introduce uapi header for sctp	Daniel Borkmann
	This patch introduces an UAPI header for the SCTP protocol, so that we can facilitate the maintenance and development of user land applications or libraries, in particular in terms of header synchronization. To not break compatibility, some fragments from lksctp-tools' netinet/sctp.h have been carefully included, while taking care that neither kernel nor user land breaks, so both compile fine with this change (for lksctp-tools I tested with the old netinet/sctp.h header and with a newly adapted one that includes the uapi sctp header). lksctp-tools smoke test run through successfully as well in both cases. Suggested-by: Neil Horman <nhorman@tuxdriver.com> Cc: Neil Horman <nhorman@tuxdriver.com> Cc: Vlad Yasevich <vyasevich@gmail.com> Signed-off-by: Daniel Borkmann <dborkman@redhat.com> Signed-off-by: David S. Miller <davem@davemloft.net>
2013-04-09	NFSv4: Fix another potential state manager deadlock	Trond Myklebust
	Don't hold the NFSv4 sequence id while we check for open permission. The call to ACCESS may block due to reboot recovery. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com>
2013-04-09	ext4: improve credit estimate for EXT4_SINGLEDATA_TRANS_BLOCKS	Jan Kara
	Estimate of 27 credits for allocation of a block in extent based inode is unnecessarily high. We can easily argue 20 is enough. Signed-off-by: Jan Kara <jack@suse.cz> Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
2013-04-09	ext4: speed-up releasing blocks on commit	Andrey Sidorov
	Improve mb_free_blocks speed by clearing entire range at once instead of iterating over each bit. Freeing block-by-block also makes buddy bitmap subtree flip twice making most of the work a no-op. Very few bits in buddy bitmap require change, e.g. freeing entire group is a 1 bit flip only. As a result, releasing blocks of 60G file now takes 5ms instead of 2.7s. This is especially good for non-preemptive kernels as there is no rescheduling during release. Signed-off-by: Andrey Sidorov <qrxd43@motorola.com> Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
2013-04-09	ext4: fix free space estimate in ext4_nonda_switch()	Eric Whitney
	Values stored in s_freeclusters_counter and s_dirtyclusters_counter are both in cluster units. Remove the cluster to block conversion applied to s_freeclusters_counter causing an inflated estimate of free space because s_dirtyclusters_counter is not similarly converted. Rename free_blocks and dirty_blocks to better reflect the units these variables contain to avoid future confusion. This fix corrects ENOSPC failures for xfstests 127 and 231 on bigalloc file systems. Signed-off-by: Eric Whitney <enwlinux@gmail.com> Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
2013-04-09	ext4: fix deadlock with quota feature	Jan Kara
	We didn't mark hidden quota files with S_NOQUOTA flag and thus quota was accounted even for quota files. Thus we could recurse back to quota code when adding new blocks to quota file which can easily deadlock. Mark hidden quota files properly. Signed-off-by: Jan Kara <jack@suse.cz> Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
2013-04-09	nfsd4: don't close read-write opens too soon	J. Bruce Fields
	Don't actually close any opens until we don't need them at all. This means being left with write access when it's not really necessary, but that's better than putting a file that might still have posix locks held on it, as we have been. Reported-by: Toralf Förster <toralf.foerster@gmx.de> Cc: stable@kernel.org Signed-off-by: J. Bruce Fields <bfields@redhat.com>
2013-04-09	nfsd4: release lockowners on last unlock in 4.1 case	J. Bruce Fields
	In the 4.1 case we're supposed to release lockowners as soon as they're no longer used. It would probably be more efficient to reference count them, but that's slightly fiddly due to the need to have callbacks from locks.c to take into account lock merging and splitting. For most cases just scanning the inode's lock list on unlock for matching locks will be sufficient. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
2013-04-09	nfsd4: more sessions/open-owner-replay cleanup	J. Bruce Fields
	More logic that's unnecessary in the 4.1 case. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
2013-04-09	nfsd4: no need for replay_owner in sessions case	J. Bruce Fields
	The replay_owner will never be used in the sessions case. Signed-off-by: J. Bruce Fields <bfields@redhat.com>
2013-04-09	nfsd4: remove some redundant comments	J. Bruce Fields
	Signed-off-by: J. Bruce Fields <bfields@redhat.com>
2013-04-09	nfsd: use kmem_cache_free() instead of kfree()	Wei Yongjun
	memory allocated by kmem_cache_alloc() should be freed using kmem_cache_free(), not kfree(). Signed-off-by: Wei Yongjun <yongjun_wei@trendmicro.com.cn> Cc: stable@kernel.org Signed-off-by: J. Bruce Fields <bfields@redhat.com>
2013-04-09	f2fs: fix typo mistakes	Namjae Jeon
	Fix typo mistakes. 1. I think that it should be 'L' instead of 'V'. 2. and try to fix 'Front' instead of 'Frone' Signed-off-by: Namjae Jeon <namjae.jeon@samsung.com> Signed-off-by: Amit Sahrawat <a.sahrawat@samsung.com> Signed-off-by: Jaegeuk Kim <jaegeuk.kim@samsung.com>
2013-04-09	f2fs: write checkpoint before starting FG_GC	Jaegeuk Kim
	In order to be aware of prefree and free sections during FG_GC, let's start with write_checkpoint(). Signed-off-by: Jaegeuk Kim <jaegeuk.kim@samsung.com>
2013-04-09	f2fs: fix the logic of IS_DNODE()	Zhihui Zhang
	If (ofs % (NIDS_PER_BLOCK + 1) == 0), the node is an indirect node block. Signed-off-by: Zhihui Zhang <zzhsuny@gmail.com> Signed-off-by: Jaegeuk Kim <jaegeuk.kim@samsung.com>
2013-04-09	f2fs: introduce a new global lock scheme	Jaegeuk Kim
	In the previous version, f2fs uses global locks according to the usage types, such as directory operations, block allocation, block write, and so on. Reference the following lock types in f2fs.h. enum lock_type { RENAME, /* for renaming operations / DENTRY_OPS, / for directory operations / DATA_WRITE, / for data write / DATA_NEW, / for data allocation / DATA_TRUNC, / for data truncate / NODE_NEW, / for node allocation / NODE_TRUNC, / for node truncate / NODE_WRITE, / for node write */ NR_LOCK_TYPE, }; In that case, we lose the performance under the multi-threading environment, since every types of operations must be conducted one at a time. In order to address the problem, let's share the locks globally with a mutex array regardless of any types. So, let users grab a mutex and perform their jobs in parallel as much as possbile. For this, I propose a new global lock scheme as follows. 0. Data structure - f2fs_sb_info -> mutex_lock[NR_GLOBAL_LOCKS] - f2fs_sb_info -> node_write 1. mutex_lock_op(sbi) - try to get an avaiable lock from the array. - returns the index of the gottern lock variable. 2. mutex_unlock_op(sbi, index of the lock) - unlock the given index of the lock. 3. mutex_lock_all(sbi) - grab all the locks in the array before the checkpoint. 4. mutex_unlock_all(sbi) - release all the locks in the array after checkpoint. 5. block_operations() - call mutex_lock_all() - sync_dirty_dir_inodes() - grab node_write - sync_node_pages() Note that, the pairs of mutex_lock_op()/mutex_unlock_op() and mutex_lock_all()/mutex_unlock_all() should be used together. Signed-off-by: Jaegeuk Kim <jaegeuk.kim@samsung.com>