gvisor - Container Runtime Sandbox

Age	Commit message (Collapse)	Author
2020-09-16	Support multiple FUSE kernel versions of FUSE_INIT response struct	Craig Chi
	The fuse_init_out struct changes in different FUSE kernel versions. A FUSE server may implement older versions of fuse_init_out, but they share common attributes from the beginning. Implement variable-length marshallable interface to support older versions of ABI. Fixes #3707
2020-09-16	Implementing inode.Getlink	Craig Chi
	kernfs uses inode.Getlink to resolve symlink when look up paths. Updates #3452
2020-09-16	Fix FUSE_READDIR offset issue	Craig Chi
	According to readdir(3), the offset attribute in struct dirent is the offset to the next dirent instead of the offset of itself. Send the successive FUSE_READDIR requests with the offset retrieved from the last entry. Updates #3255
2020-09-16	fuse: use safe go_marshal API for FUSE	Ridwan Sharif
	Until #3698 is resolved, this change is needed to ensure we're not corrupting memory anywhere.
2020-09-16	fuse: Implement IterDirents for directory file description	Ridwan Sharif
	Fixes #3255. This change adds support for IterDirents. You can now use `ls` in the FUSE sandbox. Co-authored-by: Craig Chi <craigchi@google.com>
2020-09-16	Implement FUSE_RMDIR	Ridwan Sharif
	Fixes #3587 Co-authored-by: Craig Chi <craigchi@google.com>
2020-09-16	Implement FUSE_READ	Jinmou Li
	Fixes #3206
2020-09-16	Implement FUSE_MKDIR	Boyuan He
	Fixes #3392
2020-09-16	Implement FUSE_READLINK	Boyuan He
	Fixes #3316
2020-09-16	Implement FUSE_SYMLINK	Boyuan He
	Fixes #3452
2020-09-16	Implement FUSE_MKNOD	Boyuan He
	Fixes #3492
2020-09-16	Implement FUSE_RELEASE/RELEASEDIR	Boyuan He
	Fixes #3314
2020-09-16	Implement FUSE_OPEN/OPENDIR	Boyuan He
	Fixes #3174
2020-09-16	Implement FUSE_LOOKUP	Andrei Vagin
	Fixes #3231 Co-authored-by: Boyuan He <heboyuan@google.com>
2020-09-16	Extend integration test to test sequence of FUSE operation	Craig Chi
	Original FUSE integration test has limited capabilities. To test more situations, the new integration test framework introduces a protocol to communicate between testing thread and the FUSE server. In summary, this change includes: 1. Remove CompareResult() and break SetExpected() into SetServerResponse() and GetServerActualRequest(). We no longer set up an expected request because we want to retrieve the actual FUSE request made to the FUSE server and check in the testing thread. 2. Declare a serial buffer data structure to save the received requests and expected responses sequentially. The data structure contains a cursor to indicate the progress of accessing. This change makes sequential SetServerResponse() and GetServerActualRequest() possible. 3. Replace 2 single directional pipes with 1 bi-directional socketpair. A protocol which starts with FuseTestCmd is used between the testing thread and the FUSE server to provide various functionality. Fixes #3405
2020-09-16	Rename marshal.Task to marshal.CopyContext.	Rahat Mahmood
	CopyContext is a better name for the interface because from go-marshal's perspective, the interface has nothing to do with a task. A kernel.Task happens to implement the interface, but so can other things like MemoryManager and IO sequences. PiperOrigin-RevId: 331959678
2020-09-15	Enable automated marshalling for the syscall package.	Rahat Mahmood
	PiperOrigin-RevId: 331940975
2020-09-15	Implement gvisor verity fs ioctl with GETFLAGS	Chong Cai
	PiperOrigin-RevId: 331905347
2020-09-15	Support setting STATX_SIZE for kernfs.InodeAttrs.	Dean Deng
	Make setting STATX_SIZE a no-op, if it is valid for the given permissions and file type. Also update proc tests, which were overfitted before. Fixes #3842. Updates #1193. PiperOrigin-RevId: 331861087
2020-09-15	Merge pull request #3895 from btw616:fix/issue-3894	gVisor bot
	PiperOrigin-RevId: 331824411
2020-09-15	Fix proc.(*fdDir).IterDirents for VFS2	Tiwei Bie
	Currently the returned offset is an index, and we can't use it to find the next fd to serialize, because getdents should iterate correctly despite mutation of fds. Instead, we can return the next fd to serialize plus 2 (which accounts for "." and "..") as the offset. Fixes: #3894 Signed-off-by: Tiwei Bie <tiwei.btw@antgroup.com>
2020-09-14	Correct FDSize in /proc/[pid]/status.	Jamie Liu
	In Linux, FDSize is fs/proc/array.c:task_state() => struct fdtable::max_fds, which is set to the underlying array's length in fs/file.c:alloc_fdtable(). Follow-up changes: - Remove FDTable.GetRefs() and FDTable.GetRefsVFS2(), which are unused. - Reset FDTable.used to 0 during restore, since the subsequent calls to FDTable.setAll() increment it again, causing its value to be doubled. (After this CL, FDTable.used is only used to avoid reallocation in FDTable.GetFDs(), so this fix is not very visible.) PiperOrigin-RevId: 331588190
2020-09-11	Move the 'marshal' and 'primitive' packages to the 'pkg' directory.	Rahat Mahmood
	PiperOrigin-RevId: 331256608
2020-09-11	Implement copy-up-coherent mmap for VFS2 overlayfs.	Jamie Liu
	This is very similar to copy-up-coherent mmap in the VFS1 overlay, with the minor wrinkle that there is no fs.InodeOperations.Mappable(). Updates #1199 PiperOrigin-RevId: 331206314
2020-09-11	Fix host unix socket to not swallow EOF incorrectly.	Bhasker Hariharan
	Fixes an error where in case of a receive buffer larger than the host send buffer size for a host backed unix dgram socket we would end up swallowing EOF from recvmsg syscall causing the read() to block forever. PiperOrigin-RevId: 331192810
2020-09-09	Don't write VFS2 gofer client timestamps back on dentry destruction.	Jamie Liu
	This feature is too expensive for runsc, even with setattrclunk, because fsgofer.localFile.SetAttr() ends up needing to call reopenProcFD(), incurring two string allocations for the FD pathname, an fd.FD allocation, and two calls to runtime.SetFinalizer() when the fd.FD is created and closed respectively (b/133767962) (plus the actual cost of the syscalls, which is negligible). PiperOrigin-RevId: 330843012
2020-09-08	Implement synthetic mountpoints for kernfs.	Jamie Liu
	PiperOrigin-RevId: 330629897
2020-09-08	Add check for both child and childMerkle ENOENT	gVisor bot
	The check in verity walk returns error for non ENOENT cases, and all ENOENT results should be checked. This case was missing. PiperOrigin-RevId: 330604771
2020-09-08	Implement ioctl with enable verity	gVisor bot
	ioctl with FS_IOC_ENABLE_VERITY is added to verity file system to enable a file as verity file. For a file, a Merkle tree is built with its data. For a directory, a Merkle tree is built with the root hashes of its children. PiperOrigin-RevId: 330604368
2020-09-08	[vfs] overlayfs: decref VD when not using it.	Ayush Ranjan
	overlay/filesystem.go:lookupLocked() did not DecRef the VD on some error paths when it would not end up saving or using the VD. PiperOrigin-RevId: 330589742
2020-09-08	Honor readonly flag for root mount	Fabricio Voznika
	Updates #1487 PiperOrigin-RevId: 330580699
2020-09-08	[vfs] Capitalize x in the {Get/Set/Remove/List}xattr functions.	Ayush Ranjan
	PiperOrigin-RevId: 330554450
2020-09-02	[vfs] Implement xattr for overlayfs.	Ayush Ranjan
	PiperOrigin-RevId: 329825497
2020-09-02	[vfs] Fix error handling in overlayfs OpenAt.	Ayush Ranjan
	Updates #1199 PiperOrigin-RevId: 329802274
2020-09-01	Implement setattr+clunk in 9P	Fabricio Voznika
	This is to cover the common pattern: open->read/write->close, where SetAttr needs to be called to update atime/mtime before the file is closed. Benchmark results: BM_OpenReadClose/10240 CPU setattr+clunk: 63783 ns VFS2: 68109 ns VFS1: 72507 ns Updates #1198 PiperOrigin-RevId: 329628461
2020-09-01	Refactor tty codebase to use master-replica terminology.	Ayush Ranjan
	Updates #2972 PiperOrigin-RevId: 329584905
2020-09-01	[go-marshal] Enable auto-marshalling for fs/tty.	Ayush Ranjan
	PiperOrigin-RevId: 329564614
2020-08-31	Don't use read-only host FD for writable gofer dentries in VFS2.	Jamie Liu
	As documented for gofer.dentry.hostFD. PiperOrigin-RevId: 329372319
2020-08-31	Implement walk in gvisor verity fs	gVisor bot
	Implement walk directories in gvisor verity file system. For each step, the child dentry is verified against a verified parent root hash. PiperOrigin-RevId: 329358747
2020-08-28	Implement StatFS for various VFS2 filesystems.	Rahat Mahmood
	This mainly involved enabling kernfs' client filesystems to provide a StatFS implementation. Fixes #3411, #3515. PiperOrigin-RevId: 329009864
2020-08-27	unix: return ECONNREFUSE if a socket file exists but a socket isn't bound to it	Andrei Vagin
	PiperOrigin-RevId: 328843560
2020-08-26	Remove spurious fd.IncRef().	Nicolas Lacasse
	PiperOrigin-RevId: 328583461
2020-08-26	tmpfs: Allow xattrs in the trusted namespace if creds has CAP_SYS_ADMIN.	Nicolas Lacasse
	This is needed to support the overlay opaque attribute. PiperOrigin-RevId: 328552985
2020-08-25	Use new reference count utility throughout gvisor.	Dean Deng
	This uses the refs_vfs2 template in vfs2 as well as objects common to vfs1 and vfs2. Note that vfs1-only refcounts are not replaced, since vfs1 will be deleted soon anyway. The following structs now use the new tool, with leak check enabled: devpts:rootInode fuse:inode kernfs:Dentry kernfs:dir kernfs:readonlyDir kernfs:StaticDirectory proc:fdDirInode proc:fdInfoDirInode proc:subtasksInode proc:taskInode proc:tasksInode vfs:FileDescription vfs:MountNamespace vfs:Filesystem sys:dir kernel:FSContext kernel:ProcessGroup kernel:Session shm:Shm mm:aioMappable mm:SpecialMappable transport:queue And the following use the template, but because they currently are not leak checked, a TODO is left instead of enabling leak check in this patch: kernel:FDTable tun:tunEndpoint Updates #1486. PiperOrigin-RevId: 328460377
2020-08-25	Return non-zero size for tmpfs statfs(2).	Jamie Liu
	This does not implement accepting or enforcing any size limit, which will be more complex and has performance implications; it just returns a fixed non-zero size. Updates #1936 PiperOrigin-RevId: 328428588
2020-08-25	Expose basic coverage information to userspace through kcov interface.	Dean Deng
	In Linux, a kernel configuration is set that compiles the kernel with a custom function that is called at the beginning of every basic block, which updates the memory-mapped coverage information. The Go coverage tool does not allow us to inject arbitrary instructions into basic blocks, but it does provide data that we can convert to a kcov-like format and transfer them to userspace through a memory mapping. Note that this is not a strict implementation of kcov, which is especially tricky to do because we do not have the same coverage tools available in Go that that are available for the actual Linux kernel. In Linux, a kernel configuration is set that compiles the kernel with a custom function that is called at the beginning of every basic block to write program counters to the kcov memory mapping. In Go, however, coverage tools only give us a count of basic blocks as they are executed. Every time we return to userspace, we collect the coverage information and write out PCs for each block that was executed, providing userspace with the illusion that the kcov data is always up to date. For convenience, we also generate a unique synthetic PC for each block instead of using actual PCs. Finally, we do not provide thread-specific coverage data (each kcov instance only contains PCs executed by the thread owning it); instead, we will supply data for any file specified by -- instrumentation_filter. Also, fix issue in nogo that was causing pkg/coverage:coverage_nogo compilation to fail. PiperOrigin-RevId: 328426526
2020-08-25	[go-marshal] Enable auto-marshalling for host tty.	Ayush Ranjan
	PiperOrigin-RevId: 328415633
2020-08-25	overlay: clonePrivateMount must pass a Dentry reference to MakeVirtualDentry.	Nicolas Lacasse
	PiperOrigin-RevId: 328410065
2020-08-25	Fix deadlock in gofer direct IO.	Dean Deng
	Fixes several java runtime tests: java/nio/channels/FileChannel/directio/ReadDirect.java java/nio/channels/FileChannel/directio/PreadDirect.java Updates #3576. PiperOrigin-RevId: 328281849
2020-08-24	Flush in fsimpl/gofer.regularFileFD.OnClose() if there are no dirty pages.	Jamie Liu
	This is closer to indistinguishable from VFS1 behavior. PiperOrigin-RevId: 328256068