gvisor - Container Runtime Sandbox

Age	Commit message (Collapse)	Author
2020-09-18	Merge release-20200907.0-139-g313e1988c (automated)	gVisor bot

2020-09-18	Drop ARCH_GET_FS	Michael Pratt
	Go does not call arch_prctl(ARCH_GET_FS), nor am I sure it ever did. Drop the filter. PiperOrigin-RevId: 332470532
2020-09-17	Merge release-20200907.0-125-gd796b100e (automated)	gVisor bot

2020-09-17	Merge release-20200907.0-123-gf0b1bd434 (automated)	gVisor bot

2020-09-17	Merge release-20200907.0-124-gda07e38f7 (automated)	gVisor bot

2020-09-17	Remove option to panic gofer	Fabricio Voznika
	Gofer panics are suppressed by p9 server and an error is returned to the caller, making it effectively the same as returning EROFS. PiperOrigin-RevId: 332282959
2020-09-17	Merge release-20200907.0-121-ga11061d78 (automated)	gVisor bot

2020-09-17	Add VFS2 overlay support in runsc	Fabricio Voznika
	All tests under runsc are passing with overlay enabled. Updates #1487, #1199 PiperOrigin-RevId: 332181267
2020-09-16	Refactor removed default test dimension	Fabricio Voznika
	ptrace was always selected as a dimension before, but not anymore. Some tests were specifying "overlay" expecting that to be in addition to the default. PiperOrigin-RevId: 332004111
2020-09-16	Merge release-20200907.0-56-gdcd532e2e (automated)	gVisor bot

2020-09-15	Add support for OCI seccomp filters in the sandbox.	Ian Lewis
	OCI configuration includes support for specifying seccomp filters. In runc, these filter configurations are converted into seccomp BPF programs and loaded into the kernel via libseccomp. runsc needs to be a static binary so, for runsc, we cannot rely on a C library and need to implement the functionality in Go. The generator added here implements basic support for taking OCI seccomp configuration and converting it into a seccomp BPF program with the same behavior as a program generated by libseccomp. - New conditional operations were added to pkg/seccomp to support operations available in OCI. - AllowAny and AllowValue were renamed to MatchAny and EqualTo to better reflect that syscalls matching the conditionals result in the provided action not simply SCMP_RET_ALLOW. - BuildProgram in pkg/seccomp no longer panics if provided an empty list of rules. It now builds a program with the architecture sanity check only. - ProgramBuilder now allows adding labels that are unused. However, backwards jumps are still not permitted. Fixes #510 PiperOrigin-RevId: 331938697
2020-09-08	Merge release-20200818.0-132-gc8f1ce288 (automated)	gVisor bot

2020-09-08	Honor readonly flag for root mount	Fabricio Voznika
	Updates #1487 PiperOrigin-RevId: 330580699
2020-09-08	Merge release-20200818.0-127-gd35f07b36 (automated)	gVisor bot

2020-09-08	Improve type safety for transport protocol options	Ghanan Gowripalan
	The existing implementation for TransportProtocol.{Set}Option take arguments of an empty interface type which all types (implicitly) implement; any type may be passed to the functions. This change introduces marker interfaces for transport protocol options that may be set or queried which transport protocol option types implement to ensure that invalid types are caught at compile time. Different interfaces are used to allow the compiler to enforce read-only or set-only socket options. RELNOTES: n/a PiperOrigin-RevId: 330559811
2020-09-04	Merge release-20200818.0-124-g2202812e0 (automated)	gVisor bot

2020-09-04	Simplify FD handling for container start/exec	Fabricio Voznika
	VFS1 and VFS2 host FDs have different dupping behavior, making error prone to code for both. Change the contract so that FDs are released as they are used, so the caller can simple defer a block that closes all remaining files. This also addresses handling of partial failures. With this fix, more VFS2 tests can be enabled. Updates #1487 PiperOrigin-RevId: 330112266
2020-09-02	Merge release-20200818.0-108-ga0e431038 (automated)	gVisor bot

2020-09-02	Merge pull request #3822 from btw616:fix/issue-3821	gVisor bot
	PiperOrigin-RevId: 329710371
2020-09-02	Merge release-20200818.0-105-g37a217aca (automated)	gVisor bot

2020-09-01	Implement setattr+clunk in 9P	Fabricio Voznika
	This is to cover the common pattern: open->read/write->close, where SetAttr needs to be called to update atime/mtime before the file is closed. Benchmark results: BM_OpenReadClose/10240 CPU setattr+clunk: 63783 ns VFS2: 68109 ns VFS1: 72507 ns Updates #1198 PiperOrigin-RevId: 329628461
2020-09-01	Merge release-20200818.0-102-g2eaf54dd5 (automated)	gVisor bot

2020-09-01	Refactor tty codebase to use master-replica terminology.	Ayush Ranjan
	Updates #2972 PiperOrigin-RevId: 329584905
2020-09-01	Merge release-20200818.0-99-g71589b7f7 (automated)	gVisor bot

2020-09-01	Let flags be overriden from OCI annotations	Fabricio Voznika
	This allows runsc flags to be set per sandbox instance. For example, K8s pod annotations can be used to enable --debug for a single pod, making troubleshoot much easier. Similarly, features like --vfs2 can be enabled for experimentation without affecting other pods in the node. Closes #3494 PiperOrigin-RevId: 329542815
2020-09-01	Dup stdio FDs for VFS2 when starting a child container	Tiwei Bie
	Currently the stdio FDs are not dupped and will be closed unexpectedly in VFS2 when starting a child container. This patch fixes this issue. Fixes: #3821 Signed-off-by: Tiwei Bie <tiwei.btw@antgroup.com>
2020-08-28	Merge release-20200818.0-83-gbdd5996a7 (automated)	gVisor bot

2020-08-28	Improve type safety for network protocol options	Ghanan Gowripalan
	The existing implementation for NetworkProtocol.{Set}Option take arguments of an empty interface type which all types (implicitly) implement; any type may be passed to the functions. This change introduces marker interfaces for network protocol options that may be set or queried which network protocol option types implement to ensure that invalid types are caught at compile time. Different interfaces are used to allow the compiler to enforce read-only or set-only socket options. PiperOrigin-RevId: 328980359
2020-08-27	Merge release-20200818.0-66-g32e7a54f7 (automated)	gVisor bot

2020-08-26	Make flag propagation automatic	Fabricio Voznika
	Use reflection and tags to provide automatic conversion from Config to flags. This makes adding new flags less error-prone, skips flags using default values (easier to read), and makes tests correctly use default flag values for test Configs. Updates #3494 PiperOrigin-RevId: 328662070
2020-08-25	Merge release-20200818.0-54-gcb573c8e0 (automated)	gVisor bot

2020-08-25	Expose basic coverage information to userspace through kcov interface.	Dean Deng
	In Linux, a kernel configuration is set that compiles the kernel with a custom function that is called at the beginning of every basic block, which updates the memory-mapped coverage information. The Go coverage tool does not allow us to inject arbitrary instructions into basic blocks, but it does provide data that we can convert to a kcov-like format and transfer them to userspace through a memory mapping. Note that this is not a strict implementation of kcov, which is especially tricky to do because we do not have the same coverage tools available in Go that that are available for the actual Linux kernel. In Linux, a kernel configuration is set that compiles the kernel with a custom function that is called at the beginning of every basic block to write program counters to the kcov memory mapping. In Go, however, coverage tools only give us a count of basic blocks as they are executed. Every time we return to userspace, we collect the coverage information and write out PCs for each block that was executed, providing userspace with the illusion that the kcov data is always up to date. For convenience, we also generate a unique synthetic PC for each block instead of using actual PCs. Finally, we do not provide thread-specific coverage data (each kcov instance only contains PCs executed by the thread owning it); instead, we will supply data for any file specified by -- instrumentation_filter. Also, fix issue in nogo that was causing pkg/coverage:coverage_nogo compilation to fail. PiperOrigin-RevId: 328426526
2020-08-25	Include shim in individual released binaries.	Adin Scannell
	The debian rules are also moved to the top-level, since they apply to binaries outside the //runsc directory. Fixes #3665 PiperOrigin-RevId: 328379709
2020-08-22	Merge release-20200810.0-90-g17bc5c1b0 (automated)	gVisor bot

2020-08-21	[vfs] Allow mountpoint to be an existing non-directory.	Ayush Ranjan
	Unlike linux mount(2), OCI spec allows mounting on top of an existing non-directory file. PiperOrigin-RevId: 327914342
2020-08-21	Merge release-20200810.0-83-g5ec3d4ed3 (automated)	gVisor bot

2020-08-21	Make mounts ReadWrite first, then later change to ReadOnly.	Nicolas Lacasse
	This lets us create "synthetic" mountpoint directories in ReadOnly mounts during VFS setup. Also add context.WithMountNamespace, as some filesystems (like overlay) require a MountNamespace on ctx to handle vfs.Filesystem Operations. PiperOrigin-RevId: 327874971
2020-08-20	Merge release-20200810.0-78-g73c69cb4d (automated)	gVisor bot

2020-08-20	[vfs] Create recursive dir creation util.	Ayush Ranjan
	Refactored the recursive dir creation util in runsc/boot/vfs.go to be more flexible. PiperOrigin-RevId: 327719100
2020-08-20	Merge release-20200810.0-69-gbe76c7ce6 (automated)	gVisor bot

2020-08-19	Move boot.Config to its own package	Fabricio Voznika
	Updates #3494 PiperOrigin-RevId: 327548511
2020-08-20	Merge release-20200810.0-68-g633570462 (automated)	gVisor bot

2020-08-19	Remove path walk from localFile.Mknod	Fabricio Voznika
	Replace mknod call with mknodat equivalent to protect against symlink attacks. Also added Mknod tests. Remove goferfs reliance on gofer to check for file existence before creating a synthetic entry. Updates #2923 PiperOrigin-RevId: 327544516
2020-08-18	Merge release-20200810.0-51-g760c131da (automated)	gVisor bot

2020-08-18	Return EROFS if mount is read-only	Fabricio Voznika
	PiperOrigin-RevId: 327300635
2020-08-10	Merge release-20200804.0-55-g79e7d0b06 (automated)	gVisor bot

2020-08-10	Run GC before sandbox exit when leak checking is enabled.	Dean Deng
	Running garbage collection enqueues all finalizers, which are used by the refs/refs_vfs2 packages to detect reference leaks. Note that even with GC, there is no guarantee that all finalizers will be run before the program exits. This is a best effort attempt to activate leak checks as much as possible. Updates #3545. PiperOrigin-RevId: 325834438
2020-08-08	Merge release-20200804.0-52-g3be26a271 (automated)	gVisor bot

2020-08-07	[vfs2] Fix tmpfs mounting.	Ayush Ranjan
	Earlier we were using NLink to decide if /tmp is empty or not. However, NLink at best tells us about the number of subdirectories (via the ".." entries). NLink = n + 2 for n subdirectories. But it does not tell us if the directory is empty. There still might be non-directory files. We could also not rely on NLink because host overlayfs always returned 1. VFS1 uses Readdir to decide if the directory is empty. Used a similar approach. We now use IterDirents to decide if the "/tmp" directory is empty. Fixes #3369 PiperOrigin-RevId: 325554234
2020-08-06	Merge release-20200804.0-29-g63447e5af (automated)	gVisor bot