gvisor - Container Runtime Sandbox

Age	Commit message (Collapse)	Author
2020-12-12	Merge release-20201208.0-36-g1e92732eb (automated)	gVisor bot

2020-12-11	Merge release-20201208.0-28-gaf4afdc0e (automated)	gVisor bot

2020-12-11	[netstack] Decouple tcpip.ControlMessages from the IP control messges.	Ayush Ranjan
	tcpip.ControlMessages can not contain Linux specific structures which makes it painful to convert back and forth from Linux to tcpip back to Linux when passing around control messages in hostinet and raw sockets. Now we convert to the Linux version of the control message as soon as we are out of tcpip. PiperOrigin-RevId: 347027065
2020-12-10	Merge release-20201130.0-74-g92ca72ecb (automated)	gVisor bot

2020-12-09	Add support for IP_RECVORIGDSTADDR IP option.	Bhasker Hariharan
	Fixes #5004 PiperOrigin-RevId: 346643745
2020-12-07	Merge release-20201130.0-58-g615c3380d (automated)	gVisor bot

2020-12-07	Export IGMP stats	Arthur Sfez
	PiperOrigin-RevId: 346197760
2020-12-02	Merge release-20201117.0-100-gbdaae08ee (automated)	gVisor bot

2020-12-02	Extract ICMPv4/v6 specific stats to their own types	Arthur Sfez
	This change lets us split the v4 stats from the v6 stats, which will be useful when adding stats for each network endpoint. PiperOrigin-RevId: 345322615
2020-12-02	Merge release-20201117.0-97-g1375a87a2 (automated)	gVisor bot

2020-12-02	[netstack] Refactor common utils out of netstack to socket package.	Ayush Ranjan
	Moved AddressAndFamily() and ConvertAddress() to socket package from netstack. This helps because these utilities are used by sibling netstack packages. Such sibling dependencies can later cause circular dependencies. Common utils shared between siblings should be moved up to the parent. PiperOrigin-RevId: 345275571
2020-11-26	Merge release-20201109.0-120-gad8311242 (automated)	gVisor bot

2020-11-26	[netstack] Add SOL_TCP options to SocketOptions.	Ayush Ranjan
	Ports the following options: - TCP_NODELAY - TCP_CORK - TCP_QUICKACK Also deletes the {Get/Set}SockOptBool interface methods from all implementations PiperOrigin-RevId: 344378824
2020-11-26	Merge release-20201109.0-119-gbebadb518 (automated)	gVisor bot

2020-11-25	[netstack] Add SOL_IP and SOL_IPV6 options to SocketOptions.	Ayush Ranjan
	We will use SocketOptions for all kinds of options, not just SOL_SOCKET options because (1) it is consistent with Linux which defines all option variables on the top level socket struct, (2) avoid code complexity. Appropriate checks have been added for matching option level to the endpoint type. Ported the following options to this new utility: - IP_MULTICAST_LOOP - IP_RECVTOS - IPV6_RECVTCLASS - IP_PKTINFO - IP_HDRINCL - IPV6_V6ONLY Changes in behavior (these are consistent with what Linux does AFAICT): - Now IP_MULTICAST_LOOP can be set for TCP (earlier it was a noop) but does not affect the endpoint itself. - We can now getsockopt IP_HDRINCL (earlier we would get an error). - Now we return ErrUnknownProtocolOption if SOL_IP or SOL_IPV6 options are used on unix sockets. - Now we return ErrUnknownProtocolOption if SOL_IPV6 options are used on non AF_INET6 endpoints. This change additionally makes the following modifications: - Add State() uint32 to commonEndpoint because both tcpip.Endpoint and transport.Endpoint interfaces have it. It proves to be quite useful. - Gets rid of SocketOptionsHandler.IsListening(). It was an anomaly as it was not a handler. It is now implemented on netstack itself. - Gets rid of tcp.endpoint.EndpointInfo and directly embeds stack.TransportEndpointInfo. There was an unnecessary level of embedding which served no purpose. - Removes some checks dual_stack_test.go that used the errors from GetSockOptBool(tcpip.V6OnlyOption) to confirm some state. This is not consistent with the new design and also seemed to be testing the implementation instead of behavior. PiperOrigin-RevId: 344354051
2020-11-19	Merge release-20201109.0-84-ge5650d124 (automated)	gVisor bot

2020-11-18	[netstack] Move SO_KEEPALIVE and SO_ACCEPTCONN option to SocketOptions.	Ayush Ranjan
	PiperOrigin-RevId: 343217712
2020-11-18	Merge release-20201109.0-79-gdf37babd5 (automated)	gVisor bot

2020-11-18	[netstack] Move SO_REUSEPORT and SO_REUSEADDR option to SocketOptions.	Ayush Ranjan
	This changes also introduces: - `SocketOptionsHandler` interface which can be implemented by endpoints to handle endpoint specific behavior on SetSockOpt. This is analogous to what Linux does. - `DefaultSocketOptionsHandler` which is a default implementation of the above. This is embedded in all endpoints so that we don't have to uselessly implement empty functions. Endpoints with specific behavior can override the embedded method by manually defining its own implementation. PiperOrigin-RevId: 343158301
2020-11-18	Merge release-20201109.0-77-g3e73c519a (automated)	gVisor bot

2020-11-18	[netstack] Move SO_NO_CHECK option to SocketOptions.	Ayush Ranjan
	PiperOrigin-RevId: 343146856
2020-11-18	Merge release-20201109.0-71-gfc342fb43 (automated)	gVisor bot

2020-11-18	[netstack] Move SO_PASSCRED option to SocketOptions.	Ayush Ranjan
	This change also makes the following fixes: - Make SocketOptions use atomic operations instead of having to acquire/drop locks upon each get/set option. - Make documentation more consistent. - Remove tcpip.SocketOptions from socketOpsCommon because it already exists in transport.Endpoint. - Refactors get/set socket options tests to be easily extendable. PiperOrigin-RevId: 343103780
2020-11-17	Merge release-20201109.0-55-gfb9a649f3 (automated)	gVisor bot

2020-11-17	Fix SO_ERROR behavior for TCP in gVisor.	Bhasker Hariharan
	Fixes the behaviour of SO_ERROR for tcp sockets where in linux it returns sk->sk_err and if sk->sk_err is 0 then it returns sk->sk_soft_err. In gVisor TCP we endpoint.HardError is the equivalent of sk->sk_err and endpoint.LastError holds soft errors. This change brings this into alignment with Linux such that both hard/soft errors are cleared when retrieved using getsockopt(.. SO_ERROR) is called on a socket. Fixes #3812 PiperOrigin-RevId: 342868552
2020-11-13	Merge release-20201030.0-83-g5bb64ce1b (automated)	gVisor bot

2020-11-12	Refactor SOL_SOCKET options	Nayana Bidari
	Store all the socket level options in a struct and call {Get/Set}SockOpt on this struct. This will avoid implementing socket level options on all endpoints. This CL contains implementing one socket level option for tcp and udp endpoints. PiperOrigin-RevId: 342203981
2020-11-09	Merge release-20201030.0-53-g0fb5353e4 (automated)	gVisor bot

2020-11-09	Initialize references with a value of 1.	Dean Deng
	This lets us avoid treating a value of 0 as one reference. All references using the refsvfs2 template must call InitRefs() before the reference is incremented/decremented, or else a panic will occur. Therefore, it should be pretty easy to identify missing InitRef calls during testing. Updates #1486. PiperOrigin-RevId: 341411151
2020-11-02	Merge release-20201019.0-116-g5e606844d (automated)	gVisor bot

2020-11-01	Fix returned error when deleting non-existant address	Ian Lewis
	PiperOrigin-RevId: 340149214
2020-10-29	Merge release-20201019.0-103-g181fea0b5 (automated)	gVisor bot

2020-10-29	Make RedirectTarget thread safe	Kevin Krakauer
	Fixes #4613. PiperOrigin-RevId: 339746784
2020-10-29	Merge release-20201019.0-101-g02fe467b4 (automated)	gVisor bot

2020-10-29	Keep magic constants out of netstack	Kevin Krakauer
	PiperOrigin-RevId: 339721152
2020-10-29	Merge release-20201019.0-95-g3b4674ffe (automated)	gVisor bot

2020-10-27	Merge release-20201019.0-68-g59e2c9f16 (automated)	gVisor bot

2020-10-27	Add basic address deletion to netlink	Ian Lewis
	Updates #3921 PiperOrigin-RevId: 339195417
2020-10-26	Merge release-20201019.0-62-g0bdcee38b (automated)	gVisor bot

2020-10-26	Fix SCM Rights S/R reference leak.	Dean Deng
	Control messages collected when peeking into a socket were being leaked. PiperOrigin-RevId: 339114961
2020-10-24	Merge release-20201019.0-53-g8dfbec28a (automated)	gVisor bot

2020-10-23	Fix nogo tests in //pkg/sentry/socket/...	Ting-Yu Wang
	PiperOrigin-RevId: 338784921
2020-10-24	Merge release-20201019.0-51-g9f87400f0 (automated)	gVisor bot

2020-10-23	Support VFS2 save/restore.	Jamie Liu
	Inode number consistency checks are now skipped in save/restore tests for reasons described in greatest detail in StatTest.StateDoesntChangeAfterRename. They pass in VFS1 due to the bug described in new test case SimpleStatTest.DifferentFilesHaveDifferentDeviceInodeNumberPairs. Fixes #1663 PiperOrigin-RevId: 338776148
2020-10-23	Merge release-20201019.0-41-g6ee3520b6 (automated)	gVisor bot

2020-10-23	[vfs] kernfs: Implement remaining InodeAttr fields.	Ayush Ranjan
	Added the following fields in kernfs.InodeAttr: - blockSize - atime - mtime - ctime Also resolved all TODOs for #1193. Fixes #1193 PiperOrigin-RevId: 338714527
2020-10-23	Merge release-20201019.0-37-g39e9b3bb8 (automated)	gVisor bot

2020-10-23	Support getsockopt for SO_ACCEPTCONN.	Nayana Bidari
	The SO_ACCEPTCONN option is used only on getsockopt(). When this option is specified, getsockopt() indicates whether socket listening is enabled for the socket. A value of zero indicates that socket listening is disabled; non-zero that it is enabled. PiperOrigin-RevId: 338703206
2020-10-23	Merge release-20201019.0-34-g9ca66ec59 (automated)	gVisor bot

2020-10-23	Rewrite reference leak checker without finalizers.	Dean Deng
	Our current reference leak checker uses finalizers to verify whether an object has reached zero references before it is garbage collected. There are multiple problems with this mechanism, so a rewrite is in order. With finalizers, there is no way to guarantee that a finalizer will run before the program exits. When an unreachable object with a finalizer is garbage collected, its finalizer will be added to a queue and run asynchronously. The best we can do is run garbage collection upon sandbox exit to make sure that all finalizers are enqueued. Furthermore, if there is a chain of finalized objects, e.g. A points to B points to C, garbage collection needs to run multiple times before all of the finalizers are enqueued. The first GC run will register the finalizer for A but not free it. It takes another GC run to free A, at which point B's finalizer can be registered. As a result, we need to run GC as many times as the length of the longest such chain to have a somewhat reliable leak checker. Finally, a cyclical chain of structs pointing to one another will never be garbage collected if a finalizer is set. This is a well-known issue with Go finalizers (https://github.com/golang/go/issues/7358). Using leak checking on filesystem objects that produce cycles will not work and even result in memory leaks. The new leak checker stores reference counted objects in a global map when leak check is enabled and removes them once they are destroyed. At sandbox exit, any remaining objects in the map are considered as leaked. This provides a deterministic way of detecting leaks without relying on the complexities of finalizers and garbage collection. This approach has several benefits over the former, including: - Always detects leaks of objects that should be destroyed very close to sandbox exit. The old checker very rarely detected these leaks, because it relied on garbage collection to be run in a short window of time. - Panics if we forgot to enable leak check on a ref-counted object (we will try to remove it from the map when it is destroyed, but it will never have been added). - Can store extra logging information in the map values without adding to the size of the ref count struct itself. With the size of just an int64, the ref count object remains compact, meaning frequent operations like IncRef/DecRef are more cache-efficient. - Can aggregate leak results in a single report after the sandbox exits. Instead of having warnings littered in the log, which were non-deterministically triggered by garbage collection, we can print all warning messages at once. Note that this could also be a limitation--the sandbox must exit properly for leaks to be detected. Some basic benchmarking indicates that this change does not significantly affect performance when leak checking is enabled, which is understandable since registering/unregistering is only done once for each filesystem object. Updates #1486. PiperOrigin-RevId: 338685972