gvisor - Container Runtime Sandbox

Age	Commit message (Collapse)	Author
2020-07-27	Merge release-20200622.1-239-gca6bded95 (automated)	gVisor bot

2020-07-27	Fix memory accounting in TCP pending segment queue.	Bhasker Hariharan
	TCP now tracks the overhead of the segment structure itself in it's out-of-order queue (pending). This is required to ensure that a malicious sender sending 1 byte out-of-order segments cannot queue like 1000's of segments which bloat up memory usage. We also reduce the default receive window to 32KB. With TCP moderation there is no need to keep this window at 1MB which means that for new connections the default out-of-order queue will be small unless the application actually reads the data that is being sent. This prevents a sender from just maliciously filling up pending buf with lots of tiny out-of-order segments. PiperOrigin-RevId: 323450913
2020-07-24	Merge release-20200622.1-208-g82a5cada5 (automated)	gVisor bot

2020-07-23	Add AfterFunc to tcpip.Clock	Sam Balana
	Changes the API of tcpip.Clock to also provide a method for scheduling and rescheduling work after a specified duration. This change also implements the AfterFunc method for existing implementations of tcpip.Clock. This is the groundwork required to mock time within tests. All references to CancellableTimer has been replaced with the tcpip.Job interface, allowing for custom implementations of scheduling work. This is a BREAKING CHANGE for clients that implement their own tcpip.Clock or use tcpip.CancellableTimer. Migration plan: 1. Add AfterFunc(d, f) to tcpip.Clock 2. Replace references of tcpip.CancellableTimer with tcpip.Job 3. Replace calls to tcpip.CancellableTimer#StopLocked with tcpip.Job#Cancel 4. Replace calls to tcpip.CancellableTimer#Reset with tcpip.Job#Schedule 5. Replace calls to tcpip.NewCancellableTimer with tcpip.NewJob. PiperOrigin-RevId: 322906897
2020-07-23	Merge release-20200622.1-198-gfc26b3764 (automated)	gVisor bot

2020-07-23	Merge pull request #3207 from kevinGC:icmp-connect	gVisor bot
	PiperOrigin-RevId: 322853192
2020-07-23	Merge release-20200622.1-196-g20b556e62 (automated)	gVisor bot

2020-07-23	Fix wildcard bind for raw socket.	Bhasker Hariharan
	Fixes #3334 PiperOrigin-RevId: 322846384
2020-07-22	make connect(2) fail when dest is unreachable	Kevin Krakauer
	Previously, ICMP destination unreachable datagrams were ignored by TCP endpoints. This caused connect to hang when an intermediate router couldn't find a route to the host. This manifested as a Kokoro error when Docker IPv6 was enabled. The Ruby image test would try to install the sinatra gem and hang indefinitely attempting to use an IPv6 address. Fixes #3079.
2020-07-22	Merge release-20200622.1-184-g71bf90c55 (automated)	gVisor bot

2020-07-22	Support for receiving outbound packets in AF_PACKET.	Bhasker Hariharan
	Updates #173 PiperOrigin-RevId: 322665518
2020-07-17	Merge release-20200622.1-173-gdcf6ddc27 (automated)	gVisor bot

2020-07-16	Add support to return protocol in recvmsg for AF_PACKET.	Bhasker Hariharan
	Updates #173 PiperOrigin-RevId: 321690756
2020-07-15	Merge release-20200622.1-163-g857d03f25 (automated)	gVisor bot

2020-07-15	Add support for SO_ERROR to packet sockets.	Bhasker Hariharan
	Packet sockets also seem to allow double binding and do not return an error on linux. This was tested by running the syscall test in a linux namespace as root and the current test DoubleBind fails@HEAD. Passes after this change. Updates #173 PiperOrigin-RevId: 321445137
2020-07-13	Merge release-20200622.1-97-g43c209f48 (automated)	gVisor bot

2020-07-13	garbage collect connections	Kevin Krakauer
	As in Linux, we must periodically clean up unused connections. PiperOrigin-RevId: 321003353
2020-07-11	Merge release-20200622.1-90-g216dcebc0 (automated)	gVisor bot

2020-07-11	Stub out SO_DETACH_FILTER.	Bhasker Hariharan
	Updates #2746 PiperOrigin-RevId: 320757963
2020-07-10	Merge release-20200622.1-89-g5df3a8fed (automated)	gVisor bot

2020-07-09	Discard multicast UDP source address.	gVisor bot
	RFC-1122 (and others) specify that UDP should not receive datagrams that have a source address that is a multicast address. Packets should never be received FROM a multicast address. See also, RFC 768: 'User Datagram Protocol' J. Postel, ISI, 28 August 1980 A UDP datagram received with an invalid IP source address (e.g., a broadcast or multicast address) must be discarded by UDP or by the IP layer (see rfc 1122 Section 3.2.1.3). This CL does not address TCP or broadcast which is more complicated. Also adds a test for both ipv6 and ipv4 UDP. Fixes #3154 PiperOrigin-RevId: 320547674
2020-07-09	Merge release-20200622.1-88-g5946f1118 (automated)	gVisor bot

2020-07-09	Add support for IP_HDRINCL IP option for raw sockets.	Bhasker Hariharan
	Updates #2746 Fixes #3158 PiperOrigin-RevId: 320497190
2020-07-08	Avoid accidental zero-checksum	Tamir Duberstein
	PiperOrigin-RevId: 320250773
2020-07-07	Merge release-20200622.1-76-g76c7bc51b (automated)	gVisor bot

2020-07-07	Set IPv4 ID on all non-atomic datagrams	Tony Gong
	RFC 6864 imposes various restrictions on the uniqueness of the IPv4 Identification field for non-atomic datagrams, defined as an IP datagram that either can be fragmented (DF=0) or is already a fragment (MF=1 or positive fragment offset). In order to be compliant, the ID field is assigned for all non-atomic datagrams. Add a TCP unit test that induces retransmissions and checks that the IPv4 ID field is unique every time. Add basic handling of the IP_MTU_DISCOVER socket option so that the option can be used to disable PMTU discovery, effectively setting DF=0. Attempting to set the sockopt to anything other than disabled will fail because PMTU discovery is currently not implemented, and the default behavior matches that of disabled. PiperOrigin-RevId: 320081842
2020-07-07	Merge release-20200622.1-75-g7e4d2d63e (automated)	gVisor bot

2020-07-07	icmp: When setting TransportHeader, remove from the Data portion.	Ting-Yu Wang
	The current convention is when a header is set to pkt.XxxHeader field, it gets removed from pkt.Data. ICMP does not currently follow this convention. PiperOrigin-RevId: 320078606
2020-07-07	Merge release-20200622.1-69-gb0f656184 (automated)	gVisor bot

2020-07-06	Add support for SO_RCVBUF/SO_SNDBUF for AF_PACKET sockets.	Bhasker Hariharan
	Updates #2746 PiperOrigin-RevId: 319887810
2020-07-06	Shard some slow tests.	Ting-Yu Wang
	stack_x_test: 2m -> 20s tcp_x_test: 80s -> 25s PiperOrigin-RevId: 319828101
2020-07-06	Merge release-20200622.1-63-g043e5dddd (automated)	gVisor bot

2020-07-06	Remove dependency on pkg/binary	Tamir Duberstein
	PiperOrigin-RevId: 319770124
2020-07-05	Merge release-20200622.1-62-g0c1353866 (automated)	gVisor bot

2020-07-05	Add wakers synchronously	Tamir Duberstein
	Avoid a race where an arbitrary goroutine scheduling delay can cause the processor to miss events and hang indefinitely. Reduce allocations by storing processors by-value in the dispatcher, and by using a single WaitGroup rather than one per processor. PiperOrigin-RevId: 319665861
2020-07-01	Merge release-20200622.1-54-g31b27adf9 (automated)	gVisor bot

2020-07-01	TCP receive should block when in SYN-SENT state.	Mithun Iyer
	The application can choose to initiate a non-blocking connect and later block on a read, when the endpoint is still in SYN-SENT state. PiperOrigin-RevId: 319311016
2020-07-01	Merge release-20200622.1-47-gc9446f053 (automated)	gVisor bot

2020-06-30	Fix two bugs in TCP sender.	Bhasker Hariharan
	a) When GSO is in use we should not cap the segment to maxPayloadSize in sender.maybeSendSegment as the GSO logic will cap the segment to the correct size. Without this the host GSO is not used as we end up breaking up large segments into small MSS sized segments before writing the packets to the host. b) The check to not split a segment due to it not fitting in the receiver window when there are pending segments is incorrect as segments in writeList can be really large as we just take the write call's buffer size and create a single large segment. So a write of say 128KB will just be 1 segment in the writeList. The linux code checks if 1 MSS sized segments fits in the receiver's window and if not then does not split the current segment. gVisor's check was incorrect that it was checking if the whole segment which could be >>> 1 MSS would fit in the receiver's window. This was causing us to prematurely stop sending and falling back to retransmit timer/probe from the other end to send data. This was seen when running HTTPD benchmarks where @ HEAD when sending large files the benchmark was taking forever to run. The tcp_splitseg_mss_test.go is being deleted as the test as written doesn't test what is intended correctly. This is because GSO is enabled by default and the reason the MSS+1 sized segment is sent is because GSO is in use. A proper test will require disabling GSO on linux and netstack which is going to take a bit of work in packetimpact to do it correctly. Separately a new test probably should be written that verifies that a segment > availableWindow is not split if the availableWindow is < 1 MSS. Fixes #3107 PiperOrigin-RevId: 319172089
2020-06-30	Merge release-20200622.1-42-g4784ed46e (automated)	gVisor bot

2020-06-30	Avoid multiple atomic loads	Tamir Duberstein
	...by calling (tcp.endpoint).EndpointState only once when possible. Avoid wrapping (sleep.Waker).Assert in a useless func while I'm here. PiperOrigin-RevId: 319074149
2020-06-27	Merge release-20200622.1-34-g66d166544 (automated)	gVisor bot

2020-06-26	IPv6 raw sockets. Needed for ip6tables.	Kevin Krakauer
	IPv6 raw sockets never include the IPv6 header. PiperOrigin-RevId: 318582989
2020-06-27	Merge release-20200622.1-33-g8dbeac53c (automated)	gVisor bot

2020-06-26	Implement SO_NO_CHECK socket option.	gVisor bot
	SO_NO_CHECK is used to skip the UDP checksum generation on a TX socket (UDP checksum is optional on IPv4). Test: - TestNoChecksum - SoNoCheckOffByDefault (UdpSocketTest) - SoNoCheck (UdpSocketTest) Fixes #3055 PiperOrigin-RevId: 318575215
2020-06-24	Merge release-20200608.0-120-gb070e218c (automated)	gVisor bot

2020-06-24	Add support for Stack level options.	Bhasker Hariharan
	Linux controls socket send/receive buffers using a few sysctl variables - net.core.rmem_default - net.core.rmem_max - net.core.wmem_max - net.core.wmem_default - net.ipv4.tcp_rmem - net.ipv4.tcp_wmem The first 4 control the default socket buffer sizes for all sockets raw/packet/tcp/udp and also the maximum permitted socket buffer that can be specified in setsockopt(SOL_SOCKET, SO_(RCV\|SND)BUF,...). The last two control the TCP auto-tuning limits and override the default specified in rmem_default/wmem_default as well as the max limits. Netstack today only implements tcp_rmem/tcp_wmem and incorrectly uses it to limit the maximum size in setsockopt() as well as uses it for raw/udp sockets. This changelist introduces the other 4 and updates the udp/raw sockets to use the newly introduced variables. The values for min/max match the current tcp_rmem/wmem values and the default value buffers for UDP/RAW sockets is updated to match the linux value of 212KiB up from the really low current value of 32 KiB. Updates #3043 Fixes #3043 PiperOrigin-RevId: 318089805
2020-06-24	Merge release-20200608.0-119-g364ac92ba (automated)	gVisor bot

2020-06-24	Merge release-20200608.0-116-g2141013dc (automated)	gVisor bot

2020-06-23	Add support for SO_REUSEADDR to TCP sockets/endpoints.	Ian Gudger
	For TCP sockets, SO_REUSEADDR relaxes the rules for binding addresses. gVisor/netstack already supported a behavior similar to SO_REUSEADDR, but did not allow disabling it. This change brings the SO_REUSEADDR behavior closer to the behavior implemented by Linux and adds a new SO_REUSEADDR disabled behavior. Like Linux, SO_REUSEADDR is now disabled by default. PiperOrigin-RevId: 317984380