I've been experimenting with HTTP/3 (h3) support in Envoy Proxy. I now have both upstream and downstream working
Inspired by the paper referred to in
I set out to expand the number of h3 clients for testing
Theophilus A. Benson@dottheophilus
Interested to understand why (or rather when) QUIC (HTTP3) outperforms TCP (HTTP2)? Check out my new WWW paper w/ @triplewy1 on demystifying QUIC: cs.brown.edu/~tab/papers/QU…
Hint: Performance is a function of implementation and operational choices not necessarily the protocol.06:41 AM - 16 Feb 2021
Thanks to @howardjohn at Google for helping with the initial h3 config. Thanks to @triplewy1 for helping me sort out the correct parameters to pass to proxygen.
Big thanks to the Envoy team who have helped with configs, testing, ideas etc. In particular @alyssawilk, @danzh2010 + @mattklein123
Envoy
I'm building Envoy from source (main branch) on Linux with a limited set of extensions
Clients
I've ended up building and testing 7 h3 clients:
#. | Client | Language | Compilation |
---|---|---|---|
1. | curl/curl | C | cloudflare/quiche + BoringSSL |
2. | hyperium/h3 | Rust | musl static |
3. | proxygen/hq | C++ | |
4. | mozilla/neqo | Rust | NSS |
5. | istio/quic-go | Go | |
6. | cloudflare/quiche | Rust | musl static |
7. | quinn-rs/quinn | Rust | musl static |
Testing
hyperfine
I used the excellent hyperfine for testing. Please note that benchmarking is hard and this is in no way a proper benchmark. This is more for fun, learning how to build and use new h3 clients and working out how to configure h3 / QUIC for Envoy. Please take all results with a huge grain of salt
tl;dr - neqo generally slightly quicker followed closely by proxygen, quic-go + h3 (not always in that order). Then quinn, curl and cloudflare/quiche. I'm surprised by cloudflare/quiche being so slow however I believe it has not been optimized at this point
hyperfine --ignore-failure --warmup 5 --runs 100 \ | |
'~/rust/curl/src/curl --silent --alt-svc ~/.altsvc.cache https://x1.local:4433/get' \ | |
'~/rust/h3/target/x86_64-unknown-linux-musl/release/examples/client https://x1.local:4433/get' \ | |
'~/proxygen/proxygen/_build/proxygen/httpserver/hq -logdir=/tmp -mode=client -early_data=true -connect_udp=true -draft-version=29 -port=4433 -host=x1.local -path=/get' \ | |
'~/istio/pkg/test/echo/cmd/client/client --http3 https://x1.local:4433/get' \ | |
'~/rust/neqo/target/release/neqo-client --output-read-data --resume --alpn h3-29 https://x1.local:4433/get' \ | |
'~/rust/quiche/tools/apps/target/x86_64-unknown-linux-musl/release/quiche-client https://x1.local:4433/get' \ | |
'~/rust/quinn/target/x86_64-unknown-linux-musl/release/examples/h3_client https://x1.local:4433/get' | |
Benchmark #1: ~/rust/curl/src/curl --silent --alt-svc ~/.altsvc.cache https://x1.local:4433/get | |
Time (mean ± σ): 146.1 ms ± 55.3 ms [User: 23.9 ms, System: 17.1 ms] | |
Range (min … max): 105.0 ms … 525.4 ms 100 runs | |
Benchmark #2: ~/rust/h3/target/x86_64-unknown-linux-musl/release/examples/client https://x1.local:4433/get | |
Time (mean ± σ): 129.2 ms ± 52.0 ms [User: 9.4 ms, System: 7.0 ms] | |
Range (min … max): 100.4 ms … 402.4 ms 100 runs | |
Benchmark #3: ~/proxygen/proxygen/_build/proxygen/httpserver/hq -logdir=/tmp -mode=client -early_data=true -connect_udp=true -draft-version=29 -port=4433 -host=x1.local -path=/get | |
Time (mean ± σ): 142.0 ms ± 55.8 ms [User: 16.9 ms, System: 9.0 ms] | |
Range (min … max): 104.4 ms … 507.2 ms 100 runs | |
Benchmark #4: ~/istio/pkg/test/echo/cmd/client/client --http3 https://x1.local:4433/get | |
Time (mean ± σ): 133.2 ms ± 70.2 ms [User: 10.1 ms, System: 9.3 ms] | |
Range (min … max): 101.6 ms … 625.9 ms 100 runs | |
Benchmark #5: ~/rust/neqo/target/release/neqo-client --output-read-data --resume --alpn h3-29 https://x1.local:4433/get | |
Time (mean ± σ): 125.7 ms ± 38.0 ms [User: 9.2 ms, System: 4.6 ms] | |
Range (min … max): 99.5 ms … 376.6 ms 100 runs | |
Benchmark #6: ~/rust/quiche/tools/apps/target/x86_64-unknown-linux-musl/release/quiche-client https://x1.local:4433/get | |
Time (mean ± σ): 218.5 ms ± 91.1 ms [User: 9.1 ms, System: 4.0 ms] | |
Range (min … max): 121.5 ms … 738.7 ms 100 runs | |
Benchmark #7: ~/rust/quinn/target/x86_64-unknown-linux-musl/release/examples/h3_client https://x1.local:4433/get | |
Time (mean ± σ): 151.0 ms ± 78.2 ms [User: 9.4 ms, System: 7.7 ms] | |
Range (min … max): 100.7 ms … 558.9 ms 100 runs | |
Summary | |
'~/rust/neqo/target/release/neqo-client --output-read-data --resume --alpn h3-29 https://x1.local:4433/get' ran | |
1.03 ± 0.52 times faster than '~/rust/h3/target/x86_64-unknown-linux-musl/release/examples/client https://x1.local:4433/get' | |
1.06 ± 0.64 times faster than '~/istio/pkg/test/echo/cmd/client/client --http3 https://x1.local:4433/get' | |
1.13 ± 0.56 times faster than '~/proxygen/proxygen/_build/proxygen/httpserver/hq -logdir=/tmp -mode=client -early_data=true -connect_udp=true -draft-version=29 -port=4433 -host=x1.local -path=/get' | |
1.16 ± 0.56 times faster than '~/rust/curl/src/curl --silent --alt-svc ~/.altsvc.cache https://x1.local:4433/get' | |
1.20 ± 0.72 times faster than '~/rust/quinn/target/x86_64-unknown-linux-musl/release/examples/h3_client https://x1.local:4433/get' | |
1.74 ± 0.90 times faster than '~/rust/quiche/tools/apps/target/x86_64-unknown-linux-musl/release/quiche-client https://x1.local:4433/get' |
h3spec
I've also tested using the excellent h3spec. We found one crashing bug using this test suite which has subsequently been fixed
QUIC servers | |
MUST send FLOW_CONTROL_ERROR if a STREAM frame with a large offset is received [Transport 4.1] | |
MUST send TRANSPORT_PARAMETER_ERROR if initial_source_connection_id is missing [Transport 7.3] | |
MUST send TRANSPORT_PARAMETER_ERROR if original_destination_connection_id is received [Transport 18.2] | |
MUST send TRANSPORT_PARAMETER_ERROR if preferred_address, is received [Transport 18.2] | |
MUST send TRANSPORT_PARAMETER_ERROR if retry_source_connection_id is received [Transport 18.2] | |
MUST send TRANSPORT_PARAMETER_ERROR if stateless_reset_token is received [Transport 18.2] | |
MUST send TRANSPORT_PARAMETER_ERROR if max_udp_payload_size < 1200 [Transport 7.4 and 18.2] | |
MUST send TRANSPORT_PARAMETER_ERROR if ack_delay_exponen > 20 [Transport 7.4 and 18.2] | |
MUST send TRANSPORT_PARAMETER_ERROR if max_ack_delay >= 2^14 [Transport 7.4 and 18.2] | |
MUST send FRAME_ENCODING_ERROR if a frame of unknown type is received [Transport 12.4] | |
MUST send PROTOCOL_VIOLATION on no frames [Transport 12.4] FAILED [1] | |
MUST send PROTOCOL_VIOLATION if reserved bits in Handshake are non-zero [Transport 17.2] FAILED [2] | |
MUST send PROTOCOL_VIOLATION if PATH_CHALLENGE in Handshake is received [Transport 17.2.4] | |
MUST send PROTOCOL_VIOLATION if reserved bits in Short are non-zero [Transport 17.2] FAILED [3] | |
MUST send STREAM_STATE_ERROR if RESET_STREAM is received for a send-only stream [Transport 19.4] | |
MUST send STREAM_STATE_ERROR if STOP_SENDING is received for a non-existing stream [Transport 19.5] | |
MUST send PROTOCOL_VIOLATION if NEW_TOKEN is received [Transport 19.7] | |
MUST send STREAM_STATE_ERROR if MAX_STREAM_DATA is received for a non-existing stream [Transport 19.10] | |
MUST send STREAM_STATE_ERROR if MAX_STREAM_DATA is received for a receive-only stream [Transport 19.10] | |
MUST send FRAME_ENCODING_ERROR if invalid MAX_STREAMS is received [Transport 19.11] | |
MUST send STREAM_LIMIT_ERROR or FRAME_ENCODING_ERROR if invalid STREAMS_BLOCKED is received [Transport 19.14] | |
MUST send FRAME_ENCODING_ERROR if NEW_CONNECTION_ID with invalid Retire_Prior_To is received [Transport 19.15] | |
MUST send FRAME_ENCODING_ERROR if NEW_CONNECTION_ID with 0-byte CID is received [Transport 19.15] FAILED [4] | |
MUST send PROTOCOL_VIOLATION if HANDSHAKE_DONE is received [Transport 19.20] | |
MUST send unexpected_message TLS alert if KeyUpdate in Handshake is received [TLS 6] | |
MUST send unexpected_message TLS alert if KeyUpdate in 1-RTT is received [TLS 6] FAILED [5] | |
MUST send no_application_protocol TLS alert if no application protocols are supported [TLS 8.1] | |
MUST send missing_extension TLS alert if the quic_transport_parameters extension does not included [TLS 8.2] FAILED [6] | |
MUST send unexpected_message TLS alert if EndOfEarlyData is received [TLS 8.3] | |
MUST send PROTOCOL_VIOLATION if CRYPTO in 0-RTT is received [TLS 8.3] FAILED [7] | |
HTTP/3 servers | |
MUST send H3_FRAME_UNEXPECTED if DATA is received before HEADERS [HTTP/3 4.1] | |
MUST send H3_MESSAGE_ERROR if a pseudo-header is duplicated [HTTP/3 4.1.1] FAILED [8] | |
MUST send H3_MESSAGE_ERROR if mandatory pseudo-header fields are absent [HTTP/3 4.1.3] FAILED [9] | |
MUST send H3_MESSAGE_ERROR if prohibited pseudo-header fields are present[HTTP/3 4.1.3] FAILED [10] | |
MUST send H3_MESSAGE_ERROR if pseudo-header fields exist after fields [HTTP/3 4.1.3] FAILED [11] | |
MUST send H3_MISSING_SETTINGS if the first control frame is not SETTINGS [HTTP/3 6.2.1] | |
MUST send H3_FRAME_UNEXPECTED if a DATA frame is received on a control stream [HTTP/3 7.2.1] | |
MUST send H3_FRAME_UNEXPECTED if a HEADERS frame is received on a control stream [HTTP/3 7.2.2] | |
MUST send H3_FRAME_UNEXPECTED if a second SETTINGS frame is received [HTTP/3 7.2.4] | |
MUST send H3_SETTINGS_ERROR if HTTP/2 settings are included [HTTP/3 7.2.4.1] | |
MUST send H3_FRAME_UNEXPECTED if CANCEL_PUSH is received in a request stream [HTTP/3 7.2.5] FAILED [12] | |
MUST send QPACK_DECOMPRESSION_FAILED if an invalid static table index exits in a field line representation [QPACK 3.1] | |
MUST send QPACK_ENCODER_STREAM_ERROR if a new dynamic table capacity value exceeds the limit [QPACK 4.1.3] | |
MUST send H3_CLOSED_CRITICAL_STREAM if a control stream is closed [QPACK 4.2] FAILED [13] | |
MUST send QPACK_DECODER_STREAM_ERROR if Insert Count Increment is 0 [QPACK 4.4.3] |
45 examples, 13 failures. This suite has been great for catching crashes but it should be noted the goal is not to attain 100% as there are a number of performance trade-offs to consider
Config
Downstream h3 with local direct responses + h2 upstream
The first config shows how to set up a TCP + UDP listener on the same port, testing JSON structed logging, an Envoy direct response on /local
, alt-svc
headers on h2 responses
I use CUE for all of these configs and these are exported YAML. The process is started on the Fish shell with:
<path to>/envoy --concurrency 1 --log-level debug --config-path (cue export downstream_httpbin_org.cue | psub)
Note that for h3 to worked today you'll need to set concurrency
to 1
admin: | |
access_log: | |
- name: envoy.access_loggers.file | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.access_loggers.file.v3.FileAccessLog | |
path: /tmp/admin_access.log | |
address: | |
socket_address: | |
address: ::0 | |
ipv4_compat: true | |
port_value: 9901 | |
protocol: TCP | |
layered_runtime: | |
layers: | |
- name: static-layer | |
static_layer: | |
envoy.http.headermap.lazy_map_min_size: 3 | |
envoy.reloadable_features.new_tcp_connection_pool: true | |
envoy.reloadable_features.prefer_quic_kernel_bpf_packet_routing: true | |
envoy.reloadable_features.remove_legacy_json: true | |
static_resources: | |
clusters: | |
- name: service_httpbin | |
connect_timeout: 2s | |
lb_policy: ROUND_ROBIN | |
load_assignment: | |
cluster_name: service_httpbin | |
endpoints: | |
- lb_endpoints: | |
- endpoint: | |
address: | |
socket_address: | |
address: httpbin.org | |
port_value: 443 | |
transport_socket: | |
name: envoy.transport_sockets.tls | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext | |
common_tls_context: | |
alpn_protocols: h2 | |
sni: httpbin.org | |
type: LOGICAL_DNS | |
typed_extension_protocol_options: | |
envoy.extensions.upstreams.http.v3.HttpProtocolOptions: | |
'@type': type.googleapis.com/envoy.extensions.upstreams.http.v3.HttpProtocolOptions | |
explicit_http_config: | |
http2_protocol_options: {} | |
listeners: | |
- name: listener_udp | |
address: | |
socket_address: | |
address: ::0 | |
ipv4_compat: true | |
port_value: 4433 | |
protocol: UDP | |
filter_chains: | |
- filter_chain_match: | |
transport_protocol: quic | |
transport_socket: | |
name: envoy.transport_sockets.quic | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.transport_sockets.quic.v3.QuicDownstreamTransport | |
downstream_tls_context: | |
common_tls_context: | |
alpn_protocols: h3 | |
tls_certificates: | |
certificate_chain: | |
filename: cert.pem | |
private_key: | |
filename: cert-key.pem | |
filters: | |
- name: envoy.filters.network.http_connection_manager | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager | |
access_log: | |
- name: envoy.access_loggers.file | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.access_loggers.file.v3.FileAccessLog | |
log_format: | |
json_format: | |
bytes_received: '%BYTES_RECEIVED%' | |
bytes_sent: '%BYTES_SENT%' | |
duration: '%DURATION%' | |
http_response: '%RESPONSE_CODE%' | |
protocol: '%PROTOCOL%' | |
request_authority: '%REQ(:AUTHORITY)%' | |
request_method: '%REQ(:METHOD)%' | |
request_path: '%REQ(X-ENVOY-ORIGINAL-PATH?:PATH)%' | |
response_flag: '%RESPONSE_FLAGS%' | |
start_time: '%START_TIME%' | |
upstream_host: '%UPSTREAM_HOST%' | |
upstream_service_time: '%RESP(X-ENVOY-UPSTREAM-SERVICE-TIME)%' | |
user_agent: '%REQ(USER-AGENT)%' | |
x_forwarded_for: '%REQ(X-FORWARDED-FOR)%' | |
x_request_id: '%REQ(X-REQUEST-ID)%' | |
path: http3_downstream.log | |
codec_type: HTTP3 | |
http_filters: | |
- name: envoy.filters.http.router | |
route_config: | |
name: local_route | |
max_direct_response_body_size_bytes: 428 | |
virtual_hosts: | |
- name: local_service | |
domains: | |
- '*' | |
routes: | |
- match: | |
prefix: /local | |
direct_response: | |
body: | |
inline_string: |- | |
<!DOCTYPE html> | |
<head><meta charset=utf-8><title>envoy_http3_downstream</title><link rel=icon href="" type=image/svg+xml sizes=any><style> body { font-size: 70px; }</style></head><body>direct_response: You found it!</body> | |
status: 200 | |
response_headers_to_add: | |
- header: | |
key: content-type | |
value: text/html;charset=utf-8 | |
- match: | |
prefix: / | |
route: | |
cluster: service_httpbin | |
host_rewrite_literal: httpbin.org | |
stat_prefix: ingress_h3 | |
reuse_port: true | |
udp_listener_config: | |
downstream_socket_config: | |
prefer_gro: true | |
quic_options: {} | |
- name: listener_tcp | |
address: | |
socket_address: | |
address: ::0 | |
ipv4_compat: true | |
port_value: 4433 | |
protocol: TCP | |
filter_chains: | |
- transport_socket: | |
name: envoy.transport_sockets.tls | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.DownstreamTlsContext | |
common_tls_context: | |
tls_certificates: | |
certificate_chain: | |
filename: cert.pem | |
private_key: | |
filename: cert-key.pem | |
alpn_protocols: h2 | |
filters: | |
- name: envoy.filters.network.http_connection_manager | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager | |
access_log: | |
- name: envoy.access_loggers.file | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.access_loggers.file.v3.FileAccessLog | |
path: http3_downstream.log | |
http_filters: | |
- name: envoy.filters.http.router | |
route_config: | |
name: local_route | |
max_direct_response_body_size_bytes: 428 | |
virtual_hosts: | |
- domains: | |
- '*' | |
name: local_service | |
response_headers_to_add: | |
- header: | |
key: alt-svc | |
value: h3=":4433"; ma=86400, h3-29=":4433"; ma=86400 | |
routes: | |
- match: | |
prefix: /local | |
direct_response: | |
body: | |
inline_string: |- | |
<!DOCTYPE html> | |
<head><meta charset=utf-8><title>envoy_http3_downstream</title><link rel=icon href="" type=image/svg+xml sizes=any><style> body { font-size: 70px; }</style></head><body>direct_response: You found it!</body> | |
status: 200 | |
response_headers_to_add: | |
- header: | |
key: content-type | |
value: text/html;charset=utf-8 | |
- match: | |
prefix: / | |
route: | |
cluster: service_httpbin | |
host_rewrite_literal: httpbin.org | |
stat_prefix: ingress_h2 |
Downstream h1 with h3 upstream
This is a simpler config with a stock h1 listener but talks h3 to the upstream service
admin: | |
access_log: | |
- name: envoy.access_loggers.file | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.access_loggers.file.v3.FileAccessLog | |
path: /tmp/admin_access.log | |
address: | |
socket_address: | |
address: ::0 | |
ipv4_compat: true | |
port_value: 9902 | |
protocol: TCP | |
layered_runtime: | |
layers: | |
- name: static-layer | |
static_layer: | |
envoy.http.headermap.lazy_map_min_size: 3 | |
envoy.reloadable_features.new_tcp_connection_pool: true | |
envoy.reloadable_features.prefer_quic_kernel_bpf_packet_routing: true | |
envoy.reloadable_features.remove_legacy_json: true | |
static_resources: | |
clusters: | |
- name: service_quic_tech | |
connect_timeout: 2s | |
lb_policy: ROUND_ROBIN | |
load_assignment: | |
cluster_name: service_quic_tech | |
endpoints: | |
- lb_endpoints: | |
- endpoint: | |
address: | |
socket_address: | |
address: quic.tech | |
port_value: 8443 | |
transport_socket: | |
name: envoy.transport_sockets.quic | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.transport_sockets.quic.v3.QuicUpstreamTransport | |
upstream_tls_context: | |
allow_renegotiation: true | |
common_tls_context: | |
alpn_protocols: h3-29 | |
sni: quic.tech | |
type: LOGICAL_DNS | |
typed_extension_protocol_options: | |
envoy.extensions.upstreams.http.v3.HttpProtocolOptions: | |
'@type': type.googleapis.com/envoy.extensions.upstreams.http.v3.HttpProtocolOptions | |
common_http_protocol_options: | |
idle_timeout: 1s | |
explicit_http_config: | |
http3_protocol_options: {} | |
upstream_http_protocol_options: | |
auto_sni: true | |
listeners: | |
- name: listener_0 | |
address: | |
socket_address: | |
address: ::0 | |
ipv4_compat: true | |
port_value: 10000 | |
protocol: TCP | |
filter_chains: | |
- filters: | |
- name: envoy.filters.network.http_connection_manager | |
typed_config: | |
'@type': type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager | |
stat_prefix: ingress_http | |
route_config: | |
name: local_route | |
virtual_hosts: | |
- name: local_service | |
domains: | |
- '*' | |
routes: | |
- match: | |
prefix: / | |
route: | |
host_rewrite_literal: quic.tech | |
cluster: service_quic_tech | |
http_filters: | |
- name: envoy.filters.http.router |
Future
- Would be fun to test this with things like dynamic forward proxies
- The testing above is done on an Envoy proxy with a the runtime value
envoy.reloadable_features.prefer_quic_kernel_bpf_packet_routing: true
set and Linux Capabilities ofsudo setcap cap_bpf+ep <path to>/envoy
on a kernel >= 5.8.x. However as per the following issue it is not sure what effect this hasDocs: clarify QUIC BPF operation #15845
moderation posted onTitle: Docs: clarify QUIC BPF operation
Description: Prompted by the Twitter chat at https://twitter.com/mattklein123/status/1378172039870091265 I looked into the runtime flag that allows QUIC routing in the kernal via BPF - https://github.com/envoyproxy/envoy/blob/main/source/common/runtime/runtime_features.cc#L82
layered_runtime: layers: - name: static-layer static_layer: envoy.reloadable_features.prefer_quic_kernel_bpf_packet_routing: true
Installing BPF rules like this requires one of:
- Envoy is running as root
- For kernels >= 5.8, Envoy is running with
sudo setcap cap_bpf+ep <envoy binary>
- For kernels < 5.8, Envoy is running with
sudo cap_net_admin,cap_sys_admin+ep <envoy binary>
From initial testing Envoy doesn't display any different output when launched in different modes. QUIC / h3 listeners work whether Envoy was launched with the elevated permissions or not.
It would be good to clarify in the docs what steps need to be taken to enable QUIC BPF kernel routing and what platforms work and don't work. It looks like this is Linux only at the moment. It might be worthwhile logging whether the BPF rule has been installed successfully - https://github.com/envoyproxy/envoy/blob/main/source/common/quic/active_quic_listener.cc#L234-L298
Relevant Links: Handy reference on determine what Linux Capabilities your system supports - https://linux-audit.com/linux-capabilities-101/
/cc @ggreenway @alyssawilk @danzh2010 @mattklein123
Updates
- quinn removed support for their h3 test client in https://github.com/quinn-rs/quinn/pull/1155
Top comments (1)
Interested in helping me write up the current state of this topic for a technical article in IEEE Cloud Continuum? Let me know if so and I'll send you some questions.