move proposal to doc

hongliangl · Jan 24, 2020 · 5124dbd · 5124dbd
1 parent a2481ef
commit 5124dbd
Show file tree

Hide file tree

Showing 6 changed files with 83 additions and 77 deletions.
diff --git a/doc/proposal.md b/doc/proposal.md
@@ -0,0 +1,75 @@
+## Docker image
+
+```
+docker pull mcluseau/kube-proxy2
+```
+
+## Draft of goals
+
+```
+[k8s API] ----> [local model] ----> [plugin] ----> [aggregator] ----> [subsystem]
+```
+
+* build an intermediate model specific to represent the expected node-local state (ENLS)
+* digest API server's changes to the ENLS
+  * trigger events only when the ENLS changes (not on every API event)
+  * naturally decouples and factorizes proxy-specific logic
+* ENLS should be serializable in protobuf, and may be available through an API (internal?)
+  * should ease debugging
+  * should ease independant updates on each side of this API (k8s semantics before ENLS, subsystems after)
+* rework current proxy modes as "plugins"
+* define APIs or something to aggregate publications to subsystems like ipvs or iptables from multiple vendors
+  * aggregating allows to reduce the syscall load and to factorize update logic (ie: rate limits, partial updates...)
+  * ie: calico iptables rules could be pushed to the proxy, eliminating lock contention
+* have a framework approach to ease implementations of more autonomous "kube-proxies" (iptables or ipvs only, nftables, eBPF...)
+* hard to do in the current code base, so starting out-of-tree
+  * may be moved to kubernetes/staging later if preferred.
+
+## Example of event frequency reduction made by diff'ing the ENLS
+
+* rev = real changes that would be propagated to listeners
+* eventCount = number of API events received
+
+Collected using `time timeout 30m ./kube-proxy2 --kubeconfig $KUBECONFIG |tee test-run.XXX.log`
+
+On an empty KinD cluster:
+```
+stats:	time	events	rev	sys cpu	usr cpu	tot cpu	mem	revs/events
+stats:	ms	count	count	ms	ms	%	MiB	%
+stats:	0	0	0	0	0	+Inf	1.13	NaN
+stats:	1000	9	2	3	37	4.058	0.83	22.222
+stats:	2000	10	2	1	0	0.103	0.83	20.000
+stats:	3000	12	2	1	0	0.178	0.84	16.667
+[...]
+stats:	298000	510	2	0	1	0.131	0.93	0.392
+stats:	299000	512	2	0	1	0.188	0.94	0.391
+stats:	300000	513	2	0	0	0.112	0.94	0.390
+stats:	301000	522	2	0	1	0.239	0.95	0.383 # same point as below (for comparison)
+stats:	302000	523	2	0	0	0.105	0.96	0.382
+[...]
+stats:	1798000	3078	2	0	2	0.264	1.18	0.065
+stats:	1799000	3078	2	0	0	0.037	1.18	0.065
+```
+
+![graph](test-run.kind-empty.svg)
+
+On a bigger cluster (1k services, 1.5k pods):
+```
+stats:	time	events	rev	usr cpu	sys cpu	tot cpu	mem	revs/events
+stats:	ms	count	count	ms	ms	%	MiB	%
+stats:	0	0	0	0	0	+Inf	1.22	NaN
+stats:	1000	2134	1064	3	308	31.280	3.93	49.859
+stats:	2000	2137	1064	0	1	0.131	3.94	49.789
+stats:	3000	2138	1064	0	0	0.050	3.94	49.766
+[...]
+stats:	298000	21785	1067	0	0	0.050	4.16	4.898
+stats:	299000	21787	1067	0	0	0.089	4.16	4.897
+stats:	300000	21788	1067	0	0	0.061	4.17	4.897
+stats:	301000	23925	1068	4	64	6.882	3.35	4.464 # GC after 2k+ events (without a proxy-related change)
+stats:	302000	23926	1068	0	0	0.052	3.35	4.464
+[...]
+stats:	1798000	130941	1218	0	0	0.090	4.68	0.930
+stats:	1799000	130942	1218	0	0	0.055	4.69	0.930
+```
+
+![graph](test-run.1k-svc.svg)
diff --git a/test-run.1k-svc.svg → doc/test-run.1k-svc.svg b/test-run.1k-svc.svg → doc/test-run.1k-svc.svg
diff --git a/test-run.kind-empty.svg → doc/test-run.kind-empty.svg b/test-run.kind-empty.svg → doc/test-run.kind-empty.svg
diff --git a/main.go b/main.go
@@ -23,7 +23,7 @@ func main() {
 	proxy.InitFlags(flag.CommandLine)
 
 	cmd := cobra.Command{
-		Use: "proxy2",
+		Use: "proxy",
 		Run: run,
 	}
 

diff --git a/plot.sh b/plot.sh
@@ -1,2 +1,7 @@
 #! /bin/sh
-gnuplot -e "output='${1%.*}.svg'" -e "input='$1'" cpu-mem.gnuplot
+
+input="${1}"
+output="${2:-${1%.*}.svg}"
+
+set -x
+gnuplot -e "output='${output}'" -e "input='${input}'" cpu-mem.gnuplot
diff --git a/proposal.md b/proposal.md
@@ -1,75 +1 @@
-## Docker image
-
-```
-docker pull mcluseau/kube-proxy2
-```
-
-## Draft of goals
-
-```
-[k8s API] ----> [local model] ----> [plugin] ----> [aggregator] ----> [subsystem]
-```
-
-* build an intermediate model specific to represent the expected node-local state (ENLS)
-* digest API server's changes to the ENLS
-  * trigger events only when the ENLS changes (not on every API event)
-  * naturally decouples and factorizes proxy-specific logic
-* ENLS should be serializable in protobuf, and may be available through an API (internal?)
-  * should ease debugging
-  * should ease independant updates on each side of this API (k8s semantics before ENLS, subsystems after)
-* rework current proxy modes as "plugins"
-* define APIs or something to aggregate publications to subsystems like ipvs or iptables from multiple vendors
-  * aggregating allows to reduce the syscall load and to factorize update logic (ie: rate limits, partial updates...)
-  * ie: calico iptables rules could be pushed to the proxy, eliminating lock contention
-* have a framework approach to ease implementations of more autonomous "kube-proxies" (iptables or ipvs only, nftables, eBPF...)
-* hard to do in the current code base, so starting out-of-tree
-  * may be moved to kubernetes/staging later if preferred.
-
-## Example of event frequency reduction made by diff'ing the ENLS
-
-* rev = real changes that would be propagated to listeners
-* eventCount = number of API events received
-
-Collected using `time timeout 30m ./kube-proxy2 --kubeconfig $KUBECONFIG |tee test-run.XXX.log`
-
-On an empty KinD cluster:
-```
-stats:	time	events	rev	sys cpu	usr cpu	tot cpu	mem	revs/events
-stats:	ms	count	count	ms	ms	%	MiB	%
-stats:	0	0	0	0	0	+Inf	1.13	NaN
-stats:	1000	9	2	3	37	4.058	0.83	22.222
-stats:	2000	10	2	1	0	0.103	0.83	20.000
-stats:	3000	12	2	1	0	0.178	0.84	16.667
-[...]
-stats:	298000	510	2	0	1	0.131	0.93	0.392
-stats:	299000	512	2	0	1	0.188	0.94	0.391
-stats:	300000	513	2	0	0	0.112	0.94	0.390
-stats:	301000	522	2	0	1	0.239	0.95	0.383 # same point as below (for comparison)
-stats:	302000	523	2	0	0	0.105	0.96	0.382
-[...]
-stats:	1798000	3078	2	0	2	0.264	1.18	0.065
-stats:	1799000	3078	2	0	0	0.037	1.18	0.065
-```
-
-![graph](test-run.kind-empty.svg)
-
-On a bigger cluster (1k services, 1.5k pods):
-```
-stats:	time	events	rev	usr cpu	sys cpu	tot cpu	mem	revs/events
-stats:	ms	count	count	ms	ms	%	MiB	%
-stats:	0	0	0	0	0	+Inf	1.22	NaN
-stats:	1000	2134	1064	3	308	31.280	3.93	49.859
-stats:	2000	2137	1064	0	1	0.131	3.94	49.789
-stats:	3000	2138	1064	0	0	0.050	3.94	49.766
-[...]
-stats:	298000	21785	1067	0	0	0.050	4.16	4.898
-stats:	299000	21787	1067	0	0	0.089	4.16	4.897
-stats:	300000	21788	1067	0	0	0.061	4.17	4.897
-stats:	301000	23925	1068	4	64	6.882	3.35	4.464 # GC after 2k+ events (without a proxy-related change)
-stats:	302000	23926	1068	0	0	0.052	3.35	4.464
-[...]
-stats:	1798000	130941	1218	0	0	0.090	4.68	0.930
-stats:	1799000	130942	1218	0	0	0.055	4.69	0.930
-```
-
-![graph](test-run.1k-svc.svg)
+[Moved](doc/proposal.md)