Mirroristas/etcd

mirror of https://github.com/etcd-io/etcd.git synced 2024-09-27 06:25:44 +00:00

Author	SHA1	Message	Date
wpedrak	e9779231ec	server: add 500ms retries to ReadIndex requests for l-reads It is second approach (with first being #12762) to solve #12680	2021-03-16 16:34:15 +01:00
wpedrak	4b21e38381	refactored l-read loop in v3_server.go	2021-03-16 11:03:45 +01:00
Piotr Tabor	1e7c1805d8	Unify logic of building raft-loggers for etcd. 1. We had the same code copied 3 times. 2. For no good reason the code was not reusing existing logger if this one is given.	2021-03-14 16:02:50 +01:00
Piotr Tabor	44bd22307e	Merge get_logger() & Logger() method.	2021-03-14 14:05:17 +01:00
Piotr Tabor	fd7fed1511	Move config (ServerConfig) out of etcdserver package. Motivation: - ServerConfig is part of 'embed' public API, while etcdserver is more 'internal' - EtcdServer is already too big and config is pretty wide-spread leaf if we were to split etcdserver (e.g. into pre & post-apply part).	2021-03-11 20:56:22 +01:00
Gyuho Lee	3ead91ca3e	Merge pull request #12739 from LeoYang90/optimization_watch_prevkv create event do not need prevkv range	2021-03-10 09:48:42 -08:00
Piotr Tabor	fb1d48e98e	Integration tests: Use BeforeTest(t) instead of defer AfterTest(). Thanks to this change, a single method BeforeTest(t) can handle before-test logic as well as registration of cleanup code (t.Cleanup(func)).	2021-03-09 18:19:51 +01:00
Gyuho Lee	94a371acd7	Merge pull request #12750 from ptabor/20210306-mlock --experimental-memory-mlock support	2021-03-09 09:13:40 -08:00
Gyuho Lee	6fd85af641	Merge pull request #12702 from hexfusion/add-so *: add support for socket options	2021-03-09 09:02:24 -08:00
Sam Batschelet	5b49fb41c8	fixup: add ListenerOptions Signed-off-by: Sam Batschelet <sbatsche@redhat.com>	2021-03-08 11:27:03 -05:00
Piotr Tabor	a46a358577	--experimental-memory-mlock support The flag protects etcd memory from being swapped out to disk. This can happen in memory constrained systems where mmaped bbolt area is natural condidate for swapping out. This flag should provide better tail latency on the cost of higher RSS ram usage. If the experiment is successful, the logic should get moved into bbolt layer, where we can protect specific bbolt instances (e.g. avoid protecting both during defragmentation).	2021-03-07 12:32:57 +01:00
leoyang.yl	d70f35f8d1	create event do not need prevkv range	2021-03-02 17:43:24 +08:00
Gyuho Lee	3d7aac948b	Merge pull request #12196 from ironcladlou/metrics-watch-error-fix etcdserver: fix incorrect metrics generated when clients cancel watches	2021-02-19 12:46:49 -08:00
Sam Batschelet	49078c683b	*: add support for socket options Signed-off-by: Sam Batschelet <sbatsche@redhat.com>	2021-02-19 13:31:23 -05:00
Maksim Buldukyan	7e38cfcc8d	raft: makes 'ConnReadTimeout/ConnWriteTimeout' customizable	2021-02-10 10:36:50 +07:00
Chao Chen	2ae3e82f07	etcdserver/api/etcdhttp: log successful etcd server side health check in debug level When we have an external component that checks /health periodically, the etcd server logs can be quite verbose (e.g., DDOS-ing against insure etcd health check can lead to disk space full due to large log files). This change was introduced in #11704. While we keep the warning logs for etcd health check failures, the success (or OK) log level should be set to DEBUG. Fixes #12676	2021-02-08 17:15:43 -08:00
Yanhao Mo	6d82778a4e	etcdserver: export method EtcdServer.leaderChangedNotify (#12378 )	2021-02-02 18:13:32 +08:00
Piotr Tabor	d6d03beaea	Merge pull request #12538 from lzhfromustc/12_9_GoroutineLeak test: change channel operations to avoid potential goroutine leaks	2021-02-01 21:16:43 +01:00
Sahdev Zala	69e99e80fa	Merge pull request #12465 from spacewander/fdoc chore: update the documentation link in the comment	2021-01-14 00:39:25 -05:00
Jingyi Hu	bfc6e2ff30	Merge pull request #12611 from ptabor/20210111-fix-flakes e2e tests flakes & leaks fixes: In particular TestIssue6361	2021-01-12 21:26:54 +08:00
Piotr Tabor	0d9cfc11c8	Fix usage of reflect.SliceHeader: reported by vet on tip golang Example: https://travis-ci.com/github/etcd-io/etcd/jobs/470404938 ``` % (cd server && go vet ./...) stderr: # go.etcd.io/etcd/server/v3/etcdserver/api/v2store stderr: etcdserver/api/v2store/node_extern_test.go:107:9: possible misuse of reflect.SliceHeader stderr: etcdserver/api/v2store/node_extern_test.go:107:16: possible misuse of reflect.SliceHeader ```	2021-01-12 00:14:51 +01:00
Piotr Tabor	74274f4417	e2e: Adding better diagnostic and location for temporary files to Snapshot tests.	2021-01-12 00:14:51 +01:00
Piotr Tabor	23340bb62a	Refresh proto generation script after moving modules files. With modulatiozation server protos get moved into ./server directory, but it was not reflected in scripts/genproto.sh.	2021-01-08 16:33:12 +01:00
lzhfromustc	f2a912a4e6	test: change channel operations to avoid potential goroutine leaks In these unit tests, goroutines may leak if certain branches are chosen. This commit edits channel operations and buffer sizes, so no matter what branch is chosen, the test will end correctly. This commit doesn't change the semantics of unit tests.	2020-12-09 22:23:21 -05:00
Dan Mace	9571325fe8	etcdserver: fix incorrect metrics generated when clients cancel watches Before this patch, a client which cancels the context for a watch results in the server generating a `rpctypes.ErrGRPCNoLeader` error that leads the recording of a gRPC `Unavailable` metric in association with the client watch cancellation. The metric looks like this: grpc_server_handled_total{grpc_code="Unavailable",grpc_method="Watch",grpc_service="etcdserverpb.Watch",grpc_type="bidi_stream"} So, the watch server has misidentified the error as a server error and then propagates the mistake to metrics, leading to a false indicator that the leader has been lost. This false signal then leads to false alerting. The commit 9c103dd0dedfc723cd4f33b6a5e81343d8a6bae7 introduced an interceptor which wraps watch streams requiring a leader, causing those streams to be actively canceled when leader loss is detected. However, the error handling code assumes all stream context cancellations are from the interceptor. This assumption is broken when the context was canceled because of a client stream cancelation. The core challenge is lack of information conveyed via `context.Context` which is shared by both the send and receive sides of the stream handling and is subject to cancellation by all paths (including the gRPC library itself). If any piece of the system cancels the shared context, there's no way for a context consumer to understand who cancelled the context or why. To solve the ambiguity of the stream interceptor code specifically, this patch introduces a custom context struct which the interceptor uses to expose a custom error through the context when the interceptor decides to actively cancel a stream. Now the consuming side can more safely assume a generic context cancellation can be propagated as a cancellation, and the server generated leader error is preserved and propagated normally without any special inference. When a client cancels the stream, there remains a race in the error handling code between the send and receive goroutines whereby the underlying gRPC error is lost in the case where the send path returns and is handled first, but this issue can be taken separately as no matter which paths wins, we can detect a generic cancellation. This is a replacement of https://github.com/etcd-io/etcd/pull/11375. Fixes #10289, #9725, #9576, #9166	2020-11-18 17:02:09 -05:00
Ankur Gargi	c1c681adc3	server: Added config parameter experimental-warning-apply-duration	2020-11-17 17:33:19 -05:00
Gyuho Lee	dc586a5ad2	Merge pull request #12459 from jingyih/proper_request_cancellation server: proper cancellation for range request	2020-11-13 12:20:41 -08:00
spacewander	67f040f921	Update other Documentation/v2 links	2020-11-11 09:57:01 +08:00
spacewander	f2eb15a81b	chore: update the documentation link in the comment Close #12462.	2020-11-11 09:53:18 +08:00
Maciej Borsz	0bea7df7c1	Add metric tracking apply method duration: * etcd_server_apply_duration_seconds It can be used to understand which operations are slow, in addition to the warning log message.	2020-11-06 11:11:16 +01:00
jingyih	0558e379c3	server: proper request cancellation for range	2020-11-05 21:30:02 -08:00
Piotr Tabor	6e800b9b01	20201103 no commit title check (#12447 ) * Turn off checking of format of commit message. * scripts/fix.sh: Fix fixing whitespaces in .sh scripts Aparently there is a difference between: find ./ -print0 -name .sh and find ./ -name .sh -print0 etcdserver unit tests: Do not call .Fatalf(...) from not test's goroutine. Fixes following test failures: https://travis-ci.com/github/etcd-io/etcd/jobs/425920416 ``` % (cd server && go vet ./...) stderr: # go.etcd.io/etcd/server/v3/etcdserver stderr: etcdserver/server_test.go:1002:4: call to (T).Fatalf from a non-test goroutine stderr: etcdserver/server_test.go:1166:4: call to (T).Fatalf from a non-test goroutine FAIL: (code:2): % (cd server && go vet ./...) FAIL: 'run go vet ./...' checking failed (!=0 return code) FAIL: 'govet' failed at Tue Nov 3 04:07:47 UTC 2020 ```	2020-11-03 07:59:42 -08:00
yangweiwei	aa1024a16e	etcdserver: updated cluster version during cluster version update in etcd cluster, the log should info from XX to XX.	2020-10-27 16:32:40 +08:00
Piotr Tabor	aaf423e962	server: Update imports. find -name '*.go' \| xargs sed -i --follow-symlinks 's\|etcd/v3/\|etcd/server/v3/\|g'	2020-10-26 13:02:32 +01:00
Piotr Tabor	4a5e9d1261	server: Move server files to 'server' directory. 26 git mv mvcc wal auth etcdserver etcdmain proxy embed/ lease/ server 36 git mv go.mod go.sum server	2020-10-26 12:57:19 +01:00

35 Commits