Mirroristas/etcd

mirror of https://github.com/etcd-io/etcd.git synced 2024-09-27 06:25:44 +00:00

Author	SHA1	Message	Date
Benjamin Wang	621cd7b9e5	restrict the max size of each WAL entry to the remaining size of the file Currently the max size of each WAL entry is hard coded as 10MB. If users set a value > 10MB for the flag --max-request-bytes, then etcd may run into a situation that it successfully processes a big request, but fails to decode it when replaying the WAL file on startup. On the other hand, we can't just remove the limitation, because if a WAL entry is somehow corrupted, and its recByte is a huge value, then etcd may run out of memory. So the solution is to restrict the max size of each WAL entry as a dynamic value, which is the remaining size of the WAL file. Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-06-17 09:01:29 +08:00
Piotr Tabor	47b28b600a	Verification package: Verified given data-dir. For now verifies whete Backend.cindex is consistent with WAL log, but should get expanded to cover memberships & revisions.	2021-04-28 07:56:15 +02:00
Piotr Tabor	3bb7acc8cf	Migrate dependencies pkg/foo -> client/pkg/foo	2021-04-07 00:38:47 +02:00
Piotr Tabor	4d4c84e014	server: Written Snapshot's to WAL contains populated ConfState. This will (among others) allow etcd-3.6 to not depend on store_v2 .snap files at all, as WAL + db file will be self-sufficient.	2021-03-25 00:31:44 +01:00
Chris Wedgwood	b63d31e89b	etcdserver: when using --unsafe-no-fsync write data There are situations where we don't wish to fsync but we do want to write the data. Typically this occurs in clusters where fsync latency (often the result of firmware) transiently spikes. For Kubernetes clusters this causes (many) elections which have knock-on effects such that the API server will transiently fail causing other components fail in turn. By writing the data (buffered and asynchronously flushed, so in most situations the write is fast) and avoiding the fsync we no longer trigger this situation and opportunistically write out the data. Anecdotally: Because the fsync is missing there is the argument that certain types of failure events will cause data corruption or loss, in testing this wasn't seen. If this was to occur the expectation is the member can be readded to a cluster or worst-case restored from a robust persisted snapshot. The etcd members are deployed across isolated racks with different power feeds. An instantaneous failure of all of them simultaneously is unlikely. Testing was usually of the form: * create (Kubernetes) etcd write-churn by creating replicasets of some 1000s of pods * break/fail the leader Failure testing included: * hard node power-off events * disk removal * orderly reboots/shutdown In all cases when the node recovered it was able to rejoin the cluster and synchronize.	2021-03-05 10:58:04 -08:00
Piotr Tabor	58f78df1de	Raft: Expand raft documentation, in particular point on godocs.	2021-01-15 12:34:02 +01:00
Piotr Tabor	aaf423e962	server: Update imports. find -name '*.go' \| xargs sed -i --follow-symlinks 's\|etcd/v3/\|etcd/server/v3/\|g'	2020-10-26 13:02:32 +01:00
Piotr Tabor	4a5e9d1261	server: Move server files to 'server' directory. 26 git mv mvcc wal auth etcdserver etcdmain proxy embed/ lease/ server 36 git mv go.mod go.sum server	2020-10-26 12:57:19 +01:00

8 Commits