Mirroristas/etcd

mirror of https://github.com/etcd-io/etcd.git synced 2024-09-27 06:25:44 +00:00

Author	SHA1	Message	Date
Wei Fu	4db8df677c	feature: add new compactor based revision count What would you like to be added? Add new compactor based revision count, instead of fixed interval time. In order to make it happen, the mvcc store needs to export `CompactNotify` function to notify the compactor that configured number of write transactions have occured since previsious compaction. The new compactor can get the revision change and delete out-of-date data in time, instead of waiting with fixed interval time. The underly bbolt db can reuse the free pages as soon as possible. Why is this needed? In the kubernetes cluster, for instance, argo workflow, there will be batch requests to create pods , and then there are also a lot of pod status's PATCH requests, especially when the pod has more than 3 containers. If the burst requests increase the db size in short time, it will be easy to exceed the max quota size. And then the cluster admin get involved to defrag, which may casue long downtime. So, we hope the ETCD can delete the out-of-date data as soon as possible and slow down the grow of total db size. Currently, both revision and periodic are based on time. It's not easy to use fixed interval time to face the unexpected burst update requests. The new compactor based on revision count can make the admin life easier. For instance, let's say that average of object size is 50 KiB. The new compactor will compact based on 10,000 revisions. It's like that ETCD can compact after new 500 MiB data in, no matter how long ETCD takes to get new 10,000 revisions. It can handle the burst update requests well. There are some test results: * Fixed value size: 10 KiB, Update Rate: 100/s, Total key space: 3,000 ``` enchmark put --rate=100 --total=300000 --compact-interval=0 \ --key-space-size=3000 --key-size=256 --val-size=10240 ``` \| Compactor \| DB Total Size \| DB InUse Size \| \| -- \| -- \| -- \| \| Revision(5min,retension:10000) \| 570 MiB \| 208 MiB \| \| Periodic(1m) \| 232 MiB \| 165 MiB \| \| Periodic(30s) \| 151 MiB \| 127 MiB \| \| NewRevision(retension:10000) \| 195 MiB \| 187 MiB \| * Random value size: [9 KiB, 11 KiB], Update Rate: 150/s, Total key space: 3,000 ``` bnchmark put --rate=150 --total=300000 --compact-interval=0 \ --key-space-size=3000 --key-size=256 --val-size=10240 \ --delta-val-size=1024 ``` \| Compactor \| DB Total Size \| DB InUse Size \| \| -- \| -- \| -- \| \| Revision(5min,retension:10000) \| 718 MiB \| 554 MiB \| \| Periodic(1m) \| 297 MiB \| 246 MiB \| \| Periodic(30s) \| 185 MiB \| 146 MiB \| \| NewRevision(retension:10000) \| 186 MiB \| 178 MiB \| * Random value size: [6 KiB, 14 KiB], Update Rate: 200/s, Total key space: 3,000 ``` bnchmark put --rate=200 --total=300000 --compact-interval=0 \ --key-space-size=3000 --key-size=256 --val-size=10240 \ --delta-val-size=4096 ``` \| Compactor \| DB Total Size \| DB InUse Size \| \| -- \| -- \| -- \| \| Revision(5min,retension:10000) \| 874 MiB \| 221 MiB \| \| Periodic(1m) \| 357 MiB \| 260 MiB \| \| Periodic(30s) \| 215 MiB \| 151 MiB \| \| NewRevision(retension:10000) \| 182 MiB \| 176 MiB \| For the burst requests, we needs to use short periodic interval. Otherwise, the total size will be large. I think the new compactor can handle it well. Additional Change: Currently, the quota system only checks DB total size. However, there could be a lot of free pages which can be reused to upcoming requests. Based on this proposal, I also want to extend current quota system with DB's InUse size. If the InUse size is less than max quota size, we should allow requests to update. Since the bbolt might be resized if there is no available continuous pages, we should setup a hard limit for the overflow, like 1 GiB. ```diff // Quota represents an arbitrary quota against arbitrary requests. Each request @@ -130,7 +134,17 @@ func (b *BackendQuota) Available(v interface{}) bool { return true } // TODO: maybe optimize Backend.Size() - return b.be.Size()+int64(cost) < b.maxBackendBytes + + // Since the compact comes with allocatable pages, we should check the + // SizeInUse first. If there is no continuous pages for key/value and + // the boltdb continues to resize, it should not increase more than 1 + // GiB. It's hard limitation. + // + // TODO: It should be enabled by flag. + if b.be.Size()+int64(cost)-b.maxBackendBytes >= maxAllowedOverflowBytes(b.maxBackendBytes) { + return false + } + return b.be.SizeInUse()+int64(cost) < b.maxBackendBytes } ``` And it's likely to disable NOSPACE alarm if the compact can get much more free pages. It can reduce downtime. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-08-16 23:35:08 +08:00
caojiamingalan	b9e30bf878	etcdserver: add e2e test to reproduce the incorrect hash issue when resuming scheduled compaction. check ScheduledCompactKeyName and FinishedCompactKeyName before writing hash to hashstore. If they do not match, then it means this compaction has once been interrupted and its hash value is invalid. In such cases, we won't write the hash values to the hashstore, and avoids the incorrect corruption alarm. Signed-off-by: caojiamingalan <alan.c.19971111@gmail.com>	2023-06-07 19:54:09 -05:00
Chao Chen	9e1e378e9e	update code comments Signed-off-by: Chao Chen <chaochn@amazon.com>	2023-05-24 12:54:27 -07:00
Benjamin Wang	19ec574f45	Merge pull request #15915 from hexfusion/bytes server/storage/schema: prefer equal to compare for equality comparisons	2023-05-18 10:53:42 +08:00
Sam Batschelet	a708e94749	server/storage/schema: prefer equal to compare for equality comparisons Signed-off-by: Sam Batschelet <sbatschelet@gmail.com>	2023-05-16 21:25:34 -04:00
qsyqian	0cd1b7beaa	log time when open backend db took too long Signed-off-by: qsyqian <qsyqian@gmail.com>	2023-05-08 15:23:29 +08:00
Peter Wortmann	74feb229c7	etcdserver: Guarantee order of requested progress notifications Progress notifications requested using ProgressRequest were sent directly using the ctrlStream, which means that they could race against watch responses in the watchStream. This would especially happen when the stream was not synced - e.g. if you requested a progress notification on a freshly created unsynced watcher, the notification would typically arrive indicating a revision for which not all watch responses had been sent. This changes the behaviour so that v3rpc always goes through the watch stream, using a new RequestProgressAll function that closely matches the behaviour of the v3rpc code - i.e. 1. Generate a message with WatchId -1, indicating the revision for all watchers in the stream 2. Guarantee that a response is (eventually) sent The latter might require us to defer the response until all watchers are synced, which is likely as it should be. Note that we do not guarantee that the number of progress notifications matches the number of requests, only that eventually at least one gets sent. Signed-off-by: Peter Wortmann <peter.wortmann@skao.int>	2023-04-05 11:54:10 +01:00
Marek Siarkowicz	7052d8998a	server: Test watch restore Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2023-03-20 11:15:41 +01:00
Bogdan Kanivets	a690707c57	mvcc: update minRev when watcher stays synced Problem: during restore in watchableStore.Restore, synced watchers are moved to unsynced. minRev will be behind since it's not updated when watcher stays synced. Solution: update minRev fixes: https://github.com/etcd-io/etcd/issues/15271 Signed-off-by: Bogdan Kanivets <bkanivets@apple.com> Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2023-03-20 11:11:48 +01:00
Wei Fu	eb09e00541	chore: refactor cleanup fn in mvcc test The tmp path is cleanup by go testing so that the `cleanup` doesn't need to call `os.Remove`. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-03-19 21:46:03 +08:00
Wei Fu	830d9e9eaa	test: fix TestRestoreContinueUnfinishedCompaction The original testcase uses `return` statement which skips `restore` case. It's aimed to enable `restore` testcase. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-03-18 10:58:05 +08:00
Wei Fu	d200f72d2d	test: should not leak goroutines after test finished The original flaky test shows in CI pipeline[1], but gotestsum run into a golang issue[2]. The error message is not clear from summary, like ``` {"Time":"2023-03-02T09:19:38.754394861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1433 +0x7e4\n"} {"Time":"2023-03-02T09:19:38.754414561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/runtime/panic.go:476 +0x32\n"} {"Time":"2023-03-02T09:19:38.754430561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1493 +0x47\n"} {"Time":"2023-03-02T09:19:38.754482561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:883 +0xc4\n"} {"Time":"2023-03-02T09:19:38.754497661Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:876 +0xa4\n"} {"Time":"2023-03-02T09:19:38.754512161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:927 +0x6a\n"} {"Time":"2023-03-02T09:19:38.754567661Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zaptest.testingWriter.Write()\n"} {"Time":"2023-03-02T09:19:38.754571261Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/zaptest/logger.go:130 +0x12c\n"} {"Time":"2023-03-02T09:19:38.754582861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zaptest.(testingWriter).Write()\n"} {"Time":"2023-03-02T09:19:38.754597761Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zapcore.(ioCore).Write()\n"} {"Time":"2023-03-02T09:19:38.754600961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/core.go:99 +0x199\n"} {"Time":"2023-03-02T09:19:38.754612761Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zapcore.(CheckedEntry).Write()\n"} {"Time":"2023-03-02T09:19:38.754618561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/entry.go:255 +0x2ce\n"} {"Time":"2023-03-02T09:19:38.754630161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap.(Logger).Info()\n"} {"Time":"2023-03-02T09:19:38.754633261Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/logger.go:220 +0x6a\n"} {"Time":"2023-03-02T09:19:38.754644861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.(treeIndex).Compact()\n"} {"Time":"2023-03-02T09:19:38.754648461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/index.go:194 +0x144\n"} {"Time":"2023-03-02T09:19:38.754664961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.(store).scheduleCompaction()\n"} {"Time":"2023-03-02T09:19:38.754670161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/kvstore_compaction.go:29 +0xbb\n"} {"Time":"2023-03-02T09:19:38.754681861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.(store).compact.func1()\n"} {"Time":"2023-03-02T09:19:38.754690561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/kvstore.go:235 +0x9e\n"} {"Time":"2023-03-02T09:19:38.754720061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.job.Do()\n"} {"Time":"2023-03-02T09:19:38.754724161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:41 +0x70\n"} {"Time":"2023-03-02T09:19:38.754736161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.(job).Do()\n"} {"Time":"2023-03-02T09:19:38.754750961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.(fifo).executeJob()\n"} {"Time":"2023-03-02T09:19:38.754754161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:206 +0x101\n"} {"Time":"2023-03-02T09:19:38.754765861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.(fifo).run()\n"} {"Time":"2023-03-02T09:19:38.754769061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:187 +0x1a5\n"} {"Time":"2023-03-02T09:19:38.754780461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler.func1()\n"} {"Time":"2023-03-02T09:19:38.754783661Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:101 +0x39\n"} {"Time":"2023-03-02T09:19:38.754824061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1493 +0x75d\n"} FAIL: (code:1): % (cd server && 'env' 'ETCD_VERIFY=all' 'go' 'test' '-v' '-json' '-short' '-timeout=3m' '--race=true' '--cpu=4' './...' '-p=2') {"Time":"2023-03-02T09:19:38.754838961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1846 +0x99\n"} {"Time":"2023-03-02T09:19:38.754854961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1446 +0x216\n"} {"Time":"2023-03-02T09:19:38.754893461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1844 +0x7ec\n"} {"Time":"2023-03-02T09:19:38.754908961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1726 +0xa84\n"} {"Time":"2023-03-02T09:19:38.754957861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler()\n"} {"Time":"2023-03-02T09:19:38.754961061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:101 +0x3b6\n"} {"Time":"2023-03-02T09:19:38.754976161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.NewStore()\n"} {"Time":"2023-03-02T09:19:38.754979361Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/kvstore.go:111 +0x331\n"} {"Time":"2023-03-02T09:19:38.754991061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.TestHashByRevValue()\n"} {"Time":"2023-03-02T09:19:38.754994261Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/hash_test.go:36 +0xa4\n"} {"Time":"2023-03-02T09:19:38.755010061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1446 +0x216\n"} {"Time":"2023-03-02T09:19:38.755024461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1493 +0x47\n"} === Failed === FAIL: storage/mvcc (0.00s) === CONT testing.go:1319: race detected during execution of test FAIL FAIL go.etcd.io/etcd/server/v3/storage/mvcc 9.852s ``` After using the following command to reproduce it, we can get the error like: ```bash go test -v -p=2 --cpu=4 -count=1000 -failfast --race=true -short -timeout=30m ./ --- PASS: TestHashByRevValueLastRevision (0.12s) ================== WARNING: DATA RACE Read at 0x00c002024043 by goroutine 65745: testing.(common).logDepth() /usr/lib/go-1.19/src/testing/testing.go:883 +0xc4 testing.(common).log() /usr/lib/go-1.19/src/testing/testing.go:876 +0xa4 testing.(common).Logf() /usr/lib/go-1.19/src/testing/testing.go:927 +0x6a testing.(T).Logf() <autogenerated>:1 +0x75 go.uber.org/zap/zaptest.testingWriter.Write() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/zaptest/logger.go:130 +0x12c go.uber.org/zap/zaptest.(testingWriter).Write() <autogenerated>:1 +0x7e go.uber.org/zap/zapcore.(ioCore).Write() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/core.go:99 +0x199 go.uber.org/zap/zapcore.(CheckedEntry).Write() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/entry.go:255 +0x2ce go.uber.org/zap.(Logger).Info() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/logger.go:220 +0x6a go.etcd.io/etcd/server/v3/storage/mvcc.(treeIndex).Compact() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/index.go:194 +0x144 go.etcd.io/etcd/server/v3/storage/mvcc.(store).scheduleCompaction() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/kvstore_compaction.go:29 +0xbb go.etcd.io/etcd/server/v3/storage/mvcc.(store).compact.func1() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/kvstore.go:235 +0x9e go.etcd.io/etcd/pkg/v3/schedule.job.Do() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:41 +0x70 go.etcd.io/etcd/pkg/v3/schedule.(job).Do() <autogenerated>:1 +0x29 go.etcd.io/etcd/pkg/v3/schedule.(fifo).executeJob() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:206 +0x101 go.etcd.io/etcd/pkg/v3/schedule.(fifo).run() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:187 +0x1a5 go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler.func1() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:101 +0x39 Previous write at 0x00c002024043 by goroutine 65743: testing.tRunner.func1() /usr/lib/go-1.19/src/testing/testing.go:1433 +0x7e4 runtime.deferreturn() /usr/lib/go-1.19/src/runtime/panic.go:476 +0x32 testing.(T).Run.func1() /usr/lib/go-1.19/src/testing/testing.go:1493 +0x47 Goroutine 65745 (running) created at: go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:101 +0x3b6 go.etcd.io/etcd/server/v3/storage/mvcc.NewStore() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/kvstore.go:111 +0x331 go.etcd.io/etcd/server/v3/storage/mvcc.TestHashByRevValueLastRevision() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/hash_test.go:76 +0xa4 testing.tRunner() /usr/lib/go-1.19/src/testing/testing.go:1446 +0x216 testing.(T).Run.func1() /usr/lib/go-1.19/src/testing/testing.go:1493 +0x47 Goroutine 65743 (running) created at: testing.(T).Run() /usr/lib/go-1.19/src/testing/testing.go:1493 +0x75d testing.runTests.func1() /usr/lib/go-1.19/src/testing/testing.go:1846 +0x99 testing.tRunner() /usr/lib/go-1.19/src/testing/testing.go:1446 +0x216 testing.runTests() /usr/lib/go-1.19/src/testing/testing.go:1844 +0x7ec testing.(M).Run() /usr/lib/go-1.19/src/testing/testing.go:1726 +0xa84 main.main() _testmain.go:265 +0x2e9 ================== ``` The schedule for compact is handled asynchronously and it might use `t.Logf` after go-test marks the case is done. And there is a comment from go-test: ```go // `c69ff3a7d0/src/testing/testing.go (LL1580C3-L1582C16)` // Do not lock t.done to allow race detector to detect race in case // the user does not appropriately synchronize a goroutine. t.done = true ``` We need to ensure that all the goroutines should be closed before case finish. REF: [1]: https://github.com/etcd-io/etcd/actions/runs/4312405975/jobs/7522924734 [2]: https://github.com/gotestyourself/gotestsum/issues/310 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-03-18 10:58:00 +08:00
Benjamin Wang	1fe7b9793a	etcdserver: add failpoints walBeforeSync and walAfterSync Signed-off-by: Benjamin Wang <wachao@vmware.com>	2023-02-08 09:08:09 +08:00
guozhao	fab8474ef8	test: add CompactionSleepInterval in FakeStore's config After setting the ComparionSleepInterval, we can use time.Ticker instead of time.After to optimize the scheduleComparison(), otherwise it will fail in the 'TestStoreCompact(t)' test. Signed-off-by: guozhao <guozhao@360.cn>	2023-01-20 06:25:09 +08:00
guozhao	de8d6b3792	etcdserver: use time.Ticker instead of time.After Using time.After will create a new Timer in each cycle, In these cases , it is better to use time.Ticker. Signed-off-by: guozhao <guozhao@360.cn>	2023-01-17 16:58:13 +08:00
Benjamin Wang	bd9f1584d4	process the scenaro of the last WAL record being partially synced to disk We need to return io.ErrUnexpectedEOF in the error chain, so that etcdserver can repair it automatically. Signed-off-by: Benjamin Wang <wachao@vmware.com>	2023-01-08 04:46:51 +08:00
Piotr Tabor	8ec3cbc551	fixup! Add --raw mode to ./etcd-dump-log Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-30 09:22:39 +01:00
Piotr Tabor	e571fb7baa	Add --raw mode to ./etcd-dump-log This mode allows to look at RAW protos for all entries in WAL logs in the given directory. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-30 09:22:34 +01:00
Piotr Tabor	58681d3feb	Expose types of entries within the WAL log for access from the tools. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-30 09:19:23 +01:00
Piotr Tabor	0d8aad54ba	wal: Expose Decoder as package visible interface. Such that can be used by tools. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-30 09:19:23 +01:00
Piotr Tabor	bee2a08968	wal decoding: Add optional mode to ignore CRC errors. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-30 09:19:23 +01:00
Piotr Tabor	498619bdda	wal decoder: report file & offset in case of CRC mismatch. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-30 09:19:23 +01:00
Piotr Tabor	9abc895122	Goimports: Apply automated fixing to test files as well. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-29 13:04:45 +01:00
Piotr Tabor	6f899a7b40	Merge pull request #15052 from ptabor/20221228-goimports-fix ./scripts/fix.sh: Takes care of goimports across the whole project.	2022-12-29 11:31:22 +01:00
Piotr Tabor	9e1abbab6e	Fix goimports in all existing files. Execution of ./scripts/fix.sh Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-29 09:41:31 +01:00
Wei Fu	4d0b91947e	chore: delete // +build buildtag by `go fix` Signed-off-by: Wei Fu <fuweid89@gmail.com>	2022-12-29 14:17:05 +08:00
Benjamin Wang	cc58edecf2	raft: add raft into the tools/mod Previously etcdservers depends on raft/raftpb/raft.proto directly. After moving raft to a separate repo, we need to add raft to the tools/mod, and get raft included in the -I protc flags. Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-12-02 15:36:41 +08:00
Benjamin Wang	faff80a2b3	etcdserve: format the source code gofmt -w ./server Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-12-02 13:00:59 +08:00
Benjamin Wang	e9aa275b36	etcdserver: update etcdserver to use the new raft module go.etcd.io/raft/v3 Just replaced all go.etcd.io/etcd/raft/v3 with go.etcd.io/raft/v3 under directory server. Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-12-02 09:33:45 +08:00
Benjamin Wang	686846ec65	add more debug info for opening WAL files failure Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-11-28 18:58:01 +08:00
Bhargav Ravuri	2feec4fe68	comments: fix comments as per goword in go test files Comments fixed as per goword in go test files that shell function go_srcs_in_module lists as per changes on #14827 Helps in #14827 Signed-off-by: Bhargav Ravuri <bhargav.ravuri@infracloud.io>	2022-11-23 23:05:42 +05:30
Andrew Sims	f656fa0f49	add missing copyright headers Signed-off-by: Andrew Sims <andrew.cameron.sims@gmail.com>	2022-11-23 19:13:43 +11:00
Sasha Melentyev	c3b6cbdb73	all: goimports -w . Signed-off-by: Sasha Melentyev <sasha@melentyev.io>	2022-11-17 19:07:04 +03:00
Marek Siarkowicz	f96957adba	tests: Add compact failpoints Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2022-11-15 14:59:03 +01:00
Benjamin Wang	4f824336ad	etcdserver: add two failpoints for backend 1. before and after create boltDB transaction; 2. before and after writebuf back to read buffer; Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-11-15 08:09:05 +08:00
Benjamin Wang	3f18816e7d	etcdserver: add gofail points before and after OnPreCommitUnsafe Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-11-14 11:02:18 +08:00
Benjamin Wang	5a3ef953eb	etcdserver: call the OnPreCommitUnsafe in unsafeCommit `unsafeCommit` is called by both `(batchTxBuffered) commit` and `(backend) defrag`. When users perform the defragmentation operation, etcd doesn't update the consistent index. If etcd crashes(e.g. panicking) in the process for whatever reason, then etcd replays the WAL entries starting from the latest snapshot, accordingly it may re-apply entries which might have already been applied, eventually the revision isn't consistent with other members. Refer to discussion in https://github.com/etcd-io/etcd/pull/14685 Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-11-11 10:57:15 +08:00
Marek Siarkowicz	2a1055c7f3	raft: Remove dependency on etcd api Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2022-11-08 13:56:46 +01:00
Cenk Alti	580a86ebe5	server: add more context to panic message Signed-off-by: Cenk Alti <cenkalti@gmail.com>	2022-10-31 20:29:15 -04:00
Benjamin Wang	d15a9d0edc	Merge pull request #14457 from jbml/hashbyrev_compact_main etcdserver: fix corruption check when server has just been compacted	2022-10-13 15:17:38 +08:00
Benjamin Wang	abef537a90	Merge pull request #14515 from spongecaptain/btree-generics upate:use google/btree in the genric way	2022-09-27 16:44:13 +08:00
wathenjiang	319db38b0a	update: add benchmark test benchmark result: (1) master branch $ go test -bench='BenchmarkIndexPut$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexPut-12 1000000 2591 ns/op BenchmarkIndexPut-12 1000000 2531 ns/op BenchmarkIndexPut-12 1000000 2536 ns/op BenchmarkIndexPut-12 1000000 2546 ns/op BenchmarkIndexPut-12 1000000 2538 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 167.439s $ go test -bench='BenchmarkIndexGet$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexGet-12 1000000 2021 ns/op BenchmarkIndexGet-12 1000000 2029 ns/op BenchmarkIndexGet-12 1000000 2044 ns/op BenchmarkIndexGet-12 1000000 1973 ns/op BenchmarkIndexGet-12 1000000 2027 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 177.815s (2) google/btree in the generic way $ go test -bench='BenchmarkIndexPut$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexPut-12 1000000 2477 ns/op BenchmarkIndexPut-12 1000000 2380 ns/op BenchmarkIndexPut-12 1000000 2360 ns/op BenchmarkIndexPut-12 1000000 2396 ns/op BenchmarkIndexPut-12 1000000 2382 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 165.841s $ go test -bench='BenchmarkIndexGet$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexGet-12 1000000 1985 ns/op BenchmarkIndexGet-12 1000000 1914 ns/op BenchmarkIndexGet-12 1000000 1900 ns/op BenchmarkIndexGet-12 1000000 1905 ns/op BenchmarkIndexGet-12 1000000 1894 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 177.573s Signed-off-by: wathenjiang <wathenjiang@tencent.com>	2022-09-27 14:33:02 +08:00
Spongecaptain	c53dfc7c5b	upate:use google/btree in the genric way Signed-off-by: wathenjiang <wathenjiang@tencent.com>	2022-09-27 10:16:15 +08:00
Kafuu Chino	f1d4935e91	*: avoid closing a watch with ID 0 incorrectly Signed-off-by: Kafuu Chino <KafuuChinoQ@gmail.com> add test	2022-09-26 20:30:33 +08:00
Jeremy Leach	cc1e245368	etcdserver: fix corruption check when server has just been compacted When a key-value store corruption check happens immediately after a compaction, the revision at which the key-value store hash is computed, is the compacted revision itself. In that case, the hash computation logic was incorrect because it returned an ErrCompacted error; this error should instead be returned when the revision at which the key-value store is hashed, is strictly lower than the compacted revision. Fixes #14325 Signed-off-by: Jeremy Leach <44558776+jbml@users.noreply.github.com>	2022-09-24 22:20:26 +10:00
SimFG	5702765729	wal: Fix the `walWriteBytes` metric Signed-off-by: SimFG <1142838399@qq.com>	2022-09-22 19:23:06 +08:00
Benjamin Wang	7f10dccbaf	Bump go 1.19: update all the dependencies and go.sum files 1. run ./scripts/fix.sh; 2. cd tools/mod; gofmt -w . & go mod tidy; Signed-off-by: Benjamin Wang <wachao@vmware.com>	2022-09-22 08:47:46 +08:00
demoManito	a9c3d56508	etcd: remove redundant type conversion Signed-off-by: demoManito <1430482733@qq.com>	2022-09-20 11:26:02 +08:00
demoManito	72cf0cc04a	etcd: modify declaring empty slices declare an empty slice to var s []int replace s :=[]int{}, https://github.com/golang/go/wiki/CodeReviewComments#declaring-empty-slices Signed-off-by: demoManito <1430482733@qq.com>	2022-09-16 14:41:14 +08:00
lovehhf	3b585e94fc	mvcc: Remove unused revisions and change comment rev to modified Signed-off-by: Hongfei Huang <853885165@qq.com>	2022-09-14 23:36:54 +08:00

1 2 3 4

174 Commits