Mirroristas/etcd

mirror of https://github.com/etcd-io/etcd.git synced 2024-09-27 06:25:44 +00:00

Author	SHA1	Message	Date
Fu Wei	d63ca43092	Merge 4db8df677c618b462145fce7cb926c072a0ce932 into c86c93ca2951338115159dcdd20711603044e1f1	2024-09-25 21:36:55 -07:00
Jalin Wang	2c53be7c5d	etcdserver: rename defaultCompactionSleepInterval var (#18495 ) * etcdserver: rename `minimumBatchInterval` to `defaultCompactionSleepInterval` and `defaultCompactBatchLimit` to `defaultCompactionBatchLimit` Signed-off-by: Jalin Wang <JalinWang@outlook.com>	2024-08-27 14:36:06 +01:00
Wei Fu	bbdc94181a	*: keep tombstone if revision == compactAtRev Before this patch, the tombstone can be deleted if its revision is equal compacted revision. It causes that the watch subscriber won't get this DELETE event. Based on Compact API[1], we should keep tombstone revision if it's not less than the compaction revision. > CompactionRequest compacts the key-value store up to a given revision. > All superseded keys with a revision less than the compaction revision > will be removed. [1]: https://etcd.io/docs/latest/dev-guide/api_reference_v3/ Signed-off-by: Wei Fu <fuweid89@gmail.com>	2024-08-07 21:56:20 +08:00
Wei Fu	5e178e2288	mvcc/*_test.go: should not use duplicate revision.Main for one key In commit [[1]], the newTestKeyIndex function creates one key with two Revision{Main: 14} revisions. However, starting from version [[2]], etcd server does not allow duplicate keys in a single transaction. This update to newTestKeyIndex is to avoid confusion and ensure consistency with the latest etcd server behavior. REF: [1]: `be80d11948` [2]: https://github.com/etcd-io/etcd/pull/4376 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2024-07-14 17:30:47 +08:00
Chun-Hung Tseng	5be397a6a8	Fix a linter issue golangci-lint reports the following issue: storage/mvcc/kvstore.go:312:27: (*store).restore - result 0 (error) is always nil (unparam) It's due to the fact that both Attach() and compactLockfree() within the function restore() are able to return an error, but we only log them in the current implementation. Thus, the return value restore() is always nil, hence the linter warning. We have agreed to suppress the linter warning for now [1]. Reference: [1] https://github.com/etcd-io/etcd/pull/18228#issuecomment-2187309957 Signed-off-by: Chun-Hung Tseng <henrybear327@gmail.com>	2024-06-25 11:45:53 +02:00
MR_G	c0076a788e	mvcc: fix typo Signed-off-by: MR_G <2044783677@qq.com>	2024-06-22 00:17:13 +08:00
Cenk Alti	ea46253b42	Improve snapshot status Signed-off-by: Cenk Alti <cenkalti@gmail.com>	2024-06-15 10:11:34 -04:00
lubronzhan	0b6529c462	mvcc: fix typo Signed-off-by: lubronzhan <lubronzhan@gmail.com>	2024-06-13 00:15:37 -07:00
Cenk Alti	b107d2437f	cli: Add etcdutl snapshot hashkv command Signed-off-by: Cenk Alti <cenkalti@gmail.com> Apply suggestions from code review Co-authored-by: Benjamin Wang <benjamin.wang@broadcom.com>	2024-06-02 00:07:50 -04:00
Wei Fu	94c83a962b	server/storage/mvcc: should update currentRev in revMu Signed-off-by: Wei Fu <fuweid89@gmail.com>	2024-04-24 11:08:33 +08:00
Benjamin Wang	dd4e35a585	Merge pull request #17815 from fuweid/repro-17780 [RFC] fix revision loss issue caused by compaction - 17780	2024-04-22 18:15:44 +01:00
Benjamin Wang	29529c505b	Merge pull request #17791 from ahrtr/verify_revision_bootstrap_20240415 Add verification that the revision shouldn't decrease on bootstrap	2024-04-22 10:43:01 +01:00
Wei Fu	9ea234913a	server/storage: update currentRev if scheduledCompact > currentRev Signed-off-by: Wei Fu <fuweid89@gmail.com>	2024-04-21 18:12:38 +08:00
Benjamin Wang	a54098dee3	Update the compaction log when bootstrap and update compact's signature Actually the compact() never return an error, so remove the second return parameter. Signed-off-by: Benjamin Wang <benjamin.ahrtr@gmail.com>	2024-04-15 14:16:18 +01:00
Benjamin Wang	9ac964d1cc	Add verification that the revision shouldn't decrease on bootstrap Signed-off-by: Benjamin Wang <benjamin.ahrtr@gmail.com>	2024-04-15 13:56:19 +01:00
Chao Chen	405862e807	Fix event loss after compaction Signed-off-by: Chao Chen <chaochn@amazon.com>	2024-03-15 14:22:37 -07:00
Abhishek Kr Srivastav	fe113026fa	Increased timeout to fix HashKV test Signed-off-by: Abhishek Kr Srivastav <Abhishek.kr.srivastav@ibm.com>	2024-02-23 15:58:57 +05:30
Benjamin Wang	c647614b6a	test: fix TestHashKVWhenCompacting: ensure all goroutine finished Signed-off-by: Benjamin Wang <benjamin.ahrtr@gmail.com>	2024-01-25 13:30:27 +00:00
Benjamin Wang	82562abc3c	Fix test issue in TestScheduleCompaction It always reuse the same memory space. Each next iteration will always overwrite previous data Signed-off-by: Benjamin Wang <benjamin.ahrtr@gmail.com>	2024-01-19 15:29:51 +00:00
Rahul More	21bbc82710	mvcc: Printing etcd backend database related metrics inside scheduleCompaction function To improve traceability of backend database usage, Added below parameter related to backend database usage metrics inside scheduledCompaction function. current-db-size-bytes current-db-size current-db-size-in-use-bytes current-db-size-in-use Signed-off-by: Rahul More <rahulbapumore@gmail.com>	2024-01-15 17:36:38 +05:30
Allen Ray	395376d3ab	Refactor common revision code to pkg Signed-off-by: Allen Ray <alray@redhat.com>	2023-10-09 14:53:16 -04:00
Wei Fu	aa97484166	*: enable goimports in verify-lint Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-09-21 21:14:09 +08:00
chenyahui	c0aa3b613b	Use any instead of interface{} Signed-off-by: chenyahui <cyhone@qq.com>	2023-09-17 17:41:58 +08:00
redwrasse	b79cbcc3db	mvcc: cleanup comment in HashStorage.Store Signed-off-by: redwrasse <mail@redwrasse.io>	2023-09-11 21:54:42 -07:00
Wei Fu	a6f7bbe560	server/storage/mvcc: deflake TestHashKVWhenCompacting The HashByRev-goroutines exit since receive `donec` notification. The Check-computed-hashes goroutine could not have chance to get the hash result and be stuck forever. We should add case for donec when we wait for hash result. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-09-06 18:12:42 +08:00
Wei Fu	4db8df677c	feature: add new compactor based revision count What would you like to be added? Add new compactor based revision count, instead of fixed interval time. In order to make it happen, the mvcc store needs to export `CompactNotify` function to notify the compactor that configured number of write transactions have occured since previsious compaction. The new compactor can get the revision change and delete out-of-date data in time, instead of waiting with fixed interval time. The underly bbolt db can reuse the free pages as soon as possible. Why is this needed? In the kubernetes cluster, for instance, argo workflow, there will be batch requests to create pods , and then there are also a lot of pod status's PATCH requests, especially when the pod has more than 3 containers. If the burst requests increase the db size in short time, it will be easy to exceed the max quota size. And then the cluster admin get involved to defrag, which may casue long downtime. So, we hope the ETCD can delete the out-of-date data as soon as possible and slow down the grow of total db size. Currently, both revision and periodic are based on time. It's not easy to use fixed interval time to face the unexpected burst update requests. The new compactor based on revision count can make the admin life easier. For instance, let's say that average of object size is 50 KiB. The new compactor will compact based on 10,000 revisions. It's like that ETCD can compact after new 500 MiB data in, no matter how long ETCD takes to get new 10,000 revisions. It can handle the burst update requests well. There are some test results: * Fixed value size: 10 KiB, Update Rate: 100/s, Total key space: 3,000 ``` enchmark put --rate=100 --total=300000 --compact-interval=0 \ --key-space-size=3000 --key-size=256 --val-size=10240 ``` \| Compactor \| DB Total Size \| DB InUse Size \| \| -- \| -- \| -- \| \| Revision(5min,retension:10000) \| 570 MiB \| 208 MiB \| \| Periodic(1m) \| 232 MiB \| 165 MiB \| \| Periodic(30s) \| 151 MiB \| 127 MiB \| \| NewRevision(retension:10000) \| 195 MiB \| 187 MiB \| * Random value size: [9 KiB, 11 KiB], Update Rate: 150/s, Total key space: 3,000 ``` bnchmark put --rate=150 --total=300000 --compact-interval=0 \ --key-space-size=3000 --key-size=256 --val-size=10240 \ --delta-val-size=1024 ``` \| Compactor \| DB Total Size \| DB InUse Size \| \| -- \| -- \| -- \| \| Revision(5min,retension:10000) \| 718 MiB \| 554 MiB \| \| Periodic(1m) \| 297 MiB \| 246 MiB \| \| Periodic(30s) \| 185 MiB \| 146 MiB \| \| NewRevision(retension:10000) \| 186 MiB \| 178 MiB \| * Random value size: [6 KiB, 14 KiB], Update Rate: 200/s, Total key space: 3,000 ``` bnchmark put --rate=200 --total=300000 --compact-interval=0 \ --key-space-size=3000 --key-size=256 --val-size=10240 \ --delta-val-size=4096 ``` \| Compactor \| DB Total Size \| DB InUse Size \| \| -- \| -- \| -- \| \| Revision(5min,retension:10000) \| 874 MiB \| 221 MiB \| \| Periodic(1m) \| 357 MiB \| 260 MiB \| \| Periodic(30s) \| 215 MiB \| 151 MiB \| \| NewRevision(retension:10000) \| 182 MiB \| 176 MiB \| For the burst requests, we needs to use short periodic interval. Otherwise, the total size will be large. I think the new compactor can handle it well. Additional Change: Currently, the quota system only checks DB total size. However, there could be a lot of free pages which can be reused to upcoming requests. Based on this proposal, I also want to extend current quota system with DB's InUse size. If the InUse size is less than max quota size, we should allow requests to update. Since the bbolt might be resized if there is no available continuous pages, we should setup a hard limit for the overflow, like 1 GiB. ```diff // Quota represents an arbitrary quota against arbitrary requests. Each request @@ -130,7 +134,17 @@ func (b *BackendQuota) Available(v interface{}) bool { return true } // TODO: maybe optimize Backend.Size() - return b.be.Size()+int64(cost) < b.maxBackendBytes + + // Since the compact comes with allocatable pages, we should check the + // SizeInUse first. If there is no continuous pages for key/value and + // the boltdb continues to resize, it should not increase more than 1 + // GiB. It's hard limitation. + // + // TODO: It should be enabled by flag. + if b.be.Size()+int64(cost)-b.maxBackendBytes >= maxAllowedOverflowBytes(b.maxBackendBytes) { + return false + } + return b.be.SizeInUse()+int64(cost) < b.maxBackendBytes } ``` And it's likely to disable NOSPACE alarm if the compact can get much more free pages. It can reduce downtime. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-08-16 23:35:08 +08:00
Marek Siarkowicz	53cbd81009	Separate Writer interface from BatchTx interfaces Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2023-07-31 10:18:01 +02:00
Marek Siarkowicz	29769984e6	Remove RLock/RUnlock from BatchTx Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2023-07-28 11:39:50 +02:00
caojiamingalan	bc97a94564	Follow up https://github.com/etcd-io/etcd/pull/16068#discussion_r1263664700 Replace unnecessary Lock()/Unlock()s with RLock()/RUnlock()s Signed-off-by: caojiamingalan <alan.c.19971111@gmail.com>	2023-07-14 20:08:25 -05:00
caojiamingalan	b9e30bf878	etcdserver: add e2e test to reproduce the incorrect hash issue when resuming scheduled compaction. check ScheduledCompactKeyName and FinishedCompactKeyName before writing hash to hashstore. If they do not match, then it means this compaction has once been interrupted and its hash value is invalid. In such cases, we won't write the hash values to the hashstore, and avoids the incorrect corruption alarm. Signed-off-by: caojiamingalan <alan.c.19971111@gmail.com>	2023-06-07 19:54:09 -05:00
Chao Chen	9e1e378e9e	update code comments Signed-off-by: Chao Chen <chaochn@amazon.com>	2023-05-24 12:54:27 -07:00
Peter Wortmann	74feb229c7	etcdserver: Guarantee order of requested progress notifications Progress notifications requested using ProgressRequest were sent directly using the ctrlStream, which means that they could race against watch responses in the watchStream. This would especially happen when the stream was not synced - e.g. if you requested a progress notification on a freshly created unsynced watcher, the notification would typically arrive indicating a revision for which not all watch responses had been sent. This changes the behaviour so that v3rpc always goes through the watch stream, using a new RequestProgressAll function that closely matches the behaviour of the v3rpc code - i.e. 1. Generate a message with WatchId -1, indicating the revision for all watchers in the stream 2. Guarantee that a response is (eventually) sent The latter might require us to defer the response until all watchers are synced, which is likely as it should be. Note that we do not guarantee that the number of progress notifications matches the number of requests, only that eventually at least one gets sent. Signed-off-by: Peter Wortmann <peter.wortmann@skao.int>	2023-04-05 11:54:10 +01:00
Marek Siarkowicz	7052d8998a	server: Test watch restore Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2023-03-20 11:15:41 +01:00
Bogdan Kanivets	a690707c57	mvcc: update minRev when watcher stays synced Problem: during restore in watchableStore.Restore, synced watchers are moved to unsynced. minRev will be behind since it's not updated when watcher stays synced. Solution: update minRev fixes: https://github.com/etcd-io/etcd/issues/15271 Signed-off-by: Bogdan Kanivets <bkanivets@apple.com> Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2023-03-20 11:11:48 +01:00
Wei Fu	eb09e00541	chore: refactor cleanup fn in mvcc test The tmp path is cleanup by go testing so that the `cleanup` doesn't need to call `os.Remove`. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-03-19 21:46:03 +08:00
Wei Fu	830d9e9eaa	test: fix TestRestoreContinueUnfinishedCompaction The original testcase uses `return` statement which skips `restore` case. It's aimed to enable `restore` testcase. Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-03-18 10:58:05 +08:00
Wei Fu	d200f72d2d	test: should not leak goroutines after test finished The original flaky test shows in CI pipeline[1], but gotestsum run into a golang issue[2]. The error message is not clear from summary, like ``` {"Time":"2023-03-02T09:19:38.754394861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1433 +0x7e4\n"} {"Time":"2023-03-02T09:19:38.754414561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/runtime/panic.go:476 +0x32\n"} {"Time":"2023-03-02T09:19:38.754430561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1493 +0x47\n"} {"Time":"2023-03-02T09:19:38.754482561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:883 +0xc4\n"} {"Time":"2023-03-02T09:19:38.754497661Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:876 +0xa4\n"} {"Time":"2023-03-02T09:19:38.754512161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:927 +0x6a\n"} {"Time":"2023-03-02T09:19:38.754567661Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zaptest.testingWriter.Write()\n"} {"Time":"2023-03-02T09:19:38.754571261Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/zaptest/logger.go:130 +0x12c\n"} {"Time":"2023-03-02T09:19:38.754582861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zaptest.(testingWriter).Write()\n"} {"Time":"2023-03-02T09:19:38.754597761Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zapcore.(ioCore).Write()\n"} {"Time":"2023-03-02T09:19:38.754600961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/core.go:99 +0x199\n"} {"Time":"2023-03-02T09:19:38.754612761Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap/zapcore.(CheckedEntry).Write()\n"} {"Time":"2023-03-02T09:19:38.754618561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/entry.go:255 +0x2ce\n"} {"Time":"2023-03-02T09:19:38.754630161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.uber.org/zap.(Logger).Info()\n"} {"Time":"2023-03-02T09:19:38.754633261Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/go/pkg/mod/go.uber.org/zap@v1.24.0/logger.go:220 +0x6a\n"} {"Time":"2023-03-02T09:19:38.754644861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.(treeIndex).Compact()\n"} {"Time":"2023-03-02T09:19:38.754648461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/index.go:194 +0x144\n"} {"Time":"2023-03-02T09:19:38.754664961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.(store).scheduleCompaction()\n"} {"Time":"2023-03-02T09:19:38.754670161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/kvstore_compaction.go:29 +0xbb\n"} {"Time":"2023-03-02T09:19:38.754681861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.(store).compact.func1()\n"} {"Time":"2023-03-02T09:19:38.754690561Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/kvstore.go:235 +0x9e\n"} {"Time":"2023-03-02T09:19:38.754720061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.job.Do()\n"} {"Time":"2023-03-02T09:19:38.754724161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:41 +0x70\n"} {"Time":"2023-03-02T09:19:38.754736161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.(job).Do()\n"} {"Time":"2023-03-02T09:19:38.754750961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.(fifo).executeJob()\n"} {"Time":"2023-03-02T09:19:38.754754161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:206 +0x101\n"} {"Time":"2023-03-02T09:19:38.754765861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.(fifo).run()\n"} {"Time":"2023-03-02T09:19:38.754769061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:187 +0x1a5\n"} {"Time":"2023-03-02T09:19:38.754780461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler.func1()\n"} {"Time":"2023-03-02T09:19:38.754783661Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:101 +0x39\n"} {"Time":"2023-03-02T09:19:38.754824061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1493 +0x75d\n"} FAIL: (code:1): % (cd server && 'env' 'ETCD_VERIFY=all' 'go' 'test' '-v' '-json' '-short' '-timeout=3m' '--race=true' '--cpu=4' './...' '-p=2') {"Time":"2023-03-02T09:19:38.754838961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1846 +0x99\n"} {"Time":"2023-03-02T09:19:38.754854961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1446 +0x216\n"} {"Time":"2023-03-02T09:19:38.754893461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1844 +0x7ec\n"} {"Time":"2023-03-02T09:19:38.754908961Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1726 +0xa84\n"} {"Time":"2023-03-02T09:19:38.754957861Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler()\n"} {"Time":"2023-03-02T09:19:38.754961061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/pkg/schedule/schedule.go:101 +0x3b6\n"} {"Time":"2023-03-02T09:19:38.754976161Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.NewStore()\n"} {"Time":"2023-03-02T09:19:38.754979361Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/kvstore.go:111 +0x331\n"} {"Time":"2023-03-02T09:19:38.754991061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" go.etcd.io/etcd/server/v3/storage/mvcc.TestHashByRevValue()\n"} {"Time":"2023-03-02T09:19:38.754994261Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /home/runner/work/etcd/etcd/server/storage/mvcc/hash_test.go:36 +0xa4\n"} {"Time":"2023-03-02T09:19:38.755010061Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1446 +0x216\n"} {"Time":"2023-03-02T09:19:38.755024461Z","Action":"output","Package":"go.etcd.io/etcd/server/v3/storage/mvcc","Test":"TestHashByRevValue","Output":" /opt/hostedtoolcache/go/1.19.6/x64/src/testing/testing.go:1493 +0x47\n"} === Failed === FAIL: storage/mvcc (0.00s) === CONT testing.go:1319: race detected during execution of test FAIL FAIL go.etcd.io/etcd/server/v3/storage/mvcc 9.852s ``` After using the following command to reproduce it, we can get the error like: ```bash go test -v -p=2 --cpu=4 -count=1000 -failfast --race=true -short -timeout=30m ./ --- PASS: TestHashByRevValueLastRevision (0.12s) ================== WARNING: DATA RACE Read at 0x00c002024043 by goroutine 65745: testing.(common).logDepth() /usr/lib/go-1.19/src/testing/testing.go:883 +0xc4 testing.(common).log() /usr/lib/go-1.19/src/testing/testing.go:876 +0xa4 testing.(common).Logf() /usr/lib/go-1.19/src/testing/testing.go:927 +0x6a testing.(T).Logf() <autogenerated>:1 +0x75 go.uber.org/zap/zaptest.testingWriter.Write() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/zaptest/logger.go:130 +0x12c go.uber.org/zap/zaptest.(testingWriter).Write() <autogenerated>:1 +0x7e go.uber.org/zap/zapcore.(ioCore).Write() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/core.go:99 +0x199 go.uber.org/zap/zapcore.(CheckedEntry).Write() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/zapcore/entry.go:255 +0x2ce go.uber.org/zap.(Logger).Info() /home/fuwei/go/pkg/mod/go.uber.org/zap@v1.24.0/logger.go:220 +0x6a go.etcd.io/etcd/server/v3/storage/mvcc.(treeIndex).Compact() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/index.go:194 +0x144 go.etcd.io/etcd/server/v3/storage/mvcc.(store).scheduleCompaction() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/kvstore_compaction.go:29 +0xbb go.etcd.io/etcd/server/v3/storage/mvcc.(store).compact.func1() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/kvstore.go:235 +0x9e go.etcd.io/etcd/pkg/v3/schedule.job.Do() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:41 +0x70 go.etcd.io/etcd/pkg/v3/schedule.(job).Do() <autogenerated>:1 +0x29 go.etcd.io/etcd/pkg/v3/schedule.(fifo).executeJob() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:206 +0x101 go.etcd.io/etcd/pkg/v3/schedule.(fifo).run() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:187 +0x1a5 go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler.func1() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:101 +0x39 Previous write at 0x00c002024043 by goroutine 65743: testing.tRunner.func1() /usr/lib/go-1.19/src/testing/testing.go:1433 +0x7e4 runtime.deferreturn() /usr/lib/go-1.19/src/runtime/panic.go:476 +0x32 testing.(T).Run.func1() /usr/lib/go-1.19/src/testing/testing.go:1493 +0x47 Goroutine 65745 (running) created at: go.etcd.io/etcd/pkg/v3/schedule.NewFIFOScheduler() /home/fuwei/go/src/go.etcd.io/etcd/pkg/schedule/schedule.go:101 +0x3b6 go.etcd.io/etcd/server/v3/storage/mvcc.NewStore() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/kvstore.go:111 +0x331 go.etcd.io/etcd/server/v3/storage/mvcc.TestHashByRevValueLastRevision() /home/fuwei/go/src/go.etcd.io/etcd/server/storage/mvcc/hash_test.go:76 +0xa4 testing.tRunner() /usr/lib/go-1.19/src/testing/testing.go:1446 +0x216 testing.(T).Run.func1() /usr/lib/go-1.19/src/testing/testing.go:1493 +0x47 Goroutine 65743 (running) created at: testing.(T).Run() /usr/lib/go-1.19/src/testing/testing.go:1493 +0x75d testing.runTests.func1() /usr/lib/go-1.19/src/testing/testing.go:1846 +0x99 testing.tRunner() /usr/lib/go-1.19/src/testing/testing.go:1446 +0x216 testing.runTests() /usr/lib/go-1.19/src/testing/testing.go:1844 +0x7ec testing.(M).Run() /usr/lib/go-1.19/src/testing/testing.go:1726 +0xa84 main.main() _testmain.go:265 +0x2e9 ================== ``` The schedule for compact is handled asynchronously and it might use `t.Logf` after go-test marks the case is done. And there is a comment from go-test: ```go // `c69ff3a7d0/src/testing/testing.go (LL1580C3-L1582C16)` // Do not lock t.done to allow race detector to detect race in case // the user does not appropriately synchronize a goroutine. t.done = true ``` We need to ensure that all the goroutines should be closed before case finish. REF: [1]: https://github.com/etcd-io/etcd/actions/runs/4312405975/jobs/7522924734 [2]: https://github.com/gotestyourself/gotestsum/issues/310 Signed-off-by: Wei Fu <fuweid89@gmail.com>	2023-03-18 10:58:00 +08:00
guozhao	fab8474ef8	test: add CompactionSleepInterval in FakeStore's config After setting the ComparionSleepInterval, we can use time.Ticker instead of time.After to optimize the scheduleComparison(), otherwise it will fail in the 'TestStoreCompact(t)' test. Signed-off-by: guozhao <guozhao@360.cn>	2023-01-20 06:25:09 +08:00
guozhao	de8d6b3792	etcdserver: use time.Ticker instead of time.After Using time.After will create a new Timer in each cycle, In these cases , it is better to use time.Ticker. Signed-off-by: guozhao <guozhao@360.cn>	2023-01-17 16:58:13 +08:00
Piotr Tabor	9abc895122	Goimports: Apply automated fixing to test files as well. Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-29 13:04:45 +01:00
Piotr Tabor	9e1abbab6e	Fix goimports in all existing files. Execution of ./scripts/fix.sh Signed-off-by: Piotr Tabor <ptab@google.com>	2022-12-29 09:41:31 +01:00
Bhargav Ravuri	2feec4fe68	comments: fix comments as per goword in go test files Comments fixed as per goword in go test files that shell function go_srcs_in_module lists as per changes on #14827 Helps in #14827 Signed-off-by: Bhargav Ravuri <bhargav.ravuri@infracloud.io>	2022-11-23 23:05:42 +05:30
Andrew Sims	f656fa0f49	add missing copyright headers Signed-off-by: Andrew Sims <andrew.cameron.sims@gmail.com>	2022-11-23 19:13:43 +11:00
Marek Siarkowicz	f96957adba	tests: Add compact failpoints Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>	2022-11-15 14:59:03 +01:00
Cenk Alti	580a86ebe5	server: add more context to panic message Signed-off-by: Cenk Alti <cenkalti@gmail.com>	2022-10-31 20:29:15 -04:00
Benjamin Wang	d15a9d0edc	Merge pull request #14457 from jbml/hashbyrev_compact_main etcdserver: fix corruption check when server has just been compacted	2022-10-13 15:17:38 +08:00
Benjamin Wang	abef537a90	Merge pull request #14515 from spongecaptain/btree-generics upate:use google/btree in the genric way	2022-09-27 16:44:13 +08:00
wathenjiang	319db38b0a	update: add benchmark test benchmark result: (1) master branch $ go test -bench='BenchmarkIndexPut$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexPut-12 1000000 2591 ns/op BenchmarkIndexPut-12 1000000 2531 ns/op BenchmarkIndexPut-12 1000000 2536 ns/op BenchmarkIndexPut-12 1000000 2546 ns/op BenchmarkIndexPut-12 1000000 2538 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 167.439s $ go test -bench='BenchmarkIndexGet$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexGet-12 1000000 2021 ns/op BenchmarkIndexGet-12 1000000 2029 ns/op BenchmarkIndexGet-12 1000000 2044 ns/op BenchmarkIndexGet-12 1000000 1973 ns/op BenchmarkIndexGet-12 1000000 2027 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 177.815s (2) google/btree in the generic way $ go test -bench='BenchmarkIndexPut$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexPut-12 1000000 2477 ns/op BenchmarkIndexPut-12 1000000 2380 ns/op BenchmarkIndexPut-12 1000000 2360 ns/op BenchmarkIndexPut-12 1000000 2396 ns/op BenchmarkIndexPut-12 1000000 2382 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 165.841s $ go test -bench='BenchmarkIndexGet$' -count=5 goos: darwin goarch: amd64 pkg: go.etcd.io/etcd/server/v3/storage/mvcc cpu: Intel(R) Core(TM) i7-9750H CPU @ 2.60GHz BenchmarkIndexGet-12 1000000 1985 ns/op BenchmarkIndexGet-12 1000000 1914 ns/op BenchmarkIndexGet-12 1000000 1900 ns/op BenchmarkIndexGet-12 1000000 1905 ns/op BenchmarkIndexGet-12 1000000 1894 ns/op PASS ok go.etcd.io/etcd/server/v3/storage/mvcc 177.573s Signed-off-by: wathenjiang <wathenjiang@tencent.com>	2022-09-27 14:33:02 +08:00
Spongecaptain	c53dfc7c5b	upate:use google/btree in the genric way Signed-off-by: wathenjiang <wathenjiang@tencent.com>	2022-09-27 10:16:15 +08:00
Kafuu Chino	f1d4935e91	*: avoid closing a watch with ID 0 incorrectly Signed-off-by: Kafuu Chino <KafuuChinoQ@gmail.com> add test	2022-09-26 20:30:33 +08:00

1 2 3

110 Commits