Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

roachtest: tpce/c=5000/nodes=3 failed #66719

Closed
cockroach-teamcity opened this issue Jun 22, 2021 · 4 comments
Closed

roachtest: tpce/c=5000/nodes=3 failed #66719

cockroach-teamcity opened this issue Jun 22, 2021 · 4 comments
Assignees
Labels
branch-master Failures and bugs on the master branch. C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked.

Comments

@cockroach-teamcity
Copy link
Member

roachtest.tpce/c=5000/nodes=3 failed with artifacts on master @ 1b686aef9949c1c7ef930b55bd1fbc0ed2e8268a:

		
		thread '<unnamed>' panicked at 'called ``Result::unwrap()`` on an ``Err`` value: "SendError(..)"', driver/src/customer_emulator.rs:34:58
		thread '<unnamed>' panicked at 'called ``Result::unwrap()`` on an ``Err`` value: "SendError(..)"', driver/src/market_exchange_emulator.rs:48:53
		thread '<unnamed>' panicked at 'called ``Result::unwrap()`` on an ``Err`` value: "SendError(..)"', driver/src/data_maintenance_emulator.rs:54:30

	cluster.go:2484,tpce.go:100,tpce.go:117,test_runner.go:757: monitor failure: unexpected node event: 1: dead (exit status 134)
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2472
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2480
		  | main.registerTPCE.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tpce.go:100
		  | main.registerTPCE.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tpce.go:117
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:757
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (2) monitor failure
		Wraps: (3) unexpected node event: 1: dead (exit status 134)
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *errors.errorString

	cluster.go:1328,context.go:89,cluster.go:1317,test_runner.go:838: dead node detection: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod monitor teamcity-3103795-1624342309-74-n4cpu4 --oneshot --ignore-empty-nodes: exit status 1 4: skipped
		1: dead (exit status 134)
		3: 10484
		2: 10280
		Error: UNCLASSIFIED_PROBLEM: 1: dead (exit status 134)
		(1) UNCLASSIFIED_PROBLEM
		Wraps: (2) attached stack trace
		  -- stack trace:
		  | main.glob..func14
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachprod/main.go:1154
		  | main.wrap.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachprod/main.go:276
		  | github.com/spf13/cobra.(*Command).execute
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/vendor/github.com/spf13/cobra/command.go:856
		  | github.com/spf13/cobra.(*Command).ExecuteC
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/vendor/github.com/spf13/cobra/command.go:960
		  | github.com/spf13/cobra.(*Command).Execute
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/vendor/github.com/spf13/cobra/command.go:897
		  | main.main
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachprod/main.go:2051
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:204
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (3) 1: dead (exit status 134)
		Error types: (1) errors.Unclassified (2) *withstack.withStack (3) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh tpce/c=5000/nodes=3

Same failure on other branches

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

@cockroach-teamcity cockroach-teamcity added branch-master Failures and bugs on the master branch. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked. labels Jun 22, 2021
@cockroach-teamcity
Copy link
Member Author

roachtest.tpce/c=5000/nodes=3 failed with artifacts on master @ 19865dd4d29e87212f36b237aaaf543a7d9d55f5:

		  | e79b7647066e: Pull complete
		  | Digest: sha256:d8e11d701b9a6195db6fe1f00ac542f257e906844eb69c4ab65a79b2df03d713
		  | Status: Downloaded newer image for cockroachdb/tpc-e:latest
		  | Error: Error { kind: Db, cause: Some(DbError { severity: "ERROR", parsed_severity: None, code: SqlState("XXUUU"), message: "gs://cockroach-fixtures/tpce-csv/customers=5000/0/CashTransaction.txt?AUTH=implicit: http2: server sent GOAWAY and closed the connection; LastStreamID=1, ErrCode=NO_ERROR, debug=\"server_shutting_down\"", detail: None, hint: None, position: None, where_: None, schema: None, table: None, column: None, datatype: None, constraint: None, file: Some("read_import_base.go"), line: Some(271), routine: Some("func1") }) }
		  | Error: COMMAND_PROBLEM: exit status 1
		  | (1) COMMAND_PROBLEM
		  | Wraps: (2) Node 4. Command with error:
		  |   | ``````
		  |   | sudo docker run cockroachdb/tpc-e:latest --customers=5000 --racks=3 --init --hosts=10.128.0.119
		  |   | ``````
		  | Wraps: (3) exit status 1
		  | Error types: (1) errors.Cmd (2) *hintdetail.withDetail (3) *exec.ExitError
		  |
		  | stdout:
		  | Initializing schema...
		  | Importing dataset...
		Wraps: (4) exit status 20
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *main.withCommandDetails (4) *exec.ExitError

	cluster.go:2484,tpce.go:100,tpce.go:117,test_runner.go:757: monitor failure: monitor task failed: t.Fatal() was called
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2472
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2480
		  | main.registerTPCE.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tpce.go:100
		  | main.registerTPCE.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tpce.go:117
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:757
		Wraps: (2) monitor failure
		Wraps: (3) attached stack trace
		  -- stack trace:
		  | main.(*monitor).wait.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2528
		Wraps: (4) monitor task failed
		Wraps: (5) attached stack trace
		  -- stack trace:
		  | main.init
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2442
		  | runtime.doInit
		  | 	/usr/local/go/src/runtime/proc.go:5652
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:191
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1374
		Wraps: (6) t.Fatal() was called
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *withstack.withStack (4) *errutil.withPrefix (5) *withstack.withStack (6) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh tpce/c=5000/nodes=3

Same failure on other branches

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

@nvanbenschoten
Copy link
Member

Same as #66717. We see a crash that looks like:

fatal error: concurrent map writes

goroutine 408320 [running]:
runtime.throw(0x4c5ce17, 0x15)
	/usr/local/go/src/runtime/panic.go:1116 +0x72 fp=0xc008485978 sp=0xc008485948 pc=0x48bb32
runtime.mapassign_fast64(0x452d560, 0xc008da0a20, 0x2, 0x5acd5a0)
	/usr/local/go/src/runtime/map_fast64.go:101 +0x33e fp=0xc0084859b8 sp=0xc008485978 pc=0x46707e
github.com/cockroachdb/cockroach/pkg/sql/opt.(*TableMeta).copyScalars(0xc0060bb230, 0xc008485bf0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/opt/table_meta.go:193 +0x2cd fp=0xc008485ad0 sp=0xc0084859b8 pc=0x23770cd
github.com/cockroachdb/cockroach/pkg/sql/opt.(*Metadata).CopyFrom(0xc004024380, 0xc004b84000, 0xc008485bf0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/opt/metadata.go:227 +0x445 fp=0xc008485ba8 sp=0xc008485ad0 pc=0x2370845
github.com/cockroachdb/cockroach/pkg/sql/opt/norm.(*Factory).CopyAndReplace(0xc00bfe6c98, 0x5bbb060, 0xc006593c90, 0xc008f8f180, 0xc008485c48)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/opt/norm/factory.go:222 +0xb2 fp=0xc008485c10 sp=0xc008485ba8 pc=0x253c2b2
github.com/cockroachdb/cockroach/pkg/sql/opt/norm.(*Factory).AssignPlaceholders(0xc00bfe6c98, 0xc004b84000, 0x0, 0x0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/opt/norm/factory.go:269 +0x105 fp=0xc008485c80 sp=0xc008485c10 pc=0x253c585
github.com/cockroachdb/cockroach/pkg/sql.(*optPlanningCtx).reuseMemo(0xc00bfe6c38, 0xc004b84000, 0xc00c5e8960, 0x4c55468, 0x13)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/plan_opt.go:436 +0x8c fp=0xc008485cc0 sp=0xc008485c80 pc=0x35a0a4c
github.com/cockroachdb/cockroach/pkg/sql.(*optPlanningCtx).buildExecMemo(0xc00bfe6c38, 0x5a76280, 0xc00c5e8960, 0x0, 0xc008485e38, 0x4a35ac)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/plan_opt.go:469 +0x84a fp=0xc008485de8 sp=0xc008485cc0 pc=0x35a136a
github.com/cockroachdb/cockroach/pkg/sql.(*planner).makeOptimizerPlan(0xc00bfe64f8, 0x5a76280, 0xc00c5e8960, 0xc0110f5a05, 0x110f5a0505fc8000)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/plan_opt.go:194 +0xde fp=0xc008485ea0 sp=0xc008485de8 pc=0x359fade
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).makeExecPlan(0xc00bfe6000, 0x5a76280, 0xc00c5e8960, 0xc00bfe64f8, 0x1, 0x4e1a76)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor_exec.go:948 +0x5a fp=0xc008485f48 sp=0xc008485ea0 pc=0x34ac6fa
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).dispatchToExecutionEngine(0xc00bfe6000, 0x5a76280, 0xc00c5e8960, 0xc00bfe64f8, 0x7f1698c432b8, 0xc008564680, 0x0, 0x0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor_exec.go:829 +0x18a fp=0xc008486208 sp=0xc008485f48 pc=0x34ab82a
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).execStmtInOpenState(0xc00bfe6000, 0x5a76280, 0xc00c5e8960, 0x5a94c00, 0xc007828ff0, 0xc0075c6005, 0x406, 0xa, 0x9, 0xc0078eb860, ...)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor_exec.go:667 +0xfa6 fp=0xc008486ff0 sp=0xc008486208 pc=0x34a89e6
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).execStmt(0xc00bfe6000, 0x5a761c0, 0xc006f81780, 0x5a94c00, 0xc007828ff0, 0xc0075c6005, 0x406, 0xa, 0x9, 0xc0078eb860, ...)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor_exec.go:123 +0xb14 fp=0xc008487520 sp=0xc008486ff0 pc=0x34a7334
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).execPortal(0xc00bfe6000, 0x5a761c0, 0xc006f81780, 0xc0078eb860, 0xc0067fc0a0, 0xa, 0xa, 0xc0189b31b0, 0x4, 0x4, ...)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor_exec.go:210 +0x14e fp=0xc008487658 sp=0xc008487520 pc=0x34a778e
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).execCmd.func2(0xc0075c64fa, 0x0, 0x0, 0x110e6f68, 0xed8651d55, 0x0, 0xc00bfe6000, 0xc008487a20, 0xc008487a10, 0x7, ...)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor.go:1596 +0x47b fp=0xc008487870 sp=0xc008487658 pc=0x366be9b
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).execCmd(0xc00bfe6000, 0x5a761c0, 0xc006f81780, 0x0, 0x0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor.go:1598 +0x45c fp=0xc008487cc8 sp=0xc008487870 pc=0x349b7dc
github.com/cockroachdb/cockroach/pkg/sql.(*connExecutor).run(0xc00bfe6000, 0x5a761c0, 0xc005840280, 0xc0013fc8c0, 0x5400, 0x15000, 0xc0013fc960, 0xc013c96b20, 0x0, 0x0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor.go:1450 +0x228 fp=0xc008487d68 sp=0xc008487cc8 pc=0x349b108
github.com/cockroachdb/cockroach/pkg/sql.(*Server).ServeConn(0xc001416680, 0x5a761c0, 0xc005840280, 0xc00bfe6000, 0x5400, 0x15000, 0xc0013fc960, 0xc013c96b20, 0x0, 0x0)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/conn_executor.go:484 +0xce fp=0xc008487df0 sp=0xc008487d68 pc=0x3496aee
github.com/cockroachdb/cockroach/pkg/sql/pgwire.(*conn).processCommandsAsync.func1(0xc014bde81b, 0xc007b20020, 0x5a761c0, 0xc005840280, 0xc013c96b20, 0xc001416680, 0xc008564000, 0x5add560, 0xc0029f6500, 0xc0043d5320, ...)
	/go/src/github.com/cockroachdb/cockroach/pkg/sql/pgwire/conn.go:627 +0x3ea fp=0xc008487f38 sp=0xc008487df0 pc=0x3b6720a

@nvanbenschoten nvanbenschoten added the C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. label Jun 23, 2021
@cockroach-teamcity
Copy link
Member Author

roachtest.tpce/c=5000/nodes=3 failed with artifacts on master @ 95028269160e0bfa2b94477238e72670bf780e9d:

		
		Reported tpsE :     9.81
		
		thread '<unnamed>' panicked at 'called ``Result::unwrap()`` on an ``Err`` value: "SendError(..)"', driver/src/data_maintenance_emulator.rs:54:30

	cluster.go:2484,tpce.go:100,tpce.go:117,test_runner.go:757: monitor failure: unexpected node event: 1: dead (exit status 134)
		(1) attached stack trace
		  -- stack trace:
		  | main.(*monitor).WaitE
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2472
		  | main.(*monitor).Wait
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/cluster.go:2480
		  | main.registerTPCE.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tpce.go:100
		  | main.registerTPCE.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/tpce.go:117
		  | main.(*testRunner).runTest.func2
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachtest/test_runner.go:757
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (2) monitor failure
		Wraps: (3) unexpected node event: 1: dead (exit status 134)
		Error types: (1) *withstack.withStack (2) *errutil.withPrefix (3) *errors.errorString

	cluster.go:1328,context.go:89,cluster.go:1317,test_runner.go:838: dead node detection: /home/agent/work/.go/src/github.com/cockroachdb/cockroach/bin/roachprod monitor teamcity-3113044-1624515413-73-n4cpu4 --oneshot --ignore-empty-nodes: exit status 1 4: skipped
		3: 10362
		1: dead (exit status 134)
		2: 10283
		Error: UNCLASSIFIED_PROBLEM: 1: dead (exit status 134)
		(1) UNCLASSIFIED_PROBLEM
		Wraps: (2) attached stack trace
		  -- stack trace:
		  | main.glob..func14
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachprod/main.go:1154
		  | main.wrap.func1
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachprod/main.go:276
		  | github.com/spf13/cobra.(*Command).execute
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/vendor/github.com/spf13/cobra/command.go:856
		  | github.com/spf13/cobra.(*Command).ExecuteC
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/vendor/github.com/spf13/cobra/command.go:960
		  | github.com/spf13/cobra.(*Command).Execute
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/vendor/github.com/spf13/cobra/command.go:897
		  | main.main
		  | 	/home/agent/work/.go/src/github.com/cockroachdb/cockroach/pkg/cmd/roachprod/main.go:2051
		  | runtime.main
		  | 	/usr/local/go/src/runtime/proc.go:225
		  | runtime.goexit
		  | 	/usr/local/go/src/runtime/asm_amd64.s:1371
		Wraps: (3) 1: dead (exit status 134)
		Error types: (1) errors.Unclassified (2) *withstack.withStack (3) *errutil.leafError
Reproduce

To reproduce, try:

# From https://go.crdb.dev/p/roachstress, perhaps edited lightly.
caffeinate ./roachstress.sh tpce/c=5000/nodes=3

Same failure on other branches

/cc @cockroachdb/kv

This test on roachdash | Improve this report!

@mgartner
Copy link
Collaborator

Fixed by #66792.

@mgartner mgartner moved this to Done in SQL Queries Jul 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
branch-master Failures and bugs on the master branch. C-bug Code not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior. C-test-failure Broken test (automatically or manually discovered). O-roachtest O-robot Originated from a bot. release-blocker Indicates a release-blocker. Use with branch-release-2x.x label to denote which branch is blocked.
Projects
Archived in project
Development

No branches or pull requests

3 participants