Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

not enough clients in start calculation #46

Open
lmx-code opened this issue Mar 8, 2023 · 16 comments
Open

not enough clients in start calculation #46

lmx-code opened this issue Mar 8, 2023 · 16 comments

Comments

@lmx-code
Copy link

lmx-code commented Mar 8, 2023

Describe the bug
本地化搭建之后,调用example的横向联邦学习任务示例,出现了下面这个问题
b49e2b33325a89f1448cfc03258eaea
1678265920211

@lmx-code lmx-code added the bug Something isn't working label Mar 8, 2023
@lencyforce lencyforce removed the bug Something isn't working label Mar 8, 2023
@lencyforce lencyforce changed the title [bug] not enough clients in start calculation Mar 8, 2023
@lencyforce
Copy link
Member

#18

@ssyyxx17410103151207
Copy link

我遇到了差不多的问题,运行example示例时zo总是报异常,不知道是什么原因,下面是所有的日志内容

2023-03-13 10:57:36 start run task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc
2023-03-13 10:57:36 [Create Task] create task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc
tx hash: 0x0acfaaa1917bb0a9c8ba7d31bf3a16b8e551bec86cb33f6a2ca44c5de96f0bda
2023-03-13 10:57:36 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc download task config
2023-03-13 10:57:36 [Start Round] task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 start
tx hash: 0xc3a620f72bd11b42c0586a32b7fa77a8f70ff9aef60a576d5991122488ec7825
2023-03-13 10:57:36 [Join the training round] try to join in task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1
tx hash: 0x98c76db717dadf8fc8068151a2070f0d39b620d92051ddbb447a7aa49fb7f1e0
2023-03-13 10:57:46 [Select Candidates] task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 select candidates ['0x459d81ac4fb5211248b512561643A1669Cd932c4', '0x6578aDabE867C4F7b2Ce4c59aBEAbDC754fBb990', '0xA3dcddd50d436770EF76388D0e6C5441986De4eE']
tx hash: 0x83fd0354ad56c40fa2f9e1ef47410b3b5f7265eb2b0fb29e33a29ce8e2cee2c8
2023-03-13 10:57:46 join in task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1
2023-03-13 10:57:47 [Secret Sharing Parts Commitments] task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 upload seed secret share commitments
tx hash: 0x9b13879522bfd982d3bbe6f6e7523b81b8df20e76b6b0841b247c9eda0f0818b
2023-03-13 10:57:47 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 0x459d81ac4fb5211248b512561643A1669Cd932c4 upload secret shares
2023-03-13 10:57:47 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 0xA3dcddd50d436770EF76388D0e6C5441986De4eE upload secret shares
2023-03-13 10:57:47 [Secret Sharing Parts Commitments] task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 upload secret key secret share commitments
tx hash: 0x1829651aa7262ba967afc97bdfb163cc76bc4adeb87df7e7b99a84a816f9e136
2023-03-13 10:57:47 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 0x6578aDabE867C4F7b2Ce4c59aBEAbDC754fBb990 upload secret shares
2023-03-13 10:57:47 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 upload secret shares
2023-03-13 10:57:56 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 0x459d81ac4fb5211248b512561643A1669Cd932c4 get secret shares
2023-03-13 10:57:56 task 0x9582242428ce149830b60abed6c4482d0ffb65b70bdc37d7aaa9ace7a8e29adc round 1 0xA3dcddd50d436770EF76388D0e6C5441986De4eE get secret

@lmx-code
Copy link
Author

image
image
目前定位到这一块有问题,能帮忙看看这个错误是因为啥吗?

@mh739025250
Copy link
Member

@lmx-code 你跑的是哪个example

@lmx-code
Copy link
Author

@mh739025250 横向联邦学习任务

@mh739025250
Copy link
Member

你是用delta-all-in-one跑的吗?你运行的是哪个版本的呢?
你这个问题,看起来是一个我在0.8.1版本已经修复了的bug。

@lmx-code
Copy link
Author

我用的是这个命令: git clone --depth 1 --branch v0.8.1 https://github.com/delta-mpc/delta-all-in-one.git拉取的,后续步骤按照文档操作的

@mh739025250
Copy link
Member

@lmx-code
你在docker-compose up -d启动之后,
运行一下命令,docker exec -it dashboard bash,可以进入dashboard容器内的终端,
然后输入命令pip list | grep delta,把输出给我看看。

@lmx-code
Copy link
Author

image

@mh739025250
Copy link
Member

嗯,我知道了,这是deltaboard镜像里delta-task包版本的问题。
我会出再发一个小版本来修复这个问题。

@lmx-code
Copy link
Author

好的,出完版本后跟我说下,谢谢

@mh739025250
Copy link
Member

@ssyyxx17410103151207 你的问题和 @lmx-code 是一样的吗?我看你的日志没给出报错的位置。

@mh739025250
Copy link
Member

@lmx-code 0.8.3版本已经发布了,修复了你提到的问题,你可以试一下

@zhaosiyu-alt
Copy link

zhaosiyu-alt commented May 13, 2023

您好,我在8.3版本中进行了无区块链网络搭建,尝试运行横向联邦学习任务,在日志中仍然显示ERROR而且最后的运行状态是异常,请问是为什么?(是不是因为只有两个delta-node节点,而minst数据集被分为三份,需要三个delta-node节点?)

以下是日志内容:
2023-05-13 16:24:44 start run task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772
2023-05-13 16:24:44 [Create Task] create task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772
tx hash: 0xa8b1d984b3c702fe31ca093838c2f24cc6a244992c71d30102b17ddd577a1267
2023-05-13 16:24:44 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 download task config
2023-05-13 16:24:44 [Start Round] task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 start
tx hash: 0x5ef70446ff12b0948dfeda9d878154cec76ceb70cc0ac107a5169af8bdf271e4
2023-05-13 16:24:44 [Join the training round] try to join in task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1
tx hash: 0xadf57ab491f143746785f1dc9ed1d6ed3b7140665cde96a138336acfa89b7362
2023-05-13 16:24:54 [Select Candidates] task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 select candidates ['0xa9767bea87192ff0aabf510450de7b1957ffcbe6', '0x7229e74fcc916eadd354aaf7d042d54819403559']
tx hash: 0xf88ec4233e426b2fc56493e598e2373b3e4795d08f443375c91726391f30deec
2023-05-13 16:24:54 join in task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1
2023-05-13 16:24:54 [Secret Sharing Parts Commitments] task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 upload seed secret share commitments
tx hash: 0xff890381d4cd1b761baaf69a4ac7549d539df82e2a47640aa1d83cde2dad5a9f
2023-05-13 16:24:54 [Secret Sharing Parts Commitments] task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 upload secret key secret share commitments
tx hash: 0x4bdcfb741d0fa6411e97d9496badfa9b564a19101be1782b02c639b10f7e102b
2023-05-13 16:24:54 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 0x7229e74fcc916eadd354aaf7d042d54819403559 upload secret shares
2023-05-13 16:24:54 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 0xa9767bea87192ff0aabf510450de7b1957ffcbe6 upload secret shares
2023-05-13 16:24:54 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 upload secret shares
2023-05-13 16:25:04 [Start Calculation] task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 ['0xa9767bea87192ff0aabf510450de7b1957ffcbe6', '0x7229e74fcc916eadd354aaf7d042d54819403559'] start calculation
tx hash: 0x4ca71f7a1dacdf715710098454c2941b8cea8031bf513e8a5f049a015c870f42
2023-05-13 16:25:04 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 0x7229e74fcc916eadd354aaf7d042d54819403559 get secret shares
2023-05-13 16:25:04 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 0xa9767bea87192ff0aabf510450de7b1957ffcbe6 get secret shares
2023-05-13 16:25:04 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 get and validate other members' secret shares
2023-05-13 16:25:34 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 error: not enough clients in start calculation
2023-05-13 16:26:12 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 0x7229e74fcc916eadd354aaf7d042d54819403559 upload result
2023-05-13 16:26:13 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 0xa9767bea87192ff0aabf510450de7b1957ffcbe6 upload result
2023-05-13 16:26:13 task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 upload masked result
2023-05-13 16:26:13 [Masked Result Commitment] task 0x24f87732d7c95e059617472fc38982303b0e6206b029d73fe6f3f91717148772 round 1 upload result commitment

@Fumon554
Copy link

Fumon554 commented Nov 8, 2023

用delta-all-in-one的0.8.3版本跑横向联邦学习任务还是有这个问题
屏幕截图 2023-11-08 205953

@Fumon554
Copy link

Fumon554 commented Nov 8, 2023

屏幕截图 2023-11-08 210212

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants