-
Notifications
You must be signed in to change notification settings - Fork 164
Issues: tony-framework/TonY
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
ERROR ApplicationMaster:496 - Exception while preparing AM org.apache.hadoop.yarn.exceptions.YarnException: Can't resolve the ip of ubuntu at com.linkedin.tony.util.Utils.getHostNameOrIpFromTokenConf(Utils.java:365) at com.linkedin.tony.ApplicationMaster.prepare(ApplicationMaster.java:476) at com.linkedin.tony.ApplicationMaster.run(ApplicationMaster.java:368) at com.linkedin.tony.ApplicationMaster.main(ApplicationMaster.java:342)
#673
opened May 26, 2022 by
ckqqqq
Allow that one role of task executor could make other roles exit
enhancement
New feature or request
#636
opened Jan 19, 2022 by
zuston
Task executors that support specific roles are restarted when they fail
enhancement
New feature or request
#620
opened Nov 25, 2021 by
zuston
Provide tony-submit cli tool to submit app
enhancement
New feature or request
help wanted
Extra attention is needed
#578
opened Aug 3, 2021 by
zuston
There is a vulnerability in Protocol Buffers 0.8.1 ,upgrade recommended
good first issue
Good for newcomers
help wanted
Extra attention is needed
#552
opened May 17, 2021 by
QiAnXinCodeSafe
A little mistakes in tony-example README
good first issue
Good for newcomers
#528
opened Apr 17, 2021 by
daugraph
Support elastic Horovod on TonY
enhancement
New feature or request
help wanted
Extra attention is needed
#525
opened Apr 12, 2021 by
zuston
Check for app failure before updating task infos
good first issue
Good for newcomers
#464
opened Sep 14, 2020 by
hungj
Add more testing for Tony retry logic
good first issue
Good for newcomers
#452
opened Jun 26, 2020 by
goyalankit
Add retries id in the environment when we retry
good first issue
Good for newcomers
#434
opened May 14, 2020 by
oliverhu
fail the job instead of hanging if it's requesting GPU(s) on a host where it doesn't have enough GPU(s)
help wanted
Extra attention is needed
#432
opened Mar 21, 2020 by
burgerkingeater
Add option to return SUCCEED when training is completed with some failed job tasks
#420
opened Jan 16, 2020 by
charliechen211
Print tony version in tony client and tony AM
good first issue
Good for newcomers
#392
opened Oct 8, 2019 by
hungj
Previous Next
ProTip!
Follow long discussions with comments:>50.