📚 The doc issue The troubleshooting guide to detect incorrect hardware / driver has a script that one can run. The multi-node script fails because of the c10d rdzv_backend. This issue has been filed ...