New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run logs arrive corrupted #185
Comments
I've seen this bug quite a few times but never was able to find what causes it or find a way to reproduce it consistently. |
@kengz Does the workflow itself fail or the problem is only with the output? |
The workflow fails too |
Hmm. Are you sure? According to the runner logs, the job was marked as Done. |
That's where the log ended. I had it print "done" at the end and it didn't get there. |
I found the cause of the issue. dstack's runner sends logs in chunks as utf-8 websocket messages, but multibyte unicode characters can be at the boundaries of the messages causing the message to be invalid utf-8: dstack/runner/internal/stream/http.go Line 108 in dd63876
The solution would be to send raw bytes over websocket. |
@kengz, thanks for the issue. The bug will be fixed with the next release. |
Describe the bug
When running Conda install with large dependencies (pytorch) and creating artifacts out of it, occasional the run will fail with
'utf-8' codec can't decode bytes in position 4094-4095: unexpected end of data
Version
dstack
CLI version: 0.1pip freeze
orconda list
)Minimal example
Use the main branch commit kengz/lean-dl-example@f4c06f0
Run
dstack run setup-conda
Steps to reproduce
Use the main branch commit kengz/lean-dl-example@f4c06f0
Run
dstack run setup-conda
Try it multiple times since the error is random. This only happens when running it with dstack local, but never when running conda directly on user machine.
Expected behavior
Finish normally, which it does when rerun, so the error is random.
Logs
6cda13c1f54b4a31b8e1a5038b4a4433.zip
Screenshots
Attach screenshots (if any).
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: