-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add option not to abort on cuda malloc errors #1083
Comments
The goal of returning an error instead of crashing is good, we definitely need to do that. The problem is that much of the existing code does not check the error code returned by the |
Tks Diego. |
As today ggml force aborts the process whenever there is a cuda malloc failure: eg:
This is not ideal for some production context in which we need to have a controlled way to return an OOM error and exit/reload/resume/skip gracefully.
Would you mind if I:
?
Note:
Best
W.
The text was updated successfully, but these errors were encountered: