You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to use mistral.rs to run MiniCPM-O on Google Cloud Run (with NVIDIA L4 GPU). I created a custom Dockerfile (see at the end) based on Dockerfile.cuda-all and built the latest master branch. The main addition in my Dockerfile is downloading the model and storing it in the Docker image.
Everything works fine when I send a request with text only, e.g.
Describe the bug
I'm trying to use mistral.rs to run MiniCPM-O on Google Cloud Run (with NVIDIA L4 GPU). I created a custom Dockerfile (see at the end) based on
Dockerfile.cuda-all
and built the latest master branch. The main addition in my Dockerfile is downloading the model and storing it in the Docker image.Everything works fine when I send a request with text only, e.g.
When I try sending the example from your docs it fails. The request I'm sending is like this:
I also tried with a different image URL and a base64-encoded image. The result is always the same, an error like this in the logs:
Full log:
downloaded-logs-20250228-092532.txt
Dockerfile + Cloud Build config: https://gist.github.com/jgonera/3c792ee3f44ec1fc12ba7ede7f723550
Full request-making code: https://gist.github.com/jgonera/326ff5d1612a72d0b80194636146f38c
Am I missing something obvious? I'd appreciate any help!
Latest commit or version
e2f9648
The text was updated successfully, but these errors were encountered: