Sample code using HF

#29

by vanshils - opened Jun 4, 2024

Jun 4, 2024

Would it be possible to provide a sample code for inference of chat completion request using AutoModelForCausalLM and AutoTokenizer which gives same behaviour as mistral-chat?

vanshils

Jun 4, 2024

Thanks for updating the model card @ybelkada .
If possible could you please update the example/or provide a new one with usage of chat template. Currently the template is a little bit hard to find as we have to dive in mistral-common codebase to see how they perform encode_chat_completion_request.
Once again thanks for the example.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment