mlx_lm.server gives wonky answers

#49

by conleysa - opened Apr 21

Discussion

conleysa

Apr 21

•

edited Apr 21

Hello! I am noticing that when I run llama-3-8b-instruct using mlx_lm.server, I get strange answers. Like I ask it for a query and it tells me about dog breeds. On the other hand, if I use from mlx_lm.load and mlx_lm.generate, I get reasonable responses.

Is there any reason the new llama-3 shouldn't be run from the mlx_lm server?

I can run as a server using llama-2-13b and get reasonable responses.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment