How to solve Model inference speed is too slow?

#9
by lanshan - opened

I have got the sft model, when I use the model to inference I find it is slowly, do you have some way to accelerate。

Sign up or log in to comment