performance thoughts

by RonanMcGovern - opened Jan 29

Jan 29

Thanks for making this model.

It's interesting how it seems weaker than Phi-2 - at least on coding. I notice there is an OpenHermes fine-tune too and it has the same issue (e.g. fails to write a function to add up the first N fibonacci numbers).

Any thoughts on why this might be the case?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment