Fizzarolli (Fizzarolli 🏳️‍⚧️)

Posts 2

Post

280

hi everyone!

i wanted to share an experiment i did with upcycling phi-3 mini into an moe recently.
while benchmarks are definitely within a margin of error and they performed similarly, i think it's an interesting base to try and see if you can improve phi's performance! (maybe looking into HuggingFaceFW/fineweb-edu could be interesting, i also left some other notes if anyone with more compute access wants to try it themselves)

check it out! Fizzarolli/phi3-4x4b-v1

Post

1308

Is anyone looking into some sort of decentralized/federated dataset generation or classification by humans instead of synthetically?

From my experience with trying models, a *lot* of modern finetunes are trained on what amounts to, in essence, GPT-4 generated slop that makes everything sound like a rip-off GPT-4 (refer to i.e. the Dolphin finetunes). I have a feeling that this is a lot of the reason people haven't been quite as successful as Meta's instruct tunes of Llama 3.

models 14

datasets 5

Fizzarolli/wattpad_prompt_completion

Viewer • Updated 9 days ago • 10

Fizzarolli/wattpad

Viewer • Updated Apr 28 • 23

Fizzarolli/rpguild_and_bluemoon

Viewer • Updated Apr 19

Fizzarolli/rpguild_processed

Viewer • Updated Apr 16

Fizzarolli/bluemoon_processeed

Viewer • Updated Apr 14 • 1

Fizzarolli 🏳️‍⚧️

AI & ML interests

Organizations

Posts 2

models 14

Fizzarolli/phi3-4x4b-v1

Fizzarolli/phencyclidine-8b-v1

Fizzarolli/phi3-4x4b-uninitialized

Fizzarolli/llama-3-lust-8b-v0.2

Fizzarolli/llama-3-lust-8b-v0.2-adapter

Fizzarolli/llama-3-lust-8b-step-748

Fizzarolli/llama-3-lust-8b-v0.1

Fizzarolli/lust-7b

Fizzarolli/lust-7b-GGUF

Fizzarolli/LayliticDolphinOpus

datasets 5

Fizzarolli/wattpad_prompt_completion

Fizzarolli/wattpad

Fizzarolli/rpguild_and_bluemoon

Fizzarolli/rpguild_processed

Fizzarolli/bluemoon_processeed

Fizzarolli 🏳️‍⚧️

AI & ML interests

Organizations

Posts 2

models 14 Sort: Recently updated

datasets 5 Sort: Recently updated

models 14

datasets 5