-
microsoft/Phi-3-mini-4k-instruct
Text Generation β’ Updated β’ 65.2k β’ 360 -
microsoft/Phi-3-mini-128k-instruct
Text Generation β’ Updated β’ 75.2k β’ 855 -
microsoft/Phi-3-mini-4k-instruct-gguf
Text Generation β’ Updated β’ 37.4k β’ 213 -
microsoft/Phi-3-mini-128k-instruct-onnx
Text Generation β’ Updated β’ 110
Collections
Discover the best community collections!
Collections trending this week
-
meta-llama/Meta-Llama-3-8B
Text Generation β’ Updated β’ 445k β’ 2.65k -
meta-llama/Meta-Llama-3-8B-Instruct
Text Generation β’ Updated β’ 614k β’ 1.5k -
meta-llama/Meta-Llama-3-70B-Instruct
Text Generation β’ Updated β’ 68.6k β’ 770 -
meta-llama/Meta-Llama-3-70B
Text Generation β’ Updated β’ 286k β’ 497
-
Improved Baselines with Visual Instruction Tuning
Paper β’ 2310.03744 β’ Published β’ 32 -
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Paper β’ 2403.05525 β’ Published β’ 36 -
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities
Paper β’ 2308.12966 β’ Published β’ 6 -
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model
Paper β’ 2404.01331 β’ Published β’ 22
-
HuggingFaceFW/ablation-model-fineweb-v1
Text Generation β’ Updated β’ 152 β’ 7 -
HuggingFaceFW/ablation-model-refinedweb
Text Generation β’ Updated β’ 21 β’ 1 -
HuggingFaceFW/ablation-model-c4
Text Generation β’ Updated β’ 24 β’ 2 -
HuggingFaceFW/ablation-model-dolma-v1_6
Text Generation β’ Updated β’ 15 β’ 1
-
chargoddard/Yi-34B-Llama
Text Generation β’ Updated β’ 3.5k β’ 56 -
yunconglong/Truthful_DPO_TomGrc_FusionNet_7Bx2_MoE_13B
Text Generation β’ Updated β’ 3.56k β’ 51 -
fblgit/UNA-SimpleSmaug-34b-v1beta
Text Generation β’ Updated β’ 2.31k β’ 17 -
cloudyu/TomGrc_FusionNet_34Bx2_MoE_v0.1_DPO_f16
Text Generation β’ Updated β’ 2.52k β’ 13