view article Article Adapt custom AI models to the trainer API and to 🤗 By not-lain • 3 days ago • 13
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • about 18 hours ago • 11
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation Paper • 2405.09546 • Published 1 day ago • 6
Xmodel-VLM: A Simple Baseline for Multimodal Vision Language Model Paper • 2405.09215 • Published 2 days ago • 9
ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models Paper • 2405.09220 • Published 2 days ago • 15
SpeechVerse: A Large-scale Generalizable Audio Language Model Paper • 2405.08295 • Published 3 days ago • 7
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models Paper • 2405.08317 • Published 3 days ago • 7
Understanding the performance gap between online and offline alignment algorithms Paper • 2405.08448 • Published 3 days ago • 9
No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding Paper • 2405.08344 • Published 3 days ago • 9
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory Paper • 2405.08707 • Published 3 days ago • 18
Coin3D: Controllable and Interactive 3D Assets Generation with Proxy-Guided Conditioning Paper • 2405.08054 • Published 3 days ago • 14
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Paper • 2405.08748 • Published 2 days ago • 13
VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models Paper • 2403.06098 • Published Mar 10 • 15
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 49 items • Updated 1 day ago • 10
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published 14 days ago • 90
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels Paper • 2405.07526 • Published 4 days ago • 12
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training Paper • 2405.06932 • Published 6 days ago • 14
LogoMotion: Visually Grounded Code Generation for Content-Aware Animation Paper • 2405.07065 • Published 5 days ago • 13
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots Paper • 2405.07990 • Published 3 days ago • 15
SUTRA: Scalable Multilingual Language Model Architecture Paper • 2405.06694 • Published 9 days ago • 33
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 10 items • Updated 2 days ago • 80
CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts Paper • 2405.05949 • Published 7 days ago • 2
MAmmoTH2 Collection Scaling up instruction data from the web for to build better LLMs • 10 items • Updated 6 days ago • 4
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper • 2404.14619 • Published 24 days ago • 120
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report Paper • 2405.00732 • Published 18 days ago • 105
view article Article 🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets By dvilasuero • 20 days ago • 54
Searching for Better ViT Baselines Collection Exploring ViT hparams and model shapes for the GPU poor (between tiny and base). • 15 items • Updated 3 days ago • 8
view article Article Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia? By davanstrien • 10 days ago • 6
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints 16 days ago • 48
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent 25 days ago • 71
Arctic Collection A collection of pre-trained dense-MoE Hybrid transformer models • 2 items • Updated 23 days ago • 18
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 10 items • Updated 5 days ago • 116
view article Article StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation 18 days ago • 68
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published 25 days ago • 230
Quantized-FT-Orca-Math Collection Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. • 8 items • Updated about 1 month ago • 6
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated 28 days ago • 516
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 11 days ago • 75