Blog, Articles, and discussions

Making thousands of open LLMs bloom in the Vertex AI Model Garden

By April 10, 2024 • 16

Community Articles

view all

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

1 day ago

• 11

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

•

1 day ago

• 4

Understanding IPOs: A Comprehensive Guide

•

1 day ago

• 1

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

•

1 day ago

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

•

7 days ago

• 12

seemore: Implement a Vision Language Model from Scratch

•

7 days ago

• 40

Google Search with LLM

•

8 days ago

• 4

Token Merging for fast LLM inference : Background and first trials with Mistral

•

8 days ago

• 1

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

•

10 days ago

• 24

Expanding Model Context and Creating Chat Models with a Single Click

•

10 days ago

• 25

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

12 days ago

• 49

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

•

12 days ago

• 4

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

•

13 days ago

• 9

Can We Train Chat Models with Raw Data?

•

14 days ago

• 17

RealWorldQA, What's New?

•

14 days ago

• 6

Bringing serverless GPU inference to Hugging Face users

By April 2, 2024 • 8

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

By March 20, 2024 • 3

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

By March 18, 2024

Text-Generation Pipeline on Intel® Gaudi® 2 AI Accelerator

By February 29, 2024 guest • 1

Hugging Face Text Generation Inference available for AWS Inferentia2

By February 1, 2024 • 1

Hugging Face and Google partner for open AI collaboration

By January 25, 2024 • 1

Introducing SafeCoder

By August 22, 2023

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

By August 10, 2023

Fine-tuning Stable Diffusion models on Intel CPUs

By July 14, 2023

Accelerating Vision-Language Models: BridgeTower on Habana Gaudi2

By June 29, 2023

Welcome fastText to the 🤗 Hub

By June 6, 2023

Introducing HuggingFace blog for Chinese speakers: Fostering Collaboration with the Chinese AI community

By April 24, 2023 • 1

Accelerating Hugging Face Transformers with AWS Inferentia2

By April 17, 2023

Fast Inference on Large Language Models: BLOOMZ on Habana Gaudi2 Accelerator

By March 28, 2023 • 1

Community Articles

view all

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

•

1 day ago

• 11

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

•

1 day ago

• 4

Understanding IPOs: A Comprehensive Guide

•

1 day ago

• 1

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

•

1 day ago

Mergoo: Efficiently Build Your Own MoE LLM

•

2 days ago

• 29

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

3 days ago

• 21

Top 5 Webflow Agencies Focused On Building Brands For The Future

•

3 days ago

• 1

🧑‍⚖️ "Replacing Judges with Juries" using distilabel

•

5 days ago

• 14

Fish Speech V1 - New Multilingual Open Source TTS Model

•

6 days ago

• 3

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

•

7 days ago

• 12

seemore: Implement a Vision Language Model from Scratch

•

7 days ago

• 40

Google Search with LLM

•

8 days ago

• 4

Token Merging for fast LLM inference : Background and first trials with Mistral

•

8 days ago

• 1

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

•

10 days ago

• 24

Expanding Model Context and Creating Chat Models with a Single Click

•

10 days ago

• 25

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

12 days ago

• 49

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

•

12 days ago

• 4

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

•

13 days ago

• 9

Can We Train Chat Models with Raw Data?

•

14 days ago

• 17

RealWorldQA, What's New?

•

14 days ago

• 6