derek-thomas (Derek Thomas)

upvoted a collection 2 days ago

Embedding Model Datasets

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 51 items • Updated 8 days ago • 24

upvoted an article 3 days ago

Article

Benchmarking Text Generation Inference

4 days ago

• 17

upvoted an article 6 days ago

Article

Unlocking Longer Generation with Key-Value Cache Quantization

17 days ago

• 12

upvoted 3 articles 11 days ago

Article

Hugging Face on AMD Instinct MI300 GPU

12 days ago

• 7

Article

Build AI on premise with Dell Enterprise Hub

12 days ago

• 13

Article

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

12 days ago

• 8

upvoted 3 papers 12 days ago

vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention

Paper • 2405.04437 • Published 25 days ago • 3

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published about 1 month ago • 102

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published 17 days ago • 73

upvoted 9 articles 13 days ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

19 days ago

• 131

Article

Hugging Face x LangChain : A new partner package in LangChain

19 days ago

• 70

Article

License to Call: Introducing Transformers Agents 2.0

20 days ago

• 88

Article

Subscribe to Enterprise Hub with your AWS Account

24 days ago

• 6

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

24 days ago

• 7

Article

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

30 days ago

• 13

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 53

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 46

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29

• 69

upvoted 7 articles 18 days ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Apr 19

• 70

Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Apr 16

• 11

Article

Introducing the Open Arabic LLM Leaderboard

19 days ago

• 47

Article

Running Privacy-Preserving Inference on Hugging Face Endpoints

Apr 16

• 17

Article

Ryght’s Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face

Apr 16

• 6

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 134

Article

Vision Language Models Explained

Apr 11

• 92

upvoted 9 articles 19 days ago

Article

Hugging Face partners with Wiz Research to Improve AI Security

Apr 4

• 11

Article

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Apr 3

• 6

Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

Apr 4

• 20

Article

Public Policy at Hugging Face

Apr 8

• 17

Article

Bringing serverless GPU inference to Hugging Face users

Apr 2

• 9

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22

• 39

Article

Introducing the Chatbot Guardrails Arena

Mar 21

• 4

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20

• 24

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Mar 20

• 20

upvoted 8 articles 25 days ago

Article

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Mar 15

• 5

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23

• 14

Article

Synthetic data: save money, time and carbon with open source

Feb 16

• 28

Article

From OpenAI to Open LLMs with Messages API

Feb 8

• 6

Article

NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates

Feb 2

• 2

Article

Constitutional AI with Open LLMs

Feb 1

• 5

Article

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

Jan 31

• 3

Article

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Jan 30

• 1

upvoted 6 articles 27 days ago

Article

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Jan 29

• 6

Article

Open-source LLMs as LangChain Agents

Jan 24

• 12

Article

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

Jan 19

• 8

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18

• 20

Article

A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard

Jan 12

• 4

Article

Faster fine-tuning using TRL & Unsloth

Jan 10

• 20

upvoted 2 papers 27 days ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30 • 41

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

upvoted 3 papers about 1 month ago

upvoted a paper about 2 months ago

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 102

upvoted a collection about 2 months ago

Chronos Models

Collection

Chronos: Pretrained (language) models for time series forecasting based on the T5 architecture. • 6 items • Updated Mar 18 • 25

upvoted 2 papers about 2 months ago

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Paper • 2403.14403 • Published Mar 21 • 6

Gecko: Versatile Text Embeddings Distilled from Large Language Models

Paper • 2403.20327 • Published Mar 29 • 44

upvoted 3 papers 2 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 176

Learning to Route Among Specialized Experts for Zero-Shot Generalization

Paper • 2402.05859 • Published Feb 8 • 4

FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance

Paper • 2305.05176 • Published May 9, 2023 • 3

Derek Thomas

AI & ML interests

Articles

Benchmarking Text Generation Inference

AI Watermarking 101: Tools and Techniques

Organizations

derek-thomas's activity

Benchmarking Text Generation Inference

Unlocking Longer Generation with Key-Value Cache Quantization

Hugging Face on AMD Instinct MI300 GPU

Build AI on premise with Dell Enterprise Hub

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

PaliGemma – Google's Cutting-Edge Open Vision Language Model

Hugging Face x LangChain : A new partner package in LangChain

License to Call: Introducing Transformers Agents 2.0

Subscribe to Enterprise Hub with your AWS Account

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Bringing the Artificial Analysis LLM Performance Leaderboard to Hugging Face

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Improving Prompt Consistency with Structured Generations

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

Introducing the Open Arabic LLM Leaderboard

Running Privacy-Preserving Inference on Hugging Face Endpoints

Ryght’s Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Vision Language Models Explained

Hugging Face partners with Wiz Research to Improve AI Security

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

Public Policy at Hugging Face

Bringing serverless GPU inference to Hugging Face users

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Introducing the Chatbot Guardrails Arena

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

🪆 Introduction to Matryoshka Embedding Models

Synthetic data: save money, time and carbon with open source

From OpenAI to Open LLMs with Messages API

NPHardEval Leaderboard: Unveiling the Reasoning Abilities of Large Language Models through Complexity Classes and Dynamic Updates

Constitutional AI with Open LLMs

Introducing the Enterprise Scenarios Leaderboard: a Leaderboard for Real World Use Cases

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models

Open-source LLMs as LangChain Agents

Fine-Tune W2V2-Bert for low-resource ASR with 🤗 Transformers

Preference Tuning LLMs with Direct Preference Optimization Methods

A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard

Faster fine-tuning using TRL & Unsloth