Philipp Schmid's picture

Philipp Schmid

philschmid

·

https://www.philschmid.de

AI & ML interests

None yet

Articles

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Welcome Llama 3 - Meta's new open LLM

Making thousands of open LLMs bloom in the Vertex AI Model Garden

CodeGemma - an official Google release for code LLMs

about 1 month ago

Bringing serverless GPU inference to Hugging Face users

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Welcome Gemma - Google's new open LLM

From OpenAI to Open LLMs with Messages API

Hugging Face Text Generation Inference available for AWS Inferentia2

Hugging Face and Google partner for open AI collaboration

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Mixture of Experts Explained

Deploy Embedding Models with Hugging Face Inference Endpoints

Llama 2 on Amazon SageMaker a Benchmark

Fine-tuning Llama 2 70B using PyTorch FSDP

Spread Your Wings: Falcon 180B is here

Code Llama: Llama 2 learns to code

Introducing SafeCoder

Hugging Face Platform on the AWS Marketplace: Pay with your AWS Account

Llama 2 is here - get it on Hugging Face

Deploy LLMs with Hugging Face Inference Endpoints

The Falcon has landed in the Hugging Face ecosystem

Introducing the Hugging Face LLM Inference Container for Amazon SageMaker

Hugging Face Collaborates with Microsoft to Launch Hugging Face Model Catalog on Azure

Creating a Coding Assistant with StarCoder

Accelerating Hugging Face Transformers with AWS Inferentia2

Hugging Face and AWS partner to make AI more accessible

Pre-Train BERT with Hugging Face Transformers and Habana Gaudi

Convert Transformers to ONNX with Hugging Face Optimum

Accelerated Inference with Optimum and Transformers Pipelines

Accelerate BERT inference with Hugging Face Transformers and AWS inferentia

Case Study: Millisecond Latency using Hugging Face Infinity and modern CPUs

Deploy GPT-J 6B for inference using Hugging Face Transformers and Amazon SageMaker

Few-shot learning in practice: GPT-NEO and the 🤗 Accelerated Inference API

Distributed Training: Train BART/T5 for Summarization using 🤗 Transformers and Amazon SageMaker

Organizations

philschmid's activity

upvoted a paper 9 days ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published 9 days ago • 35

upvoted a paper 12 days ago

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Paper • 2404.03715 • Published Apr 4 • 57

upvoted a paper 13 days ago

Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

Paper • 2404.14723 • Published 17 days ago • 9

upvoted an article 21 days ago

Article

Welcome Llama 3 - Meta's new open LLM

22 days ago

• 234

upvoted a paper 24 days ago

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

Paper • 2404.07413 • Published 29 days ago • 32

upvoted an article 24 days ago

Article

Welcome Gemma - Google's new open LLM

Feb 21

• 7

upvoted an article about 1 month ago

Article

CodeGemma - an official Google release for code LLMs

about 1 month ago

• 93

upvoted a paper about 1 month ago

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2 • 52

upvoted a collection about 1 month ago

HF-curated models available on Workers AI

A collection of models curated with Hugging Face that can be run on Cloudflare's Workers AI serverless inference platform. • 15 items • Updated Apr 2 • 45

upvoted a paper about 1 month ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 98

upvoted a collection about 1 month ago

MoEs papers reading list

38 items • Updated about 1 month ago • 117

upvoted 3 papers about 2 months ago

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Paper • 2402.11411 • Published Feb 18 • 1

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 48

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 54

upvoted a collection 5 months ago

Awesome SFT datasets

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated 28 days ago • 88

upvoted a collection 6 months ago

Distil-Whisper Models

The first version of the Distil-Whisper models released with the Distil-Whisper paper. • 4 items • Updated Mar 21 • 33

upvoted a collection 7 months ago

Zephyr 7B

Models, datasets, and demos associated with Zephyr 7B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 9 items • Updated 28 days ago • 136

upvoted a paper 7 months ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 116

upvoted a paper 8 months ago

Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 84

upvoted a paper 9 months ago

Code Llama: Open Foundation Models for Code

Paper • 2308.12950 • Published Aug 24, 2023 • 18