Moritz Laurer's picture

Moritz Laurer

MoritzLaurer

·

https://www.linkedin.com/in/moritz-laurer/

AI & ML interests

None yet

Articles

Synthetic data: save money, time and carbon with open source

Organizations

MoritzLaurer's activity

upvoted a collection 5 days ago

NuNerZero - Zero Shot NER

The best compact Zero-Shot NER models with MIT license • 4 items • Updated 9 days ago • 11

upvoted an article 8 days ago

Article

Improving Prompt Consistency with Structured Generations

19 days ago

• 41

upvoted a paper 11 days ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published 20 days ago • 107

upvoted a paper 12 days ago

What matters when building vision-language models?

Paper • 2405.02246 • Published 16 days ago • 73

upvoted 2 articles 13 days ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

18 days ago

• 50

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 126

upvoted a collection 19 days ago

PDF Document / OCR Datasets

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 36

upvoted a collection 24 days ago

OpenELM Instruct Models

4 items • Updated Apr 12 • 96

upvoted a collection about 1 month ago

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 13 days ago • 76

upvoted a paper about 1 month ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 54

upvoted an article about 1 month ago

Article

Total noob’s intro to Hugging Face Transformers

Mar 22

• 19

upvoted 2 papers 2 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 123

A Critical Evaluation of AI Feedback for Aligning Large Language Models

Paper • 2402.12366 • Published Feb 19 • 3

upvoted 3 collections 3 months ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 23

🤗 Spaces Helper

5 items • Updated Mar 19 • 2

⛔️🔦 Provenance, Watermarking & Deepfake Detection

Technical tools for more control over non-consensual synthetic content • 14 items • Updated Apr 1 • 36

upvoted a paper 3 months ago

Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8 • 16

upvoted a paper 4 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 135

upvoted a collection 4 months ago

Universal token classification

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 5 items • Updated Jan 15 • 7

upvoted 3 papers 5 months ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 72

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1 • 21

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Paper • 2311.14904 • Published Nov 25, 2023 • 3

upvoted a paper 6 months ago

Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 78

upvoted 2 collections 6 months ago

Seamless Communication

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 123

Zeroshot Classifiers

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 76

upvoted 4 papers 7 months ago

JudgeLM: Fine-tuned Large Language Models are Scalable Judges

Paper • 2310.17631 • Published Oct 26, 2023 • 31

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 116

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Paper • 2310.15941 • Published Oct 24, 2023 • 6

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 49

upvoted 2 papers 8 months ago

tasksource: Structured Dataset Preprocessing Annotations for Frictionless Extreme Multi-Task Learning and Evaluation

Paper • 2301.05948 • Published Jan 14, 2023 • 3

Nougat: Neural Optical Understanding for Academic Documents

Paper • 2308.13418 • Published Aug 25, 2023 • 33