lunarflu (Adam Molnar)

upvoted 4 articles 4 days ago

Article

Exploration of Job Application Automation with Data Scraping

By

•

10 days ago

• 3

Article

Glaze and the Effectiveness of Anti-AI Methods for Diffusion Models

By

•

5 days ago

• 1

Article

Synthetic dataset generation techniques: Self-Instruct

By

•

5 days ago

• 3

Article

Hugging Face + Google Visual Blocks

By

•

4 days ago

• 16

upvoted 2 collections 4 days ago

LlamaForTokenClassification

Collection

Fine Tuned llama variants for Token Classification • 6 items • Updated 7 days ago • 2

Terminus XL

Collection

v-prediction SDXL clone with zero-terminal SNR noise schedule • 8 items • Updated 26 days ago • 5

upvoted 2 articles 5 days ago

Article

2024-04-22 - Hub Incident Post Mortem

By

•

3 days ago

• 15

Article

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

By

•

4 days ago

• 15

upvoted an article 6 days ago

Article

License to Call: Introducing Transformers Agents 2.0

7 days ago

• 65

upvoted a collection 6 days ago

OCR Quality Classifiers

Collection

predict OCR quality • 4 items • Updated 8 days ago • 2

upvoted 5 articles 7 days ago

Article

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

By

•

13 days ago

• 1

Article

Advancing Open-source Large Language Models in the Medical & Healthcare Domain

By

•

10 days ago

• 3

Article

Everything About Long Context Fine-tuning

By

•

10 days ago

• 9

Article

Adapt custom AI models to the trainer API and to 🤗

By

•

6 days ago

• 15

Article

Knowledge Distillation for Fine-Tuning a GPT-3.5 Judge: Enhancing Accuracy and Performance

By

•

7 days ago

• 4

upvoted 6 articles 11 days ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

14 days ago

• 24

Article

Mergoo: Efficiently Build Your Own MoE LLM

By

•

13 days ago

• 32

Article

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

By

•

13 days ago

• 6

Article

Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner

By

•

11 days ago

• 6

Article

Energy Star Ratings for AI Models

By

•

11 days ago

• 15

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

By

•

13 days ago

• 21

upvoted 2 articles 17 days ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

22 days ago

• 30

Article

Google Search with LLM

By

•

19 days ago

• 4

upvoted 6 articles 20 days ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

21 days ago

• 25

Article

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

By

•

6 days ago

• 15

Article

Fish Speech V1 - New Multilingual Open Source TTS Model

By

•

17 days ago

• 4

Article

Token Merging for fast LLM inference : Background and first trials with Mistral

By

•

20 days ago

• 1

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

21 days ago

• 69

Article

Improving Prompt Consistency with Structured Generations

20 days ago

• 43

upvoted 8 articles 24 days ago

Article

Red-Teaming Large Language Models

Feb 24, 2023

• 3

Article

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

By

•

26 days ago

• 42

Article

Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors

By

•

26 days ago

• 2

Article

How to Finetune phi-3 on MacBook Pro

By

•

26 days ago

• 58

Article

seemore: Implement a Vision Language Model from Scratch

By

•

8 days ago

• 41

Article

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

By

•

24 days ago

• 10

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

24 days ago

• 54

Article

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

By

•

24 days ago

• 6

upvoted an article 26 days ago

Article

RealWorldQA, What's New?

By

•

25 days ago

• 6

upvoted a collection 27 days ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Apr 18 • 525

upvoted 3 articles 27 days ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 239

Article

Introducing the Open Chain of Thought Leaderboard

27 days ago

• 20

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

28 days ago

• 71

upvoted a paper 27 days ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published 28 days ago • 230

upvoted an article about 1 month ago

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

By

•

Apr 18

• 20

upvoted a paper about 1 month ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 56

upvoted 3 articles about 1 month ago

Article

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Apr 10

• 16

Article

Vision Language Models Explained

Apr 11

• 82

Article

Ryght’s Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face

Apr 16

• 6

upvoted 2 papers about 1 month ago

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 101

AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Paper • 2404.03648 • Published Apr 4 • 22

upvoted an article about 1 month ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 127

upvoted 9 papers about 1 month ago

Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Models

Paper • 2310.17086 • Published Oct 26, 2023 • 1

IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations

Paper • 2404.01266 • Published Apr 1 • 1

Plug-In Inversion: Model-Agnostic Inversion for Vision with Data Augmentations

Paper • 2201.12961 • Published Jan 31, 2022 • 1

Adam Molnar

AI & ML interests

Organizations

lunarflu's activity

Exploration of Job Application Automation with Data Scraping

Glaze and the Effectiveness of Anti-AI Methods for Diffusion Models

Synthetic dataset generation techniques: Self-Instruct

Hugging Face + Google Visual Blocks

2024-04-22 - Hub Incident Post Mortem

Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task

License to Call: Introducing Transformers Agents 2.0

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

Advancing Open-source Large Language Models in the Medical & Healthcare Domain

Everything About Long Context Fine-tuning

Adapt custom AI models to the trainer API and to 🤗

Knowledge Distillation for Fine-Tuning a GPT-3.5 Judge: Enhancing Accuracy and Performance

SeeMoE: Implementing a MoE Vision Language Model from Scratch

Mergoo: Efficiently Build Your Own MoE LLM

Can we create pedagogically valuable multi-turn synthetic datasets from Cosmopedia?

Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner

Energy Star Ratings for AI Models

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Expanding Model Context and Creating Chat Models with a Single Click

Google Search with LLM

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI

Fish Speech V1 - New Multilingual Open Source TTS Model

Token Merging for fast LLM inference : Background and first trials with Mistral

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Improving Prompt Consistency with Structured Generations

Red-Teaming Large Language Models

LLM Comparison/Test: Llama 3 Instruct 70B + 8B HF/GGUF/EXL2 (20 versions tested and compared!)

Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors

How to Finetune phi-3 on MacBook Pro

seemore: Implement a Vision Language Model from Scratch

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

Estimating Memory Consumption of LLMs for Inference and Fine-Tuning for Cohere Command-R+

RealWorldQA, What's New?

Welcome Llama 3 - Meta's new open LLM

Introducing the Open Chain of Thought Leaderboard

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

Making thousands of open LLMs bloom in the Vertex AI Model Garden

Vision Language Models Explained

Ryght’s Journey to Empower Healthcare and Life Sciences with Expert Support from Hugging Face

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community