view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22 • 35
Graph Neural Prompting with Large Language Models Paper • 2309.15427 • Published Sep 27, 2023 • 1
Matryoshka Embedding Models Collection https://huggingface.co/blog/matryoshka • 12 items • Updated 5 days ago • 10
Efficient Estimation of Word Representations in Vector Space Paper • 1301.3781 • Published Jan 16, 2013 • 6
Unifying Large Language Models and Knowledge Graphs: A Roadmap Paper • 2306.08302 • Published Jun 14, 2023 • 4
Research Lessons Collection understanding important lessons from machine learning research • 2 items • Updated Mar 23 • 2
SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing Paper • 1808.06226 • Published Aug 19, 2018 • 1
Finetuned Language Models Are Zero-Shot Learners Paper • 2109.01652 • Published Sep 3, 2021 • 2
LLaMA: Open and Efficient Foundation Language Models Paper • 2302.13971 • Published Feb 27, 2023 • 11
Adapting Large Language Models via Reading Comprehension Paper • 2309.09530 • Published Sep 18, 2023 • 69
Universal Language Model Fine-tuning for Text Classification Paper • 1801.06146 • Published Jan 18, 2018 • 6
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness Paper • 2205.14135 • Published May 27, 2022 • 8
Datasets based on UltraFeedback Collection This collection contains some datasets created on top of UltraFeedback using Argilla for the dataset exploration and curation, sorted by release date. • 6 items • Updated Mar 19 • 10
Notus 7B v1 Collection Notus 7B v1 models (DPO fine-tune of Zephyr SFT) and datasets used. More information at https://github.com/argilla-io/notus • 11 items • Updated Dec 28, 2023 • 17