mistralai/Mixtral-8x7B-Instruct-v0.1

#71 opened 5 months ago by

JayZhang1

What is the best way for the inference process in LORA in PEFT approach

8

#70 opened 5 months ago by

Pradeep1995

How to use system prompt?

#69 opened 5 months ago by

mznw

Is there any simple way to solve the problem of redundant output

#68 opened 5 months ago by

jjplane

Which is the actual way to store the adapters after PEFT finetuning

4

#67 opened 5 months ago by

Pradeep1995

Failed to import transformers.models.mixtral.modeling_mixtral because of the following error (look up to see its traceback): libcudart.so.12: cannot open shared object file: No such file or directory

#66 opened 5 months ago by

MukeshSharma

Model not loading, even with 4-bit quantization

#65 opened 5 months ago by

soumodeep-semut

did Mixtral start from Mistral or from-scratch?

#64 opened 5 months ago by

DaehanKim

How many GPUs do we need to run this out of box?

#63 opened 5 months ago by

kz919

Is this model can choose expert for every token? Or just choose two expert for a input

#62 opened 5 months ago by

PandaMaster

AutoTokenizer.from_pretrained show OSError

#61 opened 5 months ago by

sean29

does file with .safetensors necessary for continue sft training?

#60 opened 5 months ago by

hegang126

Incomplete Answers

7

#59 opened 5 months ago by

samparksoftwares

How can we enable continuous learning with the LLM model ?

#58 opened 5 months ago by

Tapendra

Inference generation extremely slow

6

#57 opened 6 months ago by

aledane

Optimizing Mixtral-8x7B-Instruct-v0.1 for Hugging Face Chat

#54 opened 6 months ago by

Husain

SageMaker Deployment Error

11

#53 opened 6 months ago by

seabasshn

killed on Loading checkpoint shards

#52 opened 6 months ago by

asmatveev

Playground?

#51 opened 6 months ago by

pbourmeau

vectorstore

#50 opened 6 months ago by

philgrey

Enable inference API

#49 opened 6 months ago by

mrfakename

How to use consolidated.xx.pt?

#47 opened 6 months ago by

Wan62

Model not loading and not printing any error message

#45 opened 6 months ago by

robotrage

open weights???

#43 opened 6 months ago by

alanchan808

Prompt Template for RAG

#42 opened 6 months ago by

mox

there is no sliding_window in params.json

#41 opened 6 months ago by

Moses25

Question on further fine-tuning

#40 opened 6 months ago by

vldvasi

KeyError: 'mixtral' when following the 'run the model' section

#39 opened 6 months ago by

obiwan92

Update config.json

#38 opened 6 months ago by

medmac01

sliding_window

5

#37 opened 6 months ago by

issa130

Update config.json

#36 opened 6 months ago by

issa130

Fix regression

#34 opened 6 months ago by

TimeRobber

SageMaker generation speed, timeouts

#33 opened 6 months ago by

elanmarkowitz

System Prompt Template

#32 opened 6 months ago by

efei

Update config.json

#31 opened 6 months ago by

tourist800

Facing issue while loading model - KeyError: 'mixtral'

7

#30 opened 6 months ago by

swapnil3597

Update README.md

#29 opened 6 months ago by

LPFLEO

Text Generation Inference?

#28 opened 6 months ago by

silvacarl

ValueError: Error raised by inference API: Model is overloaded

#25 opened 6 months ago by

alemaooo

Increase `sliding_window` to 32k

#24 opened 6 months ago by

alpindale

Intuition for quality decrease after quantization

4

#23 opened 6 months ago by

krumeto

Test Mixtral Model loading ERROR!

#21 opened 6 months ago by

ExceedZhang

Discuss benefits of this work