南栖

Minami-su

AI & ML interests

NLP,MultiModal,Human intelligence,Autonomous Cognitive,Self-instruction generation, enhanced instruction

Organizations

Minami-su's activity

New activity in deepseek-ai/DeepSeek-V2-Chat 5 days ago

MoE offloading strategy?

2
#8 opened 5 days ago by Minami-su
New activity in Minami-su/IA_14B 2 months ago

Update README.md

#1 opened 2 months ago by Minami-su
New activity in Minami-su/Qwen1.5-7B-Chat_mistral 3 months ago
New activity in Minami-su/Qwen1.5-7B-Chat_llamafy 3 months ago

Adding Evaluation Results

#3 opened 3 months ago by Minami-su

GGUF Creation from Llamafy

6
#1 opened 3 months ago by RonanMcGovern
New activity in OrionStarAI/Orion-14B-Chat 4 months ago

some text are not renamed to Orion

1
#4 opened 4 months ago by J22

llama rename?

1
#3 opened 4 months ago by Minami-su
New activity in cloudyu/Mixtral_34Bx2_MoE_60B 4 months ago

source code and paper?

8
#6 opened 4 months ago by josephykwang
New activity in KnutJaegersberg/Tess-M-34B-2bit 5 months ago

Re-Quantize Model

7
#1 opened 5 months ago by igoforth
New activity in Minami-su/SUS-Chat-34B_2bit 5 months ago

Re-Quantize?

1
#2 opened 5 months ago by igoforth

Hessian context length?

13
#1 opened 5 months ago by KnutJaegersberg
New activity in Minami-su/Yi_34B_Chat_2bit 5 months ago

Hessians?

3
#2 opened 5 months ago by somehumanperson1

Chinese token capabilities?

2
#1 opened 5 months ago by at676
New activity in THUDM/cogvlm-chat-hf 6 months ago
New activity in BelleGroup/BELLE-on-Open-Datasets 12 months ago

tokenizer加载非常的慢

3
#1 opened about 1 year ago by Minami-su