Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Organizations
mgoin's activity
Librarian Bot: Add language metadata for dataset
#2 opened 8 days ago
by
librarian-bot
Inference GPU Ram requirement >60GB
1
#1 opened 9 days ago
by
Ksgk-fy
What conversion process are you using?
2
#2 opened 9 days ago
by
matt-psaltis-devbricks
What is Marlin?
2
#1 opened about 1 month ago
by
Samvanity
Inference Issues
5
#1 opened about 1 month ago
by
qeternity
Update README.md
#2 opened 2 months ago
by
shubhrapandit
New activity in
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_70-quantized-deepsparse
2 months ago
Update README.md
#1 opened 2 months ago
by
shubhrapandit
New activity in
neuralmagic/Llama-2-7b-dolphin-open_platypus-pruned_50-quantized-deepsparse
2 months ago
Update README.md
#1 opened 2 months ago
by
shubhrapandit
Update README.md
#1 opened 2 months ago
by
shubhrapandit
Update README.md
#1 opened 2 months ago
by
shubhrapandit
Update README.md
#1 opened 2 months ago
by
abhinavnmagic
Update README.md
#1 opened 2 months ago
by
abhinavnmagic
Update README.md
#1 opened 2 months ago
by
abhinavnmagic
Update README.md
#1 opened 2 months ago
by
abhinavnmagic
Update README.md
#1 opened 2 months ago
by
abhinavnmagic
Update README.md
#1 opened 2 months ago
by
alexmarques
Update README.md
#1 opened 2 months ago
by
alexmarques
Update README.md
#1 opened 2 months ago
by
alexmarques
Update README.md
#1 opened 2 months ago
by
alexmarques
Update README.md
#1 opened 2 months ago
by
alexmarques
Update README.md
#4 opened 5 months ago
by
chrisxx
Update README with model author names and speedup numbers.
#3 opened 5 months ago
by
jen
Update README.md
#1 opened 5 months ago
by
wendlerc
Adding `safetensors` variant of this model
#1 opened 6 months ago
by
mgoin
Create README.md
#2 opened 8 months ago
by
mgoin