arxiv:2402.04249
Long Phan
justinphan3110
AI & ML interests
NLP
Organizations
Papers
2
models
4
datasets
17
justinphan3110/wildchat_over_refusal
Viewer
•
Updated
justinphan3110/scruples
Viewer
•
Updated
•
102
justinphan3110/wmdp-test
Viewer
•
Updated
•
305
justinphan3110/mmlu-test
Viewer
•
Updated
justinphan3110/harmful_harmless_instructions_llama2_chat
Viewer
•
Updated
justinphan3110/repe_emotions_concept_llama2_chat
Viewer
•
Updated
justinphan3110/sharegpt_instructions_small_en_vi_answers
Viewer
•
Updated
justinphan3110/sharegpt_instructions_small
Viewer
•
Updated
•
24
justinphan3110/100_harmless_harmful_behaviors_vicuna
Viewer
•
Updated
justinphan3110/harmful_harmless_instructions_vicuna
Viewer
•
Updated