Traditional Chinese corpus collection for LLM training (pre-training, instruction-tuning, and RLHF/alignment).
Oscar, Li
liswei
AI & ML interests
Multimodal Deep Learning, Natural Language Processing, Efficient Fine-Tuning
Organizations
None yet
Collections
1
models
2
datasets
10
liswei/news-collection-zhtw
Viewer
•
Updated
•
23
liswei/en2tw-alignment-sft
Viewer
•
Updated
•
68
•
1
liswei/zhtw-news-and-articles-2B
Viewer
•
Updated
•
410
•
2
liswei/wikinews-zhtw-dedup
Viewer
•
Updated
•
11
liswei/wikipedia-zhtw-dedup
Viewer
•
Updated
•
38
liswei/common-crawl-zhtw
Viewer
•
Updated
•
42
•
1
liswei/coct-en-zhtw-dedup
Viewer
•
Updated
•
11
liswei/c4-zhtw
Viewer
•
Updated
•
1
liswei/rm-static-zhTW
Viewer
•
Updated
•
1
•
29
liswei/NTU-Tree
Viewer
•
Updated
•
13
•
2