Vietnamese OCR dataset, including word level and line level image data.
Data Studio
community
AI & ML interests
Dataset for Machine Learning.
Organization Card
About org cards
Contact me if needed: minhquannguyen1800@gmail.com (Minh Quan)
Data Information:
OCR
Vietnamese Document with Red Seal: 223,830 samples
Vietnamese Document with Black Seal: 71,970 samples
Vietnamese Document: 1,305,220 samples
Vietnamese Document with underline text: 365,919 samples
Vietnamese Document with 5 colors Highlight: 135,295 samples
Vietnamese Document with Yellow Highlight: 174,282 samples
High quality Vietnamese Document: 22,524 samples
Text-to-Speech
>1000 hours Vietnamese Male & Female Voice - 1000000 audios
datasets
60
DataStudio/Vietnamese_Text2Speech_AB4
Viewer
•
Updated
DataStudio/Vietnamese_Text2Speech_AB3
Viewer
•
Updated
DataStudio/Vietnamese_Text2Speech_AB2
Viewer
•
Updated
DataStudio/Vietnamese_Text2Speech_AB1
Viewer
•
Updated
DataStudio/S2W_format
Viewer
•
Updated
DataStudio/Vietnamese_ASR_TestingData
Viewer
•
Updated
DataStudio/Viet-wikipedia
Viewer
•
Updated
•
2
DataStudio/T2S_dataset_v2
Viewer
•
Updated
DataStudio/Vietnamese_ASR_TestingData_Old
Viewer
•
Updated
DataStudio/OCRWordLevelClear_07
Updated