A high quality Vietnamese pretraining dataset for LLMs
Nguyễn Tiến Khôi
zerostratos
AI & ML interests
robots
Recent Activity
liked a dataset 1 day ago
cmu-lti/machine-translation-for-vision liked a dataset 5 days ago
ulab-ai/swm-bench liked a model 5 days ago
cmu-lti/osim-8bOrganizations
models 27
zerostratos/unsloth_finetune_deepseek-ocr-1000
Image Feature Extraction • 3B • Updated • 2
zerostratos/unsloth_finetune_deepseek-ocr-500
Image Feature Extraction • 3B • Updated • 2
zerostratos/unsloth_finetune_deepseek-ocr-500-adapter
Updated
zerostratos/unsloth_finetune_deepseek-ocr-200-adapter
Updated
zerostratos/unsloth_finetune_deepseek-ocr-200
Image Feature Extraction • 3B • Updated • 3
zerostratos/unsloth_finetune_deepseek-ocr
Image Feature Extraction • 3B • Updated • 3
zerostratos/qwen_prm
Feature Extraction • 0.6B • Updated • 4
zerostratos/monolingual-vietnamese-bge
Sentence Similarity • 0.6B • Updated • 2
zerostratos/cptlora-qwen-0.6B
Text Generation • 0.6B • Updated • 1
zerostratos/qwen3-0.6B-pretrained-lora
Text Generation • 0.6B • Updated • 2
datasets 77
zerostratos/Test-Wan
Updated • 2
zerostratos/error_detection_math10
Preview • Updated • 4
zerostratos/math_essay_prm
Viewer • Updated • 995 • 8
zerostratos/view_test
Viewer • Updated • 96 • 5
zerostratos/math_errors
Viewer • Updated • 835 • 8
zerostratos/k10math-1
Updated • 3
zerostratos/fineweb-2-vie-2019
Viewer • Updated • 4.68M • 59
zerostratos/fineweb-2-vie-2020
Viewer • Updated • 5M • 17
zerostratos/fineweb-2-vie-2021
Viewer • Updated • 5M • 91
zerostratos/fineweb-2-vie-2022-selected
Preview • Updated • 74