AI ToolFreemium
Datasaur
NLP labeling and LLM evaluation tooling with strong SEA-language coverage
Visit Site →
⭐ 4.6/5 · 92 reviews
via G2, Capterra or Trustpilot
About Datasaur
Datasaur is an NLP data labeling and LLM evaluation platform used by AI teams to annotate text, build training datasets, and run human-in-the-loop evals on model outputs. Founded by engineers with Indonesian roots, it ships first-class workflows for low-resource SEA languages including Bahasa Indonesia, Vietnamese, and Thai.
Key Features
✓ Named entity recognition and span labeling
✓ OCR annotation for scanned documents
✓ Audio transcription and labeling
✓ ML-assisted predictive labeling (Datasaur Dynamic)
✓ LLM Labs for prompt evaluation and red teaming
✓ Self-hosted deployment via AWS Marketplace
Best For
AI teams building SEA-language NLP modelsFintechs labeling chat or document dataGovernment and academic research on local languagesLLM evaluation for Bahasa or Vietnamese fine-tunes
Pricing
Modelfreemium
Free tier✓ Yes
Starts at$417/month
Details
CategoryAI / Data
LanguagesEN, ID, VI
Updated2026-05-06