Home/AI Tools/Datasaur
Datasaur
AI ToolFreemium

Datasaur

NLP labeling and LLM evaluation tooling with strong SEA-language coverage

Visit Site →
4.6/5 · 92 reviews
via G2, Capterra or Trustpilot

About Datasaur

Datasaur is an NLP data labeling and LLM evaluation platform used by AI teams to annotate text, build training datasets, and run human-in-the-loop evals on model outputs. Founded by engineers with Indonesian roots, it ships first-class workflows for low-resource SEA languages including Bahasa Indonesia, Vietnamese, and Thai.

Key Features

Named entity recognition and span labeling
OCR annotation for scanned documents
Audio transcription and labeling
ML-assisted predictive labeling (Datasaur Dynamic)
LLM Labs for prompt evaluation and red teaming
Self-hosted deployment via AWS Marketplace

Best For

AI teams building SEA-language NLP modelsFintechs labeling chat or document dataGovernment and academic research on local languagesLLM evaluation for Bahasa or Vietnamese fine-tunes

Pricing

Modelfreemium
Free tier✓ Yes
Starts at$417/month

Details

CategoryAI / Data
LanguagesEN, ID, VI
Updated2026-05-06