{% extends 'base.html' %} {% block title %}Synthetic Data Kit - Home{% endblock %} {% block content %}

Synthetic Data Kit Web Interface

Welcome to the Synthetic Data Kit web interface. This tool allows you to generate and curate synthetic question-answer pairs from text documents.

Current Configuration

LLM Provider: {{ provider }}

Available Operations

Ingest Document

Parse documents (PDF, HTML, YouTube, DOCX, PPT, TXT) into clean text.

Ingest Document
Create Content

Generate content from text: QA pairs, summaries, Chain of Thought examples.

Create Content
Curate QA Pairs

Curate and improve existing question-answer pairs with an LLM.

Curate QA Pairs
File Browser

Browse and view generated files and QA pairs in your data directory.

Browse Files
{% endblock %}