preference-data

Here are 8 public repositories matching this topic...

argilla-io / notus

Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach

zephyr fine-tuning dpo trl lm-alignment preference-data alignment-handbook

Updated Jan 15, 2024
Python

altaidevorg / afterimage

Star

Generate conversational, tool-calling, structured-output, and preference datasets — easily and at scale

conversational persona structured-output afterimage multi-turn dpo synthetic-dataset preference-data tool-calling

Updated May 13, 2026
Python

impel-intelligence / datapoint-mcp

Star

MCP server for human-in-the-loop surveys, A/B preference tests, ratings, and rankings. Get real human feedback inside Claude Code, Claude Desktop, Cursor, Windsurf, and any MCP client — powered by Datapoint AI.

Updated Jun 12, 2026
Python

Alfonsobang / awesome-llm-training-data

Star

Curated tools, papers, datasets, and practices for LLM training data engineering.

annotation data-quality synthetic-data training-data data-governance llm rlhf financial-ai rag-evaluation preference-data

Updated Jun 13, 2026
Python

LaelaZorana / rlhf-pairwise-rater

Star

Pairwise rating CLI for AI responses — per-axis scoring (helpfulness/harmlessness/accuracy/instruction-following), JSONL in/out, inter-rater Cohen's kappa

python annotation inter-rater-agreement ai-evaluation rlhf preference-data

Updated Jun 4, 2026
Python

TeracAI / svg-arena

Star

A forkable example of the human-in-the-loop model-improvement loop: AI generates, humans judge via the Terac MCP, you improve the model. Built as an SVG illustration arena.

annotation nextjs human-in-the-loop dpo rlhf llm-eval preference-data terac

Updated Jun 20, 2026
TypeScript

nicolasdix / decision-ontology

Star

This repository contains all artifacts produced during my bachelor's thesis on data modeling for collective decision-making.

automation linked-data knowledge-engineering rdf-validation ontology-development preference-data

Updated Apr 15, 2026
Jupyter Notebook

saitejasrivilli / preference-data-pipeline

Star

RLHF preference data curation pipeline: HH-RLHF + UltraFeedback + OASST1 → quality filter → MinHash dedup → DPO-ready JSONL

python minhash data-curation post-training dpo llm rlhf preference-data hh-rlhf ultrafeedback

Updated Jun 7, 2026
Python

Improve this page

Add a description, image, and links to the preference-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preference-data topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

preference-data

Here are 8 public repositories matching this topic...

argilla-io / notus

altaidevorg / afterimage

impel-intelligence / datapoint-mcp

Alfonsobang / awesome-llm-training-data

LaelaZorana / rlhf-pairwise-rater

TeracAI / svg-arena

nicolasdix / decision-ontology

saitejasrivilli / preference-data-pipeline

Improve this page

Add this topic to your repo