-
Kyle1668/labeled_alignment_discourse_v1
Viewer • Updated • 1.07k • 18 -
Kyle1668/alignment-classifier-documents-unlabeled
Viewer • Updated • 57.9k • 10 -
geodesic-research/anthropic-propensity-evals-human-written-refined
Viewer • Updated • 4.28k • 64 • 1 -
Kyle1668/sfm-finetuning-dataset-v1.5
Viewer • Updated • 306k • 45
Kyle O'Brien PRO
Kyle1668
AI & ML interests
pretraining, alignment, open-source
Recent Activity
updated a dataset about 7 hours ago
geodesic-research/emergent-misalignment-train-mq-mechanisms published a dataset about 7 hours ago
geodesic-research/emergent-misalignment-train-mq-mechanisms updated a model 7 days ago
geodesic-research/nemotron-instruct-tokenizer-prefill-parity-mq