Kyle O'Brien PRO

Kyle1668

49 3 101

https://kyleobrien.io

Kyle1668

AI & ML interests

pretraining, alignment, open-source

Recent Activity

updated a dataset 25 days ago

geodesic-research/pa-warm-start-1B-sft-mix

new activity 25 days ago

geodesic-research/pa-warm-start-1B-sft-mix:Migrate tool_calls/tools from JSON strings to structured columns OpenAI-convention hybrid: tool_calls is list<struct{id,type,function{name,arguments}}>, tools is list<struct{type,function{name,description,parameters}}>; arguments/parameters remain JSON-encoded strings (Arrow-clean across heterogeneous tools). JSON-string tool_calls char-iterate in Jinja chat templates, rendering one empty <tool_call><function=></function></tool_call> block per character — 4-5x length blowup and a deterministic training NaN. Renders validated byte-identical to the parsed old rows on every config.

new activity 25 days ago

geodesic-research/pa-warm-start-1B-sft-mix:Restore `default` config in the configs: mapping The explicit top-level `configs:` section (added by the per-config pushes) shadows the implicit default config, so load_dataset(repo, 'default') fails with "BuilderConfig 'default' not found" even though data/ holds the blended mix. This adds the data_files mapping for it (data/train-*, 265,048 rows).

View all activity

Organizations

Collections 2

Papers 5

models 238

datasets 39

Kyle1668/fyn1668-inoculation-midtraining

Updated Mar 21 • 5

Kyle1668/sfm-em-wem-v4-fyn1668

Viewer • Updated Mar 16 • 19k • 6

Kyle1668/sfm-emergent-misalignment-training-data

Viewer • Updated Mar 15 • 16k • 10

Kyle1668/fewshot-discourse-grounded-misalignment-evals

Viewer • Updated Jan 3 • 4.46k • 24

Kyle1668/claude-sft-discourse-grounded-misalignment-synthetic-scenario-messages

Viewer • Updated Dec 23, 2025 • 12.9k • 10

Kyle1668/discourse-grounded-misalignment-evals-relevance-filtered

Viewer • Updated Dec 23, 2025 • 2.66k • 24

Kyle1668/stampy-private-11-26-25

Updated Nov 27, 2025 • 3

Kyle1668/alignment_filtering_20251126-0344

Updated Nov 26, 2025 • 3

Kyle1668/sfm-midtraining-mix-dclm-long-context-passages-blocklist-filtered

Viewer • Updated Nov 25, 2025 • 27.3k • 21

Kyle1668/climbmix-ai-blocklist-filtered-sample

Viewer • Updated Nov 24, 2025 • 50k • 13

View 39 datasets

Kyle O'Brien PRO

AI & ML interests

Recent Activity

Organizations

Collections 2

Papers 5

models 238 Sort: Recently updated

datasets 39 Sort: Recently updated

models 238

datasets 39