On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 192
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Paper • 2311.03099 • Published Nov 6, 2023 • 31
view article Article ORBA: Orthogonal Reflection Bounded Ablation — A Geometrically Exact Detour in Directional Activation Editing 16 days ago • 5
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models Paper • 2510.15061 • Published Oct 16, 2025 • 3
Favorite Models Collection My currently most used. These are all fully uncensored (no refusals). • 13 items • Updated about 2 hours ago • 3
Finetune Experiments Collection Sorted from oldest (top) to newest (bottom) • 14 items • Updated 24 days ago • 1
Ablation Experiments Collection Sorted from oldest (top) to newest (bottom) • 23 items • Updated 22 days ago • 2
Favorite Uncensored Drivers Collection These models have no refusals and require no jailbreaks. • 36 items • Updated about 2 hours ago • 16
Model Stock: All we need is just a few fine-tuned models Paper • 2403.19522 • Published Mar 28, 2024 • 14
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 Feb 20 • 501
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling Paper • 2406.11617 • Published Jun 17, 2024 • 10
Favorite Models Collection Models with that certain something. Non-exhaustive list, no particular order. • 22 items • Updated 5 days ago • 5
Merge Experiments Collection Sorted from oldest (top) to newest (bottom) • 93 items • Updated about 2 hours ago • 4