Lorenzo's picture

Lorenzo

lsteno

·

AI & ML interests

None yet

Recent Activity

updated a collection about 7 hours ago

Qwen 3 4B RLM RLVR

liked a Space 2 days ago

HuggingFaceFW/finephrase

updated a dataset 7 days ago

lsteno/rlm-evals-paper-pass1-traces-v2

View all activity

Organizations

lsteno 's models 13

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-depth2-recursive-r64-a128-lr1e-5-adapter

Reinforcement Learning • Updated 9 days ago • 15

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr1e-5-depth1-v1

4B • Updated 29 days ago • 68

lsteno/Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr5e-6-depth1-v1

Text Generation • 4B • Updated about 1 month ago • 146

lsteno/qwen3-rlm-depth1-r64-a128-lr1e-5-s150-bal35f40v1-lora

Updated May 21 • 11

lsteno/qwen3-rlm-depth1-r64-a128-lr5e-7-s150-bal35f40v1-lora

Updated May 20 • 10

lsteno/qwen3-rlm-depth1-r16-a32-lr1e-4-s150-bal35f40v1-lora

Updated May 20 • 9

lsteno/qwen3-rlm-depth1-r16-a32-lr1e-5-s150-bal35f40v1-lora

Updated May 20 • 10

lsteno/qwen3-rlm-depth1-r16-a32-lr5e-7-s150-bal35f40v1-lora

Updated May 19 • 10

lsteno/qwen3-rlm-depth1-r4-a8-lr1e-4-s150-bal35f40v1-lora

Updated May 19 • 6

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr1e-5-s150-lora

Updated May 18 • 3

lsteno/Qwen3-4B-Instruct-2507-RLM-RL-depth1-r4-a8-lr5e-7-s150-lora

Updated May 17 • 4

lsteno/Qwen3-4B-Instruct-2507-RLM-SFT-v3-per-root-turn

4B • Updated May 17 • 14

lsteno/qwen2-7b-lora-opencodeinstruct

Text Generation • Updated Oct 12, 2025 • 1