Skip to content

Non-record RTX4060Ti 11L LeakyTTT 24h local (1.1443 BPB)#1244

Open
monkeyKingProgrammer wants to merge 1 commit intoopenai:mainfrom
monkeyKingProgrammer:nonrecord-11l-24h-local4060ti
Open

Non-record RTX4060Ti 11L LeakyTTT 24h local (1.1443 BPB)#1244
monkeyKingProgrammer wants to merge 1 commit intoopenai:mainfrom
monkeyKingProgrammer:nonrecord-11l-24h-local4060ti

Conversation

@monkeyKingProgrammer
Copy link
Copy Markdown

@monkeyKingProgrammer monkeyKingProgrammer commented Apr 2, 2026

Summary

A non-record unlimited-compute 16MB submission:

records/track_non_record_16mb/2026-04-02_11L_24h_Local4060Ti_LeakyTTT/

This run was trained locally on 1x RTX 4060 Ti 16GB for a 24h wallclock cap, so it is not intended for the 10-minute 8xH100 main leaderboard.

Final score:

  • legal_ttt_exact val_bpb: 1.14430187
  • legal_ttt_exact val_loss: 1.93210066

Artifact:

  • bytes_total: 15,702,576
  • bytes_model_int6_lzma: 15,605,300
  • bytes_code: 97,276

This reuses the same 11-layer LeakyReLU^2 + Parallel Muon + XSA4 + Partial RoPE + LN scale + EMA + legal score-first TTT stack as the earlier 16-hour local run, but extends timed training to 24 hours at fixed 2048 sequence length.

@monkeyKingProgrammer monkeyKingProgrammer changed the title Add non-record unlimited-compute 11L LeakyTTT 24h local RTX 4060 Ti run Non-record unlimited-compute 11L LeakyTTT 24h local RTX 4060 Ti run Apr 2, 2026
@monkeyKingProgrammer monkeyKingProgrammer changed the title Non-record unlimited-compute 11L LeakyTTT 24h local RTX 4060 Ti run Non-record RTX4060Ti 11L LeakyTTT 24h local (1.1443 BPB) Apr 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant