TypeError: 'NoneType' object is not subscriptable #763 Closed marchcat69 opened on Oct 8, 2025 ...
For each data sample, it is acceptable for part of the reward to be None (e.g., in multi-task training). However, the overall reward must not be None. Sign up for free to join this conversation on ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results