Rewards The purpose of a reward function is to compute the quantity that the program should maximize during training. Rewards Overview ExactMatch reward CosineSimilarity reward LMAsJudge reward ProgramAsJudge reward