Methodology copyright results: All copyright scores are go @1."One endeavor" configurations allow no majority voting or parallel take a look at-time compute; "several attempts" options let test-time variety of the applicant answer. They're all operate Along with the AI Studio API with default sampling configurations. To lessen variance, we common a