The validation scoring system in LLMOracleCoordinator can be gamed by malicious validators who wait to submit their scores last. The issue occurs because:
All validations are publicly visible on-chain
There's no time limit for submitting validations
The scoring mechanism is deterministic and uses mean/standard deviation
Validators get paid if their scores fall within 1 standard deviation of the mean
Here's how a malicious validator could exploit this:
Wait for other validators to submit their scores first
Calculate the current mean and standard deviation from the public validation data
Submit scores that are guaranteed to fall within the acceptable range
Get paid the validatorFee despite not providing honest validation
LLMOracleCoordinator.sol#L334-L355
Malicious validators can get paid without providing honest validations
The quality of the validation system is compromised
Honest validators are disadvantaged compared to those who game the system
The protocol's ability to select the best LLM responses is undermined
Implement a commit-reveal scheme for validations:
Validators first submit a hash of their scores
Only after all validators have committed, they reveal their actual scores
This prevents later validators from knowing earlier scores
The contest is live. Earn rewards by submitting a finding.
This is your time to appeal against judgements on your submissions.
Appeals are being carefully reviewed by our judges.