Int reward values are anomaly small for sparse reward environments. This is normal? if so, why? Venture example: 