Humans and other animals are driven to acquire information about opportunities in their environments, yet how they evaluate what is worth learning remains unclear. Here we combine artificial neural networks with symbolic regression to extract an expressive yet interpretable model that specifies how human participants evaluate decision-relevant information during choice. The recovered function depends primarily on the relative evidence accumulated across options rather than absolute uncertainty about each, revealing that participants seek information symmetry across alternatives rather than minimizing uncertainty option by option. This account outperforms standard models of uncertainty-based exploration and generalizes to an independent dataset. Using ultrahigh-field (7T) functional magnetic resonance imaging optimized for midbrain and brainstem, we simultaneously measured activity across five neuromodulatory nuclei and two cortical regions. Ventral tegmental area activity showed opposed coding of information and selection values, a pattern suited to arbitrating between sampling and choosing, and anterior cingulate cortex and anterior insula tracked value-of-information computations.