A bridge, not a destination: YouTube viewer perspectives on AI mental health support and human therapy

BackgroundArtificial intelligence (AI) tools are increasingly used for mental health support, yet little is known about how they are understood outside clinical trials and survey-based

Digital health tools and point solutions—pitfalls in population health program measurement

Digital health tools are generally poorly regulated and often lack strong research evidence, posing challenges for purchasers of point solutions such as employer groups and

Crisis support teams’ technological openness and learning attitudes toward the AI based virtual patient system crisis support VR

BackgroundAgainst the backdrop of escalating global humanitarian crises, innovative didactic simulations are becoming increasingly important. A promising alternative to traditional classroom-based didactics for learning psychological

Federated generative prompt learning with vision foundation models: universal efficient multi-center medical image analysis

npj Digital Medicine, Published online: 10 June 2026; doi:10.1038/s41746-026-02866-1 Federated generative prompt learning with vision foundation models: universal efficient multi-center medical image analysis

Sorting Science From Marketing in the Era of Data-Driven Biological Aging Clocks

Post Content

Pushing Biomolecular Utility-Diversity Frontiers with Supergroup Relative Policy Optimization

May 20, 2026

arXiv:2605.08659v3 Announce Type: replace-cross
Abstract: Biomolecular generators are often adapted with reward feedback to improve task-specific utility, but pushing utility alone can concentrate generation on a narrow family of candidates. Maintaining diversity is difficult because sample diversity is a set-level property. We introduce Supergroup Relative Policy Optimization (SGRPO), a flexible GRPO-style framework that directly constructs rewards from set-level diversity. For each condition, SGRPO samples a supergroup of candidate sets, compares their diversity under the same condition, and redistributes the group diversity reward to individual rollouts through leave-one-out diversity contributions before combining it with rollout-level utility. This design decouples SGRPO from a particular generator, utility reward, or diversity metric, and allows instantiation with different GRPO-style approaches. We evaluate SGRPO on de novo small-molecule design, pocket-based small-molecule design, and de novo protein design, instantiating it with both GRPO and Coupled-GRPO across autoregressive and discrete diffusion generators. Across decoding sweeps, SGRPO expands the utility-diversity Pareto frontier and achieves the best frontier-level metrics relative to pretrained generators, GRPO, and memory-assisted GRPO when applicable. Our analyses further show that direct set-level diversity rewards remain effective with small groups and help preserve broader generation-distribution coverage during post-training. The code is available at https://github.com/IDEA-XL/SGRPO.

Subscribe for Updates

Copyright 2025 dijee Intelligence Ltd. dijee Intelligence Ltd. is a private limited company registered in England and Wales at Media House, Sopers Road, Cuffley, Hertfordshire, EN6 4RY, UK registration number 16808844