We gratefully acknowledge support from
the Simons Foundation and member institutions.

No authors of 2410.01458 can endorse.

From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge

Xiefeng Wu: Is registered as an author of this paper.
Not currently an endorser. (why?)