EPO:Hierarchical LLM Agents with Environment Preference Optimization

Publication
In The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)
Haotian Fu
PhD student at Brown University

haotian_fu@brown.edu