Liao, L., Fu, Z., Yang, Z., Wang, Y., Kolar, M., & Wang, Z. (2021). Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning.
Chicago Style (17th ed.) CitationLiao, Luofeng, Zuyue Fu, Zhuoran Yang, Yixin Wang, Mladen Kolar, and Zhaoran Wang. Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning. 2021.
MLA (9th ed.) CitationLiao, Luofeng, et al. Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning. 2021.
Warning: These citations may not always be 100% accurate.