default search action

combined dblp search
author search
venue search
publication search

ask others

Yihao Feng

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XiaXDYFX0X24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XiaXDYFX0X24
Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong:
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability. ACL (1) 2024: 680-699
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0007YFQCYC0SEXX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0007YFQCYC0SEXX24
Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu:
HIVE: Harnessing Human Feedback for Instructional Visual Editing. CVPR 2024: 9026-9036
[c19]
- view
  - electronic edition via handle.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/hicss/ZhangYF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hicss/ZhangYF24
Jieyi Zhang, Cenying Yang, Yihao Feng:
Demand Prediction by Incorporating Internet-of-Things Data: A Case of Automobile Repair and Maintenance Service. HICSS 2024: 5017-5026
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YaoHNLFXNC0AXMW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YaoHNLFXNC0AXMW24
Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh R. N., Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization. ICLR 2024
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10941
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10941
Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese:
Text2Data: Low-Resource Data Generation with Textual Control. CoRR abs/2402.10941 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-15506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-15506
Jianguo Zhang, Tian Lan, Rithesh Murthy, Zhiwei Liu, Weiran Yao, Juntao Tan, Thai Hoang, Liangwei Yang, Yihao Feng, Zuxin Liu, Tulika Manoj Awalgaonkar, Juan Carlos Niebles, Silvio Savarese, Shelby Heinecke, Huan Wang, Caiming Xiong:
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning. CoRR abs/2402.15506 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-18667
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-18667
Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong:
FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability. CoRR abs/2402.18667 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01258
Ruohong Zhang, Liangke Gui, Zhiqing Sun, Yihao Feng, Keyang Xu, Yuanhan Zhang, Di Fu, Chunyuan Li, Alexander Hauptmann, Yonatan Bisk, Yiming Yang:
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward. CoRR abs/2404.01258 (2024)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18518
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18518
Zuxin Liu, Thai Hoang, Jianguo Zhang, Ming Zhu, Tian Lan, Shirley Kokane, Juntao Tan, Weiran Yao, Zhiwei Liu, Yihao Feng, Rithesh Murthy, Liangwei Yang, Silvio Savarese, Juan Carlos Niebles, Huan Wang, Shelby Heinecke, Caiming Xiong:
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets. CoRR abs/2406.18518 (2024)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14207
Bo Liu, Rui Wang, Lemeng Wu, Yihao Feng, Peter Stone, Qiang Liu:
Longhorn: State Space Models are Amortized Online Learners. CoRR abs/2407.14207 (2024)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-07060
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-07060
Kexun Zhang, Weiran Yao, Zuxin Liu, Yihao Feng, Zhiwei Liu, Rithesh Murthy, Tian Lan, Lei Li, Renze Lou, Jiacheng Xu, Bo Pang, Yingbo Zhou, Shelby Heinecke, Silvio Savarese, Huan Wang, Caiming Xiong:
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents. CoRR abs/2408.07060 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-12590
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-12590
Can Qin, Congying Xia, Krithika Ramakrishnan, Michael S. Ryoo, Lifu Tu, Yihao Feng, Manli Shu, Honglu Zhou, Anas Awadalla, Jun Wang, Senthil Purushwalkam, Le Xue, Yingbo Zhou, Huan Wang, Silvio Savarese, Juan Carlos Niebles, Zeyuan Chen, Ran Xu, Caiming Xiong:
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations. CoRR abs/2408.12590 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03215
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03215
Jianguo Zhang, Tian Lan, Ming Zhu, Zuxin Liu, Thai Hoang, Shirley Kokane, Weiran Yao, Juntao Tan, Akshara Prabhakar, Haolin Chen, Zhiwei Liu, Yihao Feng, Tulika Manoj Awalgaonkar, Rithesh Murthy, Eric Hu, Zeyuan Chen, Ran Xu, Juan Carlos Niebles, Shelby Heinecke, Huan Wang, Silvio Savarese, Caiming Xiong:
xLAM: A Family of Large Action Models to Empower AI Agent Systems. CoRR abs/2409.03215 (2024)
2023
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0042F0S23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0042F0S23
Bo Liu, Yihao Feng, Qiang Liu, Peter Stone:
Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning. AAAI 2023: 8799-8806
[c16]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FengYZ0XZW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FengYZ0XZW23
Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang:
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems. ICLR 2023
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuFSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuFSL23
Bo Liu, Yihao Feng, Peter Stone, Qiang Liu:
FAMO: Fast Adaptive Multitask Optimization. NeurIPS 2023
[c14]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuZGFLZS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuZGFLZS23
Bo Liu, Yifeng Zhu, Chongkai Gao, Yihao Feng, Qiang Liu, Yuke Zhu, Peter Stone:
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning. NeurIPS 2023
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/QinZYFYZWNXSE0X23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/QinZYFYZWNXSE0X23
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu:
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild. NeurIPS 2023
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangZXFXZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangZXFXZ23
Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou:
Preference-grounded Token-level Guidance for Language Model Fine-tuning. NeurIPS 2023
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10342
Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang:
Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems. CoRR abs/2302.10342 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09618
Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu:
HIVE: Harnessing Human Feedback for Instructional Visual Editing. CoRR abs/2303.09618 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11147
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu:
UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild. CoRR abs/2305.11147 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-00398
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-00398
Shentao Yang, Shujian Zhang, Congying Xia, Yihao Feng, Caiming Xiong, Mingyuan Zhou:
Preference-grounded Token-level Guidance for Language Model Fine-tuning. CoRR abs/2306.00398 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03310
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03310
Bo Liu, Yifeng Zhu, Chongkai Gao, Yihao Feng, Qiang Liu, Yuke Zhu, Peter Stone:
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning. CoRR abs/2306.03310 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-03792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-03792
Bo Liu, Yihao Feng, Peter Stone, Qiang Liu:
FAMO: Fast Adaptive Multitask Optimization. CoRR abs/2306.03792 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08962
Rithesh Murthy, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Le Xue, Weiran Yao, Yihao Feng, Zeyuan Chen, Akash Gokul, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
REX: Rapid Exploration and eXploitation for AI Agents. CoRR abs/2307.08962 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-02151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-02151
Weiran Yao, Shelby Heinecke, Juan Carlos Niebles, Zhiwei Liu, Yihao Feng, Le Xue, Rithesh Murthy, Zeyuan Chen, Jianguo Zhang, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization. CoRR abs/2308.02151 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05960
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05960
Zhiwei Liu, Weiran Yao, Jianguo Zhang, Le Xue, Shelby Heinecke, Rithesh Murthy, Yihao Feng, Zeyuan Chen, Juan Carlos Niebles, Devansh Arpit, Ran Xu, Phil Mui, Huan Wang, Caiming Xiong, Silvio Savarese:
BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents. CoRR abs/2308.05960 (2023)
2022
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YangFZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangFZZ22
Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou:
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning. ICML 2022: 24980-25006
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangZFZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangZFZ22
Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou:
A Unified Framework for Alternating Offline Model Training and Policy Learning. NeurIPS 2022
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-00236
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-00236
Ziyang Tang, Yihao Feng, Qiang Liu:
Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning. CoRR abs/2201.00236 (2022)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09673
Shentao Yang, Zhendong Wang, Huangjie Zheng, Yihao Feng, Mingyuan Zhou:
A Regularized Implicit Policy for Offline Reinforcement Learning. CoRR abs/2202.09673 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07166
Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou:
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning. CoRR abs/2206.07166 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-08133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-08133
Bo Liu, Yihao Feng, Qiang Liu, Peter Stone:
Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning. CoRR abs/2208.08133 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05922
Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou:
A Unified Framework for Alternating Offline Model Training and Policy Learning. CoRR abs/2210.05922 (2022)
2021
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/XuRZFX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/XuRZFX20
Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng, Caiming Xiong:
Unsupervised Out-of-Domain Detection via Pre-trained Transformers. ACL/IJCNLP (1) 2021: 1052-1061
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FengTZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FengTZ021
Yihao Feng, Ziyang Tang, Na Zhang, Qiang Liu:
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds. ICLR 2021
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/XiaYFY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/XiaYFY21
Congying Xia, Wenpeng Yin, Yihao Feng, Philip S. Yu:
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System. NAACL-HLT 2021: 1351-1360
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-05741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-05741
Yihao Feng, Ziyang Tang, Na Zhang, Qiang Liu:
Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds. CoRR abs/2103.05741 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-11882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-11882
Congying Xia, Wenpeng Yin, Yihao Feng, Philip S. Yu:
Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System. CoRR abs/2104.11882 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00948
Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng, Caiming Xiong:
Unsupervised Out-of-Domain Detection via Pre-trained Transformers. CoRR abs/2106.00948 (2021)
2020
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/TangF0ZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TangF0ZL20
Ziyang Tang, Yihao Feng, Lihong Li, Dengyong Zhou, Qiang Liu:
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation. ICLR 2020
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/FengRTL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/FengRTL20
Yihao Feng, Tongzheng Ren, Ziyang Tang, Qiang Liu:
Accountable Off-Policy Evaluation With Kernel Bellman Statistics. ICML 2020: 3102-3111
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TangFZ0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TangFZ0020
Ziyang Tang, Yihao Feng, Na Zhang, Jian Peng, Qiang Liu:
Off-Policy Interval Estimation with Lipschitz Value Iteration. NeurIPS 2020
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06668
Yihao Feng, Tongzheng Ren, Ziyang Tang, Qiang Liu:
Accountable Off-Policy Evaluation With Kernel Bellman Statistics. CoRR abs/2008.06668 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15392
Ziyang Tang, Yihao Feng, Na Zhang, Jian Peng, Qiang Liu:
Off-Policy Interval Estimation with Lipschitz Value Iteration. CoRR abs/2010.15392 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
- export record
  dblp key:
  - conf/nips/Feng0019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Feng0019
Yihao Feng, Lihong Li, Qiang Liu:
A Kernel Loss for Solving the Bellman Equation. NeurIPS 2019: 15430-15441
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-10506
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-10506
Yihao Feng, Lihong Li, Qiang Liu:
A Kernel Loss for Solving the Bellman Equation. CoRR abs/1905.10506 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-07186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-07186
Ziyang Tang, Yihao Feng, Lihong Li, Dengyong Zhou, Qiang Liu:
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation. CoRR abs/1910.07186 (2019)
2018
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuFMZ0018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuFMZ0018
Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng, Qiang Liu:
Action-dependent Control Variates for Policy Optimization via Stein Identity. ICLR (Poster) 2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-00139
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-00139
Guangming Shi, Zhongqiang Zhang, Dahua Gao, Xuemei Xie, Yihao Feng, Xinrui Ma, Danhua Liu:
Knowledge-guided Semantic Computing Network. CoRR abs/1810.00139 (2018)
2017
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/pvldb/WangFGLLTTWZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pvldb/WangFGLLTTWZ17
Chao Wang, Yihao Feng, Qi Guo, Zhaoxian Li, Kexin Liu, Zijian Tang, Anthony K. H. Tung, Lifu Wu, Yuxin Zheng:
ARShop: A Cloud-based Augmented Reality System for Shopping. Proc. VLDB Endow. 10(12): 1845-1848 (2017)
[c1]
- view
  - electronic edition @ auai.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/FengWL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/FengWL17
Yihao Feng, Dilin Wang, Qiang Liu:
Learning to Draw Samples with Amortized Stein Variational Gradient Descent. UAI 2017
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-11198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-11198
Hao Liu, Yihao Feng, Yi Mao, Dengyong Zhou, Jian Peng, Qiang Liu:
Sample-efficient Policy Optimization with Stein Control Variate. CoRR abs/1710.11198 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.