Остановите войну!
for scientists:
default search action
Dale Schuurmans
- > Home > Persons > Dale Schuurmans
Publications
- 2024
- [i91]Shicong Cen, Jincheng Mei, Hanjun Dai, Dale Schuurmans, Yuejie Chi, Bo Dai:
Beyond Expectations: Learning with Stochastic Dominance Made Practical. CoRR abs/2402.02698 (2024) - [i89]Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvári, Dale Schuurmans:
Stochastic Gradient Succeeds for Bandits. CoRR abs/2402.17235 (2024) - 2023
- [c202]Haoran Sun, Hanjun Dai, Bo Dai, Haomin Zhou, Dale Schuurmans:
Discrete Langevin Samplers via Wasserstein Gradient Flow. AISTATS 2023: 6290-6313 - [c201]Hanjun Dai, Yuan Xue, Niao He, Yixin Wang, Na Li, Dale Schuurmans, Bo Dai:
Learning to Optimize with Stochastic Dominance Constraints. AISTATS 2023: 8991-9009 - [c198]Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai:
Latent Variable Representation for Reinforcement Learning. ICLR 2023 - [c197]Tongzheng Ren, Tianjun Zhang, Lisa Lee, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai:
Spectral Decomposition Representation for Reinforcement Learning. ICLR 2023 - [c196]Haoran Sun, Bo Dai, Charles Sutton, Dale Schuurmans, Hanjun Dai:
Any-scale Balanced Samplers for Discrete Space. ICLR 2023 - [c195]Haoran Sun, Lijun Yu, Bo Dai, Dale Schuurmans, Hanjun Dai:
Score-based Continuous-time Discrete Diffusion Models. ICLR 2023 - [c191]Jincheng Mei, Zixin Zhong, Bo Dai, Alekh Agarwal, Csaba Szepesvári, Dale Schuurmans:
Stochastic Gradient Succeeds for Bandits. ICML 2023: 24325-24360 - [c188]Yilun Du, Sherry Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Josh Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. NeurIPS 2023 - [c186]Jincheng Mei, Bo Dai, Alekh Agarwal, Mohammad Ghavamzadeh, Csaba Szepesvári, Dale Schuurmans:
Ordering-based Conditions for Global Convergence of Policy Gradient Methods. NeurIPS 2023 - [c184]Tianjun Zhang, Tongzheng Ren, Chenjun Xiao, Wenli Xiao, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai:
Energy-based Predictive Representations for Partially Observed Reinforcement Learning. UAI 2023: 2477-2487 - [i87]Jincheng Mei, Wesley Chung, Valentin Thomas, Bo Dai, Csaba Szepesvári, Dale Schuurmans:
The Role of Baselines in Policy Gradient Optimization. CoRR abs/2301.06276 (2023) - [i86]Yilun Du, Mengjiao Yang, Bo Dai, Hanjun Dai, Ofir Nachum, Joshua B. Tenenbaum, Dale Schuurmans, Pieter Abbeel:
Learning Universal Policies via Text-Guided Video Generation. CoRR abs/2302.00111 (2023) - [i83]Mengjiao Yang, Yilun Du, Bo Dai, Dale Schuurmans, Joshua B. Tenenbaum, Pieter Abbeel:
Probabilistic Adaptation of Text-to-Video Models. CoRR abs/2306.01872 (2023) - [i79]Hongming Zhang, Tongzheng Ren, Chenjun Xiao, Dale Schuurmans, Bo Dai:
Provable Representation with Efficient Planning for Partially Observable Reinforcement Learning. CoRR abs/2311.12244 (2023) - 2022
- [c183]Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans:
Offline Policy Selection under Uncertainty. AISTATS 2022: 4376-4396 - [c182]Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvári:
The Curse of Passive Data Collection in Batch Reinforcement Learning. AISTATS 2022: 8413-8438 - [c181]Hanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai:
Neural Stochastic Dual Dynamic Programming. ICLR 2022 - [c180]Chenjun Xiao, Bo Dai, Jincheng Mei, Oscar A Ramirez, Ramki Gummadi, Chris Harris, Dale Schuurmans:
Understanding and Leveraging Overparameterization in Recursive Value Estimation. ICLR 2022 - [c179]Hanjun Dai, Mengjiao Yang, Yuan Xue, Dale Schuurmans, Bo Dai:
Marginal Distribution Adaptation for Discrete Sets via Module-Oriented Divergence Minimization. ICML 2022: 4605-4617 - [c177]Tianjun Zhang, Tongzheng Ren, Mengjiao Yang, Joseph Gonzalez, Dale Schuurmans, Bo Dai:
Making Linear MDPs Practical via Contrastive Representation Learning. ICML 2022: 26447-26466 - [c176]Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans:
SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs. KDD 2022: 1472-1482 - [c174]Jincheng Mei, Wesley Chung, Valentin Thomas, Bo Dai, Csaba Szepesvári, Dale Schuurmans:
The Role of Baselines in Policy Gradient Optimization. NeurIPS 2022 - [c170]Runyu Zhang, Jincheng Mei, Bo Dai, Dale Schuurmans, Na Li:
On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games. NeurIPS 2022 - [i76]Runyu Zhang, Jincheng Mei, Bo Dai, Dale Schuurmans, Na Li:
On the Effect of Log-Barrier Regularization in Decentralized Softmax Gradient Play in Multiagent Systems. CoRR abs/2202.00872 (2022) - [i70]Haoran Sun, Hanjun Dai, Bo Dai, Haomin Zhou, Dale Schuurmans:
Discrete Langevin Sampler via Wasserstein Gradient Flow. CoRR abs/2206.14897 (2022) - [i68]Tianjun Zhang, Tongzheng Ren, Mengjiao Yang, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai:
Making Linear MDPs Practical via Contrastive Representation Learning. CoRR abs/2207.07150 (2022) - [i67]Tongzheng Ren, Tianjun Zhang, Lisa Lee, Joseph E. Gonzalez, Dale Schuurmans, Bo Dai:
Spectral Decomposition Representation for Reinforcement Learning. CoRR abs/2208.09515 (2022) - [i64]Hanjun Dai, Yuan Xue, Niao He, Bethany Wang, Na Li, Dale Schuurmans, Bo Dai:
Learning to Optimize with Stochastic Dominance Constraints. CoRR abs/2211.07767 (2022) - [i61]Haoran Sun, Lijun Yu, Bo Dai, Dale Schuurmans, Hanjun Dai:
Score-based Continuous-time Discrete Diffusion Models. CoRR abs/2211.16750 (2022) - [i59]Tongzheng Ren, Chenjun Xiao, Tianjun Zhang, Na Li, Zhaoran Wang, Sujay Sanghavi, Dale Schuurmans, Bo Dai:
Latent Variable Representation for Reinforcement Learning. CoRR abs/2212.08765 (2022) - 2021
- [c167]Jincheng Mei, Yue Gao, Bo Dai, Csaba Szepesvári, Dale Schuurmans:
Leveraging Non-uniformity in First-order Non-convex Optimization. ICML 2021: 7555-7564 - [c166]Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Michihiro Yasunaga, Haitian Sun, Dale Schuurmans, Jure Leskovec, Denny Zhou:
LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs. ICML 2021: 8959-8970 - [c164]Chenjun Xiao, Yifan Wu, Jincheng Mei, Bo Dai, Tor Lattimore, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
On the Optimality of Batch Policy Optimization Algorithms. ICML 2021: 11362-11371 - [c163]Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvári, Dale Schuurmans:
Understanding the Effect of Stochasticity in Policy Optimization. NeurIPS 2021: 19339-19351 - [c162]Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai:
Combiner: Full Attention Transformer with Sparse Computation Cost. NeurIPS 2021: 22470-22482 - [i56]Chenjun Xiao, Yifan Wu, Tor Lattimore, Bo Dai, Jincheng Mei, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
On the Optimality of Batch Policy Optimization Algorithms. CoRR abs/2104.02293 (2021) - [i54]Jincheng Mei, Yue Gao, Bo Dai, Csaba Szepesvári, Dale Schuurmans:
Leveraging Non-uniformity in First-order Non-convex Optimization. CoRR abs/2105.06072 (2021) - [i52]Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvári:
On the Sample Complexity of Batch Reinforcement Learning with Policy-Induced Data. CoRR abs/2106.09973 (2021) - [i51]Hongyu Ren, Hanjun Dai, Zihang Dai, Mengjiao Yang, Jure Leskovec, Dale Schuurmans, Bo Dai:
Combiner: Full Attention Transformer with Sparse Computation Cost. CoRR abs/2107.05768 (2021) - [i50]Hongyu Ren, Hanjun Dai, Bo Dai, Xinyun Chen, Denny Zhou, Jure Leskovec, Dale Schuurmans:
SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs. CoRR abs/2110.14890 (2021) - [i49]Jincheng Mei, Bo Dai, Chenjun Xiao, Csaba Szepesvári, Dale Schuurmans:
Understanding the Effect of Stochasticity in Policy Optimization. CoRR abs/2110.15572 (2021) - [i48]Hanjun Dai, Yuan Xue, Zia Syed, Dale Schuurmans, Bo Dai:
Neural Stochastic Dual Dynamic Programming. CoRR abs/2112.00874 (2021) - 2020
- [c161]Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans:
GenDICE: Generalized Offline Estimation of Stationary Values. ICLR 2020 - [c159]Hanjun Dai, Azade Nazi, Yujia Li, Bo Dai, Dale Schuurmans:
Scalable Deep Generative Modeling for Sparse Graphs. ICML 2020: 2302-2312 - [c156]Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans:
Batch Stationary Distribution Estimation. ICML 2020: 10203-10213 - [c154]Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans:
Energy-Based Processes for Exchangeable Data. ICML 2020: 10681-10692 - [c152]Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
CoinDICE: Off-Policy Confidence Interval Estimation. NeurIPS 2020 - [c151]Hanjun Dai, Rishabh Singh, Bo Dai, Charles Sutton, Dale Schuurmans:
Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration. NeurIPS 2020 - [c149]Jincheng Mei, Chenjun Xiao, Bo Dai, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
Escaping the Gravitational Pull of Softmax. NeurIPS 2020 - [c148]Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans:
Off-Policy Evaluation via the Regularized Lagrangian. NeurIPS 2020 - [i47]Ruiyi Zhang, Bo Dai, Lihong Li, Dale Schuurmans:
GenDICE: Generalized Offline Estimation of Stationary Values. CoRR abs/2002.09072 (2020) - [i45]Junfeng Wen, Bo Dai, Lihong Li, Dale Schuurmans:
Batch Stationary Distribution Estimation. CoRR abs/2003.00722 (2020) - [i43]Mengjiao Yang, Bo Dai, Hanjun Dai, Dale Schuurmans:
Energy-Based Processes for Exchangeable Data. CoRR abs/2003.07521 (2020) - [i40]Hanjun Dai, Azade Nazi, Yujia Li, Bo Dai, Dale Schuurmans:
Scalable Deep Generative Modeling for Sparse Graphs. CoRR abs/2006.15502 (2020) - [i38]Mengjiao Yang, Ofir Nachum, Bo Dai, Lihong Li, Dale Schuurmans:
Off-Policy Evaluation via the Regularized Lagrangian. CoRR abs/2007.03438 (2020) - [i35]Bo Dai, Ofir Nachum, Yinlam Chow, Lihong Li, Csaba Szepesvári, Dale Schuurmans:
CoinDICE: Off-Policy Confidence Interval Estimation. CoRR abs/2010.11652 (2020) - [i34]Hanjun Dai, Rishabh Singh, Bo Dai, Charles Sutton, Dale Schuurmans:
Learning Discrete Energy-based Models via Auxiliary-variable Local Exploration. CoRR abs/2011.05363 (2020) - [i33]Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans:
Offline Policy Selection under Uncertainty. CoRR abs/2012.06919 (2020) - 2019
- [c147]Bo Dai, Hanjun Dai, Arthur Gretton, Le Song, Dale Schuurmans, Niao He:
Kernel Exponential Family Estimation via Doubly Dual Embedding. AISTATS 2019: 2321-2330 - [c137]Bo Dai, Zhen Liu, Hanjun Dai, Niao He, Arthur Gretton, Le Song, Dale Schuurmans:
Exponential Family Estimation via Adversarial Dynamics Embedding. NeurIPS 2019: 10977-10988 - [i29]Bo Dai, Zhen Liu, Hanjun Dai, Niao He, Arthur Gretton, Le Song, Dale Schuurmans:
Exponential Family Estimation via Adversarial Dynamics Embedding. CoRR abs/1904.12083 (2019) - [i25]Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, Dale Schuurmans:
AlgaeDICE: Policy Gradient from Arbitrary Experience. CoRR abs/1912.02074 (2019) - 2018
- [i20]Bo Dai, Hanjun Dai, Arthur Gretton, Le Song, Dale Schuurmans, Niao He:
Kernel Exponential Family Estimation via Doubly Dual Embedding. CoRR abs/1811.02228 (2018)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-25 02:24 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint