default search action

combined dblp search
author search
venue search
publication search

ask others

Silviu Pitis

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

Conference and Workshop Papers

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/RuanDWPZBDMH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RuanDWPZBDMH24
Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto:
Identifying the Risks of LM Agents with an LM-Emulated Sandbox. ICLR 2024
2023
[c11]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhouMHPPCB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhouMHPPCB23
Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, Harris Chan, Jimmy Ba:
Large Language Models are Human-Level Prompt Engineers. ICLR 2023
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Pitis23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Pitis23
Silviu Pitis:
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards. NeurIPS 2023
2022
[c9]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PitisCMG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PitisCMG22
Silviu Pitis, Elliot Creager, Ajay Mandlekar, Animesh Garg:
MoCoDA: Model-based Counterfactual Data Augmentation. NeurIPS 2022
2020
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/AsisCPSG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/AsisCPSG20
Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves:
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning. AAAI 2020: 3741-3748
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/PitisZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/PitisZ20
Silviu Pitis, Michael R. Zhang:
Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes. AAMAS 2020: 1064-1071
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PitisCJB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PitisCJB20
Silviu Pitis, Harris Chan, Kiarash Jamali, Jimmy Ba:
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality. ICLR 2020
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PitisCZSB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PitisCZSB20
Silviu Pitis, Harris Chan, Stephen Zhao, Bradly C. Stadie, Jimmy Ba:
Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning. ICML 2020: 7750-7761
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PitisCG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PitisCG20
Silviu Pitis, Elliot Creager, Animesh Garg:
Counterfactual Data Augmentation using Locally Factored Dynamics. NeurIPS 2020
2019
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Pitis19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Pitis19
Silviu Pitis:
Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach. AAAI 2019: 7949-7956
2018
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Pitis18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Pitis18
Silviu Pitis:
Source Traces for Temporal Difference Learning. AAAI 2018: 3952-3959
2017
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icail/Pitis17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icail/Pitis17
Silviu Pitis:
Methods for retrieving alternative contract language using a prototype. ICAIL 2017: 179-187

Informal and Other Publications

see FAQ

What is the meaning of the colors in the publication lists?

2024
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14916
Silviu Pitis, Ziang Xiao, Nicolas Le Roux, Alessandro Sordoni:
Improving Context-Aware Preference Modeling for Language Models. CoRR abs/2407.14916 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00844
Blair Yang, Fuyang Cui, Keiran Paster, Jimmy Ba, Pashootan Vaezipoor, Silviu Pitis, Michael R. Zhang:
Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries. CoRR abs/2409.00844 (2024)
2023
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-05970
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-05970
Silviu Pitis, Michael R. Zhang, Andrew Wang, Jimmy Ba:
Boosted Prompt Ensembles for Large Language Models. CoRR abs/2304.05970 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15817
Yangjun Ruan, Honghua Dong, Andrew Wang, Silviu Pitis, Yongchao Zhou, Jimmy Ba, Yann Dubois, Chris J. Maddison, Tatsunori Hashimoto:
Identifying the Risks of LM Agents with an LM-Emulated Sandbox. CoRR abs/2309.15817 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00435
Silviu Pitis:
Consistent Aggregation of Objectives with Diverse Time Preferences Requires Non-Markovian Rewards. CoRR abs/2310.00435 (2023)
2022
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11287
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11287
Silviu Pitis, Elliot Creager, Ajay Mandlekar, Animesh Garg:
MoCoDA: Model-based Counterfactual Data Augmentation. CoRR abs/2210.11287 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01910
Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, Harris Chan, Jimmy Ba:
Large Language Models Are Human-Level Prompt Engineers. CoRR abs/2211.01910 (2022)
2020
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-10092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-10092
Silviu Pitis, Michael R. Zhang:
Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes. CoRR abs/2001.10092 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05825
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05825
Silviu Pitis, Harris Chan, Kiarash Jamali, Jimmy Ba:
An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality. CoRR abs/2002.05825 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-02832
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-02832
Silviu Pitis, Harris Chan, Stephen Zhao, Bradly C. Stadie, Jimmy Ba:
Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning. CoRR abs/2007.02832 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-02863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-02863
Silviu Pitis, Elliot Creager, Animesh Garg:
Counterfactual Data Augmentation using Locally Factored Dynamics. CoRR abs/2007.02863 (2020)
2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02893
Silviu Pitis:
Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach. CoRR abs/1902.02893 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02907
Silviu Pitis:
Source Traces for Temporal Difference Learning. CoRR abs/1902.02907 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-03906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-03906
Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves:
Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning. CoRR abs/1909.03906 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.