Tianyuan Jin
Research Fellow
Department of Electrical and Computer Engineering
National University of Singapore
I am a Research Fellow at the Department of Electrical and Computer Engineering, National University of Singapore, working with Prof. Jonathan Scarlett and Prof. Vincent Y. F. Tan. I obtained my Ph.D. in Computer Science from the National University of Singapore in 2024, advised by Prof. Xiaokui Xiao. Before that, I received my M.S. in Computer Science advised by Prof. Enhong Chen and B.S. in Mathematics from the University of Science and Technology of China.
My research interests include bandit algorithms, reinforcement learning, and online learning.
[CV] [Google Scholar] [Email]
Contact
Email: tianyuan1044 [at] gmail [dot] com
Recent papers
* denotes equal contribution. + denotes alphabetical author order.
- Avoiding exp(k*) Scaling for Thompson Sampling in Combinatorial Semi-Bandits: From Multiple Seeds to a Single Seed COLT 2026.
- Best Arm Identification with Minimal Regret JMLR 2026.
- Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention TMLR 2026.
- Breaking the Total Variance Barrier: Sharp Sample Complexity for Linear Heteroscedastic Bandits with Fixed Action Set ICLR 2026.
- SteerConf: Steering LLMs for Confidence Elicitation NeurIPS 2025.
- Breaking the log(1/Δ2) Barrier: Better Batched Best Arm Identification with Adaptive Grids ICLR 2025.
- Optimal Batched Best Arm Identification NeurIPS 2024.
- Sparsity-Agnostic Linear Bandits with Adaptive Adversaries NeurIPS 2024.
- Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs AAAI 2024. Oral presentation.
Publications
2026
- Avoiding exp(k*) Scaling for Thompson Sampling in Combinatorial Semi-Bandits: From Multiple Seeds to a Single Seed Annual Conference on Learning Theory (COLT), 2026.
- Best Arm Identification with Minimal Regret Journal of Machine Learning Research, 2026+.
- Asymptotically and Minimax Optimal Regret Bounds for Multi-Armed Bandits with Abstention Transactions on Machine Learning Research, 2026.
- Breaking the Total Variance Barrier: Sharp Sample Complexity for Linear Heteroscedastic Bandits with Fixed Action Set International Conference on Learning Representations (ICLR), 2026.
2025
- SteerConf: Steering LLMs for Confidence Elicitation Advances in Neural Information Processing Systems (NeurIPS), 2025.
- Breaking the log(1/Δ2) Barrier: Better Batched Best Arm Identification with Adaptive Grids International Conference on Learning Representations (ICLR), 2025.
2024
- Optimal Batched Best Arm Identification Advances in Neural Information Processing Systems (NeurIPS), 2024.
- Sparsity-Agnostic Linear Bandits with Adaptive Adversaries Advances in Neural Information Processing Systems (NeurIPS), 2024.
- Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs AAAI Conference on Artificial Intelligence (AAAI), 2024. Oral presentation.
- Optimal Batched Linear Bandits International Conference on Machine Learning (ICML), 2024.
2023
- Thompson Sampling with Less Exploration is Fast and Optimal International Conference on Machine Learning (ICML), 2023.
2022
- Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits Advances in Neural Information Processing Systems (NeurIPS), 2022.
2021
- MOTS: Minimax Optimal Thompson Sampling International Conference on Machine Learning (ICML), 2021.
- Optimal Streaming Algorithms for Multi-Armed Bandits International Conference on Machine Learning (ICML), 2021.
- Almost Optimal Anytime Algorithm for Batched Multi-Armed Bandits International Conference on Machine Learning (ICML), 2021.
- Double Explore-then-Commit: Asymptotic Optimality and Beyond Conference on Learning Theory (COLT), 2021.
- Unconstrained Submodular Maximization with Modular Costs: Tight Approximation and Application to Profit Maximization Proceedings of the VLDB Endowment (PVLDB), 2021.
2020 and earlier
- Realtime Index-Free Single Source SimRank Processing on Web-Scale Graphs Proceedings of the VLDB Endowment (PVLDB), 2020.
- Realtime Top-k Personalized PageRank over Large Graphs on GPUs Proceedings of the VLDB Endowment (PVLDB), 2020.
- Efficient Pure Exploration in Adaptive Round Model Advances in Neural Information Processing Systems (NeurIPS), 2019.
- Tracking Top-k Influential Users with Relative Errors ACM International Conference on Information and Knowledge Management (CIKM), 2019.
- Maximizing the Effect of Information Adoption: A General Framework SIAM International Conference on Data Mining (SDM), 2018.
Awards
- Dean's Graduate Research Excellence Award, 2024.
- Google PhD Fellowship, 2021.
Professional service