Folgen
Jiechao Xiong
Jiechao Xiong
Tencent AI Lab
Bestätigte E-Mail-Adresse bei tencent.com
Titel
Zitiert von
Zitiert von
Jahr
Parametrized deep q-networks learning: Reinforcement learning with discrete-continuous hybrid action space
J Xiong, Q Wang, Z Yang, P Sun, L Han, Y Zheng, H Fu, T Zhang, J Liu, ...
arXiv preprint arXiv:1810.06394, 2018
1802018
Exponentially weighted imitation learning for batched historical data
Q Wang, J Xiong, L Han, H Liu, T Zhang
Advances in Neural Information Processing Systems 31, 2018
1092018
Tstarbots: Defeating the cheating level builtin ai in starcraft ii in the full game
P Sun, X Sun, L Han, J Xiong, Q Wang, B Li, Y Zheng, J Liu, Y Liu, H Liu, ...
arXiv preprint arXiv:1809.07193, 2018
782018
Sparse recovery via differential inclusions
S Osher, F Ruan, J Xiong, Y Yao, W Yin
Applied and Computational Harmonic Analysis 41 (2), 436-469, 2016
692016
Robust subjective visual property prediction from crowdsourced pairwise labels
Y Fu, TM Hospedales, T Xiang, J Xiong, S Gong, Y Wang, Y Yao
IEEE transactions on pattern analysis and machine intelligence 38 (3), 563-577, 2015
562015
Grid-wise control for multi-agent reinforcement learning in video game ai
L Han, P Sun, Y Du, J Xiong, Q Wang, X Sun, H Liu, T Zhang
International Conference on Machine Learning, 2576-2585, 2019
512019
Robust evaluation for quality of experience in crowdsourcing
Q Xu, J Xiong, Q Huang, Y Yao
Proceedings of the 21st ACM international conference on Multimedia, 43-52, 2013
312013
Tstarbot-x: An open-sourced and comprehensive study for efficient league training in starcraft ii full game
L Han, J Xiong, P Sun, X Sun, M Fang, Q Guo, Q Chen, T Shi, H Yu, X Wu, ...
arXiv preprint arXiv:2011.13729, 2020
292020
Stochastic non-convex ordinal embedding with stabilized barzilai-borwein step size
K Ma, J Zeng, J Xiong, Q Xu, X Cao, W Liu, Y Yao
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
252018
Split LBI: An iterative regularization path with structural sparsity
C Huang, X Sun, J Xiong, Y Yao
Advances In Neural Information Processing Systems 29, 2016
222016
Exploring outliers in crowdsourced ranking for qoe
Q Xu, M Yan, C Huang, J Xiong, Q Huang, Y Yao
Proceedings of the 25th ACM international conference on Multimedia, 1540-1548, 2017
182017
Tleague: A framework for competitive self-play based distributed multi-agent reinforcement learning
P Sun, J Xiong, L Han, X Sun, S Li, J Xu, M Fang, Z Zhang
arXiv preprint arXiv:2011.12895, 2020
162020
Online HodgeRank on random graphs for crowdsourceable QoE evaluation
Q Xu, J Xiong, Q Huang, Y Yao
IEEE Transactions on Multimedia 16 (2), 373-386, 2013
162013
Zeroth-order supervised policy improvement
H Sun, Z Xu, Y Song, M Fang, J Xiong, B Dai, B Zhou
arXiv preprint arXiv:2006.06600, 2020
142020
Boosting with structural sparsity: A differential inclusion approach
C Huang, X Sun, J Xiong, Y Yao
Applied and Computational Harmonic Analysis 48 (1), 1-45, 2020
132020
Hodgerank with information maximization for crowdsourced pairwise ranking aggregation
Q Xu, J Xiong, X Chen, Q Huang, Y Yao
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
132018
From social to individuals: A parsimonious path of multi-level models for crowdsourced preference aggregation
Q Xu, J Xiong, X Cao, Q Huang, Y Yao
IEEE transactions on pattern analysis and machine intelligence 41 (4), 844-856, 2018
132018
Divergence-augmented policy optimization
Q Wang, Y Li, J Xiong, T Zhang
Advances in Neural Information Processing Systems 32, 2019
122019
Analysis of crowdsourced sampling strategies for hodgerank with sparse random graphs
B Osting, J Xiong, Q Xu, Y Yao
Applied and Computational Harmonic Analysis 41 (2), 540-560, 2016
122016
False discovery rate control and statistical quality assessment of annotators in crowdsourced ranking
Q Xu, J Xiong, X Cao, Y Yao
International conference on machine learning, 1282-1291, 2016
102016
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20