Follow
Yuxin Wang
Yuxin Wang
Verified email at comp.hkbu.edu.hk
Title
Cited by
Cited by
Year
A survey of deep learning techniques for neural machine translation
S Yang, Y Wang, X Chu
arXiv preprint arXiv:2002.07526, 2020
1662020
A distributed synchronous SGD algorithm with global top-k sparsification for low bandwidth networks
S Shi, Q Wang, K Zhao, Z Tang, Y Wang, X Huang, X Chu
2019 IEEE 39th International Conference on Distributed Computing Systems …, 2019
1322019
The impact of GPU DVFS on the energy and performance of deep learning: An empirical study
Z Tang, Y Wang, Q Wang, X Chu
Proceedings of the Tenth ACM International Conference on Future Energy …, 2019
692019
Benchmarking the performance and energy efficiency of AI accelerators for AI training
Y Wang, Q Wang, S Shi, X He, Z Tang, K Zhao, X Chu
2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020
472020
Computer-aided clinical skin disease diagnosis using cnn and object detection models
X He, S Wang, S Shi, Z Tang, Y Wang, Z Zhao, J Dai, R Ni, X Zhang, X Liu, ...
2019 IEEE International Conference on Big Data (Big Data), 4839-4844, 2019
112019
Energy-efficient Inference Service of Transformer-based Deep Learning Models on GPUs
Y Wang, Q Wang, X Chu
2020 International Conferences on IEEE Green Computing and Communications …, 2020
52020
FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Z Tang, Y Wang, X He, L Zhang, X Pan, Q Wang, R Zeng, K Zhao, S Shi, ...
arXiv preprint arXiv:2309.01172, 2023
32023
FedML Parrot: A scalable federated learning system via heterogeneity-aware scheduling on sequential and hierarchical training
Z Tang, X Chu, RY Ran, S Lee, S Shi, Y Zhang, Y Wang, AQ Liang, ...
arXiv preprint arXiv:2303.01778, 2023
32023
NAS-LID: efficient neural architecture search with local intrinsic dimension
X He, J Yao, Y Wang, Z Tang, KC Cheung, S See, B Han, X Chu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 7839-7847, 2023
22023
Energy-efficient Online Scheduling of Transformer Inference Services on GPU Servers
Y Wang, Q Wang, X Chu
IEEE Transactions on Green Communications and Networking, 2022
12022
Towards Efficient and Reliable LLM Serving: A Real-World Workload Study
Y Wang, Y Chen, Z Li, Z Tang, R Guo, X Wang, Q Wang, AC Zhou, X Chu
arXiv preprint arXiv:2401.17644, 2024
2024
Reliable and Efficient In-Memory Fault Tolerance of Large Language Model Pretraining
Y Wang, S Shi, X He, Z Tang, X Pan, Y Zheng, X Wu, AC Zhou, B He, ...
arXiv preprint arXiv:2310.12670, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–12