Hi, my name is Mingxuan Song.
I'm currently pursuing my PhD at Peking University, under the supervision of Professor Zhen Xiao.
About me
I am currently a PhD student in Computer Systems Architecture at the School of Computer Science, Peking University, under the supervision of Professor Zhen Xiao, with an expected graduation in July 2028. I received my bachelor's degree in computer science and technology from China University of Geosciences, Wuhan, in 2023. My research interests include reinforcement learning (RL), sharding blockchain, and large language models (LLMs).
My goal is to continuously grow both professionally and personally, while maintaining a healthy and fulfilling lifestyle. I am also actively seeking internship and job opportunities worldwide. Feel free to contact me!
📍 Location: Beijing, China
🎯 Hobbies: 🏸 Badminton | 🎱 Billiards | 🏓 Table Tennis | 🏃♂️ Running
View Resume📚 Publications
Scholar📰 News
🌟 Projects
Few-Shot RL Fine-Tuning for LLMs
Affiliations:
School of Computer Science, Peking University;
Alimama, Alibaba Group.
In recent years, Large Language Models (LLMs) have demonstrated remarkable performance across a variety of natural language processing tasks. However, fine-tuning these models typically requires large-scale datasets and extensive computational resources, which limits their applicability in scenarios where data is scarce and budgets are constrained. This work explores a novel approach to few-shot reinforcement learning (RL) fine-tuning for LLMs, aiming to adapt pre-trained models to specific tasks using minimal supervision.
RL for Efficient Sharding Blockchain
Affiliations:
School of Computer Science, Peking University;
Theta Labs, Theta Inc.
Sharding blockchain systems face critical challenges in achieving efficient cross-shard data distribution and maintaining balanced workload across shards. Traditional address allocation methods often suffer from high latency and uneven shard utilization, especially when dealing with dynamically changing transaction patterns and reconfiguration events.
Evolutionary RL for Sensor Placement in Water Supply Networks
Affiliations:
School of Computer Science, China University of Geosciences, Wuhan.
In water supply networks, effective sensor placement is critical for early detection of contamination events, yet practical deployments are constrained by limited sensor budgets and scarce historical contamination data. This work investigates an evolutionary reinforcement learning formulation of the sensor placement problem, modeling it as a sequential decision-making process under limited supervision. By incorporating domain knowledge into an evolutionary reinforcement learning framework, the proposed approach enables efficient optimization.
🧑🤝🧑 Friends
Shenglin Yin