Augmenting Limited and Biased RCTs through Pseudo-Sample Matching-Based Observational Data Fusion Method
Zhen Peng
Senior AI/ML expert and marketplace algorithm leader working on large-scale decision systems.
I lead supply-demand adjustment strategy for the ride-hailing marketplace at DiDi Global, where my team builds algorithms for marketplace optimization, dynamic pricing, intelligent subsidies, growth marketing, and AI-assisted operations. My work combines causal inference, reinforcement learning, forecasting, operations research, and LLM agents to improve real-time and long-horizon business decisions.
Before DiDi, I worked on distributed computing infrastructure at Baidu, where I was a founding engineer of the Baidu Spark computing engine and helped scale Hadoop/Spark systems across thousand-node clusters. Across industry and research collaborations, I focus on turning rigorous modeling into production systems that operate at marketplace scale.
News
- 2025 Paper accepted at CIKM on augmenting limited and biased RCTs with observational data fusion.
- 2025 AAAI oral paper on decision-focused fine-tuning for predict-then-optimize systems.
- 2024-2025 DiDi ride-hailing and CTO-line projects recognized with multiple outstanding project awards.
- 2024 Recognized as an outstanding reviewer by DiDi's Algorithm Committee.
Research
My applied research centers on algorithmic decision-making in two-sided marketplaces: estimating price elasticity and supply-demand dynamics, designing multi-objective pricing and subsidy mechanisms, optimizing long-term user value, and integrating LLM agents into human-in-the-loop decision workflows.
I collaborate with university labs including Tsinghua University and Peking University on marketplace optimization, causal inference, reinforcement learning, and decision-focused learning, with publications at AAAI and CIKM and several productionized research systems.
Interests
- Marketplace optimization
- Dynamic pricing and revenue management
- Causal inference and elasticity modeling
- Reinforcement learning for decision systems
- Forecasting and operations research
- LLM agents and AI copilots
- Distributed ML infrastructure
Selected Work
DFF: Decision-Focused Fine-tuning for Smarter Predict-then-Optimize with Limited Data
Contrastive Balancing Representation Learning for Heterogeneous Dose-Response Curves Estimation
Code
My engineering work spans real-time pricing and subsidy decision engines, offline ML pipelines, experiment and evaluation platforms, feature systems, strategy workbenches, LLM application infrastructure, RAG/NL2SQL systems, and multi-tool Agent products.
Representative internal systems process hundreds of millions of online decisions per day and support intelligent planning over large-scale annual marketplace budgets.