Publications
2025
- arXivScientists’ First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and ReasoningIn arXiv , 2025
- arXiv
- arXiv
- arXiv
- arXivPhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal ModelsIn arXiv , 2025
- arXiv
- arXiv
- arXivOmniEarth-Bench: Towards Holistic Evaluation of Earth’s Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth DataIn arXiv , 2025
- arXiv
- arXivAlign-DA: Align Score-based Atmospheric Data Assimilation with Multiple PreferencesIn arXiv , 2025
- arXivEarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language ModelsIn arXiv , 2025
- arXiv
- ICCVDecouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible FusionIn Accepted to IEEE/CVF International Conference on Computer Vision , 2025
- CEEThe operational medium-range deterministic weather forecasting can be extended beyond a 10-day lead timeIn Communications Earth & Environment , 2025
- ICLRWeatherGFM: Learning A Weather Generalist Foundation Model via In-context LearningIn International Conference on Learning Representations , 2025
- ICLRPostcast: Generalizable postprocessing for precipitation nowcasting via unsupervised blurriness modelingIn International Conference on Learning Representations , 2025
- ICASSPDiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite ObservationsIn International Conference on Acoustics, Speech, and Signal Processing , 2025
2024
- CoLM
- arXiv
- EMNLPUniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationIn Empirical Methods in Natural Language Processing , 2024
- MM
- NeurIPSGeneralizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid ModelingIn Advances in Neural Information Processing Systems , 2024