Publications
2025
-
arXivScientists’ First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and ReasoningIn arXiv , 2025
-
arXiv
-
arXiv
-
arXiv
-
arXivPhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal ModelsIn arXiv , 2025
-
arXiv
-
arXiv
-
arXivOmniEarth-Bench: Towards Holistic Evaluation of Earth’s Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth DataIn arXiv , 2025
-
arXiv
-
arXivAlign-DA: Align Score-based Atmospheric Data Assimilation with Multiple PreferencesIn arXiv , 2025
-
arXivEarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language ModelsIn arXiv , 2025
-
arXiv
-
ICCVDecouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible FusionIn Accepted to IEEE/CVF International Conference on Computer Vision , 2025
-
CEEThe operational medium-range deterministic weather forecasting can be extended beyond a 10-day lead timeIn Communications Earth & Environment , 2025
-
ICLRWeatherGFM: Learning A Weather Generalist Foundation Model via In-context LearningIn International Conference on Learning Representations , 2025
-
ICLRPostcast: Generalizable postprocessing for precipitation nowcasting via unsupervised blurriness modelingIn International Conference on Learning Representations , 2025
-
ICASSPDiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite ObservationsIn International Conference on Acoustics, Speech, and Signal Processing , 2025
2024
-
CoLM
-
arXiv
-
EMNLPUniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationIn Empirical Methods in Natural Language Processing , 2024
-
MM
-
NeurIPSGeneralizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid ModelingIn Advances in Neural Information Processing Systems , 2024