Publications
2026
- ICLRPICABench: How Far Are We from Physically Realistic Image Editing?In International Conference on Learning Representations , 2026
- AAAI oralSynWeather: Weather Observation Data Synthesis across Multiple Regions and Variables via a General Diffusion TransformerIn AAAI Conference on Artificial Intelligence , 2026
- ICLROmni-Weather: Unified Multimodal Foundation Model for Weather Generation and UnderstandingIn International Conference on Learning Representations , 2026
- CVPRFinPercep-RM: A Fine-grained Reward Model and Co-evolutionary Curriculum for RL-based Real-world Super-ResolutionIn IEEE/CVF Conference on Computer Vision and Pattern Recognition , 2026
- arXivSciDataCopilot: An Agentic Data Preparation Framework for AGI-driven Scientific DiscoveryIn arXiv , 2026
- arXivInternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific DiscoveryIn arXiv , 2026
2025
- SCISLarge multimodal models evaluation: a surveyIn Science China Information Sciences , 2025
- TCSVTSynCast: Synergizing Contradictions in Precipitation Nowcasting via Diffusion Sequential Preference OptimizationIn IEEE Transactions on Circuits and Systems for Video Technology , 2025
- arXivProbing Scientific General Intelligence of LLMs with Scientist-Aligned WorkflowsIn arXiv , 2025
- arXiv
- arXivATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific ReasoningIn arXiv , 2025
- arXiv
- arXivUniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and TextureIn arXiv , 2025
- arXivSciEvalKit: An Open-source Evaluation Toolkit for Scientific General IntelligenceIn arXiv , 2025
- arXiv
- arXiv
- arXiv
- arXivEigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific ReasoningIn arXiv , 2025
- arXivA Survey of Scientific Large Language Models: From Data Foundations to Agent FrontiersIn arXiv , 2025
- arXiv
- arXiv
- arXiv
- arXiv
- arXivPhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal ModelsIn arXiv , 2025
- arXiv
- arXiv
- arXivOmniEarth-Bench: Towards Holistic Evaluation of Earth’s Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth DataIn arXiv , 2025
- arXiv
- arXivEarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language ModelsIn arXiv , 2025
- arXiv
- NeurIPSRadarQA: Multi-modal Quality Analysis of Weather Radar ForecastsIn Advances in Neural Information Processing Systems , 2025
- NeurIPSDAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation spaceIn Advances in Neural Information Processing Systems , 2025
- NeurIPSLatent Harmony: Synergistic Unified UHD Image Restoration via Latent Space Regularization and Controllable RefinementIn Advances in Neural Information Processing Systems , 2025
- NeurIPSScientists’ First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and ReasoningIn Advances in Neural Information Processing Systems , 2025
- NeurIPSUnderstand Before You Generate: Self-Guided Training for Autoregressive Image GenerationIn Advances in Neural Information Processing Systems , 2025
- NeurIPSAlign-DA: Align Score-based Atmospheric Data Assimilation with Multiple PreferencesIn Advances in Neural Information Processing Systems , 2025
- ICCVDecouple to Reconstruct: High Quality UHD Restoration via Active Feature Disentanglement and Reversible FusionIn Accepted to IEEE/CVF International Conference on Computer Vision , 2025
- CEEThe operational medium-range deterministic weather forecasting can be extended beyond a 10-day lead timeIn Communications Earth & Environment , 2025
- ICLRWeatherGFM: Learning A Weather Generalist Foundation Model via In-context LearningIn International Conference on Learning Representations , 2025
- ICLRPostcast: Generalizable postprocessing for precipitation nowcasting via unsupervised blurriness modelingIn International Conference on Learning Representations , 2025
- ICASSPDiffSR: Learning Radar Reflectivity Synthesis via Diffusion Model from Satellite ObservationsIn International Conference on Acoustics, Speech, and Signal Processing , 2025
2024
- CoLM
- arXiv
- EMNLPUniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationIn Empirical Methods in Natural Language Processing , 2024
- MM
- NeurIPSGeneralizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid ModelingIn Advances in Neural Information Processing Systems , 2024