Change the repository type filter
All
Repositories list
33 repositories
- A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)
MEGA-Bench
PublicThis repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]TheoremExplainAgent
PublicOfficial Repo for "TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding"VLM2Vec
PublicABC
PublicMMLU-Pro
PublicScholarCopilot
PublicLongICLBench
PublicCode and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]VideoScore
PublicVideoGenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional video generation models.CritiqueFineTuning
PublicCode for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"PixelWorld
PublicKB-BINDER
PublicOmniEdit
PublicOfficial Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]TIGERScore
Public"TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]VIEScore
PublicVisual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)AnyV2V
PublicCode and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)MAmmoTH2
PublicStructLM
PublicLLM-AMT
PublicUniIR
PublicGenAI-Bench
PublicCode and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]MAmmoTH
PublicConsistI2V
PublicConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)TheoremQA
PublicProgram-of-Thoughts
Public