Change the repository type filter
All
Repositories list
33 repositories
ImagenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)MEGA-Bench
PublicThis repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]- Official Repo for "TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding"
ABC
PublicMMLU-Pro
PublicVISTA
PublicLongICLBench
PublicCode and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]VideoScore
PublicVideoGenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional video generation models.- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
PixelWorld
PublicKB-BINDER
PublicOmniEdit
PublicOfficial Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]TIGERScore
Public"TIGERScore: Towards Building Explainable Metric for All Text Generation Tasks" [TMLR 2024]VIEScore
PublicVisual Instruction-guided Explainable Metric. Code for "Towards Explainable Metrics for Conditional Image Synthesis Evaluation" (ACL 2024)AnyV2V
PublicCode and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" (TMLR 2024)MAmmoTH2
PublicStructLM
PublicLLM-AMT
PublicUniIR
PublicGenAI-Bench
PublicCode and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]LongRAG
PublicConsistI2V
PublicConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation (TMLR 2024)TheoremQA
Public