Skip to content
@CLUEbenchmark

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4k 540

  2. SuperCLUE SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3k 97

  3. SuperCLUE-Safety SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    105 7

  4. SuperCLUE-Auto SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    29 3

  5. SuperCLUE-Agent SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    78 2

  6. SuperCLUE-RAG SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    100 3

Repositories

Showing 10 of 50 repositories
  • 2024h1 Public

    中文大模型基准测评2024上半年度报告,Report of LLMs in Chinese, First Half of 2024

    CLUEbenchmark/2024h1’s past year of commit activity
    1 0 1 0 Updated Jul 9, 2024
  • SuperCLUE-Video Public

    中文原生多层次文生视频测评基准

    CLUEbenchmark/SuperCLUE-Video’s past year of commit activity
    17 1 0 0 Updated Jul 8, 2024
  • SuperCLUE-V Public

    中文原生多模态理解测评基准(测评方案)

    CLUEbenchmark/SuperCLUE-V’s past year of commit activity
    3 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Long Public

    中文原生长文本测评基准

    CLUEbenchmark/SuperCLUE-Long’s past year of commit activity
    5 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Image Public

    中文原生文生图测评基准

    CLUEbenchmark/SuperCLUE-Image’s past year of commit activity
    7 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Coder Public

    中文原生代码助手测评基准,产品级

    CLUEbenchmark/SuperCLUE-Coder’s past year of commit activity
    0 0 0 0 Updated Jul 8, 2024
  • SuperCLUElyb Public

    SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准

    CLUEbenchmark/SuperCLUElyb’s past year of commit activity
    141 6 3 1 Updated Jun 19, 2024
  • SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    CLUEbenchmark/SuperCLUE’s past year of commit activity
    2,999 97 35 0 Updated May 23, 2024
  • CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    CLUEbenchmark/CLUE’s past year of commit activity
    Python 4,009 540 78 2 Updated May 23, 2024
  • SuperCLUE-Fin Public

    中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级

    CLUEbenchmark/SuperCLUE-Fin’s past year of commit activity
    7 0 0 0 Updated May 6, 2024