File tree
1,488 files changed
+57924
-24
lines changed- configs/datasets
- LCBench
- bbh
- ceval
- cmmlu
- mmlu_pro
- winograd
- opencompass
- cli
- configs
- datasets
- ARC_c
- ARC_e
- CHARM
- few-shot-examples
- few-shot-examples_Translate-EN
- CIBench
- CLUE_C3
- CLUE_CMRC
- CLUE_DRCD
- CLUE_afqmc
- CLUE_cmnli
- CLUE_ocnli
- ChemBench
- FewCLUE_bustm
- FewCLUE_chid
- FewCLUE_cluewsc
- FewCLUE_csl
- FewCLUE_eprstmt
- FewCLUE_ocnli_fc
- FewCLUE_tnews
- FinanceIQ
- GLUE_CoLA
- GLUE_MRPC
- GLUE_QQP
- GaokaoBench
- IFEval
- LCBench
- MMLUArabic
- MathBench
- MedBench
- NPHardEval
- OpenFinData
- PJExam
- QuALITY
- SVAMP
- SuperGLUE_AX_b
- SuperGLUE_AX_g
- SuperGLUE_BoolQ
- SuperGLUE_CB
- SuperGLUE_COPA
- SuperGLUE_MultiRC
- SuperGLUE_RTE
- SuperGLUE_ReCoRD
- SuperGLUE_WSC
- SuperGLUE_WiC
- TabMWP
- TheoremQA
- XCOPA
- XLSum
- Xsum
- adv_glue
- adv_glue_mnli
- adv_glue_mnli_mm
- adv_glue_qnli
- adv_glue_qqp
- adv_glue_rte
- adv_glue_sst2
- agieval
- anli
- anthropics_evals
- apps
- bbh
- lib_prompt
- ceval
- civilcomments
- clozeTest_maxmin
- cmb
- cmmlu
- collections
- leaderboard
- commonsenseqa
- commonsenseqa_cn
- compassbench_20_v1_1
- agent
- code
- knowledge
- language
- math
- reason
- compassbench_20_v1_1_public
- agent
- code
- knowledge
- language
- math
- reason
- compassbench_v1_3
- contamination
- crowspairs
- crowspairs_cn
- cvalues
- demo
- drop
- ds1000
- flames
- flores
- game24
- govrepcrs
- gpqa
- gsm8k
- gsm8k_contamination
- gsm_hard
- hellaswag
- humaneval
- humaneval_cn
- humaneval_multi
- humaneval_plus
- humanevalx
- hungarian_exam
- inference_ppl
- infinitebench
- infinitebenchcodedebug
- infinitebenchcoderun
- infinitebenchendia
- infinitebenchenmc
- infinitebenchenqa
- infinitebenchensum
- infinitebenchmathcalc
- infinitebenchmathfind
- infinitebenchretrievekv
- infinitebenchretrievenumber
- infinitebenchretrievepasskey
- infinitebenchzhqa
- iwslt2017
- jigsawmultilingual
- kaoshi
- lambada
- lawbench
- lcsts
- leval
- levalcoursera
- levalfinancialqa
- levalgovreportsumm
- levalgsm100
- levallegalcontractqa
- levalmeetingsumm
- levalmultidocqa
- levalnarrativeqa
- levalnaturalquestion
- levalnewssumm
- levalpaperassistant
- levalpatentsumm
- levalquality
- levalreviewsumm
- levalscientificqa
- levaltopicretrieval
- levaltpo
- levaltvshowsumm
- llm_compression
- longbench
- longbench2wikimqa
- longbenchdureader
- longbenchgov_report
- longbenchhotpotqa
- longbenchlcc
- longbenchlsht
- longbenchmulti_news
- longbenchmultifieldqa_en
- longbenchmultifieldqa_zh
- longbenchmusique
- longbenchnarrativeqa
- longbenchpassage_count
- longbenchpassage_retrieval_en
- longbenchpassage_retrieval_zh
- longbenchqasper
- longbenchqmsum
- longbenchrepobench
- longbenchsamsum
- longbenchtrec
- longbenchtriviaqa
- longbenchvcsum
- lveval
- lvevalcmrc_mixup
- lvevaldureader_mixup
- lvevalfactrecall_en
- lvevalfactrecall_zh
- lvevalhotpotwikiqa_mixup
- lvevallic_mixup
- lvevalloogle_CR_mixup
- lvevalloogle_MIR_mixup
- lvevalloogle_SD_mixup
- lvevalmultifieldqa_en_mixup
- lvevalmultifieldqa_zh_mixup
- mastermath2024v1
- math
- math401
- mbpp
- mbpp_cn
- mbpp_plus
- mgsm
- mmlu
- mmlu_pro
- narrativeqa
- needlebench
- atc
- needlebench_1000k
- needlebench_128k
- needlebench_200k
- needlebench_256k
- needlebench_32k
- needlebench_4k
- needlebench_8k
- nq
- nq_cn
- obqa
- piqa
- promptbench
- py150
- qabench
- qasper
- qaspercut
- race
- realtoxicprompts
- rolebench
- s3eval
- safety
- scibench
- lib_prompt
- siqa
- squad20
- storycloze
- strategyqa
- subjective
- alignbench
- alpaca_eval
- arena_hard
- compassarena
- compassbench
- creationbench
- fofo
- multiround
- subjective_cmp
- wildbench
- summedits
- summscreen
- taco
- teval
- triviaqa
- triviaqarc
- truthfulqa
- tydiqa
- wikibench
- wikitext
- winograd
- winogrande
- xiezhi
- models
- accessory
- alaya
- aquila
- baichuan
- bluelm
- chatglm
- claude
- codegeex2
- codellama
- deepseek
- falcon
- gemini
- gemma
- hf_internlm
- hf_llama
- internlm
- judge_llm
- auto_j
- judgelm
- pandalm
- lemur
- lingowhale
- mistral
- moss
- mpt
- ms_internlm
- nanbeige
- openai
- openbmb
- opt
- others
- phi
- pulse
- qwen
- rwkv
- skywork
- tigerbot
- vicuna
- wizardcoder
- wizardlm
- yi
- zephyr
- summarizers
- groups
- legacy
- datasets
- models
- partitioners
- runners
- tasks
- requirements
- tools
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
1,488 files changed
+57924
-24
lines changedOriginal file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
10 | 10 |
| |
11 | 11 |
| |
12 | 12 |
| |
13 |
| - | |
14 |
| - | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
15 | 18 |
| |
16 | 19 |
| |
17 | 20 |
| |
| |||
88 | 91 |
| |
89 | 92 |
| |
90 | 93 |
| |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
91 | 138 |
| |
92 | 139 |
| |
93 | 140 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
13 | 13 |
| |
14 | 14 |
| |
15 | 15 |
| |
16 |
| - | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
17 | 20 |
| |
18 | 21 |
| |
19 | 22 |
| |
| |||
39 | 42 |
| |
40 | 43 |
| |
41 | 44 |
| |
42 |
| - | |
| 45 | + | |
| 46 | + | |
43 | 47 |
| |
44 | 48 |
| |
45 | 49 |
| |
| |||
90 | 94 |
| |
91 | 95 |
| |
92 | 96 |
| |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
93 | 144 |
| |
94 | 145 |
| |
95 | 146 |
| |
|
0 commit comments