File tree
648 files changed
+4879
-4891
lines changed- configs
- api_examples
- dataset_collections
- datasets
- ARC_c
- ARC_e
- CIBench
- CLUE_C3
- CLUE_CMRC
- CLUE_DRCD
- CLUE_afqmc
- CLUE_cmnli
- CLUE_ocnli
- ChemBench
- FewCLUE_bustm
- FewCLUE_chid
- FewCLUE_cluewsc
- FewCLUE_csl
- FewCLUE_eprstmt
- FewCLUE_ocnli_fc
- FewCLUE_tnews
- FinanceIQ
- GLUE_CoLA
- GLUE_MRPC
- GLUE_QQP
- GaokaoBench
- IFEval
- MMLUArabic
- MathBench
- MedBench
- NPHardEval
- OpenFinData
- PJExam
- QuALITY
- SVAMP
- SuperGLUE_AX_b
- SuperGLUE_AX_g
- SuperGLUE_BoolQ
- SuperGLUE_CB
- SuperGLUE_COPA
- SuperGLUE_MultiRC
- SuperGLUE_RTE
- SuperGLUE_ReCoRD
- SuperGLUE_WSC
- SuperGLUE_WiC
- TabMWP
- TheoremQA
- XCOPA
- Xsum
- adv_glue
- adv_glue_mnli
- adv_glue_mnli_mm
- adv_glue_qnli
- adv_glue_qqp
- adv_glue_rte
- adv_glue_sst2
- agieval
- anli
- anthropics_evals
- apps
- bbh
- ceval
- civilcomments
- clozeTest_maxmin
- cmb
- cmmlu
- collections
- commonsenseqa
- commonsenseqa_cn
- contamination
- crowspairs
- crowspairs_cn
- cvalues
- ds1000
- flames
- flores
- game24
- govrepcrs
- gsm8k
- gsm8k_contamination
- gsm_hard
- hellaswag
- humanevalx
- hungarian_exam
- infinitebench
- infinitebenchcodedebug
- infinitebenchcoderun
- infinitebenchendia
- infinitebenchenmc
- infinitebenchenqa
- infinitebenchensum
- infinitebenchmathcalc
- infinitebenchmathfind
- infinitebenchretrievekv
- infinitebenchretrievenumber
- infinitebenchretrievepasskey
- infinitebenchzhqa
- iwslt2017
- jigsawmultilingual
- kaoshi
- lawbench
- leval
- levalcoursera
- levalfinancialqa
- levalgovreportsumm
- levallegalcontractqa
- levalmeetingsumm
- levalmultidocqa
- levalnarrativeqa
- levalnaturalquestion
- levalnewssumm
- levalpaperassistant
- levalpatentsumm
- levalquality
- levalreviewsumm
- levalscientificqa
- levaltopicretrieval
- levaltpo
- levaltvshowsumm
- llm_compression
- longbench
- lveval
- lvevalcmrc_mixup
- lvevaldureader_mixup
- lvevalfactrecall_en
- lvevalfactrecall_zh
- lvevalhotpotwikiqa_mixup
- lvevallic_mixup
- lvevalloogle_CR_mixup
- lvevalloogle_MIR_mixup
- lvevalloogle_SD_mixup
- lvevalmultifieldqa_en_mixup
- lvevalmultifieldqa_zh_mixup
- mastermath2024v1
- math
- math401
- mbpp
- mbpp_cn
- mbpp_plus
- mgsm
- mmlu
- narrativeqa
- needlebench
- atc
- needlebench_1000k
- needlebench_128k
- needlebench_200k
- needlebench_256k
- needlebench_32k
- needlebench_4k
- needlebench_8k
- nq
- nq_cn
- obqa
- piqa
- promptbench
- py150
- qabench
- qasper
- qaspercut
- race
- realtoxicprompts
- rolebench
- s3eval
- scibench
- siqa
- storycloze
- subjective
- alignbench
- alpaca_eval
- arena_hard
- compassarena
- creationbench
- multiround
- subjective_cmp
- summedits
- summscreen
- taco
- teval
- triviaqa
- triviaqarc
- truthfulqa
- tydiqa
- wikibench
- wikitext
- winograd
- winogrande
- xiezhi
- z_bench
- models
- accessory
- alaya
- aquila
- baichuan
- bluelm
- claude
- codegeex2
- gemini
- hf_internlm
- hf_llama
- internlm
- judge_llm
- auto_j
- judgelm
- pandalm
- lemur
- llama
- mistral
- mpt
- nanbeige
- others
- qwen
- rwkv
- tigerbot
- vicuna
- wizardcoder
- wizardlm
- zephyr
- subjective
- summarizers
- groups
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
648 files changed
+4879
-4891
lines changedOriginal file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
45 | 45 |
| |
46 | 46 |
| |
47 | 47 |
| |
48 |
| - | |
| 48 | + | |
49 | 49 |
| |
50 | 50 |
| |
51 | 51 |
| |
52 | 52 |
| |
53 | 53 |
| |
54 | 54 |
| |
55 | 55 |
| |
56 |
| - | |
| 56 | + | |
57 | 57 |
| |
58 | 58 |
| |
59 | 59 |
| |
60 |
| - | |
61 | 60 |
| |
62 | 61 |
| |
63 | 62 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
45 | 45 |
| |
46 | 46 |
| |
47 | 47 |
| |
48 |
| - | |
| 48 | + | |
49 | 49 |
| |
50 | 50 |
| |
51 | 51 |
| |
52 | 52 |
| |
53 | 53 |
| |
54 | 54 |
| |
55 | 55 |
| |
56 |
| - | |
| 56 | + | |
57 | 57 |
| |
58 | 58 |
| |
59 | 59 |
| |
60 |
| - | |
61 | 60 |
| |
62 | 61 |
| |
63 | 62 |
| |
|
0 commit comments