Skip to content

[pull] main from open-compass:main#25

Open
pull[bot] wants to merge 160 commits intochanglan:mainfrom
open-compass:main
Open

[pull] main from open-compass:main#25
pull[bot] wants to merge 160 commits intochanglan:mainfrom
open-compass:main

Conversation

@pull
Copy link
Copy Markdown

@pull pull Bot commented May 22, 2025

See Commits and Changes for more details.


Created by pull[bot] (v2.0.0-alpha.1)

Can you help keep this open source service alive? 💖 Please sponsor : )

Myhs-phz and others added 27 commits May 27, 2025 19:41
* update

* fix lint

* fix lint

* update precommit

* update precommit

* fix lint
* 0-shot Smolinstruct

Add 0-shot evaluation and postprocess functions for Smolinstruct

* fix acc postprocessor

* update 0-shot acc postprocessor

* rename 0-shot
* 250527

* 250527

* 250527

* 0530

* 0530

* Update srbench_gen.py

* Update srbench.py

* srbench fix

* Update datasets_info.py

* Update datasets_info.py

---------

Co-authored-by: Myhs-phz <demarcia2014@126.com>
Co-authored-by: Linchen Xiao <xxllcc1993@gmail.com>
* add LLM as judge setting for matbench

* fix reference negative sample gold value missing error

* update import

* comments trim

* fix file and import naming

* matbench fix

* matbench fix

* matbench fix

---------

Co-authored-by: Jucheng Hu <jucheng.hu.20@ucl.ac.uk>
Co-authored-by: Myhs-phz <demarcia2014@126.com>
* add phybench

* phybench fix

* update

* update

---------

Co-authored-by: Myhs-phz <demarcia2014@126.com>
Co-authored-by: MaiziXiao <xxllcc1993@gmail.com>
* update

* update

* update
* add gaokao & competition benchmark

* fix lint

* chem_exam fix

---------

Co-authored-by: Myhs-phz <demarcia2014@126.com>
* add gaokao & competition benchmark

* fix lint

* chem_exam fix

* update verifier prompt

---------

Co-authored-by: Myhs-phz <demarcia2014@126.com>
* update earth silver benchmark

* fix new issues

* update

* update

---------

Co-authored-by: MaiziXiao <xxllcc1993@gmail.com>
* healthbench

* fix irrelevant files

* first

* fix bench

* fix bench

* fix bench

* fix soft link

* fix bench

* fix bench

* healthbench fix

* fix bench

* fix bench

* fix bench

* fix bench

* fix bench

* fix bench

* fix bench

* fix bench

* fix bench

---------

Co-authored-by: Myhs-phz <demarcia2014@126.com>
* update

* update

* update

* update

* update
* [Dataset] Add R-Bench (ICML 2025)

* fixed lint

* format rbench.py by isort

* rbench fix

* r-bench fix

* update

---------

Co-authored-by: leoyizhang <leoyizhang@tencent.com>
Co-authored-by: Myhs-phz <demarcia2014@126.com>
Co-authored-by: MaiziXiao <xxllcc1993@gmail.com>
* Fix PHYbench

* update

---------

Co-authored-by: MaiziXiao <xxllcc1993@gmail.com>
* 0616

* 0616

* 0616

* update

* update

* 0616

---------

Co-authored-by: MaiziXiao <xxllcc1993@gmail.com>
* update needlebench docs for chinese

* update bilingual needlebench docs

* update docs typo

* update docs

* update docs typo

* [Docs] fix needlebench examples

* Add NeedleBench_V2

* [Fix] Fix pre-commit

* remove choice version

* [Docs] Update NeedleBench Docs

* [Docs] update NeedleBenchV2 Docs

* [Docs] Update Default Settings for NeedleBench and ATC Configs

* [Fix] Fix precommit

* [Minor] fix needlebench summarizer groups

* [Minor] Update NeedleBenchV2 dataset-index
* timed re.search and _executor made global

* TimeOutError exception handling

* added missing blank lines

* isort import

---------

Co-authored-by: Francesco Bertolotti <francesco.bertolotti@igenius.ai>
* debug rjob runner

* optimize concurrent requests by adding max_workers

* update

---------

Co-authored-by: xujun <xujun@pjlab.org.cn>
* debug rjob runner

* optimize concurrent requests by adding max_workers

* update

* optimize the max_workers for OpenAISDK

* optimize the max_workers for OpenAISDK

* optimize the max_workers for OpenAISDK

* optimize the max_workers for OpenAISDK

* Update openai_api.py

---------

Co-authored-by: xujun <xujun@pjlab.org.cn>
Co-authored-by: nic <nic@yccc.follower>
Co-authored-by: zhulinJulia24 <145004780+zhulinJulia24@users.noreply.github.com>
(Warning) This PR may introduce performance BC for ChemExam benchmark.
Myhs-phz and others added 30 commits February 5, 2026 18:17
* Fix extract_role_pred to properly strip whitespace and use explicit None checks

* update

* fix lint

* update

* update

* update

* update

* update

* update

* update

* update

* update

* Update test_base_task.py

* Update test_base_task.py
* add models unittest

* add more unit testcase

* refactor ete testcase

* update

* update

* fix unfail import

* update mount

* update url

* fix import

* update

* update

* Revert to relative imports with __init__.py support

The __init__.py files in autotest and autotest/infer packages enable relative imports to work correctly even in mmengine's lazy loading context. This is more robust than absolute imports which can fail when sys.path doesn't include the opencompass root.

* updaste

* Add proper __init__.py files to prevent namespace package issues

Add docstrings and __all__ to make autotest a proper package instead of namespace package

* add init

* Fix TypeError: LazyObject is not iterable

Remove filtering of datasets at module level as it tries to iterate LazyObject before it's built by mmengine. Simply assign datasets = datasets to preserve the lazy loading behavior.

* fix import

* update

* update

* fix eval path

* fix compare

* update import

* update

* update

* update

* update

* update configs

* update

* update config

* update

* update

* update

* update

* upda

* fix

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update path

* update
* add

* fix

* fix

* add

* fix

* fix

* add unitest

* delete

* add

* fix

* add all dataset config
* [Refactor] Support concurrent inference accorss tasks.

* Rename inferencer and fix openai streaming sdk

* fix lint

* [Fix] Add parallel ChatML inferencer and JSONL checkpoints

* fix lint

* Fix unit tests

* add more ut

---------

Co-authored-by: zhulin1 <zhulinJulia24@163.com>
* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update

* update
* add molecular_iq evaluation

* fix molecular_core

* fix molecular_core

* update

* Delete opencompass/datasets/moleculariq/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/_data/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/_dynamic/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/_nlp/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/_pools/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/rewards/__pycache__ directory

* Delete opencompass/datasets/moleculariq/moleculariq_core/solver/__pycache__ directory

* Add error handling for RDKit imports

Wrap RDKit imports in a try-except block to handle potential import errors.

* Add error handling for RDKit import

Handle import error for RDKit library gracefully.

* Update template_based_reaction_solver.py

* Refactor SymbolicSolver class and bond patterns

* Refactor RDKit import exception handling

Moved exception handling for RDKit imports to the end and added initialization for SymbolicSolver class.

* Add error handling for RDKit imports

Handle import errors for RDKit library gracefully.

---------

Co-authored-by: Myhs_phz <demarcia2014@126.com>
* fix

* fix unit test

* fix lint

* fix lint
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.