-
Notifications
You must be signed in to change notification settings - Fork 33
[feature][PR only for review]Support tau2 bench #192
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
SJTUyh
wants to merge
60
commits into
AISBench:master
Choose a base branch
from
SJTUyh:tau2_dev
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
60 commits
Select commit
Hold shift + click to select a range
fd584ea
adapt tau2 bench
0dc261c
Merge branch 'master_center' into tau2_dev
3b19d79
tau2 pass@k
5ede58d
tau2 summarizer
d607066
tau2 summarizer
7e5e0b9
tau2 summarizer
50c4472
ignore json load exception
f5c8569
fix json load
5066f44
add requirement
c1d7aaf
Apply suggestion from @gemini-code-assist[bot]
SJTUyh d1fee46
Apply suggestion from @gemini-code-assist[bot]
SJTUyh 7c98424
review fix
fa420e2
Merge branch 'tau2_dev' of https://github.com/SJTUyh/benchmark into t…
7ccb966
review fix
314ce7b
tau2 add tag limit
ed190fb
patch the user input
4f3e057
summarizer add total count
fbd59b3
summarizer add total count
93b4c13
summarizer add total count
100bbe3
summarizer add total count
e1062c2
fix weight
eba8f67
fix weight
97a7cb5
fix weight
72d83eb
tau2 fix
fb22a36
tau2 fix
d38a3fe
hat fix
2c0979c
add en docs
c5e7efe
add en docs
0510b46
merge fix
46bc121
merge fix
5f85793
merge fix
599f7b0
merge fix
a3661dd
merge fix
d88859f
merge fix
8f87172
merge fix
a4fdba1
add UT for tau2 bench
b5d420e
add UT for tau2 bench
f955871
add UT for tau2 bench
abbb924
add UT for tau2 bench
4dc4150
add UT for tau2 bench
0f4fd1f
add UT for tau2 bench
e645453
tau2 bench UT mock dependencies
38b4423
tau2 bench UT mock dependencies
b101d2e
tau2 bench UT mock dependencies
c1830fc
tau2 bench UT mock dependencies
4931123
tau2 bench UT mock dependencies
b3cab39
add new tau2 bench UT
7ad463e
add new tau2 bench UT
547e1ea
add new tau2 bench UT
d3a061d
add new tau2 bench UT
b2e78eb
add new tau2 bench UT
8624231
add new tau2 bench UT
06502d7
add new tau2 bench UT
1c8c061
add new tau2 bench UT
0f1b4e5
add new tau2 bench UT
9631cf5
add new tau2 bench UT
c777a76
add new tau2 bench UT
a38cba1
add new tau2 bench UT
7ae9a49
add new tau2 bench UT
f7ff051
delete unused dep
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Empty file.
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While adding a
Nonecheck is a good improvement for robustness, the function's return type hint-> stron line 18 is now incorrect because the function can returnNone. Please update the signature to-> Optional[str]to accurately reflect its behavior. You will also need to addfrom typing import Optionalat the top of the file.