Skip to content

Conversation

@cemde
Copy link
Collaborator

@cemde cemde commented Feb 11, 2026

Description

Type of Change

  • Bug fix (non-breaking change that fixes an issue)
  • New feature (non-breaking change that adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • Documentation update
  • Code quality improvement (refactoring, formatting, etc.)

Checklist

Contribution

Documentation

  • Added/updated docstrings for new/modified functions as instructed CONTRIBUTING.md
  • Updated relevant documentation in docs/ (if applicable)
  • Tag github issue with this PR (if applicable)

Changelog

  • Added entry to CHANGELOG.md under [Unreleased] section
    • Use Added section for new features
    • Use Changed section for modifications to existing functionality
    • Use Fixed section for bug fixes
    • Use Removed section for deprecated/removed features
  • OR this is a documentation-only change (no changelog needed)

Example:
- Support for multi-agent tracing (PR:#123)

Architecture (if applicable)

  • Core/Interface separation: Changes in maseval/core/ do NOT import from maseval/interface/
  • Dependencies: New core dependencies added sparingly; framework integrations go to optional dependencies

Additional Notes

@github-actions
Copy link

github-actions bot commented Feb 13, 2026

Coverage report

Click to see where and how coverage changed

FileStatementsMissingCoverageCoverage
(new stmts)
Lines missing
  maseval/benchmark/gaia2
  __init__.py
  data_loader.py 150-153, 165-203, 233-266
  environment.py 85-86, 198-199, 228-230, 238, 249-253, 264-268
  evaluator.py 162, 200, 213-216, 276, 295
  gaia2.py 327-329, 347-352, 431, 448-450, 478, 482-483, 572, 582-583, 637, 760-800, 814-821, 828-829, 839-840, 866-867, 876, 885-889, 899-911, 920, 950-954, 969, 984, 1027
  tool_wrapper.py 89-93, 117
  maseval/benchmark/multiagentbench
  _constants.py
  data_loader.py 140-147, 333-335
  environment.py 82, 109, 149-166
  evaluator.py 250, 644
  multiagentbench.py 485-486, 500-501
  maseval/benchmark/multiagentbench/adapters
  marble_adapter.py
  maseval/benchmark/tau2
  environment.py 157-159, 192-203, 225-246, 268-331, 342, 356-358
  evaluator.py
  tau2.py 544, 548, 550, 554, 557-560, 806-808, 816, 825-827, 830
  maseval/benchmark/tau2/domains/telecom
  db.py
  user_models.py
  user_tools.py 74, 77-78, 84, 165, 169, 176-183, 278, 307-308, 329, 708, 739-740, 831-832, 843-844, 919-920
Project Total  

This report was generated by python-coverage-comment-action

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant