[ICML 2026] HEARTS: Benchmarking LLM Reasoning on Health Time Series
-
Updated
Jun 29, 2026 - Python
[ICML 2026] HEARTS: Benchmarking LLM Reasoning on Health Time Series
Live Deep Research Bench. A challenging, objective benchmark for deep research tasks.
Agentic AI refers to AI systems capable of autonomous decision-making, planning, and executing tasks based on goals—acting like intelligent agents. These systems combine LLMs with tools, memory, and feedback loops to complete complex workflows with minimal human input.
A benchmark for evaluating advanced reasoning in language models and multi-agent systems.
A symbolic reasoning framework using Cognitive Motifs to build diverse, interpretable, and belief-driven generative agents.
Multi-agent AI reporting readiness certification system for the Microsoft Agents League Reasoning Agents track.
LangGraph is a powerful framework built on LangChain that enables the creation of stateful, multi-step, and agentic workflows using directed graphs. It simplifies complex LLM orchestration by allowing conditional branching, memory, and tool integrations in a visual and modular way.
DELPHAI - multi-agent certification-readiness council on Microsoft Foundry (Agents League Battle #2). Real Foundry IQ on Azure AI Search, 11 hosted agents, GO/NEGOTIATE/NO-GO.
Seven-agent AI system for multi-scenario certification lab recovery, readiness insights, and safety-verified reports.
Multi-agent AI platform that turns a product idea into a client-ready blueprint — six agents plan, debate, vote and deliver PRD, architecture, backlog & roadmap. Powered by Microsoft Foundry IQ.
Five-agent certification readiness platform built on Microsoft Agent Framework, all three IQ layers (Foundry/Fabric/Work), and Microsoft Learn MCP — Agents League Hackathon.
Enterprise multi-agent system powered by Microsoft Azure AI Foundry (gpt-4.1-mini) - protects developers from burnout while optimizing certification upskilling paths. Built for Agents League Hackathon 2026, Reasoning Agents track.
Microsoft Foundry IQ-powered AI governance agent for BFSI, detecting Logic Drift and enforcing HOTL governance. Currently in prototype build phase.
AI-powered privacy policy analyzer built for Microsoft's Agents League Hackathon 2026 (Reasoning Agents track). 6 specialized AI agents extract, reason, detect dark patterns, score readability, audit user rights, and benchmark policies against TikTok, Facebook & more — covering GDPR/CCPA/PDPA/PIPEDA/LGPD/DPDPA. Built with Flask + Microsoft Foundry.
🧠 A Streamlit app that evaluates and visualizes reasoning trajectories of AI agents — built with Python and inspired by agentic AI workflows, reasoning analysis, and LLM evaluation.
Enterprise role-readiness agent on Microsoft Foundry. Pick a role, take a mock assessment, get a Foundry IQ grounded learning plan to close your skill gaps.
Docvoxia: Real-Time Multilingual Clinical Reasoning Agent for Safe Healthcare Documentation
The False Readiness Firewall: Microsoft Agent Framework + Fabric IQ prove semantic conflicts with deterministic SQL, quantify learner and budget impact, and gate canonical meaning to the human owner.
⚖️ AI-powered reasoning agent that diagnoses the root causes of judicial backlog in Indian courts and recommends data-driven interventions using public judiciary datasets.
Intercepts an AI agent's action before it runs, grounds it in cited precedent via Foundry IQ, pauses for a human.
Add a description, image, and links to the reasoning-agents topic page so that developers can more easily learn about it.
To associate your repository with the reasoning-agents topic, visit your repo's landing page and select "manage topics."