feat: add request timeout to load_web_page by cchinchilla-dev · Pull Request #4887 · google/adk-python

cchinchilla-dev · 2026-03-18T20:31:11Z

Link to Issue or Description of Change

Update — 2026-04-30

After merging the latest upstream/main, the SSRF rewrite already covers URL-scheme validation and routes every fetch failure through a unified Failed to fetch url message. The unique contribution of this PR is now the request timeout; the body and tests below have been updated to reflect the post-merge scope. No code added by this PR duplicates what upstream already provides.

Problem

load_web_page() calls requests.get() without a timeout. If the target server is unresponsive, the agent hangs indefinitely.

Solution

Add timeout=_DEFAULT_TIMEOUT_SECONDS (10 seconds) to both HTTP entry points in the module:

requests.get in _fetch_response (proxy path).
session.get in _fetch_direct_response (pinned-IP path).

Extend the except in load_web_page to also catch requests.RequestException, so timeout and connection errors return the standard Failed to fetch url: {url} message instead of propagating.

Design note: the timeout is a module-level constant rather than a function parameter to keep it out of the LLM function-calling schema. It can be overridden via load_web_page._DEFAULT_TIMEOUT_SECONDS = 30 if needed.

Testing Plan

Unit Tests

Added/updated unit tests.
All unit tests pass locally (pytest tests/unittests/tools/test_load_web_page.py → 10 passed).

New/updated tests in tests/unittests/tools/test_load_web_page.py:

test_load_web_page_uses_proxy_for_unresolved_public_hostnames — updated to verify timeout=10 is forwarded on the proxy path.
test_load_web_page_passes_timeout_to_pinned_session — verifies the timeout reaches the pinned-IP session.
test_load_web_page_passes_timeout_to_proxied_get — verifies the timeout is forwarded when a proxy is configured.
test_load_web_page_returns_failure_on_timeout — verifies requests.exceptions.Timeout is converted into Failed to fetch url.

Manual E2E

N/A — internal hardening; function signature unchanged.

Checklist

I have read the CONTRIBUTING.md document.
I have performed a self-review of my own code.
I have added tests that prove my fix is effective.
New and existing unit tests pass locally with my changes.

Additional Context

This complements the existing SSRF protection (allow_redirects=False, hostname/IP validation, pinned-IP adapter) already present in the module after upstream/main was merged.

gemini-code-assist · 2026-03-18T20:31:17Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

rohityan · 2026-03-20T18:31:53Z

Hi @cchinchilla-dev , Thank you for your contribution! We appreciate you taking the time to submit this pull request. Your PR has been received by the team and is currently under review. We will provide feedback as soon as we have an update to share.

rohityan · 2026-03-20T18:32:07Z

Hi @Jacksunwei , can you please review this

cchinchilla-dev · 2026-04-17T09:14:41Z

@rohityan, @Jacksunwei — Just following up to see if anything else is needed from my end. Happy to make any adjustments.

AbhishekMauryaGEEK · 2026-04-23T12:09:59Z

I independently reproduced this issue and can confirm the behavior described. Tested on Windows with Python 3.11 against three failure modes — non-routable IP, invalid DNS, and a slow endpoint. All three result in unhandled exceptions with connect timeout=None confirmed in the traceback. Happy to share full reproduction logs if helpful for the review.

…timeout-and-url-validation # Conflicts: # src/google/adk/tools/load_web_page.py # tests/unittests/tools/test_load_web_page.py

Merge #4887 ## Link to Issue or Description of Change Closes #4886 ## Update — 2026-04-30 After merging the latest `upstream/main`, the SSRF rewrite already covers URL-scheme validation and routes every fetch failure through a unified `Failed to fetch url` message. The unique contribution of this PR is now the request **timeout**; the body and tests below have been updated to reflect the post-merge scope. No code added by this PR duplicates what upstream already provides. ## Problem `load_web_page()` calls `requests.get()` without a `timeout`. If the target server is unresponsive, the agent hangs indefinitely. ## Solution Add `timeout=_DEFAULT_TIMEOUT_SECONDS` (30 seconds) to both HTTP entry points in the module: - `requests.get` in `_fetch_response` (proxy path). - `session.get` in `_fetch_direct_response` (pinned-IP path). Extend the `except` in `load_web_page` to also catch `requests.RequestException`, so timeout and connection errors return the standard `Failed to fetch url: {url}` message instead of propagating. **Design note:** the timeout is a module-level constant rather than a function parameter to keep it out of the LLM function-calling schema. It can be overridden via `load_web_page._DEFAULT_TIMEOUT_SECONDS = 30` if needed. ## Testing Plan ### Unit Tests - [x] Added/updated unit tests. - [x] All unit tests pass locally (`pytest tests/unittests/tools/test_load_web_page.py` → 10 passed). New/updated tests in `tests/unittests/tools/test_load_web_page.py`: - `test_load_web_page_uses_proxy_for_unresolved_public_hostnames` — updated to verify `timeout=10` is forwarded on the proxy path. - `test_load_web_page_passes_timeout_to_pinned_session` — verifies the timeout reaches the pinned-IP session. - `test_load_web_page_passes_timeout_to_proxied_get` — verifies the timeout is forwarded when a proxy is configured. - `test_load_web_page_returns_failure_on_timeout` — verifies `requests.exceptions.Timeout` is converted into `Failed to fetch url`. ### Manual E2E N/A — internal hardening; function signature unchanged. ## Checklist - [x] I have read the CONTRIBUTING.md document. - [x] I have performed a self-review of my own code. - [x] I have added tests that prove my fix is effective. - [x] New and existing unit tests pass locally with my changes. ## Additional Context This complements the existing SSRF protection (`allow_redirects=False`, hostname/IP validation, pinned-IP adapter) already present in the module after upstream/main was merged. Co-authored-by: Bo Yang <ybo@google.com> COPYBARA_INTEGRATE_REVIEW=#4887 from cchinchilla-dev:feat/load-web-page-timeout-and-url-validation 4bd4799 PiperOrigin-RevId: 930335977

adk-bot · 2026-06-11T07:22:08Z

Thank you @cchinchilla-dev for your contribution! 🎉

Your changes have been successfully imported and merged via Copybara in commit 792775f.

Closing this PR as the changes are now in the main branch.

feat: add timeout and URL scheme validation to load_web_page

5139378

Merge branch 'main' into feat/load-web-page-timeout-and-url-validation

a8adb9c

adk-bot added the tools [Component] This issue is related to tools label Mar 18, 2026

cchinchilla-dev mentioned this pull request Mar 19, 2026

feat: add timeout and URL scheme validation to load_web_page #4888

Closed

rohityan self-assigned this Mar 19, 2026

Merge branch 'main' into feat/load-web-page-timeout-and-url-validation

384ed99

rohityan added the needs review [Status] The PR/issue is awaiting review from the maintainer label Mar 20, 2026

Merge remote-tracking branch 'upstream/main' into feat/load-web-page-…

6d3ed31

…timeout-and-url-validation # Conflicts: # src/google/adk/tools/load_web_page.py # tests/unittests/tools/test_load_web_page.py

cchinchilla-dev changed the title ~~feat: add timeout and URL scheme validation to load_web_page~~ feat: add request timeout to load_web_page Apr 30, 2026

Merge branch 'main' into feat/load-web-page-timeout-and-url-validation

0ef75cd

cchinchilla-dev mentioned this pull request Apr 30, 2026

feat: add timeout and URL scheme validation to load_web_page #4886

Closed

cchinchilla-dev added 2 commits May 2, 2026 13:47

Merge branch 'main' into feat/load-web-page-timeout-and-url-validation

024752a

Merge branch 'main' into feat/load-web-page-timeout-and-url-validation

4bd4799

adk-bot added the merged [Status] This PR is merged label Jun 11, 2026

adk-bot closed this Jun 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add request timeout to load_web_page#4887

feat: add request timeout to load_web_page#4887
cchinchilla-dev wants to merge 7 commits into
google:mainfrom
cchinchilla-dev:feat/load-web-page-timeout-and-url-validation

cchinchilla-dev commented Mar 18, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Mar 18, 2026

Uh oh!

rohityan commented Mar 20, 2026

Uh oh!

rohityan commented Mar 20, 2026

Uh oh!

cchinchilla-dev commented Apr 17, 2026

Uh oh!

AbhishekMauryaGEEK commented Apr 23, 2026

Uh oh!

adk-bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

cchinchilla-dev commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Link to Issue or Description of Change

Update — 2026-04-30

Problem

Solution

Testing Plan

Unit Tests

Manual E2E

Checklist

Additional Context

Uh oh!

gemini-code-assist Bot commented Mar 18, 2026

Uh oh!

rohityan commented Mar 20, 2026

Uh oh!

rohityan commented Mar 20, 2026

Uh oh!

cchinchilla-dev commented Apr 17, 2026

Uh oh!

AbhishekMauryaGEEK commented Apr 23, 2026

Uh oh!

adk-bot commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cchinchilla-dev commented Mar 18, 2026 •

edited

Loading