Skip to content

X Article Extractor does not capture article banner/hero images #129

@RKNST38

Description

@RKNST38

Description

Since the introduction of the dedicated X article extractor in Defuddle 0.7.0 (PR #124), X long-form articles that contain a banner/hero image but no inline images in the article body are clipped without any images.

Before vs After

Before 0.7.0 (generic parser) After 0.7.0 (x-article extractor)
Article text Noisy, included page clutter Clean, precise extraction ✅
Inline images Captured Captured ✅
Banner/hero images Captured (as part of generic extraction) Not captured
og:image fallback N/A Returns generic X default (abs.twimg.com/rweb/ssr/default/v2/og/image.png)

Steps to Reproduce

  1. Open an X long-form article that has a banner image but no inline images in the body, e.g.: https://x.com/molt_cornelius/article/2023212245283397709
  2. Clip with Obsidian Web Clipper (using Defuddle 0.7.0+)
  3. The resulting Markdown contains no image references at all

Comparison

An X article with inline images (e.g. https://x.com/WorkflowWhisper/article/2020557627416023335) clipped before 0.7.0 correctly includes its images:

[![Bild](https://pbs.twimg.com/media/HAp2bp6XgAAoy9A?format=jpg&name=large)](...)

The same type of content clipped after 0.7.0 from an article without inline images produces zero image references.

Expected Behavior

The X article extractor should capture the article's banner/hero image (if present) as a fallback when no inline images are found in the article body. This image is typically visible at the top of X articles and is distinct from the generic X og:image fallback.

Context

The Obsidian Web Clipper ecosystem relies on extracted images for:

  • The Featured Image plugin (auto-generates thumbnails from the first image)
  • Note list previews in navigation plugins
  • Visual identification of clipped articles

Without any images, X article clippings are visually indistinguishable from each other in the vault.

Environment

  • Defuddle: 0.7.0
  • Obsidian Web Clipper: 1.0.0
  • Browser: Chrome (latest)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions