Description
Since the introduction of the dedicated X article extractor in Defuddle 0.7.0 (PR #124), X long-form articles that contain a banner/hero image but no inline images in the article body are clipped without any images.
Before vs After
|
Before 0.7.0 (generic parser) |
After 0.7.0 (x-article extractor) |
| Article text |
Noisy, included page clutter |
Clean, precise extraction ✅ |
| Inline images |
Captured |
Captured ✅ |
| Banner/hero images |
Captured (as part of generic extraction) |
Not captured ❌ |
og:image fallback |
N/A |
Returns generic X default (abs.twimg.com/rweb/ssr/default/v2/og/image.png) |
Steps to Reproduce
- Open an X long-form article that has a banner image but no inline images in the body, e.g.:
https://x.com/molt_cornelius/article/2023212245283397709
- Clip with Obsidian Web Clipper (using Defuddle 0.7.0+)
- The resulting Markdown contains no image references at all
Comparison
An X article with inline images (e.g. https://x.com/WorkflowWhisper/article/2020557627416023335) clipped before 0.7.0 correctly includes its images:
[](...)
The same type of content clipped after 0.7.0 from an article without inline images produces zero image references.
Expected Behavior
The X article extractor should capture the article's banner/hero image (if present) as a fallback when no inline images are found in the article body. This image is typically visible at the top of X articles and is distinct from the generic X og:image fallback.
Context
The Obsidian Web Clipper ecosystem relies on extracted images for:
- The Featured Image plugin (auto-generates thumbnails from the first image)
- Note list previews in navigation plugins
- Visual identification of clipped articles
Without any images, X article clippings are visually indistinguishable from each other in the vault.
Environment
- Defuddle: 0.7.0
- Obsidian Web Clipper: 1.0.0
- Browser: Chrome (latest)
Description
Since the introduction of the dedicated X article extractor in Defuddle 0.7.0 (PR #124), X long-form articles that contain a banner/hero image but no inline images in the article body are clipped without any images.
Before vs After
og:imagefallbackabs.twimg.com/rweb/ssr/default/v2/og/image.png)Steps to Reproduce
https://x.com/molt_cornelius/article/2023212245283397709Comparison
An X article with inline images (e.g.
https://x.com/WorkflowWhisper/article/2020557627416023335) clipped before 0.7.0 correctly includes its images:The same type of content clipped after 0.7.0 from an article without inline images produces zero image references.
Expected Behavior
The X article extractor should capture the article's banner/hero image (if present) as a fallback when no inline images are found in the article body. This image is typically visible at the top of X articles and is distinct from the generic X
og:imagefallback.Context
The Obsidian Web Clipper ecosystem relies on extracted images for:
Without any images, X article clippings are visually indistinguishable from each other in the vault.
Environment