Aller au contenu principal
Mur d'archive a casiers recevant des extraits papier flottant venant du haut du cadre

Obsidian Web Clipper: official plugin review for capturing the web in Markdown

Back to blog
Artificial Intelligence
Nicolas
10 min read
Mur d'archive a casiers recevant des extraits papier flottant venant du haut du cadre

Obsidian Web Clipper was released in a stable version just over a year ago, and the real question for an existing Obsidian user isn’t about marketing: should you stop using MarkDownload to switch to the official plugin, or stick with what you have?

This review covers three weeks of daily use on a reference vault for AI monitoring and technical shopping.

The goal: sort out what holds up in use from what still feels like version 1.x.

The article highlights the Defuddle + Turndown + Reader + Highlighter stack as a mini-ETL on the browser side, then delves into the LLM Interpreter mode and open bugs.

In short

  • Stack to know: Defuddle isolates content, Turndown produces Markdown, Reader serves as the extraction base, and Highlighter captures persistent passages.
  • Optional LLM Interpreter mode: Claude Haiku, Gemini Flash, GPT Mini, or a local model via Ollama, configured by template.
  • Conditional templates: triggered by URL pattern, variables {{title}}, {{content|markdown}}, {{schema:author.name}}, all saved in frontmatter.
  • When the Interpreter helps: short summary in frontmatter, auto tags, structured JSON extraction from a spec page or scientific article.
  • Assumed limits: PDFs not managed (issue #646), YouTube transcript depends on the open panel, possible conflict with Templater in the same folder.
  • Clear verdict: MarkDownload remains valid for simple use, Web Clipper wins when you want site-specific templates, schema.org, or LLM Interpreter.

The technical stack of Web Clipper

The official plugin relies on three technical components that operate in sequence in the browser.

Defuddle tackles the page’s DOM and isolates the main content block, discarding navigation, sidebars, and ad inserts.

The project has been integrated into the Clipper since version 1.1 and remains available standalone on github.com/kepano/defuddle for those who want to use it elsewhere.

The Firefox Android version released in March 2026 pushed Defuddle to 0.14, indicating that the library continues to evolve independently of the Clipper.

Turndown takes over and converts this cleaned HTML into Markdown.

It decides how to render a list, a table, a code block, or a footnote.

The quality of the final Markdown depends as much on Defuddle as on Turndown: if the extraction lets noise through, the Markdown rendering will contain it.

If Defuddle lets a sidebar through, Turndown will render it as a bullet list without flinching.

The Reader mode serves as a clean extraction base, like a Safari or Firefox Reader, but connected to Defuddle.

The Highlighter adds a layer on top: a cursor that becomes a highlighter, DOM blocks light up on hover like virtual post-its, and highlights persist on the page.

When clipping, the extension offers three scopes: entire article, free selection, or only recorded highlights.

Vertical web panel disassembling into structured text strips drifting down the frame

The daily workflow

The goal isn’t to clip more, it’s to clip smartly.

A well-tuned template is worth more than a raw dump, and that’s exactly what the official doc struggles to explain: Web Clipper acts as a mini-ETL on the browser side.

Defuddle extracts the main content, Turndown produces the Markdown, variables and filters inject metadata, and the note arrives in the vault with a frontmatter ready to query.

The note arrives in a dedicated folder, pre-filled frontmatter

The rule that changes everything daily: all clips go into a single folder, for example /clippings.

Each note inherits a consistent frontmatter: source, author, date, and a status: inbox field indicating it still needs sorting.

The clip itself remains raw, human sorting comes afterward.

Daily sorting of clips via Dataview

The Daily Note from Periodic Notes hosts a Dataview query listing the day’s clips, filtered on the status field equal to inbox.

In five minutes a day, you process your inbox and move useful clips to literature notes elsewhere in the vault.

This pattern brings the result closer to an Obsidian knowledge base assisted by LLM, where clipping becomes organized raw material rather than a dead library.

The Templater trap to know: if a Templater template is attached to the /clippings folder, it can overwrite the frontmatter set by Web Clipper when the file is created.

The Obsidian forum has a dedicated thread on this conflict, and the most stable solution is to separate the two: Web Clipper sets its frontmatter, Templater acts afterward, on a different folder or on manual action.

Conditional templates and Interpreter mode

The templates system of Web Clipper is what drives most new users to migrate from MarkDownload.

The principle: one template per site, triggered by a URL pattern.

The community repo github.com/obsidian-community/web-clipper-templates gathers ready-to-use models for YouTube, Reddit, Wikipedia, and IMDb.

Variables, filters, and a concrete IMDb example

A variable targets a field on the page and a filter formats it.

The most useful variables in the official doctrine are {{title}}, {{content|markdown}}, and the schema: family that draws from the page’s Schema.org markup.

The IMDb example illustrated by Stephan Miller sets a template triggered on the pattern imdb.com/title/, which retrieves {{schema:Movie:name}} for the movie title, {{schema:datePublished}} for the year, and a CSS selector for the cast.

You can do the same for Wikipedia pages, a scientific blog, or an Amazon product page with a few lines of JSON.

LLM Interpreter: providers, use cases, latency

The Interpreter adds an optional LLM call at the time of clipping, configured by template.

Supported providers include OpenAI, Anthropic, Gemini, OpenRouter, Ollama local, or any custom chat completions compatible endpoint.

The official doc recommends small models: Claude Haiku, Gemini Flash, Llama 3B or 8B via Ollama, OpenAI’s Mini series, faster and precise enough for this use.

Three cases where the Interpreter saves time: short summary in frontmatter, auto tags from content, and structured JSON extraction from a spec page.

Latency ranges from a few hundred milliseconds for Gemini Flash on a short page to over 30 seconds via a local model on a long article.

Ollama trap: its default context is 2048 tokens, and a slightly long page overflows without visible error, silently returning a poor result.

The workaround documented by the Obsidian team: OLLAMA_ORIGINS and the num_ctx parameter to expand at server launch, or trim the context with a slice:0,1000 type filter in the template.

Concrete limits of Obsidian Web Clipper

The main limitation remains PDFs opened directly in the browser viewer: the Clipper sees the viewer, not HTML, and produces an empty note.

The GitHub issue #646 tracks this behavior across several releases without a definitive fix.

The YouTube transcript works again since the February 2026 update of the YTS UI feed, provided the transcript panel is open on the page at the time of clipping.

If the panel is closed, the Clipper only retrieves the title and description.

The Clipper doesn’t guess, it sees what the browser sees at the moment of the click, and only that.

On Linux, the Wayland bug documented in 1.11.4 means the Clipper only sends the page title and the Obsidian window doesn’t regain focus, the official workaround is to temporarily switch to an X11 session.

Template triggers work in OR logic only, and the community is requesting an AND combination in issue #648 to condition a template on two URL patterns simultaneously.

The Templater + Web Clipper conflict mentioned earlier falls into the same category: two tools touching the frontmatter at the same time, without explicit coordination.

SPAs and dynamic feeds remain fragile: Twitter (X) and Reddit are managed by dedicated Extractors, but a recent unlisted SPA will return poor Markdown.

Hard paywalls are by definition out of reach: Web Clipper sees what the browser sees, no more.

Obsidian Web Clipper, verdict by profile

Three profiles, three clear decisions.

First profile: you’ve been using MarkDownload for two years and your workflow is solid.

Don’t switch just for the sake of it.

Switch if you want site-specific templates, schema.org, or the LLM Interpreter to automate tags and summaries.

Second profile: you’re starting with Obsidian today or your web capture workflow isn’t set yet.

Go for Web Clipper directly.

The functional gap with MarkDownload is too wide to justify a detour, and official support by the Obsidian team is a long-term advantage.

Third profile: Notion remains your main system and your Obsidian vault is secondary.

Stick with Notion Web Clipper for your Notion clips and keep Obsidian Web Clipper for content you want to host in local Markdown long-term.

A dual stack works well as long as the boundary is clear in your mind.

The detail that sways AI users this year: since late 2025, Web Clipper can save a ChatGPT or Claude conversation as an Obsidian note, via a GitHub release notes update.

The Embedded mode released in parallel opens Web Clipper as a sidebar within the page itself, making it possible to clip multiple tabs in a row.

Web Clipper crossed the threshold of a serious tool over a year ago, and 2026 confirms the trajectory with X and Reddit Extractors, Embedded mode, and LLM conversation export.

It doesn’t replace a note-taking system, it feeds the one you already have.

The best way to evaluate it at home: take a template from the community repo, connect it to a site you read every day, activate the Interpreter on this single template to generate tags, and observe for two weeks.

If you want to go further, the article on LLM-assisted monitoring published on Anthem Creation provides the framework to connect Web Clipper to a broader monitoring pipeline.

Obsidian Web Clipper finds its place exactly there, as a capture layer between the browser and a vault that already serves your thinking.

Frequently asked questions

What changes if I switch from MarkDownload to Web Clipper?

You gain site-specific conditional templates, access to Schema.org variables, and LLM Interpreter mode, in exchange for a learning curve on template JSON.

Does the Markdown remain clean on complex pages?

On a standard blog post, a Wikipedia page, or an IMDb page with Schema.org markup, the output is solid.

On SPAs not listed in the Extractors or pages rich in JavaScript, the Markdown may be poor or misaligned.

How does it work on YouTube in 2026?

The transcript is retrieved via Reader mode since the February 2026 update, provided the transcript panel is open in the tab at the time of clipping.

Is the LLM Interpreter essential?

No, it is strictly optional and disabled by default, many users do without it and stick to static templates.

Can I use Ollama locally for privacy?

Yes, the Interpreter supports Ollama, provided you launch the server with OLLAMA_ORIGINS allowing browser extensions and expand num_ctx for long pages.

Does Web Clipper work offline?

Extraction and clipping to the vault run locally, but the cloud LLM Interpreter requires an outgoing connection, unless you use Ollama.

How to avoid the Templater + Web Clipper conflict on frontmatter?

Separate roles: Web Clipper sets the frontmatter in /clippings, Templater acts elsewhere or on manual action after human sorting.

Which pages break and what to do?

PDFs opened in the browser (issue #646), SPAs not listed in the Extractors, hard paywalls, YouTube transcript if the panel is closed.

For these cases, switch to manual selection or clip from the cache version, or wait for an Extractors update.

How to connect Web Clipper to my Daily Note?

All clips go into /clippings with a status property equal to inbox, and a Dataview query placed in the Daily Note via Periodic Notes lists the day’s clips to sort.

If I’m on Notion, should I switch?

Stick with Notion Web Clipper for your Notion pages and add Obsidian Web Clipper only for content you want to archive in local Markdown long-term.

Related Articles

Ready to scale your business?

Anthem Creation supports you in your AI transformation

Disponibilité : 3 nouveaux projets pour Mai/Juin
Book a discovery call
Une question ?
✉️

Encore quelques questions ?

Laissez-moi votre email pour qu'on puisse continuer cette conversation. Promis, je garde ça précieusement (et je ne vous bombarderai pas de newsletters).

  • 💬 Accès illimité au chatbot
  • 🚀 Des réponses plus poussées
  • 🔐 Vos données restent entre nous
Cette réponse vous a-t-elle aidé ? Merci !