The world of web browsers have not been spared by the trend of integrating LLM functionality. But there are fundamental issues with it and Vivaldi addresses them.
Edge is branding itself “The AI Browser”. Chrome has plans to embed LLMs for text input. Opera, the browser which was commandeered from the original Vivaldi team and turned into a crypto/VPN gimmick browser, is of course among the hardest leaning into the LLM trend.
I’d love a browser-embedded LLM that had access to the DOM.
“Highlight all passages that talk about yadda yadda. Remove all other content. Convert the dates to the ISO standard. Put them on a number line chart, labeled by blah.”
Arc has an LLM that lets you replace your search functionality with search or ask, where if you type a question it tries to answer it based on the content on the page. Kinda close to what you’re talking about.
Arc is genuinely trying to use LLMs in their browser in interesting ways.
That’s actually fascinating to think about. Would be a fun project to mash something like Blazor Server and an LLM together and allow users to just kindly ask to rewrite the DOM in plain English.
Hmm I don’t think it’s because of that feature, because it only runs when you explicitly ask it to translate a page for you. You should probably check your extensions, see if you have some redundant ones (a mistake people make is use multiple ad-blockers/anti-trackers, when just uBlock Origin + Firefox’s defaults are usually good enough).
Quick tool to summarize a page, proofread, or compare it to another source. Still needs a functioning human brain to separate the wheat from the chaff so to speak, but I could see a LLM (especially local) being useful in some ways.
I’m sure there are disabilities or unique use cases that could increase it’s usefulness, especially once they improve more.
Didn’t even think of it as a possibility. WTF would a browser need with LLM?
Edge is branding itself “The AI Browser”. Chrome has plans to embed LLMs for text input. Opera, the browser which was commandeered from the original Vivaldi team and turned into a crypto/VPN gimmick browser, is of course among the hardest leaning into the LLM trend.
Cheers DANNY BOOOOOOOYYY!
I’d love a browser-embedded LLM that had access to the DOM.
“Highlight all passages that talk about yadda yadda. Remove all other content. Convert the dates to the ISO standard. Put them on a number line chart, labeled by blah.”
That’d be great UX.
Arc has an LLM that lets you replace your search functionality with search or ask, where if you type a question it tries to answer it based on the content on the page. Kinda close to what you’re talking about.
Arc is genuinely trying to use LLMs in their browser in interesting ways.
That’s actually fascinating to think about. Would be a fun project to mash something like Blazor Server and an LLM together and allow users to just kindly ask to rewrite the DOM in plain English.
Local translation of text comes to mind.
Yup, Firefox has it: https://browser.mt/ (it’s now a native part of Firefox)
Hmm maybe this is why Firefox is so damn slow on my raspberry pi
Hmm I don’t think it’s because of that feature, because it only runs when you explicitly ask it to translate a page for you. You should probably check your extensions, see if you have some redundant ones (a mistake people make is use multiple ad-blockers/anti-trackers, when just uBlock Origin + Firefox’s defaults are usually good enough).
Vivaldi has had local translation for about half a year now. No need for LLM for this feature.
I was thinking more along the lines of communicating with a Klingon captain on a D7 Battlecruiser.
Quick tool to summarize a page, proofread, or compare it to another source. Still needs a functioning human brain to separate the wheat from the chaff so to speak, but I could see a LLM (especially local) being useful in some ways.
I’m sure there are disabilities or unique use cases that could increase it’s usefulness, especially once they improve more.