Item 43907000

jl6 • 3 days ago

IIRC correctly, Clippy’s most famous feature was interrupting you to offer advice. The advice was usually basic/useless/annoying, hence Clippy’s reputation, but a powerful LLM could actually make the original concept work. It would not be simply a chatbot that responds to text, but rather would observe your screen, understand it through a vision model, and give appropriate advice. Things like “did you know there’s an easier way to do what you’re doing”. I don’t think the necessary trust exists yet to do this using public LLM APIs, nor does the hardware to do it locally, but crack either of those and I could see ClipGPT being genuinely useful.

PaulHoule • 3 days ago

The way I remember it a lot of software had "help" documentation with full text search in the late 1980s and early 1990s but the common denominator was that it didn't work in the sense that you got useful answers less than 10% of the time. Until Google came along, users got trained to avoid full text search facilities.

The full text facility attached to Clippy really was helpful, getting useful answers around 50% of the time. I thought the whole point of making him an engaging cartoon character was to overcome the prejudice mid-1990s users had towards full-text search in help.

freedomben • 3 days ago

It looks like you're writing a letter.

Would you like help?

* Get help with writing the letter

* Just type the letter without help

|_| Don't show me this tip again

2 replies

jaza • 3 days ago

It looks like you're one of the 1% of humans who still write letters themselves! Dear me, imagine that, what do you think this is, the 90s or something?! Would you like to join the other 99% of humans and doomscroll and shytpost while I write that letter for you?

mock-possum • 2 days ago

Always reminds me of this short - https://youtu.be/86KduX_HaPs

vunderba • 3 days ago

We are probably getting closer to that with the newer multimodal LLMs, but you'd almost need to take a screenshot on intervals fed directly to the LLM to provide a sort of chronological context to help it understand what the user is trying to do and gauge the users intentions.

As you say though, I don't know how many people would be comfortable having screenshots of their computer sent arbitrarily to a non-local LLM.

5 replies

nrmitchi • 3 days ago

> As you say though, I don't know how many people would be comfortable having screenshots of their computer sent arbitrarily to a non-local LLM.

Of the technical, hang-out-on-HN crowd? Ya, probably not many.

Of the other 99.99% of computer users? The majority of them wouldn't even think about it, let alone care. To quote a phrase, ”the user is going to pick dancing pigs over security every time”.

Even without the non-chalent attitude towards security, the majority of the population has been so conditioned that everything they do on a computer is already being sent to 1) Apple, 2) Google, 3) Microsoft, or 4) their employer, that they're burnt-out of caring.

All that is to say that if you can make a widely-available real-time LLM assistant that appeals to non-technical users, please invite me to your private-island-celebrity-filled-yacht-parties.

walrus01 • 3 days ago

I think we're well into the paradigm of "hidden employee activity monitoring software" already taking periodic screenshots and sending it to an LLM somewhere, which then generates aggregate performance metrics and dashboards for managers. I've heard of multiple companies working on this for $bigcorp environments, customer service/call center workstation PCs, etc.

pr337h4m • 2 days ago

Models with native video understanding would do the trick - Advanced Voice Mode on the ChatGPT iOS/Android app lets you use your camera, works pretty well; there's also https://aistudio.google.com/live (AFAIK there are no open-source models with similar capabilities)

johnisgood • 3 days ago

> I don't know how many people would be comfortable having screenshots of their computer sent arbitrarily to a non-local LLM

shudders.

Henchman21 • 3 days ago

So, the Replay feature being slowly rolled out in Win11?

rossant • 3 days ago

Even funnier would be to make it unnecessarily mean and vexing.

Wait, are you really looking this up? You don't even know how to do this? Are you kidding me?

Gosh, it's been an hour and you still haven't fixed this bug? Are you retarded or something? You don't deserve this job.

2 replies

jahewson • 3 days ago

I already have a little voice in my head that tells me those things!

That said, if we could automate it, it might free up more of my brain for productivity…

spauldo • 2 days ago

You might look into vigor, a mean-spirited version of clippy for the vi editor.

GoblinSlayer • 3 days ago

>and give appropriate advice

"It's time to work, Dave"

1 reply

Henchman21 • 3 days ago

I’m sorry, I can’t do that Hal

6510 • 3 days ago

It can still be annoying; I feel it is part of his personality.

It looks like you are writing a comment on Hacker News.

Would you like help with:

- Commas? There shouldn't be one behind "responds to text"

- Capitalization? You've missed a D in "did you know..."

- Punctuation? You've missed a question mark behind "what you’re doing". It goes inside the quotes, of course!

[] Don't ever suggest anything like this ever again.

hbn • 3 days ago

Microsoft infamously is adding AI to Windows to constantly watch your screen and people understandably are not super excited for it.

2 replies

basch • 3 days ago

I personally can’t wait to ask to recall something I saw before but can’t quite remember where.

Pretty soon I won’t even need biological memory.

1 reply

kurisufag • 3 days ago

i added a minutely scrot cronjob about a year ago and haven't used it once. remembering "that website i was on last week" is apparently not a real problem I was having

jayGlow • 3 days ago

if it ran entirely on the local machine and didn't send information back to Microsoft I think people would be far more accepting of it.

1 reply

TiredOfLife • 2 days ago

That's exactly what recall was and is

1 reply

spauldo • 2 days ago

You missed the "for now" at the end of that sentence.

trinix912 • 2 days ago

> Things like “did you know there’s an easier way to do what you’re doing”

That could come off just as patronizing as the original Clippy. If it said things like "Would you like me to generate you a letter for X?" it would be miles ahead of the original.