xcjs

xcjs@programming.dev · 17 days ago

Google was working on a feature that would do just that, but I can’t recall the name of it.

They backed down for now due to public outcry, but I expect they’re just biding their time.

xcjs@programming.dev · 23 days ago

Not with this announcement, but it was.

xcjs@programming.dev · 25 days ago

It depends on the model you run. Mistral, Gemma, or Phi are great for a majority of devices, even with CPU or integrated graphics inference.

xcjs@programming.dev · edit-2 1 month ago

Show me a music store I can purchase music from on my phone through an app, and I’ll purchase it.

xcjs@programming.dev · 2 months ago

We all mess up! I hope that helps - let me know if you see improvements!

xcjs@programming.dev · edit-2 2 months ago

I think there was a special process to get Nvidia working in WSL. Let me check… (I’m running natively on Linux, so my experience doing it with WSL is limited.)

https://docs.nvidia.com/cuda/wsl-user-guide/index.html - I’m sure you’ve followed this already, but according to this, it looks like you don’t want to install the Nvidia drivers, and only want to install the cuda-toolkit metapackage. I’d follow the instructions from that link closely.

You may also run into performance issues within WSL due to the virtual machine overhead.

xcjs@programming.dev · 2 months ago

Good luck! I’m definitely willing to spend a few minutes offering advice/double checking some configuration settings if things go awry again. Let me know how things go. :-)

xcjs@programming.dev · edit-2 2 months ago

It should be split between VRAM and regular RAM, at least if it’s a GGUF model. Maybe it’s not, and that’s what’s wrong?

xcjs@programming.dev · 2 months ago

Ok, so using my “older” 2070 Super, I was able to get a response from a 70B parameter model in 9-12 minutes. (Llama 3 in this case.)

I’m fairly certain that you’re using your CPU or having another issue. Would you like to try and debug your configuration together?

xcjs@programming.dev · 2 months ago

Unfortunately, I don’t expect it to remain free forever.

xcjs@programming.dev · 2 months ago

No offense intended, but are you sure it’s using your GPU? Twenty minutes is about how long my CPU-locked instance takes to run some 70B parameter models.

On my RTX 3060, I generally get responses in seconds.

xcjs@programming.dev · edit-2 2 months ago

It’s a W3C managed standard, but there are tons of behavior not spelled out in the specification that platforms can choose to impose.

The standard doesn’t impose a 500 character limit, but there’s nothing that says there can’t be a limit.

xcjs@programming.dev · 4 months ago

My go-to solution for this is the Android FolderSync app with an SFTP connection.

xcjs@programming.dev · 4 months ago

I mean, sysvinit was just a bunch of root-executed bash scripts. I’m not sure if systemd is really much worse.

xcjs@programming.dev · edit-2 4 months ago

Systemd was created to allow parallel initialization, which other init systems lacked. If you want proof that one processor core is slower than one + n, you don’t need to compare init systems to do that.

xcjs@programming.dev · 4 months ago

Correction: migrated to GitLab, but I don’t expect they’ll want to keep it there.

xcjs@programming.dev · 4 months ago

The Nuzu repository is already wiped.

xcjs@programming.dev · 4 months ago

I already left the Messages app after the Shortcut Bar annoyance. This would have been another death knell to me that the app is past its prime.

xcjs@programming.dev · edit-2 4 months ago

On Android, it moved SMS messages from the shared SMS store upon receipt and to Signal’s own database, which was more secure.

xcjs@programming.dev · 7 months ago

Of course!