I was amazed that we transitioned from one GPU heavy bubble (Crypto) to another (LLM/AI). Whilst the hype for crypto imploded the use for the hardware sort of didn’t. I wonder if the next bubble with be the same, or if we get some refreshing variety to our money sinks?
Microsoft et al are subsidizing GenAI to an insane degree. […] prices shoot up for their customers and serve as a rough awakening to all the websites that integrated a crappy chatbot.
I’ve run some much simpler chatbots on just my desktop PC, so they will have some fallback (if they really choose to take it). Still it locks up my entire computer for a few second for each reply, so even a few hundred users per second peak would be an expensive service.
(Insert joke here about customers not noticing or caring about the difference between website chatbots built on big company services vs smaller ones, because they have exactly the same problems just in different hues.)
A lot of phone modems ship with their own SoC (processor) running its own OS. It’s much smaller and slower than the main phone SoC but, depending on its implementation, it can have full access to all of your main processor’s memory through DMA.