Why don't we just release a basic GPU with 128GB RAM and eat NVidia's local generative AI lunch?
The networking effect of all devs porting their LLMs etc. to that card would instantly put them as a major CUDA threat.
But finance folks running the show would never get such an idea...