I've had excellent luck using Claude Code to generate "mermaid diagrams" for me, and convert them to .png format headlessly using mmdc/puppeteer. Really helped me out with an engineering proposal I just finished. In past years I would have fumbled around with Visio forever and the result would have been worse.
ascorbic 7 hours ago [-]
I find Mermaid diagram rendering is quite ugly by default. I've gotten much better-looking results by asking it to just generate SVGs. As a bonus it can do animations too. e.g. see slide 3 here, which I first tried with Mermaid and then switched to SVG when I couldn't get the rendering to look good: https://talks.mk.gg/2026/atmosphereconf/
d4rkp4ttern 4 hours ago [-]
I now often have CC make technical/architecture diagrams with tikz, the results look much better than mermaid but still requires multiple iterations to fix bad arrows, bad layouts etc.
Diagrams are still far from solved. We need a good non-gameable diagrams benchmark.
0xchamin 3 hours ago [-]
I use Claude Code and Gemini and to LLM as a judge among the two to review each others result and generate a final mermiad diagram.
thawab 8 hours ago [-]
You can copy past the mermaid to excalidraw to visualize it.
system_operator 17 hours ago [-]
do the same.
I just ask Claude code for mermaid to visualize any topic I'm discussing.
lxgr 4 hours ago [-]
In a pinch, Claude is also quite good at ASCII art in my experience!
halJordan 19 hours ago [-]
And yet people here insist that the height of an llm is not being above to draw a pelican or count letters in a word
wongarsu 13 hours ago [-]
Well, mermaid diagrams are "just" a list of nodes and their relations. You'd expect any llm capable of generating code to be able to generate them
Writing an SVG of a pelican riding a bicycle without being able to see the result and iterate based on that is incredibly difficult by comparison. I'm sure some humans could do it, but I sure can't. That's part of the beauty of it: it's very difficult to do but a toddler could judge the results
Writing an SVG of a diagram by hand would be somewhere on the middle ground. Or depending on the number of nodes might be even harder than the pelican. Layouting diagrams can get tricky very quickly. It's also one of Mermaid's biggest weaknesses
christkv 18 hours ago [-]
Just wait if they go public. Claude 5.4 fails the Pelican test stock sheds 20% of value pf news. Wall street wonders if the lack of front wheel means there is something seriously wrong with the stocks underlying value
userbinator 17 hours ago [-]
Am I correct in interpreting the title to mean that visiting the page will result in a 3.1GB download?
avrionov 13 hours ago [-]
Yes. I tested it. It downloaded 3.1GB
userbinator 8 hours ago [-]
Given what the page does, that's not a surprising amount, but consider it would take over 5 days to download on a 56k dialup connection, and even at 100Mbps that's over 4 minutes.
walthamstow 19 hours ago [-]
The Gemma models really are amazing. I was on a flight a few days ago and used E2B to do some basic research on the place I was going to, running the model locally on my Pixel 10 Pro. It gave me basically the same as Gemini or ChatGPT would do when I landed
tredre3 18 hours ago [-]
> It gave me basically the same as Gemini or ChatGPT would do
I'm very surprised by this, in my tests E2B has very limited general knowledge.
walthamstow 18 hours ago [-]
It was genuinely outstanding for a tiny local model. I'm in Marrakech and it gave 3 separate one-day itineraries that contained most of the same stuff I got from Gemini when I landed. I followed up to ask specifically about souks (markers/bazaars) and it listed the main ones and what types of products you can get from each.
avadodin 16 hours ago [-]
I haven't tried E2B but E4B isn't particularly better than the old Gemma3 4B model(which was already very good at multilingual and decent at other tasks) but the voice recognition is a nice addition.
maxlegav 3 hours ago [-]
Do you see a real difference with AI agents working together
alwyn 19 hours ago [-]
May I ask what setup you used to run it on your phone and if it's satisfactory (it sounds like it)?
walthamstow 18 hours ago [-]
Edge Gallery, which is a bloody terrible name for an app by the way
OsamaJaber 19 hours ago [-]
Small models in the browser are a different optimization problem than small models on a server.
On server you chase throughput so you batch. In browser you're stuck at batch size 1, which means kernel launch overhead and memory bandwidth dominate, not FLOPs
0xchamin 3 hours ago [-]
very interesting. Prompt to "generate System Design like ChatGPT end to end". Took about ~3-5 mins the model to download. It generated a reasonably good system design.
billyp-rva 18 hours ago [-]
> "OAuth 2.0 authorization code flow with PKCE as a sequence diagram — user, browser, app server, auth server, API"
If you do a Google image search for "OAuth 2.0 PKCE sequence diagram" you get good results also. Maybe if you ask for something more esoteric this becomes valuable? Of course, that also makes hallucinations more likely.
Sathwickp 20 hours ago [-]
just tried it out, must say it's amazing the speed at which it generates these diagrams
Is this opensource by any chance?
Would love to take a look at the code and understand how it works
rahimnathwani 21 hours ago [-]
How does this part work?
"The LLM outputs compact code (~50 tokens) instead of raw Excalidraw JSON (~5,000 tokens)."
I see on the left that the LLM is outputting some instructions to add nodes and edges to the diagram. But what is interpreting those commands and turning them into an Excalidraw file?
evrydayhustling 19 hours ago [-]
had the same question! looks like it's another project called Drawmode[1] from the same group...
Really interesting, I wish I could understand the under the hood better but I guess I don't have all the background needed.
xnx 16 hours ago [-]
It seems like Gemma should replace Gemini Nano as the AI built into Chrome.
logicallee 23 hours ago [-]
I love this idea. Unfortunately, it says "Unsupported browser/GPU" for me. This is Desktop Chrome version 147 (page says it requires 134+) and I have a 1060 card with 6 GB of RAM on this specific device, so it should fit. I have more than 4 GB of free RAM as well.
That's amazing. Very good result. Thanks for sharing.
busssard 15 hours ago [-]
no firefox? sad :(
agent37 22 hours ago [-]
Very cool. Did you happen to try other models like Qwen and was there a difference as opposed to Gemma ?
hhthrowaway1230 1 days ago [-]
so multiple of these browser wasm demos make me re-download the models, can someone make a cdn for it or some sort u uberfast downloader? just throw some claude credits against it ty!
wereHamster 1 days ago [-]
CDN wouldn't help much. These days browsers partition caches by origin, so if two different tools (running on different domains) fetch the same model from the CDN, the browser would download it twice.
cjbgkagh 21 hours ago [-]
Did not know that. That sounds extraordinary wasteful, there must be a file hash based method that would allow sharing such files between domains.
faangguyindia 20 hours ago [-]
It offers security.
Just like you wouldn't use same table in your system for all users in a multi tenant application.
cjbgkagh 20 hours ago [-]
If the file is hashed strongly enough then it can be no other file. I can see how information on previous sites visited can be leaked and how this could be bad but I think whitelisting by end users could still allow some files to be used. E.g. the code for react.
stavros 2 hours ago [-]
The fact that you don't see it doesn't mean it doesn't exist. I make up a unique file, put it on site X and ask your browser to cache it. I try to load the same file on site Y and time how long it takes. If it's instant, site Y knows you visited site X.
Tadaaa! Tracking.
cjbgkagh 2 hours ago [-]
I said I ‘can see’ I already understand that. Hence the whitelisting on files that are not unique / created for this purpose.
stavros 2 hours ago [-]
Ah, my bad, sorry.
thornewolf 21 hours ago [-]
it's a security feature. otherwise my malicious site could check for cdn.sensitivephotoswebsite.com and blackmail you if it was cached already
cjbgkagh 21 hours ago [-]
It would be nice if there was a whitelist option for non-sensitive content. I stopped using cdn links due to the overhead of the extra domain lookups but I did think that my self hosted content would be cached across domains.
onion2k 6 hours ago [-]
It would be nice if there was a whitelist option for non-sensitive content.
There's no such thing as non-sensitive content from a CDN though. Scripts are obviously sensitive, styles can be used to exfiltrate data through background-url directives, and anything like images has no benefit being cached across sites.
Fonts might be one exception, but I bet those are exploitable somehow.
pyrolistical 17 hours ago [-]
Seem like a solvable problem. Per origin cache control. But actually just load the data locally
embedding-shape 24 hours ago [-]
Adding a file input where users can upload files to the frontend directly from their file manager would probably work as a stop-gap measure, for the ones who want something quick that let people manage their own "cache" of model files.
logicallee 21 hours ago [-]
>can someone make a cdn for it or some sort u uberfast downloader? just throw some claude credits against it ty!
Okay, I did so. I realize that in your later followup comment you might want something different (like for Chrome itself to cache these downloads or something) but for now I made what you asked for, here you go:
It's an ultrafast temporary CDN for one-off experiments like this. Should be lightning fast. By including the script, you can include any file this CDN serves.
hhthrowaway1230 1 hours ago [-]
haha this is awesome! this is fantastic.
logicallee 23 hours ago [-]
Would you be okay with it using your upload at the same time, then a p2p model would work. (This is potentially a good match for p2p because edge connections are very fast, they don't have to go across the whole Internet). You could be downloading from uploaders in your region. Let me know if you would be okay with uploading at the same time, then this model works and I can build it for you for people to use this way.
Rekindle8090 1 days ago [-]
What? downloaded for me at 2gbps
hhthrowaway1230 1 days ago [-]
Ah let me clarify, many of the in the browser demos make me download certain models even if I already have them It would be great if there was a way that I don't have to redownload them across demos so that I just have a cache. or an in browser model manager. hope this makes sense.
Or indeed use some sort of huggingface model downloader (if that exist with XET)
varun_ch 24 hours ago [-]
I think this would sit best at the browser level. I’m not sure there’s a nice way for multiple websites to share a cache like that.
hhthrowaway1230 1 days ago [-]
also maybe a good usecase to finally have P2P web torrents :)
hhthrowaway1230 1 days ago [-]
Yeah that's great but I'm in a cafe outside burning my phone data. ty!
COOLmanYT 1 days ago [-]
no firefox support?
teamchong 23 hours ago [-]
firefox has webgpu already, but the subgroups extension isn't in yet. every matmul / softmax kernel here leans on subgroupShuffleXor for reductions, that's the blocker. same reason mlc webllm and friends don't run on firefox either. once mozilla ships it this should work
zhangchen 13 hours ago [-]
[dead]
nigardev 1 days ago [-]
[dead]
Rendered at 14:09:51 GMT+0000 (Coordinated Universal Time) with Vercel.
Diagrams are still far from solved. We need a good non-gameable diagrams benchmark.
I just ask Claude code for mermaid to visualize any topic I'm discussing.
Writing an SVG of a pelican riding a bicycle without being able to see the result and iterate based on that is incredibly difficult by comparison. I'm sure some humans could do it, but I sure can't. That's part of the beauty of it: it's very difficult to do but a toddler could judge the results
Writing an SVG of a diagram by hand would be somewhere on the middle ground. Or depending on the number of nodes might be even harder than the pelican. Layouting diagrams can get tricky very quickly. It's also one of Mermaid's biggest weaknesses
I'm very surprised by this, in my tests E2B has very limited general knowledge.
If you do a Google image search for "OAuth 2.0 PKCE sequence diagram" you get good results also. Maybe if you ask for something more esoteric this becomes valuable? Of course, that also makes hallucinations more likely.
"The LLM outputs compact code (~50 tokens) instead of raw Excalidraw JSON (~5,000 tokens)."
I see on the left that the LLM is outputting some instructions to add nodes and edges to the diagram. But what is interpreting those commands and turning them into an Excalidraw file?
[1] https://github.com/teamchong/drawmode
Just like you wouldn't use same table in your system for all users in a multi tenant application.
Tadaaa! Tracking.
There's no such thing as non-sensitive content from a CDN though. Scripts are obviously sensitive, styles can be used to exfiltrate data through background-url directives, and anything like images has no benefit being cached across sites.
Fonts might be one exception, but I bet those are exploitable somehow.
Okay, I did so. I realize that in your later followup comment you might want something different (like for Chrome itself to cache these downloads or something) but for now I made what you asked for, here you go:
https://stateofutopia.com/experiments/ephemeralcdn/
It's an ultrafast temporary CDN for one-off experiments like this. Should be lightning fast. By including the script, you can include any file this CDN serves.
Or indeed use some sort of huggingface model downloader (if that exist with XET)