NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
On Rendering Diffs (pierre.computer)
eqvinox 17 minutes ago [-]
nit: there is no Linux "v6" or "v7". The first and second version component cannot be separated, it's v6.0 and v7.0. Versions 6.0 and 6.1 have as much in common as 6.19 and 7.0. You could also call them 3.60 and 3.80, or 2.6.100 and 2.6.120.
cipherself 21 hours ago [-]
For anyone else who's suffering, paste this in the console in devtools:

  document.getElementsByTagName('main')[0].style.margin = '0 auto';
chrismorgan 10 hours ago [-]
I’m surprised to see such a reaction to a left-aligned content column. I’d have said it was pretty common, though probably not as common as it used to be.

If I were to pick one problem with the presentation, it would rather be the font-size (12px is unequivocally too small), or the use of monospace (simply unsuitable for body text).

And incidentally, HN suffers from two of these three problems.

mi_lk 21 hours ago [-]
There's always reader mode when an author tries to be cute about their layout
fat 21 hours ago [-]
lmao - thank you!
AdamTReineke 21 hours ago [-]
Aw, Pierre shut down? Is there a write-up on that? (The code review startup idea.)
quadrature 21 hours ago [-]
thank you, how do they live like this.
darkamaul 20 hours ago [-]
What an interesting article. I did not assume I would read it until the end when I opened it, but the writing was super clear and easy to follow.

At the end, I admire the craft and patience to try to solve code diff rendering, and wish the folks at GitHub could put the same effort to improve their platform.

On a side note, I feel that we’re going to see more and more of this type of agentic usage, in well defined sub tasks, and the ability of a model to try many possibilities is a huge gift here.

amadeus 15 hours ago [-]
Hey thank you, appreciate your kind words! I don’t write much and was quite an effort to get all this written out!
gloria_mundi 16 hours ago [-]
I don't understand the point of the inverse sticky technique. Scrolling too fast still breaks the experience (content refuses to scroll), and in a way that, at least to me, feels more disruptive than blanking for a fraction of a second. I might just be too used to blanking.

Also ... shouldn't browsers just be able to render the diff without any of the trickery? Is the browser's job actually that hard for long pages, or are they just not optimising for this? Or is there some other reason for the virtualisation (e.g. memory usage)?

amadeus 15 hours ago [-]
I probably didn’t explain this well enough, but your render times always have to be within the frame buffer (16.6ms for 60hz or 8.3ms for 120hz). Under normal circumstances even if you occasionally blow a frame buffer, with the over-scroll you won’t hit the sticky bounds.

The only time you will is if you’re scrolling at a rate where the jumps are quite large — large and fast enough typically where you’re not going to have a frame of reference for what you should see vs what you are seeing to notice you are behind.

Ultimately scrolling is managed on a separate thread from JS, which means if you do like an opt+click on the scroll bar, you’re going to make a jump that JavaScript can never keep up with, even if you’re under your frame times.

And with regards to safari, if your requestAnimationFrame is capped at 60hz but your scrolling is GPU composited at 120hz, this is the only way to keep scrolling at 120hz with 60hz dom updates and never see any blanking.

Joker_vD 7 hours ago [-]
Also, I am deeply suspicious if they can properly support searching with Ctrl+F...

And yes, browsers should be just able to render the diffs, if the styling is somewhat minimal and there is no JS: I have a 6.3-MiB large HTML file with complete transcript of a certain TV series, formatted like

    <dd><b>Character #1</b>: Well, that was interesting all right.</dd>
    <dd><b>Character #2</b>: [sigh]</dd></dl>
    <dl><dd><b>Character #1</b>: Lemme help you.</dd>
    <dd><b>Character #3</b>: Yeehaw!</dd>
    <dd>[thump]</dd>
(the </dl><dl> produces a small paragraph break, to indicate scene transitions), and it scrolls and searches just fine.
shay_ker 13 hours ago [-]
hmm it does seem like a good point that browsers should be able to do this natively. the DOM already holds the content, and browsers manage what's rendered in the viewport.
eblanshey 19 hours ago [-]
It's cool seeing all the engineering that goes into optimizing performance of diffs. I'm working on a FreeCAD workbench that generates diffs on CAD model trees[0], and although my bottlenecks are a bit different, I can still implement some of your optimizations down the line if needed (such as deferred syntax highlighting).

My main bottleneck is that I do a complete diff on all open + changed documents in the repository up front, because due to how document properties are stored, I won't know if the file has meaningful changes until I compute the full diff (FreeCAD may save the document, but not have anything meaningful change.)

[0] https://github.com/eblanshey/HistoryWorkbench

amadeus 22 hours ago [-]
A bit of a technical deep dive into how we built CodeView, a review surface that can handle rendering diffs of immense size, all in a browser.
AlexErrant 18 hours ago [-]
Feature request; `git diff --color-moved` uses colors to display moved chunks of code. Scanning https://diffs.com/docs it isn't obvious that yall support that; please add it :)
OutThisLife 21 hours ago [-]
big fan :)
amadeus 21 hours ago [-]
thank you!
tomtom1337 20 hours ago [-]
It is SO NICE to see people working on making fast, nice-to-use tools. It's a lovely experience to use diffshub. Thank you for creating it, and than you for the great write-up! (I made it "that far" )
amadeus 15 hours ago [-]
Appreciate it, thank you!
joosters 21 hours ago [-]
I was hoping that this would talk more about the logic behind generating a diff, rather than the optimisations involved in rendering the text.

IMO (as someone who doesn't have to deal with the actual rendering) it would go a bit deeper into talking about deciding how to show what has changed. There's a lot of improvements that could be made there. e.g. "whitespace has changed here" so there's no real code changes involved.

Or "this big list of imports has changed, and code formatting has line-wrapped the list into different lines" - gitlab for example copes poorly with this. I'd love to just see a clean diff that highlights the additional import, and not just ten lines of changes caused by adding one line to a big list of imported symbols/functions.

amadeus 21 hours ago [-]
One of our next big projects is actually to support semantic diffs, which I think will be a lot more applicable to what you're asking for here. Currently diffs just takes a normal git patch file, or generates one from 2 versions of a file.
stephbook 20 hours ago [-]
Most projects start with tree-sitter and then switch to language-native parsers. Either way, it's not something you solve yourself – you just find the language-specific implementation load megabytes of WASM on the frontend or generate it on the backend.

difftastic, semanticdiff.. lots of projects like that. Obviously they can offer stuff like "function name changed" instead of showing you 30 lines of +newName -oldName

manquer 21 hours ago [-]
> rather than the optimizations involved in rendering the text.

Any views they have on this topic is going to come across as quite opinionated given their choices for text rendering for this post and general aesthetics of website.

amadeus 21 hours ago [-]
Naw, the truth is I'm not really smart or intelligent enough to build a semantic diff system. For that you'll need to wait on a post from one of our smarter devs, this was a post about rendering diffs in a browser.
bryanlarsen 5 hours ago [-]
The problem with large diffs is usually with the human, not the computer. Large diffs are very hard to review, so more tools that help with the understanding of a large diff would be very welcome.
akdor1154 21 hours ago [-]
I disagree with the theory that scrolling frame rate doesn't need to be smooth for scrolling to feel smooth.

On mobile it kinda does. Scrolling diffs on mobile just kinda feels crap.

I have been spoiled by years of engineer hours spent getting scrolling to be 60- or even 120Hz smooth to match my finger, and diffs just.. isn't.

I know this is frustrating to hear, and that this is technically compounded by mobile probably having the lowest device performance to be playing with too, but.. There you go.

amadeus 20 hours ago [-]
> disagree with the theory that scrolling frame rate doesn't need to be smooth for scrolling to feel smooth

It's possible you might be misunderstanding what I was trying to say here because 120hz scrolling on a 120hz device was the goal and why one of those virtualization techniques was not acceptable to me which lead me to coming up with a novel workaround to this problem (Inverse Sticky Technique).

CodeView uses a system that allows scrolling to update at your native framerate (120hz) WITHOUT needing Javascript needing to keep up at 120hz. If you're seeing stuttering while scrolling on https://diffshub.com would love to know more context (device/diff link/etc) because that is very much NOT our experience.

akdor1154 19 hours ago [-]
Even the linked ghostty PR on your home shows this - this is Firefox Android on a Nokia XR21 / TA-1486.

It's not unuseable, but it definitely feels like 'js hacking my scrolling' and not a native surface flinging around.

The experience is actually worse with smaller movements, i guess because my brain is more conscious when breaking the 'finger physically moving the text' illusion.

I don't mean to be dismissive - you're working on a really hard problem, and you're clearly approaching it with a mindset of perfection. I'm posting because I know you're probably able to solve this too :)

Edit: as a point of (unfair) comparison, the codemirror Huge File demo works fine: https://codemirror.net/examples/million/ It does suffer from the occasional partial paint when quickly coasting, but I'm not bothered by this at all, it's far less intrusive than dropping frames / stuttering / etc.

amadeus 18 hours ago [-]
Maybe i need to buy one of these devices to test.

Just to be clear tho, we don't actually scroll jack, native scrolling works as it should and content should move with normal gpu composited scroll. That said, it's possible that loading that much data into memory may be causing causing knock on effects somehow that are just slowing everything down.

embedding-shape 21 hours ago [-]
Matters a great deal on desktop too, and laptops for that matter. Even more on platforms like macOS that smooths scrolling by default too, but very noticeable on Windows and various Linux distributions too when native scrolling is janky/choppy, and it frustrates even casual users.
Igor_Wiwi 4 hours ago [-]
this approach is quite similar to what I use for rendering huge markdown files in https://mdview.io : 10MB files renders correctly and even Table of Content works. 15MB renders but scrolling is not as smooth as I wanted, will work on that. The approach described here https://igorstechnoclub.com/how-i-render-10mb-markdown-files...
shaokind 21 hours ago [-]
Semi-related: have you considered making DiffsHub a browser extension, so you can serve private diffs as well?

(I say this, having done a vibe-port of the code to a browser extension, so the underlying concept works.)

_ZeD_ 11 hours ago [-]
While for "simple" diffs these UIs are "fine", I'm still struggling (after 15+ years of search) for a really good tool that could help me with 3-file diffs...

I'm still stuck with (k)diff3, and, while they work, I would really like to a more integrate web interface for my projects

v3gas 11 hours ago [-]
Can you give an example of why you need that? Genuinely curious
jiehong 9 hours ago [-]
git merge conflicts?
thangalin 17 hours ago [-]
When producing TreeTrek, I went with rudimentary diffs that account for colourblind developers:

https://repo.autonoma.ca/repo/treetrek/commit/3fe9360599ae23...

The diffs rendering library looks amazing: https://diffs.com/

Presumably the red-green issue is a simple CSS update?

amadeus 17 hours ago [-]
Yup! You can pick from a bunch of different themes and use css variables to override the core colors as well!
speedgoose 8 hours ago [-]
> Safari, for example, currently caps requestAnimationFrame at 60Hz even on higher refresh-rate displays

Of course. It’s often Safari.

lxe 13 hours ago [-]
This is pretty awesome. I work with editors and monaco-like things a ton, and I review (look at) very large PRs very often. Having this speedy optimized interface is a delight. Check out their trees lib as well.
andrew_kwak 14 hours ago [-]
I remember wrestling with diff tools back in the day. A good visualization can make a world of difference. How does it handle large files with lots of changes?
nerdypepper 21 hours ago [-]
rendering massive diffs is cool but ultimately a gimmick. in what scenario are you actually reading a 500k line diff?

something i'd really want to see from forges is alternate diff techniques: like AST diffing.

amadeus 21 hours ago [-]
Performance and optimization is one of many pieces, but yes, it's a meme to render 500k lines.

That said though, and maybe I didn't say it well in the post, the more performant and optimized your tool is, the less burden you put on developers and users.

Sure you won't review 100k lines, but maybe the diff includes a ton of testing snapshots, or maybe it's a long running feature branch and you need to just quickly jump in and look at a specific change from a specific file. The less the developer or the user needs to think about `how` to render the diff or `how to navigate the diff`, the better we did our job.

mrkcsc 16 hours ago [-]
Optimizing for the P95/99 case of performance typically makes everything better as a whole.
IshKebab 21 hours ago [-]
Very impressive! I doubt Github or Gitlab would ever do something as good as this but maybe there's a chance we could get it in Forgejo?
philsnow 17 hours ago [-]
> so hit play on sandstorm

For a brief hopeful moment, I thought this was the .io kind of sandstorm

cvince 21 hours ago [-]
Whatever happened to all the pretext hype? I feel like that would be perfect for rendering huge diffs.
amadeus 21 hours ago [-]
Yes and no. It would help to improve things a bit when it comes the measure/reconciliation phase (unclear to say how much). However we've already done a pretty good job around batching writes vs reads.

However passing a million lines of code through pretext is unlikely to be very efficient, so a lot of the work around estimation is still very important.

That said, while I don't want to make pretext a direct dependency of the library, there's a good chance I'll explore the possibility of allowing devs to pass it in as an additional argument perhaps improve performance a bit.

It should also be noted that we have a full API to support things like line annotations (comments, etc) that are entirely controlled by the user, so there's always a bit of a dynamic aspect there that would come into play

taejavu 21 hours ago [-]
Can you select and copy text with pretext?
charcircuit 19 hours ago [-]
I feel like virtualization is not the right way to handle things. It adds so much complexity and makes the user experience buggy due to breaking optimizations and features of browsers.

Computers are very powerful these days and have a done of resources that they can use. We should be able to handle large diffs without any crazy tricks.

mrkcsc 15 hours ago [-]
> I feel like virtualization is not the right way to handle things.

How would you handle it?

charcircuit 14 hours ago [-]
Keep things as simple as possible and put the whole thing in the DOM. Then if there are performance problems address the scaling problems themselves instead of trying to avoid scaling. For example things like only rendering what is visible should be handled by the browser and not by messing with the DOM. The DOM is not synchronized with the browser's renderer so it will always end up hacky.
mrkcsc 13 hours ago [-]
OK but thats simply doesn't perform.

You can't say "only rendering what is visible should be handled by the browser" and call that a solution unless you have a magic wand to make Chrome/other browsers do this.

The browser doesn't do this, and so you can either do what you say and have your browser freeze when you load up a million line diff, or you can fix things within your control which is what the author is doing.

charcircuit 10 hours ago [-]
I just tested it now and Chrome does not freeze when opening up a page with a million lines of code. I repeated a file from the Linux kernel 600 times to reach over 1.1 million lines and I put each line in a div so I could alternate red and green background colors.

>you can fix things within your control

Blink is open source. Improving the browser is fully within your control and since browser automatically updates these optimizations will make it to your end users in a relatively short amount of time.

amadeus 10 hours ago [-]
Would love to see a website that makes everything DiffsHub does without making any of the work necessary. Making it all obsolete would be a huge step forward for the web. We support diffs that exceed 36 million lines, with syntax highlighting, comments, bundled into a reusable library for anyone to use.

Also this works in all browsers, fixing blink doesn’t help move the web forward.

charcircuit 8 hours ago [-]
I agree. I'm tired of these sites with glitchy scrolling and broken search due to trying to reimplement the browser in javascript.

>fixing blink doesn’t help move the web forward.

Historically making one browser engine faster encourages other engines to also be made faster. And realistically Blink has the majority of the browser market share, so it is the most important one to optimize for. Trying to move the web forward shouldn't be the primary goal. The primary goal should be making a good experience for the user. Niche engines like Firefox's just aren't worth caring about due to its low market share.

amadeus 8 hours ago [-]
Glad we solved glitchy scrolling for now (the title of this blog post was “on rendering diffs”, not “searching diffs”). Browser search on a 36 million line diff would not by great, but it’s something we hope to have a solution for. One step at a time.
charcircuit 7 hours ago [-]
If the solution for searching big pages was built into the browser then it would be fixed for every site instead of requiring each site to use a library or fix it themselves.
logdahl 21 hours ago [-]
Maybe an intended effect, but the header ascii art is suspiciously misaligned... :^)
brap 20 hours ago [-]
I don’t have much to say except I appreciate the ASCII art.
21 hours ago [-]
alexpandey 14 hours ago [-]
[dead]
archargelod 16 hours ago [-]
Is this really a hard problem? Rendering some text with coloring?

I believe we're just making everything harder than it needs by constraining it to a browser interface. This would be trivial as a terminal application written in native code.

amadeus 16 hours ago [-]
Yeah, my bad. Sorry i wasted your time :(
mrkcsc 16 hours ago [-]
What are you even talking about, this post is about rendering code diffs in the context of the web.

Whether or not it would be "trivial" in a native terminal application is irrelevant.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 17:19:56 GMT+0000 (Coordinated Universal Time) with Vercel.