131 pointsby gaigalasApr 1, 2026

11 Comments

gaigalasApr 1, 2026
Single standalone file, no external tools used, PATH='' (empty), portable (bash, dash, ksh, zsh), produces x86 ELF executables, has mini-libc builtin.

Usage:

printf 'int main(){puts("hello");return 0;}' | sh c89cc.sh > hello

chmod +x hello

./hello

angry_octetApr 3, 2026
I can't think of a reason to use c89cc.sh, but I salute this effort nonetheless.
jonahxApr 3, 2026
gorgeous!
t-3Apr 3, 2026
Why not POSIX or some common external tools where it makes sense? Most of those big switch statements could be easily replaced with some standard programs that already exist everywhere.
gaigalasApr 3, 2026
One main reason is performance. Forking for other tools is very expensive.

That said, using larger sed or awk programs instead of ad-hoc calls for small snippets would perhaps be net-positive for performance and readability.

I'm currently working on very strict bootstrap scenarios in which sed and awk might not be available, but a shell might be (if I'm able to write it). It is possible that in such scenarios, the fist send and awk versions will be shell-written polyfills anyway.

dmitrygrApr 3, 2026
Many parts of this are clearly autogenerated, but that in no way diminishes the sickening impressiveness of it!
phireApr 3, 2026
It does make it a little hard to understand how the parser/ast_builder works.

But the rest seems easy enough to understand.

fuhsnnApr 3, 2026
> It does make it a little hard to understand

Or much easier to backdoor...

gaigalasApr 3, 2026
This is a hand-written parser for the shell itself (portable shell parsing portable shell):

https://gist.github.com/alganet/23df53c567b8a0bf959ecbc7b689...

It's an incomplete idea from around a year ago. The approach taken here (aliases as macro-like evals, AST generation using shell variables) became the backbone for the BNF parser generator.

This one is much simpler to understand. Simpler grammars tend to produce parser code that looks more like this one.

userbinatorApr 3, 2026
It would be far more interesting to look at what this was "compiled" from; it looks like the output of a state-machine generator.
gaigalasApr 3, 2026
The source for BNF generator is here:

https://gist.github.com/alganet/4dfd501a3377a60f7825901114d6...

Roughly 70% of c89cc was generated from it (parser, emitter).

It can generate parsers for C, ES6 and XML for example (subsets but not missing a lot).

It's still a mess though and I have lots of work to do to a proper release.

gaigalasApr 3, 2026
Yes! The main parser and emitter come from a BNF parser generator, also written in portable shell (to be released though).
kelsey98765431Apr 3, 2026
Would be a lot better if it came with tests. Please do this justice and dont let it rot as a gist, make a real repo and add some docs and at least smoke tests or some kind. Thanks
fuhsnnApr 3, 2026
Don't understand why you were downvoted. An untested C compiler is simply worthless.
nananana9Apr 3, 2026
The 2026 brain simply cannot comprehend recreational programming.
fuhsnnApr 3, 2026
Well, I happen to have been recreationally maintaining a hobbyist C compiler for three years, adding tests is part of the fun.
gaigalasApr 3, 2026
You want to know what kinds of programs it can run, right?

Look at this one:

https://gist.github.com/alganet/1513d7b6abef5c1a53a324d897c3...

shell.c is a shell interpreter written for c89cc.sh. It can do the full self-hosting ouroboros:

- c89cc.sh can compile shell.c

- compiled shell.c via c89cc.sh can run c89cc.sh

It's not a full blown battle tested shell interpreter yet, but I'm working on it.

This file is part of the bootstrapping setup I'm working on for very early (pre tinyc) bootstrap from source in x64 machines and it is by far the most complicated program c89cc.sh can compile.

fuhsnnApr 3, 2026
Thanks, that actually look like a very solid baseline to start things with. Are you aware of onramp[1]? They use a custom VM to base compiler and shell on, it's extra steps, but could be more flexible long term.

[1] https://github.com/ludocode/onramp

gaigalasApr 3, 2026
This gist is a concatenation of several shell script modules which form a comprehensive parser library for the portable shell.

The main parser and emitter are BFN-generated (that's why they look so mechanical). The BNF parser generator is also written in portable shell (I posted another gist with a preview of it in another thread).

All modules have comprehensive tests, but it is still lacking documentation and not ready for prime time!

akavelApr 3, 2026
In the classic FLOSS tradition, it would be cool if you might still consider publishing such a "not-ready" repository - some people may (or may not!) be still interested, and also (sorry!) there's the bus factor... But on the other hand, in the classic FLOSS tradition, it's also 100% your decision and you have the full right to do any way you like!
jeyApr 3, 2026
It targets x86-64/ELF? I thought it would target `sh` to be portable?
gaigalasApr 3, 2026
saagarjhaApr 3, 2026
SuperH is not that portable.
_ache_Apr 3, 2026
I'm tempted to execute it, but it may as well be shellcode I couldn't tell.
JackSmith_YCApr 3, 2026
Pure shell. Love the minimalism here... especially when every tiny CLI tool these days seems to require a 50MB node_modules folder just to run. There’s a certain Zen in doing things with zero dependencies. Reminds me of why I got into Unix in the first place.
tho2u3i4o23497Apr 3, 2026
Node stuff atleast "works" - you've not seen real dependency hell until you've seen the horrible world of the Python ML ecosystem.
staredApr 3, 2026
uv solves most of that.

Before, it was a mess.

yetiheheApr 3, 2026
> Node stuff atleast "works"

As someone just starting with complicated node based project, that "works" for Python ML and Node is very close together and very far from 'just works'.

self_awarenessApr 3, 2026
"Claude please generate me a C compiler in bash"

I mean, today it's possible to generate it in Tcl, Elisp, Windows BAT, Powershell.

The effort is just 1 prompt.

The WHY question is much more important today -- "because I can" no longer makes sense, because we all can do much, much more with minimum effort today than before LLMs.

gaigalasApr 3, 2026
Here's a prototype parser from 10 months ago, when this was not possible yet:

https://gist.github.com/alganet/23df53c567b8a0bf959ecbc7b689...

Here is me 10 years ago experimenting on parsing stuff with sed:

https://gist.github.com/alganet/542f46865420529c9bd2

---

Yes, c89cc.sh was definitely AI-assisted. However, I do carry extensive knowledge of the portable shell that was essential for the AI to complete it.

You'll find tricks inside c89cc.sh that don't exist anywhere, except in other code from me (like the ksh93 fix for local dynamic scoping or the alias/macro read -n1 polyfill).

The WHY is pretty obvious: I want to show that the portable shell is not a toy.

ueckerApr 3, 2026
I am tempted to click the "report abuse" link ;-)
dmeadApr 3, 2026
This is vibe coded right?
wengo314Apr 3, 2026
if one could boostrap tcc with it, then it might be a viable tool.
redohApr 3, 2026
Writing a C compiler in pure shell is one of those projects that sounds absurd until you think about bootstrapping. If you want to compile C on a system where you literally have nothing but a POSIX shell, this is exactly what you need. The fact that the parser itself is BNF-generated from shell modules makes it even more interesting as a study in how far you can push shell scripting before it breaks. Would love to see this evolve into a proper repo with tests so it can actually serve as a minimal bootstrapping tool.