This document outlines the bootstrapping process for building the CURSED compiler, following the self-hosting compiler approach.
The CURSED compiler will be developed in four distinct stages, each building on the previous:
- Stage 0: Bootstrap Environment Setup
- Stage 1: Minimal Bootstrap Compiler
- Stage 2: Full Compiler in CURSED
- Stage 3: Self-Compiled Full Compiler
In this stage, we prepare the environment for the bootstrap compiler to work with.
- Select Rust as the implementation language for the bootstrap compiler
- Define the core CURSED language subset that will be implemented in Stage 1
- Create the project structure and build system
- Implement basic utilities and libraries needed for compiler development
- Project structure with build system (using Cargo)
- CURSED language specification documents
- Lexer and parser for CURSED subset
- AST (Abstract Syntax Tree) representation
- Simple code generation framework
In this stage, we implement a minimal compiler in Rust that can compile a useful subset of CURSED to executable code.
- Implement a minimal but useful subset of CURSED language features
- Ensure the compiler can parse and generate code for this subset
- Implement enough features to allow writing the full compiler in CURSED
- Enable compilation of CURSED code to machine code or intermediate language
- Basic types (
lit
,normie
,tea
, etc.) - Variable declarations (
sus
) - Functions (
slay
) with parameters and return values - Control structures (
lowkey
,highkey
,bestie
,periodt
) - Basic I/O operations
- Simple module system (
vibe
,yeet
) - Basic error handling
- Minimal bootstrap compiler that can compile CURSED code
- Runtime library for CURSED programs
- Test suite for the bootstrap compiler
- Example programs written in the CURSED subset
- Documentation for the bootstrap compiler
In this stage, we implement a more complete CURSED compiler written in the CURSED language itself, using only the subset implemented in Stage 1.
- Implement a full compiler for CURSED in the CURSED language
- Use only features available in the bootstrap compiler
- Add support for additional language features not in the bootstrap compiler
- Ensure the compiler can compile itself
- Complete type system including generics
- Structs (
squad
) and interfaces (collab
) - Maps (
tea[K]V
) and slices - Advanced control flow
- Error handling mechanisms
- Concurrency with goroutines (
stan
) and channels (dm
) - Standard library
- Full compiler written in CURSED
- Expanded runtime library
- Comprehensive test suite
- Enhanced standard library
- Documentation for all language features
In this stage, we use the Stage 2 compiler to compile itself, producing a fully self-hosted compiler.
- Compile the Stage 2 compiler using itself
- Verify correctness through extensive testing
- Optimize the compiler for performance
- Add additional tools like formatters, documentation generators, etc.
- Self-compiled CURSED compiler
- Complete toolchain including formatter, linter, etc.
- Performance benchmarks
- Comprehensive documentation system
- Package manager for CURSED modules
The CURSED compiler follows this compilation pipeline:
- Lexical Analysis: Convert source code to token stream
- Parsing: Generate Abstract Syntax Tree (AST) from tokens
- Semantic Analysis: Type checking and semantic validation
- Intermediate Representation: Convert AST to IR
- Optimization: Apply various optimization passes
- Code Generation: Generate target code (machine code or intermediate language)
- Linking: Link with runtime libraries
Each stage of the compiler will be thoroughly tested with:
- Unit Tests: Test individual components
- Integration Tests: Test interactions between components
- End-to-End Tests: Test complete compilation pipeline
- Regression Tests: Ensure bugs don't reappear
- Compliance Tests: Verify language specification compliance
- Self-Hosting Tests: Verify compiler can compile itself
Stage | Estimated Duration | Key Milestones |
---|---|---|
0 | 2-4 weeks | - Project setup complete - Lexer and parser working - Basic AST representation |
1 | 2-3 months | - Minimal compiler working - Can compile simple CURSED programs - Basic runtime library |
2 | 3-6 months | - Full compiler in CURSED - All language features implemented - Standard library available |
3 | 1-2 months | - Self-compiled compiler - Complete toolchain - Documentation and ecosystem tools |
- Bootstrapping Gap: Bridging the gap between Rust and CURSED
- Error Handling: Creating meaningful error messages
- Performance: Ensuring the compiler has acceptable performance
- Testing: Verifying correctness throughout the process
- Language Evolution: Managing changes to the language specification during development