-
Notifications
You must be signed in to change notification settings - Fork 565
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scaffold IPC-based API #711
base: main
Are you sure you want to change the base?
Conversation
This reverts commit 386fff2.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR scaffolds an IPC-based API by introducing a Node.js API client and a Go API server along with test cases, benchmarks, and support for virtual file systems. Key changes include the implementation of new JavaScript packages for AST definitions and API functionality, integration with libsyncrpc for synchronous IPC, and the addition of extensive benchmarks and CI workflow updates.
Reviewed Changes
Copilot reviewed 54 out of 59 changed files in this pull request and generated 1 comment.
Show a summary per file
File | Description |
---|---|
_packages/api/test/api.test.ts | New API tests for configuration parsing, symbol resolution, and disposal. |
_packages/api/src/typeFlags.ts | Added TypeFlags definition with runtime enum-like functionality. |
_packages/api/src/typeFlags.enum.ts | Introduced TypeFlags as a TypeScript enum. |
_packages/api/src/symbolFlags.ts | Added SymbolFlags with runtime enum implementation. |
_packages/api/src/symbolFlags.enum.ts | Introduced SymbolFlags as a TypeScript enum. |
_packages/api/src/proto.ts | Defined the protocol interfaces for transmitting API responses. |
_packages/api/src/path.ts | Implemented path utilities for processing file paths and URLs. |
_packages/api/src/objectRegistry.ts | Provides object caching and release handling for projects, symbols, and types. |
_packages/api/src/fs.ts | Added a virtual file system implementation with directory/file operations. |
_packages/api/src/client.ts | Implemented a client for synchronous IPC communication via libsyncrpc. |
_packages/api/src/api.ts | Implemented the API client including overloaded methods for symbol and type retrieval. |
_packages/api/bench/api.bench.ts | Added benchmarks to compare API performance with TypeScript APIs. |
README.md | Updated build instructions to require Rust along with Go and Node.js. |
Herebyfile.mjs | Updated tasks to include API test and build steps. |
.github/workflows/ci.yml | Updated CI workflows with Rust toolchain usage and new environment flags. |
Files not reviewed (5)
- _packages/api/bench/tsconfig.json: Language not supported
- _packages/api/package.json: Language not supported
- _packages/api/test/tsconfig.json: Language not supported
- _packages/api/tsconfig.json: Language not supported
- _packages/ast/package.json: Language not supported
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This all works locally for me, nice. It's a shame npm doesn't tell you what's taking so long to build on install, but oh well.
Going to go test this on Windows where I don't have rust installed to see what happens there.
At this point I think this looks good; I'd only want to make sure everyone knows they have to install rustup (or msrustup?) to keep things working, unless we're able to find a way to make that build lazy somehow. |
import fs from "node:fs"; | ||
import path from "node:path"; | ||
import { fileURLToPath } from "node:url"; | ||
import { Bench } from "tinybench"; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In a way I wish were just using vitest here (since it has tinybench built-in), but it's all pretty similar.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think I have any notes, I'm just exited to get it in and start playing with it.
This is amazing work @andrewbranch! Myself and @acutmore are working on a blog post about some of the wider implications of the Go port. (@jakebailey has been kind enough to eyeball a draft I believe.) In that we talk about I was digging through this PR and didn't spot a |
Important
Until libsyncrpc is set up to publish to npm, this PR takes a git dependency on it, which will build the binary from source during
npm install
. You need Rust 1.85 or higher to have a successfulnpm install
in typescript-go.Note
Takeaways from design meeting:
This PR is the start of a JavaScript API client and Go API server that communicate over STDIO. Only a few methods are implemented; the aim of this PR is to be the basis for discussions around the general architecture, then additional functionality can be filled in.
Same backend, different clients
This PR includes a synchronous JavaScript client for Node.js. It uses libsyncrpc to block during IPC calls to the server. Relatively small changes to the client could produce an asynchronous variant without Node.js-specific native bindings that could work in Deno or Bun. I don’t want to make specific promises about WASM without doing those experiments, but using the same async client with an adapter for calling WASM exports seems possible. I’m imagining that eventually we’ll publish the Node.js-specific sync client as a standalone library for those who need a sync API, and an async version adaptable to other use cases, ideally codegen’d from the same source. The same backend is intended to be used with any out-of-process client.
Client structure
This PR creates two JavaScript packages,
@typescript/ast
and@typescript/api
(which may make more sense as@typescript/api-sync
or@typescript/api-node
eventually). The former contains a copy of TS 5.9’s AST node definitions, related enums, and node tests (e.g.isIdentifier()
), with the minor changes that TS 7 has made to those definitions applied. The latter contains the implementation of the Node.js API client. It currently takes a path to the tsgo executable and spawns it as a child process. (I imagine eventually, the TypeScript 7.0+ compiler npm package will be a peerDependency of the API client, and resolution of the executable can happen automatically.)Backend structure
tsgo api
starts the API server communicating over STDIO. The server initializes theapi.API
struct which is responsible for handling requests and managing state, like a stripped-downproject.Service
. In fact, it uses the other components of the project system, storing documents and projects the same way. (As the project service gets built out with things like file watchers and optimizations for find-all-references, it would get increasingly unwieldy to use directly as an API service, but a future refactor might extract the basic project and document storage to a shared component.)The API already has methods that return projects, symbols, and types. These are returned as IDs plus bits of easily serializable info, like name and flags. When one of these objects is requested, the API server stores it with its ID so follow-up requests can be made against those IDs. This does create some memory management challenges, which I’ll discuss a bit later.
Implemented functionality
Here’s a selection of the API client type definitions that shows what methods exist as of this PR:
Here’s some example usage from benchmarks:
Client-side virtual file systems are also supported. There’s a helper for making a very simple one from a record:
Performance
These are the results of the included benchmarks on my M2 Mac. Note that IPC is very fast on Apple Silicon, and Windows seems to see significantly more overhead per call. Tasks prefixed
TS -
refer to the rough equivalent with the TypeScript 5.9 API. ThegetSymbolAtPosition
tasks are operating on TypeScript’sprogram.ts
, which has 10893 identifiers.To editorialize these numbers a bit: in absolute terms, this is pretty fast, even transferring large payloads like a binary-encoded
checker.ts
(10). On the order of tens, hundreds, or thousands of API calls, most applications probably wouldn’t notice a per-call regression over using the TypeScript 5.9 API, and may speed up if program creation / parsing multiple files is a significant portion of their API consumption today (5–7). However, the IPC overhead is pretty noticeable when looking at hundreds of thousands of back-to-back calls on an operation that would be essentially free in a native JavaScript API, like getting the symbol for every identifier in a large file (15, 18). For that reason, we’ll be very open to including bulk/batch/composite API methods that reduce the number of round trips needed to retrieve lots of information for common scenarios (16, 17).Memory management
The current API design uses opaque IDs for objects like symbols and types, so the client can receive a handle to one of these objects and then query for additional information about it. For example, implemented in this PR is
getTypeOfSymbol
, which takes a symbol ID. The server has to store the symbol in a map so it can be quickly retrieved when the client asks for its type. This client/server split presents two main challenges:To accomplish this, there is a client-side object registry that stores objects by their IDs. API users will need to explicitly dispose those objects to release them both from the client-side store and from the server. (Server objects may be automatically released in response to program updates, and making additional queries against them will result in an error.) This can be done with the
.dispose()
method:or with explicit resource management: