-
-
Notifications
You must be signed in to change notification settings - Fork 664
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement closures #798
Comments
I think we need to override the meaning of call indirect. |
How would this transform? function add(a, b): callback {
return () => a + b;
} |
class ClosureContext {
fn: (ctx: usize) => i32;
a: i32;
b: i32;
parent: ClosureContext | null = null;
}
function lambdaAdd(ctx: usize): i32 {
return changetype<ClosureContext>(ctx).a + changetype<ClosureContext>(ctx).b;
}
function add(a: i32, b: i32): ClosureContext {
let ctx = new ClosureContext();
ctx.fn = lambdaAdd;
ctx.a = a;
ctx.b = b;
return ctx;
} |
Instead class ClosureContext {
fn: (ctx: usize) => i32;
b: i32;
...
} we could just store index for indirect function table class ClosureContext {
fnIdx: usize;
a: i32;
...
}
...
call_indirect(ctx.fnIdx, ...args) |
How does this work with function dispatch? For instance, let's say I pass the Closure Context out to js as a pointer. How can I re-call it? |
Is there a way to make a table and add entries to it? |
@jtenner actually you need pass only EDIT No you need unpack |
Next example: function test(fn: (x: i32) => void): i32 {
let n = 0;
fn(x => { n = x });
return n;
} should generate: class ClosureContext {
fn: (ctx: usize, x: i32) => void;
n: i32;
parent: ClosureContext | null = null;
}
function lambdaFn(ctx: usize, x: i32): void {
changetype<ClosureContext>(ctx).n = x;
}
function test(fn: (x: i32) => void): i32 {
let n = 0;
let ctx = new ClosureContext();
ctx.fn = lambdaFn;
ctx.n = n;
fn(changetype<usize>(ctx));
n = ctx.n;
return n;
} |
Well I'm thinking about aspect collecting function pointers. How will aspect need to work with the function pointers? |
My guess is that passing around the closure context will cause problems with manually using call_indirect like aspect does. Also, this closure method doesn't handle multiple references to the same local. let a = 1;
let b = () => a;
let c = () => a += 1; B and c will get different versions of a. |
@jtenner (func $foo (result i32) (i32.const 1234))
(table (export "tbl") anyfunc (elem $foo)) than you js part: WebAssembly.instantiateStreaming(fetch('main.wasm')).then(({ instance }) => {
const table = instance.exports.tbl;
console.log(table.get(0)()); // 1234
}); |
It's likely we will need to allocate enclosed local values in a table or on the heap. Each pointer to those values will need to be stored in a table pointing to the heap. This idea is Naive because the variables can no longer be treated like local variables because it's possible to modify local values before the function finishes executing. |
Why? let a = 1;
let b = () => a;
let c = () => a += 1;
let br = b(); // 1
let cr = c(); // 2
// assert(a == 2) Will generate: class ClosureContextB { fn; a; }
class ClosureContextC { fn; a; }
function lambdaB(ctx: usize): i32 {
return changetype<ClosureContextB>(ctx).a;
}
function lambdaC(ctx: usize): i32 {
return changetype<ClosureContextC>(ctx).a += 1;
}
let a = 1;
let ctxB = new ClosureContextB(lambdaB, a);
let br = b(ctxB); // 1
// a = ctxB.a; // 1 unmodified so unnecessary
let ctxC = new ClosureContextB(lambdaC, a);
let cr = c(ctxC); // 2
a = ctxC.a; // 2 |
I'm saying |
No, we don't need pass plain types by boxed references. Also found pretty clear article. So |
Regarding collection of closure contexts: Seems the idea here is to pass around a closure context (containing both the function index and the lexical scope) instead of just a function index. While that can be reference counted, it leads to a situation where something can be called with either a function index or a closure context, for example function callIt(fn: () => void): void { fn(); }
function nop(): void {}
callIt(nop); // function index
let a: i32;
function ctx(): void { a = 1; }
callIt(ctx); // closure context which means we'd either have to generate two versions of A better approach might be to utilize the multi-value spec, in that a closure is actually two values, a function index and a lexical environment, with the latter possibly being |
Yep. The issue I'm going to hit with a multivalue return is when these function pointers need to be utilized in JavaScript. For instance, I want to call a describe function pointer that is nested. In order to do this from AssemblyScript, I need to export a function Edit: Could we have a primitive like |
|
Maybe one way to work around the allocation is to keep a singleton closure context per non-closure around in static memory. Like, if we know that the table has max 200 elements, make 200 dummies pre-populated with the function index and no lexical scope? Hmm |
Well at least 200 might not be enough. I can imagine taking advantage of this feature in aspect in very terrifying ways |
Firstly, I'd like to follow up on Daniel's example, would something like this work? type context<T> = T extends Context | T extends Function
function callIt<context<T>>(fn: T): returnof<T> { //This function would need to change.
if (isFunction<T>()){
if isVoid<T>() {
fn();
return;
}
return fn();
}
let ctx = changetype<Context>(fn);
if (ctx.isVoid){ // Need to add a isVoid property.
ctx.call()
return
}
return ctx.call();
} I read this article a year ago about how Elm handles first class functions in wasm: https://dev.to/briancarroll/elm-functions-in-webassembly-50ak The big take away is that a function context is a function pointer, its current arity (or how many arguments it still has to take) and an ArrayBuffer of the arguments that have been passed. When you have new function: let add = (a: i32, b: 32): i32 => a + b;
let addOne = add(1); ==> this is now a function `a: i32 => a + 1`;
let addTwo = add(2);
let two = addOne(1);
let three = addTwo(1); This could look something like this: class Func<Fn> {
// fn: Fn; //function to be called when all arguments are present
// airity: usize; // current airity
get length(): usize {
return lengthof<Fn>();
}
get currentArg(): usize {
return this.length - this.arity;
}
constructor(public fn: Fn, public arity: usize, public args: ArrayBuffer){}
static create<Fn>(fn: Fn, arity: usize = lengthof<Fn>(), args: new ArrayBuffer(sizeof<u64>() * lengthof<Fn>()): {
//This isn't exactly right size the size of each parameter could vary. But for sake of simplicity let's assume they are all usize.
// Let's assume there is a builtin `paramType<Fn>(0)`
type argType = paramType<Fn>(0);
let func;
if (arity > 1) ? paramType<Fn>(lengthof<Fn>() - arity + 1) : returnof<Fn>();
let func = new Func<argType, Fn>(fn, arity, args);
return func;
}
call<T>(arg: T): Func<Fn> | returnof<Fn>() {
assert(arg instanceof paramType<Fn>(this.currentArg());
if (arity == 0) { // arg is the last one and we can call the function
this.fn(...this.args); //This clearly needs to be a compiler operation to load the arguments as locals. Or transform all paremeter references with offsets and pass the array/arraybuffer.
}
let args = this.args.clone();
store<T>(args.dataStart + this.currentArg(), arg)
return new Func<Fn>(fn, this.arity - 1, args);
}
} I know a lot of this isn't currently possible, just wanted to get the point across. |
Potential implementation idea:
Problem: The same function can be called with either a function table index or a closure pointer. if (fn & 15) {
call_indirect(fn, ...);
} else {
ctx = __retain(fn);
call_indirect(ctx.index, ...);
__release(ctx);
} Here,
Compilation: When encountering a closed-over variable
When compiling a call to such a function
Performance impact:
What am I missing? :) |
Isn't this approach cause to table fragmentation? I predict closures can be much much more often than ordinal indirect functions |
Found which function capture environment (has free variables) and actually is closure and which is not is pretty simple and fully precise and doing in ahead of time during analysis inside AS module. Main problem with functions / closures which cross wasm<->host boundary |
That is a pretty bold assertion. ASC performance will likely slow down too. Next, consider this: type ClosureStruct = {
set: (n: i32) => void,
get: () => i32
};
function foo(x: i32): ClosureStruct {
return {
set: (new_value: i32): void => {
x = new_value;
},
get: (): i32 => x
};
}
function bar(): void {
const struct: ClosureStruct = foo(100);
struct.set(10);
struct.get(10);
} Would this be accessing invalid data? |
For |
C++ can't do this usually. But Rust which have own intermediate representation called MIR provide similar transforms. See this here: |
C++ couldn't do it because of the But, good luck with whatever implementation that the ASC team uses for AS closures! |
Ah right! I'm not very familiar with modern C++. But if you check llvm-ir you got a lot codegen initially. So all credit lies with LLVM in this case and its further optimizations.
Thanks! |
What's the current state of closures? How can I workaround the following code snippet: export function addClockEvent(cpu: CPU, callbackId: u32, cycles: u32): AVRClockEventCallback {
return cpu.addClockEvent(() => callClockEventCallback(callbackId), cycles)
} I'm porting a library to AS, but there is the point where a function is passed as a parameter. Because I can't pass a function to WASM, I am building a map with a callbackId that references the correct function on the JS side and calling this in an imported JS function. Without the closure I can't pass the parameter into it and modifying the function signature is not possible because it is deeply integrated. |
Just use temporary global variable: let _callbackId: u32;
export function addClockEvent(cpu: CPU, callbackId: u32, cycles: u32): AVRClockEventCallback {
_callbackId = callbackId;
return cpu.addClockEvent(() => callClockEventCallback(_callbackId), cycles)
} But this has some limitations. You can't use this approach for recursive calls and for serial calls |
I will get multiple calls to this function, so the global variable would be overwriten every time it get called. Is there any other method to do it or have I misunderstood something? For me it currently looks like I have to change the callback signature which is very problematic and blocking. Having closures would be nice. |
Here's the approach I used in a similar situation. It replaces the callback parameter with an interface, so you can easily create classes in that implement the interface and store additional information (sort of manual closure). interface CallbackInstance {
execute(): void;
}
class ClockEventCallback implement CallbackInstance {
constructor(readonly callbackId: u32) {}
execute(): void {
callClockEventCallback(this.callbackId); // or whatever custom login you want to do.
}
} Then change the signature of addClockEvent(callback: CallbackInstance , cycles: u32): void {
// now store callback somewhere, and use callback.execute() to call it.
} Finally, call it like: cpu.addClockEvent(new ClockEventCallback(whatever), 1000); I hope this is helpful! |
I used the approach @urish mentioned. I was hoping there is an alternative because it changes the signature of the method, which results in a changed API. |
I got another error while replacing a closure with an interface, the code looks like the following: //callReadHook and callWriteHook are imported module functions
export class ExternalCPUMemoryReadHook implements CPUMemoryReadHook {
call(addr: u16): u8 {
return callReadHook(addr);
}
}
export class ExternalCPUMemoryWriteHook implements CPUMemoryHook {
call(value: u8, oldValue: u8, addr: u16, mask: u8): boolean {
return callWriteHook(value, oldValue, addr, mask);
}
}
export function addReadHook(cpu: CPU, addr: u32): void {
cpu.readHooks.set(addr, new ExternalCPUMemoryReadHook());
}
export function addWriteHook(cpu: CPU, addr: u32): void {
cpu.writeHooks.set(addr, new ExternalCPUMemoryWriteHook());
}
// cpu.readHooks and cpu.writeHooks are of the following types:
export class CPUMemoryHooks extends Map<u32, CPUMemoryHook> {
}
export class CPUMemoryReadHooks extends Map<u32, CPUMemoryReadHook> {
} Just the import breaks the build, removing the exported functions still crashes.
|
Any reason why closures can’t be rewritten as classes & pass through constructor. As far as I can see in the ast, there should be a 1 to 1 relationship going from closures & currying to classes. For example, Typescript classes to es3 translation results in closures. |
Any updates on adding closure support? What is the blocker? |
Any updates on adding closure support? What is the problem? |
I planned to write a game engine use AS and I have programmed for a month, but now I give up AS and choose Rust because lack of supports for closures. Just let you know, It's not reasonable to not support closures in any case. |
@luffyfly as a Functional Programming enthusiast I totally agree that closures are super useful and handy. But are you sure closures in Rust are implemented for Wasm efficient enough for your game engine? As a general observation, in the current state Wasm presents a poor target for Functional Programming: doesn't efficiently support heavy use of function pointers, closures and GC. Moreover AssemblyScript is not really a Functional Language. And if you're programming in OO style it's not hard to implement purpose-built closures out of objects (going as far as Strategy Pattern if really needed). |
@gabriel-fallen I'm pretty sure about that, because When I call the fetch() function of the web host from Wasm, I have to use closures as callbacks to avoid blocking the main thread. |
Just to note. Closures and async (cooperative multitasking) are two independent things |
Closures in Rust are rarely represented as function pointers and instead as anonymous structs that have a "call" method that is statically dispatched rather than dynamically in almost all cases. The method is almost always inlined too, to the point where almost always you can't even tell it ever was a closure to begin with. It's very strongly a zero cost abstraction there. Though I'm also wondering why you say function pointers aren't well supported by Wasm. As far as I know they are just functions stored in the function table and then called via an GC though is definitely a problem with closures for languages that don't have an ownership system. |
@CryZe I have tried function pointers, and It worked but It is so weird and difficult to use. |
@MaxGraey If AS has aysnc implementations, I will not take closures. |
@luffyfly Again closures and |
@MaxGraey See #376 (comment). |
What is the current state of the closure implementation? I've seen several PRs related to closures, but none of them have been merged. I'm curious about what is the biggest block of closure implementation. |
I am interested in using AssemblyScript, but must admit I am reluctant to use until closures are implemented. Yes, obviously there are workarounds, but they are inconvenient and add to code bloat. All TypeScript developers whom I assume this project hopes to pilfer are accustomed to the convenience of closures and to some degree of functional programming. Many languages that target WASM already support closures, so I am confused as to what it is about AssemblyScript specifically that prevents closures from being implemented. I would be surprised if some limitation in WASM were a real roadblock. I saw some mentions of concerns around performance and allocations, which are reasonable, but I wonder if this is one of those situations where a bit of runtime efficiency is sacrificed for developer convenience? And the implementation wouldn't necessarily be the perfect implementation out the gate, so long as it is functionally correct. It could be refined or replaced later when new WASM features are introduced, or if a better implementation is worked out. |
Any update on this? Another year without closures 😢 Will 2024 the year of closures and thread support in AssemblyScript? |
Upvoting this. Array find functions, like |
I decide to start discussion about simple closure implementations.
Some obvious (and may be naive) implementation is using generation class context which I prefer to demonstrate in following example:
which transform to:
Closure and ClosureContext will not generated when no one variable was captured and use usual anonym functions.
ClosureContext will not created when only
this
was captured. In this casectx
param use to pass this reference.Other discussions: #563
@dcodeIO @jtenner @willemneal Let me know what you think about this?
The text was updated successfully, but these errors were encountered: