Applicative APIs and partial application with `f(a, b, ..)` #1852

hayleigh-dot-dev · 2022-11-16T12:00:14Z

hayleigh-dot-dev
Nov 16, 2022

The problem

In Elm it's common to have these pipe-friendly APIs for things like decoding, parsing, and validating. A hypothetical decoder API might look something like:

decoder : Decoder Person
decoder =
    succeed (\name age address -> Person { name = name, age = age, address = address })
        |> and_map (field "name" string)
        |> and_map (field "age" int)
        |> and_map (field "address" string)

The basic format involves starting with some constructor function that takes n arguments, and then chaining a bunch of operations with n number of and_maps to slowly piece together something.

Just using pipes on their own is insufficient because we're in some other "context", in this case a Decoder, or a Parser, or something else. The magic happens in and_map. The type looks like this...

and_map : Decoder a -> Decoder (a -> b) -> Decoder b

...or in gleam syntax...

and_map : fn(Decoder(fn(a) -> b), Decoder(a)) -> Decoder(b)

This works in Elm (and others) specifically because Elm's functions are automatically curried. That is, a function a -> b -> c is actually a chain of single-argument functions a -> (b -> c). In Gleam that means instead of fn(a, b) -> c we would have fn(a) -> fn(b) -> c.

We can produce a similar API in Gleam today, with the help of the curry utilities found in gleam/function...

import gleam/function

fn decoder() -> Decoder(Person) {
  succeed(function.curry3(fn(name, age, address) { Person(name, age, address) })
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_map(field("address", string))
}

These APIs are very nice to consume, and have some advantages over alternative ones that might make use of .then, but we hit some resistance in Gleam because the explicit currying step. Currently there are utilities to curry functions up to six arguments: curry2, curry3, ... curry6.

This explicit currying is clunky and confusing. For those new to FP the word "curry" probably sooner conjures images of a delicious meal rather than some single-argument functions, and inevitably begs the question "why can't I just write succeed(fn(name, age) { ... }).

The answer to that question is simple. We don't have a way to partially supply one (or some) argument(s) to a function.

The proposal

And with that I'd like to present a way to achieve that. I propose we re-use the existing spread syntax (..) as a way of partially applying a function. Here is a minimal example...

fn do_maths (x: Int, y: Int, z: Int) -> Int {
  x + y * z
}

fn example () {
  // `f` is a function that takes two arguments:
  // fn(Int, Int) -> Int
  let f = do_maths(1, ..)
  let a = f(2, 3)
  
  // `g` is a function that takes one argument:
  // fn(Int) -> Int
  let g = f(2, ..)
  let b = g(3)

  // This is a type error! All the arguments have
  // been supplied at this point, there is nothing
  // left to "spread".
  let h = g(3, ..)

  // Multiple arguments can be supplied, not just
  // one. Here `i` is a function that takes one
  // argument:
  let i = do_maths(1, 2, ..)
  let c = i(3)
}

Although not 1:1, this approach to partial application has parallels with existing usage of the spread syntax:

[ x, ..rest ]
Foo(..foo, bar: x)

The above example shows the syntax for partially applying function calls. The second part of this proposal is an extension of the type system to introduce the idea of "a function with at least this many arguments".

Consider again the type of and_map we saw earlier...

and_map : fn(Decoder(fn(a) -> b), Decoder(a)) -> Decoder(b)

This required currying to work nicely in the pipe, because we need that b to also be a function. A revised version of and_map using the proposed spread syntax becomes...

and_map : fn(Decoder(fn(a, ..b) -> c), Decoder(a)) -> Decoder(fn(..b) -> c)

The complete implementation could then be...

fn and_map(decode_f: Decoder(fn(a, ..b) -> c), decode_a: Decoder(a)) -> Decoder(fn(..b) -> c) {
  use f <- then(decode_f, _)
  use a <- then(decode_a, _)

  // We partially apply `f` with just the argument `a`.
  // This in turn produces a new function `fn(..b) -> c`
  // which in a `Decoder` using `succeed`.
  succeed(f(a, ..))
}

Now we can write...

fn decoder() -> Decoder(Person) {
  succeed(Person)
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  ...
}

Because the spread always implies there are additional arguments, unlike the Elm version we need a second function to act as the terminator in this style of pipe API. For that, we can re-use the initial type of and_map and just rename it...

and_finally : fn(Decoder(fn(a) -> b), Decoder(a)) -> Decoder(b)

...and with that we are left with...

fn decoder() -> Decoder(Person) {
  succeed(Person)
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_finally(field("address", string))
}

Technical details

This essentially involves treating functions as a separate Kind, and introducing a form of row polymorphism to allow us to express the idea of "at least n arguments" in a type annotation and use function application to eliminate arguments from a type.

If we look instead at records, this concept may make a bit more sense. A type like...

{ r | x : Int }

...is a record with the field x : Int and r additional fields. This means all the following record types would unify with the above...

{ x : Int, y : Float }
-- No additional rows is valid
{ x : Int }
-- Ordering is typically insignificant
{ a : String, x : Int }

We run into some additional constraints that seem practical to impose for our function version of row polymorphism. The "no additional rows is valid" case makes less immediate sense for functions. Given the row polymorphic function type...

fn(Int, ..b) -> Int

...it feels like f(Int) -> Int should not unify here. Similarly, order must become significant for obvious reasons.

Alternatives

@lpil and I have been thinking about this problem on-and-off for a while now. The current decoder API supplies a bunch of decode{N} functions, and an older version of a parser package I have provided a bunch of succeed{N} functions in a similar fashion. This places a hard limit on the arity of the functions you can provide in these APIs.

Another idea is to introduce a language construct for automatic currying, with a curry keyword. Making this a keyword/operator means we can side-step the type system and have a single way to curry arbitrary arity functions rather than curry2, curry3, and so on. This has the same problem as today, where users are forced to what currying even is before they can consume some API.

My thoughts are a little scattered on this. I have a good idea of what it would look like / how it would work in my head, but I've struggled to put it in writing. Thoughts, comments, and questions all very much welcome: I will do my best to clear things up!

lpil · 2022-11-16T21:27:23Z

lpil
Nov 16, 2022
Maintainer

This is a really good write up, thank you! I think the partial function application is the best approach we've seen to this so far. Let's think about this more

0 replies

lpil · 2022-11-16T21:37:15Z

lpil
Nov 16, 2022
Maintainer

A question from nth on Discord

under the partial eval proposal is this a valid type?
type SchemaBuilder = fn(..a) -> Schema(t)

We would at least need to paramertise the type, but I'm not sure it's quite right still

type SchemaBuilder(a, t) = fn(..a) -> Schema(t)

7 replies

hayleigh-dot-dev Nov 17, 2022
Author

I wonder if fn(a, ..b) -> c should be "arity greater than 1" rather than "at least 1"

I agree, I think. Especially if the following:

f(a, ..)

would not type check if f was only 1 arity (which I think makes sense)

schurhammer Nov 19, 2022

I think f(a, ..) should be allowed on arity 1, similar to how [x, ..r] works with a 1 length list (r = empty list).
I haven't thought this through fully but I think it would allow you to avoid needing an and_finally function.

NthTensor Nov 20, 2022

I do not think there is a good way to allow fn(a) -> b to match with fn(a, ..x) -> b. Either we accept and_finally (I personally think its is actually good), or we have to scrap the proposal. And we should not scrap the proposal.

The basic problem with f : (a) -> b matching fn(a, ..x) is that it screws up partial evaluation. If your function has an argument fun: fn(a, ..x) -> b you can't call fun. Calling it would mean you get back something of type b, but you want to work with higher-arty functions and they need more information before they can return a b. The best we can do is partially apply an argument arg with fun(arg, ..) to get a type matching fn(..x) -> b. If we allow arty=1 matches and pass fn(a) -> b to that argument, the partial application fun(arg, ..) should end up as either b or fn() -> b. The former conflicts with partial evaluation returning a function and is generally unsound, and the latter means you still need a and_finally or something similar and is in-my opinion very gross feeling.

I also have some concerns regarding side-effects, but I think this is the meat of the issue.

schurhammer Nov 20, 2022

fun(arg, ..) should end up as fn() -> b is what I was thinking. If that's case why do you still need a and_finally? Can't you just call the function?

edit: to expand
I think the original example would look like

fn decoder() -> Decoder(Person) {
  let decode = succeed(Person)
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_map(field("address", string))
  decode()
}

Or if you don't like the look of that you could pipe it to something like fn exec(f) { f() }

I hope I'm not just talking rubbish haha 😅

NthTensor Nov 20, 2022

That all makes sense, and I guess the API is a bit cleaner. It seems like we won't be able to escape needing a final invocation to break out of partial-application, but you are right that having it require no arguments looks a bit nicer and maybe requires less duplicate code. And you are also right that this lines up better with the list spread semantics.

NthTensor · 2022-11-18T03:15:04Z

NthTensor
Nov 18, 2022

I don't have much experience on the PL design side of things, but I wanted to chime in and voice my support for this proposal. Clean yet super expressive. I love it. Using this feature, the following example ORM-like-thing can be fully implemented in about 150 lines of pure gleam (save for four external encoding/decoding functions).

pub type City {
  City(name: String, latitude: Float, longitude: Float, metadata: Metadata)
}

fn main() {

  let city_schema = schema(table: "city", is: City)
  |> field("name", string, fn(city) { city.name })
  |> field("lat", float, fn(city) { city.latitude })
  |> field("lng", float, fn(city) { city.longatude })
  |> construct()

  let cities = [
    City(name: "New York", latatude: 40.71, longatude: 74.00, metadata: metadata()),
    City(name: "Berlin",   latatude: 52.52, longatude: 13.40, metadata: metadata()),
    City(name: "London",   latatude: 51.50, londatude: 00.12, metadata: metadata()),
  ];

  // A stack is a collection of rows from multiple SQL tables
  // type Stack = Map(String, Frame), where a frame represents some rows from a single table
  // type Frame = Map(String, Series), where `Series` is a vector version of `Dynamic` representing a column 
  let stack: Stack = city_schema.encode(cities)

  // Select just the cities frame
  // Encoders return multile frames because objects can contain loaded assoceations 
  let frame = map.get(stack, "city")

  // If we decode the frame, it should return the same values we started with!!!
  assert Ok(cities) = city_schema.decode(frame)
}

Here construct is analogous to and_finally and it lets us do something kinda cool: Suppose we need to attach some database information to a users's custom types. We could wrap things in a generic type, OR we could require users to add a metadata field to their structs. By defining cosntruct as

fn construct(schema: Schema(fn(Metadata) -> t)) -> Schema(t)

instead of

fn construct(schema: Schema(fn(a) -> t), a: a) -> Schema(t)

we can require that the last field for any type used in the ORM is of type Metadata. Anything else is a type error when they try to write a schema. Neat, right? More generally, the separation of and_map & and_finally lets you do book-ending operations at the end of your applicative-style pipeline. I really like this, and I think it should be viewed as a feature. Which is all to say, yes I think distinguishing between arty=n and arty>n is the way to go.

The row-poly syntax is perfectly inline with the behavior of the spread operator as defined on lists. Only one "spread" is allowed, and only after all the other arguments. The partial application syntax also seems great. I think if you bill it as "argument spreading" rather than "currying", new users will probably pick it up pretty quickly (if they have to touch it at all).

Comments

Since the OTP module and external functions may cause side effects, I do think it is important that we emphasize the difference between partial application and partial evaluation.
I believe this feature is incompatible with function overloading, which is maybe why you don't see too many non-PF languages implementing it.
You can't use an arg spread with function type alias, right? But you never have to use an alias, so worst-case this just makes things a little more verbose. This seems acceptable, but it is maybe a downside compared to the curry keyword.

Questions

How much can the compiler optimize the sorts of highly head-recursive calls you get with aplicative pipelines?
Is there runtime overhead associated with this?
What needs to happen on the type-checking and compiling level for something like this to be implemented?

5 replies

NthTensor Nov 20, 2022

Also with noting: I think this proposal might not mesh well with the proposed optional arguments. We can make it work, because basically and_finally determines when a function is actually evaluated. But it would definitely add some complexity.

inoas Nov 20, 2022

I hope we can do optional args with defaults first then to not block the path forward?

lpil Nov 20, 2022
Maintainer

I think optional arguments would would statically in a fashion to labels so they would not be available with functions as values like this.

NthTensor Nov 20, 2022

Would you elaborate on this?

lpil Nov 20, 2022
Maintainer

You can only use labels when directly calling module functions, function degrade when you use them as a value so you can no longer use labels with them.

pub fn fun(label arg) {
  arg
}

pub fn main() {
  fun(label: 1) // This works
  let x = fun
  x(label: 1) // This does not work
}

lpil · 2022-11-20T15:57:13Z

lpil
Nov 20, 2022
Maintainer

How would the code generation work for this?

pub fn main(f: fn(Int, ..b) -> c) {
  f(1, ..)
}

I don't know what code we would generate for this. Don't we need to know the arity of f to know the arity of the wrapper function we would generate?

3 replies

NthTensor Nov 24, 2022

For future readers: There was a big discussion in discord and everybody seemed to come to the conclusion that, yep, this is a serious issue. Getting around it would seem to require either runtime reflection or full program compilation, and would likely complicate long-term plans for a native compile target. (I didn't figure any of this stuff out myself).

I've been thinking about it since, and I have come to the conclusion that we are doing row-polymorphism backwards. Instead of having row-polymorphic arguments that accept a range of normal functions, we need to have row-polymorphic functions that can be passed to a range of normal arguments. Flipping this around also means we need to add information to function calls rather than function definitions, so this is maybe going to feel a little closer to the curry keyword.

Here is the basic sketch of this idea: do type inference on the structure of functions. Suppose we are working with a function bar: fn(Int, Int, Int, Int) -> String and we want to do something weird and complicated like this (using the syntax of the original proposal):

fn foo(f: fn(Int, Int, ..b) -> c, arg: Int) -> fn(..b) -> c {
  f(arg + 1, arg - 1, ..b)
}

fn main() {
  let b = foo(bar(1, ..), 2)
}

The type annotation for foo looks slick, but as was pointed out its basically useless. If we remove it, we might rewrite the above like so, retaining what we can of the partial application syntax.

fn foo(f: fn(Int, Int) -> k, arg: Int) -> k {
  f(arg + 1, arg - 1) 
}

fn main() {
  let b = foo(bar(1, ..), 2)
}

which should be equivalent to the following

fn main() {
  let b = foo(
    fn (arg1, arg2) {
      fn (arg3) {
        bar(1, arg1, arg2, arg3)
      }
    }, 2)
}

Basically, we don't need to be able to specify greater-than-arity function types if we use a souped-up version of the curry keyword. We just need to be able to automatically break a function into a nested sequence of wrapper functions, each with arbitrary arity. As an bonus, this syntax leaves room for partial application.

Is this more tractable? Again, I'm a little weak on theory, so I don't know. I think what I am proposing is upgrading fun(..) to a row-polymorphic function that matches any concrete function argument fn(a_1, a_2, a_3, ..., a_N) -> r if the ith argument of fun is of type a_i and r is totally generic. If fun has greater arity than N, we let r be a function from the remaining args to the return-type of fun. If fun has exactly N matching args, we evaluate the function and r is the normal return type. Otherwise its a type error. If that all works, it seems like you get partial application for free. I'm hoping that this formulation lets us better exploit the existing and very powerful generic system, which admittedly I do not know very much about.

If this works out, applicative-style pipelines could be as simple as

fn main(dyn: Dynamic) {
  succeed(Person(..))
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_map(field("address", string))
}

I am having trouble working out if this works with more complex arrangements of higher-order-functions, and that worries me a little. It seems like once you start passing these polymorphic types around between multiple functions, it might still require monomorphization somewhere. Would love to hear from the more theory-minded folks weather they think this is true.

Edit: Pretty sure this is easily broken by conditionally passing a row-polymorphic function to one of two different functions, each expecting different arty -_-. We could throw a compiler error but it seems like maybe we just should curry things.

ShalokShalom Nov 24, 2022

Just to write this down here as well:

It was also bespoken, how the curring would be introduced to into the language

One important point here is, that people coming from cough old, traditional languages do not flee in masses, upon hearing about traditional indian food in the language.

Louis suggested to create a syntax to avoid naming the feature with a keyword.

I would suggest, to call the keyword simply partial

In both of those ways is it be fine to speak about partial application, in case somebody asks about the feature, without confusing people about cardamom and its relationship to a dead mathematician.

timjs Nov 24, 2022

I hope to write down a more theoretical description of the ideas you described @hayleigh-dot-dev and @NthTensor. Did not read along the discussion on Discord though.

timjs · 2022-12-07T22:13:31Z

timjs
Dec 7, 2022

TL;DR;

I think that what we want to do here is most easily solved by support from the compiler, to insert appropriate transformations in the right place for partial application and for currying, no need for row types.

Oké, it took me some time to sit down and write about this. Let me first try to summarise the problem stated by @hayleigh-dot-dev and @NthTensor, and then try to formalise it and pose a solution. Please let me know if anything is unclear or, even more importantly, if I’m describing a solution that doesn’t fit the posed problem.

Problem

@hayleigh-dot-dev starts describing the problem of decoding pipelines, which in Gleam currently need to be written like this:

import gleam/function

let decoder: Decoder(Person) =
  succeed(function.curry3(Person))
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_map(field("address", string))

The pain point here is that the Person constructor needs to be curried. As decoder.and_map has type fn(Decoder(fn(a) -> b), Decoder(a)) -> Decoder(b), we need to pass in a function of exactly one argument (b can be anything). So we need to convert the type of the Person constructor from fn(String, Int, String) -> Person to fn(String) -> fn(Int) -> fn(String) -> Person. This is what function.curryN does, for different arrities N.

From here, the discussion started for syntax to partially apply functions and for syntax to denote some sort of currying.

Terms

Let me introduce some terms, to make clear what we’re talking about.

Currying

In general, currying is turning a function of arity n, into a function of arity 1 which returns a new function of arity 1 and so on, up until all n arguments can be filled in. For example:

import function

let calc = fn(a, b, c) {
  a * b * c
}
// calc : fn(Int, Int, Int) -> Int

let calc_curried = function.curry3(calc)
// calc_curried : fn(Int) -> fn(Int) -> fn(Int) -> Int

As you can see, we spread all 3 parameters of calc over separate 1-arity functions. We could define calc as a curried function from the start lik this:

let calc_always_curried = fn(a) { fn(b) { fn(c) {
  a * b * c
} } }

or enforce all functions are curried by default (as Haskell, OCaml, Elm, PureScript, etc. do) but there are reasons not to do this, like performance. This is what Gleam and Erlang choose to do.

Note the difference in calling the functions:

let result = calc(1, 2, 3) // all arguments need to be supplied
let result = calc_curried(1)(2)(3) // all arguments are provided separately

Partial application

Partial application is when a function gets supplied fewer arguments than needed to execute. So, if we have a curried function calc_curried, we can give it just one argument, which results in a function which needs just two more (actually: a function which needs one more, which returns a function which needs another one):

let calc_double = calc_curried(2)
// calc_double : fn(Int, Int) -> Int

Note that currying is something that can make partial application possible. The proposition is to add syntax so that we can supply only the first parameter to the uncurried function calc. The proposed syntax for this is:

let calc_double = calc(2, ..)
// calc_double : fn(Int, Int) -> Int

The result is a function which only needs 2 arguments (now really 2).

Similarly, we can provide the first two arguments and get a function taking just one argument:

let calc_quadruple = calc(1, 4, ..)
// calc_quadruple : fn(Int) -> Int

Curried function type

Then, there is another proposal to add some kind of curried function type (not an official term as far as I’m aware of, invented it myself). Note that this is something different from partial application and currying! It is about changing the contract a function exposes to the outer world by giving it another type signature. Take for example, using the proposed syntax, a function which needs a f of at least two integers:

let use_two = fn(f: fn(Int, Int, ..r) -> a, x) {
  f(x + 1, x - 1)
}
// use_two : fn(fn(Int, Int, ..r) -> a, Int) -> fn(..r) -> a

The usage of this function would be something like this:

let g = use_two(calc, 6)
// g : fn(Int) -> Int

calc is a function of 3 arguments, so it has at least 2 and fits the signature of use_two. Inside use_two it gets applied to 2 arguments, and the result is a function which still requires one more integer. By using rewriting, we would like the above definition of g be equivalent to:

let g = fn(x) { calc(6 + 1, 6 - 1, x) }

Proposal

Language

Oké, let’s set up our language. We’re using Gleam. All Gleam programs can be simplified to the following grammar:

e ::=                 // Expressions
  | x                 // variables
  | let x = e_1; e_2  // let-binding
  | fn(x^k) { e }     // function abstraction
  | f(e^k)            // function application
  | l                 // literal

t ::=                 // Types
  | fn(t^k) -> t_0    // function type
  | p                 // primitive type

First a note on my grammar. x, e, t, l, p, and k are metavariables. that range over other productions in this grammar. I use the notation x^k to denote a number of k variables, separated by spaces. So k is the arity of the function. Something similar happens in function application, where e^k denotes k arguments passed to function f. For example, f(e^3) is short for f(e_1, e_2, e_3). All other symbols are to be taken verbatim (that is, terminals in the grammar), where ; denotes a newline. Furthermore, l are literals (such as integers and strings), and p primitive types (such as Int and String). We won’t spend more words on them than this.

The proposal is to add two pieces to the syntax, one new expression and one new type form. I’ll call the first one partial application and the second one auto currying (also not an official term). I’ll explain why in a minute.

e ::= ...
  | f(e^n, ..)           // partial application

t ::= ...
  | fn(t^n, ..t) -> t_0  // auto currying

Here n has the same purpose as k, I just named them differently

Helpers

Next, let me introduce two functions (or actually two family of functions) called partial_n_k and curry_n_k. I say family of functions, because I define them for arbitrary lengths n and k. Some examples:

partial_2_3(f, e_1, e_2) will apply 2 out of 3 arguments to function f of arity 3, and return a new function of the 1 remaining argument. So, in this example:
- if f is a function of 3 arguments:
  f : fn(t_1, t_2, t_3) -> t_0
- and e_1 and e_2 are expressions of type t_1 and t_2:
  e_1 : t_1, e_2 : t_2
- then partial_2_3(f, e_1, e_2) returns a function of the 1 remaining argument to its return type:
  partial_2_3(f, e_1, e_2) : fn(t_3) -> t_0
The full type of partial_2_3 is fn(fn(t_1, t_2, t_3) -> t_0, t_1, t_2) -> fn(t_3) -> t_0.
curry_2_3(f) will transform a function of 3 arguments into a function of taking first 2 arguments, and then the remaining 1. So:
- if f is again a function of 3 arguments:
  f : fn(t_1, t_2, t_3) -> t_0
- then curry_2_3(f) returns a function taking first 2 arguments, and returning a function taking the remaining 1:
  curry_2_3(f) : fn(t_1, t_2) -> fn(t_3) -> t_0
The full type of curry_2_3 is fn(fn(t_1, t_2, t_3) -> t_0) -> fn(t_1, t_2) -> fn(t_3) -> t_0.

You can think of curry_n_k(f) as splitting function f of arity k into a function of two steps, it first takes n arguments and then takes k-n arguments.

We can define these functions in pseudo Gleam like this:

// partial_n_k takes
// * a function f of arity k
// * n arguments e_1, ..., e_n
// it returns a function of
// * the remaining k-n arguments y_n+1, ..., y_k
//   and applies the first k to f
fn partial_n_k(f, e^n) {
  fn(y^k-n) { f(e^n, y^(k-n) }
}

// curry_n_k takes
// * a function f of arity k
// it returns a function which
// * first takes n arguments x_1, ..., x_n
// * then the remaining arguments x_n+1, ..., x_k
fn curry_n_k(f) {
  fn(x^n) { fn(y^k-n) { f(x^n, y^(k-n) } }
  // this is actually equivallent to:
  // fn(x^n) { partial_n_k(f, x^n) }
}

Now, my hypothesis is that, after type inference, we can automatically insert partial_n_k and curry_n_k for correct n and k at every position needed when using the new grammar for partial application and auto currying.

Examples

Partial application

When we have the partial application from the example above:

let calc_quadruple = calc(1, 4, ..)

we know that:

the function calc has type fn(Int, Int, Int) -> Int, so it has arity 3;
the partial application calc(1, 4, ..) applies 2 arguments.

So k = 3 and n = 2 and therefore:

calc(1, 4, ..) ~~> partial_2_3(calc, 1, 4)

(Read ~~> as “translates to”.)

Auto currying

From the example above:

let use_two = fn(f: fn(Int, Int, ..r) -> a, x) {
  f(x + 1, x - 1)
}

let g = use_two(calc, 6)

We know that:

the function calc has type fn(Int, Int, Int) -> Int, so it has arity 3;
the function use_two has type fn(fn(Int, Int, ..r) -> a, Int) -> fn(..r) -> a, so it needs a function of arity 2.

So calc has arity 3, but is needed in a context of arity 2 (k = 3 and n = 2):

calc ~~> curry_2_3(calc)

And therefore

use_two(calc, 6) ~~> use_two(curry_2_3(calc), 6)

The signature of use_two can be safely translated into “Gleam without the sugar” (as @NthTensor already said):

fn(fn(Int, Int, ..r) -> a, Int) -> fn(..r) -> a
~~>
fn(fn(Int, Int) -> k, Int) -> k

Edge cases

Maybe you’ll already have the feeling that some laws should apply to k and n. As k is the total amount of parameters, and n is the supplied amount of arguments, n can never exceed k, so at least we have n <= k. But can n == k or can one or both be 0?

What if `n == k`?

partial would apply all arguments and delay the function call:

partial_3_3
  : fn(fn(t_1, t_2, t_3) -> t_0, t_1, t_2, t_3)
  -> fn() -> t_0
    // ^^ no arguments left

curry would move all the arguments to the “first half of the split” and delay the resulting call:

curry_3_3
  : fn(fn(t_1, t_2, t_3) -> t_0)
  -> fn(t_1, t_2, t3) -> fn() -> t_0
     // ^^^^^^^^^^^^ all params are here

(Above are just examples for n == k == 3, but we can do this for every n and k.)

Both don’t seem very useful at first sight. Below I’ll show you that it even breaks things.

What if `n == 0`?

partial would be the identity function:

partial_0_3
  : fn(fn(t_1, t_2, t_3) -> t_0)
  -> fn(t_1, t_2, t_3) -> t_0
  // ^^^^^^^^^^^^^^^^^^^^^^^^ same as before

When would you encounter this? Well, when one’d write a partial application like

let calc_same = calc(..)

This is like saying: “I won’t give any arguments to calc but just save the result in calc_same”, which indeed is like the identity, so it’s silly and you can better write:

let calc_same = calc

curry would delay the whole function call:

curry_0_3
  : fn(fn(t_1, t_2, t_3) -> t_0)
  -> fn() -> fn(t_1, t_2, t_3) -> t_0
    // ^^ no params here any more

An example with currying for n == 0 would be

fn f(g: fn(..r) -> a)) { ... }

which is like saying “g is a function with any number of arguments, but I don’t know how many”. This is pretty lame, as you cannot do anything with it except for calling it with no arguments at all. But you don’t know what its result will be, only that it is a function. I don’t see any use case for this, maybe somebody else has ideas….

So both don’t seem very handy to me, perhaps even confusing.

Conclusion

We need that 0 < n < k.

More examples

Let’s take a look at the example from @NthTensor from another thread. If we have:

fn bar(a, b, c, d) {
  a + b + c + d
}

and we have the following code:

fn foo(f: fn(Int, Int, ..r) -> a, x: Int) -> fn(..r) -> a {
  // this is fine, we don't need partial application:
  f(x + 1, x - 1)
}

let b = foo(bar(1..), 2)

Note that the call to f in foo doesn’t need any partial application syntax, because we’re giving it exactly 2 arguments.

This gets translated to:

fn foo(f: fn(Int, Int) -> z, x: Int) -> z {
  f(x + 1, x - 1)
}

let b = foo(curry_2_3(partial_1_4(bar, 1)), 2)

Here, the type signature of foo gets translated to base Gleam function types. But the real magic happens in the call to foo!

We've a partial application of bar where:
- k = 4 because bar has 4 parameters
- n = 1 because 1 argument is given;
and auto currying of that function where
- k = 3 because the result of partial_1_4 is a function with 3 parameters
- n = 2 because of the type signature of f in foo.

The original example had a partial application in the call to f in foo:

fn foo(f: fn(Int, Int, ..r) -> a, x: Int) -> fn(..r) -> a {
  f(x + 1, x - 1, ..)
}

let r = foo(bar(1, ..), 2)

(With .. in place of ..b, as the original example had it. This would mix syntax from the type level and the term level!)

This is bad. Imagine what would happen if we’d allow this. We’d get the following translation:

fn foo(f: fn(Int, Int) -> z, x: Int) -> z {
  partial_2_2(f, x + 1, x - 1)
}

let r = foo(curry_2_3(partial_1_4(bar, 1)), 2)

which, unfolded, is equivalent to:

fn foo(f: fn(Int, Int) -> z, x: Int) -> z {
  fn() { f(x + 1, x - 1) }
}

let r = foo(
  fn(x_1, x_2) { fn(x_3) { // <= curry_2_3
    fn(y_2, y_3, y_4) {    // <= partial_1_4
      bar(1, y_2, y_3, y_4)
    }(x_1, x_2, x_3)
  } }, 2)

and even further:

let r = foo(
  fn(x_1, x_2) { fn(x_3) {
    bar(1, x_1, x_2, x_3)
  } }, 2)
==>
let r = fn() {
  fn(x_1, x_2) { fn(x_3) {
    bar(1, x_1, x_2, x_3)
  } }(2 + 1, 2 - 1)
}
==>
let r = fn() {
  fn(x_3) {
    bar(1, 3, 1, x_3)
  }
}

r has now type fn() -> fn(Int) -> Int. We gained an extra empty function call!

Takeaway message:

Don’t allow the full number of arguments to be partially applied.

Then, there is the question what happens if we had the following definition instead of bar:

fn baz(a, b, c) {
  a + b + c
}

let s = foo(baz(1, ..), 2)

baz just takes 3 arguments instead of 4. After partially applying it to the integer 1, it needs just 2 arguments, which is exactly the amount foo wants to have! We’d get:

let s = foo(partial_1_3(bar, 1), 2)

There is no curry_2_2 needed here! (Even worse, if you’d add it, it will go horribly wrong, just try to rewrite foo(curry_2_2(partial_1_3(bar, 1)), 2) and see it happening. I’m leaving this as an exercise to the reader 😉.)

Takeaway message:

When you’re passing in a function of the right arity, leave it as is.

Implementation

If I’m correct, Gleam’s typing system is fully based on Damas-Hindley-Milner (correct me if I’m wrong @lpil). This means that after type inference, we know the type of every expression and parameter in the language. The only thing (I think, I didn’t check, I create a bidirection type system first, but then realised Gleam doesn’t have one 😛) we need to change, is in the unification algorithm, where types of the form:

fn(t^k) -> t_0    and   fn(t^n, ..r) -> t_0

need to be unified for n <= k. In cases where n > k this results in a type error, as this signals you’re trying to fit a function which doesn’t have enough arguments into one that wants more…

Runtime overhead

Obviously, there is some runtime overhead. We’re creating additional anonymous functions which get (partially) called. However, it is quite obvious where the overhead happens when you look at the code:

Everywhere you’re using partial application syntax like f(x, y, ..), an anonymous function is going to be created.
In every call to a function with an auto curried parameter like fn(t_1, t_2, ..r) -> t_0 there is a potential overhead because a curried function has to be created.

Summary

This is already a really long post and it’s getting late.

I hope that I could show two things:

Sugar for partial application can be easily transformed into existing Gleam.
The same holds for sugar for curried function types in function parameters.

Both features can be supported by the compiler because arities are known at compile time. Therefore the compiler can insert (inlined versions of) the partial_n_k and curried_n_k calls where appropriate.

There is no need for row type polymorphism. As @hayleigh-dot-dev already said, you would need to take order into account. Also, the translation of row polymorphic records to real records is usually by adding the index of the field in the record as part of the polymorphic function (see Daan Leijen’s excellent paper), this would complicate function calls a lot.

I’m not aware of any prior art for this problem, not using row types and not using auto currying. Thing that comes close, is Swift’s autoclosure, which wraps an expression in a closure so it doesn’t have to be evaluated. Some kind of auto support for lazy evaluation. We could implement it along with this, as it is just similar to inserting partial_n_k or curry_n_k calls.

Although inserting (inlined versions of) partial_n_k and curried_n_k will lead to convoluted code, it is quite clear what happens, and developers would write the same code by hand. So you can really see this as an advanced form of syntactic sugar!

11 replies

lpil Dec 21, 2022
Maintainer

Great write up! Thank you very much @timjs !

One thing to consider which hasn't been touched on here is that one of our main goals is to have good interop with Erlang etc in both directions, so Gleam APIs should always be easy to use from those languages. The general rule is that the generated Gleam code should not be easily distinguishable from hand-written Erlang or Elixir.

This would be the first feature where the caller needs to do some extra contextual work order to call the Gleam function, which would make it a lot harder to use without the Gleam compiler automatically doing this housekeeping, and it would very clearly not be normal Erlang code.

This might be acceptable, but we need to think about how we would communicate to Erlang/Elixir/etc users how to call these functions, and anything else we need to do in order to provide them with a good UX.

On use + pipes

About the admin/authenticate example: I think this is an error in the Gleam compiler. The sugar is not expanded or type checked correctly. This (without any partial application!)

use user <- authenticate("admin") |> attempt()
foo(user)

should be equivalent to

use user <- attempt(authenticate("admin"))
foo(user)

which currently does work.

It should not do that, at least not now. We always introduce the minimal useful version of a feature so that we can learn about it. It is easy to add, it is had to subtract.

It's possible that we may add more a sophisticated use which uses type information and performs some type back tracking to determine which of the possible rewrites is appropriate, but today it is only syntactic sugar.

I'm not yet convinced that making it more complex in this way will result in Gleam code being easier to work with. There is a nice property with the current design that the function to which control is yielded is the one closest to the use keyword. It looks quite similar to the await found in many languages. The examples I have seen so far in which pipes would be used have been consistently less clear to me than the current style and no more concise.

NthTensor Dec 24, 2022

Lets look at some examples of erlang as generated by this proposal. I will assume curry and partial are stand-ins for two common forms of anonymous functions.

Partial application

list.map(foo("test", ...), list)
>> list.map(partial_1_2(foo, "test"), list)

ultimately generates something like this, right?

lists:map(fun (V) -> foo("test", V) end, List).

I've had cause to write stuff like that once or twice in my limited time with Erlang. Its always pretty simple. In my mind, partial application fits with hand-written Erlang. I also don't see a situation where someone could be forced to use partial application, so I don't think it can 'infect' non-gleam code with foreign patterns either.

Currying is not as nice. Using your example

pub fn main(f: fn(Int, ..b) -> c) {
  f(1)
}

should translate into something as simple as

main(F) ->
  F(1).

but its up to non-gleam users to ensure F is of the correct arity and that F returns a function of the correct arity. This is can get complicated. Consider the pipelined-parsing concept:

pub fn succeed(a: a) -> Decoder(a) {
  fn(_dynamic) { 
    Ok(a)
  }
}

pub fn then(decode: Decoder(a), f: fn(a) -> Decoder(b)) -> Decoder(b) {
  fn(dynamic) {
    case decode(dynamic) {
      Ok(a) -> f(a)(dynamic)
      err -> err 
    }
  }
}

pub fn and_map(decode_f: Decoder(fn(a, ..b) -> c), decode_a: Decoder(a)) -> Decoder(fn(..b) -> c) {
  use f <- then(decode_f)
  use a <- then(decode_a)
  succeed(f(a))
}

pub fn and_finally(decide_f: Decoder(fn(a) -> b), decode_a: Decoder(a)) -> Decoder(b) {
  use f <- then(decode_f)
  use a <- then(decode_a)
  succeed(f(a))
}

We might use this to parse a dynamically typed object

fn decoder() -> Decoder(Person) {
  succeed(Person)
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_finally(field("address", string))
}

and this should (ideally, using some hand wavy type unification) de-sugar to the following

fn decoder() -> Decoder(Person) {
  and_finally(
    and_map(
      and_map(
        succeed(
          curry_1_2(
            curry_2_3(Person)
          )
        ),
        field("name", string)
      ),
      field("age", int)
    ),
    field("address", string)
  )
}

and translate (I believe) to

decoder() ->
  and_finally(
    and_map(
      and_map(
        succeed(
          fun (A1) ->
            fun (A2) ->
              (fun (B1, B2) ->
                fun (B3) ->
                  {person, B1, B2, B3}
                end
              end)(A1, A2)
            end
          end),
        field("name", string)),
      field("age", int)),
    field("address", string)).

This is not sane or normal Erlang, which is unfortunate. I still don't think this would often end up preventing erlangists from writing idomatic code, as you mostly need currying to do type shenanigans. I mean, there's really no reason to use the dynamic module from erlang, and you can always just write manually curried functions when you don't have to worry about constructors. However, there is probably at-least one example of a curried API written in gleam that forces people working in Erlang to write plumbing with anonymous functions.

By the way, as you can see from this example, currying the same function multiple times results in a redundant function call, similar to what @inoas and @timjs discussed with partial. It would be cool if we could collapse this:

...
fun (A1) ->
  fun (A2) ->
    fun (B3) ->
      {person, A1, A2, B3}
    end
  end
end

Edit: Fixed the code for the currying example.

timjs May 2, 2023

Sorry for me being absent for a long time. Still had on some list to reply on this!

Great write up! Thank you very much @timjs !

You're very welcome :-)

One thing to consider which hasn't been touched on here is that one of our main goals is to have good interop with Erlang etc in both directions, so Gleam APIs should always be easy to use from those languages. The general rule is that the generated Gleam code should not be easily distinguishable from hand-written Erlang or Elixir.

This would be the first feature where the caller needs to do some extra contextual work order to call the Gleam function, which would make it a lot harder to use without the Gleam compiler automatically doing this housekeeping, and it would very clearly not be normal Erlang code.

I think this feature exactly matches the needs for good interoperability with Erlang, as the resulting Erlang code for functions using auto currying would be the same. For example, the functions and_map and and_finally given above by @NthTensor would be compiled to exactly the same Erlang code: a function expecting two parameters of which the first is a function taking one argument and the second should be a Decoder.

So, the current Gleam code:

pub fn and_map(decode_f: Decoder(fn(a) -> b), decode_a: Decoder(a)) -> Decoder(b) {
  use f <- then(decode_f)
  use a <- then(decode_a)
  succeed(f(a))
}

now generates the following Erlang (generated with Gleam 0.28.3):

and_map(Decode_f, Decode_a) ->
    then(Decode_f, fun(F) -> then(Decode_a, fun(A) -> succeed(F(A)) end) end).

and this would not be different when changing the signature of and_map to fn and_map(decode_f: Decoder(fn(a, ..b) -> c), decode_a: Decoder(a)) -> Decoder(fn(..b) -> c)!

Only Gleam itself will make use of the special signature with auto currying when calling such a function from the Gleam side.

Going back to the implementation @hayleigh-dot-dev gave as motivation for this feature, currently one should write in Gleam:

fn decoder() -> Decoder(Person) {
  succeed(function.curry3(fn(name, age, address) { Person(name, age, address) })
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_map(field("address", string))
}

which generates the following Erlang code (also generated with Gleam 0.28.3):

decoder() ->
    _pipe = succeed(
        gleam@function:curry3(
            fun(Name, Age, Address) -> {person, Name, Age, Address} end 
        )
    ),
    _pipe@1 = and_map(_pipe, field(<<"name"/utf8>>, fun string/1)),
    _pipe@2 = and_map(_pipe@1, field(<<"age"/utf8>>, fun int/1)),
    and_map(_pipe@2, field(<<"address"/utf8>>, fun string/1)).

So the generated Erlang code would not be non-idiomatic because of auto currying. One could say it already is non-idiomatic because of using pipes and because of enforcing currying by the API. That is, because and_map wants its first parameter to be a function taking just one argument.

Using the auto currying feature and the API @NthTensor gave in the previous post, in Gleam, instead of manually adding the function.curry3, one would be allowed to write:

fn decoder() -> Decoder(Person) {
  succeed(Person)  // magic happens here!!
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_finally(field("address", string))
}

and Gleam would automagically insert the appropriate amount of curried function parameters. With the optimisation of folding partial and curried calls, the generated Erlang should change to:

-spec decoder() -> fun((binary()) -> {ok, person()} | {error, failure()}).
decoder() ->
    _pipe = succeed(
        fun(Name) -> fun(Age) -> fun(Address) ->  % magic happens here!!
            {person, Name, Age, Address}
        end end end
        )
    ),
    _pipe@1 = and_map(_pipe, field(<<"name"/utf8>>, fun string/1)),
    _pipe@2 = and_map(_pipe@1, field(<<"age"/utf8>>, fun int/1)),
    and_map(_pipe@2, field(<<"address"/utf8>>, fun string/1)).

which is hardly different from the previously generated code.

Btw. One can already write:

fn decoder() -> Decoder(Person) {
  succeed(function.curry3(Person))
  |> and_map(field("name", string))
  |> and_map(field("age", int))
  |> and_map(field("address", string))
}

NthTensor May 15, 2023

I've been playing around with this, and one thing that dosn't seem to have been talked about is how what we are calling "auto-currying" can be unexpectedly permanent when returning functions. The following example wraps fun with n arity-1 functions, but this is not super clear when n=2.

fn curry_n(num: Int, fun: fn(a, ..r) -> k) -> fn(a) -> fn(..r) -> k {
  case num {
    0 | 1 | 2 -> fun
    n -> f(x) { curry(n-1, fun(x)) }  
  }
}

(This would probably raise a pretty confusing type error if fun has fewer than n arguments). Here's a simpler example

fn curry1(fun: fn(x, ..r) -> k) { fun }

fn add2(x, y) { x + y }

fn main() {
  curry1(add2)(1)(2)
}

Here curry1 causes add2 to be wrapped with curry_1_2 and then just passes the wrapped function back out. But judging from the body of curry1 you might not expect the output to be different from the input.

This makes we wonder if type inference would allow us to replace curry1 with ident. If type inference can insert curry_n_m then you can arbitrarily curry any function at the call-site (rather than as a function parameter) with ident(f). Maybe we should just go for the curry keyword instead after all.

timjs Jun 2, 2023

Yes, that's a consequence of this approach 😞 You lose principle types and a type annotation changes the meaning of a program. A solution could be, as you say, to add some keyword of sigil to curry a function.

han-tyumi · 2024-01-23T15:30:15Z

han-tyumi
Jan 23, 2024

ReScript recently released a major update which changes the language to be uncurried by default rather than curried.

They introduced similar syntax, add(5, ...), to allow partial application within this new default mode.
Thought it'd be worth noting as a reference.

https://rescript-lang.org/blog/uncurried-mode

1 reply

lpil Jan 23, 2024
Maintainer

Looks like they're using it for partial application rather than currying, but pretty close

Applicative APIs and partial application with f(a, b, ..) #1852

The problem

The proposal

Technical details

Alternatives

Replies: 6 comments · 27 replies

lpil Nov 16, 2022 Maintainer

lpil Nov 16, 2022 Maintainer

hayleigh-dot-dev Nov 17, 2022 Author

Comments

Questions

lpil Nov 20, 2022 Maintainer

lpil Nov 20, 2022 Maintainer

lpil Nov 20, 2022 Maintainer

Problem

Terms

Currying

Partial application

Curried function type

Proposal

Language

Helpers

Examples

Partial application

Auto currying

Edge cases

What if n == k?

What if n == 0?

Conclusion

More examples

Implementation

Runtime overhead

Summary

lpil Dec 21, 2022 Maintainer

On use + pipes

lpil Jan 23, 2024 Maintainer

Applicative APIs and partial application with `f(a, b, ..)` #1852

Replies: 6 comments 27 replies

lpil
Nov 16, 2022
Maintainer

lpil
Nov 16, 2022
Maintainer

hayleigh-dot-dev Nov 17, 2022
Author

lpil Nov 20, 2022
Maintainer

lpil Nov 20, 2022
Maintainer

lpil
Nov 20, 2022
Maintainer

What if `n == k`?

What if `n == 0`?

lpil Dec 21, 2022
Maintainer

lpil Jan 23, 2024
Maintainer