Extractor functions by nystrom · Pull Request #28 · RelationalAI-oss/Rematch.jl

nystrom · 2020-03-12T10:46:37Z

This PR implements extractor functions for Rematch.

Patterns can use extractor functions (also known as active patterns). These are just any function that takes a value to match and returns either nothing (indicating match failure) or a tuple that decomposes the value. The tuple is then matched against other patterns.

An extractor function must have a lowercase name to distinguish it from a struct name. [I'd like to relax this requirement, if possible, but we should discuss.]

An extractor function must take one argument--the value to be matched against--and should return either one value (for nullary and unary patterns), or a tuple of values (for 2+-ary patterns). Returning nothing indicates the extractor does not match.

For example to destruct an array into its head and tail:

function Cons(xs)
    if isempty(xs)
        nothing
    else
        ([xs[1], xs[2:end]])
    end
end

@match [1,2,3] begin
    ~Cons(x, xs) => @assert x == 1 && xs == [2,3]
end

The main code changes are in handle_destruct for the T_(subpatterns__) case.

cscherrer · 2020-04-20T19:53:40Z

This looks really nice @nystrom . Am I understanding right that the semantics are that

@match foo begin
    cons(x, xs) => bar
end

is equivalent to

@match cons(foo) begin
    (x, xs) => bar
end

so it seems the function is converted to its inverse? I haven't used active patterns much, is this a common way to write things?

nystrom · 2020-04-20T20:00:31Z

Those two cases are equivalent. Yes, the extractor is basically the inverse of the constructor.
But you can't just call the inverse function directly on the scrutinee after @match because you need to call different functions in different case arms. For example:

@match foo begin
   cons(x, cons(y, ys)) => foo
   cons(x, xs) => bar
   [] => baz
end

calls cons on foo in both the first and second patterns, but not in the third.

comnik

Sorry for being a bit skeptical at first, this does look really cool for some use cases. Like anything else, it can probably be overused, but looking at eval.jl I see the need clearer than before ;)

I don't get everything about the implementation, this is the first time I'm looking at the Rematch source. However no red flags jump out to me, apart from a few style guide violations.

Thanks @nystrom !

src/Rematch.jl

test/rematch.jl

src/Rematch.jl

nystrom · 2020-05-11T17:56:42Z

I was thinking that rather than using uppercase/lowercase to distinguish between structs and extractors, we could just prepend an operator like ~.

@match foo begin
   ~Foo(x,y,z) => ... # call extractor Foo
   Foo(x,y,z) => ... # match struct Foo
end

I'm converting this back to a draft PR.

…attern. This allows again struct names to be lowercase. Extractor functions can now take arguments.

test/rematch.jl

ghost · 2020-08-26T20:25:35Z

test/rematch.jl

+        @test (@match 3 begin
+           ~sub1(x) => x
+        end) == 4


I don't understand why this returns 4. I could understand if it returned 2 or 3, but i don't quite understand how it returns 4.

The definition of sub1(x) is sub1(x) = x+1. So I take @match 3 begin ~sub1(x) => x end to mean "if 3 matches sub1(x) for some x, return that x." And in this case, 3 would match sub1(2), so i'd expect it to return 2?

Also, i'm confused why x+1 is called sub1 - i'd have called it add1?

Ohhhhhhhh after reading some examples, i now understand that in the above match pattern, the x doesn't refer to the input, it refers to the output.

So ~sub1(x) means x == sub1(3), meaning x == 4.

This was clearer to me after reading the Cons example in the README.

Perhaps we can find a better syntax that doesn't look like a function application? This would also help with picking a disambiguating syntax from struct constructors.

What do you think about something like this?:

@test (@match 3 begin sub1(~)(x) => x end) == 4 @match [1,2,3] begin Cons(~)(x, xs) => @assert x == 1 && xs == [2,3] end

Where the ~ represents the object being matched on.

Or instead of the ~, maybe Cons(_)(x, xs)? Or Cons()(x, xs)? Or maybe something like Cons()->(x, xs), or some other similar but illegal syntax?

I don't understand why this returns 4. I could understand if it returned 2 or 3, but i don't quite understand how it returns 4.

The idea is that 3 == sub1(4). Don't think of sub1 as a normal function; it's basically the inverse of a constructor.
Maybe we can change the name to unapply_sub1 or something like that (in Scala extractor methods are all named unapply, the inverse of apply). This would avoid overloading issues if there's a unary constructor also.

Perhaps we can find a better syntax that doesn't look like a function application? This would also help with picking a disambiguating syntax from struct constructors.

What do you think about something like this?:

@test (@match 3 begin sub1(~)(x) => x end) == 4 @match [1,2,3] begin Cons(~)(x, xs) => @assert x == 1 && xs == [2,3] end

I don't see this as any better than ~Cons(x,xs).

Ah, but renaming the function to unapply_X breaks higher-order extractors like Re (see the tests and README).

Nevermind, easy to fix. Extractors are named unapply_Foo and the pattern is ~Foo. Pushed.

After using it a bit, I'm not really sure I like the unapply_ prefix (or any prefix). Best to just have the writer of the extractor function call it whatever they want, avoiding confusion about which function gets called.

I don't see this as any better than ~Cons(x,xs).

The reason I prefer Cons(~)(x,xs) is because it prevents parsing the above as a two argument function call to Cons, which is what it really looks like.

Instead, I want to somehow indicate that the syntax is matching on a two-argument destructured return value from calling Cons on the match variable. I was imagining the Cons(~) syntax to imply "calling Cons with the match variable", and then Cons(~)(x,xs) implying "the return value from that function can be captured by destructuring into two variables."

Another option might be something like (x,xs) = Cons(~)?

Basically, I just want to prevent the initial parse looking like the destructed output variables are actually the inputs to the function, since when they're both arity-1, as in the sub1 function, it was difficult to understand which variable is the input and which is the output.

Does that make sense? :)

This seems like a really cool feature! I'm sorry to hold it up bikeshedding. Please take this as a positive: this is really cool and I want to help make it intuitive and easy to use! :)

Sorry we left this PR lagging for forever.

Please feel free to just merge it now, and we can always change this in later iterations if we want to!

After rereading the above discussion, i would like to propose that latest suggestion one more time; i actually think it's fairly nice:

@test (@match 3 begin (x = sub1(~)) => x end) == 4 @match [1,2,3] begin ((x, xs) = Cons(~)) => @assert x == 1 && xs == [2,3] end

This syntax makes the behavior more obvious, IMO:

call the supplied function on the match variable, and then

do a capture match on the return value.

BUT it seems to me like you are implementing a well known feature in other pattern-matching systems, and I am not at all familiar with those, so please feel free to completely ignore my input. :)

I'd like to get out of the game of causing drama on this repo! ❤️ 😅

amirsh · 2021-09-02T18:59:58Z

@nystrom this is really nice :)
looking forward to see it merged :D

ghost

@nystrom - sorry for the long delay here. I really don't know much of anything about pattern matching, so it would probably be better to get someone else to review this PR!

@amirsh: Can you maybe review it? 😁 🙏 If you think it's useful, just approve it! We can always make more changes in the future. :) Sorry that i caused such a long hang up!!

ghost · 2021-09-02T22:11:20Z

test/rematch.jl

+        @test (@match 3 begin
+           ~sub1(x) => x
+        end) == 4


Sorry we left this PR lagging for forever.

Please feel free to just merge it now, and we can always change this in later iterations if we want to!

After rereading the above discussion, i would like to propose that latest suggestion one more time; i actually think it's fairly nice:

@test (@match 3 begin (x = sub1(~)) => x end) == 4 @match [1,2,3] begin ((x, xs) = Cons(~)) => @assert x == 1 && xs == [2,3] end

This syntax makes the behavior more obvious, IMO:

call the supplied function on the match variable, and then

do a capture match on the return value.

BUT it seems to me like you are implementing a well known feature in other pattern-matching systems, and I am not at all familiar with those, so please feel free to completely ignore my input. :)

I'd like to get out of the game of causing drama on this repo! ❤️ 😅

nystrom · 2021-09-03T09:10:43Z

Thanks @rai-nhdaly and @amirsh for the comments/review.

I haven't really been pushing this since we've basically worked around its absence for a while now, and I'm no longer sure I'll even use it since extractors will not work with the type dispatch macros I layered on top of Rematch in the compiler. I'd like to actually back-port the type dispatch code into Rematch but haven't had the time.

Sorry @rai-nhdaly I don't like your syntax proposal :-) It doesn't look much like pattern matching to me anymore. The whole idea is to make the pattern look like the code used to construct the scrutinee of the match, but with free variables for constructor arguments. At least that's what I'm used to in functional languages. I tried to stay as close to the Scala behavior as possible.

I'll merge this when I'm back from vacation.

amirsh · 2021-09-03T10:16:59Z

@NHDaly sure.

Regarding the syntax, I totally agree with @nystrom . The proposed syntax matches better with the one from Scala and F#.

One minor comment is that the last part of the README file is broken due to a missing ```.

Please let me know once the code is available for review.

nystrom · 2021-09-03T10:34:07Z

I fixed the missing quote, thanks.

ghost · 2021-09-03T15:32:59Z

Excellent. Like i said, i don't have much experience with those other systems, so i trust your gut - sorry for the drama. :)

Enjoy the vacation!! :)

nystrom added 10 commits February 7, 2020 12:43

Added support for extractor functions.

6c89eab

changed call to length > 0 to !isempty

453de9d

updated comments; allow splats in extractor matching

8d22d8a

whitespace. red-black trees test

f482fe0

fixed red-black tree test size

9a6094b

comments

edb44cc

add .julia to gitignore

6e37f0c

expand tabs in README

6e9c9b8

changed isnothing to === nothing

d5a1e2c

@eval types in tests (0.7 compat issue)

24f9fde

nystrom requested review from mbravenboer and removed request for mbravenboer April 20, 2020 17:35

comnik approved these changes Apr 20, 2020

View reviewed changes

src/Rematch.jl Outdated Show resolved Hide resolved

test/rematch.jl Outdated Show resolved Hide resolved

test/rematch.jl Outdated Show resolved Hide resolved

src/Rematch.jl Outdated Show resolved Hide resolved

src/Rematch.jl Outdated Show resolved Hide resolved

nystrom added 2 commits April 21, 2020 10:23

style fixes, mostly line lengths

19bb13c

added docstrings and more style fixes

e696940

nystrom marked this pull request as draft May 11, 2020 17:56

Changed syntax of extractor patterns to require a ~ in front of the p…

d2ea77d

…attern. This allows again struct names to be lowercase. Extractor functions can now take arguments.

nystrom marked this pull request as ready for review May 11, 2020 19:21

nystrom requested a review from a user August 21, 2020 19:26

ghost reviewed Aug 26, 2020

View reviewed changes

test/rematch.jl Outdated Show resolved Hide resolved

ghost reviewed Aug 26, 2020

View reviewed changes

nystrom added 6 commits August 27, 2020 12:42

some extractor cleanup: should always return tuples or nothing

f9916ec

Extractor functions must be named unapply_X.

cddbfdc

remove dead function

a8c3cb8

fixed documentation of extractors

036dd65

removed unused __module__ param from functions

8b0e924

documentation cleanup

e1e5256

ghost reviewed Sep 2, 2021

View reviewed changes

Fix missing close quote

e6d52bb

nystrom mentioned this pull request Sep 27, 2023

Support ML-style active patterns JuliaServices/Match.jl#92

Closed

Conversation

nystrom commented Mar 12, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cscherrer commented Apr 20, 2020

Uh oh!

nystrom commented Apr 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

comnik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nystrom commented May 11, 2020

Uh oh!

Uh oh!

ghost Aug 26, 2020 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ghost Aug 26, 2020

Choose a reason for hiding this comment

Uh oh!

nystrom Aug 27, 2020

Choose a reason for hiding this comment

Uh oh!

nystrom Aug 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nystrom Aug 27, 2020

Choose a reason for hiding this comment

Uh oh!

nystrom Aug 27, 2020

Choose a reason for hiding this comment

Uh oh!

ghost Aug 28, 2020

Choose a reason for hiding this comment

Uh oh!

ghost Sep 2, 2021

Choose a reason for hiding this comment

Uh oh!

amirsh commented Sep 2, 2021

Uh oh!

ghost left a comment

Choose a reason for hiding this comment

Uh oh!

ghost Sep 2, 2021

Choose a reason for hiding this comment

Uh oh!

nystrom commented Sep 3, 2021

Uh oh!

amirsh commented Sep 3, 2021

Uh oh!

nystrom commented Sep 3, 2021

Uh oh!

ghost commented Sep 3, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nystrom commented Mar 12, 2020 •

edited

Loading

nystrom commented Apr 20, 2020 •

edited

Loading

ghost Aug 26, 2020 •

edited by ghost

Loading

nystrom Aug 27, 2020 •

edited

Loading