Fix name duplication in code generation for new definition typing #197

chengluyu · 2023-12-30T07:05:46Z

We might notice a strange behavior of code generation: if we declare a symbol x and then define it, the generated JavaScript function name will have a numeric postfix x1, which indicates that some other x was shadowed. But this is the very first time where x is defined in this file.

diff --git a/shared/src/test/diff/nu/CtorSubtraction.mls b/shared/src/test/diff/nu/CtorSubtraction.mls
--- a/shared/src/test/diff/nu/CtorSubtraction.mls
+++ b/shared/src/test/diff/nu/CtorSubtraction.mls
@@ -9,11 +9,11 @@
 fun x: ('a & Object) -> ('a \ Cls)
 fun x = case
   Cls then error
   y then y
 //│ fun x: forall 'b. (Cls | Object & 'b & ~#Cls) -> 'b
 //│ fun x: forall 'a. (Object & 'a) -> ('a & ~Cls)
 
 x : Int -> Int
 //│ Int -> Int
 //│ res
-//│     = [Function: x1] // <---- Oops?
+//│     = [Function: x]

Well, this is caused by an imperfect workaround for translating classes in new definition typing. Consider the following code.

let f x = x + 1
class Foo(value: Int) {
  let y = f of value
}

Code generation lifts class definitions (e.g., Foo) and translates them into a typing unit class before translating top-level let bindings (e.g., f). So, the Scope should contains some sort of stub symbols of top-level let bindings before translating classes. The current implementation does not handle this well, as it just declares ValueSymbol in the first place.

mlscript/shared/src/main/scala/mlscript/JSBackend.scala

Lines 1404 to 1412 in 231d9ec

    
           otherStmts.foreach { 
        
             case fd @ NuFunDef(isLetRec, Var(nme), symNme, _, L(body)) => 
        
               val isByname = isLetRec.isEmpty 
        
               val isByvalueRecIn = if (isByname) None else Some(true) 
        
               val bodyIsLam = body match { case _: Lam => true case _ => false } 
        
               val symb = symNme.map(_.name) 
        
               scope.declareValue(nme, isByvalueRecIn, bodyIsLam, symb) 
        
             case _ => () 
        
           }

Then, after the translation of the typing unit class, when translating the top-level let bindings, the symbols will be declared twice and the runtime will be renewed, thus the runtime name comes with a numeric postfix. This sometimes cause runtime errors of undefined references. For example,

fun foo: Int -> Int
fun foo = x => x + 1
class Bar {
  fun calc(x) = foo(x)
}
//│ fun foo: Int -> Int
//│ class Bar {
//│   constructor()
//│   fun calc: Int -> Int
//│ }
//│ fun foo: Int -> Int

foo
//│ Int -> Int
//│ res
//│     = [Function: foo1] <--- Note that `foo` has runtime name `foo1`

:re
(new Bar()).calc(0)
//│ Int
//│ res
//│ Runtime error:
//│   ReferenceError: foo is not defined <--- But the class thinks `foo` is `foo`.

This PR fixes this problem by adding a Boolean field forNewDefsDryRun to ValueSymbol, which indicates if this symbol is declare in the situation as described above. If new symbols are shadowing this symbol, we will reuse the runtime name rather than allocating new names.

Please `git cherrypick` this to a separate PR.

NeilKleistGao · 2023-12-30T15:33:32Z

Related PRs: #167, #186

NeilKleistGao

LGTM. Thanks for the better solution. :)

shared/src/main/scala/mlscript/codegen/Scope.scala

LPTK

Ok, this looks like a very hacky solution on top of an extremely hacky foundation.

Thankfully, we'll replace this hacky stuff soon once the new symbols are implemented. So let's just merged the hack.

BTW, does it work in situations like the following?

fun foo: Int -> Int
fun foo = x => x + 1
class Bar { fun calc(x) = foo(x) }

class Bar { fun calc(x) = foo }
fun foo: Int
fun foo = 123

chengluyu · 2024-01-01T16:32:21Z

I agree. I think the original approach (declare top-level let bindings, and unregister them if there's any error) is imperfect and hacky. So, this is actually a hacky fix to address the imperfectness of previous approach and will be soon removed by the help of PreTyper.

I will try your example and add it to tests.

chengluyu · 2024-01-01T19:53:20Z

The test cases you suggested work.

Fix name duplication in code generation for new definition typing

7988884

Please `git cherrypick` this to a separate PR.

chengluyu requested a review from NeilKleistGao December 30, 2023 07:05

chengluyu added the js-code-gen label Dec 30, 2023

NeilKleistGao reviewed Dec 30, 2023

View reviewed changes

LPTK reviewed Jan 1, 2024

View reviewed changes

shared/src/main/scala/mlscript/codegen/Scope.scala Show resolved Hide resolved

LPTK approved these changes Jan 1, 2024

View reviewed changes

Add suggested test cases

5bf69fa

chengluyu added 3 commits January 2, 2024 03:55

Show generated JavaScript in examples from the previous commit

234a4d5

Explain what does "dry run" mean in ValueSymbol's new parameter

f1044e0

Add link to this PR to the explanation

8a519ea

LPTK merged commit 0049da9 into hkust-taco:new-definition-typing Jan 2, 2024
1 check passed

LPTK deleted the codegen/wasted-runtime-names branch January 2, 2024 05:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix name duplication in code generation for new definition typing #197

Fix name duplication in code generation for new definition typing #197

chengluyu commented Dec 30, 2023 •

edited

Loading

NeilKleistGao commented Dec 30, 2023

NeilKleistGao left a comment

LPTK left a comment

chengluyu commented Jan 1, 2024

chengluyu commented Jan 1, 2024

	otherStmts.foreach {
	case fd @ NuFunDef(isLetRec, Var(nme), symNme, _, L(body)) =>
	val isByname = isLetRec.isEmpty
	val isByvalueRecIn = if (isByname) None else Some(true)
	val bodyIsLam = body match { case _: Lam => true case _ => false }
	val symb = symNme.map(_.name)
	scope.declareValue(nme, isByvalueRecIn, bodyIsLam, symb)
	case _ => ()
	}

Fix name duplication in code generation for new definition typing #197

Fix name duplication in code generation for new definition typing #197

Conversation

chengluyu commented Dec 30, 2023 • edited Loading

NeilKleistGao commented Dec 30, 2023

NeilKleistGao left a comment

Choose a reason for hiding this comment

LPTK left a comment

Choose a reason for hiding this comment

chengluyu commented Jan 1, 2024

chengluyu commented Jan 1, 2024

chengluyu commented Dec 30, 2023 •

edited

Loading