MoTion is a new object pattern matcher in Pharo. A pattern matcher works on a finit set of objects that we will call a model. Examples of models are: the Pharo AST of a method, the DOM of an XML document, the objects loaded from a JSON file,. . . MoTion can deal with Pharo objects independently of the model containing the data. MoTion combines both features for graph pattern matching and object matching, and by that it enables expressing patterns declaratively and applying matches over complex object structures.
To install MoTion, go to the Playground (Ctrl+OW
) in your Pharo image and execute the following Metacello script (select it and press Do-it button or Ctrl+D
):
Metacello new
baseline: 'MoTion';
repository: 'github://moosetechnology/MoTion:main';
load.
Multiple operators are used to help developers creating MoTion patterns.
- Literals are declared the way they are in Pharo like ’A sample text here’ asMatcher and 1 asMatcher. Literal patterns match exactly their literal value. This is useful for specifying the value that a property of an object must have.
- The “<=>” operator is a sort of generic matcher. It tries to match its left hand side (lhs) with the pattern on its right hand side (rhs). As noted before, it is a polymorphic operator depending on the lhs. If the lhs is an object, it tries to match this object with the rhs. If the lhs is a collection, it tries to match any element of the collection with the rhs.
- To define an object pattern, one specifies its type using the class name followed by the “% {}” operator like in: ClassA % {} . The class expected comes before “% {}” and property values can be specified between the curly braces (see below).
- A similar operator: “%% {}” is used to match a class or any of its subclasses.
- These two operators can express sub-patterns on the properties of the matched object. They are specified between the curly braces. Object properties are instance variable accessors. The curly braces act as a conjunction of sub-patterns specifying the value a property should match. It can be seen as a Logical matcher.
ClassA % {
#’property1’ <=> aValue1.
#’property2’ <=> aValue2.
}
This pattern matches an object of class ClassA, with a property property1 having the value aValue1 and property2 having the value aValue2.
The sub-patterns could also be more complex (see bellow, Structural pattern).
This mechanism contributes to the seamless addition of various properties, in a declarative way.
6. The “% {}” operator, combined with the “<=>” operator, also allows to express Structural pattern where a first object is matched, then a second object in one of the properties of the first is matched and we express a sub-pattern on this second object:
ClassA % {
#’property1’ <=> aValue1.
#’property2’ <=> ClassB %% {
#’property3’ <=> aValue3.
}
}
This pattern matches an instance of ClassA with aValue1 in its property1, and an instance of ClassB in its property2. This second object must have aValue3 in its property3.
- Negation in pattern matching is handle with the “<∼=>” operator. It specifies that the lhs should not match the rhs pattern.
- Non-Linear pattern is obtain using the “@” operator followed by a name (for example: @x). This allows to store a matched object in the “variable” to reuse it somewhere else in the pattern.
- Wildcard (“_”) can be used to indicate a property whose name is not known, when one only cares for its value:
ClassA % {
#_ <=> aValue.
}
This pattern matches an instance of ClassA with an unnamed property matching the value aValue.
- The “>” operators implements Path traversal by allowing to “chain” multiple properties in a pattern. Such paths help reducing complex patterns expression, by accessing a chain of objects and their properties:
ClassA % {
#’property1>property2’ <=> aValue.
}
This pattern first match an instance of ClassA, then it takes the object in its property1 and the value in property2 of this second object. This value should match aValue. This notation allows to express in a very concise way a path in a graph of objects. Note that this operator is also polymorphic. Similarly to “<=>”, if one of the objects in the path is a collection, the operator will look for an element of this collection that allows to continue the search, that is to say that has a property matching the remaining part of the pattern.
- MoTion allows to perform Recursive traversal through a “*” operator combined with the Path traversal operator “>”. In a chain of objects, one may know the initial property and the final one, but not know how long the chain of objects is.
ClassA % {
#’property1>repeatedProp*’ <=> aValue.
}
This pattern will match first an instance of ClassA, then the object in its property property1 then it will match a chain of objects all having a property repeatedProp and one of them containing the value aValue. The match ends on this last object.
- The “*” operator may also be combined with a wildcard (“_”).
ClassA % {
#’property1>_*>propN’ <=> aValue.
}
This pattern will match first an instance of ClassA, then the object in its property property1 then it will match a chain of objects with unknown properties ending with an object having a property propN with value aValue.
- It is possible to match Complex lists using the “{}” list operator and declaring how the list should look like. Note that this is not the same operator as “% {}” (see above). This operator allows to express that given elements in a list should match specific patterns. {#’@x’. #’@x’} This pattern matches a list containing exactly two elements that are the same (use of a named variable).
- The repetition operator (“”) may also be used in a list to indicate an unspecified number of elements. {#’@x’. #’_’. #’@x’} This pattern, matches a list with first and last elements equals and of unspecified length (obviously at least 2). To express that one element is part of a collection, MoTion offers a shortcut. To check if the value 5 is part of a collection (contained in the property someProperty of an instance of ClassA) one can use the pattern:
ClassA % {
#someProperty <=> {#’*s1’. 5. #’*s2’}
}
But the same can be expressed with a shortcut:
ClassA % {
#someProperty<=> 5
}
Note that this could also match an instance of ClassA with a property someProperty that matches exactly the value 5 (with no collection).
- Finally there is another operator for Logical matcher: orMatches:. It allows to express a disjunction of two patterns (one or the other match). (Remember that “% {}” implements a conjunction of patterns within the curly braces.)
ClassA % {
#someProperty <=> (5 orMatches: 6)
}
This pattern matches an instance of ClassA with a property someProperty matching the value 5 or the value 6.
- One gets a “matcher” by calling the asMatcher method. “1 asMatcher” creates a matcher that only matches the value “1”.
- A matcher as a match: method that allows it to try to match the argument.
pattern := #’@foo’ asMatcher.
results := pattern match: ’text’ .
result isMatch.
This creates a matcher than matches anything (and associates it with the “foo” symbol) and runs it on the string ’text’. The last line will answer true as the match was successful. The result of match: is a MatchingResult. As we just saw, it includes a boolean property isMatch indicating whether the match was successful or not. It also has a property, and matchingContexts which is a collection of MatchingContext objects. Each of these contexts includes again a boolean field isMatch and a dictionary of its bindings. To get the binding of foo in the the small example above, one would do:
results matchingContexts first bindings
at: ’foo’.
This will return the string ’text’. Bindings can also be created with the as: method. It is used to bind the result of a pattern that will be kept in the result’s bindings. Finally to simplify getting the result of the bindings one is mostly interested in, there is a method collectBindings: that accepts a collection of (interesting) keys as parameter, and returns their values matched by a pattern. In case there is no match, the return is an empty collection.
pattern := #’@foo’ asMatcher.
results := pattern collectBindings: {#foo } for: ’text’ .
This puts in results a collection of dictionaries (here there is only one) with the binding for the #foo symbol. The result is a collection because there could be several matchings (for example with a disjunction operator). The collection holds dictionaries because we could ask for several bindings in the first parameter of the method.
- Finding superInheritances for 'Remote' Inteface in Java model:
pattern := FamixJavaModel % {
#'allTypes>entities' <=>
FamixJavaInterface % {
#'superInheritances>superclass>name'
<=> 'Remote' .
#'isStub' <~=> true.
} as: 'foundInterface'.
}.
pattern match: aFamixJavaModel.
- Looking for methods where get.config(akey) is invoked
pattern := FASTJavaMethodEntity % {
#'children*' <=> FASTJavaMethodInvocation % {
#'receiver>name' <=> #'config'.
#name <=> #get.
#'arguments>primitiveValue' <=> aKey.
} as: #configInvocation
}.
pattern match: aFASTJavaMethodEntity.
- Decomposing the same pattern for reusage of small patterns
childrenPath := #'children*'.
receiverNamePath := #'receiver>name'.
argsVal := #'arguments>primitiveValue'.
subPattern := FASTJavaMethodInvocation % {
receiverNamePath <=> #'config'.
#name <=> #get.
argsVal <=> aKey.
} as: #configInvocation.
pattern := FASTJavaMethodEntity % {
childrenPath <=> subPattern.
}.
pattern match: aFASTJavaMethodEntity.
- MoTion matching Pharo package
pattern := RPackage % {
#name <=> #MoTion.
#'definedClasses>methodDict' <=> CompiledMethod % {
#'ast>allChildren' <=> RBMessageNode % {
#'selector>value' <=> #ifTrue:ifFalse:
} as: #Node
} as: #Method.
} as: #Package.
pattern collectBindings: {#Node. #Method. #Package.} for: (#MoTion asPackage).
- MoTion matching XML node list
pattern := XMLNodeList % {
#'_' <=> XMLElement % {
#'name' <=> #'div' .
} as: 'XMLElement'.
}.
pattern match: anXMLNodeList.
Don't hesitate to ask. More examples can be found in tests package. Also, don't hesitate to have a look at Iguala which is the same matcher implemented in Python: https://github.com/aranega/iguala Finally, well, just enjoy it :)