67
The Expression Problem Prof. Dr. Ralf Lämmel Universität Koblenz-Landau Software Languages Team

The Expression Problem (as part of the the PTT lecture)

Embed Size (px)

Citation preview

Page 1: The Expression Problem (as part of the the PTT lecture)

The Expression ProblemProf. Dr. Ralf Lämmel

Universität Koblenz-LandauSoftware Languages Team

Page 2: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Elevator speech

Suppose you have some data variants (say “apples” and “oranges”) and you have some operations on such data (say “drink” and “squeeze”), how would you go about the design of data and operations so that you can hopefully add new data variants and new operations later on?

21

Page 3: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Expression problem: Why the name?

• Consider such data:– Expressions as in programming languages:

– Literals– Addition– …

• Consider such operations:– Pretty print expressions– Evaluate expressions– …

• Consider such extension scenarios:– Add another expression form

• Cases for all existing operations must be added.– Add another operation

• Cases for all existing expression forms must be added.

22

The name goes back to Phil Wadler who defined the expression problem in an email sent to mailing lists and individuals in November 1998.

Page 4: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Expression problem: What problem?

• In basic OO programming:– It is easy to add new data variants.

– It is hard to add new operations.

23

Page 5: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Data extensibility based on simple inheritance

• class Expr: The base class of all expression forms

• Virtual methods:

•method prettyPrint: operation for pretty-printing

•method evaluate: operation for expression evaluation

• Subclasses:

• class Lit: The expression form of "literals"

• class Add: The expression form of "addition"

• ... Data extensibility

24

Page 6: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The base class of all expression forms */public abstract class Expr { /* * Operation for pretty printing */ public abstract String prettyPrint(); /* * Operation for expression evaluation */ public abstract int evaluate();}

Beware of code bloat!

25

The class rooting all data variants

Page 7: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The expression form of "literals" (i.e., constants) */public class Lit extends Expr {

private int info; public int getInfo() { return info; } public void setInfo(int info) { this.info = info; } public String prettyPrint() { return Integer.toString(getInfo()); } public int evaluate() { return getInfo(); }}

Beware of code bloat!

26

A class for a data variant

Page 8: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The expression form of "addition" */public class Add extends Expr { private Expr left, right; public Expr getLeft() { return left; } public void setLeft(Expr left) { this.left = left; } public Expr getRight() { return right; } public void setRight(Expr right) { this.right = right; } public String prettyPrint() { ... } public int evaluate() { return getLeft().evaluate() + getRight().evaluate(); }}

That is, another data variant but with the same operations.

Beware of code bloat!

27

Page 9: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The expression form of "negation" */public class Neg extends Expr { private Expr expr; public Expr getExpr() { return expr; } public void setExpr(Expr expr) { this.expr = expr; } public String prettyPrint() { return "-(" + getExpr().prettyPrint() + ")"; } public int evaluate() { return - getExpr().evaluate(); }}

That is, yet another data variant but, again, with the same operations.

Beware of code bloat!

28

Page 10: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Expression problem: What problem?

• In basic OO programming:– It is easy to add new data variants.

– It is hard to add new operations.

• In OO programming with instance-of / cast:– It is easy to add new operations.

– It is easy to add new data variants.

29

But beware of

programming errors

Page 11: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Full (?) extensibility based on instance-of tests and type casts

• We use instanceof tests and casts to dispatch functionality on data.

• We use a specific exception and try-catch blocks to extend operations.

• We encapsulate operations in objects so that extensions are self-aware.

• This is approach is weakly typed.

• In particular, there is no guarantee that we have covered all cases.

30

Page 12: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

public abstract class Expr { }

public class Lit extends Expr { private int info; public int getInfo() { return info; } public void setInfo(int info) { this.info = info; }}

public class Add extends Expr { private Expr left, right; public Expr getLeft() { return left; } public void setLeft(Expr left) { this.left = left; } public Expr getRight() { return right; } public void setRight(Expr right) { this.right = right; }}

Domain classes are nothing but data

capsules.

31

Page 13: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

public class EvaluatorBase {

public int evaluate(Expr e) { if (e instanceof Lit) { Lit l = (Lit)e; return l.getInfo(); } if (e instanceof Add) { Add a = (Add)e; return evaluate(a.getLeft()) + evaluate(a.getRight()); } throw new FallThrouhException(); }}

We simulate pattern matching by instanceof

and cast.

We anticipate failure of such operations as a means to compose

them.

32

Page 14: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

public class EvaluatorExtension extends EvaluatorBase {

public int evaluate(Expr e) { try { return super.evaluate(e); } catch (FallThrouhException x) { if (e instanceof Neg) { Neg n = (Neg)e; return -evaluate(n.getExpr()); } throw new FallThrouhException(); } }} Recursive calls are

properly dispatched through “this” to the extended operation.

An extended operation first attempts the basic

implementation.

33

Page 15: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Expression problem: What problem?

• In basic OO programming:– It is easy to add new data variants.

– It is hard to add new operations.

• In OO programming with visitors:– It is easy to add new operations.

– It is hard to add new data variants.

• In OO programming with instance-of / cast:– It is easy to add new operations.

– It is easy to add new data variants.

34

Forthcoming!

Page 16: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Operation extensibility based on visitorsPREVIEW

• class Expr: The base class of all expression forms

• Subclasses as before.

• Virtual methods:

• Accept a visitor (“apply an operation”)

• Visitors:

• class PrettyPrinter: operation for pretty-printing

• class Evaluator: operation for expression evaluation

• ... Operation extensibility

35

Page 17: The Expression Problem (as part of the the PTT lecture)

The Visitor Pattern

Page 18: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

A book recommendation

Page 19: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

Elevator speechQ: How can we add functionality to a given (perhaps even closed) class hierarchy such that behavior can vary across concrete classes?

A: Represent an operation to be performed on the elements of an object structure in a class separate from the given classes.

Page 20: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

Adding an operation to lists

interface List {} class Nil implements List {} class Cons implements List {

int head; List tail;

}

Now suppose we need functionality to compute the sum of a list.

Page 21: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

1st attempt: instance-of and type cast

List l = ...;int sum = 0; // contains the sum after the loopboolean proceed = true; while (proceed) { if (l instanceof Nil) proceed = false; else if (l instanceof Cons) { sum = sum + ((Cons) l).head; // Type cast!

l = ((Cons) l).tail; // Type cast! }

}

What are theproblems here?

Page 22: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

1st attempt: instance-of and type cast

Casts are evil!

Requirement: static type checking

http://en.wikipedia.org/wiki/File:Codex_Gigas_devil.jpg

Page 23: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

2nd attempt: virtual methods

What are theproblems here?

interface List { int sum(); } class Nil implements List { public int sum() { return 0; } } class Cons implements List { int head; List tail; public int sum() { return head + tail.sum(); }

}

Page 24: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

2nd attempt: virtual methods

Patching is evil!

Requirement: operation extensibility

http://en.wikipedia.org/wiki/File:Codex_Gigas_devil.jpg

Page 25: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

3rd attempt: the visitor pattern

Part I:An interface for operations

interface Visitor { void visitNil(Nil x); void visitCons(Cons x); }

Page 26: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

3rd attempt: the visitor pattern

Part II:Classes that accept visitorsinterface List {

void accept(Visitor v); }class Nil implements List { public void accept(Visitor v) { v.visitNil(this); }

} class Cons implements List { int head; List tail; public void accept(Visitor v) { v.visitCons(this);

} }

Page 27: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

3rd attempt: the visitor pattern

Part III:A concrete visitor

class SumVisitor implements Visitor { int sum = 0; public void visitNil(Nil x) {} public void visitCons(Cons x){ sum = sum + x.head; x.tail.accept(this); }}

Page 28: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

3rd attempt: the visitor pattern

Demo

SumVisitor sv = new SumVisitor();

l.accept(sv);

System.out.println(sv.sum);

Page 29: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

Summary so far

Frequenttype casts?

Frequentrecompilation?

Instanceof and type casts Yes No

Dedicated methods No Yes

The Visitor pattern No No

What if we need a new class?

Page 30: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

Summary so far

• Class hierarchy implements virtual accept.

• A Visitor object has a visit method for each class.

• Double dispatch:

• accept method takes a specific Visitor object.

• accept redirects to class-specific visit method.

Two levels of polymorphism

Page 31: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

The Visitor pattern:generic solution

http://en.wikipedia.org/wiki/File:VisitorClassDiagram.svg

Page 32: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

ApplicabilityAn object structure contains different classes of objects with different interfaces and operations on these objects depend on their concrete classes.

Many unrelated operations need to be performed on the objects and you want to keep related operations together.

The classes defining the object structure rarely change, but you want to define new operations over the structure. (If the object structure classes change often, it is better to define the operations in those classes.)

Page 33: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Operation extensibility based on visitors

• class Expr: The base class of all expression forms

• Subclasses as before.

• Virtual methods:

• Accept a visitor (“apply an operation”)

• Visitors:

• class PrettyPrinter: operation for pretty-printing

• class Evaluator: operation for expression evaluation

• ... Operation extensibility

52

Page 34: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The base class of all expression forms */public abstract class Expr { /* * Accept a visitor (i.e., apply an operation) */ public abstract <R> R accept(Visitor<R> v);}

Domain operations are no longer hosted in

domain classes.

53

Page 35: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel54

Intermezzo:visitor types

• Returning visitors

• Visit methods have a return type.

• Void visitors

• Visit methods are void.

In principle, void visitors are sufficient because one can always (and often has to) aggregate the result through state which is

ultimately to be returned through a designated getter.

Page 36: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * A concrete visitor describe a concrete operation on expressions. * There is one visit method per type in the class hierarchy. */public abstract class Visitor<R> { public abstract R visit(Lit x); public abstract R visit(Add x); public abstract R visit(Neg x);}

Requires folklore knowledge on design

patterns.

55

Page 37: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The expression form of "literals" (i.e., constants) */public class Lit extends Expr { private int info; public int getInfo() { return info; } public void setInfo(int info) { this.info = info; } public <R> R accept(Visitor<R> v) { return v.visit(this); }}

One visitor-enabled data variant.

56

Page 38: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The expression form of "addition" */public class Add extends Expr { private Expr left, right; public Expr getLeft() { return left; } public void setLeft(Expr left) { this.left = left; } public Expr getRight() { return right; } public void setRight(Expr right) { this.right = right; } public <R> R accept(Visitor<R> v) { return v.visit(this); }}

Another visitor-enabled data variant.

Beware of code bloat!

57

Page 39: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * The expression form of "negation" */public class Neg extends Expr { private Expr expr; public Expr getExpr() { return expr; } public void setExpr(Expr expr) { this.expr = expr; }

public <R> R accept(Visitor<R> v) { return v.visit(this); }}

Beware of code bloat!

Yet another visitor-enabled data variant.

58

Page 40: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

/** * Operation for pretty printing */public class PrettyPrinter extends Visitor<String> { public String visit(Lit x) { return Integer.toString(x.getInfo()); } public String visit(Add x) { return x.getLeft().accept(this) + " + " + x.getRight().accept(this); } public String visit(Neg x) { return "- (" + x.getExpr().accept(this) + ")"; }}

Beware of code bloat!

An operation represented as a concrete visitor.

59

Page 41: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Beware of code bloat!

Another operation represented as a concrete visitor.

/** * Operation for expression evaluation */public class Evaluator extends Visitor<Integer> { public Integer visit(Lit x) { return x.getInfo(); } public Integer visit(Add x) { return x.getLeft().accept(this) + x.getRight().accept(this); } public Integer visit(Neg x) { return - x.getExpr().accept(this); }}

60

Page 42: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

101implementations

javaComposition

javaStatic

javaInheritance

javaVisitor

Page 43: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

Summary

Try to modularize (say, separate concerns).Try to understand extensibility requirements.Avoid modularity exorcism.Take refactoring and testing very seriously.

Please don’t print the slides, or print resource-friendly.

Page 44: The Expression Problem (as part of the the PTT lecture)

Everything beyond this point is “for your

information” only.

Page 45: The Expression Problem (as part of the the PTT lecture)

The Expression Problem Cont’d

Page 46: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

The notion of extensibility

• The initial system I: • Provides operations o1, …, om.

• Handles data variants d1, …, dn.

• Consider any number of extensions e1, …, ek:– ei could be a data extension:

• Introduce a new data variant.• Provide cases for all existing operations.

– ei could be an operation extension:

• Introduce a new operation.• Provide cases for all existing data variants.

– Any extension can be factored into such primitive ones.

• The fully extended system ek ( … (e1 (I)))65

Page 47: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

More on extensibility

• Code-level extensibility:

– Extensions can be modeled as code units.

– Existing code units are never edited or cloned.

• Separate compilation:

– The extensions are compiled separately.

• Statically type-checked extensibility:

– The extension are statically type-checked separately.

66

Page 48: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Beyond Java

• Functions and pattern matching à la SML, Haskell

•Multi-file predicates in Prolog

• Partial classes in C#

• ...

67

Page 49: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

A C# approach to extensibility

• Use the partial class mechanism of C#

• A class can be “declared” many times.

• The list of members is accumulated.

68

Page 50: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

The data slice

69

public abstract partial class Expr { }

public partial class Lit : Expr { public int info; }

public partial class Add : Expr { public Expr left, right; }

Page 51: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

The evaluation slice

70

public abstract partial class Expr { public abstract int Evaluate(); } public partial class Lit { public override int Evaluate() { return info; } } public partial class Add { public override int Evaluate(){ return left.Evaluate() + right.Evaluate(); } }

Page 52: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

The evaluation slice

71

public abstract partial class Expr { public abstract string PrettyPrint(); } public partial class Lit { public override string PrettyPrint() { return info.ToString(); } } public partial class Add { public override string PrettyPrint() { return left.PrettyPrint() + " + " + right.PrettyPrint(); } }

Page 53: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

A data extension

72

public partial class Neg : Expr { public Expr expr; } public partial class Neg { public override string PrettyPrint() { return "- (" + expr.PrettyPrint() + ")"; } } public partial class Neg { public override int Evaluate() { return -expr.Evaluate(); } }

Page 54: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Discussion of the C# approach

• Code extensibility only

• All extension need to be compiled together.

• Slices are not even checked independently.

73

Page 55: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

A Haskell approach to operation extensibility

• data type Expr: The union of all expression forms

• Public constructors:

• Lit: The expression form of "literals"

• Add: The expression form of "addition"

• Functions defined by pattern matching on Expr:

• function prettyPrint: operation for pretty-printing

• function evaluate: operation for expression evaluation

• ... Operation extensibility

74

Page 56: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Algebraic data types of Haskell are closed!

module Data where

-- A data type of expression forms

data Exp = Lit Int | Add Exp Exp

75

Algebraic data type

Constructor Constructorcomponent

Constructor

2 constructor components

Page 57: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

module PrettyPrinter where

import Data

-- Operation for pretty printing

prettyPrint :: Exp -> IO ()prettyPrint (Lit i) = putStr (show i)prettyPrint (Add l r) = do prettyPrint l; putStr " + "; prettyPrint r

A new module can be designated to each

new operation.

76

Operation defined bycase discrimination

Argument type

Result type “side effect”

Page 58: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Another operation;another module.

module Evaluator where

import Data

-- Operation for expression evaluation

evaluate :: Exp -> Intevaluate (Lit i) = ievaluate (Add l r) = evaluate l + evaluate r

77

Page 59: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Discussion of the Haskell approach

• Lack of encapsulation?

•Only operation extensibility

78

Page 60: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

A Prolog approach to extensibility

•Define operations as predicates.

•Model data variants through functors.

• Achieve extensibility through “multi-file” predicates.

79

Page 61: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

The initial program

80

:- multifile prettyPrint/2.

prettyPrint(lit(I),S) :- string_to_atom(S,I).

prettyPrint(add(L,R),S) :- prettyPrint(L,S1), prettyPrint(R,S2), format(string(S),"~s + ~s", [S1,S2]).

Page 62: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

A data extension

81

:- multifile prettyPrint/2.:- multifile evaluate/2.

prettyPrint(neg(E),S) :- prettyPrint(E,S1), format(string(S),"- (~s)",[S1]).

evaluate(neg(E),I) :- evaluate(E,I1), I is - I1.

Page 63: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

An operation extension

82

:- multifile evaluate/2.

evaluate(lit(I),I).

evaluate(add(L,R),I) :- evaluate(L,I1), evaluate(R,I2), I is I1 + I2.

Page 64: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Program composition

83

:- [ 'InitialProgram.pro' , 'OperationExtension.pro' , 'DataExtension.pro' ].

Page 65: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Discussion of the Prolog approach

• Full extensibility

•No descriptive overhead

• Lack of static typing

• Performance penalty

84

Page 66: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Ralf Lämmel

Solutions of the expression problem

•Open classes in more dynamic languages (Smalltalk, etc.) • Extensible predicates in Prolog (code-level extensibility only)• Java, C# & Co.• Clever encodings (Torgersen, ECOOP 2004)• AOP-like Open classes (introductions)• .NET-like partial classes (code-level extensibility only)• Expanders (Warth et al., OOPSLA 2006)• JavaGI (Wehr et al., ECOOP 2007)• ...

85

Page 67: The Expression Problem (as part of the the PTT lecture)

(C) 2010-2013 Prof. Dr. Ralf Lämmel, Universität Koblenz-Landau (where applicable)

Summary on extensibilityWanted

Data extensibility and

Operation extensibility andSoft or strong static type checking and

Separate compilation/deployment and

Unanticipated extension and

Configurability and Composability and

Efficiency (“No distributed fat”)