libExpr - ruby expression library

	libExpr

What is it?

This is a ruby library that can convert string expressions with variables to RPN or Prefix or AST (abstract syntax tree) for quick evaluation.

That means you can give it something like (as a string):

  "12 + 3*6 / -(14+3) ^ 2 ^ 3"

And it will convert it to an RPN or Prefix or AST expression object, and then you can evaluate it. (Default is AST) This is as simple as:

require 'libExpr'
exprObj = Expr.new("12 + 3*6 / -(14+3) ^ 2 ^ 3")
answer = exprObj.eval() 
# Or, if you don't need to repeat the evaluation:
answer = Expr.new("12 + 3*6 / -(14+3) ^ 2 ^ 3").eval()

That in itself might not be interesting, since you can always use the built-in language eval() (although this one doesn't require any sanitizing of inputs to avoid the inherent security dangers that 'eval()' has).

You can also supply it with expressions that have variables:

  "12 + someVar * 6 / (32 - anotherVar)"

And when you evaluate the expression object you can supply a block which the evaluator will use to get any variables, such as:

vars['foo'] = 42
vars['bar'] = 3
require 'libExpr'
someExpr = Expr.new("5*foo-6/bar")
answer = someExpr.eval() { |varname| vars[varname] }
# Returns  5*42-6/3 = 208

# And evaluate again with different vars
vars['foo'] = 100
answer = someExpr.eval() { |varname| vars[varname] }
# Now returns 5*100-6/3 = 498

When you create the expression object, you can even supply the regexp that it will use to determine what constitutes a variable/label, which can be useful if your variable may otherwise get confused with operations, such as a project where all my variables were of the form "%VARIABLENAME" - which otherwise would look like the '%' op preceding a 'VARIABLENAME'. It's the second arg to the constructor. You can even implement some simple external function calls this way. Anything that a ruby block can calculate.

This can be useful when:

Eval is not safe (no need to sanitize inputs here)
'Variable' evaluation is different from ruby variables
Expressions are evaluated many times with different variable values
'Variables' may not actually be normal variable notation. I.e.: [`PARAMS@SOME_VAL("DEFAULT")]

Eval is fairly fast - runs about 50% of the speed of an actual ruby eval().

This is done with a modified Shunting-Yard Algorithm that can handle Unary/Ternary operations and short-circuiting of operations, so it's also potentially interesting to anyone trying to implement Shunting-Yard for their own application.

Unary Ops?

Yes - Dijkstra's original Shunting-Yard algorithm could only handle binary operations (operations that take two operands). To confuse the issue further, unary operations (such as '-a' and '+a') also have binary counterparts (such as 'a-b' and 'a+b'). Two changes are needed to handle unary ops:

A different operator needs to be used for the unary op vs the binary op. Since libExpr has operator objects, it's easy to create two different objects, and I used a key for hashing ops where a unary op has a space before it, i.e. " -" for unary negative and "-" for binary subtraction.
To recognize unary '-' as opposed to binary '-' is actually pretty simple when we realize that a unary '-' unambiguously will always follow either a left paren '(' or another operator (such as '3 + -7'). In all other cases it is a binary subtraction. Once we see that, we can just save the last token we saw when we run the Shunting-Yard, and that can tell us whether it's a unary or binary op (if the token supports both)
As a final point, when it's a unary op, we obviously only push one operand instead of two.

Handling ternary ops (?:) was very similar, except that we don't treat the ':' as an actual operation, we throw it away after ensuring (as a sanity check) that it exists to separate the second and third operands, then we just push all three operands.

Finally, short circuiting was only added for AST, because for an AST it's trivial to simply not traverse one side of the tree. It would be interesting to write an algorithm that would intelligently throw out the proper number of operators/operands from an RPN or Prefix stack to accomplish short circuiting, but that's when my development on this project stopped. Use AST if you need short-circuiting.

Last of all, I also added an 'if-then' operator "a -> b" which is effectively logically equivalent to "!a || b". I added this simply because I needed it for a project I was working on.

Format Capabilities

Format	Unary ops (+a, -a)	Short-Circuit (\|\|,&&,?:)	Ternary-Ops (?:)
RPN	Yes	No	No
Prefix	Yes	Yes	No
AST	Yes	Yes	Yes

Format Info

RPN (Reverse Polish Notation aka "postfix"):: Fastest (slightly), simplest to view and debug, but missing short-circuit/ternary
Prefix: Converts to RPN then quick conversion to Prefix. Supports short-circuit (for &&,||)
AST: Can do ternary ops. Also fast, though harder to debug/understand since it's a tree

Notes about short-circuit

Since there aren't function calls (although you could arguably implement them with the variables) then the short-circuiting mostly will only effect performance (by doing unnecessary calculations), but there are also cases such as:

  (a == 0) || (num/a)

Which will incorrectly cause a ZeroDivisionError if you don't have short-circuit turned on.

Testing

There is a testbench included with the library that is somewhat interesting. It generates a large mass of random legal expressions that are mathematical expressions (such as "3 + 7*-4") as well as logical expressions (such as "(3 < 12) || (9==5+4)") or a mix of both, such as "(4 < 17) ? (8+1) : (9/12-3)", except generally far more complicated than these examples. It then compares the three expression systems to ruby eval() and ensures that everything is calculated correctly. It also creates a few variables to test variable evaluation as well. Definitely useful if you want to add anything to the expression library.

An example expression

- +1-3 + (- var[:n2]) +  ( (var[:t1] )   ?  (var[:n2])   :- 1)  ^  (  ( - ( var[:n0] ) ) * 10 &((var[:f1] ) ? var[:n2]  : ( - (10) % (+var[:n0] )  )*+var[:n2] ) )

Which correctly gives us the answer -4

License:

This software is essentially free, but feel free to read my payment spiel
Please read the full license

Documentation?

There's more docs inside the source, take a look at the commenting.

Download:

You can get the ruby source directly, as well as the testbench, or just get the tar.gz.