Symdiff

Symbolic differentiation in C++, Python and C. All of them are standalone implementations. This is not intended to be an efficient Symbolic library but rather a relatively simple example of how it could be implemented.

py/ for symbolic diff in python
src/ for symbolic diff in C++
csymdiff/ for symbolic diff in C

If you are using MinGW. You need to disable pthreads on windows (cmake -Dgtest_disable_pthreads=ON)

Requirement

Python: None
C : None
C++ : cmake, LLVM is optional

Compiled with MSVC2015 (64bit) and LLVM-3.8.1 on Windows 10

Python

x = Unknown('x')
y = Unknown('y')

env = {x: scalar(5), y: scalar(2)}

f = x ** 3 - y ** 2
print(f)

dfdx = f.derivate(x)
dfdy = f.derivate(y)

val = f.eval(env)

C++

#include "sym.h"
#include <iostream>

using namespace symdif;

int main(){

    /*  Expr building */
    Sym x = make_var("x");
    Sym y = make_var("y");

    Sym f = x * x * y;

    f.print(std::cout) << std::endl;

    Sym df = f.derivate("x");

    df.print(std::cout) << std::endl;

    /*  Full Eval */
    Context env = {{"x", make_val(4)}, {"y", make_val(3)}};

    std::cout << f.full_eval(c) << std::endl;
    std::cout << df.full_eval(c) << std::endl;

    /*  Partial Eval */
    Context p = {{"x", make_val(4)}};

    f.partial_eval(p).print(std::cout) << std::endl;
    df.partial_eval(p).print(std::cout) << std::endl;

    return 0;
}

C

int main ()
{
    SymExpr x = sym_placeholder("x");
    SymExpr expr = sym_mult(x, x);

    sym_print(expr);     printf("\n");

    SymExpr df = sym_deriv("x", expr);

    sym_print(df);          printf("\n");

    sym_free(df);
    sym_free(expr);
    sym_free(x);

    return 0;
}

Features

Pretty Printing (C++, Python, C)
Basic Optimization (Constant folding, Mult by zero/one, etc...) (C++, Python)

LLVM-IR code gen (if LLVM is installed) (C++)

The expression 10 + y + 3 * x becomes: (you can choose the function's name)

  define double @my_fun(double %y, double %x) {
  fun_body:
     %0 = fmul double 3.000000e+00, %x
     %1 = fadd double %y, %0
     %2 = fadd double 1.000000e+01, %1
     ret double %2
  }

StrContext (C++, Python) and PtrContext (C++)
- StrContext is used to substitute using variable's name (Only placeholders can be subs)
Pattern Matching (C++)

Stack Based VM (C++)

   (if (+ -1 (- 3)) (+ x 2) (+ y 3))

   0 push -1
   1 push 3
   2 neg
   3 add
   4 branch 5
   5 lookup y
   6 push 3
   7 add
   8 push 1
   9 branch 3
  10 lookup x
  11 push 2
  12 add

Register Based VM (C++)

   (if (+ -1 (- 3)) (+ x 2) (+ y 3))

   0 load -1 => 0
   1 load 3 => 1
   2 neg 1, 0, 1
   3 add 0, 1, 0
   4 branch 0 ? 5
   5 lookup y => 0
   6 load 3 => 1
   7 add 0, 1, 0
   8 load 1 => 1
   9 branch 1 ? 3
  10 lookup x => 0
  11 load 2 => 1
  12 add 0, 1, 0

TODO

factorize/expand

Would be nice to be able to parse a string

  f = expr
  std::stringstream ss, ff;
  f.print(ss);
  std::string fstr = ss.str();

  g = read(fstr);
  f.print(ff);
  std::string gstr = ff.str();

  if (gstr == fstr)
      success ()

IDEAS

Type system
Function Call
Check if Graph can be 'easily' balanced
Check if we can use threads to compute two different branches

Check shared_ptr and thread

  Thread share pointers put they cannot deallocate those.
  Only temporary pointers will be deallocated.

   level 1 - Threading

   Thread 1 | Thread 2
            *                 level 0 - 1 Possible Thread
         /  |  \
        /   |   \
       +    |    +            level 1 - 2 possible Thread
     /   \  |  /   \
    a     b | c     d         level 2 - 4 Possible Thread
            |

  We cannot spawn thread at each nodes there are too many of them.
  but we could spam per level.

  Seek the level best fitting our threading capability.

  NB: Number of possible thread per level is not necessarily even. (Unary Nodes)

Add Tensor
- Placeholder could be generalized
- ConstTensor
Cached eval result
find_node(mult(2, add(3, 4))
find_pattern(mult(any, add(any, any))) // Any is a special node that can not be evaluated
find_all_node
find_all_pattern
replace(Pattern, Sym by)
replace(node, Sym by)

Branching (VMs)

if allows you to branch forward (skip instructions) while loops allow you to branch backward. So to prove that a program ends we need to show that the instruction counter will eventually reach the end of the program.

Performance

Although this is an experimental project and little regard have been given to performance a benchmark utility is present. While randomly running the bench_dummy program I found that exec time were quite different between Windows and linux. I think this is a bug, CPU time on windows subsystem for linux seems meaningless.

CPU Time for balanced expression
Windows - MSVC 2015 - 64bit
            Best        Worst   Total       SD
Partial  :  0.0150      0.0781  22.73560    0.0090
Full     :  0.0054      0.0388   7.24590    0.0032
(SVM) C+E:  0.0160      0.0597  20.20980    0.0048
(RVM) C+E:  0.0162      0.0674  20.59150    0.0050
(SVM) C  :  0.0124      0.0408  15.79970    0.0038
(RVM) C  :  0.0136      0.0571  17.16800    0.0042
(SVM) E  :  0.0031      0.0284   3.87030    0.0016
(RVM) E  :  0.0022      0.0246   2.96520    0.0018

Linux (Ubuntu subsystem on Windows, same machine) - GCC 6
            Best        Worst   Total       SD
Partial  :  0.0000      0.0312  10.90600    0.0077
Full     :  0.0000      0.0156   2.60940    0.0058
(SVM) C+E:  0.0000      0.0312   6.76560    0.0078
(RVM) C+E:  0.0000      0.0312   7.71880    0.0078
(SVM) C  :  0.0000      0.0156   4.57810    0.0071
(RVM) C  :  0.0000      0.0312   5.90620    0.0076
(SVM) E  :  0.0000      0.0156   2.23440    0.0055
(RVM) E  :  0.0000      0.0156   2.03120    0.0053

Relative Unbalanced/Balanced

Windows - CPU Time
Partial  :  1.2945     0.7703    1.2064
Full     :  1.0164     0.9746    1.0154
(SVM) C+E:  1.0862     0.7593    1.0698
(RVM) C+E:  1.0574     1.1768    1.1164
(SVM) C  :  1.1481     1.1451    1.1111
(RVM) C  :  1.1228     1.1505    1.1017
(SVM) E  :  1.0192     1.9329    1.0378
(RVM) E  :  0.8047     1.4860    0.8042

Linux - Real Time
Partial  :  1.1739     0.8880    1.0884
Full     :  0.8236     0.7172    0.8267
(SVM) C+E:  0.9028     0.8936    1.1145
(RVM) C+E:  0.8912     1.2287    1.0029
(SVM) C  :  0.9843     0.7342    0.9697
(RVM) C  :  0.9921     1.8600    1.0547
(SVM) E  :  0.6806     0.4652    0.6947
(RVM) E  :  0.4578     0.6739    0.4660

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
bench		bench
cmake		cmake
csymdiff		csymdiff
dependencies		dependencies
examples		examples
java		java
py		py
src		src
tests		tests
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Symdiff

Requirement

Python

C++

C

Features

TODO

IDEAS

Branching (VMs)

Performance

About

Releases

Packages

Languages

License

Delaunay/symdiff

Folders and files

Latest commit

History

Repository files navigation

Symdiff

Requirement

Python

C++

C

Features

TODO

IDEAS

Branching (VMs)

Performance

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages