Extend work stealing scheduler and study its performance

Implement a happens-before data-race detector using Intel's Pin tool

Develop a simple program for GPU