Fork improvements
- Make it easier to attach schedules to reduce nodes.
- User can attach schedule to init or reduct inputs and appropriate schedule will get propagated to reduce node.
- Persist tmp directories when clang, nvcc, or ar fail.
- Add new fork test.