[Experimental] Enabled the use of the enhanced JIT Compiler (AbstractTensor.lisp) #155

hikettei · 2024-06-03T07:47:09Z

Changes

New backend: aten, aten[clang]
- It entirely relies on the tiny JIT Compiler (called AbstractTensor.lisp)
  - It is tiny and easy to write a new backend.
  - It optimises the kernel, especially where memory bandwidth is an overhead. (Conv2d gets approximately 10 times faster on CPU.)
  - This method (could be) extended to GPU architectures.
  - (WIP) It unrolls/vectorizes the kernel, enabling SLEEF/Neon/AVX intrinsics.
  - (TBD) data-parallel computation using OpenMP.
- Now it supports basic operations including arithmetic ops and mathematical ops (not tested).
- (TODO) Implement Gemm/Im2Col/Winograd etc.
Removed a JITCPUTensor backend.
Fixed a bug related to the VM execution model. %vm-move ignores the broadcast. (resulting the shape error.)

e.g.:

(with-devices ((Aten[Clang] :debug 1 :opt 3) CPUTensor)
    ....)

$ ./roswell/waffe2.ros test -b "Aten[Clang]" -b LispTensor -b CPUTensor

…n an aten runtime.

…ng id2table refinements due to cName refactors.

…ompilation-mode

hikettei added 15 commits May 29, 2024 17:15

[Feature] Get ready for implementing aten backend.

1ebff72

[Enhancement] Implemented a baseline for implementing the aten backend

85230ba

[Add] Unary ops

5c93432

[Enhancement] Parameterized Backend Configuration using symbol-macro

3b02eae

[Update] Aten backends require first call (Aten[Backend] ...) macro.

c4d8537

[BugFix] Fixed indexing-rule, enabling all permutation tests passed o…

0999905

…n an aten runtime.

[BugFix] fixed stride computation to work complicated permutation

75487a4

[Enhancement] DynamicShape for Aten Runtime

cadbc25

Add: nn.lisp

19a2ad9

[Refactor] Purged JITCPUTensor backend.

863a1d2

[Enhancement] Supports for configurated runtime declaration

90c001e

Tweaked the example for mnist demonstration get worked again, includi…

10d94b1

…ng id2table refinements due to cName refactors.

[Optimization] Reduced the number of invoking gcc; by enabling lazy-c…

0a80373

…ompilation-mode

[BugFix] Stride Error

22acca5

[BugFix] adding initial_offset when using Aten runtime

212d30e

hikettei merged commit 0d532bb into develop Jun 3, 2024
3 checks passed

hikettei mentioned this pull request Jun 3, 2024

[WIP] Remove obsolete SIMD and BLAS dependencies (for cpu) #152

Open

5 tasks