-
-
Notifications
You must be signed in to change notification settings - Fork 447
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PR-URL: #2956 Ref: #2039 Co-authored-by: Athan Reines <[email protected]> Reviewed-by: Athan Reines <[email protected]>
- Loading branch information
Showing
32 changed files
with
5,817 additions
and
0 deletions.
There are no files selected for viewing
341 changes: 341 additions & 0 deletions
341
lib/node_modules/@stdlib/blas/base/saxpy-wasm/README.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,341 @@ | ||
<!-- | ||
@license Apache-2.0 | ||
Copyright (c) 2024 The Stdlib Authors. | ||
Licensed under the Apache License, Version 2.0 (the "License"); | ||
you may not use this file except in compliance with the License. | ||
You may obtain a copy of the License at | ||
http://www.apache.org/licenses/LICENSE-2.0 | ||
Unless required by applicable law or agreed to in writing, software | ||
distributed under the License is distributed on an "AS IS" BASIS, | ||
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
See the License for the specific language governing permissions and | ||
limitations under the License. | ||
--> | ||
|
||
# saxpy | ||
|
||
> Multiply a vector `x` by a constant `alpha` and add the result to `y`. | ||
<section class="usage"> | ||
|
||
## Usage | ||
|
||
```javascript | ||
var saxpy = require( '@stdlib/blas/base/saxpy-wasm' ); | ||
``` | ||
|
||
#### saxpy.main( N, alpha, x, strideX, y, strideY ) | ||
|
||
Multiplies a vector `x` by a constant `alpha` and adds the result to `y`. | ||
|
||
```javascript | ||
var Float32Array = require( '@stdlib/array/float32' ); | ||
|
||
var x = new Float32Array( [ 1.0, 2.0, 3.0, 4.0, 5.0 ] ); | ||
var y = new Float32Array( [ 1.0, 1.0, 1.0, 1.0, 1.0 ] ); | ||
|
||
saxpy.main( x.length, 5.0, x, 1, y, 1 ); | ||
// y => <Float32Array>[ 6.0, 11.0, 16.0, 21.0, 26.0 ] | ||
``` | ||
|
||
The function has the following parameters: | ||
|
||
- **N**: number of indexed elements. | ||
- **alpha**: scalar constant. | ||
- **x**: input [`Float32Array`][@stdlib/array/float32]. | ||
- **strideX**: index increment for `x`. | ||
- **y**: input [`Float32Array`][@stdlib/array/float32]. | ||
- **strideY**: index increment for `y`. | ||
|
||
The `N` and stride parameters determine which elements in the strided arrays are accessed at runtime. For example, to multiply every other value in `x` by `alpha` and add the result to the first `N` elements of `y` in reverse order, | ||
|
||
```javascript | ||
var Float32Array = require( '@stdlib/array/float32' ); | ||
|
||
var x = new Float32Array( [ 1.0, 2.0, 3.0, 4.0, 5.0, 6.0 ] ); | ||
var y = new Float32Array( [ 1.0, 1.0, 1.0, 1.0, 1.0, 1.0 ] ); | ||
|
||
var alpha = 5.0; | ||
|
||
saxpy.main( 3, alpha, x, 2, y, -1 ); | ||
// y => <Float32Array>[ 26.0, 16.0, 6.0, 1.0, 1.0, 1.0 ] | ||
``` | ||
|
||
Note that indexing is relative to the first index. To introduce an offset, use [`typed array`][mdn-typed-array] views. | ||
|
||
<!-- eslint-disable stdlib/capitalized-comments --> | ||
|
||
```javascript | ||
var Float32Array = require( '@stdlib/array/float32' ); | ||
|
||
// Initial arrays... | ||
var x0 = new Float32Array( [ 1.0, 2.0, 3.0, 4.0, 5.0, 6.0 ] ); | ||
var y0 = new Float32Array( [ 7.0, 8.0, 9.0, 10.0, 11.0, 12.0 ] ); | ||
|
||
// Create offset views... | ||
var x1 = new Float32Array( x0.buffer, x0.BYTES_PER_ELEMENT*1 ); // start at 2nd element | ||
var y1 = new Float32Array( y0.buffer, y0.BYTES_PER_ELEMENT*3 ); // start at 4th element | ||
|
||
saxpy.main( 3, 5.0, x1, -2, y1, 1 ); | ||
// y0 => <Float32Array>[ 7.0, 8.0, 9.0, 40.0, 31.0, 22.0 ] | ||
``` | ||
|
||
#### saxpy.ndarray( N, alpha, x, strideX, offsetX, y, strideY, offsetY ) | ||
|
||
Multiplies a vector `x` by a constant `alpha` and adds the result to `y` using alternative indexing semantics. | ||
|
||
```javascript | ||
var Float32Array = require( '@stdlib/array/float32' ); | ||
|
||
var x = new Float32Array( [ 1.0, 2.0, 3.0, 4.0, 5.0 ] ); | ||
var y = new Float32Array( [ 1.0, 1.0, 1.0, 1.0, 1.0 ] ); | ||
var alpha = 5.0; | ||
|
||
saxpy.ndarray( x.length, alpha, x, 1, 0, y, 1, 0 ); | ||
// y => <Float32Array>[ 6.0, 11.0, 16.0, 21.0, 26.0 ] | ||
``` | ||
|
||
The function has the following additional parameters: | ||
|
||
- **offsetX**: starting index for `x`. | ||
- **offsetY**: starting index for `y`. | ||
|
||
While [`typed array`][mdn-typed-array] views mandate a view offset based on the underlying buffer, the offset parameters support indexing semantics based on starting indices. For example, to multiply every other value in `x` by a constant `alpha` starting from the second value and add to the last `N` elements in `y` where `x[i] -> y[n]`, `x[i+2] -> y[n-1]`,..., | ||
|
||
```javascript | ||
var Float32Array = require( '@stdlib/array/float32' ); | ||
|
||
var x = new Float32Array( [ 1.0, 2.0, 3.0, 4.0, 5.0, 6.0 ] ); | ||
var y = new Float32Array( [ 7.0, 8.0, 9.0, 10.0, 11.0, 12.0 ] ); | ||
|
||
var alpha = 5.0; | ||
|
||
saxpy.ndarray( 3, alpha, x, 2, 1, y, -1, y.length-1 ); | ||
// y => <Float32Array>[ 7.0, 8.0, 9.0, 40.0, 31.0, 22.0 ] | ||
``` | ||
|
||
* * * | ||
|
||
### Module | ||
|
||
#### saxpy.Module( memory ) | ||
|
||
Returns a new WebAssembly [module wrapper][@stdlib/wasm/module-wrapper] instance which uses the provided WebAssembly [memory][@stdlib/wasm/memory] instance as its underlying memory. | ||
|
||
<!-- eslint-disable node/no-sync --> | ||
|
||
```javascript | ||
var Memory = require( '@stdlib/wasm/memory' ); | ||
|
||
// Create a new memory instance with an initial size of 10 pages (640KiB) and a maximum size of 100 pages (6.4MiB): | ||
var mem = new Memory({ | ||
'initial': 10, | ||
'maximum': 100 | ||
}); | ||
|
||
// Create a BLAS routine: | ||
var mod = new saxpy.Module( mem ); | ||
// returns <Module> | ||
|
||
// Initialize the routine: | ||
mod.initializeSync(); | ||
``` | ||
|
||
#### saxpy.Module.prototype.main( N, α, xp, sx, yp, sy ) | ||
|
||
Multiplies a vector `x` by a constant and adds the result to `y`. | ||
|
||
<!-- eslint-disable node/no-sync --> | ||
|
||
```javascript | ||
var Memory = require( '@stdlib/wasm/memory' ); | ||
var oneTo = require( '@stdlib/array/one-to' ); | ||
var ones = require( '@stdlib/array/ones' ); | ||
var zeros = require( '@stdlib/array/zeros' ); | ||
var bytesPerElement = require( '@stdlib/ndarray/base/bytes-per-element' ); | ||
|
||
// Create a new memory instance with an initial size of 10 pages (640KiB) and a maximum size of 100 pages (6.4MiB): | ||
var mem = new Memory({ | ||
'initial': 10, | ||
'maximum': 100 | ||
}); | ||
|
||
// Create a BLAS routine: | ||
var mod = new saxpy.Module( mem ); | ||
// returns <Module> | ||
|
||
// Initialize the routine: | ||
mod.initializeSync(); | ||
|
||
// Define a vector data type: | ||
var dtype = 'float32'; | ||
|
||
// Specify a vector length: | ||
var N = 5; | ||
|
||
// Define pointers (i.e., byte offsets) for storing two vectors: | ||
var xptr = 0; | ||
var yptr = N * bytesPerElement( dtype ); | ||
|
||
// Write vector values to module memory: | ||
mod.write( xptr, oneTo( N, dtype ) ); | ||
mod.write( yptr, ones( N, dtype ) ); | ||
|
||
// Perform computation: | ||
mod.main( N, 5.0, xptr, 1, yptr, 1 ); | ||
|
||
// Read out the results: | ||
var view = zeros( N, dtype ); | ||
mod.read( yptr, view ); | ||
|
||
console.log( view ); | ||
// => <Float32Array>[ 6.0, 11.0, 16.0, 21.0, 26.0 ] | ||
``` | ||
|
||
The function has the following parameters: | ||
|
||
- **N**: number of indexed elements. | ||
- **α**: scalar constant. | ||
- **xp**: input [`Float32Array`][@stdlib/array/float32] pointer (i.e., byte offset). | ||
- **sx**: index increment for `x`. | ||
- **yp**: input [`Float32Array`][@stdlib/array/float32] pointer (i.e., byte offset). | ||
- **sy**: index increment for `y`. | ||
|
||
#### saxpy.Module.prototype.ndarray( N, α, xp, sx, ox, yp, sy, oy ) | ||
|
||
Multiplies a vector `x` by a constant and adds the result to `y` using alternative indexing semantics. | ||
|
||
<!-- eslint-disable node/no-sync --> | ||
|
||
```javascript | ||
var Memory = require( '@stdlib/wasm/memory' ); | ||
var oneTo = require( '@stdlib/array/one-to' ); | ||
var ones = require( '@stdlib/array/ones' ); | ||
var zeros = require( '@stdlib/array/zeros' ); | ||
var bytesPerElement = require( '@stdlib/ndarray/base/bytes-per-element' ); | ||
|
||
// Create a new memory instance with an initial size of 10 pages (640KiB) and a maximum size of 100 pages (6.4MiB): | ||
var mem = new Memory({ | ||
'initial': 10, | ||
'maximum': 100 | ||
}); | ||
|
||
// Create a BLAS routine: | ||
var mod = new saxpy.Module( mem ); | ||
// returns <Module> | ||
|
||
// Initialize the routine: | ||
mod.initializeSync(); | ||
|
||
// Define a vector data type: | ||
var dtype = 'float32'; | ||
|
||
// Specify a vector length: | ||
var N = 5; | ||
|
||
// Define pointers (i.e., byte offsets) for storing two vectors: | ||
var xptr = 0; | ||
var yptr = N * bytesPerElement( dtype ); | ||
|
||
// Write vector values to module memory: | ||
mod.write( xptr, oneTo( N, dtype ) ); | ||
mod.write( yptr, ones( N, dtype ) ); | ||
|
||
// Perform computation: | ||
mod.ndarray( N, 5.0, xptr, 1, 0, yptr, 1, 0 ); | ||
|
||
// Read out the results: | ||
var view = zeros( N, dtype ); | ||
mod.read( yptr, view ); | ||
|
||
console.log( view ); | ||
// => <Float32Array>[ 6.0, 11.0, 16.0, 21.0, 26.0 ] | ||
``` | ||
|
||
The function has the following additional parameters: | ||
|
||
- **ox**: starting index for `x`. | ||
- **oy**: starting index for `y`. | ||
|
||
</section> | ||
|
||
<!-- /.usage --> | ||
|
||
<section class="notes"> | ||
|
||
* * * | ||
|
||
## Notes | ||
|
||
- If `N <= 0` or `alpha == 0`, `y` is left unchanged. | ||
- This package implements routines using WebAssembly. When provided arrays which are not allocated on a `saxpy` module memory instance, data must be explicitly copied to module memory prior to computation. Data movement may entail a performance cost, and, thus, if you are using arrays external to module memory, you should prefer using [`@stdlib/blas/base/saxpy`][@stdlib/blas/base/saxpy]. However, if working with arrays which are allocated and explicitly managed on module memory, you can achieve better performance when compared to the pure JavaScript implementations found in [`@stdlib/blas/base/saxpy`][@stdlib/blas/base/saxpy]. Beware that such performance gains may come at the cost of additional complexity when having to perform manual memory management. Choosing between implementations depends heavily on the particular needs and constraints of your application, with no one choice universally better than the other. | ||
- `saxpy()` corresponds to the [BLAS][blas] level 1 function [`saxpy`][saxpy]. | ||
|
||
</section> | ||
|
||
<!-- /.notes --> | ||
|
||
<section class="examples"> | ||
|
||
* * * | ||
|
||
## Examples | ||
|
||
<!-- eslint no-undef: "error" --> | ||
|
||
```javascript | ||
var discreteUniform = require( '@stdlib/random/array/discrete-uniform' ); | ||
var saxpy = require( '@stdlib/blas/base/saxpy-wasm' ); | ||
|
||
var opts = { | ||
'dtype': 'float32' | ||
}; | ||
var x = discreteUniform( 10, 0, 100, opts ); | ||
console.log( x ); | ||
|
||
var y = discreteUniform( x.length, 0, 10, opts ); | ||
console.log( y ); | ||
|
||
saxpy.ndarray( x.length, 5.0, x, 1, 0, y, -1, y.length-1 ); | ||
console.log( y ); | ||
``` | ||
|
||
</section> | ||
|
||
<!-- /.examples --> | ||
|
||
<!-- Section for related `stdlib` packages. Do not manually edit this section, as it is automatically populated. --> | ||
|
||
<section class="related"> | ||
|
||
</section> | ||
|
||
<!-- /.related --> | ||
|
||
<!-- Section for all links. Make sure to keep an empty line after the `section` element and another before the `/section` close. --> | ||
|
||
<section class="links"> | ||
|
||
[blas]: http://www.netlib.org/blas | ||
|
||
[saxpy]: http://www.netlib.org/lapack/explore-html/de/da4/group__double__blas__level1.html | ||
|
||
[mdn-typed-array]: https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/TypedArray | ||
|
||
[@stdlib/array/float32]: https://github.com/stdlib-js/stdlib/tree/develop/lib/node_modules/%40stdlib/array/float32 | ||
|
||
[@stdlib/wasm/memory]: https://github.com/stdlib-js/stdlib/tree/develop/lib/node_modules/%40stdlib/wasm/memory | ||
|
||
[@stdlib/wasm/module-wrapper]: https://github.com/stdlib-js/stdlib/tree/develop/lib/node_modules/%40stdlib/wasm/module-wrapper | ||
|
||
[@stdlib/blas/base/saxpy]: https://github.com/stdlib-js/stdlib/tree/develop/lib/node_modules/%40stdlib/blas/base/saxpy | ||
|
||
</section> | ||
|
||
<!-- /.links --> |
Oops, something went wrong.
929a224
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Coverage Report
The above coverage report was generated for the changes in this push.