arkouda.numpy.pdarrayclass¶

Exceptions¶

RegistrationError

Error/Exception used when the Arkouda Server cannot register an object

Classes¶

pdarray

The basic arkouda array class. This class contains only the

Functions¶

`argmaxk`(→ pdarray)	Find the indices corresponding to the k maximum values of an array.
`argmink`(→ pdarray)	Finds the indices corresponding to the k minimum values of an array.
`broadcast_to_shape`(→ pdarray)	Create a "broadcasted" array (of rank 'nd') by copying an array into an
`clear`(→ None)	Send a clear message to clear all unregistered data from the server symbol table.
`clz`(→ pdarray)	Count leading zeros for each integer in an array.
`corr`(→ numpy.float64)	Return the correlation between x and y
`cov`(→ numpy.float64)	Return the covariance of x and y
`ctz`(→ pdarray)	Count trailing zeros for each integer in an array.
`diff`(→ pdarray)	Calculate the n-th discrete difference along the given axis.
`divmod`(→ Tuple[pdarray, pdarray])
`dot`(→ Union[arkouda.numpy.dtypes.numeric_scalars, pdarray])	Computes dot product of two arrays.
`fmod`(→ pdarray)	Returns the element-wise remainder of division.
`maxk`(→ pdarray)	Find the k maximum values of an array.
`mink`(→ pdarray)	Find the k minimum values of an array.
`mod`(→ pdarray)	Returns the element-wise remainder of division.
`parity`(→ pdarray)	Find the bit parity (XOR of all bits) for each integer in an array.
`popcount`(→ pdarray)	Find the population (number of bits set) for each integer in an array.
`power`(→ pdarray)	Raises an array to a power. If where is given, the operation will only take place in the positions
`rotl`(→ pdarray)	Rotate bits of <x> to the left by <rot>.
`rotr`(→ pdarray)	Rotate bits of <x> to the left by <rot>.
`sqrt`(→ pdarray)	Takes the square root of array. If where is given, the operation will only take place in

Module Contents¶

exception arkouda.numpy.pdarrayclass.RegistrationError[source]¶

Bases: Exception

Error/Exception used when the Arkouda Server cannot register an object

arkouda.numpy.pdarrayclass.argmaxk(pda: pdarray, k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶

Find the indices corresponding to the k maximum values of an array.

Returns the largest k values of an array, sorted

Parameters:

pda (pdarray) – Input array.
k (int_scalars) – The desired count of indices corresponding to maxmum array values

Returns:

The indices of the maximum k values from the pda, sorted

Return type:

pdarray

Raises:

TypeError – Raised if pda is not a pdarray or k is not an integer
ValueError – Raised if the pda is empty, or pda.ndim > 1, or k < 1

Notes

This call is equivalent in value to ak.argsort(a)[k:] and generally outperforms this operation.

This reduction will see a significant drop in performance as k grows beyond a certain value. This value is system dependent, but generally about a k of 5 million is where performance degradation has been observed.

Examples

>>> import arkouda as ak
>>> A = ak.array([10,5,1,3,7,2,9,0])
>>> ak.argmaxk(A, 3)
array([4 6 0])
>>> ak.argmaxk(A, 4)
array([1 4 6 0])

arkouda.numpy.pdarrayclass.argmink(pda: pdarray, k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶

Finds the indices corresponding to the k minimum values of an array.

Parameters:

pda (pdarray) – Input array.
k (int_scalars) – The desired count of indices corresponding to minimum array values

Returns:

The indices of the minimum k values from the pda, sorted

Return type:

pdarray

Raises:

TypeError – Raised if pda is not a pdarray or k is not an integer
ValueError – Raised if the pda is empty, or pda.ndim > 1, or k < 1

Notes

This call is equivalent in value to ak.argsort(a)[:k] and generally outperforms this operation.

Examples

>>> import arkouda as ak
>>> A = ak.array([10,5,1,3,7,2,9,0])
>>> ak.argmink(A, 3)
array([7 2 5])
>>> ak.argmink(A, 4)
array([7 2 5 3])

arkouda.numpy.pdarrayclass.broadcast_to_shape(pda: pdarray, shape: Tuple[int, Ellipsis]) → pdarray[source]¶

Create a “broadcasted” array (of rank ‘nd’) by copying an array into an array of the given shape.

E.g., given the following broadcast:

pda (3d array): 1 x 4 x 1

shape ( shape ): 7 x 4 x 2

Result (3d array): 7 x 4 x 2

When copying from a singleton dimension, the value is repeated along that dimension (e.g., pda’s 1st and 3rd above). For non singleton dimensions, the size of the two arrays must match, and the values are copied into the result array.

When prepending a new dimension to increase an array’s rank, the values from the other dimensions are repeated along the new dimension.

Parameters:

pda (pdarray) – the input to be broadcast
shape (tuple of int) – the shape to which pda is to be broadcast

Returns:

the result of the broadcast operation

Return type:

pdarray

Examples

>>> import arkouda as ak
>>> a = ak.arange(2).reshape(1,2,1)
>>> ak.broadcast_to_shape(a,(2,2,2))
array([array([array([0 0]) array([1 1])]) array([array([0 0]) array([1 1])])])
>>> a = ak.array([5,19]).reshape(1,2)
>>> ak.broadcast_to_shape(a,(2,2,2))
array([array([array([5 19]) array([5 19])]) array([array([5 19]) array([5 19])])])

Raises:: RuntimeError – raised if the pda can’t be broadcast to the given shape

arkouda.numpy.pdarrayclass.clear() → None[source]¶

Send a clear message to clear all unregistered data from the server symbol table.

Raises:: RuntimeError – Raised if there is a server-side error in executing clear request

arkouda.numpy.pdarrayclass.clz(pda: pdarray) → pdarray[source]¶

Count leading zeros for each integer in an array.

Parameters:: pda (pdarray, int64, uint64, bigint) – Input array (must be integral).
Returns:: The number of leading zeros of each element.
Return type:: pdarray
Raises:: TypeError – If input array is not int64, uint64, or bigint

Examples

>>> import arkouda as ak
>>> A = ak.arange(10)
>>> ak.clz(A)
array([64 63 62 62 61 61 61 61 60 60])

arkouda.numpy.pdarrayclass.corr(x: pdarray, y: pdarray) → numpy.float64[source]¶

Return the correlation between x and y

Parameters:

x (pdarray) – One of the pdarrays used to calculate correlation
y (pdarray) – One of the pdarrays used to calculate correlation

Returns:

The scalar correlation of the two pdarrays

Return type:

np.float64

Examples

>>> import arkouda as ak
>>> a = ak.arange(10)
>>> b = a + 1
>>> ak.corr(a,b)
np.float64(0.9999999999999998)
>>> a.corr(b)
np.float64(0.9999999999999998)

Raises:

TypeError – Raised if x or y is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

See also

std, cov

Notes

The correlation is calculated by cov(x, y) / (x.std(ddof=1) * y.std(ddof=1))

arkouda.numpy.pdarrayclass.cov(x: pdarray, y: pdarray) → numpy.float64[source]¶

Return the covariance of x and y

Parameters:

x (pdarray) – One of the pdarrays used to calculate covariance
y (pdarray) – One of the pdarrays used to calculate covariance

Returns:

The scalar covariance of the two pdarrays

Return type:

np.float64

Examples

>>> import arkouda as ak
>>> a = ak.arange(10)
>>> b = a + 1
>>> ak.cov(a,b)
np.float64(9.166666666666666)
>>> a.cov(b)
np.float64(9.166666666666666)

Raises:

TypeError – Raised if x or y is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

See also

mean, var

Notes

The covariance is calculated by cov = ((x - x.mean()) * (y - y.mean())).sum() / (x.size - 1).

arkouda.numpy.pdarrayclass.ctz(pda: pdarray) → pdarray[source]¶

Count trailing zeros for each integer in an array.

Parameters:: pda (pdarray, int64, uint64, bigint) – Input array (must be integral).
Returns:: The number of trailing zeros of each element.
Return type:: pdarray

Notes

ctz(0) is defined to be zero.

Raises:: TypeError – If input array is not int64, uint64, or bigint

Examples

>>> import arkouda as ak
>>> A = ak.arange(10)
>>> ak.ctz(A)
array([0 0 1 0 2 0 1 0 3 0])

arkouda.numpy.pdarrayclass.diff(a: pdarray, n: int = 1, axis: int = -1, prepend=None, append=None) → pdarray[source]¶

Calculate the n-th discrete difference along the given axis.

The first difference is given by out[i] = a[i+1] - a[i] along the given axis, higher differences are calculated by using diff iteratively.

Parameters:

a (pdarray) – The array to calculate the difference
n (int, optional) – The order of the finite difference. Default is 1.
axis (int, optional) – The axis along which to calculate the difference. Default is the last axis.
prepend (pdarray, optional) – The pdarray to prepend to a along axis before calculating the difference.
append (pdarray, optional) – The pdarray to append to a along axis before calculating the difference.

Returns:

The n-th differences. The shape of the output is the same as a except along axis where the dimension is smaller by n. The type of the output is the same as the type of the difference between any two elements of a. This is the same as the type of a in most cases. A notable exception is datetime64, which results in a timedelta64 output array.

Return type:

pdarray

Notes

Type is preserved for boolean arrays, so the result will contain False when consecutive elements are the same and True when they differ.

For unsigned integer arrays, the results will also be unsigned. This should not be surprising, as the result is consistent with calculating the difference directly.

If this is not desirable, then the array should be cast to a larger integer type first:

Examples

>>> import arkouda as ak
>>> a = ak.array([1, 2, 4, 7, 0])
>>> ak.diff(a)
array([1 2 3 -7])
>>> ak.diff(a, n=2)
array([1 1 -10])

>>> a = ak.array([[1, 3, 6, 10], [0, 5, 6, 8]])
>>> ak.diff(a)
array([array([2 3 4]) array([5 1 2])])
>>> ak.diff(a, axis=0)
array([array([-1 2 0 -2])])

arkouda.numpy.pdarrayclass.divmod(x: arkouda.numpy.dtypes.numeric_scalars | pdarray, y: arkouda.numpy.dtypes.numeric_scalars | pdarray, where: arkouda.numpy.dtypes.bool_scalars | pdarray = True) → Tuple[pdarray, pdarray][source]¶

Parameters:

x (numeric_scalars(float_scalars, int_scalars) or pdarray) – The dividend array, the values that will be the numerator of the floordivision and will be acted on by the bases for modular division.
y (numeric_scalars(float_scalars, int_scalars) or pdarray) – The divisor array, the values that will be the denominator of the division and will be the bases for the modular division.
where (Boolean or pdarray) – This condition is broadcast over the input. At locations where the condition is True, the corresponding value will be divided using floor and modular division. Elsewhere, it will retain its original value. Default set to True.

Returns:

Returns a tuple that contains quotient and remainder of the division

Return type:

(pdarray, pdarray)

Raises:

TypeError – At least one entry must be a pdarray
ValueError – If both inputs are both pdarrays, their size must match
ZeroDivisionError – No entry in y is allowed to be 0, to prevent division by zero

Notes

The div is calculated by x // y The mod is calculated by x % y

Examples

>>> import arkouda as ak
>>> x = ak.arange(5, 10)
>>> y = ak.array([2, 1, 4, 5, 8])
>>> ak.divmod(x,y)
(array([2 6 1 1 1]), array([1 0 3 3 1]))
>>> ak.divmod(x,y, x % 2 == 0)
(array([5 6 7 1 9]), array([5 0 7 3 9]))

Computes dot product of two arrays.

If pda1 and pda2 are 1-D vectors of identical length, returns the conventional dot product.

If both pda1 and pda2 are scalars, returns their product.

If one of pda1, pda2 is a scalar, and the other a pdarray, returns the pdarray multiplied by the scalar.

If both pda1 and pda2 are 2-D arrays, returns the matrix multiplication.

If pda1 is M-D and pda2 is 1-D, returns a sum product over the last axis of pda1 and pda2.

If pda1 is M-D and pda2 is N-D, returns a sum product over the last axis of pda1 and the next-to-last axis of pda2, e.g.:

For example, If pda1 has rank (3,3,4) and pda2 has rank (4,2), then the result of ak.dot(pda1,pda2) has rank (3,3,2), and

result[i,j,k] = sum( pda1[i, j, :] * pda2[:, k] )

Parameters:

pda1 (Union[np.int64, np.float64, np.uint64, pdarray],)
pda2 (Union[np.int64, np.float64, np.uint64, pdarray],)

Returns:

as described above

Return type:

Union[numeric_scalars, pdarray]

Examples

>>> import arkouda as ak
>>> ak.dot(ak.array([1, 2, 3]),ak.array([4,5,6]))
np.int64(32)
>>> ak.dot(ak.array([1, 2, 3]),5)
array([5 10 15])
>>> ak.dot(5,ak.array([2, 3, 4]))
array([10 15 20])
>>> ak.dot(ak.arange(9).reshape(3,3),ak.arange(6).reshape(3,2))
array([array([10 13]) array([28 40]) array([46 67])])
>>> ak.dot(ak.arange(27).reshape(3,3,3),ak.array([2,3,4]))
array([array([11 38 65]) array([92 119 146]) array([173 200 227])])
>>> ak.dot(ak.arange(36).reshape(3,3,4),ak.arange(8).reshape(4,2))
array([array([array([28 34]) array([76 98]) array([124 162])]) array([array([172 226])
    array([220 290]) array([268 354])]) array([array([316 418]) array([364 482]) array([412 546])])])

Raises:: ValueError – Raised if either pdda1 or pda2 is not an allowed type, or if shapes are incompatible.

arkouda.numpy.pdarrayclass.fmod(dividend: pdarray | arkouda.numpy.dtypes.numeric_scalars, divisor: pdarray | arkouda.numpy.dtypes.numeric_scalars) → pdarray[source]¶

Returns the element-wise remainder of division.

It is equivalent to np.fmod, the remainder has the same sign as the dividend.

Parameters:

dividend (numeric scalars or pdarray) – The array being acted on by the bases for the modular division.
divisor (numeric scalars or pdarray) – The array that will be the bases for the modular division.

Returns:

an array that contains the element-wise remainder of division.

Return type:

pdarray

Raises:

TypeError – Raised if neither dividend nor divisor is a pdarray (at least one must be) or if any scalar or pdarray element is not one of int, uint, float, bigint

arkouda.numpy.pdarrayclass.maxk(pda: pdarray, k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶

Find the k maximum values of an array.

Returns the largest k values of an array, sorted

Parameters:

pda (pdarray) – Input array.
k (int_scalars) – The desired count of maximum values to be returned by the output.

Returns:

The maximum k values from pda, sorted

Return type:

pdarray

Raises:

TypeError – Raised if pda is not a pdarray or k is not an integer
ValueError – Raised if the pda is empty, or pda.ndim > 1, or k < 1

Notes

This call is equivalent in value to a[ak.argsort(a)[k:]]

and generally outperforms this operation.

Examples

>>> import arkouda as ak
>>> A = ak.array([10,5,1,3,7,2,9,0])
>>> ak.maxk(A, 3)
array([7 9 10])
>>> ak.maxk(A, 4)
array([5 7 9 10])

arkouda.numpy.pdarrayclass.mink(pda: pdarray, k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶

Find the k minimum values of an array.

Returns the smallest k values of an array, sorted

Parameters:

pda (pdarray) – Input array.
k (int_scalars) – The desired count of minimum values to be returned by the output.

Returns:

The minimum k values from pda, sorted

Return type:

pdarray

Raises:

TypeError – Raised if pda is not a pdarray
ValueError – Raised if the pda is empty, or pda.ndim > 1, or k < 1

Notes

This call is equivalent in value to a[ak.argsort(a)[:k]] and generally outperforms this operation.

Examples

>>> import arkouda as ak
>>> A = ak.array([10,5,1,3,7,2,9,0])
>>> ak.mink(A, 3)
array([0 1 2])
>>> ak.mink(A, 4)
array([0 1 2 3])

arkouda.numpy.pdarrayclass.mod(dividend, divisor) → pdarray[source]¶

Returns the element-wise remainder of division.

Computes the remainder complementary to the floor_divide function. It is equivalent to np.mod, the remainder has the same sign as the divisor.

Parameters:

dividend – pdarray : The numeric scalar or pdarray being acted on by the bases for the modular division.
divisor – pdarray : The numeric scalar or pdarray that will be the bases for the modular division.

Returns:

an array that contains the element-wise remainder of division.

Return type:

pdarray

Examples

>>> import arkouda as ak
>>> a = ak.array([1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20])
>>> b = ak.array([2,2,2,3,3,3,4,4,4,5,5,5,6,6,6,7,7,7,8,8])
>>> ak.mod(a,b)
array([1 0 1 1 2 0 3 0 1 0 1 2 1 2 3 2 3 4 3 4])

Raises:: ValueError – raised if shapes of dividend and divisor are incompatible

arkouda.numpy.pdarrayclass.parity(pda: pdarray) → pdarray[source]¶

Find the bit parity (XOR of all bits) for each integer in an array.

Parameters:: pda (pdarray, int64, uint64, bigint) – Input array (must be integral).
Returns:: The parity of each element: 0 if even number of bits set, 1 if odd.
Return type:: pdarray
Raises:: TypeError – If input array is not int64, uint64, or bigint

Examples

>>> import arkouda as ak
>>> A = ak.arange(10)
>>> ak.parity(A)
array([0 1 1 0 1 0 0 1 1 0])

class arkouda.numpy.pdarrayclass.pdarray(name: str, mydtype: numpy.dtype, size: arkouda.numpy.dtypes.int_scalars, ndim: arkouda.numpy.dtypes.int_scalars, shape: Tuple[int, Ellipsis], itemsize: arkouda.numpy.dtypes.int_scalars, max_bits: int | None = None)[source]¶

The basic arkouda array class. This class contains only the attributes of the array; the data resides on the arkouda server. When a server operation results in a new array, arkouda will create a pdarray instance that points to the array data on the server. As such, the user should not initialize pdarray instances directly.

name¶

The server-side identifier for the array

Type:: str

dtype¶

The element dtype of the array

Type:: type

size¶

The number of elements in the array

Type:: int_scalars

ndim¶

The rank of the array

Type:: int_scalars

shape¶

A tuple containing the sizes of each dimension of the array

Type:: Tuple[int, …]

itemsize¶

The size in bytes of each element

Type:: int_scalars

BinOps¶

OpEqOps¶

all(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.bool_scalars | pdarray[source]¶

Return True iff all elements of the array along the given axis evaluate to True.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

boolean if axis is omitted, pdarray if axis is supplied

Return type:

boolean or pdarray

Examples

>>> import arkouda as ak
>>> ak.all(ak.array([True,False,False]))
np.False_
>>> ak.all(ak.array([[True,True,False],[False,True,True]]),axis=0)
array([False True False])
>>> ak.all(ak.array([[True,True,True],[False,False,False]]),axis=0,keepdims=True)
array([array([False False False])])
>>> ak.all(ak.array([[True,True,True],[False,False,False]]),axis=1,keepdims=True)
array([array([True]) array([False])])
>>> ak.array([True,False,False]).all()
np.False_

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Notes

Works as a method of a pdarray (e.g. a.any()) or a standalone function (e.g. ak.all(a))

any(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.bool_scalars | pdarray[source]¶

Return True iff any element of the array along the given axis evaluates to True.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

boolean if axis is omitted, else pdarray if axis is supplied

Return type:

boolean or pdarray

Examples

>>> import arkouda as ak
>>> ak.any(ak.array([True,False,False]))
np.True_
>>> ak.any(ak.array([[True,True,False],[False,True,True]]),axis=0)
array([True True True])
>>> ak.any(ak.array([[True,True,True],[False,False,False]]),axis=0,keepdims=True)
array([array([True True True])])
>>> ak.any(ak.array([[True,True,True],[False,False,False]]),axis=1,keepdims=True)
array([array([True]) array([False])])
>>> ak.array([True,False,False]).any()
np.True_

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Notes

Works as a method of a pdarray (e.g. a.any()) or a standalone function (e.g. ak.any(a))

argmax(axis: int | None | None = None, keepdims: bool = False) → numpy.int64 | numpy.uint64 | pdarray[source]¶

Return index of the first occurrence of the maximum along the given axis.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

int64 or uint64 if axis is omitted, in which case operation is done over entire array pdarray if axis is supplied, in which case the operation is done along that axis

Return type:

int64, uint64 or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.argmax(ak.array([1,2,3,4,5]))
np.int64(4)
>>> ak.argmax(ak.array([5.5,4.5,3.5,2.5,1.5]))
np.int64(0)
>>> ak.array([[1,2,3],[5,4,3]]).argmax(axis=1)
array([2 0])

Notes

Works as a method of a pdarray (e.g. a.argmax()) or a standalone function (e.g. ak.argmax(a))

argmaxk(k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶: Finds the indices corresponding to the k maximum values of an array. See arkouda.argmaxk for details.

argmin(axis: int | None | None = None, keepdims: bool = False) → numpy.int64 | numpy.uint64 | pdarray[source]¶

Return index of the first occurrence of the minimum along the given axis.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

int64 or uint64 if axis is omitted, in which case operation is done over entire array pdarray if axis is supplied, in which case the operation is done along that axis

Return type:

int64, uint64 or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.argmin(ak.array([1,2,3,4,5]))
np.int64(0)
>>> ak.argmin(ak.array([5.5,4.5,3.5,2.5,1.5]))
np.int64(4)
>>> ak.array([[1,2,3],[5,4,3]]).argmin(axis=1)
array([0 2])

Notes

Works as a method of a pdarray (e.g. a.argmin()) or a standalone function (e.g. ak.argmin(a))

argmink(k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶: Finds the indices corresponding to the k minimum values of an array. See arkouda.argmink for details.

argsort(algorithm: arkouda.numpy.sorting.SortingAlgorithm = SortingAlgorithm.RadixSortLSD, axis: arkouda.numpy.dtypes.int_scalars = 0, ascending: bool = True) → pdarray[source]¶

Return the permutation that sorts the pdarray.

Parameters:

algorithm (SortingAlgorithm, default SortingAlgorithm.RadixSortLSD) – The algorithm to use for sorting.
axis (int_scalars, default 0) – The axis to sort along. Must be between -1 and the array rank.
ascending (bool, default True) – Whether to sort in ascending order. If False, returns a reversed permutation. Note: ascending=False is only supported for 1D arrays.

Returns:

The indices that would sort the array.

Return type:

pdarray

Examples

>>> import arkouda as ak
>>> a = ak.array([42, 7, 19])
>>> a.argsort()
array([1 2 0])
>>> a[a.argsort()]
array([7 19 42])
>>> a.argsort(ascending=False)
array([0 2 1])

astype(dtype) → pdarray[source]¶

Cast values of pdarray to provided dtype

Parameters:: dtype (np.dtype or str) – Dtype to cast to

Examples

>>> import arkouda as ak
>>> ak.array([1,2,3]).astype(ak.float64)
array([1.00000000000000000 2.00000000000000000 3.00000000000000000])
>>> ak.array([1.5,2.5]).astype(ak.int64)
array([1 2])
>>> ak.array([True,False]).astype(ak.int64)
array([1 0])

Returns:: An arkouda pdarray with values converted to the specified data type
Return type:: ak.pdarray

Notes

This is essentially shorthand for ak.cast(x, ‘<dtype>’) where x is a pdarray.

bigint_to_uint_arrays() → List[pdarray][source]¶

Create a list of uint pdarrays from a bigint pdarray. The first item in return will be the highest 64 bits of the bigint pdarray and the last item will be the lowest 64 bits.

Returns:: A list of uint pdarrays where: The first item in return will be the highest 64 bits of the bigint pdarray and the last item will be the lowest 64 bits.
Return type:: List[pdarrays]
Raises:: RuntimeError – Raised if there is a server-side error thrown

See also

pdarraycreation.bigint_from_uint_arrays

Examples

>>> import arkouda as ak
>>> a = ak.arange(2**64, 2**64 + 5)
>>> a
array([18446744073709551616 18446744073709551617 18446744073709551618
18446744073709551619 18446744073709551620])
>>> a.bigint_to_uint_arrays()
[array([1 1 1 1 1]), array([0 1 2 3 4])]

clz() → pdarray[source]¶: Count the number of leading zeros in each element. See ak.clz.

copy() → pdarray[source]¶

Return an array copy of the given object.

Returns:: A deep copy of the pdarray.
Return type:: pdarray

corr(y: pdarray) → numpy.float64[source]¶: Compute the correlation between self and y using pearson correlation coefficient. See arkouda.corr for details.

cov(y: pdarray) → numpy.float64[source]¶: Compute the covariance between self and y.

ctz() → pdarray[source]¶: Count the number of trailing zeros in each element. See ak.ctz.

dtype: numpy.dtype¶

equals(other) → arkouda.numpy.dtypes.bool_scalars[source]¶

Whether pdarrays are the same size and all entries are equal.

Parameters:: other (object) – object to compare.
Returns:: True if the pdarrays are the same, o.w. False.
Return type:: bool_scalars

Examples

>>> import arkouda as ak
>>> a = ak.array([1, 2, 3])
>>> a_cpy = ak.array([1, 2, 3])
>>> a.equals(a_cpy)
np.True_
>>> a2 = ak.array([1, 2, 5])
>>> a.equals(a2)
np.False_

fill(value: arkouda.numpy.dtypes.numeric_scalars) → None[source]¶

Fill the array (in place) with a constant value.

Parameters:: value (numeric_scalars)
Raises:: TypeError – Raised if value is not an int, int64, float, or float64

flatten()[source]¶

Return a copy of the array collapsed into one dimension.

Return type:: A copy of the input array, flattened to one dimension.

Examples

>>> import arkouda as ak
>>> a = ak.array([[3,2,1],[2,3,1]])
>>> a.flatten()
array([3 2 1 2 3 1])

format_other(other) → str[source]¶

Attempt to cast scalar other to the element dtype of this pdarray, and print the resulting value to a string (e.g. for sending to a server command). The user should not call this function directly.

Parameters:: other (object) – The scalar to be cast to the pdarray.dtype
Return type:: string representation of np.dtype corresponding to the other parameter
Raises:: TypeError – Raised if the other parameter cannot be converted to Numpy dtype

property inferred_type: str | None¶: Return a string of the type inferred from the values.

info() → str[source]¶

Return a JSON formatted string containing information about all components of self.

Returns:: JSON string containing information about all components of self
Return type:: str

is_registered() → numpy.bool_[source]¶

Return True iff the object is contained in the registry

Returns:: Indicates if the object is contained in the registry
Return type:: bool
Raises:: RuntimeError – Raised if there’s a server-side error thrown

Note

This will return True if the object is registered itself or as a component of another object

is_sorted(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.bool_scalars | pdarray[source]¶

Return True iff the array (or given axis of the array) is monotonically non-decreasing.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

boolean if axis is omitted, else pdarray if axis is supplied

Return type:

boolean or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.is_sorted(ak.array([1,2,3,4,5]))
np.True_
>>> ak.is_sorted(ak.array([5,4,3,2,1]))
np.False_
>>> ak.array([[1,2,3],[5,4,3]]).is_sorted(axis=1)
array([True False])

Notes

Works as a method of a pdarray (e.g. a.is_sorted()) or a standalone function (e.g. ak.is_sorted(a))

itemsize: arkouda.numpy.dtypes.int_scalars¶

max(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.numeric_scalars | pdarray[source]¶

Return max of array elements along the given axis.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

numeric_scalar if axis is omitted, in which case operation is done over entire array pdarray if axis is supplied, in which case the operation is done along that axis

Return type:

numeric_scalar or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.max(ak.array([1,2,3,4,5]))
np.int64(5)
>>> ak.max(ak.array([5.5,4.5,3.5,2.5,1.5]))
np.float64(5.5)
>>> ak.array([[1,2,3],[5,4,3]]).max(axis=1)
array([3 5])

Notes

Works as a method of a pdarray (e.g. a.max()) or a standalone function (e.g. ak.max(a))

property max_bits¶

maxk(k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶: Compute the maximum “k” values. See arkouda.maxk for details.

mean(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.numeric_scalars | pdarray[source]¶

Return the mean of the array.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

The mean calculated from the pda sum and size, along the axis/axes if those are given.

Return type:

Union[np.float64, pdarray]

Examples

>>> import arkouda as ak
>>> a = ak.arange(10)
>>> ak.mean(a)
np.float64(4.5)
>>> a.mean()
np.float64(4.5)
>>> a = ak.arange(10).reshape(2,5)
>>> a.mean(axis=0)
array([2.5 3.5 4.5 5.5 6.5])
>>> ak.mean(a,axis=0)
array([2.5 3.5 4.5 5.5 6.5])
>>> a.mean(axis=1)
array([2.00000000000000000 7.00000000000000000])
>>> ak.mean(a,axis=1)
array([2.00000000000000000 7.00000000000000000])

Raises: Raised if pda is not a pdarray instance
RuntimeError: Raised if there’s a server-side error thrown

min(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.numeric_scalars | pdarray[source]¶

Return min of array elements along the given axis.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

numeric_scalar if axis is omitted, in which case operation is done over entire array pdarray if axis is supplied, in which case the operation is done along that axis

Return type:

numeric_scalar or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.min(ak.array([1,2,3,4,5]))
np.int64(1)
>>> ak.min(ak.array([5.5,4.5,3.5,2.5,1.5]))
np.float64(1.5)
>>> ak.array([[1,2,3],[5,4,3]]).min(axis=1)
array([1 3])

Notes

Works as a method of a pdarray (e.g. a.min()) or a standalone function (e.g. ak.min(a))

mink(k: arkouda.numpy.dtypes.int_scalars) → pdarray[source]¶: Compute the minimum “k” values. See arkouda.mink for details.

name: str¶

property nbytes¶

The size of the pdarray in bytes.

Returns:: The size of the pdarray in bytes.
Return type:: int

ndim: arkouda.numpy.dtypes.int_scalars¶

objType = 'pdarray'¶

opeq(other, op)[source]¶

parity() → pdarray[source]¶: Find the parity (XOR of all bits) in each element. See ak.parity.

popcount() → pdarray[source]¶: Find the population (number of bits set) in each element. See ak.popcount.

pretty_print_info() → None[source]¶: Print information about all components of self in a human-readable format.

prod(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.numeric_scalars | pdarray[source]¶

Return prod of array elements along the given axis.

Parameters:

axis (int, Tuple[int, ...], optional, defalt = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

numeric_scalars if axis is omitted, in which case operation is done over entire array pdarray if axis is supplied, in which case the operation is done along that axis

Return type:

numeric_scalars or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.prod(ak.array([1,2,3,4,5]))
np.int64(120)
>>> ak.prod(ak.array([5.5,4.5,3.5,2.5,1.5]))
np.float64(324.84375)
>>> ak.array([[1,2,3],[5,4,3]]).prod(axis=1)
array([6 60])

Notes

Works as a method of a pdarray (e.g. a.prod()) or a standalone function (e.g. ak.prod(a))

register(user_defined_name: str) → pdarray[source]¶

Register this pdarray with a user defined name in the arkouda server so it can be attached to later using pdarray.attach() This is an in-place operation, registering a pdarray more than once will update the name in the registry and remove the previously registered name. A name can only be registered to one pdarray at a time.

Parameters:

user_defined_name (str) – user defined name array is to be registered under

Returns:

The same pdarray which is now registered with the arkouda server and has an updated name. This is an in-place modification, the original is returned to support a fluid programming style. Please note you cannot register two different pdarrays with the same name.

Return type:

pdarray

Raises:

TypeError – Raised if user_defined_name is not a str
RegistrationError – If the server was unable to register the pdarray with the user_defined_name If the user is attempting to register more than one pdarray with the same name, the former should be unregistered first to free up the registration name.

See also

attach, unregister, is_registered, list_registry, unregister_pdarray_by_name

Notes

Registered names/pdarrays in the server are immune to deletion until they are unregistered.

Examples

>>> import arkouda as ak
>>> a = ak.zeros(3)
>>> a.register("my_zeros")
array([0.00000000000000000 0.00000000000000000 0.00000000000000000])

potentially disconnect from server and reconnect to server >>> b = ak.attach(“my_zeros”) >>> b.unregister()

registered_name: str | None = None¶

reshape(*shape)[source]¶

Gives a new shape to an array without changing its data.

Parameters:: shape (int, tuple of ints, or pdarray) – The new shape should be compatible with the original shape.
Returns:: a pdarray with the same data, reshaped to the new shape
Return type:: pdarray

Examples

>>> import arkouda as ak
>>> a = ak.array([[3,2,1],[2,3,1]])
>>> a.reshape((3,2))
array([array([3 2]) array([1 2]) array([3 1])])
>>> a.reshape(3,2)
array([array([3 2]) array([1 2]) array([3 1])])
>>> a.reshape((6,1))
array([array([3]) array([2]) array([1]) array([2]) array([3]) array([1])])

Notes

only available as a method, not as a standalone function, i.e., a.reshape(compatibleShape) is valid, but ak.reshape(a,compatibleShape) is not.

rotl(other) → pdarray[source]¶: Rotate bits left by <other>.

rotr(other) → pdarray[source]¶: Rotate bits right by <other>.

property shape¶

Return the shape of an array.

Returns:: The elements of the shape tuple give the lengths of the corresponding array dimensions.
Return type:: tuple of int

size: arkouda.numpy.dtypes.int_scalars¶

slice_bits(low, high) → pdarray[source]¶

Return a pdarray containing only bits from low to high of self.

This is zero indexed and inclusive on both ends, so slicing the bottom 64 bits is pda.slice_bits(0, 63)

Parameters:

low (int) – The lowest bit included in the slice (inclusive) zero indexed, so the first bit is 0
high (int) – The highest bit included in the slice (inclusive)

Returns:

A new pdarray containing the bits of self from low to high

Return type:

pdarray

Raises:

RuntimeError – Raised if there is a server-side error thrown

Examples

>>> import arkouda as ak
>>> p = ak.array([2**65 + (2**64 - 1)])
>>> bin(p[0])
'0b101111111111111111111111111111111111111111111111111111111111111111'
>>> bin(p.slice_bits(64, 65)[0])
'0b10'
>>> a = ak.array([143,15])
>>> a.slice_bits(1,3)
array([7 7])
>>> a.slice_bits(4,9)
array([8 0])
>>> a.slice_bits(1,9)
array([71 7])

std(ddof: arkouda.numpy.dtypes.int_scalars = 0, axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool | None = False) → numpy.float64 | pdarray[source]¶

Return the standard deviation of values in the array. The standard deviation is implemented as the square root of the variance.

Parameters:

ddof (int_scalars) – “Delta Degrees of Freedom” used in calculating std
axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

The scalar standard deviation of the array, or the standard deviation: along the axis/axes if supplied

Return type:

Union[np.float64, pdarray]

Examples

>>> import arkouda as ak
>>> a = ak.arange(10)
>>> ak.std(a)
np.float64(2.8722813232690143)
>>> a.std()
np.float64(2.8722813232690143)
>>> a = ak.arange(10).reshape(2,5)
>>> a.std(axis=0)
array([2.5 2.5 2.5 2.5 2.5])
>>> ak.std(a,axis=0)
array([2.5 2.5 2.5 2.5 2.5])
>>> a.std(axis=1)
array([1.4142135623730951 1.4142135623730951])
>>> ak.std(a,axis=1)
array([1.4142135623730951 1.4142135623730951])

Raises:

TypeError – Raised if pda is not a pdarray instance or ddof is not an integer
ValueError – Raised if ddof is an integer < 0
RuntimeError – Raised if there’s a server-side error thrown

See also

mean, var

Notes

The standard deviation is the square root of the average of the squared deviations from the mean, i.e., std = sqrt(mean((x - x.mean())**2)).

The average squared deviation is normally calculated as x.sum() / N, where N = len(x). If, however, ddof is specified, the divisor N - ddof is used instead. In standard statistical practice, ddof=1 provides an unbiased estimator of the variance of the infinite population. ddof=0 provides a maximum likelihood estimate of the variance for normally distributed variables. The standard deviation computed in this function is the square root of the estimated variance, so even with ddof=1, it will not be an unbiased estimate of the standard deviation per se.

sum(axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool = False) → arkouda.numpy.dtypes.numeric_scalars | pdarray[source]¶

Return sum of array elements along the given axis.

Parameters:

axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

numeric_scalars if axis is omitted, in which case operation is done over entire array pdarray if axis is supplied, in which case the operation is done along that axis

Return type:

numeric_scalars or pdarray

Raises:

TypeError – Raised if pda is not a pdarray instance
RuntimeError – Raised if there’s a server-side error thrown

Examples

>>> import arkouda as ak
>>> ak.sum(ak.array([1,2,3,4,5]))
np.int64(15)
>>> ak.sum(ak.array([5.5,4.5,3.5,2.5,1.5]))
np.float64(17.5)
>>> ak.array([[1,2,3],[5,4,3]]).sum(axis=1)
array([6 12])

Notes

Works as a method of a pdarray (e.g. a.sum()) or a standalone function (e.g. ak.sum(a))

to_csv(prefix_path: str, dataset: str = 'array', col_delim: str = ',', overwrite: bool = False)[source]¶

Write pdarry to CSV file(s). File will contain a single column with the pdarray data. All CSV files written by Arkouda include a header denoting data types of the columns.

Parameters:

prefix_path (str) – filename prefix to be used for saving files. Files will have _LOCALE#### appended when they are written to disk.
dataset (str, defaults to "array") – column name to save the pdarray under.
col_delim (str, defaults to ",") – value to be used to separate columns within the file. Please be sure that the value used DOES NOT appear in your dataset.
overwrite (bool, defaults to False) – If True, existing files matching the provided path will be overwritten. if False and existing files are found, an error will be returned.

Returns:

response message

Return type:

str

Raises:

ValueError – Raised if all datasets are not present in all parquet files or if one or more of the specified files do not exist
RuntimeError – Raised if one or more of the specified files cannot be opened. if ‘allow_errors’ is true, this may be raised if no values are returned from the server.
TypeError – Raise if the server returns an unknown arkouda_type

Notes

CSV format is not currently supported by load/load_all operations
The column delimiter is expected to be the same for all column names and data
Be sure that column delimiters are not found within your data.
All CSV files must delimit rows using newline (”n”) at this time.

to_cuda()[source]¶

Convert the array to a Numba DeviceND array, transferring array data from the arkouda server to Python via ndarray. If the array exceeds a builtin size limit, a RuntimeError is raised.

Returns:

A Numba ndarray with the same attributes and data as the pdarray; on GPU

Return type:

numba.DeviceNDArray

Raises:

ImportError – Raised if CUDA is not available
ModuleNotFoundError – Raised if Numba is either not installed or not enabled
RuntimeError – Raised if there is a server-side error thrown in the course of retrieving the pdarray.

Notes

The number of bytes in the array cannot exceed client.maxTransferBytes, otherwise a RuntimeError will be raised. This is to protect the user from overflowing the memory of the system on which the Python client is running, under the assumption that the server is running on a distributed system with much more memory than the client. The user may override this limit by setting client.maxTransferBytes to a larger value, but proceed with caution.

See also

array

Examples

>>> import arkouda as ak
>>> a = ak.arange(0, 5, 1)
>>> a.to_cuda()
array([0, 1, 2, 3, 4])
>>> type(a.to_cuda())
numpy.devicendarray

to_hdf(prefix_path: str, dataset: str = 'array', mode: str = 'truncate', file_type: str = 'distribute') → str[source]¶

Save the pdarray to HDF5. The object can be saved to a collection of files or single file.

Parameters:

prefix_path (str) – Directory and filename prefix that all output files share
dataset (str) – Name of the dataset to create in files (must not already exist)
mode (str {'truncate' | 'append'}) – By default, truncate (overwrite) output files, if they exist. If ‘append’, attempt to create new dataset in existing files.
file_type (str ("single" | "distribute")) – Default: “distribute” When set to single, dataset is written to a single file. When distribute, dataset is written on a file per locale. This is only supported by HDF5 files and will have no impact of Parquet Files.

Return type:

string message indicating result of save operation

Raises:

RuntimeError – Raised if a server-side error is thrown saving the pdarray

Notes

The prefix_path must be visible to the arkouda server and the user must

have write permission. - Output files have names of the form <prefix_path>_LOCALE<i>, where <i> ranges from 0 to numLocales for file_type=’distribute’. Otherwise, the file name will be prefix_path. - If any of the output files already exist and the mode is ‘truncate’, they will be overwritten. If the mode is ‘append’ and the number of output files is less than the number of locales or a dataset with the same name already exists, a RuntimeError will result. - Any file extension can be used.The file I/O does not rely on the extension to determine the file format.

Examples

>>> import arkouda as ak
>>> a = ak.arange(25)

Saving without an extension >>> a.to_hdf(‘path/prefix’, dataset=’array’) # doctest: +SKIP Saves the array to numLocales HDF5 files with the name cwd/path/name_prefix_LOCALE####

Saving with an extension (HDF5) >>> a.to_hdf(‘path/prefix.h5’, dataset=’array’) # doctest: +SKIP Saves the array to numLocales HDF5 files with the name cwd/path/name_prefix_LOCALE####.h5 where #### is replaced by each locale number

Saving to a single file >>> a.to_hdf(‘path/prefix.hdf5’, dataset=’array’, file_type=’single’) # doctest: +SKIP Saves the array in to single hdf5 file on the root node. cwd/path/name_prefix.hdf5

to_ndarray() → numpy.ndarray[source]¶

Convert the array to a np.ndarray, transferring array data from the Arkouda server to client-side Python. Note: if the pdarray size exceeds client.maxTransferBytes, a RuntimeError is raised.

Returns:: A numpy ndarray with the same attributes and data as the pdarray
Return type:: np.ndarray
Raises:: RuntimeError – Raised if there is a server-side error thrown, if the pdarray size exceeds the built-in client.maxTransferBytes size limit, or if the bytes received does not match expected number of bytes

Notes

See also

array, tolist

Examples

>>> import arkouda as ak
>>> a = ak.arange(0, 5, 1)
>>> a.to_ndarray()
array([0, 1, 2, 3, 4])
>>> type(a.to_ndarray())
<class 'numpy.ndarray'>

to_parquet(prefix_path: str, dataset: str = 'array', mode: str = 'truncate', compression: str | None = None) → str[source]¶

Save the pdarray to Parquet. The result is a collection of files, one file per locale of the arkouda server, where each filename starts with prefix_path. Each locale saves its chunk of the array to its corresponding file.

Parameters:

prefix_path (str) – Directory and filename prefix that all output files share
dataset (str) – Name of the dataset to create in files (must not already exist)
mode (str {'truncate' | 'append'}) – By default, truncate (overwrite) output files, if they exist. If ‘append’, attempt to create new dataset in existing files.
compression (str (Optional)) – (None | “snappy” | “gzip” | “brotli” | “zstd” | “lz4”) Sets the compression type used with Parquet files

Return type:

string message indicating result of save operation

Raises:

RuntimeError – Raised if a server-side error is thrown saving the pdarray

Notes

The prefix_path must be visible to the arkouda server and the user must

have write permission. - Output files have names of the form <prefix_path>_LOCALE<i>, where <i> ranges from 0 to numLocales for file_type=’distribute’. - ‘append’ write mode is supported, but is not efficient. - If any of the output files already exist and the mode is ‘truncate’, they will be overwritten. If the mode is ‘append’ and the number of output files is less than the number of locales or a dataset with the same name already exists, a RuntimeError will result. - Any file extension can be used.The file I/O does not rely on the extension to determine the file format.

Examples

>>> import arkouda as ak
>>> a = ak.arange(25)

Saving without an extension >>> a.to_parquet(‘path/prefix’, dataset=’array’) # doctest: +SKIP Saves the array to numLocales HDF5 files with the name cwd/path/name_prefix_LOCALE####

Saving with an extension (HDF5) >>> a.to_parqet(‘path/prefix.parquet’, dataset=’array’) # doctest: +SKIP Saves the array to numLocales HDF5 files with the name cwd/path/name_prefix_LOCALE####.parquet where #### is replaced by each locale number

tolist() → List[arkouda.numpy.dtypes.numeric_scalars][source]¶

Convert the array to a list, transferring array data from the Arkouda server to client-side Python. Note: if the pdarray size exceeds client.maxTransferBytes, a RuntimeError is raised.

Returns:: A list with the same data as the pdarray
Return type:: List[numeric_scalars]
Raises:: RuntimeError – Raised if there is a server-side error thrown, if the pdarray size exceeds the built-in client.maxTransferBytes size limit, or if the bytes received does not match expected number of bytes

Notes

See also

to_ndarray

Examples

>>> import arkouda as ak
>>> a = ak.arange(0, 5, 1)
>>> a.tolist()
[0, 1, 2, 3, 4]
>>> type(a.tolist())
<class 'list'>

transfer(hostname: str, port: arkouda.numpy.dtypes.int_scalars)[source]¶

Send a pdarray to a different Arkouda server.

Parameters:

hostname (str) – The hostname where the Arkouda server intended to receive the pdarray is running.
port (int_scalars) – The port to send the array over. This needs to be an open port (i.e., not one that the Arkouda server is running on). This will open up numLocales ports, each of which in succession, so will use ports of the range {port..(port+numLocales)} (e.g., running an Arkouda server of 4 nodes, port 1234 is passed as port, Arkouda will use ports 1234, 1235, 1236, and 1237 to send the array data). This port much match the port passed to the call to ak.receive_array().

Return type:

A message indicating a complete transfer

Raises:

ValueError – Raised if the op is not within the pdarray.BinOps set
TypeError – Raised if other is not a pdarray or the pdarray.dtype is not a supported dtype

unregister() → None[source]¶

Unregister a pdarray in the arkouda server which was previously registered using register() and/or attahced to using attach()

Raises:: RuntimeError – Raised if the server could not find the internal name/symbol to remove

See also

register, unregister, is_registered, unregister_pdarray_by_name, list_registry

Notes

Registered names/pdarrays in the server are immune to deletion until they are unregistered.

Examples

>>> import arkouda as ak
>>> a = ak.zeros(3)
>>> a.register("my_zeros")
array([0.00000000000000000 0.00000000000000000 0.00000000000000000])

potentially disconnect from server and reconnect to server >>> b = ak.attach(“my_zeros”) >>> b.unregister()

update_hdf(prefix_path: str, dataset: str = 'array', repack: bool = True)[source]¶

Overwrite the dataset with the name provided with this pdarray. If the dataset does not exist it is added

Parameters:

prefix_path (str) – Directory and filename prefix that all output files share
dataset (str) – Name of the dataset to create in files
repack (bool) – Default: True HDF5 does not release memory on delete. When True, the inaccessible data (that was overwritten) is removed. When False, the data remains, but is inaccessible. Setting to false will yield better performance, but will cause file sizes to expand.

Return type:

str - success message if successful

Raises:

RuntimeError – Raised if a server-side error is thrown saving the pdarray

Notes

If file does not contain File_Format attribute to indicate how it was saved, the file name is checked for _LOCALE#### to determine if it is distributed.
If the dataset provided does not exist, it will be added

value_counts()[source]¶

Count the occurrences of the unique values of self.

Returns:

unique_valuespdarray: The unique values, sorted in ascending order
countspdarray, int64: The number of times the corresponding unique value occurs

Return type:

pdarray, pdarray|int64

Examples

>>> import arkouda as ak
>>> ak.array([2, 0, 2, 4, 0, 0]).value_counts()
(array([0 2 4]), array([3 2 1]))

var(ddof: arkouda.numpy.dtypes.int_scalars = 0, axis: int | Tuple[int, Ellipsis] | None = None, keepdims: bool | None = False) → numpy.float64 | pdarray[source]¶

Return the variance of values in the array.

Parameters:

ddof (int_scalars) – “Delta Degrees of Freedom” used in calculating var
axis (int, Tuple[int, ...], optional, default = None) – The axis or axes along which to do the operation If None, the computation is done across the entire array.
keepdims (bool, optional, default = False) – Whether to keep the singleton dimension(s) along axis in the result.

Returns:

The scalar variance of the array, or the variance along the axis/axes if supplied

Return type:

Union[np.float64, pdarray]

Examples

>>> import arkouda as ak
>>> a = ak.arange(10)
>>> ak.var(a)
np.float64(8.25)
>>> a.var()
np.float64(8.25)
>>> a = ak.arange(10).reshape(2,5)
>>> a.var(axis=0)
array([6.25 6.25 6.25 6.25 6.25])
>>> ak.var(a,axis=0)
array([6.25 6.25 6.25 6.25 6.25])
>>> a.var(axis=1)
array([2.00000000000000000 2.00000000000000000])
>>> ak.var(a,axis=1)
array([2.00000000000000000 2.00000000000000000])

Raises:

TypeError – Raised if pda is not a pdarray instance
ValueError – Raised if the ddof >= pdarray size
RuntimeError – Raised if there’s a server-side error thrown

See also

mean, std

Notes

The variance is the average of the squared deviations from the mean, i.e., var = mean((x - x.mean())**2).

The mean is normally calculated as x.sum() / N, where N = len(x). If, however, ddof is specified, the divisor N - ddof is used instead. In standard statistical practice, ddof=1 provides an unbiased estimator of the variance of a hypothetical infinite population. ddof=0 provides a maximum likelihood estimate of the variance for normally distributed variables.

arkouda.numpy.pdarrayclass.popcount(pda: pdarray) → pdarray[source]¶

Find the population (number of bits set) for each integer in an array.

Parameters:: pda (pdarray, int64, uint64, bigint) – Input array (must be integral).
Returns:: The number of bits set (1) in each element
Return type:: pdarray
Raises:: TypeError – If input array is not int64, uint64, or bigint

Examples

>>> import arkouda as ak
>>> A = ak.arange(10)
>>> ak.popcount(A)
array([0 1 1 2 1 2 2 3 1 2])

arkouda.numpy.pdarrayclass.power(pda: pdarray, pwr: int | float | pdarray, where: arkouda.numpy.dtypes.bool_scalars | pdarray = True) → pdarray[source]¶

Raises an array to a power. If where is given, the operation will only take place in the positions where the where condition is True.

Note: Our implementation of the where argument deviates from numpy. The difference in behavior occurs at positions where the where argument contains a False. In numpy, these position will have uninitialized memory (which can contain anything and will vary between runs). We have chosen to instead return the value of the original array in these positions.

Parameters:

pda (pdarray) – A pdarray of values that will be raised to a power (pwr)
pwr (integer, float, or pdarray) – The power(s) that pda is raised to
where (Boolean or pdarray) – This condition is broadcast over the input. At locations where the condition is True, the corresponding value will be raised to the respective power. Elsewhere, it will retain its original value. Default set to True.

Returns:

a pdarray of values raised to a power, under the boolean where condition.

Return type:

pdarray

Examples

>>> import arkouda as ak
>>> a = ak.arange(5)
>>> ak.power(a, 3)
array([0 1 8 27 64])
>>> ak.power(a, 3, a % 2 == 0)
array([0 1 8 3 64])

Raises:

TypeError – raised if pda is not a pdarray, or if pwe is not an int, float, or pdarray
ValueError – raised if pda and power are of incompatible dimensions

arkouda.numpy.pdarrayclass.rotl(x, rot) → pdarray[source]¶

Rotate bits of <x> to the left by <rot>.

Parameters:

x (pdarray(int64/uint64) or integer) – Value(s) to rotate left.
rot (pdarray(int64/uint64) or integer) – Amount(s) to rotate by.

Returns:

The rotated elements of x.

Return type:

pdarray

Raises:

TypeError – If input array is not int64 or uint64

Examples

>>> import arkouda as ak
>>> A = ak.arange(10)
>>> ak.rotl(A, A)
array([0 2 8 24 64 160 384 896 2048 4608])

arkouda.numpy.pdarrayclass.rotr(x, rot) → pdarray[source]¶

Rotate bits of <x> to the left by <rot>.

Parameters:

x (pdarray(int64/uint64) or integer) – Value(s) to rotate left.
rot (pdarray(int64/uint64) or integer) – Amount(s) to rotate by.

Returns:

The rotated elements of x.

Return type:

pdarray

Raises:

TypeError – If input array is not int64 or uint64

Examples

>>> import arkouda as ak
>>> A = ak.arange(10)
>>> ak.rotr(1024 * A, A)
array([0 512 512 384 256 160 96 56 32 18])

arkouda.numpy.pdarrayclass.sqrt(pda: pdarray, where: arkouda.numpy.dtypes.bool_scalars | pdarray = True) → pdarray[source]¶

Takes the square root of array. If where is given, the operation will only take place in the positions where the where condition is True.

Parameters:

pda (pdarray) – A pdarray of values the square roots of which will be computed
where (Boolean or pdarray) – This condition is broadcast over the input. At locations where the condition is True, the corresponding value will be square rooted. Elsewhere, it will retain its original value. Default set to True.

Returns:

a pdarray of square roots of the original values, or the original values themselves, subject to the boolean where condition.

Return type:

pdarray

Examples

>>> import arkouda as ak
>>> a = ak.arange(5)
>>> ak.sqrt(a)
array([0.00000000000000000 1.00000000000000000 1.4142135623730951
         1.7320508075688772 2.00000000000000000])
>>> ak.sqrt(a, ak.array([True, True, False, False, True]))
array([0.00000000000000000 1.00000000000000000 2.00000000000000000
         3.00000000000000000 2.00000000000000000])

Raises:: TypeError – raised if pda is not a pdarray of ak.int64 or ak.float64

Notes

Square roots of negative numbers are returned as nan.