PyKX is a Python first interface to the worlds fastest time-series database kdb+ and it's underlying vector programming language q. PyKX takes a Python first approach to integrating q/kdb+ with Python following 10+ years of integrations between these two languages. Fundamentally it provides users with the ability to efficiently query and analyze huge amounts of in-memory and on-disk time-series data.
This interface exposes q as a domain-specific language (DSL) embedded within Python, taking the approach that q should principally be used for data processing and management of databases. This approach does not diminish the ability for users familiar with q or those wishing to learn more about it from making the most of advanced analytics and database management functionality but rather empowers those who want to make use of the power of kdb+/q who lack this expertise to get up and running fast.
PyKX supports three principal use cases:
- It allows users to store, query, manipulate and use q objects within a Python process.
- It allows users to query external q processes via an IPC interface.
- It allows users to embed Python functionality within a native q session using it's under q functionality.
Users wishing to install the library can do so following the instructions here.
Once you have the library installed you can get up and running with PyKX following the quickstart guide here.
Mentioned throughout the documentation q and kdb+ are respectively a highly efficient vector programming language and highly optimised time-series database used to analyse streaming, real-time and historical data. Used throughout the financial sector for 25+ years this technology has been a cornerstone of modern financial markets providing a storage mechanism for historical market data and tooling to make the analysis of this vast data performant.
Kdb+ is a high-performance column-oriented database designed to process and store large amounts of data. Commonly accessed data is available in RAM which makes it faster to access than disk stored data. Operating with temporal data types as a first class entity the use of q and it's query language qsql against this database creates a highly performant time-series analysis tool.
q is the vector programming language which is used for all interactions with kdb+ databases and which is known both for its speed and expressiveness.
For more information on using q/kdb+ and getting started with see the following links:
Ensure you have a recent version of pip:
pip install --upgrade pip
Then install the latest version of PyKX with the following command:
pip install pykx
To install a specific version of PyKX run the following command replacing <INSERT_VERSION> with a specific released semver version of the interface
pip install pykx==<INSERT_VERSION>
Warning: Python packages should typically be installed in a virtual environment. This can be done with the venv package from the standard library.
Installation of PyKX via pip provides users with access to the library with limited functional scope, full details of these limitations can be found here. To access the full functionality of PyKX you must first download and install a kdb+ license, this can be achieved either through use of a personal evaluation license or receipt of a commercial license.
The following steps outline the process by which a user can gain access to an install a kdb Insights license which provides access to PyKX
- Visit https://kx.com/kdb-insights-personal-edition-license-download/ and fill in the attached form following the instructions provided.
- On receipt of an email from KX providing access to your license download this file and save to a secure location on your computer.
- Set an environment variable on your computer pointing to the folder containing the license file (instructions for setting environment variables on PyKX supported operating systems can be found here.
- Variable Name:
QLIC
- Variable Value:
/user/path/to/folder
- Variable Name:
The following steps outline the process by which a user can gain access to an install a kdb Insights license which provides access to PyKX
- Visit https://kx.com/kdb-insights-commercial-evaluation-license-download/ and fill in the attached form following the instructions provided.
- On receipt of an email from KX providing access to your license download this file and save to a secure location on your computer.
- Set an environment variable on your computer pointing to the folder containing the license file (instructions for setting environment variables on PyKX supported operating systems can be found here.
- Variable Name:
QLIC
- Variable Value:
/user/path/to/folder
- Variable Name:
Note: PyKX will not operate with a vanilla or legacy kdb+ license which does not have access to specific feature flags embedded within the license. In the absence of a license with appropriate feature flags PyKX will fail to initialise with full feature functionality.
KX only officially supports versions of PyKX built by KX, i.e. versions of PyKX installed from wheel files. Support for user-built installations of PyKX (e.g. built from the source distribution) is only provided on a best-effort basis. Currently, PyKX provides wheels for the following environments:
- Linux (
manylinux_2_17_x86_64
) with CPython 3.8-3.11 - macOS (
macosx_10_10_x86_64
) with CPython 3.8-3.11 - Windows (
win_amd64
) with CPython 3.8-3.11
PyKX depends on the following third-party Python packages:
pandas~=1.2
numpy~=1.22
They are installed automatically by pip
when PyKX is installed.
PyKX also has an optional Python dependency of pyarrow>=3.0.0
, which can be included by installing the pyarrow
extra, e.g. pip install pykx[pyarrow]
Warning: Trying to use the pa
conversion methods of pykx.K
objects or the pykx.toq.from_arrow
method when PyArrow is not installed (or could not be imported without error) will raise a pykx.PyArrowUnavailable
exception. pyarrow
is supported Python 3.8-3.10 but remains in Beta for Python 3.11.
libssl
for TLS on IPC connections.
To run q or PyKX on Windows, msvcr100.dll
must be installed. It is included in the Microsoft Visual C++ 2010 Redistributable.
The full list of supported environments is detailed here. Installation of dependencies will vary on different platforms.
apt
example:
apt-install python3 python3-venv build-essential python3-dev
yum
example:
yum install python3 gcc gcc-c++ python3-devel.x86_64
Windows:
- Python
- Build Tools for Visual Studio.
- dlfcn-win32. Can be installed using Vcpkg.
msvcr100.dll
. Available in Microsoft Visual C++ 2010 Redistributable.
Using a Python virtual environment is recommended:
python3 -m venv pykx-dev
source pykx-dev/bin/activate
Build and install PyKX:
cd pykx
pip3 install -U '.[all]'
To run PyKX in licensed mode ensure to follow the steps to receive a Personal Evaluation License
Now you can run/test PyKX:
(pykx-dev) /data/pykx$ python
Python 3.10.6 (main, May 29 2023, 11:10:38) [GCC 11.3.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import pykx
>>> pykx.q('1+1')
pykx.LongAtom(pykx.q('2'))
Contributions to the project must pass a linting check:
pflake8
Contributions to the project must include tests. To run tests:
export PATH="$PATH:/location/of/your/q/l64" # q must be on PATH for tests
export QHOME=/location/of/your/q #q needs QHOME available
python -m pytest -vvv -n 0 --no-cov --junitxml=report.xml
This work is dual licensed under Apache 2.0 and the Software License for q.so and users are required to abide by the terms of both licenses in their entirety.
If you have any issues or questions you can post them to community.kx.com. Also available on Stack Overflow are the tags pykx and kdb.
- Inquires or feedback:
[email protected]
- Support for Licensed Subscribers: support.kx.com