Sparsity and Linearity-Exploiting Interior-Point solver - Now Internally Readable
Named after Odin's eight-legged horse from Norse mythology, Sleipnir is a linearity-exploiting sparse nonlinear constrained optimization problem solver that uses the interior-point method.
#include <print>
#include <sleipnir/optimization/OptimizationProblem.hpp>
int main() {
// Find the x, y pair with the largest product for which x + 3y = 36
sleipnir::OptimizationProblem problem;
auto x = problem.DecisionVariable();
auto y = problem.DecisionVariable();
problem.Maximize(x * y);
problem.SubjectTo(x + 3 * y == 36);
problem.Solve();
// x = 18.0, y = 6.0
std::println("x = {}, y = {}", x.Value(), y.Value());
}
#!/usr/bin/env python3
from jormungandr.optimization import OptimizationProblem
def main():
# Find the x, y pair with the largest product for which x + 3y = 36
problem = OptimizationProblem()
x = problem.decision_variable()
y = problem.decision_variable()
problem.maximize(x * y)
problem.subject_to(x + 3 * y == 36)
problem.solve()
# x = 18.0, y = 6.0
print(f"x = {x.value()}, y = {y.value()}")
if __name__ == "__main__":
main()
Sleipnir's internals are intended to be readable by those who aren't domain experts with links to explanatory material for its algorithms.
flywheel-scalability-results-casadi.csv
flywheel-scalability-results-sleipnir.csv |
cart-pole-scalability-results-casadi.csv
cart-pole-scalability-results-sleipnir.csv |
Generated by tools/generate-scalability-results.sh from benchmarks/scalability source.
- CPU: AMD Ryzen 7 7840U
- RAM: 64 GB, 5600 MHz DDR5
- Compiler version: g++ (GCC) 14.2.1 20240805
The following thirdparty software was used in the benchmarks:
- CasADi 3.6.7 (autodiff and NLP solver frontend)
- Ipopt 3.14.17 (NLP solver backend)
- MUMPS 5.7.3 (linear solver)
Ipopt uses MUMPS by default because it has free licensing. Commercial linear solvers may be much faster.
See benchmark details for more.
Sleipnir requires somewhat newer operating systems and C++ runtimes for std::print().
- Windows
- OS: Windows 10
- Runtime: Microsoft Visual C++ 2022 redistributable
- Linux
- OS: Ubuntu 24.04
- Runtime: GCC 14 libstdc++ (run
sudo apt install g++-14
)
- macOS
- OS: macOS 14
- Runtime: Apple Clang 15.0.0 libc++ from Xcode 15.3 (run
xcode-select --install
)
See the build instructions.
pip install sleipnirgroup-jormungandr
See the C++ API docs and Python API docs.
See the examples, C++ optimization unit tests, and Python optimization unit tests.
- C++23 compiler
- On Windows 10 or greater, install Visual Studio Community 2022 and select the C++ programming language during installation
- On Ubuntu 24.04 or greater, install GCC 14 via
sudo apt install g++-14
- On macOS 14 or greater, install the Xcode 15.3 command-line build tools via
xcode-select --install
- CMake 3.21 or greater
- On Windows, install from the link above
- On Linux, install via
sudo apt install cmake
- On macOS, install via
brew install cmake
- Python 3.9 or greater
- On Windows, install from the link above
- On Linux, install via
sudo apt install python
- On macOS, install via
brew install python
- Eigen
- nanobind (build only)
- Catch2 (tests only)
Library dependencies which aren't installed locally will be automatically downloaded and built by CMake.
The benchmark executables require CasADi to be installed locally.
On Windows, open a Developer PowerShell. On Linux or macOS, open a Bash shell.
# Clone the repository
git clone [email protected]:SleipnirGroup/Sleipnir
cd Sleipnir
# Configure; automatically downloads library dependencies
cmake -B build -S .
# Build
cmake --build build
# Test
ctest --test-dir build --output-on-failure
# Install
cmake --install build --prefix pkgdir
The following build types can be specified via -DCMAKE_BUILD_TYPE
during CMake configure:
- Debug
- Optimizations off
- Debug symbols on
- Release
- Optimizations on
- Debug symbols off
- RelWithDebInfo (default)
- Release build type, but with debug info
- MinSizeRel
- Minimum size release build
- Asan
- Enables address sanitizer
- Tsan
- Enables thread sanitizer
- Ubsan
- Enables undefined behavior sanitizer
- Perf
- RelWithDebInfo build type, but with frame pointer so perf utility can use it
On Windows, open a Developer PowerShell. On Linux or macOS, open a Bash shell.
# Clone the repository
git clone [email protected]:SleipnirGroup/Sleipnir
cd Sleipnir
# Setup
pip install --user build
# Build
python -m build --wheel
# Install
pip install --user dist/sleipnirgroup_jormungandr-*.whl
# Test
pytest
Passing the --enable-diagnostics
flag to the test executable enables solver diagnostic prints.
Some test problems generate CSV files containing their solutions. These can be plotted with tools/plot_test_problem_solutions.py.
Benchmark projects are in the benchmarks folder. To compile and run them, run the following in the repository root:
# Install CasADi and [matplotlib, numpy, scipy] pip packages first
cmake -B build -S . -DBUILD_BENCHMARKS=ON
cmake --build build
./tools/generate-scalability-results.sh
See the contents of ./tools/generate-scalability-results.sh
for how to run specific benchmarks.
During problem setup, equality and inequality constraints are encoded as different types, so the appropriate setup behavior can be selected at compile time via operator overloads.
The autodiff library automatically records the linearity of every node in the computational graph. Linear functions have constant first derivatives, and quadratic functions have constant second derivatives. The constant derivatives are computed in the initialization phase and reused for all solver iterations. Only nonlinear parts of the computational graph are recomputed during each solver iteration.
For quadratic problems, we compute the Lagrangian Hessian and constraint Jacobians once with no problem structure hints from the user.
Eigen provides these. It also has no required dependencies, which makes cross compilation much easier.
This promotes fast allocation/deallocation and good memory locality.
We could mitigate the solver's high last-level-cache miss rate (~42% on the machine above) further by breaking apart the expression nodes into fields that are commonly iterated together. We used to use a tape, which gave computational graph updates linear access patterns, but tapes are monotonic buffers with no way to reclaim storage.