epic: Cortex.cpp to support Python? #1353

dan-homebrew · 2024-09-29T08:48:22Z

Goal

We need a Python runtime to run TTS library
There is an ever-increasing focus on Python for inference - how do we align?
Blocks epic: cortex.cpp support Ichigo & TTS #1247

Tasklist

Architecture question: how do we support Python as part of Cortex?

Previous Discussions

Python as a separate process (i.e. we just throw an error message, expect user to already have Python installed?)
Or: is there a way for us to package Python?

nguyenhoangthuan99 · 2024-10-02T16:57:52Z

cortex.python Integration Architecture

Overview

This document outlines the architecture for integrating Python functionality into a C++ application, specifically for running machine learning models. The system uses a proxy approach to connect the C++ application (cortex.cpp) with Python processes, allowing for isolated environments for different models.

Architecture Diagram

Key Components

cortex.cpp: The main C++ application.
cortex.python: A proxy engine that connects cortex.cpp with Python processes.
Python Processes: Separate processes spawned for each model execution.
Virtual Environments: Isolated Python environments for each model.

Folder Structure

cortexcpp/
├── models/
│           cortexso/
│           └── python/
│               └── whisper/
│                     ├── model-binary.pth 
│                     ├── whisper.py
│                     ├── main.py
│                     └── requirements.txt       
├── engines/
│   ├── cortex.llamacpp/
│   └── cortex.python/
│       ├── libengine.so  # proxy interface for python model and cortex.cpp
│       └── venv/         # Virtual environments
│           ├── whisper/
│           │   ├── lib/     #  python libraries and dependencies for whisper
│           │   └── bin/
│           │           └─ python3.12 # executable python for whisper
│           ├── fish-speech/
│           └── vision/

Processes

Model Pulling

Request from cortex.cpp to cortex.python
create virtual environment
Pull python for created virtual environment.
Pull code, model from cortexso.
Install dependencies to virtual environment: /path/to/venv/bin/python -m pip install -r requirements.txt

The model pulling step also needs to install the engine for running python model. engine or backend for python model is all libs and deps inside virtual environment.

Model Execution

Request from cortex.cpp to cortex.python
cortex.python spawns a new process
Run main.py in the appropriate virtual environment/ engine/ backend.

Chat Functionality

Request from cortex.cpp to cortex.python
cortex.python communicates with the Python process via WebSocket, Unix Domain Socket, or similar

Implementation Details

Python Interface

Create an abstract Python interface to wrap the inference logic for communication with cortex.cpp
Implement a predict function (or similar) for each model's specific inference logic

Virtual Environments

Each model has its own virtual environment to avoid dependency conflicts
Virtual environments are created and managed by cortex.python

Packaged Python

Python installation is packaged with the cortex.python engine
Users don't need to install Python separately

Model Execution

Each model runs in its own process for isolation
main.py is the entry point for each model
sys.path can be modified to locate model-specific modules (e.g., whisper.py)

dan-homebrew · 2024-10-04T04:27:16Z

@nguyenhoangthuan99 @vansangpfiev @namchuai I would like to raise a concern here, and propose a (possibly incorrect) alternative:

Engines as 1st class Citizens of Cortex

Engines are 1st-class objects in Cortex, e.g. llama.cpp, Whisper, Fish Speech etc
Engines package their dependencies, e.g. CUDA, or Python, or whatever else

This has the following benefits:

Each "engine" can define its own set of Python dependencies
Python dependencies have traditionally been hell, and I anticipate a lot of incompatibility issues between engines
We can define a clear Engine interface (this is already being used by a couple of engineers in Discord)

How this would work

We focus on our Engines interface, and define a way to package a Python runtime and dependencies
Each Engine is firewalled from other engines, and maintains its own Python version, dependencies, etc
In the future, we can add optimizations (i.e. shared Python versions)

dan-homebrew added this to Jan & Cortex Sep 29, 2024

dan-homebrew converted this from a draft issue Sep 29, 2024

dan-homebrew changed the title ~~epic: Cortex.cpp to support Python Runtime?~~ epic: Cortex.cpp to support Python? Sep 29, 2024

dan-homebrew assigned nguyenhoangthuan99 Oct 2, 2024

nguyenhoangthuan99 linked a pull request Oct 3, 2024 that will close this issue

Feat/run whisper model janhq/cortex.python#20

Open

vansangpfiev self-assigned this Oct 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epic: Cortex.cpp to support Python? #1353

epic: Cortex.cpp to support Python? #1353

dan-homebrew commented Sep 29, 2024 •

edited

Loading

nguyenhoangthuan99 commented Oct 2, 2024 •

edited

Loading

dan-homebrew commented Oct 4, 2024

epic: Cortex.cpp to support Python? #1353

epic: Cortex.cpp to support Python? #1353

Comments

dan-homebrew commented Sep 29, 2024 • edited Loading

Goal

Tasklist

Previous Discussions

nguyenhoangthuan99 commented Oct 2, 2024 • edited Loading

cortex.python Integration Architecture

Overview

Architecture Diagram

Key Components

Folder Structure

Processes

Model Pulling

Model Execution

Chat Functionality

Implementation Details

Python Interface

Virtual Environments

Packaged Python

Model Execution

dan-homebrew commented Oct 4, 2024

Engines as 1st class Citizens of Cortex

How this would work

dan-homebrew commented Sep 29, 2024 •

edited

Loading

nguyenhoangthuan99 commented Oct 2, 2024 •

edited

Loading