Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Port 24.04 changes into main #851

Merged
merged 12 commits into from
Jun 7, 2024
Merged

Port 24.04 changes into main #851

merged 12 commits into from
Jun 7, 2024

Commits on May 23, 2024

  1. Configuration menu
    Copy the full SHA
    f834f4e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    060b1e8 View commit details
    Browse the repository at this point in the history

Commits on May 24, 2024

  1. Configuration menu
    Copy the full SHA
    07fe41a View commit details
    Browse the repository at this point in the history
  2. Pax2TEConverter support for LLaMa2 checkpoints (#641)

    **Changes:**
    - New option flags `--pax-split-qkv` and
    `--te-qkv-layout={qkv_packed,kv_packed}` for identifying QKV layouts for
    both the Pax source and TE target. Default behavior without new flags
    remains unchanged.
    - New option flag `--use-gated-activations` to support gated GELU in
    LLaMa2.
    - Bias terms are omitted from the checkpoint map when `--weight-only` is
    used.
    
    **Notes:**
    - Changes are verified with a 70b LLaMa2 checkpoint previously converted
    with @mingxu1067's hard-coded LLaMa2-only converter script.
    ```
    python converter/main.py \
        --input-path=<...> \
        --output-path=<...> \
        --fw=pax --direction=fw2te \
        --num-of-layer=80 \
        --num-of-head=64 \
        --head-dim=128  \
        --mlp-intermediate-dim=28672 \
        --kernel-chunk-size=512 \
        --weight-only \
        --use-gated-activations \
        --pax-split-qkv \
        --te-qkv-layout=kv_packed
    ```
    
    ---------
    
    Signed-off-by: Alp Dener <[email protected]>
    Co-authored-by: ashors1 <[email protected]>
    denera and ashors1 committed May 24, 2024
    Configuration menu
    Copy the full SHA
    3e33819 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    fa64660 View commit details
    Browse the repository at this point in the history
  4. fix PaliGemma use-cases

    ashors1 committed May 24, 2024
    Configuration menu
    Copy the full SHA
    3b99cdf View commit details
    Browse the repository at this point in the history

Commits on May 28, 2024

  1. Configuration menu
    Copy the full SHA
    1bfbc43 View commit details
    Browse the repository at this point in the history

Commits on May 29, 2024

  1. Configuration menu
    Copy the full SHA
    41c781c View commit details
    Browse the repository at this point in the history
  2. fix readme

    ashors1 committed May 29, 2024
    Configuration menu
    Copy the full SHA
    7d691e1 View commit details
    Browse the repository at this point in the history

Commits on May 30, 2024

  1. minor fix

    ashors1 committed May 30, 2024
    Configuration menu
    Copy the full SHA
    5972a5e View commit details
    Browse the repository at this point in the history

Commits on Jun 5, 2024

  1. Configuration menu
    Copy the full SHA
    8fe0f98 View commit details
    Browse the repository at this point in the history

Commits on Jun 7, 2024

  1. Configuration menu
    Copy the full SHA
    fbdaf7a View commit details
    Browse the repository at this point in the history