Skip to content
This repository has been archived by the owner on Aug 30, 2024. It is now read-only.

[BesTLA] Improve RTN quantization accuracy of int4 and int3 #210

[BesTLA] Improve RTN quantization accuracy of int4 and int3

[BesTLA] Improve RTN quantization accuracy of int4 and int3 #210

Workflow file for this run

name: Windows Binary Test
on:
pull_request:
branches: [main]
paths:
- ".github/workflows/windows-test.yml"
- "requirements.txt"
- "setup.py"
- "neural_speed/**"
- "bestla/**"
- '!bestla/ut/**'
- '!bestla/xbyak/**'
- '!bestla/xbyak/*.md'
- '!neural_speed/*.md'
workflow_dispatch:
# If there is a new commit, the previous jobs will be canceled
concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true
env:
SCRIPT_PATH: ${{ github.workspace }}\.github\workflows\script
WORKING_DIR: ${{ github.workspace }}
jobs:
Windows-Binary-Test:
runs-on: 'Windows'
steps:
- name: Checkout out Repo
uses: actions/checkout@v4
with:
submodules: "recursive"
fetch-tags: true
path: "a"
- name: Binary build
shell: cmd
run: |
SET HTTP_PROXY=${{ vars.HTTP_PROXY_WINDOWS }}
SET HTTPS_PROXY=${{ vars.HTTP_PROXY_WINDOWS }}
cd ${{ github.workspace }}\a\.github\workflows\scripts
prepare_env_with_conda.bat