Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix merging conflict for upgrade from 0.12.1 to 0.14 #719

Closed
wants to merge 282 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
282 commits
Select commit Hold shift + click to select a range
124d36f
Remove GOARCH (#3523)
mkumatag Mar 17, 2024
488012e
upgrade knative to 1.13 (#3457)
andyi2it Mar 17, 2024
20cf546
Update KServe project description (#3524)
yuzisun Mar 18, 2024
a652036
Upgrade istio to 1.20 works for the Github Actions (#3529)
houshengbo Mar 18, 2024
62ce7de
Update kserve cake diagram (#3530)
yuzisun Mar 19, 2024
11e0ab2
Remove white background for the kserve diagram (#3531)
yuzisun Mar 19, 2024
e73f50c
Merge branch 'uppstream/master' into 'odh/master'
spolti Mar 20, 2024
f6d0b8d
Add capability to run RawDeployment E2Es in OpenShift-ci
israel-hdez Feb 20, 2024
c8e53e9
Fix CI: Serverless removed domain-mapping deployment
israel-hdez Mar 22, 2024
a3f7d15
Merge pull request #277 from israel-hdez/ci-fix-serverless-1320
openshift-merge-bot[bot] Mar 22, 2024
050d23c
Merge pull request #275 from spolti/sync
openshift-merge-bot[bot] Mar 22, 2024
8c722b6
GH Alert: Potential file inclusion via variable (#3520)
spolti Mar 23, 2024
a98e22d
fix a typo in OPENSHIFT_GUIDE.md (#3544)
marek-veber Mar 24, 2024
c7a8d7a
Make the modelcar injection idempotent (#3517)
rhuss Mar 24, 2024
56b8fe0
Fix bug: Remove redundant helm chart affinity on predictor CRD (#3481)
trojaond Mar 24, 2024
fab6583
Merge pull request #220 from israel-hdez/e2e-raw
openshift-merge-bot[bot] Mar 25, 2024
731fc0f
Update codeQL to v3 (#3548)
spolti Mar 30, 2024
673818c
switch e2e test inference graph to raw mode (#3511)
andyi2it Mar 30, 2024
6f942c8
Only pad left for decode-only architecture models. (#3534)
sivanantha321 Mar 30, 2024
5a41507
Only load from model repository if model binary is not found under mo…
sivanantha321 Mar 30, 2024
fda7b41
CVE-2024-24762 - update fastapi to 0.109.1 (#3556)
spolti Mar 30, 2024
382f41a
chore: Remove Seldon Alibi dependencies. Fixes #3380 (#3443)
terrytangyuan Apr 1, 2024
c993521
[RHOAIENG-5073] - Routing and Headless Service Support in KServe Raw …
spolti Apr 2, 2024
8350c7c
Black lint (#3568)
cmaddalozzo Apr 3, 2024
f678243
fix lint typo on Makefile (#3569)
spolti Apr 3, 2024
6f8f037
Support model revision and tokenizer revision in huggingface server (…
lizzzcai Apr 3, 2024
b855d52
comment ce logs for headless services
spolti Apr 4, 2024
9b5ea93
Merge pull request #280 from spolti/RHOAIENG-5073
openshift-merge-bot[bot] Apr 4, 2024
80bf4a0
OpenAI schema (#3477)
tessapham Apr 5, 2024
adfad6b
Fix typo in README.md (#3575)
terrytangyuan Apr 5, 2024
6e88700
Fix python linter (#3571)
sivanantha321 Apr 5, 2024
d80772b
Merge branch 'upstream/master' into 'odh/master'
spolti Apr 5, 2024
96ca17a
docs: Move Alibi explainer to docs (#3579)
terrytangyuan Apr 7, 2024
fc1b99f
lint the required test files
spolti Apr 8, 2024
153e56f
Merge pull request #289 from spolti/sync2
openshift-merge-bot[bot] Apr 8, 2024
19314ab
build: Add flake8 and black to pre-commit hooks (#3578)
terrytangyuan Apr 9, 2024
d0cb667
fix: Set writable cache folder to avoid permission issue. Fixes #3562…
terrytangyuan Apr 9, 2024
9fd262f
golang.org/x/net Allocation of Resources Without Limits or Throttling…
spolti Apr 14, 2024
888afd5
Support OpenAIModel in ModelRepository (#3590)
grandbora Apr 16, 2024
d01d6c3
updated xgboost to support json and ubj models (#3551)
andyi2it Apr 16, 2024
385b8c2
Security fix - CVE 2024 24786 (#3585)
andyi2it Apr 16, 2024
3c5d8b4
remove unnecessary operators
Jooho Apr 17, 2024
9a43dfa
Create Workflow for release and tag with Changelog (#287)
rpancham Apr 19, 2024
9cc4eed
VLLM support for OpenAI Completions in HF server (#3589)
gavrissh Apr 19, 2024
09a9168
Add a user friendly error message for http exceptions (#3581)
grandbora Apr 20, 2024
362cd18
Fix model unload in server stop method (#3587)
sivanantha321 Apr 21, 2024
f1b4d4b
feat: Provide minimal distribution of CRDs (#3492)
terrytangyuan Apr 21, 2024
af37834
build: Allow pre-commit to keep changes in reformatted code (#3604)
terrytangyuan Apr 21, 2024
39a4866
Fix golint errors (#3552)
andyi2it Apr 21, 2024
7ade501
Merge pull request #316 from Jooho/remove_ops
Jooho Apr 22, 2024
9a897ed
Fix make deploy-dev-storage-initializer not working (#3617)
sivanantha321 Apr 23, 2024
ec59f43
Fix Pydantic 2 warnings (#3622)
cmaddalozzo Apr 23, 2024
34ea6f0
build: Fix CRD copying in generate-install.sh (#3620)
terrytangyuan Apr 23, 2024
6c23281
chore: v0.12.1 install files (#3619)
terrytangyuan Apr 23, 2024
cefdc2e
Fix CVE-2023-45288 for qpext (#3618)
sivanantha321 Apr 23, 2024
47265b2
Allow rerunning failed workflows by comment (#3550)
andyi2it Apr 25, 2024
a9d747e
Add OpenAI API support to Huggingfaceserver (#3582)
cmaddalozzo Apr 25, 2024
0549c98
add re-run info in the PR templates (#3633)
spolti Apr 27, 2024
e27aeec
chore: Bump ModelMesh version to v0.12.0-rc0 in Helm chart (#3642)
terrytangyuan Apr 28, 2024
5b51837
build: Remove misleading logs from minimal-crdgen.sh (#3641)
terrytangyuan Apr 28, 2024
622f32f
Add e2e tests for huggingface (#3600)
sivanantha321 Apr 29, 2024
1c5b0f9
set default SAFETENSORS_FAST_GPU and HF_HUB_DISABLE_TELEMETRY in HF S…
lizzzcai Apr 29, 2024
a3155d8
[RHOAIENG-4617] - follow up - remove hardcoded fastapi from Dockerfil…
spolti Apr 30, 2024
3f9b442
Remove generate endpoints (#3654)
cmaddalozzo Apr 30, 2024
0fe5d3f
Assign device to input tensors in huggingface server with huggingface…
saileshd1402 Apr 30, 2024
8cfb3e0
Test image builds for ARM64 arch in CI (#3629)
sivanantha321 Apr 30, 2024
f717e12
Fix Huggingface server stopping criteria (#3659)
cmaddalozzo Apr 30, 2024
36a3e6d
Enabled the multiple domains support on an inference service (#3615)
houshengbo Apr 30, 2024
9c6a6b8
Explicitly specify pad token id when generating tokens (#3565)
sivanantha321 May 2, 2024
5011c7d
Fix quick install does not cleans up Istio installer (#3660)
sivanantha321 May 3, 2024
6d1e00b
Fix storage-initializer-docker-publisher workflow
israel-hdez May 3, 2024
7830f59
Merge pull request #340 from israel-hdez/fix-storage-init-workflow
Jooho May 3, 2024
2cddaa4
Add base model for proxying request to an OpenAI API enabled model se…
cmaddalozzo May 3, 2024
0fb5696
Add headers to predictor exception logging (#3658)
grandbora May 3, 2024
a3fda9c
workflow file for cherry-pick on comment (#3653)
andyi2it May 4, 2024
59d705b
Enhance controller setup based on available CRDs (#3472)
israel-hdez May 4, 2024
bfc2e21
Bump version to 0.13.0-rc0 (#3665)
cmaddalozzo May 6, 2024
f3c3220
upgrade vllm/transformers version (#3671)
johnugeorge May 8, 2024
d608056
Add openai models endpoint (#3666)
cmaddalozzo May 9, 2024
629e4ae
feat: Support customizable deployment strategy for RawDeployment mode…
terrytangyuan May 9, 2024
a30d402
Enable dtype support for huggingface server (#3613)
Datta0 May 9, 2024
ca50e18
Add method for checking model health/readiness (#3673)
cmaddalozzo May 10, 2024
ce9b0e8
fix for extract zip from gcs (#3510)
andyi2it May 10, 2024
a4cce1a
Update Dockerfile and Readme (#3676)
gavrissh May 11, 2024
9dbce8e
Update huggingface readme (#3678)
alexagriffith May 11, 2024
56a2940
fix: HPA equality check should include annotations (#3650)
terrytangyuan May 11, 2024
024f69b
Fix: huggingface runtime in helm chart (#3679)
yuzisun May 13, 2024
4c6ce45
Fix: model id and model dir check order (#3680)
yuzisun May 14, 2024
892e5dc
Fix:vLLM Model Supported check throwing circular dependency (#3688)
gavrissh May 15, 2024
8771c3d
Fix: Allow null in Finish reason streaming response in vLLM (#3684)
gavrissh May 15, 2024
929471b
Merge branch 'kserve/kserve master into odh/kserve master
spolti May 15, 2024
c5bebaa
fix lint issues
spolti May 15, 2024
148d70f
Merge pull request #356 from spolti/sync3
openshift-merge-bot[bot] May 17, 2024
6f155a1
Unify the log configuration using kserve logger (#3577)
sivanantha321 May 19, 2024
1fa44e9
Remove conversion webhook from kubeflow manifest patch (#3700)
sivanantha321 May 20, 2024
bbc5ad4
Merge pull request #358 from kserve/master
openshift-merge-bot[bot] May 20, 2024
690e269
Add the field ResponseStartTimeoutSeconds to create ksvc (#3705)
houshengbo May 21, 2024
4841328
Fix kserve version is not updated properly by python-release.sh (#3707)
sivanantha321 May 22, 2024
c9b3738
Copy generated CRDs by kustomize to Helm (#3392)
Jooho May 27, 2024
04c41c2
Add precaution again running v1 endpoints on openai models (#3694)
grandbora May 27, 2024
c660972
Add FP16 datatype support for OIP grpc (#3695)
sivanantha321 May 28, 2024
d2c26b7
replace upstream Dockerfiles with ubi dockerfiles.
VedantMahabaleshwarkar May 21, 2024
e717527
Merge pull request #362 from VedantMahabaleshwarkar/eng-6506-odh
Jooho May 30, 2024
a177171
Add workflow to Trigger build on PR add LGTM and Create Tag and Relea…
rpancham May 31, 2024
d393486
Add option for returning probabilities in huggingface server (#3607)
andyi2it Jun 2, 2024
4b58775
Typos and minor fixes (#3429)
alpe Jun 2, 2024
71114b6
Fix model_id and model_dir precedence for vLLM (#3718)
yuzisun Jun 2, 2024
16c9017
Publish 0.13 release (#3719)
yuzisun Jun 3, 2024
ff744c6
Fixup max_length for HF and model info for vLLM (#3715)
Datta0 Jun 3, 2024
edac2c3
Fix prompt token count and provide completion usage in OpenAI respons…
sivanantha321 Jun 3, 2024
1c51eee
Update serving runtimes package version 0.13.0 (#3720)
sivanantha321 Jun 3, 2024
4e12777
Docs for authorization feature
israel-hdez Feb 28, 2024
601a6e9
Merge pull request #253 from israel-hdez/k-auth-docs
Jooho Jun 4, 2024
365c490
Workflow to trigger a PR and rename Tag
rpancham Jun 6, 2024
65d43c4
Modified the workflow
rpancham Jun 6, 2024
5b62ee3
enable DirectPvcVolumeMount
VedantMahabaleshwarkar Jun 6, 2024
1f3a0a4
Merge pull request #367 from VedantMahabaleshwarkar/patch-1
Jooho Jun 6, 2024
722788b
Remove wait-lgtm from workflow
rpancham Jun 6, 2024
88ad19e
update local gateway information
Jooho Jun 6, 2024
97c83cf
Merge pull request #361 from kserve/master
openshift-merge-bot[bot] Jun 7, 2024
212a77c
Prevent the PassthroughCluster for clients/workloads in the service m…
israel-hdez Jun 9, 2024
3065fd4
Merge pull request #374 from kserve/master
openshift-merge-bot[bot] Jun 10, 2024
cd4ae27
Remove wait-lgtm from workflow and increase timeout time
rpancham Jun 13, 2024
32d3e19
Extract openai predict logic into smaller methods (#3716)
grandbora Jun 18, 2024
fe70b14
modify variable name
rpancham Jun 19, 2024
4c78aaf
Bump MLServer to 1.5.0 (#3740)
sivanantha321 Jun 19, 2024
0ec1763
Refactor storage initializer to log model download time for all stora…
sivanantha321 Jun 19, 2024
b3b1c09
inferenceservice controller: fix error check in Serverless mode (#3753)
dtrifiro Jun 19, 2024
7f0f5e0
Add nccl package and Bump vLLM to 0.4.3 for huggingface runtime (#3723)
sivanantha321 Jun 19, 2024
4c52716
Merge pull request #379 from kserve/master
openshift-merge-bot[bot] Jun 19, 2024
f6b8f8d
Modified the quay_username
rpancham Jun 20, 2024
9a16be4
Merge pull request #366 from rpancham/pr-build
openshift-merge-bot[bot] Jun 20, 2024
56a4e7d
Propagate `trust_remote_code` flag throughout vLLM startup (#3729)
calwoo Jun 21, 2024
03a45bb
Merge pull request #372 from Jooho/master_update_isvc_config
openshift-merge-bot[bot] Jun 21, 2024
352e030
Fix dead links on PyPI (#3754)
kevinbazira Jun 24, 2024
d19e310
Fix model is ready even if there is no model (#3275)
HAO2167 Jun 24, 2024
fbd8778
Fix No model ready error in multi model serving (#3758)
sivanantha321 Jun 27, 2024
71104e1
CWE-362 - anyio Race Condition
spolti Jun 27, 2024
4259287
Merge pull request #382 from spolti/RHOAIENG-9178
openshift-merge-bot[bot] Jun 28, 2024
74c9ff4
Initial implementation of Inference client (#3401)
sivanantha321 Jun 29, 2024
d4349ea
Fix logprobs for vLLM (#3738)
sivanantha321 Jun 29, 2024
68b04ef
Fix model name not properly parsed by inference graph (#3746)
sivanantha321 Jun 29, 2024
ca691f7
pillow - Buffer Overflow (#3598)
spolti Jun 29, 2024
164ea83
Use add_generation_prompt while creating chat template (#3775)
Datta0 Jul 11, 2024
1a70e11
Deduplicate the names for the additional domain names (#3773)
houshengbo Jul 12, 2024
29731be
Make Virtual Service case-insensitive (#3779)
andyi2it Jul 15, 2024
6bacfbf
set protocol https for .Status.Address.URL
Jooho Jul 18, 2024
26f2eb1
Merge pull request #386 from Jooho/change_protocol
openshift-merge-bot[bot] Jul 18, 2024
580a0f3
Install packages needed for vllm model load (#3802)
gavrissh Jul 18, 2024
31d2395
Make gRPC max message length configurable (#3741)
sivanantha321 Jul 20, 2024
5736731
Add readiness probe for MLServer and Increase memory for pmml in CI (…
sivanantha321 Jul 20, 2024
bb9641f
Several bug fixes for vLLM completion endpoint (#3788)
sivanantha321 Jul 21, 2024
7e62122
Removed arm/v7 to make the jobs working again for now
npanpaliya Jul 24, 2024
de96bd3
Increase timeout to make unit test stable (#3808)
Jooho Jul 25, 2024
32d3fc0
Upgrade CI deps (#3822)
sivanantha321 Jul 27, 2024
3862175
Add tests for vLLM (#3771)
sivanantha321 Jul 27, 2024
f9e7d5a
Bump python to 3.11 for serving runtime images and Bump poetry to 1.8…
sivanantha321 Jul 27, 2024
fdac4ed
explicitly enable baseos and appstream repos only
riprasad Aug 2, 2024
96fb00e
Bump vLLM to 0.5.3.post1 (#3828)
sivanantha321 Aug 2, 2024
ec3df25
Merge pull request #395 from opendatahub-io/fix-konflux-build
openshift-merge-bot[bot] Aug 2, 2024
ae30b97
Merge pull request #391 from odh-on-pz/fix-multiarch-image-build
openshift-merge-bot[bot] Aug 2, 2024
c080da5
Refactor the ModelServer to let uvicorn handle multiple workers and u…
sivanantha321 Aug 6, 2024
d73f1ca
Update golang for docs/Dockerfile to 1.21 (#3761)
spolti Aug 10, 2024
9632a52
Make ray an optional dependency (#3834)
sivanantha321 Aug 11, 2024
a6a061e
Update aif example (#3765)
spolti Aug 11, 2024
f9c73c2
Use helm for quick installation (#3813)
sivanantha321 Aug 12, 2024
9a6524e
Updated istio versions based on go version 1.21 (#396)
mholder6 Aug 16, 2024
2fae6e2
Allow KServe to have its own local gateways for Serverless mode (#3737)
israel-hdez Aug 17, 2024
61b5e7e
Add support for Azure DNS zone endpoints (#3819)
tjandy98 Aug 17, 2024
a16f2d7
Fix failed build for knativeLocalGatewayService (#3866)
yuzisun Aug 18, 2024
9292c88
Add logging request feature for vLLM backend (#3849)
sivanantha321 Aug 19, 2024
656f65e
Bump vLLM to 0.5.4 (#3874)
sivanantha321 Aug 21, 2024
99fa478
Fix: Add workaround for snyk image scan failure (#3880)
sivanantha321 Aug 21, 2024
87cf2cd
Fix trust_remote_code not working with huggingface backend (#3879)
sivanantha321 Aug 21, 2024
e82beb6
Update KServe 2024-2025 Roadmap (#3810)
yuzisun Aug 21, 2024
c10f548
Configurable image pull secrets in Helm charts (#3838)
saileshd1402 Aug 24, 2024
7dd4ae2
Fix issue with rolling update behavior (#3786)
andyi2it Aug 24, 2024
b89c8c2
Fix the 'tokens exceeding model limit' error response in vllm server …
saileshd1402 Aug 24, 2024
69cdca5
Add support for binary data extension protocol and FP16 datatype (#3685)
sivanantha321 Aug 24, 2024
0a6302b
Protobuf version upgrade 4.25.4 (#3881)
andyi2it Aug 24, 2024
a9e967f
Adds optional labels and annotations to the controller (#3366)
guitouni Aug 24, 2024
f17d068
Enable Server-Side Apply for Kustomize Overlays in Test Environment (…
Jooho Aug 24, 2024
6978ab9
bufix: update image_transformer.py to handle changes in input structu…
zwong91 Aug 24, 2024
b7431e2
support text embedding task in hugging face server (#3743)
kevinmingtarja Aug 24, 2024
1bd82fb
Rename max_length parameter to max_model_len to be in sync with vLLM …
Datta0 Aug 25, 2024
74591b4
[Upstream] - Update-istio version based on go version 1.21 (#3825)
mholder6 Aug 25, 2024
9bca4ba
Enrich isvc NotReady events for failed conditions (#3303)
asdqwe123zxc Aug 26, 2024
c05d760
adding metadata on requests (#3635)
gcemaj Aug 26, 2024
fb57c15
Fix apiGroups in aggregate roles on manifests
israel-hdez Aug 20, 2024
ab251d6
Publish 0.14.0-rc0 release (#3867)
yuzisun Aug 27, 2024
ddc56ca
Use API token for publishing package to PyPI (#3896)
sivanantha321 Aug 28, 2024
d2f90dc
Fix sdlc broken when kserve installed using helm (#3890)
sivanantha321 Aug 28, 2024
3a347c5
Add Security Context and Resources to RBAC Proxy (#3898)
HotsauceLee Aug 28, 2024
e879a39
Remove unwanted cluster scope secret permissions (#3893)
sivanantha321 Aug 30, 2024
67373ad
bump to vllm 0.5.5 (#3911)
lizzzcai Sep 3, 2024
c80dae9
pin gosec to 2.20.0 (#3921)
greenmoon55 Sep 5, 2024
c7e99d8
add a new doc 'common issues and solutions' (#3878)
Jooho Sep 7, 2024
305d393
Implement health endpoint for vLLM backend (#3850)
sivanantha321 Sep 7, 2024
f54e27a
Add security best practices for inferenceservice, inferencegraph, ser…
sivanantha321 Sep 7, 2024
e7353d1
Bump Go to 1.22 (#3912)
sivanantha321 Sep 11, 2024
f69de18
[RHOAIENG-12577] - enable modelCar by default (#407)
spolti Sep 13, 2024
d779344
bump to vllm 0.6.0 (#3934)
hustxiayang Sep 13, 2024
fef6c7a
Set the volume mount's readonly annotation based on the ISVC annotati…
hdefazio Sep 15, 2024
391c8c6
mount /dev/shm volume to huggingfaceserver (#3910)
lizzzcai Sep 16, 2024
9ec6842
Fix permission error in snyk scan (#3889)
sivanantha321 Sep 16, 2024
acfc887
Cluster Local Model CR (#3839)
greenmoon55 Sep 17, 2024
90a5e12
added http headers to inbound request (#3895)
andyi2it Sep 17, 2024
a2f612c
Add prow-github-action (#3888)
sivanantha321 Sep 19, 2024
02293ac
Add TLS support for Inference Loggers (#3863)
ruivieira Sep 19, 2024
536fc9b
Fix explainer endpoint not working with path based routing (#3257)
sivanantha321 Sep 19, 2024
d97baa6
Fix ingress configuration for path based routing and update go mod (…
sivanantha321 Sep 19, 2024
a0a4d00
Add HostIPC field to ServingRuntimePodSpec (#3943)
greenmoon55 Sep 23, 2024
bf6fae8
remove conversion wehbook part from self-signed-ca.sh (#3941)
Jooho Sep 23, 2024
2376eeb
update fluid kserve sample to use huggingface servingruntime (#3907)
lizzzcai Sep 23, 2024
80982c2
bump to vLLM0.6.1post2 (#3948)
hustxiayang Sep 24, 2024
12c4039
Add NodeDownloadPending status to ClusterLocalModel (#3955)
greenmoon55 Sep 26, 2024
4add27b
[RHOAIENG-12260] - [konflux] hermetic build for storage-initializer i…
spolti Sep 26, 2024
9380e99
add tags to rest server timing logs to differentiate cpu and wall tim…
gfkeith Sep 27, 2024
1e28d4f
Implement Huggingface model download in storage initializer (#3584)
andyi2it Sep 29, 2024
b344a0a
Update OWNERS file (#3966)
yuzisun Sep 30, 2024
20b6e29
Cluster local model controller (#3860)
greenmoon55 Oct 2, 2024
a50fdc9
Prepare for 0.14.0-rc1release and automate sync process (#3970)
sivanantha321 Oct 3, 2024
d5ed018
add a new API for multi-node/multi-gpu (#3871)
Jooho Oct 3, 2024
ecc3729
Fix update-openapigen.sh that can be executed from kserve dir (#3924)
Jooho Oct 5, 2024
24d882b
Add python 3.12 support and remove python 3.8 support (#3645)
sivanantha321 Oct 6, 2024
17725be
Fix openssl vulnerability CWE-1395 (#3975)
sivanantha321 Oct 9, 2024
d7d79e1
Fix Kubernetes Doc Links (#3670)
jyono Oct 9, 2024
70e12ff
Fix kserve local testing env (#3981)
yuzisun Oct 9, 2024
dec8c14
Fix streaming response not working properly with logger (#3847)
sivanantha321 Oct 10, 2024
6ff2a2f
Add a flag for automount serviceaccount token (#3979)
greenmoon55 Oct 10, 2024
d2695a5
Do not set security context on the storage initializer from user cont…
greenmoon55 Oct 11, 2024
7796c44
Modelcar race condition mitigation with an init container (#3932)
israel-hdez Oct 11, 2024
fc72a2b
Fix: Headers passing for v1/v2 endpoints (#3669)
andyi2it Oct 12, 2024
dbceef3
Torchserve security update (#3774)
udaij12 Oct 14, 2024
82234c0
Pin ubuntu 22.04 for minikube setup action (#3994)
greenmoon55 Oct 14, 2024
7e43642
KServe 0.14 Release (#3988)
greenmoon55 Oct 14, 2024
ac7ae34
Documentation on using OCI images for model storage (modelcars) (#415)
israel-hdez Oct 18, 2024
1663443
Updated the OWNERS file to remove previous members and added new memb…
mholder6 Oct 23, 2024
befa4d6
Merge remote-tracking branch 'kserve/release-0.14' into j9436-kserve0…
israel-hdez Oct 22, 2024
53de246
Fix poetry lockfiles after v0.14.0 code sync
israel-hdez Oct 24, 2024
b7a868f
Restore compatibility with go v1.21
israel-hdez Oct 24, 2024
c9453bf
Merge branch 'odh-master' into j9436-kserve014-upgrade
israel-hdez Oct 28, 2024
733c1c3
Merge pull request #421 from israel-hdez/j9436-kserve014-upgrade
israel-hdez Oct 28, 2024
a3e474e
Reduce E2Es dependency on CI environment (2) (#4008)
israel-hdez Oct 21, 2024
d955f6f
Revert 'Increase memory limit of kserve-controller pod'
hdefazio Oct 29, 2024
4f82fe4
Fix issues after 0.14.0 sync (#424)
Jooho Nov 4, 2024
bf4c2ac
Merge pull request #430 from Jooho/odh_0.14_cherry_pick
Jooho Nov 4, 2024
89922f6
add knativeLocalGatewayService (#432)
Jooho Nov 5, 2024
ee9a342
[Cherry Pick] Multi node implementation (#434)
Jooho Nov 7, 2024
d3cb8fe
Merge branch 'odh_0.14' into rhoai_master
Jooho Nov 7, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
18 changes: 18 additions & 0 deletions .flake8
Original file line number Diff line number Diff line change
@@ -0,0 +1,18 @@
[flake8]
max-line-length = 120
extend-ignore = E203,E501,E701, B017
exclude =
.venv,
venv,
python/kserve/kserve/exceptions.py,
python/kserve/kserve/configuration.py,
python/kserve/kserve/rest.py,
python/kserve/kserve/api_client.py,
python/kserve/test/test_v1*.py,
python/kserve/kserve/__init__.py,
python/kserve/test/__init__.py,
python/kserve/test/test_knative*.py,
python/kserve/kserve/protocol/grpc/grpc_predict_v2*.py
python/*_pb2.py
docs/**/*.py
python/kserve/kserve/protocol/rest/openai/types/openapi.py
5 changes: 5 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,3 +46,8 @@ Please also list any relevant details for your test configuration.
```release-note

```

**Re-running failed tests**

- `/rerun-all` - rerun all failed workflows.
- `/rerun-workflow <workflow name>` - rerun a specific failed workflow. Only one workflow name can be specified. Multiple /rerun-workflow commands are allowed per comment.
25 changes: 8 additions & 17 deletions .github/actions/base-download/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -4,23 +4,14 @@ description: 'A wrapper around download docker artifact to download a set of bas
runs:
using: "composite"
steps:

- name: Download controller image
uses: ishworkh/docker-image-artifact-download@v1
with:
image: "kserve/kserve-controller:${{ github.sha }}"

- name: Download agent image
uses: ishworkh/docker-image-artifact-download@v1
with:
image: "kserve/agent:${{ github.sha }}"

- name: Download storage initializer image
uses: ishworkh/docker-image-artifact-download@v1
- name: Download base artifacts
uses: actions/download-artifact@v4
with:
image: "kserve/storage-initializer:${{ github.sha }}"
path: ./tmp
pattern: ${{ env.BASE_ARTIFACT_PREFIX }}-*
merge-multiple: true

- name: Download router image
uses: ishworkh/docker-image-artifact-download@v1
- name: Load base images
uses: ./.github/actions/load-docker-images
with:
image: "kserve/router:${{ github.sha }}"
directory: ./tmp
6 changes: 5 additions & 1 deletion .github/actions/kserve-dep-setup/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -5,6 +5,10 @@ inputs:
description: 'Network layer to install'
required: true
default: 'istio'
deployment-mode:
description: 'Kserve deployment mode. Supported values are serverless and raw'
required: false
default: 'serverless'
runs:
using: "composite"
steps:
Expand All @@ -15,7 +19,7 @@ runs:

if [[ "${{ inputs.network-layer }}" == "istio" ]]; then
echo "Selected network layer ${{ inputs.network-layer }}"
./test/scripts/gh-actions/setup-deps.sh
./test/scripts/gh-actions/setup-deps.sh ${{ inputs.deployment-mode }}

kubectl get pods -n istio-system
kubectl describe svc -n istio-system istio-ingressgateway
Expand Down
21 changes: 21 additions & 0 deletions .github/actions/load-docker-images/action.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
name: 'Load docker images'
description: 'Loads docker images from the tar files in specified director and deletes the directory'

inputs:
directory:
description: 'Path to the directory which contains the tar files'
required: true

runs:
using: composite
steps:
- name: Load docker images
shell: bash
run: |
files=$(find ${{ inputs.directory }} -maxdepth 1 -type f)
for file in ${files[@]};do
echo "Loading image $(basename ${file})"
docker image load -i ${file}
done
rm -rf ${{ inputs.directory }}
docker image ls
10 changes: 4 additions & 6 deletions .github/actions/minikube-setup/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -5,18 +5,16 @@ runs:
using: "composite"
steps:
- name: Install kubectl
uses: azure/setup-kubectl@v3
uses: azure/setup-kubectl@v4.0.0
with:
version: 'v1.27.4'
version: 'v1.29.7'

- name: Setup Minikube
uses: medyagh/setup-minikube@latest
with:
minikube-version: '1.31.1'
kubernetes-version: 'v1.27.4'
minikube-version: '1.33.1'
kubernetes-version: 'v1.29.7'
driver: 'none'
network-plugin: 'cni'
cni: 'calico'
wait: 'all'
start-args: --wait-timeout=6m0s

Expand Down
31 changes: 31 additions & 0 deletions .github/labels.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
# Labels for labeling issues and pull requests using GitHub prow action.
kind:
- 'bug'
- 'discussion'
- 'feature'
- 'process'
- 'question'

area:
- 'api'
- 'azure'
- 'docs'
- 'example'
- 'gcp'
- 'inference'
- 'infrastructure-feature'
- 'logging'
- 'operator'
- 'performance'
- 'testing'
- 'project-health'
- 'servingruntime'
- 'inferencegraph'
- 'storage'
- 'inferenceservice'
- 'control-plane'

priority:
- 'p0'
- 'p1'
- 'p2'
15 changes: 9 additions & 6 deletions .github/workflows/agent-docker-publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -28,7 +28,8 @@ jobs:
runs-on: ubuntu-latest

steps:
- uses: actions/checkout@v2
- name: Checkout source
uses: actions/checkout@v4

- name: Run tests
run: |
Expand All @@ -49,18 +50,20 @@ jobs:
if: github.event_name == 'push'

steps:
- uses: actions/checkout@v2
- name: Checkout source
uses: actions/checkout@v4

- name: Setup QEMU
uses: docker/setup-qemu-action@v2
uses: docker/setup-qemu-action@v3
with:
platforms: ppc64le

- name: Setup Docker Buildx
uses: docker/setup-buildx-action@v2
uses: docker/setup-buildx-action@v3

- name: Login to Quay
uses: docker/login-action@v2
uses: docker/login-action@v3

with:
registry: quay.io
username: ${{ secrets.QUAY_USER }}
Expand Down Expand Up @@ -91,7 +94,7 @@ jobs:
echo CONTAINER_TAGS=$TAGS >> $GITHUB_ENV

- name: Build and push
uses: docker/build-push-action@v3
uses: docker/build-push-action@v5
with:
platforms: linux/amd64,linux/arm64/v8,linux/ppc64le,linux/s390x
context: .
Expand Down
38 changes: 24 additions & 14 deletions .github/workflows/artexplainer-docker-publish.yml
Original file line number Diff line number Diff line change
Expand Up @@ -27,17 +27,24 @@ jobs:
runs-on: ubuntu-latest

steps:
- uses: actions/checkout@v2
- name: Checkout source
uses: actions/checkout@v4

- name: Setup QEMU
uses: docker/setup-qemu-action@v3

- name: Setup Docker Buildx
uses: docker/setup-buildx-action@v3

- name: Run tests
run: |
if [ -f docker-compose.test.yml ]; then
docker-compose --file docker-compose.test.yml build
docker-compose --file docker-compose.test.yml run sut
else
cd python
docker buildx build . --file artexplainer.Dockerfile
fi
uses: docker/build-push-action@v5
with:
platforms: linux/amd64, linux/arm64/v8
context: python
file: python/artexplainer.Dockerfile
push: false
# https://github.com/docker/buildx/issues/1533
provenance: false

# Push image to GitHub Packages.
# See also https://docs.docker.com/docker-hub/builds/
Expand All @@ -49,16 +56,17 @@ jobs:
if: github.event_name == 'push'

steps:
- uses: actions/checkout@v2
- name: Checkout source
uses: actions/checkout@v4

- name: Setup QEMU
uses: docker/setup-qemu-action@v2
uses: docker/setup-qemu-action@v3

- name: Setup Docker Buildx
uses: docker/setup-buildx-action@v2
uses: docker/setup-buildx-action@v3

- name: Login to DockerHub
uses: docker/login-action@v2
uses: docker/login-action@v3
with:
username: ${{ secrets.DOCKER_USER }}
password: ${{ secrets.DOCKER_PASSWORD }}
Expand All @@ -83,10 +91,12 @@ jobs:
echo IMAGE_ID=$IMAGE_ID >> $GITHUB_ENV

- name: Build and push
uses: docker/build-push-action@v3
uses: docker/build-push-action@v5
with:
platforms: linux/amd64, linux/arm64/v8
context: python
file: python/artexplainer.Dockerfile
push: true
tags: ${{ env.IMAGE_ID }}:${{ env.VERSION }}
# https://github.com/docker/buildx/issues/1533
provenance: false
4 changes: 2 additions & 2 deletions .github/workflows/automated-release.yml
Original file line number Diff line number Diff line change
Expand Up @@ -14,9 +14,9 @@ jobs:
runs-on: ubuntu-latest
steps:
- name: Setup Go
uses: actions/setup-go@v4
uses: actions/setup-go@v5
with:
go-version: 1.21
go-version: 1.22

- name: Checkout source code
uses: actions/checkout@v4
Expand Down
31 changes: 0 additions & 31 deletions .github/workflows/cherry-pick.yml

This file was deleted.

23 changes: 23 additions & 0 deletions .github/workflows/comment-cherry-pick.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,23 @@
name: Cherry Pick On Comment

on:
issue_comment:
types: [created]

jobs:
cherry-pick:

name: Cherry Pick
if: github.event.issue.pull_request != '' && contains(github.event.comment.body, '/cherry-pick') && github.event.comment.author_association == 'CONTRIBUTOR'
runs-on: ubuntu-latest

steps:
- name: Checkout the latest code
uses: actions/checkout@v2
with:
token: ${{ secrets.GITHUB_TOKEN }}
fetch-depth: 0 # otherwise, you will fail to push refs to dest repo
- name: Automatic Cherry Pick
uses: hianhianhian/gha-cherry-pick@master
env:
GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
Loading