All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
Usage: hugie endpoint create [OPTIONS] [DATA]
Create an endpoint
Args: data (str): Path to JSON data to create the endpoint
Arguments:
[DATA] Path JSON data to create the endpoint
Options:
--account-id TEXT ID of the account (for private endpoints)
--name TEXT Name of the endpoint [default: hf-endpoint]
--type TEXT Type of endpoint, one of ['public', 'protected',
'private']
--accelerator TEXT Accelerator to use. One of ['CPU','GPU'] [default:
cpu]
--instance-type TEXT [default: c6i]
--instance-size TEXT [default: small]
--min-replica INTEGER Minimum number of replicas [default: 1]
--max-replica INTEGER Maximum number of replicas [default: 1]
--framework TEXT Framework to use [default: custom]
--repository TEXT Name of the hf model repository [default: t5-small]
--revision TEXT Revision of the hf model repository [default: main]
--task TEXT Task of the model [default: text-generation]
--image TEXT Image to use from huggingface or tgi [default:
huggingface]
--vendor TEXT Vendor to use. One of ['aws','gcp'] [default: aws]
--region TEXT Vendor specific region, e.g. 'us-east-1' [default:
us-east-1]
--json Prints the full output in JSON.
--help Show this message and exit.
hugie endpoint create
now received params with all the fields of the endpoint configuration file. This means you can easily deploy an endpoint by simply changing only the params that are neccesary away from the default
- Add endpoint config subcommand #47 (resolves #41)
- Adds
ls
alias tolist
command #50 - Adds
-f
shortcut to--force
command #50 - Adds
poetry
for dependency management #69 - Adds
ui
command to open Hugging Face Inference Endpoint website in browser #70
- Only create docs on merge to main #50
- Switch to poetry for dependency management #69
- Don't set
top_k
when runninghugie endpoint test
#49
- Removes
--no-force
option todelete
command #50
- Fixes error handling flows #72
- Fixes breaking changes introduced by pydantic 2.0 #68
- Fix data bug in update function #63
- Address missing token message #46
- Handle 40x responses in create #45
- Update now handles 40x codes #44
- Remove --no-json from cli #43
✨ Renamed to Hugie 🐻
- Add endpoint subcommand
- Add endpoint test command
- Add error handling
- Abstract json loading to
load_json
function - Allow test command to take an input JSON containing queries
- Make create output more human readable #26
- Make delete error messsage more informative #25
- Add tests and CI/CD
- Correct table column widths #16