|
| - decentralized_model - (todo)
| |- hivemind/
│ |─ __init__.py
│ |─ connection_manager.py
│ |─ peer_discovery.py # e.g., build connections via IP addresses
│ |─ utils.py
| |- cli/
│ |─ run_dht.py
|
| |- server/
│ ├── __init__.py
│ ├── server.py
│ ├── backend.py
│ ├── handlers.py
| |- client/
│ ├── __init__.py
│ ├── client.py # e.g., clientConfig
│ ├── client_manager.py # e.g., sequence manager
│ ├── sequential_generation.py
| |- pipeline parallelism
| |- models/
| |- llama/layers (todo)
| |- opt/layers (todo)
|
| - dist_model - (todo)
| |- tensor model parallelism -
| | |- models/
| | |- llama/layers (todo)
| | |- opt/layers (todo)
| |
| |- sequence model parallelism -
| |- models/
| |- llama/layers (todo)
| |- opt/layers (todo)
|
| - single_gpu_model -
| |- models/
| |- llama/layers (pass)
| |- opt/layers (pass)
|
| - examples
| |- decentralized_model_scripts (configs)
| | |- models
| | |- llama
| | | |- dece_flex_llama.py (todo)
| | |- opt
| | | |- dece_flex_opt.py (todo)
| |
| |- dist_model_scripts
| | |- models
| | |- llama
| | | |- dist_flex_llama.py (todo)
| | |- opt
| | |- dist_flex_opt.py (todo)
| |
| |- single_gpu_model_scripts
| |- models
| |- llama
| | |- flex_llama.py (pass)
| |- opt
| |- flex_opt.py (pass)
|
| - utils
|
-
Notifications
You must be signed in to change notification settings - Fork 0
PASAUCMerced/Flexgen-modification
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published