What AI Models are you interested to try out from the New GitHub Models Catalog? Share Your Thoughts! #134404
Replies: 23 comments 1 reply
-
Which model caught your eye and why?Llama 403B - I haven't had a chance to use it elsewhere and can't deploy it at home. Have you had a chance to experiment with any of the new models? How was your experience?Yes. It's an awesome concept and I'm excited about this. Encountered one bug so far: I can't modify the tokens by using my mouse. I can only modify it by typing in a number, for example 4096. Is there a particular model that you hope for in the future to be included?Any hyped model (like Llama 403B in this case), Claude Are there any emerging models that you're excited about?Flux |
Beta Was this translation helpful? Give feedback.
-
I would say at this point, it is best to learn practical like agentic, function calling, rag, etc. than trying specific models. Of course, new models mean new strategies, new possibilities, so both are linked. etc. For the future, having models that can understand a computer screen and eventually help the user (step toward and always on assistant), and models like Segment Anything 2 would be great. Rather than big models like Stable Diffusion, smaller, task-specific models lighter and simpler could be interesting. A workflow (altering the color of a t-short), segmentation + labeling (which is ‘wow’), with a mask as an output used to apply changes to the original picture. Each task can be isolated. The app needs to know what the user wants, in natural language. So, some NLP and an occasion to use the semantic kernel to interpret the intent, two LLM agents (or SLM) could be doing the same job (or having the same goal, but using different strategies, like clip, or segmenting already), then a third one would summarize and validate before proceeding. For NLP, a simple case is an Assistant like Google’s (but smarter), that can tell if the user is just chatting or asking something (like turn on that light). Available models on the marketplace, are capable of this, but with some help. Finally, time series models, to detect anomalies and thresholds. Example, the number of card payments/min (I work in that field) has a regular pattern, except on special days (not regularly occurring), national holiday, or black Friday are quite different, so irregular seasonality. This is a frequent problem. And all of this in one click with Codespaces :) These are just ideas without name of models because I haven’t been following closely new models… but application of the models, are in my opinion, more important now. |
Beta Was this translation helpful? Give feedback.
-
Definitely the mamba models |
Beta Was this translation helpful? Give feedback.
-
The top of the line models are a must: GPT-4, Claude 3.5 Sonnet, Claude Opus. Also coding specific ones: DeepSeek-V2 |
Beta Was this translation helpful? Give feedback.
-
I'm particularly interested in trying out models related to natural language processing (NLP), like GPT-based models, and computer vision models for image recognition. The GitHub Models Catalog offers diverse options, so exploring those tailored for specific industries like healthcare or finance could benefit niche applications. |
Beta Was this translation helpful? Give feedback.
-
A new catalog is a truly nice place to try out the models at one place, but by the way, i want to share a small feedback that a lot of AI engineers might also have, that how can we upload our own models to the community and in facts, i think there can be another type of repository that could allow us to upload the model file, train file and so on. This might promote open-source community's improvement~ hope to be seen~ |
Beta Was this translation helpful? Give feedback.
-
Vision transformer models such as Microsoft Florence, ViT, etc. I think would be a good addition. Obviously we have some multimodal models in there already but I think vision specific models would be a good addition. |
Beta Was this translation helpful? Give feedback.
-
Mamba models Most importantly, allow developers upload their own. |
Beta Was this translation helpful? Give feedback.
-
VLLMs (vision) and GAN models please! |
Beta Was this translation helpful? Give feedback.
-
Getting access to ColPali would be great. Looks like the future of RAG through vision models. |
Beta Was this translation helpful? Give feedback.
-
I’m especially interested in exploring models focused on natural language processing (NLP), such as those based on GPT, as well as computer vision models for image recognition, because these are my requirements as I work as an SEO Specialist and content writer for site ihatecbts |
Beta Was this translation helpful? Give feedback.
-
I'd like to be able to train or finetine models, too. I'm surprised by how good the Llama 403B is! |
Beta Was this translation helpful? Give feedback.
-
How about some diffusion models such as Flux? |
Beta Was this translation helpful? Give feedback.
-
DeepSeek V2.5 of cause |
Beta Was this translation helpful? Give feedback.
-
Is it possible Flux.1 Pro? |
Beta Was this translation helpful? Give feedback.
-
Hope to launch a vertical model specifically for Review or Unit Test. |
Beta Was this translation helpful? Give feedback.
-
i would like to have claude 3.5 sonnet new |
Beta Was this translation helpful? Give feedback.
This comment was marked as off-topic.
This comment was marked as off-topic.
-
I would like to use antrophic's claude serie models like opus,sonnet and haiku |
Beta Was this translation helpful? Give feedback.
-
I added support for GitHub Models to my terminal based chat client Charla after I got access to the public beta 2 months ago. I use the Azure AI Inference Python library for interacting with GitHub models and the Ollama client and server for local models. For simple tasks, small models like Phi3 work fine in my experience. For more complex tasks, a model like Llama3 can be a good choice, but none come close to the gpt-4o models available through GitHub Models, in my opinion. I'm amazed at the speed of the responses, the stability of the API, and, most importantly, the quality of the output. Fun fact: the responses from the remote API are often faster than interactions with local models. Regarding the code libraries I'm using, Ollama offers some features that I miss in the Azure API, for example, metadata about the models via the Anyhow, I really like how easy it was to integrate GitHub Models and how reliably the API works. I’m looking forward to using more specialized and smaller models in the future that are even faster and more efficient. |
Beta Was this translation helpful? Give feedback.
-
DeepSeek V3 |
Beta Was this translation helpful? Give feedback.
-
Select Topic Area
Show & Tell
Body
We're excited to announce the launch of GitHub Models, including the all new catalog! 🎉 This catalog features a diverse range of models ranging from model types, sizes, and specializations. Now that the catalog is live, we want to hear from you!
What's your favorite AI model from the new catalog and why?
Whether you're into machine learning, deep learning, natural language processing, computer vision, or any other AI domain, your feedback is invaluable. Your insights can help others discover powerful tools and spark innovative ideas.
We believe every developer can be an AI engineer with the right tools and training. From playground to coding with the model in GitHub Codespaces to production deployment via Azure, GitHub Models shows you how simple it can be. Sign up for the limited public beta HERE.
Share your experiences and feedback in the comments below or create a new discussion 👇
Beta Was this translation helpful? Give feedback.
All reactions