Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aiinference #3559

Draft
wants to merge 14 commits into
base: 0.2
Choose a base branch
from
Draft

Aiinference #3559

wants to merge 14 commits into from

Conversation

Josephrp
Copy link

@Josephrp Josephrp commented Sep 23, 2024

Why are these changes needed?

This PR introduces a new GithubLLM class to autogen, allowing users to leverage GitHub's inference endpoint with automatic fallback to Azure. It provides a seamless way to use GitHub's LLM capabilities within the autogen ecosystem, handling rate limits and ensuring high availability through Azure fallback.

Related Issues

Contributes to #3300

Checks

Tasks

  • Inference

@Josephrp
Copy link
Author

CHECKLIST (WiP / Help Wanted) :

  • improve / rename tests
  • add json / chat completions client

@jackgerrits jackgerrits mentioned this pull request Sep 25, 2024
)
"""

SUPPORTED_MODELS = [
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we need to check this from client side? Can we just let the server to check it.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it could be a good idea to keep it server side because the updates are frequent , is there any interest in this ? on my side , sure , i could always send it in for review if it's a good idea :-)

@ekzhu ekzhu changed the base branch from main to 0.2 October 2, 2024 18:24
@jackgerrits jackgerrits added the 0.2 Issues which are related to the pre 0.4 codebase label Oct 4, 2024
@rysweet
Copy link
Collaborator

rysweet commented Oct 10, 2024

This PR is against AutoGen 0.2. AutoGen 0.2 has been moved to the 0.2 branch. Please rebase your PR on the 0.2 branch or update it to work with the new AutoGen 0.4 that is now in main.

@rysweet rysweet added the awaiting-op-response Issue or pr has been triaged or responded to and is now awaiting a reply from the original poster label Oct 10, 2024
@Josephrp
Copy link
Author

thanks !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
0.2 Issues which are related to the pre 0.4 codebase awaiting-op-response Issue or pr has been triaged or responded to and is now awaiting a reply from the original poster
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants