Concept of self-hosted agent #10624

ankitbko · 2025-02-20T16:07:11Z

Motivation and Context

In the agentic world, different agents are created and maintained by different teams in an organization. Each team has their own way of creating agents and hosting them which are independent of each other. To utilize these agents when creating a multi-agent application, semantic kernel will need to integrate with agents which lies outside the semantic kernel application. For semantic kernel, these agents are black box services that have their own implementation of "intelligence" but could still use the plugins registered in kernel as needed. AgentGroupChat should be able to orchestrate between these agents as usual.

Description

Although there could be multiple ways to implement such concept, this PR takes the approach of deriving from ChatCompletionClientBase and taking inspiration from OpenAIChatCompletionBase to implement SelfHostedChatCompletion which makes a REST request to externally hosted agents. Instances of SelfHostedChatCompletion are registered as service in kernel

kernel.add_service(
    SelfHostedChatCompletion(
        url=os.getenv("REVIEWER_AGENT_URL") or "", ai_model_id=REVIEWER_NAME, service_id=REVIEWER_NAME
    )
)

ChatCompletionAgent is used to create agents and the appropriate service is referenced using service_id.

agent_reviewer = ChatCompletionAgent(
    service_id="artdirector",
    kernel=kernel,
    name=REVIEWER_NAME,
    arguments=KernelArguments(settings=PromptExecutionSettings(service_id=REVIEWER_NAME)),
)

The agents themselves are implemented in a fastapi server under agents folder. AgentGroupChat is used to orchestrate between the agents.

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

eavanvalkenburg · 2025-02-20T17:46:36Z

python/samples/concepts/agents/self_hosted_agent/app/self_hosted_api_chat_completion.py

+    from semantic_kernel.connectors.ai.prompt_execution_settings import PromptExecutionSettings
+
+
+class SelfHostedChatCompletion(ChatCompletionClientBase):


I'm not sure I get the point of this extra implementation, the OpenAIChatCompletion can take any URL and as long as the response matches it should just work and it seems the API you create does just that?

And in addition we have support for other local models, through Ollama, ONNX and Hugging Face...

Our customer is evaluating SK as orchestration platform to build multi-agent system where different agents are developed and deployed by different teams. These agents could have their own contract and authN mechanism. In other words, agents are "microservices".

Eventually these agents' contract will converge to a unified standard within their org. At the bare minimum we foresee the contract to include message history and tools support, rest of all AOAI Chat Completion parameters are not needed as they are internal implementation of an agent. Just for simplicity for demonstrating this concept, we assume the agents' contract follows OpenAI Chat Completion, but it does not have to be the case.

The key requirement is that agent implementation (whether local models or AOAI) is external to SK app.

Ankit Sinha added 2 commits February 20, 2025 12:18

feat: sample for self-hosted agents

51f8b78

feat: updated readme and folder rename

74777f2

ankitbko requested a review from a team as a code owner February 20, 2025 16:07

eavanvalkenburg reviewed Feb 20, 2025

View reviewed changes

crickman added agents samples labels Feb 21, 2025

crickman assigned ankitbko Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concept of self-hosted agent #10624

Concept of self-hosted agent #10624

ankitbko commented Feb 20, 2025 •

edited

Loading

eavanvalkenburg Feb 20, 2025

ankitbko Feb 21, 2025 •

edited

Loading

		from semantic_kernel.connectors.ai.prompt_execution_settings import PromptExecutionSettings


		class SelfHostedChatCompletion(ChatCompletionClientBase):

Concept of self-hosted agent #10624

Are you sure you want to change the base?

Concept of self-hosted agent #10624

Conversation

ankitbko commented Feb 20, 2025 • edited Loading

Motivation and Context

Description

Contribution Checklist

eavanvalkenburg Feb 20, 2025

Choose a reason for hiding this comment

ankitbko Feb 21, 2025 • edited Loading

Choose a reason for hiding this comment

ankitbko commented Feb 20, 2025 •

edited

Loading

ankitbko Feb 21, 2025 •

edited

Loading