Remove the abstraction for token counting from the main evaluation API #6320

shyamnamboodiripad · 2025-04-18T08:55:23Z

This change is being made because there is still some uncertainty around what a general-purpose token counting abstraction (that supports all kinds of future models, and all kinds of input modalities) should look like at the moment. We do not want to bake in an API that only supports text-based inputs for the models and use cases that are prevalent today, since it would be a potential breaking change to change this API after we release a stable version of the evaluation APIs.

We can always reintroduce the token counting support in a non-breaking fashion in the future if and when there is more clarity on what a general-purpose token counting abstraction should look like, or if and when a general-purpose token counting abstraction is introduced in a lower layer (Microsoft.Extensions.AI) in the future.

In the meanwhile, callers should still be able to use the Microsoft.ML.Tokenizers library directly to count tokens in text-based content and trim down the conversation history before calling EvaluateAsync() if needed.

Fixes #6234

Microsoft Reviewers: Open in CodeFlow

src/Libraries/Microsoft.Extensions.AI.Evaluation/Microsoft.Extensions.AI.Evaluation.csproj

src/Libraries/Microsoft.Extensions.AI.Evaluation/ChatMessageExtensions.cs

test/Libraries/Microsoft.Extensions.AI.Evaluation.Tests/ConversationTrimmingTests.cs

This change is being made because there is still some uncertainty around what a general purpose token counting abstraction (that supports all kinds of future models, and all kinds of input modalities) should look like at the moment. We do not want to bake in an API that only supports text based inputs for the models and use cases that are prevalent today, since it would be a potential breaking change to change this API after we release a stable version of the evaluation APIs. We can always reintroduce the token counting support in a non-breaking fashion in the future if and when there is more clarity on what a general purpose token counting abstraction should look like, or if and when a general purpose token counting abstraction is introduced in a lower layer (Microsoft.Extensions.AI) in the future. In the meanwhile, callers should still be able to use the Microsoft.ML.Tokenizers library directly to count tokens in text-based content and trim down the conversation history before calling EvaluateAsync() if needed.

shyamnamboodiripad requested a review from a team as a code owner April 18, 2025 08:55

github-actions bot added the area-ai-eval Microsoft.Extensions.AI.Evaluation and related label Apr 18, 2025

dotnet-policy-service bot assigned shyamnamboodiripad Apr 18, 2025

shyamnamboodiripad force-pushed the counter branch 2 times, most recently from 7511ebe to f93c418 Compare April 18, 2025 21:35

peterwald reviewed Apr 19, 2025

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation/Microsoft.Extensions.AI.Evaluation.csproj Outdated Show resolved Hide resolved

peterwald reviewed Apr 19, 2025

View reviewed changes

src/Libraries/Microsoft.Extensions.AI.Evaluation/ChatMessageExtensions.cs Outdated Show resolved Hide resolved

peterwald reviewed Apr 19, 2025

View reviewed changes

test/Libraries/Microsoft.Extensions.AI.Evaluation.Tests/ConversationTrimmingTests.cs Outdated Show resolved Hide resolved

shyamnamboodiripad force-pushed the counter branch 2 times, most recently from bd3f9b0 to 987b057 Compare April 22, 2025 21:05

shyamnamboodiripad force-pushed the counter branch from 987b057 to 5f9bc5e Compare April 22, 2025 21:07

shyamnamboodiripad enabled auto-merge (squash) April 22, 2025 21:07

Rename includedHistory -> conversationHistory

ac654b8

peterwald approved these changes Apr 23, 2025

View reviewed changes

shyamnamboodiripad merged commit 44eaa2e into dotnet:main Apr 23, 2025
6 checks passed

shyamnamboodiripad deleted the counter branch April 23, 2025 17:44

github-actions bot locked and limited conversation to collaborators May 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove the abstraction for token counting from the main evaluation API #6320

Remove the abstraction for token counting from the main evaluation API #6320

Uh oh!

shyamnamboodiripad commented Apr 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Remove the abstraction for token counting from the main evaluation API #6320

Remove the abstraction for token counting from the main evaluation API #6320

Uh oh!

Conversation

shyamnamboodiripad commented Apr 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Microsoft Reviewers: Open in CodeFlow

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

shyamnamboodiripad commented Apr 18, 2025 •

edited

Loading