Expanding ChatPromptParser to handle other content types


### Discussed in https://github.com/microsoft/semantic-kernel/discussions/11012

<div type='discussions-op-text'>

<sup>Originally posted by **glorious-beard** March 17, 2025</sup>
Now that OpenAI can handle file inputs (for PDFs) in addition to text and images, are there plans to add the ability to parse additional content tags in `ChatPromptParser` to handle additional content types, like `BinaryContent`, `AudioContent`, etc.? (Claude can handle PDFs too - [see here](https://docs.anthropic.com/en/docs/build-with-claude/pdf-support))

Additional tags could include:
* '&lt;audio&gt; *(base64 audio stream)* &lt;/audio&gt;' - Parsed into an `AudioContent` instance
* '&lt;binary mimeType="*(mime type)*"&gt; *(base64 content)* &lt;/binary&gt;' - Parsed into a `BinaryContent` instance, with `mimeType` defaulting to "application/octet-stream" if not present
* '&lt;pdf&gt; *(base64 content)* &lt;/pdf&gt;' - Parsed into a new `PdfContent` class derived from `BinaryContent`

My application makes heavy use of the YAML prompt templates so this would be very helpful in not having to manually build chat histories for any operation involving inputs beyond text and images.

I volunteer to add the above if it's not already planned for a near term release.

(Maybe this is an extension of [this discussion](https://github.com/microsoft/semantic-kernel/discussions/8487)?)</div>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Expanding ChatPromptParser to handle other content types #11044

Discussed in #11012

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Expanding ChatPromptParser to handle other content types #11044

Description

Discussed in #11012

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions