Skip to content

Multiturn conversation - Utilize the context ring buffer to allow context longer than maximum context window. #17122

@psiddh

Description

@psiddh

On the Demo App, prompt

"How do you author a native Kernel Op on ExecuTorch" ?

Or keep chatting until Demo stalls or fails to give proper response

cc @larryliu0820 @mergennachin @cccclai @helunwencser @jackzhxng

Metadata

Metadata

Labels

module: llmIssues related to LLM examples and apps, and to the extensions/llm/ code

Type

No type

Projects

Status

To triage

Status

Todo

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions