Skip to content

Conversation

@featzhang
Copy link
Member

What is the purpose of the change

This PR adds comprehensive documentation for the Triton inference model in Flink SQL, enabling users to perform real-time model inference by calling NVIDIA Triton Inference Server from Flink.

The documentation introduces how to define, configure, and use Triton-backed models in Flink SQL, covering common inference scenarios as well as advanced production use cases.

What is the change

  • Add a new Triton model documentation page under docs/connectors/models
    • Overview of Triton integration and supported features
    • End-to-end SQL examples for model creation and inference
    • Advanced configurations (authentication, headers, batching, compression, priority)
    • Examples for array-type inference and stateful / sequence models
  • Add corresponding Chinese documentation to keep EN / ZH docs consistent
  • Extend existing SQL documentation to reference Triton models:
    • CREATE MODEL syntax
    • ML_PREDICT usage for model inference

Why is the change needed

Triton is a widely used, high-performance inference serving system supporting multiple ML frameworks.
Providing first-class documentation for Triton model integration helps users:

  • Integrate online model inference into Flink SQL pipelines more easily
  • Understand supported configuration options and best practices
  • Adopt Flink as a unified platform for real-time data processing and AI inference

How was this change tested

  • Documentation-only change
  • Verified SQL examples and configuration options for correctness and consistency

Does this PR introduce any user-facing change

Yes.
This PR introduces new user-facing documentation describing how to use Triton inference models in Flink SQL.

Checklist

  • Documentation updated (English and Chinese)
  • SQL examples provided
  • No backward compatibility impact
  • No code or runtime behavior changes

@flinkbot
Copy link
Collaborator

flinkbot commented Jan 29, 2026

CI report:

Bot commands The @flinkbot bot supports the following commands:
  • @flinkbot run azure re-run the last Azure build

@github-actions github-actions bot added the community-reviewed PR has been reviewed by the community. label Jan 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

community-reviewed PR has been reviewed by the community.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants