Skip to content

Support reasoner video input.#25

Open
foreverlms wants to merge 21 commits into
mainfrom
maoshengl/video_reasoner_inference
Open

Support reasoner video input.#25
foreverlms wants to merge 21 commits into
mainfrom
maoshengl/video_reasoner_inference

Conversation

@foreverlms

Copy link
Copy Markdown
Collaborator

No description provided.

foreverlms and others added 17 commits June 8, 2026 03:32
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…tion

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…l-exclusion validator

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…pass-throughs

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…te_reasoner_text stub

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…ate_reasoner_text

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…nce engine

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…decord/pkl_to_media dep)

The repo Qwen3VLProcessor runs do_sample_frames=False and expects a pre-decoded
frame list; decode with the inference-canonical torchvision.io.read_video (no
undeclared decord dep) and sample toward video_fps via Qwen smart_nframes.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…/plan superseded

Always emit reasoner_videos=[video_or_None] (like reasoner_images) so the batch
homogeneity check aligns positionally and reliably rejects an image/video/text mix.
Add superseded banners to the spec/plan docs (frame-decode + video_fps-only is final).

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@foreverlms foreverlms force-pushed the maoshengl/video_reasoner_inference branch from 1b7b175 to 19bd716 Compare June 8, 2026 10:37
foreverlms and others added 2 commits June 8, 2026 03:38
… untracked)

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…-commit)

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@foreverlms foreverlms marked this pull request as ready for review June 8, 2026 12:03
@foreverlms foreverlms marked this pull request as draft June 8, 2026 12:20
…t for reasoner_videos key

- _make_reasoner_sample_args gains video_fps
- text-only / with-image get_sample_data tests assert the always-present reasoner_videos:[None]
- add test_get_sample_data_reasoner_with_video (monkeypatched decoder)
- drop redundant lower-level _get_reasoner_sample_data duplicates (public get_sample_data set covers them)

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
@foreverlms foreverlms marked this pull request as ready for review June 9, 2026 14:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant