Request: Multi-view Cosmos 3 Generator for automotive/robotics via CLI or Python scripts (non-vLLM)

Hi Cosmos team,

I want to use the Cosmos 3 Generator for multi-view video generation in automotive use cases (and potentially robotics), but my assumption is to run it through CLI or Python scripts, similar to this repository, not via vLLM serving.

I found a related vLLM-based example in [another repository](https://github.com/NVIDIA/cosmos/blob/2bbc0b7254b2abd85fa44416ec016638afffc2fc/cookbooks/cosmos3/generator/action/run_fd_with_vllm.ipynb#L496)

However, what I am specifically looking for is guidance for non-vLLM workflows:

1. Recommended CLI or Python-script entry points for multi-view generation
2. Required input format/configuration for automotive or robotics multi-view scenarios
3. GPU requirements for this workflow (minimum and recommended)

Do you have plans to publish code or documentation in this repository for automotive/robotics-oriented multi-view generation using CLI/Python scripts?

If this is already on the roadmap, a rough timeline or pointer to upcoming docs/examples would be very helpful.

Thank you.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request: Multi-view Cosmos 3 Generator for automotive/robotics via CLI or Python scripts (non-vLLM) #36

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Request: Multi-view Cosmos 3 Generator for automotive/robotics via CLI or Python scripts (non-vLLM) #36

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions