NFDI4DS | UHH-SEMS - Publication Details

From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

Line (geometry)

DOI: 10.48550/arxiv.2406.10478 Publication Date: 2024-06-14

Abstract Supplemental Material References Cited by

AUTHORS (5)

Samuel S. Sohn

Danrui Li

Sen Zhang

Che‐Jui Chang

Mubbasir Kapadia

ABSTRACT

Digital storytelling, essential in entertainment, education, and marketing, faces challenges production scalability flexibility. The StoryAgent framework, introduced this paper, utilizes Large Language Models generative tools to automate refine digital storytelling. Employing a top-down story drafting bottom-up asset generation approach, tackles key issues such as manual intervention, interactive scene orchestration, narrative consistency. This framework enables efficient of consistent narratives across multiple modalities, democratizing content creation enhancing engagement. Our results demonstrate the framework's capability produce coherent stories without reference videos, marking significant advancement automated

SUPPLEMENTAL MATERIAL

Coming soon ....

REFERENCES ()

CITATIONS ()

EXTERNAL LINKS

OPENALEX - Publications OPENAIRE - Products

PlumX Metrics

From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent

RECOMMENDATIONS

FAIR ASSESSMENT

Coming soon ....

JUPYTER LAB

Coming soon ....