From Words to Worlds: Transforming One-line Prompt into Immersive Multi-modal Digital Stories with Communicative LLM Agent
Line (geometry)
DOI:
10.48550/arxiv.2406.10478
Publication Date:
2024-06-14
AUTHORS (5)
ABSTRACT
Digital storytelling, essential in entertainment, education, and marketing, faces challenges production scalability flexibility. The StoryAgent framework, introduced this paper, utilizes Large Language Models generative tools to automate refine digital storytelling. Employing a top-down story drafting bottom-up asset generation approach, tackles key issues such as manual intervention, interactive scene orchestration, narrative consistency. This framework enables efficient of consistent narratives across multiple modalities, democratizing content creation enhancing engagement. Our results demonstrate the framework's capability produce coherent stories without reference videos, marking significant advancement automated
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES ()
CITATIONS ()
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....