Hover on the video to see corresponding text prompts
Play speed:
Long Video Showcases
Hover on the video to see corresponding text prompts
Play speed:
60s
A video featuring a woman introducing the iPhone 15, available for purchase on Shopee. The woman has a friendly and engaging demeanor, speaking clearly and confidently about the phone's features and benefits. She demonstrates the phone's camera capabilities, display quality, and user interface. The background includes subtle animations of the Shopee app and product listings. The woman wears casual, modern clothing and maintains a neutral facial expression as she interacts with the phone. The video opens with a close-up of the woman’s face, then transitions to medium shots of her handling the phone. The camera occasionally zooms in on specific features of the iPhone 15.
60s
Create a Batman and Joker fight scene using the graphics style of Grand Theft Auto V. In the video, Batman is dressed in his iconic black cape and utility belt, while Joker sports his signature green hair and purple outfit. They are engaged in a fierce hand-to-hand combat in a gritty urban environment, with buildings and vehicles in the background. Both characters display intense expressions and fluid motions as they dodge, punch, and kick each other. The scene is captured from a dynamic third-person perspective, with occasional close-ups emphasizing their intense facial expressions and body language.
60s
A 1940s film noir-style animation with realistic RTX effects, featuring a flock of angry birds. The birds have detailed feathers, expressive scowling faces, and their wings are spread as if ready to attack. The scene takes place in a dimly lit alleyway with old brick walls and flickering streetlights casting shadows. The birds are shown in a medium close-up, emphasizing their aggressive postures and animated expressions. The realistic textures and lighting create a gritty, vintage aesthetic.
60s
Animated scene features a close-up of a short fluffy monster kneeling beside a melting red candle. The art style is 3D and realistic, with a focus on lighting and texture. The mood of the painting is one of wonder and curiosity, as the monster gazes at the flame with wide eyes and open mouth. Its pose and expression convey a sense of innocence and playfulness, as if it is exploring the world around it for the first time. The use of warm colors and dramatic lighting further enhances the cozy atmosphere of the image.
60s
A dynamic over-the-shoulder perspective of a chef meticulously plating a dish in a bustling kitchen. The chef, a middle-aged man with a neatly trimmed beard and focused expression, deftly arranges ingredients on a pristine white plate. His hands move with precision, each gesture deliberate and practiced. The background shows a crowded kitchen with steaming pots, whirring blenders, and the clatter of utensils. Bright lights highlight the scene, casting shadows across the busy workspace. The camera angle captures the chef's detailed work from behind, emphasizing his skill and dedication.
60s
3D animation of a small, round, fluffy creature with big, expressive eyes explores a vibrant, enchanted forest. The creature, a whimsical blend of a rabbit and a squirrel, has soft blue fur and a bushy, striped tail. It hops along a sparkling stream, its eyes wide with wonder. The forest is alive with magical elements: flowers that glow and change colors, trees with leaves in shades of purple and silver, and small floating lights that resemble fireflies. The creature stops to interact playfully with a group of tiny, fairy-like beings dancing around a mushroom ring. The creature looks up in awe at a large, glowing tree that seems to be the heart of the forest.
60s
A group of diverse individuals, including adults and seniors, performing various exercise routines in a park. They are jogging, stretching, lifting weights, and practicing yoga. Each person is engaged in their own activity, showcasing a range of body postures and expressions of concentration and determination. The environment is bright and lively, with green grass, trees, and a clear blue sky visible in the background. The camera captures each individual in a series of medium shots, maintaining a static perspective to highlight the fluid motions of their exercises.
60s
A playful raccoon is seen playing an electronic guitar, strumming the strings with its front paws. The raccoon has distinctive black facial markings and a bushy tail. It sits comfortably on a small stool, its body slightly tilted as it focuses intently on the instrument. The setting is a cozy, dimly lit room with vintage posters on the walls, adding a retro vibe. The raccoon's expressive eyes convey a sense of joy and concentration. Medium close-up shot, focusing on the raccoon's face and hands interacting with the guitar.
60s
Steampunk-themed animation, set in the 1800s, featuring two rival characters engaged in a tense gun duel. Both characters are dressed in intricate steampunk attire, with goggles and brass mechanical elements visible on their outfits. One character is a stern-faced gentleman with a top hat and a long coat, while the other is a fierce woman with a leather jacket and a bandanna. They stand facing each other in a dimly lit, foggy alleyway filled with old machinery and gears. Each holds a custom-made revolver, emphasizing the detailed mechanisms and steam-powered elements. The background showcases a gritty industrial cityscape with smokestacks and gas lamps. Medium close-up shot focusing on the intense expressions and hand movements of the characters during the duel.
60s
A playful, animated panda dancing joyfully under a blanket of snow. The panda moves gracefully, performing lively dance steps that include twirling and hopping. The scene is set in a serene winter landscape with gently falling snowflakes and tall pine trees in the background. The panda has soft, fluffy fur and expressive, joyful eyes. The camera captures the panda from a medium close-up angle, focusing on the panda’s energetic movements and the peaceful snowy environment surrounding it.
60s
A dynamic time-lapse video showing the rapidly moving scenery from the window of a speeding train. The camera captures various elements such as lush green fields, towering trees, quaint countryside houses, and distant mountain ranges passing by quickly. The train window frames the view, adding a sense of speed and motion as the landscape rushes past. The camera remains static but emphasizes the fast-paced movement outside. The overall atmosphere is serene yet exhilarating, capturing the essence of travel and exploration. Medium shot focusing on the train window and the rushing scenery beyond.
60s
A dynamic and chaotic scene in a dense forest during a heavy rainstorm, capturing a real girl frantically running through the foliage. Her wild hair flows behind her as she sprints, her arms flailing and her face contorted in fear and desperation. Behind her, various animals—rabbits, deer, and birds—are also running, creating a frenzied atmosphere. The girl's clothes are soaked, clinging to her body, and she is screaming and shouting as she tries to escape. The background is a blur of greenery and rain-drenched trees, with occasional glimpses of the darkening sky. A wide-angle shot from a low angle, emphasizing the urgency and chaos of the moment.
Ultra-Long Video Showcases
Play speed:
240s
A dramatic and dynamic scene in the style of a disaster movie, depicting a powerful tsunami rushing through a narrow alley in Bulgaria. The water is turbulent and chaotic, with waves crashing violently against the walls and buildings on either side. The alley is lined with old, weathered houses, their facades partially submerged and splintered. The camera angle is low, capturing the full force of the tsunami as it surges forward, creating a sense of urgency and danger. People can be seen running frantically, adding to the chaos. The background features a distant horizon, hinting at the larger scale of the tsunami. A dynamic, sweeping shot from a low-angle perspective, emphasizing the movement and intensity of the event.
240s
A single white sheep bending down to drink water from a calm river. The sheep has fluffy wool, long curved horns, and soft brown eyes. It is positioned near the riverbank, with its front legs partially submerged in the clear water. The river flows gently, reflecting the surrounding greenery and blue sky. The background shows lush grass and trees along the riverbank, creating a serene pastoral landscape. The sheep's body is slightly tilted as it bends down to drink, emphasizing a natural and tranquil motion. Medium close-up shot focusing on the sheep and the river.
240s
A close-up shot of a ceramic teacup slowly pouring water into a glass mug. The water flows smoothly from the spout of the teacup into the mug, creating gentle ripples as it fills up. Both cups have detailed textures, with the teacup having a matte finish and the glass mug showcasing clear transparency. The background is a blurred kitchen countertop, adding context without distracting from the central action. The pouring motion is fluid and natural, emphasizing the interaction between the two cups.
Comparisons
LongLive exhibits strong prompt compliance, smooth transitions, and high long-range consistency while sustaining high throughput. Compared to ours, SkyReels-V2 shows weaker long-range consistency and lower throughput. Self-Forcing faces quality degradation on longer videos.
Play speed:
Self-Forcing Generation FPS: 17.0
SkyReels-V2 Generation FPS: 0.49
LongLive Generation FPS: 20.7
0s–10s: In a warmly lit home office, a bearded Asian man in his early thirties types intently on a laptop. An orange tabby with bright green eyes sits beside him, a steaming coffee mug nearby, bookshelves framing the cozy scene.
10s–20s: He pauses to rub his right forearm, gaze fixed on the screen; the cat’s tail curls lazily, eyes following the motion.
20s–30s: Hands return to the keys; the cat turns its head toward his concentrated face, ears perked.
30s–40s: He glances at the cat and murmurs a few gentle words, smiling as their eyes meet; the cat settles into a loaf, tail loosely wrapped.
40s–50s: He scratches beneath the cat’s chin; it purrs, eyes half-closed.
50s–60s: He pats the cat’s head twice, then resumes typing while it blinks slowly, tail curled in contentment.
0s–10s: In a warmly lit home office, a bearded Asian man in his early thirties types intently on a laptop. An orange tabby with bright green eyes sits beside him, a steaming coffee mug nearby, bookshelves framing the cozy scene.
10s–20s: He pauses to rub his right forearm, gaze fixed on the screen; the cat’s tail curls lazily, eyes following the motion.
20s–30s: Hands return to the keys; the cat turns its head toward his concentrated face, ears perked.
30s–40s: He glances at the cat and murmurs a few gentle words, smiling as their eyes meet; the cat settles into a loaf, tail loosely wrapped.
40s–50s: He scratches beneath the cat’s chin; it purrs, eyes half-closed.
50s–60s: He pats the cat’s head twice, then resumes typing while it blinks slowly, tail curled in contentment.
0s–10s: In a warmly lit home office, a bearded Asian man in his early thirties types intently on a laptop. An orange tabby with bright green eyes sits beside him, a steaming coffee mug nearby, bookshelves framing the cozy scene.
10s–20s: He pauses to rub his right forearm, gaze fixed on the screen; the cat’s tail curls lazily, eyes following the motion.
20s–30s: Hands return to the keys; the cat turns its head toward his concentrated face, ears perked.
30s–40s: He glances at the cat and murmurs a few gentle words, smiling as their eyes meet; the cat settles into a loaf, tail loosely wrapped.
40s–50s: He scratches beneath the cat’s chin; it purrs, eyes half-closed.
50s–60s: He pats the cat’s head twice, then resumes typing while it blinks slowly, tail curled in contentment.
0s–10s: In a serene garden of blooming flowers and lush greenery, a joyful Madonna in a flowing pastel, embroidered robe holds a rosary, smiling warmly as she gently blesses. Clear blue sky; soft sunlight through leaves.
10s–20s: She brings her free hand to her chest, deepening the blessing.
20s–30s: A light breeze stirs; flowers sway softly.
30s–40s: A butterfly lands on her outstretched hand; dappled light plays across the scene.
40s–50s: The butterfly lifts off, circling her head before disappearing into the foliage, leaving a faint shimmer.
50s–60s: She smiles and begins to speak softly, lips moving as if offering comfort.
0s–10s: In a serene garden of blooming flowers and lush greenery, a joyful Madonna in a flowing pastel, embroidered robe holds a rosary, smiling warmly as she gently blesses. Clear blue sky; soft sunlight through leaves.
10s–20s: She brings her free hand to her chest, deepening the blessing.
20s–30s: A light breeze stirs; flowers sway softly.
30s–40s: A butterfly lands on her outstretched hand; dappled light plays across the scene.
40s–50s: The butterfly lifts off, circling her head before disappearing into the foliage, leaving a faint shimmer.
50s–60s: She smiles and begins to speak softly, lips moving as if offering comfort.
0s–10s: In a serene garden of blooming flowers and lush greenery, a joyful Madonna in a flowing pastel, embroidered robe holds a rosary, smiling warmly as she gently blesses. Clear blue sky; soft sunlight through leaves.
10s–20s: She brings her free hand to her chest, deepening the blessing.
20s–30s: A light breeze stirs; flowers sway softly.
30s–40s: A butterfly lands on her outstretched hand; dappled light plays across the scene.
40s–50s: The butterfly lifts off, circling her head before disappearing into the foliage, leaving a faint shimmer.
50s–60s: She smiles and begins to speak softly, lips moving as if offering comfort.
0s–10s: A serene model with long pink hair stands amid gently falling sakura. Swirling pink smoke partially veils her; a simple white gown complements the palette, creating a dreamy, ethereal mood.
10s–20s: She slowly raises her hand, fingertips grazing the mist as if caressing a petal.
20s–30s: A soft breeze stirs; more petals flutter around her.
30s–40s: She closes her eyes; lashes rest softly as the calm endures.
40s–50s: She takes a subtle step forward; a small bird flits in and settles on a nearby branch.
50s–60s: The bird alights on her outstretched finger; the pink smoke thickens with hints of cyan-blue, gently enveloping the scene.
0s–10s: A serene model with long pink hair stands amid gently falling sakura. Swirling pink smoke partially veils her; a simple white gown complements the palette, creating a dreamy, ethereal mood.
10s–20s: She slowly raises her hand, fingertips grazing the mist as if caressing a petal.
20s–30s: A soft breeze stirs; more petals flutter around her.
30s–40s: She closes her eyes; lashes rest softly as the calm endures.
40s–50s: She takes a subtle step forward; a small bird flits in and settles on a nearby branch.
50s–60s: The bird alights on her outstretched finger; the pink smoke thickens with hints of cyan-blue, gently enveloping the scene.
0s–10s: A serene model with long pink hair stands amid gently falling sakura. Swirling pink smoke partially veils her; a simple white gown complements the palette, creating a dreamy, ethereal mood.
10s–20s: She slowly raises her hand, fingertips grazing the mist as if caressing a petal.
20s–30s: A soft breeze stirs; more petals flutter around her.
30s–40s: She closes her eyes; lashes rest softly as the calm endures.
40s–50s: She takes a subtle step forward; a small bird flits in and settles on a nearby branch.
50s–60s: The bird alights on her outstretched finger; the pink smoke thickens with hints of cyan-blue, gently enveloping the scene.
0s–10s: Empty stage under a lone ghost light; red velvet seats fade into darkness. A solo ballet dancer in a simple leotard rehearses—each landing kicks up rosin dust. Muscles ripple with controlled breath; a water bottle sits at the wings.
10s–20s: She traces an imaginary line toward the ghost light, settling into a poised phrase.
20s–30s: She flows into a clean pirouette, fingers still sketching that invisible line.
30s–40s: Spin blooms into a leap; feet kiss the floor before a graceful landing.
40s–50s: She holds a serene stillness, eyes closed in quiet focus.
50s–60s: A light jump, soft landing; a stagehand slips in at the side to adjust a prop without breaking her focus.
0s–10s: Empty stage under a lone ghost light; red velvet seats fade into darkness. A solo ballet dancer in a simple leotard rehearses—each landing kicks up rosin dust. Muscles ripple with controlled breath; a water bottle sits at the wings.
10s–20s: She traces an imaginary line toward the ghost light, settling into a poised phrase.
20s–30s: She flows into a clean pirouette, fingers still sketching that invisible line.
30s–40s: Spin blooms into a leap; feet kiss the floor before a graceful landing.
40s–50s: She holds a serene stillness, eyes closed in quiet focus.
50s–60s: A light jump, soft landing; a stagehand slips in at the side to adjust a prop without breaking her focus.
0s–10s: Empty stage under a lone ghost light; red velvet seats fade into darkness. A solo ballet dancer in a simple leotard rehearses—each landing kicks up rosin dust. Muscles ripple with controlled breath; a water bottle sits at the wings.
10s–20s: She traces an imaginary line toward the ghost light, settling into a poised phrase.
20s–30s: She flows into a clean pirouette, fingers still sketching that invisible line.
30s–40s: Spin blooms into a leap; feet kiss the floor before a graceful landing.
40s–50s: She holds a serene stillness, eyes closed in quiet focus.
50s–60s: A light jump, soft landing; a stagehand slips in at the side to adjust a prop without breaking her focus.
0s–10s: In a lantern-lit temple, David and Michal stand under a flowered chuppah; he wears a white robe with gold sash and crown, she a veiled white gown with gold-ivory embroidery.
10s–20s: David takes her hand and slips a ring on her finger as the crowd watches in silence.
20s–30s: He kisses her cheek; approving murmurs rise.
30s–40s: They share a quiet, reverent gaze, hands touching beneath the chuppah.
40s–50s: David gently leads Michal around the chuppah; a small child runs in with a bouquet.
50s–60s: The child offers the bouquet with a shy smile, then steps back to watch.
0s–10s: In a lantern-lit temple, David and Michal stand under a flowered chuppah; he wears a white robe with gold sash and crown, she a veiled white gown with gold-ivory embroidery.
10s–20s: David takes her hand and slips a ring on her finger as the crowd watches in silence.
20s–30s: He kisses her cheek; approving murmurs rise.
30s–40s: They share a quiet, reverent gaze, hands touching beneath the chuppah.
40s–50s: David gently leads Michal around the chuppah; a small child runs in with a bouquet.
50s–60s: The child offers the bouquet with a shy smile, then steps back to watch.
0s–10s: In a lantern-lit temple, David and Michal stand under a flowered chuppah; he wears a white robe with gold sash and crown, she a veiled white gown with gold-ivory embroidery.
10s–20s: David takes her hand and slips a ring on her finger as the crowd watches in silence.
20s–30s: He kisses her cheek; approving murmurs rise.
30s–40s: They share a quiet, reverent gaze, hands touching beneath the chuppah.
40s–50s: David gently leads Michal around the chuppah; a small child runs in with a bouquet.
50s–60s: The child offers the bouquet with a shy smile, then steps back to watch.
KV Recaching
Prompt switching under different KV-cache strategies. No KV cache: New-prompt adherence but abrupt transitions and visual discontinuity. KV cache: Smooth visuals but new-prompt non-adherence (delayed or ignored). KV recache: Visual consistency and new-prompt adherence.
Play speed:
no KV cache
KV cache
KV recache
0s–5s: ambitious young man in a sharp suit stands arms-crossed, slight confident smile, bustling modern office behind him.
5s–10s: he uncrosses his arms, focus tightening as he's about to address the team; same office backdrop, medium shot, static.
Short Window Attention & Frame Sink
Comparison in a 20s generated video of long window attention (21 local latent frames), short-window attention (12 local), and short-window + frame-sink (9 local + 3 sink). Shorter windows boost efficiency but weaken long-range consistency; adding a frame-sink restores consistency while keeping the efficiency gains.
Play speed:
w/o Frame Sink (Window 21)
w/o Frame Sink (Window 12)
w/ Frame Sink (Window 9 + Sink3)
A serene, picturesque scene of an elderly woman living in a quaint, weathered wooden cottage surrounded by lush greenery and tall trees. She leans forward slightly in her rocking chair, reaching out to pluck a fallen leaf from the porch floor, its veins glowing silver in the moonlight. The moon casts a soft glow over the tranquil landscape, highlighting the distant hills and the twinkling stars above. The rustle of leaves and a distant owl hoot blend with the gentle creak of the rocking chair. medium‑close static
Streaming Long Tuning
The streaming long tuning pipeline. (a) Short tuning: only 5s clips are supervised, like Self-Forcing, leading to quality loss on long videos. (b) Naive long tuning: naively scaling to long sequences causes incorrect teacher supervision and OOM. (c) Streaming long tuning: our approach trains on long sequences by reusing the historical KV cache each iteration to generate the next 5s clip, then supervising it with the teacher.
Contact Us
Feel free to contact Shuai Yang at shyang@nvidia.com or Yukang Chen at yukangc@nvidia.com for any question, cooperation, and communication.
If you find this work useful, please consider citing:
@article{yang2025longlive,
title={LongLive: Real-time Interactive Long Video Generation},
author={Shuai Yang and Wei Huang and Ruihang Chu and Yicheng Xiao and Yuyang Zhao and Xianbang Wang and Muyang Li and Enze Xie and Yingcong Chen and Yao Lu and Song Hanand Yukang Chen},
year={2025},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
Thank Jenga, UltraPixel, ControlNeXt, and ToonCrafter to provide us the project page's template!