How to use Wan 2.1(Alibaba) to generate videos on Monica?

Follow the Comprehensive and Easy Guide to Change Your Image Generation Habits

how to use wan 2.1 to generate videos

AI video generators are reshaping 2025. Leading tech companies are launching advanced models designed to meet the demands of large-scale applications for both developers and the general public. The rapid evolution of this competition has driven significant improvements in video quality, including more realistic motion, refined physical effects, and enhanced visual fidelity.

Wan AI F is one of the strong contenders in the AI video generation field, competing with Kling and Google’s Veo 2. Wan AI focuses on generating highly realistic videos with smooth motion and enhanced physical effects.

This article will guide you on how to use Wan AI, explore its features, and explain how to generate videos.


Let’s first take a look at some stunning videos created with Wan 2.1.

0:00
/
wan 2.1 generated video-1
0:00
/
wan 2.1 generated video-2
0:00
/
wan 2.1 generated video-3
0:00
/
wan 2.1 generated video-4

What is WAN 2.1

Wan 2.1 is Alibaba's latest open-source AI model, supporting text-to-video and image-to-video generation to create high-quality, realistic content. Compared to its predecessors, Wan 2.1 features enhanced multimodal capabilities, enabling users to flexibly create and edit videos using text or image inputs. Additionally, it supports multi-scenario image creation, including text-to-image, image-to-image, sketch-based drawing, virtual models, and personal portraits, fully catering to diverse artistic creation needs.

Key Features of Wan 2.1

· Bilingual Video Model: Wan 2.1 is the first video model capable of generating videos in both Chinese and English, featuring robust text generation capabilities that enhance its practicality. It can produce cinematic-level text and animations, supporting various font applications across multiple scenarios, including special effects fonts, poster fonts, and real-world font displays, meeting diverse professional requirements.

· Multi-Video Tasks: Offers powerful capabilities for text-to-video and image-to-video generation, as well as video editing, video-to-audio conversion, and other advanced video-related tasks.

· Open-Source Access: Wan 2.1 is available on GitHub and Hugging Face, making it accessible to developers.

· Benchmark Performance: Wan 2.1 ranks highly on the VBench benchmark for video realism and multi-object interactions.


Step-by-Step Guide to Generate Videos

Step 0: Sign Up and Log In to Monica

1️⃣  Click the button below to visit the Monica Image homepage and follow the instructions to register and log in:

2️⃣  Click Video Generation
Select Video Generation to start creating videos.

select video generation
select video generation

Step 1: Select the Generation Type
First, choose the type of generation:

  • Text-to-Video
  • Image-to-Video
select Text2Vide or Image2Video
select Text2Vide or Image2Video

Let’s start with Text-to-Video:


Step 2: Select the Model
Choose Wan 2.1 (14B) under the Model option.

switch video model
switch video model
💡
Below the model selection, you can switch between Text-to-Video or Image-to-Video at any time.
switch model Text2Vide or Image2Video freely
switch model Text2Vide or Image2Video freely

Now, let’s switch to Image-to-Video and continue the introduction.

image2video generation
image2video generation

Step 3: Click to upload a reference image.

Step 4: Enter Prompt - Video Description

  • Input a detailed Video Description.
💡
Tips:
If you have great ideas but are unsure how to describe them accurately and in detail, you can seek help from Monica's free GPT assistant. Monica is compatible with various chat assistants, such as GPT-4o, Wan 2.1, DeepSeek-R1 and more.
input video description
input video description

See the comparison below to understand how important a detailed prompt  is.

Generation of a image with a Simple Prompt:

A dog playing on a grass.
0:00
/
simple prompt generation

Generation of a image with a Detailed Prompt:

A playful dog enjoying its time on a lush green meadow, surrounded by soft sunlight and vibrant grass, capturing a joyful and lively moment.
0:00
/
detailed prompt generation

Isn't it immediately clear how important a good prompt is? So, try to describe your video prompt as thoroughly as possible. Or simply ask Monica's free AI assistant, Wan 2.1, to help you brainstorm and add more details.

If you want to learn more about Prompt Engineering, we recommend watching this YouTube video.

Prompt Engineering Youtube Video

Step 5: Configure Settings

Set the parameters for video generation

Resolution: 480p/580p
Aspect ratio: 16:9/9:16
Set the parameters
Set the parameters

Step 6: Generate Video

  • Click Confirm to generate the video with one click.
generate the video
generate the video

Final Video Generation with Wan 2.1

0:00
/
final video generated by Wan 2.1

Conclusion

Benefits of Using WAN 2.1 on Monica


Monica is an all-in-one platform. In addition to free access to Wan 2.1, you can effortlessly use other image generation models on the same platform, such as Veo 2, Kling, Hailuo, etc.. We encourage everyone to try and share their generated videos. If you have any questions, feel free to email us.

Below are some case examples:


Veo 2:

0:00
/
veo 2 generated video

Kling 1.6

0:00
/
Kling generated video-1
0:00
/
Kling generated video-2

Hailuo:

0:00
/
Hailuo generated video

Monica is an all-in-one platform where you can not only use video generation models but also access a variety of other tools such as AI Background Remover and AI Logo Generator. Additionally, Monica also offers PDF tools, including ChatPDF and PDF Translator , along with many other incredibly useful features waiting for you to discover and use.

Enhance your work and learning efficiency—Monica is waiting for you to explore!

Subscribe to Monica Blog

Don’t miss out on the latest issues. Sign up now to get access to the library of members-only issues.
[email protected]
Subscribe