How to use Wan 2.1(Alibaba) to generate videos on Monica?
Follow the Comprehensive and Easy Guide to Change Your Image Generation Habits

AI video generators are reshaping 2025. Leading tech companies are launching advanced models designed to meet the demands of large-scale applications for both developers and the general public. The rapid evolution of this competition has driven significant improvements in video quality, including more realistic motion, refined physical effects, and enhanced visual fidelity.
Wan AI F is one of the strong contenders in the AI video generation field, competing with Kling and Google’s Veo 2. Wan AI focuses on generating highly realistic videos with smooth motion and enhanced physical effects.
This article will guide you on how to use Wan AI, explore its features, and explain how to generate videos.
Let’s first take a look at some stunning videos created with Wan 2.1.
What is WAN 2.1
Wan 2.1 is Alibaba's latest open-source AI model, supporting text-to-video and image-to-video generation to create high-quality, realistic content. Compared to its predecessors, Wan 2.1 features enhanced multimodal capabilities, enabling users to flexibly create and edit videos using text or image inputs. Additionally, it supports multi-scenario image creation, including text-to-image, image-to-image, sketch-based drawing, virtual models, and personal portraits, fully catering to diverse artistic creation needs.
Key Features of Wan 2.1
· Bilingual Video Model: Wan 2.1 is the first video model capable of generating videos in both Chinese and English, featuring robust text generation capabilities that enhance its practicality. It can produce cinematic-level text and animations, supporting various font applications across multiple scenarios, including special effects fonts, poster fonts, and real-world font displays, meeting diverse professional requirements.
· Multi-Video Tasks: Offers powerful capabilities for text-to-video and image-to-video generation, as well as video editing, video-to-audio conversion, and other advanced video-related tasks.
· Open-Source Access: Wan 2.1 is available on GitHub and Hugging Face, making it accessible to developers.
· Benchmark Performance: Wan 2.1 ranks highly on the VBench benchmark for video realism and multi-object interactions.
Step-by-Step Guide to Generate Videos
Step 0: Sign Up and Log In to Monica
1️⃣ Click the button below to visit the Monica Image homepage and follow the instructions to register and log in:
2️⃣ Click Video Generation
Select Video Generation to start creating videos.

Step 1: Select the Generation Type
First, choose the type of generation:
- Text-to-Video
- Image-to-Video

Let’s start with Text-to-Video:
Step 2: Select the Model
Choose Wan 2.1 (14B) under the Model option.


Now, let’s switch to Image-to-Video and continue the introduction.

Step 3: Click to upload a reference image.


Step 4: Enter Prompt - Video Description
- Input a detailed Video Description.
If you have great ideas but are unsure how to describe them accurately and in detail, you can seek help from Monica's free GPT assistant. Monica is compatible with various chat assistants, such as GPT-4o, Wan 2.1, DeepSeek-R1 and more.

See the comparison below to understand how important a detailed prompt is.
Generation of a image with a Simple Prompt:
A dog playing on a grass.
Generation of a image with a Detailed Prompt:
A playful dog enjoying its time on a lush green meadow, surrounded by soft sunlight and vibrant grass, capturing a joyful and lively moment.
Isn't it immediately clear how important a good prompt is? So, try to describe your video prompt as thoroughly as possible. Or simply ask Monica's free AI assistant, Wan 2.1, to help you brainstorm and add more details.
If you want to learn more about Prompt Engineering, we recommend watching this YouTube video.
Step 5: Configure Settings
Set the parameters for video generation
Resolution: 480p/580p
Aspect ratio: 16:9/9:16

Step 6: Generate Video
- Click Confirm to generate the video with one click.

Final Video Generation with Wan 2.1
Conclusion
Benefits of Using WAN 2.1 on Monica
Monica is an all-in-one platform. In addition to free access to Wan 2.1, you can effortlessly use other image generation models on the same platform, such as Veo 2, Kling, Hailuo, etc.. We encourage everyone to try and share their generated videos. If you have any questions, feel free to email us.
Below are some case examples:
Veo 2:
Kling 1.6
Hailuo:
Monica is an all-in-one platform where you can not only use video generation models but also access a variety of other tools such as AI Background Remover and AI Logo Generator. Additionally, Monica also offers PDF tools, including ChatPDF and PDF Translator , along with many other incredibly useful features waiting for you to discover and use.
Enhance your work and learning efficiency—Monica is waiting for you to explore!