MiniMax-Remover Cog Wrapper

This repository contains a Cog wrapper for the MiniMax-Remover video object removal model.

🚀 Try it on Replicate

Model API: ayushunleashed/minimax-remover

Overview

MiniMax-Remover is a fast and effective video object remover based on minimax optimization. This Cog wrapper provides a convenient API for running the model on Replicate with video and mask inputs.

Examples Video:

Repository Structure

object-remover/
├── cog.yaml                    # Cog configuration
├── predict.py                  # Cog prediction interface
├── download_weights.py         # Weight downloader script
├── minimax_remover/           # Git submodule
│   ├── README.md
│   ├── requirements.txt
│   ├── pipeline_minimax_remover.py
│   ├── transformer_minimax_remover.py
│   └── ...
├── sample_data/               # Sample videos for testing
│   ├── racoon_video.mp4       # Input video with racoon
│   └── racoon_mask.mp4        # Mask video (white areas to remove)
└── README.md                  # This file

Original Repository

This wrapper is build for MiniMax-Remover project. The original repository contains the core implementation which is implemented as sub module here.

Local Testing with Cog

1. Clone with Submodule

# Clone the repository
git clone https://github.com/AyushUnleashed/object-remover.git
cd object-remover

# Initialize and update the submodule
git submodule update --init --recursive

2. Install Cog

Follow the official Cog installation guide:

# On macOS
brew install replicate/tap/cog

# On Linux/Windows WSL
sudo curl -o /usr/local/bin/cog -L "https://github.com/replicate/cog/releases/latest/download/cog_$(uname -s)_$(uname -m)"
sudo chmod +x /usr/local/bin/cog

3. Test Locally

# manually download weights first (optional)
python download_weights.py

# Build the Docker image
cog build

# Test with sample videos
cog predict -i video=@sample_data/racoon_video.mp4 -i mask=@sample_data/racoon_mask.mp4

Usage

Input Requirements

Video: MP4 format recommended, max 81 frames
Mask: Video file where white areas indicate objects to remove
Both video and mask should have the same frame count

How to Generate Mask Videos

If you don't have a mask video yet, you can create one using the following workflow:

Requirements

Original video you want to remove objects from
Binary masked video of the object(s) you want to remove

Step-by-Step Process

Extract first frame: Use Frame Extractor to get the first frame of your original video
Generate mask: Use SAM-2 to create multiple masked images from the first frame
Select target mask: Choose the mask that covers the subject(s) you want to remove
Create mask video: Use X-MEM with your selected masked image and original video to generate the complete masked video

This workflow ensures you have a properly formatted mask video that tracks your target object(s) throughout all frames.

API Parameters

video (required): Input video file
mask (required): Mask video file
num_frames: Number of frames to process
height: Output video height
width: Output video width
num_inference_steps: Denoising steps
iterations: Mask dilation iterations
seed: Random seed (optional)

Example Usage

Local Testing with Cog

# Basic usage with sample data
cog predict \
    -i video=@sample_data/racoon_video.mp4 \
    -i mask=@sample_data/racoon_mask.mp4

# With custom parameters
cog predict \
    -i video=@sample_data/racoon_video.mp4 \
    -i mask=@sample_data/racoon_mask.mp4 \

Python API (Using Deployed Model)

import replicate

output = replicate.run(
    "ayushunleashed/minimax-remover",
    input={
        "video": open("your_video.mp4", "rb"),
        "mask": open("your_mask.mp4", "rb"),
        "num_frames": 25,
        "height": 480,
        "width": 832,
        "num_inference_steps": 12,
        "iterations": 6
    }
)

print(f"Output video: {output}")

Model Details

Architecture: Simplified DiT (Diffusion Transformer) with minimax optimization
Inference Steps: 6-12 steps (much faster than traditional diffusion models)
Memory Requirements: ~8GB GPU memory for typical usage
Model Weights: Downloaded automatically from Hugging Face during first setup

Performance Tips

Frame Count: Fewer frames = faster processing
Resolution: Lower resolution = faster processing
Inference Steps: 6-12 steps provide good quality/speed balance
Mask Quality: Clean masks with clear boundaries work best

Troubleshooting

Common Issues

Out of Memory: Reduce num_frames, height, or width
Slow Performance: Reduce num_inference_steps to 6-8
Poor Quality: Increase num_inference_steps or improve mask quality

License

This Cog wrapper follows the same license as the original MiniMax-Remover project. See the original repository for license details.

Citation

If you use this model, please cite the original MiniMax-Remover paper:

@article{minimax2024,
  title={MiniMax-Remover: Taming Bad Noise Helps Video Object Removal},
  author={Bojia Zi and Weixuan Peng and Xianbiao Qi and Jianan Wang and Shihao Zhao and Rong Xiao and Kam-Fai Wong},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
minimax_remover @ bec96f6		minimax_remover @ bec96f6
sample_data		sample_data
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
cog.yaml		cog.yaml
download_weights.py		download_weights.py
predict.py		predict.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MiniMax-Remover Cog Wrapper

🚀 Try it on Replicate

Overview

Repository Structure

Original Repository

Local Testing with Cog

1. Clone with Submodule

2. Install Cog

3. Test Locally

Usage

Input Requirements

How to Generate Mask Videos

Requirements

Step-by-Step Process

API Parameters

Example Usage

Local Testing with Cog

Python API (Using Deployed Model)

Model Details

Performance Tips

Troubleshooting

Common Issues

License

Citation

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MiniMax-Remover Cog Wrapper

🚀 Try it on Replicate

Overview

Repository Structure

Original Repository

Local Testing with Cog

1. Clone with Submodule

2. Install Cog

3. Test Locally

Usage

Input Requirements

How to Generate Mask Videos

Requirements

Step-by-Step Process

API Parameters

Example Usage

Local Testing with Cog

Python API (Using Deployed Model)

Model Details

Performance Tips

Troubleshooting

Common Issues

License

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages