🎭 Animate Me

Identity-Preserving Animated Avatar System with Interactive Rendering.

Animate Me is a modular AI system that enables you to:

Generate cartoon characters from text prompts or input images.
Preserve user identity traits after stylization.
Create motion (GIFs) from static images.
Render characters inside an interactive, game-like environment.

The system is designed with production thinking: clear modularization, easy model replacement, and straightforward scalability.

🖼️ Example Results

1) Game scene after character compositing

2) Sample action output (GIF)

🧠 Engineering Highlights

Multi-stage AI pipeline: generation → segmentation → pose → animation → rendering.
Clear separation between model layer, orchestration layer, and interface layer.
FastAPI + WebSocket integration for backend interaction.
Interactive runtime support with Pygame.

🎯 Problem Statement

The goal is to build a digital avatar that can:

Preserve identity characteristics.
Generate natural motion from static images.
Support real-time interaction.

Key challenges:

Identity Preservation: stylization often removes distinctive facial features.
Static-to-Dynamic Conversion: generating smooth motion from a single input frame.
Temporal Consistency: minimizing frame-to-frame flicker.
Interactive Rendering: integrating animation into a runtime environment.

Animate Me addresses the entire pipeline rather than isolated sub-problems.

🏗️ System Architecture

Input (Text / Image)
        ↓
Text-to-Image / Upload Handler
        ↓
Style Transfer Module
        ↓
Object Decomposition & Face Segmentation
        ↓
Pose Estimation
        ↓
Motion / Animation Generator
        ↓
GIF Export  |  Interactive Renderer (Pygame)

Layered Design

Model Layer
- Text-to-Image
- Style Transfer
- Segmentation
- Pose Estimation
- Motion Synthesis
Pipeline Layer
- Orchestration
- Action scheduling
- Frame generation
- Character state management
Interface Layer
- Streamlit demo
- FastAPI backend
- WebSocket server
- Pygame runtime

⚙️ Tech Stack

Core: Python 3.8, PyTorch, OpenCV.
AI Components: Text-to-Image, Style Transfer, OpenMMLab/MMPose, Segmentation.
Backend/Runtime: FastAPI, WebSocket, Streamlit, Pygame.
Environment: Conda (recommended), CUDA (optional).

🔁 End-to-End Pipeline

Receive input from text or image.
Generate or normalize the character image.
Apply reference style.
Decompose foreground/background and segment the face.
Estimate poses for each target action.
Generate animation frame sequences.
Export GIFs or render in an interactive environment.

📂 Project Structure

animating_image/
├── src/
│   ├── app/                    # FastAPI backend
│   ├── demo/                   # Streamlit demo
│   ├── pipeline/               # Pipeline orchestration
│   ├── animator/               # Motion synthesis engine
│   ├── pose_estimator/         # Pose estimation
│   ├── image_style_transfer/   # Stylization
│   ├── concept_decomposer/     # Object decomposition
│   ├── face_segmenter/         # Face segmentation
│   ├── img_to_vector/          # Vectorization
│   ├── render/                 # Runtime/game rendering
│   ├── text_to_image/          # Text-to-image
│   ├── text_to_speech/         # TTS
│   ├── configs/                # Character config
│   └── __main__.py
├── external/                   # Third-party models
├── assets/
├── notebook/
├── requirements.txt
└── environment.yaml

🚀 Installation

1) Clone repository

git clone <your-repo-url>
cd animating_image

2) Setup environment

Conda (recommended)

conda env create -f environment.yaml
conda activate openmmlab

pip/venv

python3 -m venv .venv
source .venv/bin/activate
pip install -U pip
pip install -r requirements.txt

3) Configure `.env`

GOOGLE_API_KEY=your_google_api_key
POSE_MODEL_CFG_PATH=/absolute/path/to/mmpose_config.py
POSE_MODEL_CKPT_PATH=/absolute/path/to/mmpose_checkpoint.pth

# Optional for API/runtime
STORAGE_ROOT=/absolute/path/to/storage
SERVER_IP=0.0.0.0
SERVER_PORT=8765
TARGET_OBJECT=/absolute/path/to/target_object.json
THIRD_PARTY_WEBSOCKET_URL=ws://host:port

▶️ Run

Streamlit demos

streamlit run src/demo/app.py

streamlit run src/demo/create_animation_demo.py

FastAPI backend

uvicorn src.app.main:app --reload

Interactive render (Pygame entry)

python -m src

Utility scripts

python -m src.test_pipeline
python -m src.test_animation
python -m src.test_pygame
python -m src.test_tts

📊 Engineering Considerations

Scalability

Split the pipeline into independent modules.
Containerize services when needed.
Support GPU acceleration.

Production concerns

Manage checkpoints/secrets via environment variables.
Design APIs with stateless principles.
Expand toward a microservice architecture when needed.

Optimization

Reduce actions/frames to improve speed.
Cache intermediate outputs.
Batch pose generation when appropriate.

🔗 References

Animated Drawings (Facebook Research): https://github.com/facebookresearch/AnimatedDrawings.git
Frontend (AnimGen Studio): https://github.com/tamchamchi/animgen-studio.git

🛠️ Troubleshooting

Missing API key or model path: check your .env file.
src import errors: run commands from the project root.
Checkpoint not found: use absolute paths and verify files exist.
Slow CPU execution: reduce actions/frames or use CUDA GPU.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
assets		assets
notebook		notebook
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
environment.yaml		environment.yaml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎭 Animate Me

🖼️ Example Results

1) Game scene after character compositing

2) Sample action output (GIF)

🧠 Engineering Highlights

🎯 Problem Statement

🏗️ System Architecture

Layered Design

⚙️ Tech Stack

🔁 End-to-End Pipeline

📂 Project Structure

🚀 Installation

1) Clone repository

2) Setup environment

3) Configure `.env`

▶️ Run

Streamlit demos

FastAPI backend

Interactive render (Pygame entry)

Utility scripts

📊 Engineering Considerations

Scalability

Production concerns

Optimization

🔗 References

🛠️ Troubleshooting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎭 Animate Me

🖼️ Example Results

1) Game scene after character compositing

2) Sample action output (GIF)

🧠 Engineering Highlights

🎯 Problem Statement

🏗️ System Architecture

Layered Design

⚙️ Tech Stack

🔁 End-to-End Pipeline

📂 Project Structure

🚀 Installation

1) Clone repository

2) Setup environment

3) Configure .env

▶️ Run

Streamlit demos

FastAPI backend

Interactive render (Pygame entry)

Utility scripts

📊 Engineering Considerations

Scalability

Production concerns

Optimization

🔗 References

🛠️ Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

3) Configure `.env`

Packages