A comprehensive toolkit that generates character poses from reference images using AI-powered pose analysis and automated rigging. Available as both a Windows desktop application and Python scripts for Blender integration.
- Overview
- Features
- Quick Start
- Requirements
- Downloads
- Operating Modes
- Cost Estimation
- Privacy & Security
- Building from Source
- Blender Workflow
- Troubleshooting
- Contributing
- License & Terms of Use
This project enables you to:
- Analyze a reference image of a person to extract detailed pose descriptions
- Convert those descriptions into precise bone rotations for Blender armatures
- Automatically apply poses to your rigged characters
- Retarget poses to different rig types (including Cyberpunk 2077)
The workflow combines computer vision analysis with biomechanical understanding to create accurate 3D poses from 2D references.
- Step-by-step wizard workflow: Intuitive interface guiding you from image selection to final pose generation
- Operating modes: Choose between Budget, Balanced, or Quality modes to optimize cost vs. quality
- OpenAI Integration: Uses GPT-4 Vision for image analysis with intelligent model selection and fallbacks
- Real-time cost estimation: See estimated API costs before making calls
- Real-time validation: Validates your API key before proceeding
- Image preview: See your reference image while working
- Editable descriptions: Review and refine AI-generated pose descriptions
- JSON export: Copy to clipboard or save bone rotations as JSON files
- Single executable: No installation required - just download and run
- Download the latest release for your platform from the link above
- Extract the zip file and run the executable
- Provide your OpenAI API key (Get one here)
- Select operating mode (Budget/Balanced/Quality)
- Select a reference image and describe the rough pose
- Review the AI-generated extended pose description
- Generate bone rotations for your MPFB GameEngine rig
- Copy or save the JSON output
- Windows 10/11 (64-bit)
- OpenAI API key (required - you provide your own)
- Internet connection for API calls
Available for:
- Windows 10/11 (64-bit Intel/AMD)
- Windows 11 (ARM64 - for Surface and other ARM devices)
The desktop application offers three operating modes that balance quality vs. cost:
-
Budget - Fast & cheapest; ok for simple photos
- Preferred Model:
gpt-4.1-nano(fallback:gpt-4.1-mini) - Best for simple, straightforward poses
- Expected Output: ~300 tokens per step
- Preferred Model:
-
Balanced (Default) - Good quality for most cases
- Preferred Model:
gpt-4.1-mini(fallbacks:o4-mini,gpt-4.1) - Best for most everyday use cases
- Expected Output: ~600 tokens per step
- Preferred Model:
-
Quality - Best quality at a sensible price
- Preferred Model:
gpt-4.1(fallback:o4-mini) - Best for complex poses or maximum accuracy
- Expected Output: ~800 tokens per step
- Preferred Model:
The app shows real-time cost estimates before API calls, calculated using:
- SharpToken (tiktoken for .NET) for token counting
- OpenAI's tile-based formula for image tokens
Always verify pricing at OpenAI's Pricing Page
- ✅ Your API key is stored in memory only during the session
- ✅ No data saved to disk without your explicit action
- ✅ API calls go directly to OpenAI - no third-party servers
⚠️ API usage charges apply (you provide your own key)
cd src/DesktopApp
dotnet build ImageToPose.sln
# Create single-file executable:
dotnet publish ImageToPose.Desktop -c Release -r win-x64The executable will be in bin/Release/net9.0/win-x64/publish/.
See the Blender Workflow Guide for applying generated poses.
- API key validation fails: Check internet, verify key at OpenAI Platform
- Prompt files not found: Ensure txt files are in repository root
- Build errors: Run
dotnet restore - Bones not moving: Check armature and bone naming
- Incorrect rotations: Verify rotation order and axes
- Constraint conflicts: Temporarily disable IK
- Extreme poses: Start subtle, build gradually
This project is modular and extensible. Feel free to:
- Improve AI prompts for better analysis
- Add support for additional rig types
- Enhance Blender integration
- Create presets for common poses
This project uses the BSD 3‑Clause License with a project‑specific Attribution Notice (retain the copyright line with the repo link) and practical usage terms. See LICENSE+TERMS.md for details, including disclaimers and responsibilities.