ScriptsMay 30, 2026·3 min read

Modly — Local AI 3D Model Generation from Images

Modly is a desktop application that generates 3D models from 2D images using AI, running entirely on your local GPU. It provides a simple interface for converting photos into editable 3D assets without cloud dependencies.

Agent ready

Ready-to-run agent install

This asset can be installed after the agent chooses its runtime, checks the plan, and runs the matching command.

Native · 98/100Policy: allow
Agent surface
Any MCP/CLI agent
Kind
Skill
Install
Single
Trust
Trust: Established
Entrypoint
Modly
Direct install command
npx -y tokrepo@latest install 2e1515e9-5c22-11f1-9bc6-00163e2b0d79 --target codex

Run after dry-run confirms the install plan.

Introduction

Modly bridges the gap between 2D images and 3D assets by using on-device AI models to reconstruct three-dimensional geometry from photographs. It runs entirely on your GPU, making it useful for game developers, designers, and 3D artists who need quick asset prototyping.

What Modly Does

  • Converts single 2D images into 3D mesh models
  • Runs inference locally on your GPU for privacy and speed
  • Exports models in standard 3D formats for use in other tools
  • Provides a visual preview of generated models with rotation and zoom
  • Supports batch processing of multiple images

Architecture Overview

Modly is built as an Electron desktop application with a TypeScript frontend. The 3D reconstruction pipeline uses pre-trained neural networks that estimate depth and geometry from single images. The inference backend leverages ONNX Runtime or PyTorch for GPU-accelerated model execution. Generated meshes are post-processed for topology cleanup before export.

Self-Hosting & Configuration

  • Clone the repository and build with npm
  • Requires an NVIDIA GPU with CUDA support for inference
  • Model weights are downloaded automatically on first launch
  • Configure output format and quality settings in the application preferences
  • Works on Windows, macOS, and Linux with appropriate GPU drivers

Key Features

  • Single-image 3D reconstruction without photogrammetry setup
  • Fully local processing with no data leaving your machine
  • Standard format export including OBJ, GLB, and STL
  • Interactive 3D preview with material and lighting controls
  • Desktop-native application with drag-and-drop image input

Comparison with Similar Tools

  • Meshy — cloud-based 3D generation with subscription pricing; Modly runs locally for free
  • Luma AI — requires uploading images to cloud servers; Modly keeps all data on device
  • InstantMesh — research-focused with notebook interfaces; Modly provides a polished desktop app
  • TripoSR — fast single-image 3D but CLI-only; Modly adds a visual desktop interface

FAQ

Q: What image types work best? A: Clear photos of single objects against simple backgrounds produce the best results. Complex scenes may require manual cleanup.

Q: Can I edit the generated 3D models? A: Exported models can be opened in any 3D editor like Blender. Modly focuses on generation rather than editing.

Q: What GPU memory is required? A: A GPU with at least 6 GB VRAM is recommended. Higher VRAM allows processing larger images.

Q: Is this suitable for production-quality 3D assets? A: Generated models work well for prototyping and reference. Production assets typically need manual refinement in a 3D editor.

Sources

Discussion

Sign in to join the discussion.
No comments yet. Be the first to share your thoughts.

Related Assets