Lift a 2D Image to 3D Model (Browser, Six Models)

By Arron R.17 min read
Lift a 2D image to 3D model in 2026: flat illustration in, textured mesh out in 30 seconds. Sorceress 3D Studio runs six models (Hunyuan 3D 3.1, Meshy 6, TRELLI

Lifting a 2D image to 3D model used to be the slowest step in any indie game pipeline. You opened Blender (free but punishing) or Maya (paid and even more punishing) and you sculpted a humanoid from a flat illustration for a week. The 2026 version of the same work is a single flat image — a concept art piece, a generated AI character render, a pixel-art sprite, a pencil sketch — fed into a generative 3D model and out comes a textured GLB in under a minute. This guide walks the browser pipeline end-to-end specifically for flat inputs (the harder case), every step verified May 25, 2026 against the live 3D Studio source.

Lift a 2D image to 3D model browser pipeline: flat illustration to model picker to lift to GLB export with six 2026 models inside Sorceress 3D Studio
The four-stage browser pipeline to lift a 2D image to 3D model. Flat input, model picker, lift, GLB export — all inside 3D Studio in under 90 seconds.

What “lift a 2D image to 3D model” actually means in 2026

The phrase covers a single workflow with a sharp boundary. The input is exactly one flat 2D illustration — a pencil sketch, a concept-art piece, a generated AI character render, a pixel-art sprite, a logo, a comic-book panel. The output is a real 3D mesh: vertices, faces, UV coordinates, plus a baked color texture, often a full PBR set (base color, metallic, roughness, normal, sometimes AO and emission), exported as GLB, FBX, OBJ, USDZ, or STL. The mesh is watertight or near-watertight, loads in any major game engine without a plugin, and renders correctly under whatever lighting the engine throws at it. That is the entire promise of the 2026 lift-a-2D-image-to-3D-model workflow.

The boundary that matters is the input type. A photograph has real lighting, real specular highlights, real depth cues from focus and parallax that the generative models read as geometry hints. A flat 2D illustration has none of those — no specular cues, no focus-blur depth, no real surface tone. The 2026 models compensate by leaning harder on their learned 3D priors over the geometric cues in the image. The priors are strongest on common subjects (humanoid characters, animals, vehicles, common props in clean front-facing or three-quarter views) and weaker on rare or asymmetric assemblies. For a flat illustration, the right model pick and the right input preparation matter more than they do for a photograph — which is the entire reason this post exists separately from the more general convert image to 3D model guide.

The technology under the hood is a flow-matching or diffusion transformer trained on millions of paired image-and-mesh examples. The model learns to invent the back of the object from learned 3D priors when the input image only shows the front. The 4-billion-parameter TRELLIS 2 model from Microsoft Research is the strongest 2026 prior in the panel — its O-Voxel architecture handles open surfaces (clothing, leaves), non-manifold geometry, and sharp features that the older iso-surface methods cannot. For a 2D image to 3D model lift specifically, that prior strength is the difference between a usable mesh and a malformed blob.

The six models to lift a 2D image to 3D model (inside 3D Studio)

Sorceress 3D Studio ships six image-to-3D models in a single model picker, verified against the THREED_MODEL_ORDER array on lines 212-219 of src/lib/threed-models.ts on 2026-05-25. Each model has distinct strengths when the input is a flat 2D image rather than a photograph; the right pick depends on the input style and the downstream use case.

Six 2D image to 3D model lift models in 2026 compared: Hunyuan 3D 3.1, Meshy 6, TRELLIS 2, TRELLIS v1, Rodin 2.0, Tripo v3.1 inside Sorceress 3D Studio
The six models that ship today inside 3D Studio to lift a 2D image to 3D model. Hunyuan 3D 3.1 is the recommended default at 25 credits per generation.

Hunyuan 3D 3.1 is the Tencent generative-3D model, verified active on Replicate at replicate.com/tencent/hunyuan-3d-3.1 on 2026-05-25. It is the default recommended pick inside 3D Studio — the only model in the RECOMMENDED_MODELS set on line 221 of src/lib/threed-models.ts. The cost is 25 credits per generation, verified against line 202. It supports both image-to-3D and text-to-3D input modes. PBR materials are enabled by default. The face_count parameter ranges from 40,000 to 1.5 million, with the default at the maximum for the best texture quality (verified against line 207). For a clean concept-art illustration with rich color and clear silhouette, Hunyuan 3D 3.1 produces a usable lift in 30 to 60 seconds and is the right pick for the first run on any new flat input. Replicate’s 3d-models collection page lists it as the best all-around 3D generation model in May 2026 — that matches what shows up in 3D Studio.

Meshy 6 — the texture-quality pick (50 cr base, +25 texture, +13 remesh)

Meshy 6 was released on January 18, 2026 — verified against meshy.ai/blog/meshy-6-launch on 2026-05-25. The launch announcement names the headline improvements: cleaner geometry for characters and organic models, sharper edges and clearer silhouettes for mechanical models, a dedicated Low Poly Mode for game developers, and multi-color 3D printing with 3MF export. The Sorceress 3D Studio cost is 50 credits base plus 25 for textures plus 13 for remesh — verified against the getCredits function on lines 48-53 of src/lib/threed-models.ts. Meshy 6 is the only model in the panel that exposes the 4K hd_texture base color (verified against docs.meshy.ai on 2026-05-25 — only meshy-6 and the latest alias support it). For a 2D image to 3D model lift from a high-color concept-art illustration with fine detail, Meshy 6 with hd_texture on is the sharpest texture path in the panel.

TRELLIS 2 — the complex-topology pick (40 credits at 1024p default)

TRELLIS 2 is the Microsoft Research generative-3D model with 4 billion parameters — verified against github.com/microsoft/TRELLIS.2, the Hugging Face model card, and the arxiv 2512.14692 paper (published December 16, 2025) on 2026-05-25. The model uses a novel field-free sparse voxel structure called O-Voxel — distinct from the SDF and Flexicubes iso-surface methods the older generation relied on. The O-Voxel structure handles three input categories that the older models struggle with: open surfaces (clothing, leaves, hair, capes), non-manifold geometry, and internal enclosed structures, all without lossy conversion. Generation speed on an H100 GPU runs roughly 3 seconds at 512 cubed resolution, 17 seconds at 1024 cubed (the default in Sorceress 3D Studio), and 60 seconds at 1536 cubed. The Sorceress cost scales with resolution: 35 credits at 512p, 40 credits at 1024p, 45 credits at 1536p — verified against the getCredits function on lines 154-162 of src/lib/threed-models.ts. PBR materials including transparency and translucency are supported natively. For a 2D image to 3D model lift from a sparse line-art sketch or a stylized illustration with complex topology, TRELLIS 2 has the strongest prior in the panel.

TRELLIS v1 — the cheap iteration pick (8 credits)

The original TRELLIS model from Microsoft Research, routed through Replicate via firtoz/trellis. The Sorceress cost is 8 credits per generation — the cheapest path in the panel, verified against line 111 of src/lib/threed-models.ts. Image-to-3D only (no text-to-3D). The parameter surface exposes structure sampling steps, latent sampling steps, structure guidance, latent guidance, texture size (512 to 2048), and mesh simplification (0.90 to 0.98). TRELLIS v1 is the right pick when iterating fast on prompts or refining the input illustration — 12 generations on the starter allowance vs 4 generations on Hunyuan 3D 3.1. For a 2D image to 3D model workflow where you are still tuning the prompt for the source illustration, run TRELLIS v1 for the iteration loop, then return to Hunyuan 3D 3.1 or Meshy 6 for the final hero pass.

Rodin 2.0 — the quad-mesh pick (50 credits)

Rodin 2.0 (Hyper3D Gen-2) is routed through Replicate via hyper3d/rodin. The Sorceress cost is 50 credits — verified against line 91 of src/lib/threed-models.ts. The model wins on two specific features that none of the other five expose. First, the Mesh Mode toggle lets the user pick Quad (clean quadrilateral faces, ideal for subdivision and animation rigging) vs Raw (triangle mesh, standard game-engine format). Quad mode at High density produces 50K faces, Medium 18K, Low 8K, Extra-Low 4K. Second, the geometry_file_format parameter exposes five output formats from the single job: GLB (default), FBX, OBJ, USDZ (Apple AR), STL (3D printing) — no other model in the panel exports USDZ or STL directly without a conversion step. For a 2D image to 3D model lift that needs to feed straight into a subdivision-surface workflow or an animation rig, Rodin 2.0 in Quad mode is the cleanest path.

Tripo v3.1 — the HD-texture pick (30 cr no texture, 40 cr with HD)

Tripo v3.1 was released on February 11, 2026 — verified against runware.ai/docs/models/tripo-v3-1 on 2026-05-25. The Runware model ID is tripo:v3.1@0. Pricing on Runware starts at $0.3 per generation for text-to-3D and $0.4 for image-to-3D. The Sorceress 3D Studio cost is 30 credits image-to-3D without texture, 40 credits with HD texture, plus 5 credits for the optional Quad Mesh surcharge — verified against the getCredits function on lines 190-195 of src/lib/threed-models.ts. The model_version on the Tripo v2/openapi/task endpoint is v3.1-20260211, matching the release date. The HD texture path is the headline differentiator vs the standard texture — Tripo markets the v3.1 release (as Tripo H3.1) as a high-density-geometry, close-up-quality, production-ready upgrade. For a 2D image to 3D model lift where the texture quality is the bottleneck, Tripo v3.1 with HD texture is the right pick.

The browser pipeline to lift a 2D image to 3D model (no Blender, no install)

The entire lift-a-2D-image-to-3D-model pipeline runs in a single browser tab. No local install, no GPU at home, no upstream account at Meshy or Tripo or Hyper3D. The Sorceress 3D Studio panel handles all six upstream providers through one unified credit budget on the Sorceress account. Every signed-in user gets 100 starter credits. At Hunyuan 3D 3.1 pricing (25 credits per generation), the starter allowance covers exactly 4 lifts. At TRELLIS v1 pricing (8 credits), the starter allowance covers 12 lifts. The honest budget for the first day of work is somewhere in between: one Hunyuan 3D 3.1 pass for the headline-quality mesh, then several TRELLIS v1 iterations to refine the input illustration before re-running the higher-cost models.

The architecture is intentionally narrow. The 3D Studio panel does one thing: it accepts an image, runs it through the chosen model, and returns a downloadable mesh. Texturing, rigging, retargeting, and animation are downstream steps in adjacent Sorceress tools — Auto-Rigging for the humanoid skeleton, 3D Studio Animate for text-to-motion clips, Material Forge for PBR refinement, 3D to 2D for rendering the lifted mesh back out as a sprite sheet. Each tool composes with the 2D image to 3D model output without a manual file-format conversion step. The end-to-end pipeline from illustration to rigged, animated, exported character is covered separately in the full image-to-3D pipeline guide; this article focuses on the lift step specifically.

The five-step workflow to lift a 2D image to 3D model from a flat illustration

Five steps in a single browser tab. Every step is interactive, every step is undoable, every step except the lift itself is free.

Step 1 — Start with a clean illustration (or generate one)

Open 3D Studio. The left rail shows the input dropzone. Drop a JPG or PNG. Three things matter for a clean 2D image to 3D model lift from a flat input. First, isolate the subject — a transparent or single-color background works far better than a busy scene. The lift models lock onto whatever the strongest contiguous foreground region is; a cluttered background bleeds into the geometry. Second, frame the subject head-on or slightly off-axis — a three-quarter view (35-45 degrees off front) gives the model the most information about both the front and a hint of the side. Pure profile views work worse because they hide the front entirely. Third, choose a resolution between 1024×1024 and 2048×2048 — the models downsample anything larger and upscale anything smaller, so the sweet spot is somewhere in the middle.

If you do not have an input illustration, generate one first with AI Image Gen using a high-quality model (Nano Banana Pro, GPT Image 2, or Flux 2 Pro). A prompt like a chunky green orc warrior in T-pose, full body, three-quarter view, plain white background, concept art style, cel-shaded coloring produces an illustration that the 2D image to 3D model lift handles cleanly. Push the generated image through BG Remover for a perfect alpha mask before sending it to 3D Studio. For a line-art sketch with no shading, run a colorize pass through AI Image Gen first (prompt: add full color shading, soft cel-shaded lighting, plain white background) — the lift models read shaded illustrations far better than raw line art.

Step 2 — Pick the right model for the input

The model picker is the second panel. Six radio buttons: Hunyuan 3D 3.1 (recommended default, 25 credits), Meshy 6 (50 credits), TRELLIS 2 (40 credits at 1024p), TRELLIS (8 credits), Rodin 2.0 (50 credits), Tripo v3.1 (30 credits no texture, 40 credits with HD texture). For a first lift on any new 2D illustration, leave Hunyuan 3D 3.1 selected — it is the recommended default for a reason. Switch to Meshy 6 only if the input has fine surface detail that the default pass missed. Switch to TRELLIS 2 if the subject has complex topology (clothing folds, leaves, hair, capes). Switch to Tripo v3.1 if the texture quality is the bottleneck. Switch to TRELLIS v1 if you are still iterating on the source illustration and want to keep credit burn low. Switch to Rodin 2.0 if you need a Quad mesh for subdivision or rigging.

Step 3 — Set the parameters (or accept the defaults)

Each model exposes its own parameter set. For Hunyuan 3D 3.1, the meaningful knobs are Generate Type (Normal or Geometry) and PBR Materials (default on). Leave both at defaults for the first lift. For Meshy 6, the meaningful knobs are Pose Mode (force A-pose or T-pose for rigging-ready characters), Topology (Triangle for game engines, Quad for subdivision), should_remesh (defaults to false for Meshy 6 since the January 22, 2026 changelog update), and image_enhancement (defaults true — leave on for flat illustrations because the AI upscale step pulls more signal out of low-detail input). For TRELLIS 2, the meaningful knob is Resolution (512, 1024, 1536) — bump to 1536 for hero assets, drop to 512 for fast iteration. For Tripo v3.1, the meaningful knob is Texture (no, standard, HD) — HD costs the same as standard upstream so it is almost always the right pick for a 2D image to 3D model lift.

Step 4 — Lift (30-90 seconds, 8-50 credits)

Click Generate. The lift job queues, the upstream provider runs, and the mesh appears in the gallery on the right with a live 3D preview. Total wall-clock time depends on the model: TRELLIS 2 returns in 3-60 seconds depending on the resolution, Hunyuan 3D 3.1 in 30-60 seconds, Meshy 6 in 30-90 seconds depending on whether hd_texture is on, Tripo v3.1 in 20-45 seconds. Inspect the preview from multiple angles — rotate the mesh with the orbit controls, zoom in on the back of the head or the underside of the object to check that the priors held up. If the back is malformed (a common outcome when the input was a pure front view of an asymmetric subject), the right next step is either to generate a back-view illustration in AI Image Gen and re-run with Meshy 6 multi-image-to-3D or Tripo Multi View (introduced March 11, 2026 per the Tripo features page), or to switch to TRELLIS 2 which has the strongest single-image prior in the panel.

Step 5 — Export the GLB

Click Export. The default format is GLB — the binary container for glTF 2.0 from the Khronos Group spec. The other format options on the Rodin 2.0 and Meshy 6 lift jobs are FBX (Unreal-native), OBJ (universal but text-only, no animations or materials), USDZ (Apple AR), STL (3D printing, no textures), and 3MF (multi-color 3D printing, Meshy 6 only). For 99% of game-engine workflows, GLB is the right pick. Download the file. The 2D image to 3D model output is now a single binary blob on your local disk, ready for any engine.

From the lifted mesh into Three.js r184 (and the other engines)

Three.js r184 (released April 16, 2026, verified against github.com/mrdoob/three.js/releases/tag/r184 on 2026-05-25) loads the GLB output of the 2D image to 3D model lift in one line, using the GLTFLoader from the three.js examples:

// Load the lifted GLB from the 2D image to 3D model export
import { GLTFLoader } from 'three/addons/loaders/GLTFLoader.js';

const loader = new GLTFLoader();
loader.load('models/lifted-character.glb', (gltf) => {
  scene.add(gltf.scene);
});

That is the entire integration. The GLB carries the geometry, the UV layout, the PBR materials, and any baked animations in a single file. Three.js parses the file, builds the scene graph, attaches the materials to the meshes, and the next render frame draws the lifted character correctly under whatever lighting the scene defines. For a character that needs to play a walk cycle, add an AnimationMixer from the gltf.animations array and call mixer.update(delta) inside the render loop — the same pattern documented in the Three.js r184 examples.

Lifted 2D image to 3D model GLB loaded into a Three.js r184 scene with GLTFLoader, PBR materials, and the lifted character mesh on a grid floor
The GLB export from the 2D image to 3D model lift loads in one line with the Three.js r184 GLTFLoader. The PBR materials, the UV layout, and any baked animations all ship inside the single binary file.

For Unity 6, drag the lifted GLB into the Assets panel and Unity imports it as a prefab with materials and meshes already wired (the com.unity.cloud.gltfast package — no FBX conversion step). For Unreal 5.6, the glTF Importer is built in since 5.0; drop the GLB into the Content Browser and Unreal generates the static mesh plus material instances. For Godot 4, drop the GLB into the FileSystem panel and Godot imports it as a PackedScene with embedded materials. All four engines treat the GLB output of the 2D image to 3D model pipeline as a first-class asset.

Five mistakes that ruin a lift (and the fix for each)

  1. Feeding raw line art with no shading. A pure line-art sketch with no fill colors and no shading gives the lift model almost nothing to read for surface tone or implied volume. The fix is a quick AI Image Gen pass that colorizes the line art into a shaded illustration before the lift. Prompt: add full color shading, soft cel-shaded lighting, plain white background, concept art style. The colorized result lifts cleanly. The raw line-art original almost never does.
  2. Leaving a busy background in the source illustration. The 2D image to 3D model lift models lock onto whatever the strongest contiguous foreground region is. If the character stands in front of a forest, the closer trees become part of the lifted mesh. Push the input through BG Remover first — a clean alpha mask on a transparent background produces a clean mesh. This is the single highest-impact prep step for any flat input.
  3. Using a profile view as the only input. Pure side-views hide the front of the subject entirely. The single-image priors invent both the front and the back of the object, and the result is often a flat or doubled mesh. Use a front-facing or three-quarter view for the 2D image to 3D model lift. If you must work from a profile illustration, generate a front view first with AI Image Gen using a prompt that references the profile (same character, three-quarter view, plain white background), then use multi-image-to-3D (Meshy 6 or Tripo Multi View) with both views.
  4. Picking the wrong model for the illustration style. Hunyuan 3D 3.1 is the strongest default, but it is not the strongest for every input. Hard-surface mechanical illustrations (vehicles, weapons, tools) lift cleaner on Meshy 6 with the sharper-edges path. Open-topology subjects (characters with flowing capes, plants with leaves, hair) lift cleaner on TRELLIS 2 with the O-Voxel field-free structure. Stylized characters that need a Quad mesh for rigging lift cleaner on Rodin 2.0 in Quad mode. Match the model to the illustration before burning the credits.
  5. Burning credits on iteration with the high-cost models. Lift results vary run-to-run. The first pass on Hunyuan 3D 3.1 might miss a detail; rerunning it costs another 25 credits. If you are iterating on the source illustration or refining the prompt, switch to TRELLIS v1 (8 credits) for the iteration loop, then return to Hunyuan 3D 3.1 or Meshy 6 for the final hero pass. The 12-vs-4 lift-count math on the starter allowance pays for itself within the first afternoon.

The verdict — when to lift a 2D image to 3D model vs photogrammetry vs hand-model

The 2D image to 3D model lift is the right pick when three conditions hold. First, you have exactly one flat illustration (or two to four for the multi-image path) of the subject — not 30+ photos for photogrammetry, not a 3D-scanner point cloud. Second, the subject is a class the 2026 priors handle well: humanoid characters, animals, vehicles, common props in clean front-facing or three-quarter views. Third, you want a polygon mesh out (not a volumetric NeRF, not a Gaussian splat), exported as GLB or FBX, ready to drop into Unity, Unreal, Godot, or Three.js.

It is not the right pick when (1) you need millimetre-accurate geometry for engineering or 3D printing of a real-world part (photogrammetry or a 3D scanner wins), (2) the subject is asymmetric and the back-view matters and you only have a front view (generate the back-view first, then use multi-image-to-3D), or (3) you need a fully retopologized, animation-ready quad mesh straight from the model (use Rodin 2.0 in Quad mode for the closest approximation, but plan for a Blender retopology pass on hero assets).

For the core indie game-development use case — taking a concept art illustration, a sketched character design, or a generated AI character render and turning it into a textured mesh that loads in an engine — the Sorceress 3D Studio lift pipeline is the cleanest browser-based path in May 2026. Six models, one credit budget, one click, one GLB. Compare that to the time investment of hand-modeling the same character in Blender (4-20 hours depending on skill), running a photogrammetry rig with a turntable (30-60 minutes of photo capture plus an hour of reconstruction), or buying a Maya seat ($2,010 a year per our Maya seat breakdown), and the 2D image to 3D model lift path is the right on-ramp for any project that needs characters in the engine this week, not next month.

For deeper background, see the general convert image to 3D model guide (covers the photo-input case in detail), the full image-to-3D pipeline guide, the how-to-turn-an-image-into-a-3d-model walkthrough, the Meshy deep-dive, the Tripo deep-dive, the six-model comparison post, the free 2D-to-3D conversion guide, and the Makerworld GLB-path guide. For a tour of the rest of the Sorceress toolset, see the tools guide.

Frequently Asked Questions

What does lift a 2D image to 3D model actually mean in 2026?

It means feeding a single flat 2D illustration — a pencil sketch, a concept-art piece, a generated AI character render, a pixel-art sprite, a logo, a comic-book panel — into a generative 3D model and getting back a textured polygon mesh in one API call. The mesh has real geometry (vertices, faces, UVs), a baked color texture often paired with a full PBR set (base color, metallic, roughness, normal), and exports as GLB, FBX, OBJ, USDZ, or STL. The distinction from convert-image-to-3D from a photo is the input type — a flat illustration has no depth cues, no real shading, no surface specularity for the model to read. The 2026 models compensate by leaning harder on their learned 3D priors over the geometric cues in the image. The five workhorse 2026 models inside Sorceress 3D Studio that handle a 2D image to 3D model lift well are Hunyuan 3D 3.1 (Tencent), Meshy 6 (released January 18, 2026), TRELLIS 2 (Microsoft Research 4B parameters, paper arxiv 2512.14692 published December 16, 2025), TRELLIS v1, and Tripo v3.1 (released February 11, 2026). Rodin 2.0 (Hyper3D Gen-2) rounds out the six and wins specifically when you need a Quad mesh from the lifted illustration.

Can I lift a pencil sketch (line art only) to 3D?

Yes, but the result quality depends on how much volume the sketch already implies. A clean, fully-shaded concept-art illustration lifts cleanly because the 2026 models can read the implied lighting and silhouette. A bare line-art sketch with no shading is harder — the model has to invent both the geometry and the surface tone from a near-blank canvas. The honest workflow for line-art input has three steps. First, push the line-art sketch through AI Image Gen with a prompt like add full color shading, 3/4 view, plain white background, concept art style to produce a shaded reference image. Second, pipe the shaded image through BG Remover to isolate the subject on a transparent background. Third, send the cleaned image to 3D Studio. The Hunyuan 3D 3.1 default produces a usable mesh from this kind of refined sketch input in most cases. For best results on a pure line-art input directly, switch to TRELLIS 2 — the 4-billion-parameter Microsoft Research prior is the strongest in the panel at inventing volume from sparse input.

Which 3D Studio model is the best to lift a 2D image to 3D model from concept art?

Hunyuan 3D 3.1 (25 credits in Sorceress 3D Studio, verified against src/lib/threed-models.ts line 202 on 2026-05-25) is the default recommended pick because it is in the RECOMMENDED_MODELS set (verified line 221) — the strongest geometry-plus-texture balance at the lowest credit cost. For concept-art input with rich color and clear silhouette, Hunyuan 3D 3.1 produces a clean lift in 30-60 seconds. Switch to Meshy 6 (50 cr base plus 25 cr texture plus 13 cr remesh) when the concept art has fine surface detail that the default pass missed — the 4K hd_texture base color (only available on meshy-6 and latest per docs.meshy.ai) and the Pose Mode toggle (force A-pose or T-pose for rigging-ready meshes) are the wins. Switch to TRELLIS 2 (40 credits at 1024p default) when the concept art has open surfaces (flowing capes, cloth, hair), non-manifold geometry, or sharp features — the O-Voxel field-free sparse voxel structure handles all three without the iso-surface lossy conversion that the older models impose. Switch to Tripo v3.1 (30 cr no-texture, 40 cr with HD texture) when texture quality is the bottleneck — the v3.1 release was a texture-quality update. Switch to TRELLIS v1 (8 credits, image-to-3D only) for the iteration loop when burning credits on the higher-cost models would be wasteful.

How long does it take to lift a 2D image to 3D model in the browser?

The wall-clock time depends on the model and the input image resolution. TRELLIS 2 generation on H100 GPUs takes approximately 3 seconds at 512 cubed resolution, 17 seconds at 1024 cubed (the default in Sorceress 3D Studio), and 60 seconds at 1536 cubed (verified against github.com/microsoft/TRELLIS.2 on 2026-05-25). Hunyuan 3D 3.1 typically returns inside 30 to 60 seconds via the Replicate endpoint. Meshy 6 is similar — 30 to 90 seconds depending on whether you enable hd_texture (4K base color) and the remesh phase. Tripo v3.1 is the fastest of the texture-rich models at 20 to 45 seconds for the standard texture path. The Sorceress UI queues the lift job, polls the upstream provider, and shows a live preview when the mesh arrives. Including the file-upload step, total wall-clock time to lift a 2D image to 3D model is usually under 90 seconds for any of the six models.

Do I need a paid Meshy or Tripo account to lift a 2D image to 3D model?

Not through Sorceress 3D Studio. The 3D Studio panel routes every lift job through one unified credit budget on the Sorceress account — there is no Meshy account, no Tripo account, no Hyper3D account, no Hugging Face token, no Replicate billing setup required. Every signed-in user gets 100 starter credits. At Hunyuan 3D 3.1 pricing (25 credits per lift), the starter allowance covers exactly 4 generations. At TRELLIS v1 pricing (8 credits), the starter allowance covers 12 generations. The honest first-day-of-work budget for a 2D image to 3D model workflow is one Hunyuan 3D 3.1 pass for the headline-quality mesh, then several TRELLIS v1 iterations to refine the input image before re-running the higher-cost models. The fastest free path inside Sorceress is therefore Hunyuan 3D 3.1 for the first run, then TRELLIS v1 for cheap iteration.

What is the GLB format and why is it the default lift a 2D image to 3D model output?

GLB is the binary container for glTF 2.0 — the Khronos Group royalty-free, vendor-neutral 3D asset format. A GLB file is a single binary blob that holds the mesh geometry (vertices, faces, normals, UVs), the textures (base color, metallic, roughness, normal, AO, emission), the materials (PBR descriptors), the rig (skeleton plus skin weights if present), and any animations (clips plus keyframes). Everything ships in one file that the game engine loads with a single API call. Sorceress 3D Studio defaults every 2D image to 3D model lift to GLB output because (1) Unity 6, Unreal 5.6, and Godot 4 all import GLB natively without a plugin, (2) Three.js r184 (released April 16, 2026, verified against github.com/mrdoob/three.js/releases/tag/r184 on 2026-05-25) loads GLB in one line with GLTFLoader, (3) the Apple USDZ pipeline and the Microsoft 3D Viewer both accept GLB as a source, and (4) the single-file delivery makes hosting on Backblaze B2 or any static CDN trivial. The other formats — FBX, OBJ, USDZ, STL, 3MF — are all available from the same Meshy 6 or Rodin 2.0 lift job by setting target_formats; GLB is just the default.

Sources

  1. Meshy 6 launch announcement (Meshy)
  2. TRELLIS 2 GitHub (Microsoft Research)
  3. TRELLIS.2-4B model card (Hugging Face)
  4. TRELLIS.2 paper (arxiv 2512.14692)
  5. Hunyuan 3D 3.1 on Replicate (Tencent)
  6. Tripo 3D v3.1 API (Runware)
  7. glTF 2.0 specification (Khronos Group)
  8. Three.js r184 release (April 16, 2026)
Written by Arron R.·3,831 words·17 min read

Related posts