https://arxiv.org/api/SujfJ5YWcaLgDj+mcLRtrctUOeU 2026-06-27T18:08:48Z 9390 1605 15 http://arxiv.org/abs/2509.14177v1 Progressing Level-of-Detail Animation of Volumetric Elastodynamics 2025-09-16T06:39:21Z

We extend the progressive dynamics model (Zhang et al., 2024) from cloth and shell simulation to volumetric finite elements, enabling an efficient level-of-detail (LOD) animation-design pipeline with predictive coarse-resolution previews facilitating rapid iterative design for a final, to-be-generated, high-resolution animation of volumetric elastodynamics. This extension to volumetric domains poses significant new challenges, including the construction of suitable mesh hierarchies and the definition of effective prolongation operators for codimension-0 progressive dynamics. To address these challenges, we propose a practical method for defining multiresolution hierarchies and, more importantly, introduce a simple yet effective topology-aware algorithm for constructing prolongation operators between overlapping (but not necessarily conforming) volumetric meshes. Our key insight is a boundary binding strategy that enables the computation of barycentric coordinates, allowing several off-the-shelf interpolants -- such as standard barycentric coordinates, Biharmonic Coordinates (Wang et al., 2015), and Phong Deformation (James, 2020) -- to serve as "plug-and-play" components for prolongation with minimal modification. We show that our progressive volumetric simulation framework achieves high-fidelity matching LOD animation across resolutions including challenging dynamics with high speeds, large deformations, and frictional contact.

2025-09-16T06:39:21Z Jiayi Eris Zhang Doug L. James Danny M. Kaufman http://arxiv.org/abs/2505.18805v2 Auto Hair Card Extraction for Smooth Hair with Differentiable Rendering 2025-09-15T23:37:54Z

Hair cards remain a widely used representation for hair modeling in real-time applications, offering a practical trade-off between visual fidelity, memory usage, and performance. However, generating high-quality hair card models remains a challenging and labor-intensive task. This work presents an automated pipeline for converting strand-based hair models into hair card models with a limited number of cards and textures while preserving the hairstyle appearance. Our key idea is a novel differentiable representation where each strand is encoded as a projected 2D spline in the texture space, which enables efficient optimization with differentiable rendering and structured results respecting the hair geometry. Based on this representation, we develop a novel algorithm pipeline, where we first cluster hair strands into initial hair cards and project the strands into the texture space. We then conduct a two-stage optimization where our first stage optimizes the texture and geometry of each hair card separately, and after texture reduction, our second stage conducts joint optimization of all the cards for fine-tuning. Put together, our method is evaluated on a wide range of hairstyles, including straight, wavy, curly, and coily hairs. To better capture the appearance of short or coily hair, we additionally support hair cap and cross-card. Furthermore, our framework supports seamless LoD transitions via texture sharing, balancing texture memory efficiency and visual quality.

2025-05-24T17:32:19Z Zhongtian Zheng Tao Huang Haozhe Su Xueqi Ma Yuefan Shen Tongtong Wang Yin Yang Xifeng Gao Zherong Pan Kui Wu http://arxiv.org/abs/2509.12187v1 HoloGarment: 360° Novel View Synthesis of In-the-Wild Garments 2025-09-15T17:50:57Z

Novel view synthesis (NVS) of in-the-wild garments is a challenging task due significant occlusions, complex human poses, and cloth deformations. Prior methods rely on synthetic 3D training data consisting of mostly unoccluded and static objects, leading to poor generalization on real-world clothing. In this paper, we propose HoloGarment (Hologram-Garment), a method that takes 1-3 images or a continuous video of a person wearing a garment and generates 360° novel views of the garment in a canonical pose. Our key insight is to bridge the domain gap between real and synthetic data with a novel implicit training paradigm leveraging a combination of large-scale real video data and small-scale synthetic 3D data to optimize a shared garment embedding space. During inference, the shared embedding space further enables dynamic video-to-360° NVS through the construction of a garment "atlas" representation by finetuning a garment embedding on a specific real-world video. The atlas captures garment-specific geometry and texture across all viewpoints, independent of body pose or motion. Extensive experiments show that HoloGarment achieves state-of-the-art performance on NVS of in-the-wild garments from images and videos. Notably, our method robustly handles challenging real-world artifacts -- such as wrinkling, pose variation, and occlusion -- while maintaining photorealism, view consistency, fine texture details, and accurate geometry. Visit our project page for additional results: https://johannakarras.github.io/HoloGarment

2025-09-15T17:50:57Z Johanna Karras Yingwei Li Yasamin Jafarian Ira Kemelmacher-Shlizerman http://arxiv.org/abs/2506.03004v2 PartComposer: Learning and Composing Part-Level Concepts from Single-Image Examples 2025-09-15T14:22:56Z

We present PartComposer: a framework for part-level concept learning from single-image examples that enables text-to-image diffusion models to compose novel objects from meaningful components. Existing methods either struggle with effectively learning fine-grained concepts or require a large dataset as input. We propose a dynamic data synthesis pipeline generating diverse part compositions to address one-shot data scarcity. Most importantly, we propose to maximize the mutual information between denoised latents and structured concept codes via a concept predictor, enabling direct regulation on concept disentanglement and re-composition supervision. Our method achieves strong disentanglement and controllable composition, outperforming subject and part-level baselines when mixing concepts from the same, or different, object categories.

2025-06-03T15:43:28Z Junyu Liu R. Kenny Jones Daniel Ritchie 10.1145/3757377.3763947 http://arxiv.org/abs/2505.03154v2 StableMotion: Training Motion Cleanup Models with Unpaired Corrupted Data 2025-09-15T04:21:36Z

Motion capture (mocap) data often exhibits visually jarring artifacts due to inaccurate sensors and post-processing. Cleaning this corrupted data can require substantial manual effort from human experts, which can be a costly and time-consuming process. Previous data-driven motion cleanup methods offer the promise of automating this cleanup process, but often require in-domain paired corrupted-to-clean training data. Constructing such paired datasets requires access to high-quality, relatively artifact-free motion clips, which often necessitates laborious manual cleanup. In this work, we present StableMotion, a simple yet effective method for training motion cleanup models directly from unpaired corrupted datasets that need cleanup. The core component of our method is the introduction of motion quality indicators, which can be easily annotated - through manual labeling or heuristic algorithms - and enable training of quality-aware motion generation models on raw motion data with mixed quality. At test time, the model can be prompted to generate high-quality motions using the quality indicators. Our method can be implemented through a simple diffusion-based framework, leading to a unified motion generate-discriminate model, which can be used to both identify and fix corrupted frames. We demonstrate that our proposed method is effective for training motion cleanup models on raw mocap data in production scenarios by applying StableMotion to SoccerMocap, a 245-hour soccer mocap dataset containing real-world motion artifacts. The trained model effectively corrects a wide range of motion artifacts, reducing motion pops and frozen frames by 68% and 81%, respectively. Results and code are available at https://yxmu.foo/stablemotion-page

2025-05-06T04:02:47Z Accepted for SIGGRAPH Asia 2025 Yuxuan Mu Hung Yu Ling Yi Shi Ismael Baira Ojeda Pengcheng Xi Chang Shu Fabio Zinno Xue Bin Peng http://arxiv.org/abs/2509.11411v1 On the Skinning of Gaussian Avatars 2025-09-14T19:58:48Z

Radiance field-based methods have recently been used to reconstruct human avatars, showing that we can significantly downscale the systems needed for creating animated human avatars. Although this progress has been initiated by neural radiance fields, their slow rendering and backward mapping from the observation space to the canonical space have been the main challenges. With Gaussian splatting overcoming both challenges, a new family of approaches has emerged that are faster to train and render, while also straightforward to implement using forward skinning from the canonical to the observation space. However, the linear blend skinning required for the deformation of the Gaussians does not provide valid results for their non-linear rotation properties. To address such artifacts, recent works use mesh properties to rotate the non-linear Gaussian properties or train models to predict corrective offsets. Instead, we propose a weighted rotation blending approach that leverages quaternion averaging. This leads to simpler vertex-based Gaussians that can be efficiently animated and integrated in any engine by only modifying the linear blend skinning technique, and using any Gaussian rasterizer.

2025-09-14T19:58:48Z Nikolaos Zioulis Nikolaos Kotarelas Georgios Albanis Spyridon Thermos Anargyros Chatzitofis http://arxiv.org/abs/2509.11410v1 3De Interactive Lenses for Visualization in Virtual Environments 2025-09-14T19:54:58Z

We present 3De lens, a technique for focus+context visualization of multi-geometry data. It fuses two categories of lenses (3D and Decal) to become a versatile lens for seamlessly working on multiple geometric representations that commonly coexist in 3D visualizations. In addition, we incorporate our lens into virtual reality as it enables a natural style of direct spatial manipulation for exploratory 3D data analysis. To demonstrate its potential use, we discuss two domain examples in which our lens technique creates customized visualizations of both surfaces and streamlines.

2025-09-14T19:54:58Z Roberta C. R. Mota Allan Rocha Julio Daniel Silva Usman Alim Ehud Sharlin http://arxiv.org/abs/2509.11377v1 3D Gaussian Modeling and Ray Marching of OpenVDB datasets for Scientific Visualization 2025-09-14T18:15:59Z

3D Gaussians are currently being heavily investigated for their scene modeling and compression abilities. In 3D volumes, their use is being explored for representing dense volumes as sparsely as possible. However, most of these methods begin with a memory inefficient data format. Specially in Scientific Visualization(SciVis), where most popular formats are dense-grid data structures that store every grid cell, irrespective of its contribution. OpenVDB library and data format were introduced for representing sparse volumetric data specifically for visual effects use cases such as clouds, fire, fluids etc. It avoids storing empty cells by masking them during storage. It presents an opportunity for use in SciVis, specifically as a modeling framework for conversion to 3D Gaussian particles for further compression and for a unified modeling approach for different scientific volume types. This compression head-start is non-trivial and this paper would like to present this with a rendering algorithm based on line integration implemented in OptiX8.1 for calculating 3D Gaussians contribution along a ray for optical-depth accumulation. For comparing the rendering results of our ray marching Gaussians renderer, we also implement a SciVis style primary-ray only NanoVDB HDDA based ray marcher for OpenVDB voxel grids. Finally, this paper also explores application of this Gaussian model to formats of volumes other than regular grids, such as AMR volumes and point clouds, using internal representation of OpenVDB grid class types for data hierarchy and subdivision structure.

2025-09-14T18:15:59Z Isha Sharma Dieter Schmalstieg http://arxiv.org/abs/2509.11087v1 SH-SAS: An Implicit Neural Representation for Complex Spherical-Harmonic Scattering Fields for 3D Synthetic Aperture Sonar 2025-09-14T04:29:28Z

Synthetic aperture sonar (SAS) reconstruction requires recovering both the spatial distribution of acoustic scatterers and their direction-dependent response. Time-domain backprojection is the most common 3D SAS reconstruction algorithm, but it does not model directionality and can suffer from sampling limitations, aliasing, and occlusion. Prior neural volumetric methods applied to synthetic aperture sonar treat each voxel as an isotropic scattering density, not modeling anisotropic returns. We introduce SH-SAS, an implicit neural representation that expresses the complex acoustic scattering field as a set of spherical harmonic (SH) coefficients. A multi-resolution hash encoder feeds a lightweight MLP that outputs complex SH coefficients up to a specified degree L. The zeroth-order coefficient acts as an isotropic scattering field, which also serves as the density term, while higher orders compactly capture directional scattering with minimal parameter overhead. Because the model predicts the complex amplitude for any transmit-receive baseline, training is performed directly from 1-D time-of-flight signals without the need to beamform intermediate images for supervision. Across synthetic and real SAS (both in-air and underwater) benchmarks, results show that SH-SAS performs better in terms of 3D reconstruction quality and geometric metrics than previous methods.

2025-09-14T04:29:28Z Omkar Shailendra Vengurlekar Adithya Pediredla Suren Jayasuriya http://arxiv.org/abs/2506.22426v2 Single-shot HDR using conventional image sensor shutter functions and optical randomization 2025-09-12T21:08:01Z

High-dynamic-range (HDR) imaging is an essential technique for overcoming the dynamic range limits of image sensors. The classic method relies on multiple exposures, which slows capture time, resulting in motion artifacts when imaging dynamic scenes. Single-shot HDR imaging alleviates this issue by encoding HDR data into a single exposure, then computationally recovering it. Many established methods use strong image priors to recover improperly exposed image detail. These approaches struggle with extended highlight regions. We utilize the global reset release (GRR) shutter mode of an off-the-shelf sensor. GRR shutter mode applies a longer exposure time to rows closer to the bottom of the sensor. We use optics that relay a randomly permuted (shuffled) image onto the sensor, effectively creating spatially randomized exposures across the scene. The exposure diversity allows us to recover HDR data by solving an optimization problem with a simple total variation image prior. In simulation, we demonstrate that our method outperforms other single-shot methods when many sensor pixels are saturated (10% or more), and is competitive at a modest saturation (1%). Finally, we demonstrate a physical lab prototype that uses an off-the-shelf random fiber bundle for the optical shuffling. The fiber bundle is coupled to a low-cost commercial sensor operating in GRR shutter mode. Our prototype achieves a dynamic range of up to 73dB using an 8-bit sensor with 48dB dynamic range.

2025-06-27T17:48:21Z Published in ACM Transactions on Graphics (TOG), Volume 44, Issue 5, October 2025. DOI: 10.1145/3748718 ACM Trans. Graph. 44, 5, Article 172 (October 2025), 20 pages Xiang Dai Kyrollos Yanny Kristina Monakhova Nicholas Antipa 10.1145/3748718 http://arxiv.org/abs/2408.04466v2 One-Shot Method for Computing Generalized Winding Numbers 2025-09-12T20:47:11Z

The generalized winding number is an essential part of the geometry processing toolkit, allowing to quantify how much a given point is inside a surface, even when the surface has boundaries and noise. We propose a new universal method to compute a generalized winding number, based only on the surface boundary and the intersections of a single ray with the surface, supporting any oriented surface representations that support a ray intersection query. Due to the focus on the boundary, our algorithm has a unique set of properties. For 2D parametric curves, on a regular grid of query points, our method is up to 4x faster than the current state of the art, maintaining the same precision. In 3D, our method can compute a winding number of a surface without discretizing it, including parametric surfaces. For some meshes with many triangles and a simple boundary, our method is faster than the hierarchical evaluation of the generalized winding number while still being precise. Similarly, on some parametric surfaces with a simple boundary, our method can be faster than adaptive quadrature. We validate our algorithms theoretically, numerically, and by demonstrating a gallery of results on a variety of parametric surfaces and meshes, as well uses in a variety of applications, including voxelizations and boolean operations.

2024-08-08T13:49:26Z 15 pages, 18 figures (2025), One-Shot Method for Computing Generalized Winding Numbers. Computer Graphics Forum, 44: e70194 Cedric Martens Mikhail Bessmeltsev 10.1111/cgf.70194 http://arxiv.org/abs/2508.05524v2 GASP: A Gradient-Aware Shortest Path Algorithm for Boundary-Confined Visualization of 2-Manifold Reeb Graphs 2025-09-12T17:45:56Z

Reeb graphs are an important tool for abstracting and representing the topological structure of a function defined on a manifold. We have identified three properties for faithfully representing Reeb graphs in a visualization: they should be constrained to the boundary, compact, and aligned with the function gradient. Existing algorithms for drawing Reeb graphs are agnostic to or violate these properties. In this paper, we introduce an algorithm to generate Reeb graph visualizations, called GASP, that is cognizant of these properties, thereby producing visualizations that are more representative of the underlying data. To demonstrate the improvements, the resulting Reeb graphs are evaluated both qualitatively and quantitatively against the geometric barycenter algorithm, using its implementation available in the Topology ToolKit (TTK), a widely adopted tool for calculating and visualizing Reeb graphs.

2025-08-07T15:53:26Z Sefat E. Rahman Tushar M. Athawale Paul Rosen http://arxiv.org/abs/2509.10599v1 Can any model be fabricated? Inverse operation based planning for hybrid additive-subtractive manufacturing 2025-09-12T17:31:41Z

This paper presents a method for computing interleaved additive and subtractive manufacturing operations to fabricate models of arbitrary shapes. We solve the manufacturing planning problem by searching a sequence of inverse operations that progressively transform a target model into a null shape. Each inverse operation corresponds to either an additive or a subtractive step, ensuring both manufacturability and structural stability of intermediate shapes throughout the process. We theoretically prove that any model can be fabricated exactly using a sequence generated by our approach. To demonstrate the effectiveness of this method, we adopt a voxel-based implementation and develop a scalable algorithm that works on models represented by a large number of voxels. Our approach has been tested across a range of digital models and further validated through physical fabrication on a hybrid manufacturing system with automatic tool switching.

2025-09-12T17:31:41Z Yongxue Chen Tao Liu Yuming Huang Weiming Wang Tianyu Zhang Kun Qian Zikang Shi Charlie C. L. Wang http://arxiv.org/abs/2509.09952v1 Chord: Chain of Rendering Decomposition for PBR Material Estimation from Generated Texture Images 2025-09-12T04:03:07Z

Material creation and reconstruction are crucial for appearance modeling but traditionally require significant time and expertise from artists. While recent methods leverage visual foundation models to synthesize PBR materials from user-provided inputs, they often fall short in quality, flexibility, and user control. We propose a novel two-stage generate-and-estimate framework for PBR material generation. In the generation stage, a fine-tuned diffusion model synthesizes shaded, tileable texture images aligned with user input. In the estimation stage, we introduce a chained decomposition scheme that sequentially predicts SVBRDF channels by passing previously extracted representation as input into a single-step image-conditional diffusion model. Our method is efficient, high quality, and enables flexible user control. We evaluate our approach against existing material generation and estimation methods, demonstrating superior performance. Our material estimation method shows strong robustness on both generated textures and in-the-wild photographs. Furthermore, we highlight the flexibility of our framework across diverse applications, including text-to-material, image-to-material, structure-guided generation, and material editing.

2025-09-12T04:03:07Z Accepted to SIGGRAPH Asia 2025. Project page: https://ubisoft-laforge.github.io/world/chord Zhi Ying Boxiang Rong Jingyu Wang Maoyuan Xu http://arxiv.org/abs/2509.09143v1 Objectness Similarity: Capturing Object-Level Fidelity in 3D Scene Evaluation 2025-09-11T04:33:27Z

This paper presents Objectness SIMilarity (OSIM), a novel evaluation metric for 3D scenes that explicitly focuses on "objects," which are fundamental units of human visual perception. Existing metrics assess overall image quality, leading to discrepancies with human perception. Inspired by neuropsychological insights, we hypothesize that human recognition of 3D scenes fundamentally involves attention to individual objects. OSIM enables object-centric evaluations by leveraging an object detection model and its feature representations to quantify the "objectness" of each object in the scene. Our user study demonstrates that OSIM aligns more closely with human perception compared to existing metrics. We also analyze the characteristics of OSIM using various approaches. Moreover, we re-evaluate recent 3D reconstruction and generation models under a standardized experimental setup to clarify advancements in this field. The code is available at https://github.com/Objectness-Similarity/OSIM.

2025-09-11T04:33:27Z Accepted by the ICCV 2025 UniLight Workshop Yuiko Uchida Ren Togo Keisuke Maeda Takahiro Ogawa Miki Haseyama