https://arxiv.org/api/qovX2DfyT7BrkLSwG7hRXs9EG2I2026-06-27T19:02:28Z9390162015http://arxiv.org/abs/2502.00626v2Lifting the Winding Number: Precise Discontinuities in Neural Fields for Physics Simulation2025-09-11T03:25:24ZCutting thin-walled deformable structures is common in daily life, but poses significant challenges for simulation due to the introduced spatial discontinuities. Traditional methods rely on mesh-based domain representations, which require frequent remeshing and refinement to accurately capture evolving discontinuities. These challenges are further compounded in reduced-space simulations, where the basis functions are inherently geometry- and mesh-dependent, making it difficult or even impossible for the basis to represent the diverse family of discontinuities introduced by cuts.
Recent advances in representing basis functions with neural fields offer a promising alternative, leveraging their discretization-agnostic nature to represent deformations across varying geometries. However, the inherent continuity of neural fields is an obstruction to generalization, particularly if discontinuities are encoded in neural network weights.
We present Wind Lifter, a novel neural representation designed to accurately model complex cuts in thin-walled deformable structures. Our approach constructs neural fields that reproduce discontinuities precisely at specified locations, without baking in the position of the cut line. Crucially, our approach does not embed the discontinuity in the neural network's weights, opening avenues to generalization of cut placement.
Our method achieves real-time simulation speeds and supports dynamic updates to cut line geometry during the simulation. Moreover, the explicit representation of discontinuities makes our neural field intuitive to control and edit, offering a significant advantage over traditional neural fields, where discontinuities are embedded within the network's weights, and enabling new applications that rely on general cut placement.2025-02-02T01:51:56ZYue ChangMengfei LiuZhecheng WangPeter Yichen ChenEitan Grinspunhttp://arxiv.org/abs/2509.07897v1dciWebMapper2: Enhancing the dciWebMapper framework toward integrated, interactive visualization of linked multi-type maps, charts, and spatial statistics and analysis2025-09-09T16:29:12ZAs interactive web-based geovisualization becomes increasingly vital across disciplines, there is a growing need for open-source frameworks that support dynamic, multi-attribute spatial analysis and accessible design. This paper introduces dciWebMapper2, a significant expansion of the original dciWebMapper framework, designed to enable exploratory analysis across domains such as climate justice, food access, and social vulnerability. The enhanced framework integrates multiple map types, including choropleth, proportional symbol, small multiples, and heatmaps, with linked statistical charts (e.g., scatter plots, boxplots) and time sliders, all within a coordinated-view environment. Dropdown-based controls allow flexible, high-dimensional comparisons while maintaining visual clarity. Grounded in cartographic and information visualization principles, dciWebMapper2 is fully open-source, self-contained, and server-free, supporting modularity, reproducibility, and long-term sustainability. Three applied use cases demonstrate its adaptability and potential to democratize interactive web cartography. This work offers a versatile foundation for inclusive spatial storytelling and transparent geospatial analysis in research, education, and civic engagement.2025-09-09T16:29:12Z15 figures, 2 tables, and three advanced interactive web map apps that are openly available to the publicSarigai SarigaiLiping YangKatie SlackCarolyn FishMichaela BuenemannQiusheng WuYan LinJoseph A. CookDavid Jacobshttp://arxiv.org/abs/2502.08297v2BEAM: Bridging Physically-based Rendering and Gaussian Modeling for Relightable Volumetric Video2025-09-09T14:05:14ZVolumetric video enables immersive experiences by capturing dynamic 3D scenes, enabling diverse applications for virtual reality, education, and telepresence. However, traditional methods struggle with fixed lighting conditions, while neural approaches face trade-offs in efficiency, quality, or adaptability for relightable scenarios. To address these limitations, we present BEAM, a novel pipeline that bridges 4D Gaussian representations with physically-based rendering (PBR) to produce high-quality, relightable volumetric videos from multi-view RGB footage. BEAM recovers detailed geometry and PBR properties via a series of available Gaussian-based techniques. It first combines Gaussian-based human performance tracking with geometry-aware rasterization in a coarse-to-fine optimization framework to recover spatially and temporally consistent geometries. We further enhance Gaussian attributes by incorporating PBR properties step by step. We generate roughness via a multi-view-conditioned diffusion model, and then derive AO and base color using a 2D-to-3D strategy, incorporating a tailored Gaussian-based ray tracer for efficient visibility computation. Once recovered, these dynamic, relightable assets integrate seamlessly into traditional CG pipelines, supporting real-time rendering with deferred shading and offline rendering with ray tracing. By offering realistic, lifelike visualizations under diverse lighting conditions, BEAM opens new possibilities for interactive entertainment, storytelling, and creative visualization.2025-02-12T10:58:09ZYu HongYize WuZhehao ShenChengcheng GuoYuheng JiangYingliang ZhangJingyi YuLan Xuhttp://arxiv.org/abs/2506.18407v2IntuiTF: MLLM-Guided Transfer Function Optimization for Direct Volume Rendering2025-09-09T12:36:01ZDirect volume rendering (DVR) is a fundamental technique for visualizing volumetric data, where transfer functions (TFs) play a crucial role in extracting meaningful structures. However, designing effective TFs remains unintuitive due to the semantic gap between user intent and TF parameter space. Although numerous TF optimization methods have been proposed to mitigate this issue, existing approaches still face two major challenges: the vast exploration space and limited generalizability. To address these issues, we propose IntuiTF, a novel framework that leverages Multimodal Large Language Models (MLLMs) to guide TF optimization in alignment with user intent. Specifically, our method consists of two key components: (1) an evolution-driven explorer for effective exploration of the TF space, and (2) an MLLM-guided human-aligned evaluator that provides generalizable visual feedback on rendering quality. The explorer and the evaluator together establish an efficient Trial-Insight-Replanning paradigm for TF space exploration. We further extend our framework with an interactive TF design system. We demonstrate the broad applicability of our framework through three case studies and validate the effectiveness of each component through extensive experiments. We strongly recommend readers check our cases, demo video, and source code at: https://github.com/wyysteelhead/IntuiTF2025-06-23T08:42:57ZYiyao WangBo PanKe WangHan LiuJinyuan MaoYuxin LiuMinfeng ZhuXiuqi HuangWeifeng ChenBo ZhangWei Chenhttp://arxiv.org/abs/2509.07653v1Topology-Aware Optimization of Gaussian Primitives for Human-Centric Volumetric Videos2025-09-09T12:18:31ZVolumetric video is emerging as a key medium for digitizing the dynamic physical world, creating the virtual environments with six degrees of freedom to deliver immersive user experiences. However, robustly modeling general dynamic scenes, especially those involving topological changes while maintaining long-term tracking remains a fundamental challenge. In this paper, we present TaoGS, a novel topology-aware dynamic Gaussian representation that disentangles motion and appearance to support, both, long-range tracking and topological adaptation. We represent scene motion with a sparse set of motion Gaussians, which are continuously updated by a spatio-temporal tracker and photometric cues that detect structural variations across frames. To capture fine-grained texture, each motion Gaussian anchors and dynamically activates a set of local appearance Gaussians, which are non-rigidly warped to the current frame to provide strong initialization and significantly reduce training time. This activation mechanism enables efficient modeling of detailed textures and maintains temporal coherence, allowing high-fidelity rendering even under challenging scenarios such as changing clothes. To enable seamless integration into codec-based volumetric formats, we introduce a global Gaussian Lookup Table that records the lifespan of each Gaussian and organizes attributes into a lifespan-aware 2D layout. This structure aligns naturally with standard video codecs and supports up to 40 compression. TaoGS provides a unified, adaptive solution for scalable volumetric video under topological variation, capturing moments where "elegance in motion" and "Power in Stillness", delivering immersive experiences that harmonize with the physical world.2025-09-09T12:18:31ZAccepted at SIGGRAPH Asia 2025. Project page: https://guochch.github.io/TaoGS/Yuheng JiangChengcheng GuoYize WuYu HongShengkun ZhuZhehao ShenYingliang ZhangShaohui JiaoZhuo SuLan XuMarc HabermannChristian Theobalthttp://arxiv.org/abs/2509.07643v1ReShape: a Collaborative Art Experience2025-09-09T12:10:22ZThis article describes a project called ReShape in which we created and designed a crowdsourced art initiative, inspired and powered by mathematics.2025-09-09T12:10:22Z12 pagesHugo ParlierBruno Teheuxhttp://arxiv.org/abs/2502.05752v2PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map2025-09-09T08:58:33ZRobots benefit from high-fidelity reconstructions of their environment, which should be geometrically accurate and photorealistic to support downstream tasks. While this can be achieved by building distance fields from range sensors and radiance fields from cameras, realising scalable incremental mapping of both fields consistently and at the same time with high quality is challenging. In this paper, we propose a novel map representation that unifies a continuous signed distance field and a Gaussian splatting radiance field within an elastic and compact point-based implicit neural map. By enforcing geometric consistency between these fields, we achieve mutual improvements by exploiting both modalities. We present a novel LiDAR-visual SLAM system called PINGS using the proposed map representation and evaluate it on several challenging large-scale datasets. Experimental results demonstrate that PINGS can incrementally build globally consistent distance and radiance fields encoded with a compact set of neural points. Compared to state-of-the-art methods, PINGS achieves superior photometric and geometric rendering at novel views by constraining the radiance field with the distance field. Furthermore, by utilizing dense photometric cues and multi-view consistency from the radiance field, PINGS produces more accurate distance fields, leading to improved odometry estimation and mesh reconstruction. We also provide an open-source implementation of PING at: https://github.com/PRBonn/PINGS.2025-02-09T03:06:19Z15 pages, 8 figures, presented at RSS 2025Yue PanXingguang ZhongLiren JinLouis WiesmannMarija PopovićJens BehleyCyrill Stachnisshttp://arxiv.org/abs/2509.07522v1Neural Cone Radiosity for Interactive Global Illumination with Glossy Materials2025-09-09T08:58:13ZModeling of high-frequency outgoing radiance distributions has long been a key challenge in rendering, particularly for glossy material. Such distributions concentrate radiative energy within a narrow lobe and are highly sensitive to changes in view direction. However, existing neural radiosity methods, which primarily rely on positional feature encoding, exhibit notable limitations in capturing these high-frequency, strongly view-dependent radiance distributions. To address this, we propose a highly-efficient approach by reflectance-aware ray cone encoding based on the neural radiosity framework, named neural cone radiosity. The core idea is to employ a pre-filtered multi-resolution hash grid to accurately approximate the glossy BSDF lobe, embedding view-dependent reflectance characteristics directly into the encoding process through continuous spatial aggregation. Our design not only significantly improves the network's ability to model high-frequency reflection distributions but also effectively handles surfaces with a wide range of glossiness levels, from highly glossy to low-gloss finishes. Meanwhile, our method reduces the network's burden in fitting complex radiance distributions, allowing the overall architecture to remain compact and efficient. Comprehensive experimental results demonstrate that our method consistently produces high-quality, noise-free renderings in real time under various glossiness conditions, and delivers superior fidelity and realism compared to baseline approaches.2025-09-09T08:58:13ZJierui RenHaojie JinBo PangYisong ChenGuoping WangSheng Lihttp://arxiv.org/abs/2509.08855v1Morphology-Preserving Remeshing Approach to Particulate Microstructures via Harmonic Decomposition2025-09-09T08:49:34ZHarmonic decomposition of surfaces, such as spherical and spheroidal harmonics, is used to analyze morphology, reconstruct, and generate surface inclusions of particulate microstructures. However, obtaining high-quality meshes of engineering microstructures using these approaches remains an open question. In harmonic approaches, we usually reconstruct surfaces by evaluating the harmonic bases on equidistantly sampled simplicial complexes of the base domains (e.g., triangular spheroids and disks). However, this traditional sampling does not account for local changes in the Jacobian of the basis functions, resulting in nonuniform discretization after reconstruction or generation. As it impacts the accuracy and time step, high-quality discretization of microstructures is crucial for efficient numerical simulations (e.g., finite element and discrete element methods). To circumvent this issue, we propose an efficient hierarchical diffusion-based approach for resampling the surface-i.e., performing a reparameterization-to yield an equalized mesh triangulation. Analogous to heat problems, we use nonlinear diffusion to resample the curvilinear coordinates of the analysis domain, thereby enlarging small triangles at the expense of large triangles on surfaces. We tested isotropic and anisotropic diffusion schemes on the recent spheroidal and hemispheroidal harmonics methods. The results show a substantial improvement in the quality metrics for surface triangulation. Unlike traditional surface reconstruction and meshing techniques, this approach preserves surface morphology, along with the areas and volumes of surfaces. We discuss the results and the associated computational costs for large 2D and 3D microstructures, such as digital twins of concrete and stone masonry, and their future applications.2025-09-09T08:49:34ZMahmoud Shaqfa10.1016/j.powtec.2025.121991http://arxiv.org/abs/2405.15425v3Don't Splat your Gaussians: Volumetric Ray-Traced Primitives for Modeling and Rendering Scattering and Emissive Media2025-09-08T22:26:42ZEfficient scene representations are essential for many computer graphics applications. A general unified representation that can handle both surfaces and volumes simultaneously, remains a research challenge. Inspired by recent methods for scene reconstruction that leverage mixtures of 3D Gaussians to model radiance fields, we formalize and generalize the modeling of scattering and emissive media using mixtures of simple kernel-based volumetric primitives. We introduce closed-form solutions for transmittance and free-flight distance sampling for different kernels, and propose several optimizations to use our method efficiently within any off-the-shelf volumetric path tracer. We demonstrate our method as a compact and efficient alternative to other forms of volume modeling for forward and inverse rendering of scattering media. Furthermore, we adapt and showcase our method in radiance field optimization and rendering, providing additional flexibility compared to current state of the art given its ray-tracing formulation. We also introduce the Epanechnikov kernel and demonstrate its potential as an efficient alternative to the traditionally-used Gaussian kernel in scene reconstruction tasks. The versatility and physically-based nature of our approach allows us to go beyond radiance fields and bring to kernel-based modeling and rendering any path-tracing enabled functionality such as scattering, relighting and complex camera models.2024-05-24T10:42:05Z17 pages, 17 figuresACM Trans. Graph. 44, 1, Article 10 (February 2025), 17 pagesJorge CondorSebastien SpeiererLukas BodeAljaz BozicSimon GreenPiotr DidykAdrian Jarabo10.1145/3711853http://arxiv.org/abs/2509.07127v1SVGauge: Towards Human-Aligned Evaluation for SVG Generation2025-09-08T18:28:31ZGenerated Scalable Vector Graphics (SVG) images demand evaluation criteria tuned to their symbolic and vectorial nature: criteria that existing metrics such as FID, LPIPS, or CLIPScore fail to satisfy. In this paper, we introduce SVGauge, the first human-aligned, reference based metric for text-to-SVG generation. SVGauge jointly measures (i) visual fidelity, obtained by extracting SigLIP image embeddings and refining them with PCA and whitening for domain alignment, and (ii) semantic consistency, captured by comparing BLIP-2-generated captions of the SVGs against the original prompts in the combined space of SBERT and TF-IDF. Evaluation on the proposed SHE benchmark shows that SVGauge attains the highest correlation with human judgments and reproduces system-level rankings of eight zero-shot LLM-based generators more faithfully than existing metrics. Our results highlight the necessity of vector-specific evaluation and provide a practical tool for benchmarking future text-to-SVG generation models.2025-09-08T18:28:31ZAccepted at 23rd edition of International Conference on Image Analysis and Processing 2025Leonardo ZiniElia FrigieriSebastiano AloscariMarcello GeneraliLorenzo DodiRobert DosenLorenzo Baraldihttp://arxiv.org/abs/2509.06950v1Scaling Transformer-Based Novel View Synthesis Models with Token Disentanglement and Synthetic Data2025-09-08T17:58:06ZLarge transformer-based models have made significant progress in generalizable novel view synthesis (NVS) from sparse input views, generating novel viewpoints without the need for test-time optimization. However, these models are constrained by the limited diversity of publicly available scene datasets, making most real-world (in-the-wild) scenes out-of-distribution. To overcome this, we incorporate synthetic training data generated from diffusion models, which improves generalization across unseen domains. While synthetic data offers scalability, we identify artifacts introduced during data generation as a key bottleneck affecting reconstruction quality. To address this, we propose a token disentanglement process within the transformer architecture, enhancing feature separation and ensuring more effective learning. This refinement not only improves reconstruction quality over standard transformers but also enables scalable training with synthetic data. As a result, our method outperforms existing models on both in-dataset and cross-dataset evaluations, achieving state-of-the-art results across multiple benchmarks while significantly reducing computational costs. Project page: https://scaling3dnvs.github.io/2025-09-08T17:58:06ZAccepted at ICCV 2025Nithin Gopalakrishnan NairSrinivas KazaXuan LuoVishal M. PatelStephen LombardiJungyeon Parkhttp://arxiv.org/abs/2403.18767v3The best approximation pair problem relative to two subsets in a normed space2025-09-08T15:54:54ZIn the classical best approximation pair (BAP) problem, one is given two nonempty, closed, convex and disjoint subsets in a finite- or an infinite-dimensional Hilbert space, and the goal is to find a pair of points, each from each subset, which realizes the distance between the subsets. We discuss the problem in more general normed spaces and with possibly non-convex subsets, and focus our attention on the issues of uniqueness and existence of the solution to the problem. As far as we know, these fundamental issues have not received much attention. We present several sufficient geometric conditions for the (at most) uniqueness of a BAP. These conditions are related to the structure and the relative orientation of the boundaries of the subsets and to the norm. We also present many sufficient conditions for the existence of a BAP. Our results significantly extend the horizon of a recent algorithm for solving the BAP problem [Censor, Mansour, Reem, J. Approx. Theory (2024)]. The paper also shows, perhaps for the first time, how wide is the scope of the BAP problem in terms of the scientific communities which are involved in it (frequently independently) and in terms of its applications.2024-03-27T17:05:06ZCorrection of a misprint in the AcknowledgmentsDaniel ReemYair Censorhttp://arxiv.org/abs/2509.06607v1From Skin to Skeleton: Towards Biomechanically Accurate 3D Digital Humans2025-09-08T12:24:27ZGreat progress has been made in estimating 3D human pose and shape from images and video by training neural networks to directly regress the parameters of parametric human models like SMPL. However, existing body models have simplified kinematic structures that do not correspond to the true joint locations and articulations in the human skeletal system, limiting their potential use in biomechanics. On the other hand, methods for estimating biomechanically accurate skeletal motion typically rely on complex motion capture systems and expensive optimization methods. What is needed is a parametric 3D human model with a biomechanically accurate skeletal structure that can be easily posed. To that end, we develop SKEL, which re-rigs the SMPL body model with a biomechanics skeleton. To enable this, we need training data of skeletons inside SMPL meshes in diverse poses.
We build such a dataset by optimizing biomechanically accurate skeletons inside SMPL meshes from AMASS sequences. We then learn a regressor from SMPL mesh vertices to the optimized joint locations and bone rotations. Finally, we re-parametrize the SMPL mesh with the new kinematic parameters. The resulting SKEL model is animatable like SMPL but with fewer, and biomechanically-realistic, degrees of freedom. We show that SKEL has more biomechanically accurate joint locations than SMPL, and the bones fit inside the body surface better than previous methods. By fitting SKEL to SMPL meshes we are able to "upgrade" existing human pose and shape datasets to include biomechanical parameters. SKEL provides a new tool to enable biomechanics in the wild, while also providing vision and graphics researchers with a better constrained and more realistic model of human articulation. The model, code, and data are available for research at https://skel.is.tue.mpg.de..2025-09-08T12:24:27ZACM Trans. Graph. 42, 6, Article 253 (December 2023), 12 pagesMarilyn KellerKeenon WerlingSoyong ShinScott DelpSergi PujadesC. Karen LiuMichael J. Black10.1145/3618381http://arxiv.org/abs/2509.06573v1From Rigging to Waving: 3D-Guided Diffusion for Natural Animation of Hand-Drawn Characters2025-09-08T11:35:40ZHand-drawn character animation is a vibrant field in computer graphics, presenting challenges in achieving geometric consistency while conveying expressive motion. Traditional skeletal animation methods maintain geometric consistency but struggle with complex non-rigid elements like flowing hair and skirts, leading to unnatural deformation. Conversely, video diffusion models synthesize realistic dynamics but often create geometric distortions in stylized drawings due to domain gaps. This work proposes a hybrid animation system that combines skeletal animation and video diffusion. Initially, coarse images are generated from characters retargeted with skeletal animations for geometric guidance. These images are then enhanced in texture and secondary dynamics using video diffusion priors, framing this enhancement as an inpainting task. A domain-adapted diffusion model refines user-masked regions needing improvement, especially for secondary dynamics. To enhance motion realism further, we introduce a Secondary Dynamics Injection (SDI) strategy in the denoising process, incorporating features from a pre-trained diffusion model enriched with human motion priors. Additionally, to tackle unnatural deformations from low-poly single-mesh character modeling, we present a Hair Layering Modeling (HLM) technique that uses segmentation maps to separate hair from the body, allowing for more natural animation of long-haired characters. Extensive experiments show that our system outperforms state-of-the-art methods in both quantitative and qualitative evaluations.2025-09-08T11:35:40ZJie ZhouLinzi QuMiu-Ling LamHongbo Fu