https://arxiv.org/api/3YNyKRC55tCICwf/BWTHEYQRJ2c2026-07-01T13:37:15Z9421216015http://arxiv.org/abs/2506.00870v1Hybridizing Expressive Rendering: Stroke-Based Rendering with Classic and Neural Methods2025-06-01T07:18:24ZNon-Photorealistic Rendering (NPR) has long been used to create artistic visualizations that prioritize style over realism, enabling the depiction of a wide range of aesthetic effects, from hand-drawn sketches to painterly renderings. While classical NPR methods, such as edge detection, toon shading, and geometric abstraction, have been well-established in both research and practice, with a particular focus on stroke-based rendering, the recent rise of deep learning represents a paradigm shift. We analyze the similarities and differences between classical and neural network based NPR techniques, focusing on stroke-based rendering (SBR), highlighting their strengths and limitations. We discuss trade offs in quality and artistic control between these paradigms, propose a framework where these approaches can be combined for new possibilities in expressive rendering.2025-06-01T07:18:24ZKapil Devhttp://arxiv.org/abs/2506.02034v1High-throughput viscometry via machine-learning from videos of inverted vials2025-05-30T20:45:05ZAlthough the inverted vial test has been widely used as a qualitative method for estimating fluid viscosity, quantitative rheological characterization has remained limited due to its complex, uncontrolled flow - driven by gravity, surface tension, inertia, and initial conditions. Here, we present a computer vision (CV) viscometer that automates the inverted vial test and enables quantitative viscosity inference across nearly five orders of magnitude (0.01-1000 Pas), without requiring direct velocity field measurements. The system simultaneously inverts multiple vials and records videos of the evolving fluid, which are fed into a neural network that approximates the inverse function from visual features and known fluid density. Despite the complex, multi-regime flow within the vial, our approach achieves relative errors below 25%, improving to 15% for viscosities above 0.1 Pas. When tested on non-Newtonian polymer solutions, the method reliably estimates zero-shear viscosity as long as viscoelastic or shear-thinning behaviors remain negligible within the flow regime. Moreover, high standard deviations in the inferred values may serve as a proxy for identifying fluids with strong non-Newtonian behavior. The CV viscometer requires only one camera and one motor, is contactless and low-cost, and can be easily integrated into high-throughput experimental automated and manual workflows. Transcending traditional characterization paradigms, our method leverages uncontrolled flows and visual features to achieve simplicity and scalability, enabling high-throughput viscosity inference that can meet the growing demand of data-driven material models while remaining accessible to lower resource environments.2025-05-30T20:45:05ZIgnacio ArretcheMohammad Tanver HossainRamdas TiwariAbbie KimMya G. MillsConnor D. ArmstrongJacob J. LessardSameh H. TawfickRandy H. Ewoldthttp://arxiv.org/abs/2506.00173v1MotionPersona: Characteristics-aware Locomotion Control2025-05-30T19:24:43ZWe present MotionPersona, a novel real-time character controller that allows users to characterize a character by specifying attributes such as physical traits, mental states, and demographics, and projects these properties into the generated motions for animating the character. In contrast to existing deep learning-based controllers, which typically produce homogeneous animations tailored to a single, predefined character, MotionPersona accounts for the impact of various traits on human motion as observed in the real world. To achieve this, we develop a block autoregressive motion diffusion model conditioned on SMPLX parameters, textual prompts, and user-defined locomotion control signals. We also curate a comprehensive dataset featuring a wide range of locomotion types and actor traits to enable the training of this characteristic-aware controller. Unlike prior work, MotionPersona is the first method capable of generating motion that faithfully reflects user-specified characteristics (e.g., an elderly person's shuffling gait) while responding in real time to dynamic control inputs. Additionally, we introduce a few-shot characterization technique as a complementary conditioning mechanism, enabling customization via short motion clips when language prompts fall short. Through extensive experiments, we demonstrate that MotionPersona outperforms existing methods in characteristics-aware locomotion control, achieving superior motion quality and diversity. Results, code, and demo can be found at: https://motionpersona25.github.io/.2025-05-30T19:24:43Z15 pages, 13 figures, webpage: https://motionpersona25.github.io/Mingyi ShiWei LiuJidong MeiWangpok TseRui ChenXuelin ChenTaku Komurahttp://arxiv.org/abs/2505.24653v1Minimizing Ray Tracing Memory Traffic through Quantized Structures and Ray Stream Tracing2025-05-30T14:41:25ZMemory bandwidth constraints continue to be a significant limiting factor in ray tracing performance, particularly as scene complexity grows and computational capabilities outpace memory access speeds. This paper presents a memory-efficient ray tracing methodology that integrates compressed data structures with ray stream techniques to reduce memory traffic. The approach implements compressed BVH and triangle representations to minimize acceleration structure size in combination with ray stream tracing to reduce traversal stack memory traffic. The technique employs fixed-point arithmetic for intersection tests for prospective hardware with tailored integer operations. Despite using reduced precision, geometric holes are avoided by leveraging fixed-point arithmetic instead of encountering the floating-point rounding errors common in traditional approaches. Quantitative analysis demonstrates significant memory traffic reduction across various scene complexities and BVH configurations. The presented 8-wide BVH ray stream implementation reduces memory traffic to only 18% of traditional approaches by using 8-bit quantization for box and triangle coordinates and directly ray tracing these quantized structures. These reductions are especially beneficial for bandwidth-constrained hardware environments such as mobile devices. This integrated approach addresses both memory bandwidth limitations and numerical precision challenges inherent to modern ray tracing applications.2025-05-30T14:41:25ZMoritz GrauerJohannes HanikaCarsten Dachsbacherhttp://arxiv.org/abs/2405.14979v4CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner2025-05-30T10:22:03ZWe present a novel generative 3D modeling system, coined CraftsMan, which can generate high-fidelity 3D geometries with highly varied shapes, regular mesh topologies, and detailed surfaces, and, notably, allows for refining the geometry in an interactive manner. Despite the significant advancements in 3D generation, existing methods still struggle with lengthy optimization processes, irregular mesh topologies, noisy surfaces, and difficulties in accommodating user edits, consequently impeding their widespread adoption and implementation in 3D modeling software. Our work is inspired by the craftsman, who usually roughs out the holistic figure of the work first and elaborates the surface details subsequently. Specifically, we employ a 3D native diffusion model, which operates on latent space learned from latent set-based 3D representations, to generate coarse geometries with regular mesh topology in seconds. In particular, this process takes as input a text prompt or a reference image and leverages a powerful multi-view (MV) diffusion model to generate multiple views of the coarse geometry, which are fed into our MV-conditioned 3D diffusion model for generating the 3D geometry, significantly improving robustness and generalizability. Following that, a normal-based geometry refiner is used to significantly enhance the surface details. This refinement can be performed automatically, or interactively with user-supplied edits. Extensive experiments demonstrate that our method achieves high efficacy in producing superior-quality 3D assets compared to existing methods. HomePage: https://craftsman3d.github.io/, Code: https://github.com/wyysf-98/CraftsMan2024-05-23T18:30:12ZHomePage: https://craftsman3d.github.io/, Code: https://github.com/wyysf-98/CraftsMan3DWeiyu LiJiarui LiuHongyu YanRui ChenYixun LiangXuelin ChenPing TanXiaoxiao Longhttp://arxiv.org/abs/2507.18460v1Topology-Preserving Coupling of Compressible Fluids and Thin Deformables2025-05-29T23:04:53ZWe present a novel discretization of coupled compressible fluid and thin deformable structures that provides sufficient and necessary leakproofness by preserving the path connectedness of the fluid domain. Our method employs a constrained Voronoi-based spatial partitioning combined with Godunov-style finite-volume time integration. The fluid domain is discretized into cells that conform exactly to the fluid-solid interface, allowing boundary conditions to be sharply resolved exactly at the interface. This enables direct force exchange between the fluid and solid while ensuring that no fluid leaks through the solid, even when arbitrarily thin. We validate our approach on a series of challenging scenarios -- including a balloon propelled by internal compressed air, a champagne cork ejecting after overcoming friction, and a supersonic asteroid -- demonstrating bidirectional energy transfer between fluid and solid.2025-05-29T23:04:53ZJonathan PanuelosEitan GrinspunDavid Levinhttp://arxiv.org/abs/2505.23969v1Force-Dual Modes: Subspace Design from Stochastic Forces2025-05-29T19:49:05ZDesigning subspaces for Reduced Order Modeling (ROM) is crucial for accelerating finite element simulations in graphics and engineering. Unfortunately, it's not always clear which subspace is optimal for arbitrary dynamic simulation. We propose to construct simulation subspaces from force distributions, allowing us to tailor such subspaces to common scene interactions involving constraint penalties, handles-based control, contact and musculoskeletal actuation. To achieve this we adopt a statistical perspective on Reduced Order Modelling, which allows us to push such user-designed force distributions through a linearized simulation to obtain a dual distribution on displacements. To construct our subspace, we then fit a low-rank Gaussian model to this displacement distribution, which we show generalizes Linear Modal Analysis subspaces for uncorrelated unit variance force distributions, as well as Green's Function subspaces for low rank force distributions. We show our framework allows for the construction of subspaces that are optimal both with respect to physical material properties, as well as arbitrary force distributions as observed in handle-based, contact, and musculoskeletal scene interactions.2025-05-29T19:49:05Z14 pages, 16 figuresOtman BenchekrounEitan GrinspunMaurizio ChiaramontePhilip Allen Etterhttp://arxiv.org/abs/2505.23708v1AMOR: Adaptive Character Control through Multi-Objective Reinforcement Learning2025-05-29T17:41:48ZReinforcement learning (RL) has significantly advanced the control of physics-based and robotic characters that track kinematic reference motion. However, methods typically rely on a weighted sum of conflicting reward functions, requiring extensive tuning to achieve a desired behavior. Due to the computational cost of RL, this iterative process is a tedious, time-intensive task. Furthermore, for robotics applications, the weights need to be chosen such that the policy performs well in the real world, despite inevitable sim-to-real gaps. To address these challenges, we propose a multi-objective reinforcement learning framework that trains a single policy conditioned on a set of weights, spanning the Pareto front of reward trade-offs. Within this framework, weights can be selected and tuned after training, significantly speeding up iteration time. We demonstrate how this improved workflow can be used to perform highly dynamic motions with a robot character. Moreover, we explore how weight-conditioned policies can be leveraged in hierarchical settings, using a high-level policy to dynamically select weights according to the current task. We show that the multi-objective policy encodes a diverse spectrum of behaviors, facilitating efficient adaptation to novel tasks.2025-05-29T17:41:48ZSIGGRAPH 2025Lucas N. AlegreAgon SerifiRuben GrandiaDavid MüllerEspen KnoopMoritz Bächer10.1145/3721238.3730656http://arxiv.org/abs/2505.23447v1To Measure What Isn't There -- Visual Exploration of Missingness Structures Using Quality Metrics2025-05-29T13:51:34ZThis paper contributes a set of quality metrics for identification and visual analysis of structured missingness in high-dimensional data. Missing values in data are a frequent challenge in most data generating domains and may cause a range of analysis issues. Structural missingness in data may indicate issues in data collection and pre-processing, but may also highlight important data characteristics. While research into statistical methods for dealing with missing data are mainly focusing on replacing missing values with plausible estimated values, visualization has great potential to support a more in-depth understanding of missingness structures in data. Nonetheless, while the interest in missing data visualization has increased in the last decade, it is still a relatively overlooked research topic with a comparably small number of publications, few of which address scalability issues. Efficient visual analysis approaches are needed to enable exploration of missingness structures in large and high-dimensional data, and to support informed decision-making in context of potential data quality issues. This paper suggests a set of quality metrics for identification of patterns of interest for understanding of structural missingness in data. These quality metrics can be used as guidance in visual analysis, as demonstrated through a use case exploring structural missingness in data from a real-life walking monitoring study. All supplemental materials for this paper are available at https://doi.org/10.25405/data.ncl.c.7741829.2025-05-29T13:51:34ZSubmitted to IEEE Vis2025Sara Johansson FernstadSarah AlsufyaniSilvia Del DinAlison YarnallLynn Rochesterhttp://arxiv.org/abs/2505.23301v1Quality assessment of 3D human animation: Subjective and objective evaluation2025-05-29T09:58:53ZVirtual human animations have a wide range of applications in virtual and augmented reality. While automatic generation methods of animated virtual humans have been developed, assessing their quality remains challenging. Recently, approaches introducing task-oriented evaluation metrics have been proposed, leveraging neural network training. However, quality assessment measures for animated virtual humans that are not generated with parametric body models have yet to be developed. In this context, we introduce a first such quality assessment measure leveraging a novel data-driven framework. First, we generate a dataset of virtual human animations together with their corresponding subjective realism evaluation scores collected with a user study. Second, we use the resulting dataset to learn predicting perceptual evaluation scores. Results indicate that training a linear regressor on our dataset results in a correlation of 90%, which outperforms a state of the art deep learning baseline.2025-05-29T09:58:53ZIEEE Transactions on Visualization and Computer Graphics 2025Rim RekikStefanie WuhrerLudovic HoyetKatja ZibrekAnne-Hélène Olivier10.1109/TVCG.2025.3631385http://arxiv.org/abs/2203.02252v2Parametric/direct CAD integration2025-05-29T03:48:33ZIn the history of computer-aided design (CAD), feature-based parametric modeling and boundary representation-based direct modeling are two of the most important CAD paradigms, developed respectively in the late 1980s and the late 2000s. They have complementary advantages and limitations, thereby offering huge potential for improvement towards an integrated CAD modeling scheme. Some believe that their integration will be the key characteristic of next generation CAD software. This paper provides a brief review on current parametric/direct integration approaches. Their basic ideas, advantages, and disadvantages will be discussed. The main result reads that existing integration approaches are far from being completed if seamless parametric/direct integration is desired. It is hoped that, by outlining what has already been made possible and what still remains problematic, more researchers will be attracted to work on this very important research topic of parametric/direct integration.
This paper serves as a complement to the CAD paper titled ``Variational Direct Modeling: A Framework Towards Integration of Parametric Modeling and Direct Modeling in CAD." Cite this work as follows: Qiang Zou, Hsi-Yung Feng, and Shuming Gao. Variational Direct Modeling: A Framework Towards Integration of Parametric Modeling and Direct Modeling in CAD. Computer-Aided Design 157 (2023): 103465.2022-03-04T11:49:52Z12 pages; 3 figuresComputer-Aided Design 157 (2023): 103465Qiang Zou10.1016/j.cad.2022.103465http://arxiv.org/abs/2411.18625v2Textured Gaussians for Enhanced 3D Scene Appearance Modeling2025-05-29T02:41:22Z3D Gaussian Splatting (3DGS) has recently emerged as a state-of-the-art 3D reconstruction and rendering technique due to its high-quality results and fast training and rendering time. However, pixels covered by the same Gaussian are always shaded in the same color up to a Gaussian falloff scaling factor. Furthermore, the finest geometric detail any individual Gaussian can represent is a simple ellipsoid. These properties of 3DGS greatly limit the expressivity of individual Gaussian primitives. To address these issues, we draw inspiration from texture and alpha mapping in traditional graphics and integrate it with 3DGS. Specifically, we propose a new generalized Gaussian appearance representation that augments each Gaussian with alpha~(A), RGB, or RGBA texture maps to model spatially varying color and opacity across the extent of each Gaussian. As such, each Gaussian can represent a richer set of texture patterns and geometric structures, instead of just a single color and ellipsoid as in naive Gaussian Splatting. Surprisingly, we found that the expressivity of Gaussians can be greatly improved by using alpha-only texture maps, and further augmenting Gaussians with RGB texture maps achieves the highest expressivity. We validate our method on a wide variety of standard benchmark datasets and our own custom captures at both the object and scene levels. We demonstrate image quality improvements over existing methods while using a similar or lower number of Gaussians.2024-11-27T18:59:59ZWill be presented at CVPR 2025. Project website: https://textured-gaussians.github.io/Brian ChaoHung-Yu TsengLorenzo PorziChen GaoTuotuo LiQinbo LiAyush SarafJia-Bin HuangJohannes KopfGordon WetzsteinChangil Kimhttp://arxiv.org/abs/2505.22489v1Cascaded 3D Diffusion Models for Whole-body 3D 18-F FDG PET/CT synthesis from Demographics2025-05-28T15:38:33ZWe propose a cascaded 3D diffusion model framework to synthesize high-fidelity 3D PET/CT volumes directly from demographic variables, addressing the growing need for realistic digital twins in oncologic imaging, virtual trials, and AI-driven data augmentation. Unlike deterministic phantoms, which rely on predefined anatomical and metabolic templates, our method employs a two-stage generative process. An initial score-based diffusion model synthesizes low-resolution PET/CT volumes from demographic variables alone, providing global anatomical structures and approximate metabolic activity. This is followed by a super-resolution residual diffusion model that refines spatial resolution. Our framework was trained on 18-F FDG PET/CT scans from the AutoPET dataset and evaluated using organ-wise volume and standardized uptake value (SUV) distributions, comparing synthetic and real data between demographic subgroups. The organ-wise comparison demonstrated strong concordance between synthetic and real images. In particular, most deviations in metabolic uptake values remained within 3-5% of the ground truth in subgroup analysis. These findings highlight the potential of cascaded 3D diffusion models to generate anatomically and metabolically accurate PET/CT images, offering a robust alternative to traditional phantoms and enabling scalable, population-informed synthetic imaging for clinical and research applications.2025-05-28T15:38:33ZMICCAI2025 Submitted versionSiyeop YoonSifan SongPengfei JinMatthew TivnanYujin OhSekeun KimDufan WuXiang LiQuanzheng Lihttp://arxiv.org/abs/2402.00719v3Geometric Contact Potential2025-05-28T15:08:19ZBarrier potentials gained popularity as a means for robust contact handling in physical modeling and for modeling self-avoiding shapes. The key to the success of these approaches is adherence to geometric constraints, i.e., avoiding intersections, which are the cause of most robustness problems in complex deformation simulation with contact. However, existing barrier-potential methods may lead to spurious forces and imperfect satisfaction of the geometric constraints. They may have strong resolution dependence, requiring careful adaptation of the potential parameters to the object discretizations. We present a systematic derivation of a continuum potential defined for smooth and piecewise smooth surfaces, starting from identifying a set of natural requirements for contact potentials, including the barrier property, locality, differentiable dependence on shape, and absence of forces in rest configurations. Our potential is formulated independently of surface discretization and addresses the shortcomings of existing potential-based methods while retaining their advantages. We present a discretization of our potential that is a drop-in replacement for the potential used in the Incremental Potential Contact (IPC) formulation, and compare its behavior to other potential formulations, demonstrating that it has the expected behavior. The presented formulation connects existing barrier approaches, as all recent existing methods can be viewed as a variation of the presented potential, and lays a foundation for developing alternative (e.g., higher-order) versions.2024-02-01T16:12:15ZZizhou HuangMax PaikZachary FergusonDaniele PanozzoDenis Zorin10.1145/3731142http://arxiv.org/abs/2505.22416v1Neural Face Skinning for Mesh-agnostic Facial Expression Cloning2025-05-28T14:43:43ZAccurately retargeting facial expressions to a face mesh while enabling manipulation is a key challenge in facial animation retargeting. Recent deep-learning methods address this by encoding facial expressions into a global latent code, but they often fail to capture fine-grained details in local regions. While some methods improve local accuracy by transferring deformations locally, this often complicates overall control of the facial expression. To address this, we propose a method that combines the strengths of both global and local deformation models. Our approach enables intuitive control and detailed expression cloning across diverse face meshes, regardless of their underlying structures. The core idea is to localize the influence of the global latent code on the target mesh. Our model learns to predict skinning weights for each vertex of the target face mesh through indirect supervision from predefined segmentation labels. These predicted weights localize the global latent code, enabling precise and region-specific deformations even for meshes with unseen shapes. We supervise the latent code using Facial Action Coding System (FACS)-based blendshapes to ensure interpretability and allow straightforward editing of the generated animation. Through extensive experiments, we demonstrate improved performance over state-of-the-art methods in terms of expression fidelity, deformation transfer accuracy, and adaptability across diverse mesh structures.2025-05-28T14:43:43ZSihun ChaSerin YoonKwanggyoon SeoJunyong Noh