https://arxiv.org/api/2Na90T1W6J+Nca5yH2Ts8qV1wpE 2026-06-28T11:01:07Z 9390 1845 15 http://arxiv.org/abs/2510.15876v1 Adaptive Frameless Rendering 2025-07-28T21:49:13Z

We propose an adaptive form of frameless rendering with the potential to dramatically increase rendering speed over conventional interactive rendering approaches. Without the rigid sampling patterns of framed renderers, sampling and reconstruction can adapt with very fine granularity to spatio-temporal color change. A sampler uses closed-loop feedback to guide sampling toward edges or motion in the image. Temporally deep buffers store all the samples created over a short time interval for use in reconstruction and as sampler feedback. GPU-based reconstruction responds both to sampling density and space-time color gradients. Where the displayed scene is static, spatial color change dominates and older samples are given significant weight in reconstruction, resulting in sharper and eventually antialiased images. Where the scene is dynamic, more recent samples are emphasized, resulting in less sharp but more up-to-date images. We also use sample reprojection to improve reconstruction and guide sampling toward occlusion edges, undersampled regions, and specular highlights. In simulation our frameless renderer requires an order of magnitude fewer samples than traditional rendering of similar visual quality (as measured by RMS error), while introducing overhead amounting to 15% of computation time.

2025-07-28T21:49:13Z Proc. Eurographics Symposium on Rendering (Konstanz, June), 265-275. 2005 Abhinav Dayal Cliff Woolley Benjamin Watson David Luebke 10.2312/EGWR/EGSR05/265-275 http://arxiv.org/abs/2507.21311v1 VoluMe -- Authentic 3D Video Calls from Live Gaussian Splat Prediction 2025-07-28T20:07:55Z

Virtual 3D meetings offer the potential to enhance copresence, increase engagement and thus improve effectiveness of remote meetings compared to standard 2D video calls. However, representing people in 3D meetings remains a challenge; existing solutions achieve high quality by using complex hardware, making use of fixed appearance via enrolment, or by inverting a pre-trained generative model. These approaches lead to constraints that are unwelcome and ill-fitting for videoconferencing applications. We present the first method to predict 3D Gaussian reconstructions in real time from a single 2D webcam feed, where the 3D representation is not only live and realistic, but also authentic to the input video. By conditioning the 3D representation on each video frame independently, our reconstruction faithfully recreates the input video from the captured viewpoint (a property we call authenticity), while generalizing realistically to novel viewpoints. Additionally, we introduce a stability loss to obtain reconstructions that are temporally stable on video sequences. We show that our method delivers state-of-the-art accuracy in visual quality and stability metrics compared to existing methods, and demonstrate our approach in live one-to-one 3D meetings using only a standard 2D camera and display. This demonstrates that our approach can allow anyone to communicate volumetrically, via a method for 3D videoconferencing that is not only highly accessible, but also realistic and authentic.

2025-07-28T20:07:55Z Martin de La Gorce Charlie Hewitt Tibor Takacs Robert Gerdisch Zafiirah Hosenie Givi Meishvili Marek Kowalski Thomas J. Cashman Antonio Criminisi http://arxiv.org/abs/2507.20922v1 Methodology for intelligent injection point location based on geometric algorithms and discrete topologies for virtual digital twin environments 2025-07-28T15:25:30Z

This article presents an innovative methodology for locating injection points in injection-molded parts using intelligent models with geometric algorithms for discrete topologies. The first algorithm calculates the center of mass of the discrete model based on the center of mass of each triangular facet in the system, ensuring uniform molten plastic distribution during mold cavity filling. Two sub-algorithms intelligently evaluate the geometry and optimal injection point location. The first sub-algorithm generates a geometric matrix based on a two-dimensional nodal quadrature adapted to the part's bounding box. The second sub-algorithm projects the nodal matrix and associated circular areas orthogonally on the part's surface along the demolding direction. The optimal injection point location is determined by minimizing the distance to the center of mass from the first algorithm's result. This novel methodology has been validated through rheological simulations in six case studies with complex geometries. The results demonstrate uniform and homogeneous molten plastic distribution with minimal pressure loss during the filling phase. Importantly, this methodology does not require expert intervention, reducing time and costs associated with manual injection mold feed system design. It is also adaptable to various design environments and virtual twin systems, not tied to specific CAD software. The validated results surpass the state of the art, offering an agile alternative for digital twin applications in new product design environments, reducing dependence on experts, facilitating designer training, and ultimately cutting costs

2025-07-28T15:25:30Z Mercado-Colmenero, J. M., Torres-Alba, A., Martin-Donate, C. (2024). DYNA, 99(1), 44-50 J. Mercado Colmenero A. Torres Alba C. Martin Donate 10.6036/11004 http://arxiv.org/abs/2510.15875v1 A virtual airplane for fear of flying therapy 2025-07-28T02:15:06Z

Fear of flying is a serious problem that affects millions of individuals. Exposure therapy for fear of flying is an effective therapy technique. However, exposure therapy is also expensive, logistically difficult to arrange, and presents significant problems of patient confidentiality and potential embarrassment. We have developed a virtual airplane for use in fear of flying therapy. Using the virtual airplane for exposure therapy is a potential solution to many of the current problems of fear of flying exposure therapy. We describe the design of the virtual airplane and present a case report on its use for fear of flying exposure therapy.

2025-07-28T02:15:06Z Proceedings of the IEEE 1996 Virtual Reality Annual International Symposium Pages 86-93 Larry F Hodges Barbara O Rothbaum Benjamin Watson G Drew Kessler Dan Opdyke 10.1109/VRAIS.1996.490515 http://arxiv.org/abs/2506.08161v2 GATE: Geometry-Aware Trained Encoding 2025-07-27T09:58:25Z

The encoding of input parameters is one of the fundamental building blocks of neural network algorithms. Its goal is to map the input data to a higher-dimensional space, typically supported by trained feature vectors. The mapping is crucial for the efficiency and approximation quality of neural networks. We propose a novel geometry-aware encoding called GATE that stores feature vectors on the surface of triangular meshes. Our encoding is suitable for neural rendering-related algorithms, for example, neural radiance caching. It also avoids limitations of previous hash-based encoding schemes, such as hash collisions, selection of resolution versus scene size, and divergent memory access. Our approach decouples feature vector density from geometry density using mesh colors, while allowing for finer control over neural network training and adaptive level-of-detail.

2025-06-09T19:13:16Z Jakub Bokšanský Daniel Meister Carsten Benthin http://arxiv.org/abs/2507.20200v1 Neural Shell Texture Splatting: More Details and Fewer Primitives 2025-07-27T09:39:10Z

Gaussian splatting techniques have shown promising results in novel view synthesis, achieving high fidelity and efficiency. However, their high reconstruction quality comes at the cost of requiring a large number of primitives. We identify this issue as stemming from the entanglement of geometry and appearance in Gaussian Splatting. To address this, we introduce a neural shell texture, a global representation that encodes texture information around the surface. We use Gaussian primitives as both a geometric representation and texture field samplers, efficiently splatting texture features into image space. Our evaluation demonstrates that this disentanglement enables high parameter efficiency, fine texture detail reconstruction, and easy textured mesh extraction, all while using significantly fewer primitives.

2025-07-27T09:39:10Z Xin Zhang Anpei Chen Jincheng Xiong Pinxuan Dai Yujun Shen Weiwei Xu http://arxiv.org/abs/2507.20127v1 Aggregation-aware MLP: An Unsupervised Approach for Graph Message-passing 2025-07-27T04:52:55Z

Graph Neural Networks (GNNs) have become a dominant approach to learning graph representations, primarily because of their message-passing mechanisms. However, GNNs typically adopt a fixed aggregator function such as Mean, Max, or Sum without principled reasoning behind the selection. This rigidity, especially in the presence of heterophily, often leads to poor, problem dependent performance. Although some attempts address this by designing more sophisticated aggregation functions, these methods tend to rely heavily on labeled data, which is often scarce in real-world tasks. In this work, we propose a novel unsupervised framework, "Aggregation-aware Multilayer Perceptron" (AMLP), which shifts the paradigm from directly crafting aggregation functions to making MLP adaptive to aggregation. Our lightweight approach consists of two key steps: First, we utilize a graph reconstruction method that facilitates high-order grouping effects, and second, we employ a single-layer network to encode varying degrees of heterophily, thereby improving the capacity and applicability of the model. Extensive experiments on node clustering and classification demonstrate the superior performance of AMLP, highlighting its potential for diverse graph learning scenarios.

2025-07-27T04:52:55Z 11 pages, 6 figures Xuanting Xie Bingheng Li Erlin Pan Zhao Kang Wenyu Chen http://arxiv.org/abs/2505.17860v2 Multi-Person Interaction Generation from Two-Person Motion Priors 2025-07-26T22:01:05Z

Generating realistic human motion with high-level controls is a crucial task for social understanding, robotics, and animation. With high-quality MOCAP data becoming more available recently, a wide range of data-driven approaches have been presented. However, modelling multi-person interactions still remains a less explored area. In this paper, we present Graph-driven Interaction Sampling, a method that can generate realistic and diverse multi-person interactions by leveraging existing two-person motion diffusion models as motion priors. Instead of training a new model specific to multi-person interaction synthesis, our key insight is to spatially and temporally separate complex multi-person interactions into a graph structure of two-person interactions, which we name the Pairwise Interaction Graph. We thus decompose the generation task into simultaneous single-person motion generation conditioned on one other's motion. In addition, to reduce artifacts such as interpenetrations of body parts in generated multi-person interactions, we introduce two graph-dependent guidance terms into the diffusion sampling scheme. Unlike previous work, our method can produce various high-quality multi-person interactions without having repetitive individual motions. Extensive experiments demonstrate that our approach consistently outperforms existing methods in reducing artifacts when generating a wide range of two-person and multi-person interactions.

2025-05-23T13:13:00Z SIGGRAPH 2025 Conference Papers, project page at http://wenningxu.github.io/multicharacter/ Wenning Xu Shiyu Fan Paul Henderson Edmond S. L. Ho http://arxiv.org/abs/2507.21181v1 Mitigation of Social Media Platforms Impact on the Users 2025-07-26T18:51:32Z

Social media platforms offer numerous benefits and allow people to come together for various causes. Many communities, academia, government agencies, institutions, healthcare, entertainment, and businesses are on social media platforms. They are intuitive and free for users. It has become unimaginable to live without social media. Their architecture and data handling are geared towards scalability, uninterrupted availability, and both personal and collaborative revenue generation. Primarily, artificial intelligence algorithms are employed on stored user data for optimization and feeds. This has the potential to impact user safety, privacy, and security, even when metadata is used. A new decentralized data arrangement framework based on the Fractal-tree and L-Systems algorithm is proposed to mitigate some of the impacts of social media platforms. Future work will focus on demonstrating the effectiveness of the new decentralized framework by comparing its results against state-of-the-art security methods currently used in databases. A cryptographic algorithm could also be implemented for the framework, employing a new key generation for each branch. This will strengthen database security; for example, if a user key is leaked, regenerating the key for each branch will keep the data secure by applying defense mechanisms in the proposed L-System-based tree framework.

2025-07-26T18:51:32Z WSCG 2025 33. International Conference on Computer Graphics, Visualization and Computer Vision 2025 33. International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision WSCG 2025 Proceedings Smita Khapre Sudhanshu Semwal 10.24132/CSRN.2025-15 http://arxiv.org/abs/2507.19988v1 Visual Analytics Using Tensor Unified Linear Comparative Analysis 2025-07-26T15:54:12Z

Comparing tensors and identifying their (dis)similar structures is fundamental in understanding the underlying phenomena for complex data. Tensor decomposition methods help analysts extract tensors' essential characteristics and aid in visual analytics for tensors. In contrast to dimensionality reduction (DR) methods designed only for analyzing a matrix (i.e., second-order tensor), existing tensor decomposition methods do not support flexible comparative analysis. To address this analysis limitation, we introduce a new tensor decomposition method, named tensor unified linear comparative analysis (TULCA), by extending its DR counterpart, ULCA, for tensor analysis. TULCA integrates discriminant analysis and contrastive learning schemes for tensor decomposition, enabling flexible comparison of tensors. We also introduce an effective method to visualize a core tensor extracted from TULCA into a set of 2D visualizations. We integrate TULCA's functionalities into a visual analytics interface to support analysts in interpreting and refining the TULCA results. We demonstrate the efficacy of TULCA and the visual analytics interface with computational evaluations and two case studies, including an analysis of log data collected from a supercomputer.

2025-07-26T15:54:12Z To appear in IEEE Transactions on Visualization and Computer Graphics and IEEE VIS 2025 Naoki Okami Kazuki Miyake Naohisa Sakamoto Jorji Nonaka Takanori Fujiwara http://arxiv.org/abs/2507.19836v1 ChoreoMuse: Robust Music-to-Dance Video Generation with Style Transfer and Beat-Adherent Motion 2025-07-26T07:17:50Z

Modern artistic productions increasingly demand automated choreography generation that adapts to diverse musical styles and individual dancer characteristics. Existing approaches often fail to produce high-quality dance videos that harmonize with both musical rhythm and user-defined choreography styles, limiting their applicability in real-world creative contexts. To address this gap, we introduce ChoreoMuse, a diffusion-based framework that uses SMPL format parameters and their variation version as intermediaries between music and video generation, thereby overcoming the usual constraints imposed by video resolution. Critically, ChoreoMuse supports style-controllable, high-fidelity dance video generation across diverse musical genres and individual dancer characteristics, including the flexibility to handle any reference individual at any resolution. Our method employs a novel music encoder MotionTune to capture motion cues from audio, ensuring that the generated choreography closely follows the beat and expressive qualities of the input music. To quantitatively evaluate how well the generated dances match both musical and choreographic styles, we introduce two new metrics that measure alignment with the intended stylistic cues. Extensive experiments confirm that ChoreoMuse achieves state-of-the-art performance across multiple dimensions, including video quality, beat alignment, dance diversity, and style adherence, demonstrating its potential as a robust solution for a wide range of creative applications. Video results can be found on our project page: https://choreomuse.github.io.

2025-07-26T07:17:50Z 10 pages, 5 figures, accepted by the 33rd ACM International Conference on Multimedia (ACM MM 2025), demo page: https://choreomuse.github.io Xuanchen Wang Heng Wang Weidong Cai http://arxiv.org/abs/2503.15557v2 Motion Synthesis with Sparse and Flexible Keyjoint Control 2025-07-25T04:45:05Z

Creating expressive character animations is labor-intensive, requiring intricate manual adjustment of animators across space and time. Previous works on controllable motion generation often rely on a predefined set of dense spatio-temporal specifications (e.g., dense pelvis trajectories with exact per-frame timing), limiting practicality for animators. To process high-level intent and intuitive control in diverse scenarios, we propose a practical controllable motions synthesis framework that respects sparse and flexible keyjoint signals. Our approach employs a decomposed diffusion-based motion synthesis framework that first synthesizes keyjoint movements from sparse input control signals and then synthesizes full-body motion based on the completed keyjoint trajectories. The low-dimensional keyjoint movements can easily adapt to various control signal types, such as end-effector position for diverse goal-driven motion synthesis, or incorporate functional constraints on a subset of keyjoints. Additionally, we introduce a time-agnostic control formulation, eliminating the need for frame-specific timing annotations and enhancing control flexibility. Then, the shared second stage can synthesize a natural whole-body motion that precisely satisfies the task requirement from dense keyjoint movements. We demonstrate the effectiveness of sparse and flexible keyjoint control through comprehensive experiments on diverse datasets and scenarios.

2025-03-18T21:21:15Z Accepted to ICCV 2025. Project Page: http://inwoohwang.me/SFControl Inwoo Hwang Jinseok Bae Donggeun Lim Young Min Kim http://arxiv.org/abs/2507.18899v1 Procedural city modeling 2025-07-25T02:46:34Z

We propose a method to procedurally generate a familiar yet complex human artifact: the city. We are not trying to reproduce existing cities, but to generate artificial cities that are convincing and plausible by capturing developmental behavior. In addition, our results are meant to build upon themselves, such that they ought to look compelling at any point along the transition from village to metropolis. Our approach largely focuses upon land usage and building distribution for creating realistic city environments, whereas previous attempts at city modeling have mainly focused on populating road networks. Finally, we want our model to be self automated to the point that the only necessary input is a terrain description, but other high-level and low-level parameters can be specified to support artistic contributions. With the aid of agent based simulation we are generating a system of agents and behaviors that interact with one another through their effects upon a simulated environment. Our philosophy is that as each agent follows a simple behavioral rule set, a more complex behavior will tend to emerge out of the interactions between the agents and their differing rule sets. By confining our model to a set of simple rules for each class of agents, we hope to make our model extendible not only in regard to the types of structures that are produced, but also in describing the social and cultural influences prevalent in all cities

2025-07-25T02:46:34Z 1st Midwestern Graphics Conference 2003 Thomas Lechner Ben Watson Uri Wilensky Martin Felsen http://arxiv.org/abs/2507.06646v2 Assessing Learned Models for Phase-only Hologram Compression 2025-07-24T13:01:37Z

We evaluate the performance of four common learned models utilizing INR and VAE structures for compressing phase-only holograms in holographic displays. The evaluated models include a vanilla MLP, SIREN, and FilmSIREN, with TAESD as the representative VAE model. Our experiments reveal that a pretrained image VAE, TAESD, with 2.2M parameters struggles with phase-only hologram compression, revealing the need for task-specific adaptations. Among the INRs, SIREN with 4.9k parameters achieves %40 compression with high quality in the reconstructed 3D images (PSNR = 34.54 dB). These results emphasize the effectiveness of INRs and identify the limitations of pretrained image compression VAEs for hologram compression task.

2025-07-09T08:19:44Z SIGGRAPH 2025 Poster Zicong Peng Yicheng Zhan Josef Spjut Kaan Akşit 10.1145/3721250.3742993 http://arxiv.org/abs/2507.18231v1 PS-GS: Gaussian Splatting for Multi-View Photometric Stereo 2025-07-24T09:22:02Z

Integrating inverse rendering with multi-view photometric stereo (MVPS) yields more accurate 3D reconstructions than the inverse rendering approaches that rely on fixed environment illumination. However, efficient inverse rendering with MVPS remains challenging. To fill this gap, we introduce the Gaussian Splatting for Multi-view Photometric Stereo (PS-GS), which efficiently and jointly estimates the geometry, materials, and lighting of the object that is illuminated by diverse directional lights (multi-light). Our method first reconstructs a standard 2D Gaussian splatting model as the initial geometry. Based on the initialization model, it then proceeds with the deferred inverse rendering by the full rendering equation containing a lighting-computing multi-layer perceptron. During the whole optimization, we regularize the rendered normal maps by the uncalibrated photometric stereo estimated normals. We also propose the 2D Gaussian ray-tracing for single directional light to refine the incident lighting. The regularizations and the use of multi-view and multi-light images mitigate the ill-posed problem of inverse rendering. After optimization, the reconstructed object can be used for novel-view synthesis, relighting, and material and shape editing. Experiments on both synthetic and real datasets demonstrate that our method outperforms prior works in terms of reconstruction accuracy and computational efficiency.

2025-07-24T09:22:02Z Yixiao Chen Bin Liang Hanzhi Guo Yongqing Cheng Jiayi Zhao Dongdong Weng