https://arxiv.org/api/8JbtPzx9egsrhFx6L2dbJwqiMeY 2026-06-17T13:48:06Z 8101 570 15 http://arxiv.org/abs/2406.17952v2 LINSCAN -- A Linearity Based Clustering Algorithm 2025-11-25T03:05:06Z

DBSCAN and OPTICS are powerful algorithms for identifying clusters of points in domains where few assumptions can be made about the structure of the data. In this paper, we leverage these strengths and introduce a new algorithm, LINSCAN, designed to seek lineated clusters that are difficult to find and isolate with existing methods. In particular, by embedding points as normal distributions approximating their local neighborhoods and leveraging a distance function derived from the Kullback Leibler Divergence, LINSCAN can detect and distinguish lineated clusters that are spatially close but have orthogonal covariances. We demonstrate how LINSCAN can be applied to seismic data to identify active faults, including intersecting faults, and determine their orientation. Finally, we discuss the properties a generalization of DBSCAN and OPTICS must have in order to retain the stability benefits of these algorithms.

2024-06-25T21:58:37Z Proc. Mach. Learn. Res. 321:269-286 (2026) Andrew Dennehy Xiaoyu Zou Shabnam J. Semnani Yuri Fialko Alexander Cloninger http://arxiv.org/abs/2511.21754v1 Arc Spline Approximation of Envelopes of Evolving Planar Domains 2025-11-24T10:25:47Z

Computing the envelope of deforming planar domains is a significant and challenging problem with a wide range of potential applications. We approximate the envelope using circular arc splines, curves that balance geometric flexibility and computational simplicity. Our approach combines two concepts to achieve these benefits. First, we represent a planar domain by its medial axis transform (MAT), which is a geometric graph in Minkowski space $\mathbb R^{2,1}$ (possibly with degenerate branches). We observe that circular arcs in the Minkowski space correspond to MATs of arc spline domains. Furthermore, as a planar domain evolves over time, each branch of its MAT evolves and forms a surface in the Minkowski space. This allows us to reformulate the problem of envelope computation as a problem of computing cyclographic images of finite sets of curves on these surfaces. We propose and compare two pairs of methods for approximating the curves and boundaries of their cyclographic images. All of these methods result in an arc spline approximation of the envelope of the evolving domain. Second, we exploit the geometric flexibility of circular arcs in both the plane and Minkowski space to achieve a high approximation rate. The computational simplicity ensures the efficient trimming of redundant branches of the generated envelope using a sweep line algorithm with optimal computational complexity.

2025-11-24T10:25:47Z 31 pages (+ references), 19 figures Jana Vráblíková Bert Jüttler http://arxiv.org/abs/2507.23105v2 The Squishy Grid Problem 2025-11-23T01:13:43Z

In this paper we consider the problem of approximating Euclidean distances by the infinite integer grid graph. Although the topology of the graph is fixed, we have control over the edge-weight assignment $w:E\to \mathbb{R}_{\ge 0}$, and hope to have grid distances be asymptotically isometric to Euclidean distances, that is, for all grid points $u,v$, $\mathrm{dist}_w(u,v) = (1\pm o(1))\|u-v\|_2$. We give three methods for solving this problem, each attractive in its own way. * Our first construction is based on an embedding of the recursive, non-periodic pinwheel tiling of Radin and Conway into the integer grid. Distances in the pinwheel graph are asymptotically isometric to Euclidean distances, but no explicit bound on the rate of convergence was known. We prove that the multiplicative distortion of the pinwheel graph is $(1+1/Θ(\log^ξ\log D))$, where $D$ is the Euclidean distance and $ξ=Θ(1)$. The pinwheel tiling approach is conceptually simple, but can be improved quantitatively. * Our second construction is based on a hierarchical arrangement of "highways." It is simple, achieving stretch $(1 + 1/Θ(D^{1/9}))$, which converges doubly exponentially faster than the pinwheel tiling approach. * The first two methods are deterministic. An even simpler approach is to sample the edge weights independently from a common distribution $\mathscr{D}$. Whether there exists a distribution $\mathscr{D}^*$ that makes grid distances Euclidean, asymptotically and in expectation, is major open problem in the theory of first passage percolation. Previous experiments show that when $\mathscr{D}$ is a Fisher distribution, grid distances are within 1\% of Euclidean. We demonstrate experimentally that this level of accuracy can be achieved by a simple 2-point distribution that assigns weights 0.41 or 4.75 with probability 44\% and 56\%, respectively.

2025-07-30T21:11:22Z Zixi Cai Kuowen Chen Shengquan Du Arnold Filtser Seth Pettie Daniel Skora http://arxiv.org/abs/2305.07570v2 Feature-aware manifold meshing and remeshing of point clouds and polyhedral surfaces with guaranteed smallest edge length 2025-11-21T11:38:59Z

Point clouds and polygonal meshes are widely used when modeling real-world scenarios. Here, point clouds arise, for instance, from acquisition processes applied in various surroundings, such as reverse engineering, rapid prototyping, or cultural preservation. Based on these raw data, polygonal meshes are created to, for example, run various simulations. For such applications, the utilized meshes must be of high quality. This paper presents an algorithm to derive triangle meshes from unstructured point clouds. The occurring edges have a close to uniform length and their lengths are bounded from below. Theoretical results guarantee the output to be manifold, provided suitable input and parameter choices. Further, the paper presents several experiments establishing that the algorithms can compete with widely used competitors in terms of quality of the output and timing and the output is stable under moderate levels of noise. Additionally, we expand the algorithm to detect and respect features on point clouds as well as to remesh polyhedral surfaces, possibly with features. Supplementary material, an extended preprint, a link to a previously published version of the article, utilized models, and implementation details are made available online: https://ms-math-computer.science/projects/guaranteed-smallest-edge-length-manifold-meshing.html

2023-05-12T15:57:28Z Proceedings of the 2024 International Meshing Roundtable (IMR); Special Issue of Computer Aided Geometric Design Henriette Lipschütz Ulrich Reitebuch Konrad Polthier Martin Skrodzki 10.1137/1.9781611978001.1 10.1016/j.cad.2025.104010 http://arxiv.org/abs/2511.16938v1 Edge-ANN: Storage-Efficient Edge-Based Remote Sensing Feature Retrieval 2025-11-21T04:24:26Z

Meeting real-time constraints for high-performance Approximate Nearest Neighbor (ANN) search remains a critical challenge in remote sensing edge devices, which are essentially fusion systems like micro-satellites and UAVs, largely due to stringent limitations in primary (RAM) and secondary (disk) storage. To address this challenge, we propose Edge-ANN, an innovative ANN framework specifically engineered for storage efficiency. The core innovation of Edge-ANN lies in its departure from traditional tree-based methods that store high-dimensional hyperplanes. Instead, it leverages pairs of existing data items, termed "anchors," to implicitly define spatial partitions. To ensure these partitions are both balanced and effective, we have developed a novel Binary Anchor Optimization algorithm.This architectural shift eliminates the dimension-dependence of the space complexity. Rigorous experiments on three multi-source datasets, MillionAID, High-resolution Urban Complex Dataset, and GlobalUrbanNet Dataset, demonstrate that under simulated edge environments with dual storage constraints, Edge-ANN achieves a 30-40% reduction in secondary storage compared to the baseline, at the cost of a minor 3-5% drop in retrieval accuracy. Furthermore, its overall retrieval performance surpasses that of other mainstream methods in these constrained scenarios. Collectively, these results establish Edge-ANN as a state-of-the-art solution for enabling large-scale, high-performance, real-time remote sensing feature retrieval on edge devices with exceptionally constrained storage. The codes of Edge-ANN are available at https://github.com/huaijiao666/Edge-ANN.

2025-11-21T04:24:26Z Xianwei Lv Debin Tang Zhecheng Shi Wang Wang Yujiao Zheng Xiatian Zhu http://arxiv.org/abs/2511.16877v1 Efficient Algorithms and Implementations for Extracting Maximum-Size $(k,\ell)$-Sparse Subgraphs 2025-11-21T01:01:23Z

A multigraph $G = (V, E)$ is $(k, \ell)$-sparse if every subset $X \subseteq V$ induces at most $\max\{k|X| - \ell, 0\}$ edges. Finding a maximum-size $(k, \ell)$-sparse subgraph is a classical problem in rigidity theory and combinatorial optimization, with known polynomial-time algorithms. This paper presents a highly efficient and flexible implementation of an augmenting path method, enhanced with a range of powerful practical heuristics that significantly reduce running time while preserving optimality. These heuristics $\unicode{x2013}$ including edge-ordering, node-ordering, two-phase strategies, and pseudoforest-based initialization $\unicode{x2013}$ steer the algorithm toward accepting more edges early in the execution and avoiding costly augmentations. A comprehensive experimental evaluation on both synthetic and real-world graphs demonstrates that our implementation outperforms existing tools by several orders of magnitude. We also propose an asymptotically faster algorithm for extracting an inclusion-wise maximal $(k,2k)$-sparse subgraph with the sparsity condition required only for node sets of size at least three, which is particularly relevant to 3D rigidity when $k = 3$. We provide a carefully engineered implementation, which is publicly available online and is proposed for inclusion in the LEMON graph library.

2025-11-21T01:01:23Z Péter Madarasi http://arxiv.org/abs/2511.16812v1 Minimizing Vertical Length in Linked Bar Charts 2025-11-20T21:30:47Z

A linked bar chart is the augmentation of a traditional bar chart where each bar is partitioned into blocks and pairs of blocks are linked using orthogonal lines that pass over intermediate bars. The order of the blocks readily influences the legibility of the links. We study the algorithmic problem of minimizing the vertical length of these links, for a fixed bar order. The main challenge lies with ``dependent'' links, whose vertical link length cannot be optimized independently per bar. We show that, if the dependent links form a forest, the problem can be solved in $O(nm)$ time, for n bars and m links. If the dependent links between non-adjacent bars form a forest, the problem admits an $O(n^4m)$-time algorithm. Finally, we show that the general case is fixed-parameter tractable in the maximum number of links that are connected to one bar.

2025-11-20T21:30:47Z Steven van den Broek Marc van Kreveld Wouter Meulemans Arjen Simons http://arxiv.org/abs/2511.12564v2 Linear time small coresets for k-mean clustering of segments with applications 2025-11-20T08:55:11Z

We study the $k$-means problem for a set $\mathcal{S} \subseteq \mathbb{R}^d$ of $n$ segments, aiming to find $k$ centers $X \subseteq \mathbb{R}^d$ that minimize $D(\mathcal{S},X) := \sum_{S \in \mathcal{S}} \min_{x \in X} D(S,x)$, where $D(S,x) := \int_{p \in S} |p - x| dp$ measures the total distance from each point along a segment to a center. Variants of this problem include handling outliers, employing alternative distance functions such as M-estimators, weighting distances to achieve balanced clustering, or enforcing unique cluster assignments. For any $\varepsilon > 0$, an $\varepsilon$-coreset is a weighted subset $C \subseteq \mathbb{R}^d$ that approximates $D(\mathcal{S},X)$ within a factor of $1 \pm \varepsilon$ for any set of $k$ centers, enabling efficient streaming, distributed, or parallel computation. We propose the first coreset construction that provably handles arbitrary input segments. For constant $k$ and $\varepsilon$, it produces a coreset of size $O(\log^2 n)$ computable in $O(nd)$ time. Experiments, including a real-time video tracking application, demonstrate substantial speedups with minimal loss in clustering accuracy, confirming both the practical efficiency and theoretical guarantees of our method.

2025-11-16T11:48:55Z First published in WALCOM 2026 by Springer Nature David Denisov Shlomi Dolev Dan Felmdan Michael Segal http://arxiv.org/abs/2506.18818v2 How Hard is it to be a Star? Convex Geometry and the Real Hierarchy 2025-11-19T15:59:11Z

A set is star-shaped if there is a point in the set that can see every other point in the set in the sense that the line-segment connecting the points lies within the set. We show that testing whether a non-empty compact smooth region is star-shaped is $\forall\mathbb{R}$-complete. Since the obvious definition of star-shapedness has logical form $\exists\forall$, this is a somewhat surprising result, based on Krasnosel'skiĭ's theorem from convex geometry; we study several related complexity classifications in the real hierarchy based on other results from convex geometry.

2025-06-23T16:27:29Z Marcus Schaefer Daniel Štefankovič http://arxiv.org/abs/2511.14975v1 A Dichotomy for 1-Planarity with Restricted Crossing Types Parameterized by Treewidth 2025-11-18T23:44:35Z

A drawing of a graph is 1-planar if each edge participates in at most one crossing and adjacent edges do not cross. Up to symmetry, each crossing in a 1-planar drawing belongs to one out of six possible crossing types, where a type characterizes the subgraph induced by the four vertices of the crossing edges. Each of the 63 possible nonempty subsets $\mathcal{S}$ of crossing types gives a recognition problem: does a given graph admit an $\mathcal{S}$-restricted drawing, that is, a 1-planar drawing where the crossing type of each crossing is in $\mathcal{S}$? We show that there is a set $\mathcal{S}_{\rm bad}$ with three crossing types and the following properties: If $\mathcal{S}$ contains no crossing type from $\mathcal{S}_{\rm bad}$, then the recognition of graphs that admit an $\mathcal{S}$-restricted drawing is fixed-parameter tractable with respect to the treewidth of the input graph. If $\mathcal{S}$ contains any crossing type from $\mathcal{S}_{\rm bad}$, then it is NP-hard to decide whether a graph has an $\mathcal{S}$-restricted drawing, even when considering graphs of constant pathwidth. We also extend this characterization of crossing types to 1-planar straight-line drawings and show the same complexity behaviour parameterized by treewidth.

2025-11-18T23:44:35Z 35 pages, 17 figures; preliminary version to be presented at ISAAC 2025 Sergio Cabello Alexander Dobler Gašper Fijavž Thekla Hamm Mirko H. Wagner http://arxiv.org/abs/2511.14728v1 Automated proving in planar geometry based on the complex number identity method and elimination 2025-11-18T18:20:17Z

We improve the complex number identity proving method to a fully automated procedure, based on elimination ideals. By using declarative equations or rewriting each real-relational hypothesis $h_i$ to $h_i-r_i$, and the thesis $t$ to $t-r$, clearing the denominators and introducing an extra expression with a slack variable, we eliminate all free and relational point variables. From the obtained ideal $I$ in $\mathbb{Q}[r,r_1,r_2,\ldots]$ we can find a conclusive result. It plays an important role that if $r_1,r_2,\ldots$ are real, $r$ must also be real if there is a linear polynomial $p(r)\in I$, unless division by zero occurs when expressing $r$. Our results are presented in Mathematica, Maple and in a new version of the Giac computer algebra system. Finally, we present a prototype of the automated procedure in an experimental version of the dynamic geometry software GeoGebra.

2025-11-18T18:20:17Z 15 pages, 4 figures Zoltán Kovács Xicheng Peng http://arxiv.org/abs/2506.01228v4 Reweighted Spectral Partitioning Works: A Simple Algorithm for Vertex Separators in Special Graph Classes 2025-11-17T12:01:52Z

We establish that a simple polynomial-time algorithm that we call reweighted spectral partitioning obtains small 2/3-balanced vertex-separators for a number of graph classes, including $O(\sqrt{n})$-sized separators for planar graphs, $O(\min\{(\log g)^2,\logΔ\}\cdot\sqrt{gn})$-sized separators for genus-$g$ graphs of maximum degree $Δ$, and $O(\min\{\log h,\sqrt{\logΔ}\}(h\log h\log\log h)\sqrt{n})$-sized separators for $K_h$-minor-free graphs of maximum degree $Δ$. To accomplish this, we first obtain a refined form of a Cheeger-style inequality relating the vertex expansion of a graph and the solution to a semidefinite program defined over the graph. Then, to obtain the guarantees for specific graph classes, we derive direct bounds on the value of the semidefinite program. We also obtain several other results of independent interest, including an improved separator theorem for the intersection graphs of $d$-dimensional balls with bounded ply, a new bound on the Fiedler value of genus-$g$ graphs, and a new "spectral" proof of the planar separator theorem.

2025-06-02T00:40:30Z 42 pages, 12 figures Jack Spalding-Jamieson http://arxiv.org/abs/2403.04356v2 Fine-Grained Complexity of Earth Mover's Distance under Translation 2025-11-17T09:03:45Z

The Earth Mover's Distance is a popular similarity measure in several branches of computer science. It measures the minimum total edge length of a perfect matching between two point sets. The Earth Mover's Distance under Translation ($\mathrm{EMDuT}$) is a translation-invariant version thereof. It minimizes the Earth Mover's Distance over all translations of one point set. For $\mathrm{EMDuT}$ in $\mathbb{R}^1$, we present an $\tilde{\mathcal{O}}(n^2)$-time algorithm. We also show that this algorithm is nearly optimal by presenting a matching conditional lower bound based on the Orthogonal Vectors Hypothesis. For $\mathrm{EMDuT}$ in $\mathbb{R}^d$, we present an $\tilde{\mathcal{O}}(n^{2d+2})$-time algorithm for the $L_1$ and $L_\infty$ metric. We show that this dependence on $d$ is asymptotically tight, as an $n^{o(d)}$-time algorithm for $L_1$ or $L_\infty$ would contradict the Exponential Time Hypothesis (ETH). Prior to our work, only approximation algorithms were known for these problems.

2024-03-07T09:38:16Z 31 pages, 10 colored figures Journal of Computational Geometry 2025 Karl Bringmann Frank Staals Karol Węgrzycki Geert van Wordragen 10.20382/jocg.v16i2a10 http://arxiv.org/abs/2412.02174v3 All Polyhedral Manifolds are Connected by a 2-Step Refolding 2025-11-17T00:08:08Z

We prove that, for any two polyhedral manifolds $\mathcal P,\mathcal Q$, there is a polyhedral manifold $\mathcal I$ such that $\mathcal P,\mathcal I$ share a common unfolding and $\mathcal I,\mathcal Q$ share a common unfolding. In other words, we can unfold $\mathcal P$, refold (glue) that unfolding into $\mathcal I$, unfold $\mathcal I$, and then refold into $\mathcal Q$. Furthermore, if $\mathcal P,\mathcal Q$ have no boundary and can be embedded in 3D (without self-intersection), then so does $\mathcal I$. These results generalize to $n$ given manifolds $\mathcal P_1,\mathcal P_2, \dots, \mathcal P_n$; they all have a common unfolding with the same intermediate manifold $\mathcal I$. Allowing more than two unfold/refold steps, we obtain stronger results for two special cases: for doubly covered convex planar polygons, we achieve that all intermediate polyhedra are planar; and for tree-shaped polycubes, we achieve that all intermediate polyhedra are tree-shaped polycubes.

2024-12-03T05:18:20Z 14 pages, 10 figures. Presented at JCDCGGG 2024. Revision for Journal of Information Processing, correcting a bug in the manifold construction, and small typo Lily Chung Erik D. Demaine Jenny Diomidova Tonan Kamata Jayson Lynch Ryuhei Uehara Hanyu Alice Zhang http://arxiv.org/abs/2511.12059v1 Too Many or Too Few? Sampling Bounds for Topological Descriptors 2025-11-15T06:51:53Z

Topological descriptors, such as the Euler characteristic function and the persistence diagram, have grown increasingly popular for representing complex data. Recent work showed that a carefully chosen set of these descriptors encodes all of the geometric and topological information about a shape in R^d. In practice, epsilon nets are often used to find samples in one of two extremes. On one hand, making strong geometric assumptions about the shape allows us to choose epsilon small enough (corresponding to a high enough density sample) in order to guarantee a faithful representation, resulting in oversampling. On the other hand, if we choose a larger epsilon in order to allow faster computations, this leads to an incomplete description of the shape and a discretized transform that lacks theoretical guarantees. In this work, we investigate how many directions are really needed to represent geometric simplicial complexes, exploring both synthetic and real-world datasets. We provide constructive proofs that help establish size bounds and an experimental investigation giving insights into the consequences of over- and undersampling.

2025-11-15T06:51:53Z 19 pages, 8 figures, submitted to Computational Geometry Theory and Applications Brittany Terese Fasy Montana State University Bozeman USA Maksym Makarchuk Montana State University Bozeman USA Samuel Micka Western Colorado University Gunnison USA David L. Millman Blocky Bozeman USA