https://arxiv.org/api/aaaIczErAYlsfDWTIwvOCnoY/WE2026-06-14T03:04:59Z225921015http://arxiv.org/abs/2512.01354v3The Necessity of Imperfection:Reversing Model Collapse via Simulating Cognitive Boundedness2025-12-08T22:57:17ZAlthough synthetic data is widely promoted as a remedy, its prevailing production paradigm -- one optimizing for statistical smoothness -- systematically removes the long-tail, cognitively grounded irregularities that characterize human text. Prolonged training on such statistically optimal but cognitively impoverished data accelerates model collapse.
This paper proposes a paradigm shift: instead of imitating the surface properties of data, we simulate the cognitive processes that generate human text. We introduce the Prompt-driven Cognitive Computing Framework (PMCSF), whose core consists of a Cognitive State Decoder (CSD) that reverse-engineers unstructured text into structured cognitive vectors, and a Cognitive Text Encoder (CTE) that re-materializes these states into text enriched with human-typical imperfections via mathematically defined Cognitive Perturbation Operators.
The framework is validated through a two-stage objective evaluation pipeline. First, in cognitive codec verification, CTE text yields a Jensen-Shannon divergence of 0.0614 from human text (vs. 0.4431 for standard LLM output), passes double-blind professional media review, and achieves an intraclass correlation coefficient ICC > 0.9 for cognitive profile alignment across heterogeneous models. Second, in functional gain evaluation, isomorphic stress tests in the A-share market show that strategies incorporating CTE-generated data reduce maximum drawdown by 47.4% during the 2015 crash and deliver 8.6% Defensive Alpha, exceeding transaction costs by a factor of 33.
Our findings demonstrate that modelling human cognitive limitations -- not copying surface data -- enables synthetic data with genuine functional gain, offering a viable technical pathway toward resolving the AI data-collapse crisis.2025-12-01T07:09:38Z60 pages,9 figures. v3: Major update. Added 3D topological visualization (Figure 1) and independent computational verification of the Adaptive Markets Hypothesis (AMH). Includes comprehensive Supplementary Materials (algorithmic pseudocode, system architecture, and real-time GARCH logs) for technical reproducibilityZhongjie Jianghttp://arxiv.org/abs/2507.22712v2Order-Flow Filtration and Directional Association with Short-Horizon Returns2025-12-08T04:09:43ZElectronic markets generate dense order flow with many transient orders, which degrade directional signals derived from the limit order book (LOB). We study whether simple structural filters on order lifetime, modification count, and modification timing sharpen the association between order book imbalance (OBI) and short-horizon returns in BankNifty index futures, where unfiltered OBI is already known to be a strong short-horizon directional indicator. The efficacy of each filter is evaluated using a three-step diagnostic ladder: contemporaneous correlations, linear association between discretised regimes, and Hawkes event-time excitation between OBI and return regimes. Our results indicate that filtration of the aggregate order flow produces only modest changes relative to the unfiltered benchmark. By contrast, when filters are applied on the parent orders of executed trades, the resulting OBI series exhibits systematically stronger directional association. Motivated by recent regulatory initiatives to curb noisy order flow, we treat the association between OBI and short-horizon returns as a policy-relevant diagnostic of market quality. We then compare unfiltered and filtered OBI series, using tick-by-tick data from the National Stock Exchange of India, to infer how structural filters on the order flow affect OBI-return dynamics in an emerging market setting.2025-07-30T14:22:47Z21 pagesAditya Nittur AnanthaShashi JainPrithwish Maitihttp://arxiv.org/abs/2512.06309v1Wealth or Stealth? The Camouflage Effect in Insider Trading2025-12-06T05:54:28ZWe consider a Kyle-type model where insider trading takes place among a potentially large population of liquidity traders and is subject to legal penalties. Insiders exploit the liquidity provided by the trading masses to "camouflage" their actions and balance expected wealth with the necessary stealth to avoid detection. Under a diverse spectrum of prosecution schemes, we establish the existence of equilibria for arbitrary population sizes and a unique limiting equilibrium. A convergence analysis determines the scale of insider trading by a stealth index $γ$, revealing that the equilibrium can be closely approximated by a simple limit due to diminished price informativeness. Empirical aspects are derived from two calibration experiments using non-overlapping data sets spanning from 1980 to 2018, which underline the indispensable role of a large population in insider trading models with legal risk, along with important implications for the incidence of stealth trading and the deterrent effect of legal enforcement.2025-12-06T05:54:28Z49 pages; 6 tables; 3 figuresJin MaWeixuan XiaJianfeng Zhanghttp://arxiv.org/abs/2512.15732v1The Red Queen's Trap: Limits of Deep Evolution in High-Frequency Trading2025-12-05T19:30:26ZThe integration of Deep Reinforcement Learning (DRL) and Evolutionary Computation (EC) is frequently hypothesized to be the "Holy Grail" of algorithmic trading, promising systems that adapt autonomously to non-stationary market regimes. This paper presents a rigorous post-mortem analysis of "Galaxy Empire," a hybrid framework coupling LSTM/Transformer-based perception with a genetic "Time-is-Life" survival mechanism. Deploying a population of 500 autonomous agents in a high-frequency cryptocurrency environment, we observed a catastrophic divergence between training metrics (Validation APY $>300\%$) and live performance (Capital Decay $>70\%$). We deconstruct this failure through a multi-disciplinary lens, identifying three critical failure modes: the overfitting of \textit{Aleatoric Uncertainty} in low-entropy time-series, the \textit{Survivor Bias} inherent in evolutionary selection under high variance, and the mathematical impossibility of overcoming microstructure friction without order-flow data. Our findings provide empirical evidence that increasing model complexity in the absence of information asymmetry exacerbates systemic fragility.2025-12-05T19:30:26ZYijia Chenhttp://arxiv.org/abs/2512.05011v1Risk aversion of insider and dynamic asymmetric information2025-12-04T17:21:52ZThis paper studies a Kyle-Back model with a risk-averse insider possessing exponential utility and a dynamic stochastic signal about the asset's terminal fundamental value. While the existing literature considers either risk-neutral insiders with dynamic signals or risk-averse insiders with static signals, we establish equilibrium when both features are present. Our approach imposes no restrictions on the magnitude of the risk aversion parameter, extending beyond previous work that requires sufficiently small risk aversion. We employ a weak conditioning methodology to construct a Schrödinger bridge between the insider's signal and the asset price process, an approach that naturally accommodates stochastic signal evolution and removes risk aversion constraints.
We derive necessary conditions for equilibrium, showing that the optimal insider strategy must be continuous with bounded variation. Under these conditions, we characterize the market-maker pricing rule and insider strategy that achieve equilibrium. We obtain explicit closed-form solutions for important cases including deterministic and quadratic signal volatilities, demonstrating the tractability of our framework.2025-12-04T17:21:52ZAlbina DanilovaValentin Lizhdvoyhttp://arxiv.org/abs/2512.04603v1FX Market Making with Internal Liquidity2025-12-04T09:25:14ZAs the FX markets continue to evolve, many institutions have started offering passive access to their internal liquidity pools. Market makers act as principal and have the opportunity to fill those orders as part of their risk management, or they may choose to adjust pricing to their external OTC franchise to facilitate the matching flow. It is, a priori, unclear how the strategies managing internal liquidity should depend on market condions, the market maker's risk appetite, and the placement algorithms deployed by participating clients. The market maker's actions in the presence of passive orders are relevant not only for their own objectives, but also for those liquidity providers who have certain expectations of the execution speed. In this work, we investigate the optimal multi-objective strategy of a market maker with an option to take liquidity on an internal exchange, and draw important qualitative insights for real-world trading.2025-12-04T09:25:14Z12 pagesAlexander BarzykinRobert BoyceEyal Neumanhttp://arxiv.org/abs/2501.17096v2Why is the estimation of metaorder impact with public market data so challenging?2025-12-03T16:06:53ZEstimating market impact and transaction costs of large trades (metaorders) is a very important topic in finance. However, using models of price and trade based on public market data provide average price trajectories which are qualitatively different from what is observed during real metaorder executions: the price increases linearly, rather than in a concave way, during the execution and the amount of reversion after its end is very limited. We claim that this is a generic phenomenon due to the fact that even sophisticated statistical models are unable to correctly describe the origin of the autocorrelation of the order flow. We propose a modified Transient Impact Model which provides more realistic trajectories by assuming that only a fraction of the metaorder trading triggers market order flow. Interestingly, in our model there is a critical condition on the kernels of the price and order flow equations in which market impact becomes permanent.2025-01-28T17:29:08ZManuel NaviglioGiacomo BormettiFrancesco CampigliGerman RodikovFabrizio Lillohttp://arxiv.org/abs/2512.15720v1Hidden Order in Trades Predicts the Size of Price Moves2025-12-02T23:20:46ZFinancial markets exhibit an apparent paradox: while directional price movements remain largely unpredictable--consistent with weak-form efficiency--the magnitude of price changes displays systematic structure. Here we demonstrate that real-time order-flow entropy, computed from a 15-state Markov transition matrix at second resolution, predicts the magnitude of intraday returns without providing directional information. Analysis of 38.5 million SPY trades over 36 trading days reveals that conditioning on entropy below the 5th percentile increases subsequent 5-minute absolute returns by a factor of 2.89 (t = 12.41, p < 0.0001), while directional accuracy remains at 45.0%--statistically indistinguishable from chance (p = 0.12). This decoupling arises from a fundamental symmetry: entropy is invariant under sign permutation, detecting the presence of informed trading without revealing its direction. Walk-forward validation across five non-overlapping test periods confirms out-of-sample predictability, and label-permutation placebo tests yield z = 14.4 against the null. These findings suggest that information-theoretic measures may serve as volatility state variables in market microstructure, though the limited sample (36 days, single instrument) requires extended validation.2025-12-02T23:20:46ZHidden order in trades predicts the size of intraday price moves but not the direction, consistent with entropy's permutation symmetry. 38.5M SPY trades, 5-fold walk-forward validation, z=14.4 under label-permutation placebo tests, using a transparent intraday trading rule to quantify economic impact. Feedback welcomeMainak Singhahttp://arxiv.org/abs/2512.03123v1A Stochastic Thermodynamics Approach to Price Impact and Round-Trip Arbitrage: Theory and Empirical Implications2025-12-02T17:07:08ZThis paper develops a comprehensive theoretical framework that imports concepts from stochastic thermodynamics to model price impact and characterize the feasibility of round-trip arbitrage in financial markets. A trading cycle is treated as a non-equilibrium thermodynamic process, where price impact represents dissipative work and market noise plays the role of thermal fluctuations. The paper proves a Financial Second Law: under general convex impact functionals, any round-trip trading strategy yields non-positive expected profit. This structural constraint is complemented by a fluctuation theorem that bounds the probability of profitable cycles in terms of dissipated work and market volatility. The framework introduces a statistical ensemble of trading strategies governed by a Gibbs measure, leading to a free energy decomposition that connects expected cost, strategy entropy, and a market temperature parameter. The framework provides rigorous, testable inequalities linking microstructural impact to macroscopic no-arbitrage conditions, offering a novel physics-inspired perspective on market efficiency. The paper derives explicit analytical results for prototypical trading strategies and discusses empirical validation protocols.2025-12-02T17:07:08ZAmit Kumar Jhahttp://arxiv.org/abs/2507.02027v2Arbitrage with bounded Liquidity2025-12-02T12:35:05ZWe derive the arbitrage gains or, equivalently, Loss Versus Rebalancing (LVR) for arbitrage between \textit{two imperfectly liquid} markets, extending prior work that assumes the existence of an infinitely liquid reference market. Our result highlights that the LVR depends on the relative liquidity and relative trading volume of the two markets between which arbitrage gains are extracted. Our model assumes that trading costs on at least one of the markets is quadratic. This assumption holds well in practice, with the exception of highly liquid major pairs on centralized exchanges, for which we discuss extensions to other cost functions.2025-07-02T16:47:20ZChristoph SchlegelQuintus Kilbournhttp://arxiv.org/abs/2507.05749v2Event-Time Anchor Selection for Multi-Contract Quoting2025-12-02T04:50:00ZWhen quoting across multiple contracts, the sequence of execution can be a key driver of implementation shortfall relative to the target spread~\cite{bergault2022multi}. We model the short-horizon execution risk from such quoting as variations in transaction prices between the initiation of the first leg and the completion of the position. Our quoting policy anchors the spread by designating one contract ex ante as a \emph{reference contract}. Reducing execution risk requires a predictive criterion for selecting that contract whose price is most stable over the execution interval. This paper develops a diagnostic framework for reference-contract selection that evaluates this stability by contrasting order-flow Hawkes forecasts with a Composite Liquidity Factor (CLF) of instantaneous limit order book (LOB) shape. We illustrate the framework on tick-by-tick data for a pair of NIFTY futures contracts. The results suggest that event-history and LOB-state signals offer complementary views of short-horizon execution risk for reference-contract selection.2025-07-08T07:52:07Z29 pagesAditya Nittur AnanthaShashi JainShivam GoyalDhruv Misrahttp://arxiv.org/abs/2511.22766v1Beta-Dependent Gamma Feedback and Endogenous Volatility Amplification in Option Markets2025-11-27T21:27:56ZWe develop a theoretical framework that aims to link micro-level option hedging and stock-specific factor exposure with macro-level market turbulence and explain endogenous volatility amplification during gamma-squeeze events. By explicitly modeling market-maker delta-neutral hedging and incorporating beta-dependent volatility normalization, we derive a stability condition that characterizes the onset of a gamma-squeeze event. The model captures a nonlinear recursive feedback loop between market-maker hedging and price movements and the resulting self-reinforcing dynamics. From a complex-systems perspective, the dynamics represent a bounded nonlinear response in which effective gain depends jointly on beta-normalized shock perception and gamma-scaled sensitivity. Our analysis highlights that low-beta stocks exhibit disproportionately strong feedback even for modest absolute price movements.2025-11-27T21:27:56ZHaoying Daihttp://arxiv.org/abs/2511.22101v1Adaptive Dueling Double Deep Q-networks in Uniswap V3 Replication and Extension with Mamba2025-11-27T04:45:20ZThe report goes through the main steps of replicating and improving the article "Adaptive Liquidity Provision in Uniswap V3 with Deep Reinforcement Learning." The replication part includes how to obtain data from the Uniswap Subgraph, details of the implementation, and comments on the results. After the replication, I propose a new structure based on the original model, which combines Mamba with DDQN and a new reward function. In this new structure, I clean the data again and introduce two new baselines for comparison. As a result, although the model has not yet been applied to all datasets, it shows stronger theoretical support than the original model and performs better in some tests.2025-11-27T04:45:20Z12 pages, 5 figuresZhaofeng Zhanghttp://arxiv.org/abs/2511.20606v2Limit Order Book Dynamics in Matching Markets: Microstructure, Spread, and Execution Slippage2025-11-26T04:14:16ZConventional models of matching markets assume that monetary transfers can clear markets by compensating for utility differentials. However, empirical patterns show that such transfers often fail to close structural preference gaps. This paper introduces a market microstructure framework that models matching decisions as a limit order book system with rigid bid ask spreads. Individual preferences are represented by a latent preference state matrix, where the spread between an agent's internal ask price (the unconditional maximum) and the market's best bid (the reachable maximum) creates a structural liquidity constraint. We establish a Threshold Impossibility Theorem showing that linear compensation cannot close these spreads unless it induces a categorical identity shift. A dynamic discrete choice execution model further demonstrates that matches occur only when the market to book ratio crosses a time decaying liquidity threshold, analogous to order execution under inventory pressure. Numerical experiments validate persistent slippage, regional invariance of preference orderings, and high tier zero spread executions. The model provides a unified microstructure explanation for matching failures, compensation inefficiency, and post match regret in illiquid order driven environments.2025-11-25T18:34:46Z33 pages, 7 figures, 5 experiments, 6 appendices. Primary category: q-fin.TR; Secondary: cs.SI. Code: https://github.com/Republic1024/Limit-Order-Matching-MicrostructureYao Wuhttp://arxiv.org/abs/2504.10282v2Optimal Execution in Intraday Energy Markets under Hawkes Processes with Transient Impact2025-11-25T20:47:47ZThis paper investigates optimal execution strategies in intraday energy markets through a mutually exciting Hawkes process model. Calibrated to data from the German intraday electricity market, the model effectively captures key empirical features, including intra-session volatility, distinct intraday market activity patterns, and the Samuelson effect as gate closure approaches. By integrating a transient price impact model with a bivariate Hawkes process to model the market order flow, we derive an optimal trading trajectory for energy companies managing large volumes, accounting for the specific trading patterns in these markets. A back-testing analysis compares the proposed strategy against standard benchmarks such as Time-Weighted Average Price (TWAP) and Volume-Weighted Average Price (VWAP), demonstrating substantial cost reductions across various hourly trading products in intraday energy markets.2025-04-14T14:51:18Z24 pages, 34 figuresKonstantinos ChatziandreouSven Karbach