https://arxiv.org/api/cL7xWL6Qgmw9vtI3DmHX+jTKwrU2026-06-09T21:31:26Z2391515http://arxiv.org/abs/2510.23436v1Education Paradigm Shift To Maintain Human Competitive Advantage Over AI2025-10-27T15:38:20ZDiscussion about the replacement of intellectual human labour by ``thinking machines'' has been present in the public and expert discourse since the creation of Artificial Intelligence (AI) as an idea and terminology since the middle of the twentieth century. Until recently, it was more of a hypothetical concern. However, in recent years, with the rise of Generative AI, especially Large Language Models (LLM), and particularly with the widespread popularity of the ChatGPT model, that concern became practical. Many domains of human intellectual labour have to adapt to the new AI tools that give humans new functionality and opportunity, but also question the viability and necessity of some human work that used to be considered intellectual yet has now become an easily automatable commodity. Education, unexpectedly, has now become burdened by an especially crucial role of charting long-range strategies for discovering viable human skills that would guarantee their place in the world of the ubiquitous use of AI in the intellectual sphere. We highlight weaknesses of the current AI and, especially, of its LLM-based core, show that root causes of LLMs' weaknesses are unfixable by the current technologies, and propose directions in the constructivist paradigm for the changes in Education that ensure long-term advantages of humans over AI tools.2025-10-27T15:38:20ZStanislav SelitskiyChihiro Inoue10.2514/6.2024-4902http://arxiv.org/abs/2510.11595v1Reproducibility: The New Frontier in AI Governance2025-10-13T16:34:25ZAI policymakers are responsible for delivering effective governance mechanisms that can provide safe, aligned and trustworthy AI development. However, the information environment offered to policymakers is characterised by an unnecessarily low Signal-To-Noise Ratio, favouring regulatory capture and creating deep uncertainty and divides on which risks should be prioritised from a governance perspective. We posit that the current publication speeds in AI combined with the lack of strong scientific standards, via weak reproducibility protocols, effectively erodes the power of policymakers to enact meaningful policy and governance protocols. Our paper outlines how AI research could adopt stricter reproducibility guidelines to assist governance endeavours and improve consensus on the AI risk landscape. We evaluate the forthcoming reproducibility crisis within AI research through the lens of crises in other scientific domains; providing a commentary on how adopting preregistration, increased statistical power and negative result publication reproducibility protocols can enable effective AI governance. While we maintain that AI governance must be reactive due to AI's significant societal implications we argue that policymakers and governments must consider reproducibility protocols as a core tool in the governance arsenal and demand higher standards for AI research. Code to replicate data and figures: https://github.com/IFMW01/reproducibility-the-new-frontier-in-ai-governance2025-10-13T16:34:25Z12 pages,6 figures,Workshop on Technical AI Governance at ICMLIsrael Mason-WilliamsGabryel Mason-Williamshttp://arxiv.org/abs/2511.11572v1LLM Architecture, Scaling Laws, and Economics: A Quick Summary2025-09-11T20:31:49ZThe current standard architecture of Large Language Models (LLMs) with QKV self-attention is briefly summarized, including the architecture of a typical Transformer. Scaling laws for compute (flops) and memory (parameters plus data) are given, along with their present (2025) rough cost estimates for the parameters of present LLMs of various scales, including discussion of whether DeepSeek should be viewed as a special case. Nothing here is new, but this material seems not otherwise readily available in summary form.2025-09-11T20:31:49Z9 pages, 3 figuresWilliam H. Presshttp://arxiv.org/abs/2509.04372v1Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology2025-09-04T16:29:38ZIn this note, we reflect on several fundamental connections among widely used post-training techniques. We clarify some intimate connections and equivalences between reinforcement learning with human feedback, reinforcement learning with internal feedback, and test-time scaling (particularly soft best-of-$N$ sampling), while also illuminating intrinsic links between diffusion guidance and test-time scaling. Additionally, we introduce a resampling approach for alignment and reward-directed diffusion models, sidestepping the need for explicit reinforcement learning techniques.2025-09-04T16:29:38ZYuchen JiaoYuxin ChenGen Lihttp://arxiv.org/abs/2508.16616v1The history of digital ethics2025-08-13T15:04:27ZDigital ethics, also known as computer ethics or information ethics, is now a lively field that draws a lot of attention, but how did it come about and what were the developments that lead to its existence? What are the traditions, the concerns, the technological and social developments that pushed digital ethics? How did ethical issues change with digitalisation of human life? How did the traditional discipline of philosophy respond? The article provides an overview, proposing historical epochs: 'pre-modernity' prior to digital computation over data, via the 'modernity' of digital data processing to our present 'post-modernity' when not only the data is digital, but our lives themselves are largely digital. In each section, the situation in technology and society is sketched, and then the developments in digital ethics are explained. Finally, a brief outlook is provided.2025-08-13T15:04:27Z(2022) in Carissa Véliz (ed.), Oxford handbook of digital ethics (Oxford: Oxford University Press), 3-19Vincent C. Müllerhttp://arxiv.org/abs/2501.16457v2Symbolic Mathematical Computation 1965--1975: The View from a Half-Century Perspective2025-05-02T23:33:48ZThe 2025 ISSAC conference in Guanajuato, Mexico, marks the 50th event in this significant series, making it an ideal moment to reflect on the field's history. This paper reviews the formative years of symbolic computation up to 1975, fifty years ago. By revisiting a period unfamiliar to most current participants, this survey aims to shed light on once-pressing issues that are now largely resolved and to highlight how some of today's challenges were recognized earlier than expected.2025-01-27T19:35:18Z18 pages, 149 referencesRobert M. CorlessArthur C. NormanTomas RecioWilliam J. TurkelStephen M. Watthttp://arxiv.org/abs/2504.17428v1Detection, Classification and Prevalence of Self-Admitted Aging Debt2025-04-24T10:38:55ZContext: Previous research on software aging is limited with focus on dynamic runtime indicators like memory and performance, often neglecting evolutionary indicators like source code comments and narrowly examining legacy issues within the TD context. Objective: We introduce the concept of Aging Debt (AD), representing the increased maintenance efforts and costs needed to keep software updated. We study AD through Self-Admitted Aging Debt (SAAD) observed in source code comments left by software developers. Method: We employ a mixed-methods approach, combining qualitative and quantitative analyses to detect and measure AD in software. This includes framing SAAD patterns from the source code comments after analysing the source code context, then utilizing the SAAD patterns to detect SAAD comments. In the process, we develop a taxonomy for SAAD that reflects the temporal aging of software and its associated debt. Then we utilize the taxonomy to quantify the different types of AD prevalent in OSS repositories. Results: Our proposed taxonomy categorizes temporal software aging into Active and Dormant types. Our extensive analysis of over 9,000+ Open Source Software (OSS) repositories reveals that more than 21% repositories exhibit signs of SAAD as observed from our gold standard SAAD dataset. Notably, Dormant AD emerges as the predominant category, highlighting a critical but often overlooked aspect of software maintenance. Conclusion: As software volume grows annually, so do evolutionary aging and maintenance challenges; our proposed taxonomy can aid researchers in detailed software aging studies and help practitioners develop improved and proactive maintenance strategies.2025-04-24T10:38:55ZDraftMurali SridharanMika MäntyläLeevi Rantalahttp://arxiv.org/abs/2503.05767v1Mesterséges Intelligencia Kutatások Magyarországon2025-02-24T20:28:11ZArtificial intelligence (AI) has undergone remarkable development since the mid-2000s, particularly in the fields of machine learning and deep learning, driven by the explosive growth of large databases and computational capacity. Hungarian researchers recognized the significance of AI early on, actively participating in international research and achieving significant results in both theoretical and practical domains. This article presents some key achievements in Hungarian AI research. It highlights the results from the period before the rise of deep learning (the early 2010s), then discusses major theoretical advancements in Hungary after 2010. Finally, it provides a brief overview of AI-related applied scientific achievements from 2010 onward.2025-02-24T20:28:11Zin Hungarian language. Submitted to Magyar TudományAndrás A. BenczúrTibor GyimóthyBalázs Szegedyhttp://arxiv.org/abs/2311.03292v4Data Science from 1963 to 20122024-10-22T17:57:48ZConsensus on the definition of data science remains low despite the widespread establishment of academic programs in the field and continued demand for data scientists in industry. Definitions range from rebranded statistics to data-driven science to the science of data to simply the application of machine learning to so-called big data to solve real-world problems. Current efforts to trace the history of the field in order to clarify its definition, such as Donoho's "50 Years of Data Science" (Donoho 2017), tend to focus on a short period when a small group of statisticians adopted the term in an unsuccessful attempt to rebrand their field in the face of the overshadowing effects of computational statistics and data mining. Using textual evidence from primary sources, this essay traces the history of the term to the 1960s, when it was first used by the US Air Force in a surprisingly similar way to its current usage, to 2012, the year that Harvard Business Review published the enormously influential article "Data Scientist: The Sexiest Job of the 21st Century" (Davenport and Patil 2012) and the American Statistical Association acknowledged a profound disconnect between statistics and data science (Rodriguez 2012). Among the themes that emerge from this review are (1) the long-standing opposition between data analysts and data miners that continues to animate the field, (2) an established definition of the term as the practice of managing and processing scientific data that has been occluded by recent usage, and (3) the phenomenon of data impedance -- the disproportion between surplus data, indexed by phrases like data deluge and big data, and the limitations of computational machinery and methods to process them. This persistent condition appears to have motivated the use of the term and the field itself since its beginnings.2023-11-06T17:35:35Z48 pagesRafael C. Alvaradohttp://arxiv.org/abs/2301.09771v6Automation and AI Technology in Surface Mining With a Brief Introduction to Open-Pit Operations in the Pilbara2024-09-27T06:57:04ZThis survey article provides a synopsis on some of the engineering problems, technological innovations, robotic development and automation efforts encountered in the mining industry -- particularly in the Pilbara iron-ore region of Western Australia. The goal is to paint the technology landscape and highlight issues relevant to an engineering audience to raise awareness of AI and automation trends in mining. It assumes the reader has no prior knowledge of mining and builds context gradually through focused discussion and short summaries of common open-pit mining operations. The principal activities that take place may be categorized in terms of resource development, mine-, rail- and port operations. From mineral exploration to ore shipment, there are roughly nine steps in between. These include: geological assessment, mine planning and development, production drilling and assaying, blasting and excavation, transportation of ore and waste, crush and screen, stockpile and load-out, rail network distribution, and ore-car dumping. The objective is to describe these processes and provide insights on some of the challenges/opportunities from the perspective of a decade-long industry-university R&D partnership.2023-01-24T00:57:37ZAccepted manuscript. Paper provides insights on state-of-the-art technologies and future trends. Keywords: Mining automation, robotics, intelligent systems, machine learning, remote sensing, geostatistics, planning, scheduling, optimization, modelling, geology, complex systems. Document: 21 pages, 6 figures, 2 tables. 2024 Update: Added ICRA conference poster + slides as ancilliary filesIEEE Robotics & Automation Magazine 32:3 (2025) 164-183Raymond LeungAndrew J HillArman Melkumyan10.1109/MRA.2023.3328457http://arxiv.org/abs/2407.02591v1Enabling Student Innovation through Virtual Reality Development2024-07-02T18:28:04ZIt is clear, from the major press coverage that Virtual Reality (VR) development is garnering, that there is a huge amount of development interest in VR across multiple industries, including video streaming, gaming and simulated learning. Even though PC, web, and mobile are still the top platforms for software development, it is important for university computer science (CS) programs to expose students to VR as a development platform. Additionally, it is important for CS students to learn how to learn about new technologies, since change is constant in the CS field. CS curriculum changes happen much slower than the pace of technology adoption. As new technologies are introduced, CS faculty and students often learn together, especially in smaller CS programs. This paper describes how student-led VR projects are used, across the CS curriculum, as basic CS concepts are covered. The student-led VR projects are engaging, and promote learning and creativity. Additionally, each student project inspires more students to try their hand at VR development as well.2024-07-02T18:28:04ZPublished in proceedings and presented at https://micsymposium.org/mics2016/Papers/MICS_2016_paper_36.pdf; 10 pages; 3 figuresHarms, S. K. (2016). Enabling Student Innovation through Virtual Reality Development. 2016 Midwest Instructional Computing Symposium Proceedings, Cedar Rapids, IASherri Harmshttp://arxiv.org/abs/2407.02492v1Max Bense as a Visionary: from Entropy to the Dialectics of Programmed Images2024-04-04T07:32:21ZIn 1960 in Stuttgart, Max Bense published the book Programming the Beautiful [Programmierung des Sch{ö}nen]. Bense looks in cybernetics for scientific concepts and instigates the thought of programming in the field of literature. His information aesthetics influences a whole generation of scientists and artists - including the Stuttgart Circle, which takes hold of the new aesthetics to carry out the first programmed artistic images. Is Max Bense a visionary? How is he revolutionizing the world of images? The article discusses the cybernetics that inspired Bense: a science of probability that contrasts with the principles of Newtonian physics. Moreover, in the sixties, Max Bense, together with Elisabeth Walther, launched the experimental magazine Rot, which devoted its pages to the concrete poetry and the first computer-generated images of Georg Nees. As Frieder Nake defends through his pioneering work and theory, these images oppose the visible and the computable. This dialectic opens to a critical thinking on the algorithmic image in art and science.2024-04-04T07:32:21Zin French languageImages Re-Vues, 2021, 19Gaëtan RobillardUP810.4000/imagesrevues.10395http://arxiv.org/abs/2405.00040v1A guideline for the methodology chapter in computer science dissertations2024-03-29T13:31:54ZRather than simply offering suggestions, this guideline for the methodology chapter in computer science dissertations provides thorough insights on how to develop a strong research methodology within the area of computer science. The method is structured into several parts starting with an overview of research strategies which include experiments, surveys, interviews and case studies. The guide highlights the significance of defining a research philosophy and reasoning by talking about paradigms such as positivism, constructivism and pragmatism. Besides, it reveals the importance of types of research including deductive and inductive methodologies; basic versus applied research approaches. Moreover, this guideline discusses data collection and analysis intricacies that divide data into quantitative and qualitative typologies. It explains different ways in which data can be collected from observation to experimentation, interviews or surveys. It also mentions ethical considerations in research emphasizing ethical behavior like following academic principles. In general, this guideline is an essential tool for undertaking computer science dissertations that help researchers structure their work while maintaining ethical standards in their study design.2024-03-29T13:31:54ZMarco Araujohttp://arxiv.org/abs/2403.05592v1Eternal Sunshine of the Mechanical Mind: The Irreconcilability of Machine Learning and the Right to be Forgotten2024-03-06T13:23:57ZAs we keep rapidly advancing toward an era where artificial intelligence is a constant and normative experience for most of us, we must also be aware of what this vision and this progress entail. By first approximating neural connections and activities in computer circuits and then creating more and more sophisticated versions of this crude approximation, we are now facing an age to come where modern deep learning-based artificial intelligence systems can rightly be called thinking machines, and they are sometimes even lauded for their emergent behavior and black-box approaches. But as we create more powerful electronic brains, with billions of neural connections and parameters, can we guarantee that these mammoths built of artificial neurons will be able to forget the data that we store in them? If they are at some level like a brain, can the right to be forgotten still be protected while dealing with these AIs? The essential gap between machine learning and the RTBF is explored in this article, with a premonition of far-reaching conclusions if the gap is not bridged or reconciled any time soon. The core argument is that deep learning models, due to their structure and size, cannot be expected to forget or delete a data as it would be expected from a tabular database, and they should be treated more like a mechanical brain, albeit still in development.2024-03-06T13:23:57ZMeem Arafat Manabhttp://arxiv.org/abs/2402.10393v1Darwin Turing Dawkins: Building a General Theory of Evolution2024-02-16T01:27:21ZLiving things, computers, societies, and even books are part of a grand evolutionary struggle to survive. That struggle shapes nature, nations, religions, art, science, and you. What you think, feel, and do is determined by it. Darwinian evolution does not apply solely to the genes that are stored in DNA. Using the insights of Alan Turing and Richard Dawkins, we will see that it also applies to the memes we store in our brains and the information we store in our computers. The next time you run for president, fight a war, or just deal with the ordinary problems humans are heir to, perhaps this book will be of use. If you want to understand why and when you will die, or if you want to achieve greatness this book may help. If you are concerned about where the computer revolution is headed, this book may provide some answers.2024-02-16T01:27:21Z247 pagesLeonard M. Adleman