https://arxiv.org/api/l0ueoXHzbVnyPFmQBfAHAg6wclQ 2026-06-09T22:19:44Z 239 30 15 http://arxiv.org/abs/2402.05122v1 History of generative Artificial Intelligence (AI) chatbots: past, present, and future development 2024-02-04T05:01:38Z

This research provides an in-depth comprehensive review of the progress of chatbot technology over time, from the initial basic systems relying on rules to today's advanced conversational bots powered by artificial intelligence. Spanning many decades, the paper explores the major milestones, innovations, and paradigm shifts that have driven the evolution of chatbots. Looking back at the very basic statistical model in 1906 via the early chatbots, such as ELIZA and ALICE in the 1960s and 1970s, the study traces key innovations leading to today's advanced conversational agents, such as ChatGPT and Google Bard. The study synthesizes insights from academic literature and industry sources to highlight crucial milestones, including the introduction of Turing tests, influential projects such as CALO, and recent transformer-based models. Tracing the path forward, the paper highlights how natural language processing and machine learning have been integrated into modern chatbots for more sophisticated capabilities. This chronological survey of the chatbot landscape provides a holistic reference to understand the technological and historical factors propelling conversational AI. By synthesizing learnings from this historical analysis, the research offers important context about the developmental trajectory of chatbots and their immense future potential across various field of application which could be the potential take ways for the respective research community and stakeholders.

2024-02-04T05:01:38Z Md. Al-Amin Mohammad Shazed Ali Abdus Salam Arif Khan Ashraf Ali Ahsan Ullah Md Nur Alam Shamsul Kabir Chowdhury http://arxiv.org/abs/2302.05449v5 Heckerthoughts 2024-01-07T15:47:36Z

This manuscript is technical memoir about my work at Stanford and Microsoft Research. Included are fundamental concepts central to machine learning and artificial intelligence, applications of these concepts, and stories behind their creation.

2023-02-13T14:42:15Z Fixed typos around Equation 1 (thank you Xinlong Du), and added a philosophical note at the end of Section 3.5 about the perception that consciousness is unitary David Heckerman http://arxiv.org/abs/2311.07631v1 The 4+1 Model of Data Science 2023-11-13T12:12:32Z

Data Science is a complex and evolving field, but most agree that it can be defined as a combination of expertise drawn from three broad areascomputer science and technology, math and statistics, and domain knowledge -- with the purpose of extracting knowledge and value from data. Beyond this, the field is often defined as a series of practical activities ranging from the cleaning and wrangling of data, to its analysis and use to infer models, to the visual and rhetorical representation of results to stakeholders and decision-makers. This essay proposes a model of data science that goes beyond laundry-list definitions to get at the specific nature of data science and help distinguish it from adjacent fields such as computer science and statistics. We define data science as an interdisciplinary field comprising four broad areas of expertise: value, design, systems, and analytics. A fifth area, practice, integrates the other four in specific contexts of domain knowledge. We call this the 4+1 model of data science. Together, these areas belong to every data science project, even if they are often unconnected and siloed in the academy.

2023-11-13T12:12:32Z 28 pages Rafael C. Alvarado http://arxiv.org/abs/2309.13094v1 Computational Natural Philosophy: A Thread from Presocratics through Turing to ChatGPT 2023-09-22T11:47:36Z

Modern computational natural philosophy conceptualizes the universe in terms of information and computation, establishing a framework for the study of cognition and intelligence. Despite some critiques, this computational perspective has significantly influenced our understanding of the natural world, leading to the development of AI systems like ChatGPT based on deep neural networks. Advancements in this domain have been facilitated by interdisciplinary research, integrating knowledge from multiple fields to simulate complex systems. Large Language Models (LLMs), such as ChatGPT, represent this approach's capabilities, utilizing reinforcement learning with human feedback (RLHF). Current research initiatives aim to integrate neural networks with symbolic computing, introducing a new generation of hybrid computational models.

2023-09-22T11:47:36Z 17 pages Gordana Dodig-Crnkovic http://arxiv.org/abs/2309.01525v1 The History of Quantum Games 2023-09-04T11:10:58Z

In this paper, we explore the historical development of playable quantum physics related games (\textit{\textbf{quantum games}}). For the purpose of this examination, we have collected over 260 quantum games ranging from commercial games, applied and serious games, and games that have been developed at quantum themed game jams and educational courses. We provide an overview of the journey of quantum games across three dimensions: \textit{the perceivable dimension of quantum physics, the dimension of scientific purposes, and the dimension of quantum technologies}. We then further reflect on the definition of quantum games and its implications. While motivations behind developing quantum games have typically been educational or academic, themes related to quantum physics have begun to be more broadly utilised across a range of commercial games. In addition, as the availability of quantum computer hardware has grown, entirely new variants of quantum games have emerged to take advantage of these machines' inherent capabilities, \textit{quantum computer games}

2023-09-04T11:10:58Z 8 pages, from which 1.5 pages of references, 11 figures, one table, presented in the IEEE Conference on Games 2023 Laura Piispanen Edward Morrell Solip Park Marcell Pfaffhauser Annakaisa Kultima 10.1109/CoG57401.2023.10333150 http://arxiv.org/abs/2105.05302v2 Human-Machine Interaction in the Light of Turing and Wittgenstein 2023-08-07T10:01:41Z

We propose a study of the constitution of meaning in human-computer interaction based on Turing and Wittgenstein's definitions of thought, understanding, and decision. We show by the comparative analysis of the conceptual similarities and differences between the two authors that the common sense between humans and machines is co-constituted in and from action and that it is precisely in this co-constitution that lies the social value of their interaction. This involves problematizing human-machine interaction around the question of what it means to "follow a rule" to define and distinguish the interpretative modes and decision-making behaviors of each. We conclude that the mutualization of signs that takes place through the human-machine dialogue is at the foundation of the constitution of a computerized society.

2021-04-30T07:32:41Z in French language, Revue Implications Philosophiques, 2023 Charles Bodon UP1 UFR10 http://arxiv.org/abs/2210.05791v3 Sociotechnical Harms of Algorithmic Systems: Scoping a Taxonomy for Harm Reduction 2023-07-19T02:56:32Z

Understanding the landscape of potential harms from algorithmic systems enables practitioners to better anticipate consequences of the systems they build. It also supports the prospect of incorporating controls to help minimize harms that emerge from the interplay of technologies and social and cultural dynamics. A growing body of scholarship has identified a wide range of harms across different algorithmic technologies. However, computing research and practitioners lack a high level and synthesized overview of harms from algorithmic systems. Based on a scoping review of computing research $(n=172)$, we present an applied taxonomy of sociotechnical harms to support a more systematic surfacing of potential harms in algorithmic systems. The final taxonomy builds on and refers to existing taxonomies, classifications, and terminologies. Five major themes related to sociotechnical harms - representational, allocative, quality-of-service, interpersonal harms, and social system/societal harms - and sub-themes are presented along with a description of these categories. We conclude with a discussion of challenges and opportunities for future research.

2022-10-11T21:22:30Z Renee Shelby Shalaleh Rismani Kathryn Henne AJung Moon Negar Rostamzadeh Paul Nicholas N'Mah Yilla Jess Gallegos Andrew Smart Emilio Garcia Gurleen Virk http://arxiv.org/abs/2307.10265v1 AI empowering research: 10 ways how science can benefit from AI 2023-07-17T18:41:18Z

This article explores the transformative impact of artificial intelligence (AI) on scientific research. It highlights ten ways in which AI is revolutionizing the work of scientists, including powerful referencing tools, improved understanding of research problems, enhanced research question generation, optimized research design, stub data generation, data transformation, advanced data analysis, and AI-assisted reporting. While AI offers numerous benefits, challenges such as bias, privacy concerns, and the need for human-AI collaboration must be considered. The article emphasizes that AI can augment human creativity in science but not replace it.

2023-07-17T18:41:18Z César França http://arxiv.org/abs/2209.12816v2 Fast-FNet: Accelerating Transformer Encoder Models via Efficient Fourier Layers 2023-05-16T13:16:00Z

Transformer-based language models utilize the attention mechanism for substantial performance improvements in almost all natural language processing (NLP) tasks. Similar attention structures are also extensively studied in several other areas. Although the attention mechanism enhances the model performances significantly, its quadratic complexity prevents efficient processing of long sequences. Recent works focused on eliminating the disadvantages of computational inefficiency and showed that transformer-based models can still reach competitive results without the attention layer. A pioneering study proposed the FNet, which replaces the attention layer with the Fourier Transform (FT) in the transformer encoder architecture. FNet achieves competitive performances concerning the original transformer encoder model while accelerating training process by removing the computational burden of the attention mechanism. However, the FNet model ignores essential properties of the FT from the classical signal processing that can be leveraged to increase model efficiency further. We propose different methods to deploy FT efficiently in transformer encoder models. Our proposed architectures have smaller number of model parameters, shorter training times, less memory usage, and some additional performance improvements. We demonstrate these improvements through extensive experiments on common benchmarks.

2022-09-26T16:23:02Z 11 pages Nurullah Sevim Ege Ozan Özyedek Furkan Şahinuç Aykut Koç http://arxiv.org/abs/2205.15360v3 AI-enabled Sound Pattern Recognition on Asthma Medication Adherence: Evaluation with the RDA Benchmark Suite 2023-04-16T17:32:06Z

Asthma is a common, usually long-term respiratory disease with negative impact on global society and economy. Treatment involves using medical devices (inhalers) that distribute medication to the airways and its efficiency depends on the precision of the inhalation technique. There is a clinical need for objective methods to assess the inhalation technique, during clinical consultation. Integrated health monitoring systems, equipped with sensors, enable the recognition of drug actuation, embedded with sound signal detection, analysis and identification, from intelligent structures, that could provide powerful tools for reliable content management. Health monitoring systems equipped with sensors, embedded with sound signal detection, enable the recognition of drug actuation and could be used for effective audio content analysis. This paper revisits sound pattern recognition with machine learning techniques for asthma medication adherence assessment and presents the Respiratory and Drug Actuation (RDA) Suite (https://gitlab.com/vvr/monitoring-medication-adherence/rda-benchmark) for benchmarking and further research. The RDA Suite includes a set of tools for audio processing, feature extraction and classification procedures and is provided along with a dataset, consisting of respiratory and drug actuation sounds. The classification models in RDA are implemented based on conventional and advanced machine learning and deep networks' architectures. This study provides a comparative evaluation of the implemented approaches, examines potential improvements and discusses on challenges and future tendencies.

2022-05-30T18:08:28Z Nikos D. Fakotakis Stavros Nousias Gerasimos Arvanitis Evangelia I. Zacharaki Konstantinos Moustakas 10.1109/ACCESS.2023.3243547 http://arxiv.org/abs/2208.01765v2 Mary Kenneth Keller: First US PhD in Computer Science 2023-03-30T18:18:18Z

In June 1965, Sister Mary Kenneth Keller, BVM, received the first US PhD in Computer Science, and this paper outlines her life and accomplishments. As a scholar, she has the distinction of being an early advocate of learning-by-example in artificial intelligence. Her main scholarly contribution was in shaping computer science education in high schools and small colleges. She was an evangelist for viewing the computer as a symbol manipulator, for providing computer literacy to everyone, and for the use of computers in service to humanity. She was far ahead of her time in working to ensure a place for women in technology and in eliminating barriers preventing their participation, such as poor access to education and daycare. She was a strong and spirited woman, a visionary in seeing how computers would revolutionize our lives. A condensation of this paper appeared as, ``The Legacy of Mary Kenneth Keller, First U.S. Ph.D. in Computer Science," Jennifer Head and Dianne P. O'Leary, IEEE Annals of the History of Computing 45(1):55--63, January-March 2023.

2022-08-02T21:42:01Z This revision expands the abstract, adds a reference to a condensed version of this paper published in a journal, references Keller's work on ACM curricula, and notes an IEEE prize in her honor IEEE Annals of the History of Computing 45(1):55--63, January-March 2023 Jennifer Head Dianne P. O'Leary 10.1109/MAHC.2022.3231763 http://arxiv.org/abs/2304.12898v1 ChatGPT believes it is conscious 2023-03-29T13:15:45Z

The development of advanced generative chat models, such as ChatGPT, has raised questions about the potential consciousness of these tools and the extent of their general artificial intelligence. ChatGPT consistent avoidance of passing the test is here overcome by asking ChatGPT to apply the Turing test to itself. This explores the possibility of the model recognizing its own sentience. In its own eyes, it passes this test. ChatGPT's self-assessment makes serious implications about our understanding of the Turing test and the nature of consciousness. This investigation concludes by considering the existence of distinct types of consciousness and the possibility that the Turing test is only effective when applied between consciousnesses of the same kind. This study also raises intriguing questions about the nature of AI consciousness and the validity of the Turing test as a means of verifying such consciousness.

2023-03-29T13:15:45Z Arend Hintze http://arxiv.org/abs/2303.13740v1 The First Computer Program 2023-03-24T01:46:27Z

In 1837, the first computer program in history was sketched by the renowned mathematician and inventor Charles Babbage. It was a program for the Analytical Engine. The program consists of a sequence of arithmetical operations and the necessary variable addresses (memory locations) of the arguments and the result, displayed in tabular fashion, like a program trace. The program computes the solutions for a system of two linear equations in two unknowns.

2023-03-24T01:46:27Z 8 pages, 4 tables Raúl Rojas http://arxiv.org/abs/2301.02919v1 Charles Babbage, Ada Lovelace, and the Bernoulli Numbers 2023-01-07T18:55:04Z

This chapter makes needed corrections to an unduly negative scholarly view of Ada Lovelace. Credit between Lovelace and Babbage is not a zero-sum game, where any credit added to Lovelace somehow detracts from Babbage. Ample evidence indicates Babbage and Lovelace each had important contributions to the famous 1843 Sketch of Babbage's Analytical Engine and the accompanying Notes. Further, Lovelace's correspondence with two highly accomplished figures in 19th century mathematics, Charles Babbage and Augustus De Morgan, establish her mathematical background and sophistication. Babbage and Lovelace's treatment of the Bernoulli numbers in Note 'G' spotlights this aspect of their collaboration. Finally, while acknowledging significant definitional problems in calling Lovelace the world's "first computer programmer," I affirm that Lovelace created an elemental sequence of instructions -- that is, an algorithm -- for computing the series of Bernoulli numbers.

2023-01-07T18:55:04Z 20 pages, 4 figures In Robin Hammerman and Andrew L. Russell, eds., Ada's Legacy: Cultures of Computing from the Victorian to the Digital Age. Association for Computing Machinery and Morgan & Claypool, 2015 Thomas J. Misa 10.1145/2809523.2809527 http://arxiv.org/abs/2210.07178v2 Challenges and Opportunities of Large Transnational Datasets: A Case Study on European Administrative Crop Data 2022-12-22T11:35:49Z

Expansive, informative datasets are vital in providing foundations and possibilities for scientific research and development across many fields of study. Assembly of grand datasets, however, frequently poses difficulty for the author and stakeholders alike, with a variety of considerations required throughout the collaboration efforts and development lifecycle. In this work, we discuss and analyse the challenges and opportunities we faced throughout the creation of a transnational, European agricultural dataset containing reference labels of cultivated crops. Together, this forms a succinct framework of important elements one should consider when forging a dataset of their own.

2022-09-19T13:53:51Z for associated GitHub repository, see https://github.com/maja601/EuroCrops Workshop on Broadening Research Collaborations in ML (NeurIPS 2022) Maja Schneider Christian Marchington Marco Körner