https://arxiv.org/api/W+iNZlcBnHZAwT506h99Xo7T/4U2026-06-09T23:21:53Z2394515http://arxiv.org/abs/2212.08141v1Epistemological Equation for Analysing Uncontrollable States in Complex Systems: Quantifying Cyber Risks from the Internet of Things2022-12-15T21:02:49ZTo enable quantitative risk assessment of uncontrollable risk states in complex and coupled IoT systems, a new epistemological equation is designed and tested though comparative and empirical analysis. The comparative analysis is conducted on national digital strategies, followed by an empirical analysis of cyber risk assessment approaches. The new epistemological analysis approach enables the assessment of uncontrollable risk states in complex IoT systems, which begin to resemble artificial intelligence, and can be used for a quantitative self-assessment of IoT cyber risk posture.2022-12-15T21:02:49ZPetar RadanlievDavid De RourePete BurnapOmar Santos10.1007/s12626-021-00086-5http://arxiv.org/abs/2212.08041v1Can REF output quality scores be assigned by AI? Experimental evidence2022-12-11T18:32:00ZThis document describes strategies for using Artificial Intelligence (AI) to predict some journal article scores in future research assessment exercises. Five strategies have been assessed.2022-12-11T18:32:00ZMike ThelwallKayvan KoushaMahshid AbdoliEmma StuartMeiko MakitaPaul WilsonJonathan Levitthttp://arxiv.org/abs/2206.05498v2A Review of Causality for Learning Algorithms in Medical Image Analysis2022-11-26T10:25:19ZMedical image analysis is a vibrant research area that offers doctors and medical practitioners invaluable insight and the ability to accurately diagnose and monitor disease. Machine learning provides an additional boost for this area. However, machine learning for medical image analysis is particularly vulnerable to natural biases like domain shifts that affect algorithmic performance and robustness. In this paper we analyze machine learning for medical image analysis within the framework of Technology Readiness Levels and review how causal analysis methods can fill a gap when creating robust and adaptable medical image analysis algorithms. We review methods using causality in medical imaging AI/ML and find that causal analysis has the potential to mitigate critical problems for clinical translation but that uptake and clinical downstream research has been limited so far.2022-06-11T11:04:13ZAccepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://www.melba-journal.org/papers/2022:028.html". ; Paper ID: 2022:028Machine.Learning.for.Biomedical.Imaging. 1 (2022)Athanasios VlontzosDaniel RueckertBernhard Kainzhttp://arxiv.org/abs/2211.09554v1Systematic Literature Review of Gender and Software Engineering in Asia2022-11-16T14:58:01ZIt is essential to discuss the role, difficulties, and opportunities concerning people of different gender in the field of software engineering research, education, and industry. Although some literature reviews address software engineering and gender, it is still unclear how research and practices in Asia exist for handling gender aspects in software development and engineering. We conducted a systematic literature review to grasp the comprehensive view of gender research and practices in Asia. We analyzed the 32 identified papers concerning countries and publication years among 463 publications. Researchers and practitioners from various organizations actively work on gender research and practices in some countries, including China, India, and Turkey. We identified topics and classified them into seven categories varying from personal mental health and team building to organization. Future research directions include investigating the synergy between (regional) gender aspects and cultural concerns and considering possible contributions and dependency among different topics to have a solid foundation for accelerating further research and getting actionable practices.2022-11-16T14:58:01ZAsia-Pacific Software Engineering and Diversity, Equity, and Inclusion (APSEDEI) workshop collocated with APSEC 2022, December 6th, 2022Hironori Washizakihttp://arxiv.org/abs/2207.12230v2Mind the hubris: complexity can misfire2022-10-30T12:23:23ZHere we briefly reflect on the philosophical foundations that ground the quest towards ever-detailed models and identify four practical dangers derived from this pursuit: explosion of the model's uncertainty space, model black-boxing, computational exhaustion and model attachment. We argue that the growth of a mathematical model should be carefully and continuously pondered lest models become extraneous constructs chasing the Cartesian dream.2022-06-22T09:57:06ZThis is a draft of a chapter that has been accepted for publication by Oxford University Press in the forthcoming book "Views on Mathematical Modelling", edited by Andrea Saltelli and Monica Di Fiore and due for publication in 2023Arnald PuyAndrea Saltellihttp://arxiv.org/abs/2210.13016v1Cards Against AI: Predicting Humor in a Fill-in-the-blank Party Game2022-10-24T08:05:21ZHumor is an inherently social phenomenon, with humorous utterances shaped by what is socially and culturally accepted. Understanding humor is an important NLP challenge, with many applications to human-computer interactions. In this work we explore humor in the context of Cards Against Humanity -- a party game where players complete fill-in-the-blank statements using cards that can be offensive or politically incorrect. We introduce a novel dataset of 300,000 online games of Cards Against Humanity, including 785K unique jokes, analyze it and provide insights. We trained machine learning models to predict the winning joke per game, achieving performance twice as good (20\%) as random, even without any user information. On the more difficult task of judging novel cards, we see the models' ability to generalize is moderate. Interestingly, we find that our models are primarily focused on punchline card, with the context having little impact. Analyzing feature importance, we observe that short, crude, juvenile punchlines tend to win.2022-10-24T08:05:21ZConditionally accepted in EMNLP 2022 short findings. 5 pageshttps://aclanthology.org/2022.findings-emnlp.394Dan OferDafna Shahafhttp://arxiv.org/abs/2210.00236v1Software system rationalisation: How to get better outcomes through stronger user engagement2022-10-01T10:07:28ZAs businesses get more sizable and more mature they now, inevitably accrete more and more software systems. This estate expansion leads not only to greater complexity and expense for the enterprise, but also to fragmentation, inconsistency and siloing of business processes. Because platform rationalisation and system decommissioning never happens spontaneously, a perennial problem for the enterprise then becomes how to simplify their corporate software platforms. Recently, Curlew Research personnel were involved in a software rationalisation program within a large global life sciences company and this paper describes an approach to decommissioning which we developed as part of that project, and which we feel could be of use more widely to help with objective more user-centric system rationalisation. The method derives from a model developed by Noriaki Kano et al to help with determining customer satisfaction and loyalty, and the prioritisation of new, additional functionality, features or "products", for example when looking to enhance software applications. Using a blueprint process for rationalisation, the Curlew-Kano method enables each application to be placed efficiently and objectively into one of four categories - Retain; Review; Remove; Research - thus allowing the enterprise to identify and prioritise quickly those systems which warrant further investigation as part of a decommissioning activity. The key difference of the Curlew-Kano method compared to other application rationalisation methodologies is the fundamental involvement of users in the identification of systems more suitable for rationalisation and possible decommissioning. In our view involving users more fully in system rationalisation leads to better outcomes for the enterprise.2022-10-01T10:07:28Z12 pages, 5 figures, 3 tables, 10 referencesRichard ShuteNick Lynchhttp://arxiv.org/abs/2209.10485v1Towards a Standardised Performance Evaluation Protocol for Cooperative MARL2022-09-21T16:40:03ZMulti-agent reinforcement learning (MARL) has emerged as a useful approach to solving decentralised decision-making problems at scale. Research in the field has been growing steadily with many breakthrough algorithms proposed in recent years. In this work, we take a closer look at this rapid development with a focus on evaluation methodologies employed across a large body of research in cooperative MARL. By conducting a detailed meta-analysis of prior work, spanning 75 papers accepted for publication from 2016 to 2022, we bring to light worrying trends that put into question the true rate of progress. We further consider these trends in a wider context and take inspiration from single-agent RL literature on similar issues with recommendations that remain applicable to MARL. Combining these recommendations, with novel insights from our analysis, we propose a standardised performance evaluation protocol for cooperative MARL. We argue that such a standard protocol, if widely adopted, would greatly improve the validity and credibility of future research, make replication and reproducibility easier, as well as improve the ability of the field to accurately gauge the rate of progress over time by being able to make sound comparisons across different works. Finally, we release our meta-analysis data publicly on our project website for future research on evaluation: https://sites.google.com/view/marl-standard-protocol2022-09-21T16:40:03ZPublished at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022). Website: see https://sites.google.com/view/marl-standard-protocol . 43 Pages, 21 Figures, 8 TablesRihab GorsaneOmayma MahjoubRuan de KockRoland DubbSiddarth SinghArnu Pretoriushttp://arxiv.org/abs/2208.04738v2Long-Term Mentoring for Computer Science Researchers2022-09-17T16:46:07ZEarly in the pandemic, we -- leaders in the research areas of programming languages (PL) and computer architecture (CA) -- realized that we had a problem: the only way to form new lasting connections in the community was to already have lasting connections in the community. Both of our academic communities had wonderful short-term mentoring programs to address this problem, but it was clear that we needed long-term mentoring programs.
Those of us in CA approached this scientifically, making an evidence-backed case for community-wide long-term mentoring. In the meantime, one of us in PL had impulsively launched an unofficial long-term mentoring program, founded on chaos and spreadsheets. In January 2021, the latter grew to an official cross-institutional long-term mentoring program called SIGPLAN-M; in January 2022, the former grew to Computer Architecture Long-term Mentoring (CALM).
The impacts have been strong: SIGPLAN-M reaches 328 mentees and 234 mentors across 41 countries, and mentees have described it as "life changing" and "a career saver." And while CALM is in its pilot phase -- with 13 mentors and 21 mentees across 7 countries -- it has received very positive feedback. The leaders of SIGPLAN-M and CALM shared our designs, impacts, and challenges along the way. Now, we wish to share those with you. We hope this will kick-start a larger long-term mentoring effort across all of computer science.2022-08-06T13:24:20ZEmily RuppelSihang LiuElba GarzaSukyoung RyuAlexandra SilvaTalia Ringerhttp://arxiv.org/abs/2209.11197v1An Overview of Phishing Victimization: Human Factors, Training and the Role of Emotions2022-09-13T12:51:20ZPhishing is a form of cybercrime and a threat that allows criminals, phishers, to deceive end users in order to steal their confidential and sensitive information. Attackers usually attempt to manipulate the psychology and emotions of victims. The increasing threat of phishing has made its study worthwhile and much research has been conducted into the issue. This paper explores the emotional factors that have been reported in previous studies to be significant in phishing victimization. In addition, we compare what security organizations and researchers have highlighted in terms of phishing types and categories as well as training in tackling the problem, in a literature review which takes into account all major credible and published sources.2022-09-13T12:51:20ZMousa Jari10.5121/csit.2022.121319http://arxiv.org/abs/2206.10257v14Satoshi Nakamoto and the Origins of Bitcoin -- The Profile of a 1-in-a-Billion Genius2022-09-09T16:04:31ZThe mystery about the ingenious creator of Bitcoin concealing behind the pseudonym Satoshi Nakamoto has been fascinating the global public for more than a decade. Suddenly jumping out of the dark in 2008, this persona hurled the decentralized electronic cash system "Bitcoin", which has reached a peak market capitalization in the region of 1 trillion USD. In a purposely agnostic, and meticulous "lea-ving no stone unturned" approach, this study presents new hard facts, which evidently slipped through Satoshi Nakamoto's elaborate privacy shield, and derives meaningful pointers that are primarily inferred from Bitcoin's whitepaper, its blockchain parameters, and data that were widely up to his discretion. This ample stack of established and novel evidence is systematically categorized, analyzed, and then connected to its related, real-world ambient, like relevant locations and happenings in the past, and at the time. Evidence compounds towards a substantial role of the Benelux cryptography ecosystem, with strong transatlantic links, in the creation of Bitcoin. A consistent biography, a psychogram, and gripping story of an ingenious, multi-talented, autodidactic, reticent, and capricious polymath transpire, which are absolutely unique from a history of science and technology perspective. A cohort of previously fielded and best matches emerging from the investigations are probed against an unprecedently restrictive, multi-stage exclusion filter, which can, with maximum certainty, rule out most "Satoshi Nakamoto" candidates, while some of them remain to be confirmed. With this article, you will be able to decide who is not, or highly unlikely to be Satoshi Nakamoto, be equipped with an ample stack of systematically categorized evidence and efficient methodologies to find suitable candidates, and can possibly unveil the real identity of the creator of Bitcoin - if you want.2022-06-21T11:10:21ZMain text: 84 pages Number of references: 1468 Appendix: 5 pagesJens Ducréehttp://arxiv.org/abs/2204.07612v2Contextualizing Artificially Intelligent Morality: A Meta-Ethnography of Top-Down, Bottom-Up, and Hybrid Models for Theoretical and Applied Ethics in Artificial Intelligence2022-09-08T18:15:35ZIn this meta-ethnography, we explore three different angles of ethical artificial intelligence (AI) design implementation including the philosophical ethical viewpoint, the technical perspective, and framing through a political lens. Our qualitative research includes a literature review that highlights the cross-referencing of these angles by discussing the value and drawbacks of contrastive top-down, bottom-up, and hybrid approaches previously published. The novel contribution to this framework is the political angle, which constitutes ethics in AI either being determined by corporations and governments and imposed through policies or law (coming from the top), or ethics being called for by the people (coming from the bottom), as well as top-down, bottom-up, and hybrid technicalities of how AI is developed within a moral construct and in consideration of its users, with expected and unexpected consequences and long-term impact in the world. There is a focus on reinforcement learning as an example of a bottom-up applied technical approach and AI ethics principles as a practical top-down approach. This investigation includes real-world case studies to impart a global perspective, as well as philosophical debate on the ethics of AI and theoretical future thought experimentation based on historical facts, current world circumstances, and possible ensuing realities.2022-04-15T18:47:49Z22 pages, 4 tables, accepted for publication in the Future of Information and Communication Conference (FICC) 2023 proceedings will be published in Springer series "Lecture Notes in Networks and Systems" and submitted for consideration to Web of Science, SCOPUS, INSPEC, WTI Frankfurt eG, zbMATH and SCImagoJennafer S. RobertsLaura N. Montoyahttp://arxiv.org/abs/2209.02297v1SIND: A Drone Dataset at Signalized Intersection in China2022-09-06T08:49:44ZIntersection is one of the most challenging scenarios for autonomous driving tasks. Due to the complexity and stochasticity, essential applications (e.g., behavior modeling, motion prediction, safety validation, etc.) at intersections rely heavily on data-driven techniques. Thus, there is an intense demand for trajectory datasets of traffic participants (TPs) in intersections. Currently, most intersections in urban areas are equipped with traffic lights. However, there is not yet a large-scale, high-quality, publicly available trajectory dataset for signalized intersections. Therefore, in this paper, a typical two-phase signalized intersection is selected in Tianjin, China. Besides, a pipeline is designed to construct a Signalized INtersection Dataset (SIND), which contains 7 hours of recording including over 13,000 TPs with 7 types. Then, the behaviors of traffic light violations in SIND are recorded. Furthermore, the SIND is also compared with other similar works. The features of the SIND can be summarized as follows: 1) SIND provides more comprehensive information, including traffic light states, motion parameters, High Definition (HD) map, etc. 2) The category of TPs is diverse and characteristic, where the proportion of vulnerable road users (VRUs) is up to 62.6% 3) Multiple traffic light violations of non-motor vehicles are shown. We believe that SIND would be an effective supplement to existing datasets and can promote related research on autonomous driving.The dataset is available online via: https://github.com/SOTIF-AVLab/SinD2022-09-06T08:49:44Z8 pagesYanchao XuWenbo ShaoJun LiKai YangWeida WangHua HuangChen LvHong Wanghttp://arxiv.org/abs/2209.07493v1Decentralized Infrastructure for (Neuro)science2022-09-01T01:46:29ZThe most pressing problems in science are neither empirical nor theoretical, but infrastructural. Scientific practice is defined by coproductive, mutually reinforcing infrastructural deficits and incentive systems that everywhere constrain and contort our art of curiosity in service of profit and prestige. Our infrastructural problems are not unique to science, but reflective of the broader logic of digital enclosure where platformatized control of information production and extraction fuels some of the largest corporations in the world. I have taken lessons learned from decades of intertwined digital cultures within and beyond academia like wikis, pirates, and librarians in order to draft a path towards more liberatory infrastructures for both science and society. Based on a system of peer-to-peer linked data, I sketch interoperable systems for shared data, tools, and knowledge that map onto three domains of platform capture: storage, computation and communication. The challenge of infrastructure is not solely technical, but also social and cultural, and so I attempt to ground a practical development blueprint in an ethics for organizing and maintaining it. I intend this draft as a rallying call for organization, to be revised with the input of collaborators and through the challenges posed by its implementation. I argue that a more liberatory future for science is neither utopian nor impractical -- the truly impractical choice is to continue to organize science as prestige fiefdoms resting on a pyramid scheme of underpaid labor, playing out the clock as every part of our work is swallowed whole by circling information conglomerates. It was arguably scientists looking for a better way to communicate that created something as radical as the internet in the first place, and I believe we can do it again.2022-09-01T01:46:29ZOriginal Web Document: https://jon-e.net/infrastructureJonny L. Saundershttp://arxiv.org/abs/2208.00003v1RangL: A Reinforcement Learning Competition Platform2022-07-28T09:44:21ZThe RangL project hosted by The Alan Turing Institute aims to encourage the wider uptake of reinforcement learning by supporting competitions relating to real-world dynamic decision problems. This article describes the reusable code repository developed by the RangL team and deployed for the 2022 Pathways to Net Zero Challenge, supported by the UK Net Zero Technology Centre. The winning solutions to this particular Challenge seek to optimize the UK's energy transition policy to net zero carbon emissions by 2050. The RangL repository includes an OpenAI Gym reinforcement learning environment and code that supports both submission to, and evaluation in, a remote instance of the open source EvalAI platform as well as all winning learning agent strategies. The repository is an illustrative example of RangL's capability to provide a reusable structure for future challenges.2022-07-28T09:44:21ZDocuments in general and premierly the RangL competition plattform and in particular its 2022's competition "Pathways to Netzero" 10 pages, 2 figures, 1 table, Comments welcome!Viktor ZobernigRichard A. SaldanhaJinke HeErica van der SarJasper van DoornJia-Chen HuaLachlan R. MasonAleksander CzechowskiDrago IndjicTomasz KosmalaAlessandro ZoccaSandjai BhulaiJorge Montalvo ArvizuClaude KlöcklJohn Moriarty