https://arxiv.org/api/6haN9Rxs1OkpbQ49DABXegoQJx8 2026-06-14T04:30:13Z 6065 585 15 http://arxiv.org/abs/2509.20493v1 InsightGUIDE: An Opinionated AI Assistant for Guided Critical Reading of Scientific Literature 2025-09-24T19:10:52Z

The proliferation of scientific literature presents an increasingly significant challenge for researchers. While Large Language Models (LLMs) offer promise, existing tools often provide verbose summaries that risk replacing, rather than assisting, the reading of the source material. This paper introduces InsightGUIDE, a novel AI-powered tool designed to function as a reading assistant, not a replacement. Our system provides concise, structured insights that act as a "map" to a paper's key elements by embedding an expert's reading methodology directly into its core AI logic. We present the system's architecture, its prompt-driven methodology, and a qualitative case study comparing its output to a general-purpose LLM. The results demonstrate that InsightGUIDE produces more structured and actionable guidance, serving as a more effective tool for the modern researcher.

2025-09-24T19:10:52Z Accepted for publication on ICTAI 2025 Paris Koloveas Serafeim Chatzopoulos Thanasis Vergoulis Christos Tryfonopoulos http://arxiv.org/abs/2504.10552v4 LEMUR Neural Network Dataset: Towards Seamless AutoML 2025-09-24T10:29:39Z

Neural networks are the backbone of modern artificial intelligence, but designing, evaluating, and comparing them remains labor-intensive. While numerous datasets exist for training, there are few standardized collections of the models themselves. We introduce LEMUR, an open-source dataset and framework that provides a large collection of PyTorch-based neural networks across tasks such as classification, segmentation, detection, and natural language processing. Each model follows a unified template, with configurations and results stored in a structured database to ensure consistency and reproducibility. LEMUR integrates automated hyperparameter optimization via Optuna, includes statistical analysis and visualization tools, and offers an API for seamless access to performance data. The framework is extensible, allowing researchers to add new models, datasets, or metrics without breaking compatibility. By standardizing implementations and unifying evaluation, LEMUR aims to accelerate AutoML research, enable fair benchmarking, and reduce barriers to large-scale neural network experimentation. To support adoption and collaboration, LEMUR and its plugins are released under the MIT license at: https://github.com/ABrain-One/nn-dataset https://github.com/ABrain-One/nn-plots https://github.com/ABrain-One/nn-vr

2025-04-14T09:08:00Z Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 3291-3300, 2026 Arash Torabi Goodarzi Roman Kochnev Waleed Khalid Hojjat Torabi Goudarzi Furui Qin Tolgay Atinc Uzun Yashkumar Sanjaybhai Dhameliya Yash Kanubhai Kathiriya Zofia Antonina Bentyn Dmitry Ignatov Radu Timofte http://arxiv.org/abs/2408.13430v3 The ICML 2023 Ranking Experiment: Examining Author Self-Assessment in ML/AI Peer Review 2025-09-23T17:56:41Z

We conducted an experiment during the review process of the 2023 International Conference on Machine Learning (ICML), asking authors with multiple submissions to rank their papers based on perceived quality. In total, we received 1,342 rankings, each from a different author, covering 2,592 submissions. In this paper, we present an empirical analysis of how author-provided rankings could be leveraged to improve peer review processes at machine learning conferences. We focus on the Isotonic Mechanism, which calibrates raw review scores using the author-provided rankings. Our analysis shows that these ranking-calibrated scores outperform the raw review scores in estimating the ground truth ``expected review scores'' in terms of both squared and absolute error metrics. Furthermore, we propose several cautious, low-risk applications of the Isotonic Mechanism and author-provided rankings in peer review, including supporting senior area chairs in overseeing area chairs' recommendations, assisting in the selection of paper awards, and guiding the recruitment of emergency reviewers.

2024-08-24T01:51:23Z Minor revision of Section 4; Published in Journal of the American Statistical Association (JASA) as a Discussion Paper Buxin Su Jiayao Zhang Natalie Collina Yuling Yan Didong Li Kyunghyun Cho Jianqing Fan Aaron Roth Weijie Su http://arxiv.org/abs/2509.19206v1 A decentralized future for the open-science databases 2025-09-23T16:28:21Z

Continuous and reliable access to curated biological data repositories is indispensable for accelerating rigorous scientific inquiry and fostering reproducible research. Centralized repositories, though widely used, are vulnerable to single points of failure arising from cyberattacks, technical faults, natural disasters, or funding and political uncertainties. This can lead to widespread data unavailability, data loss, integrity compromises, and substantial delays in critical research, ultimately impeding scientific progress. Centralizing essential scientific resources in a single geopolitical or institutional hub is inherently dangerous, as any disruption can paralyze diverse ongoing research. The rapid acceleration of data generation, combined with an increasingly volatile global landscape, necessitates a critical re-evaluation of the sustainability of centralized models. Implementing federated and decentralized architectures presents a compelling and future-oriented pathway to substantially strengthen the resilience of scientific data infrastructures, thereby mitigating vulnerabilities and ensuring the long-term integrity of data. Here, we examine the structural limitations of centralized repositories, evaluate federated and decentralized models, and propose a hybrid framework for resilient, FAIR, and sustainable scientific data stewardship. Such an approach offers a significant reduction in exposure to governance instability, infrastructural fragility, and funding volatility, and also fosters fairness and global accessibility. The future of open science depends on integrating these complementary approaches to establish a globally distributed, economically sustainable, and institutionally robust infrastructure that safeguards scientific data as a public good, further ensuring continued accessibility, interoperability, and preservation for generations to come.

2025-09-23T16:28:21Z 21 Pages, 2 figures Gaurav Sharma Viorel Munteanu Nika Mansouri Ghiasi Jineta Banerjee Susheel Varma Luca Foschini Kyle Ellrott Onur Mutlu Dumitru Ciorbă Roel A. Ophoff Viorel Bostan Christopher E Mason Jason H. Moore Despoina Sousoni Arunkumar Krishnan Christopher E. Mason Mihai Dimian Gustavo Stolovitzky Fabio G. Liberante Taras K. Oleksyk Serghei Mangul http://arxiv.org/abs/2509.12955v2 Automated Generation of Research Workflows from Academic Papers: A Full-text Mining Framework 2025-09-23T14:57:23Z

The automated generation of research workflows is essential for improving the reproducibility of research and accelerating the paradigm of "AI for Science". However, existing methods typically extract merely fragmented procedural components and thus fail to capture complete research workflows. To address this gap, we propose an end-to-end framework that generates comprehensive, structured research workflows by mining full-text academic papers. As a case study in the Natural Language Processing (NLP) domain, our paragraph-centric approach first employs Positive-Unlabeled (PU) Learning with SciBERT to identify workflow-descriptive paragraphs, achieving an F1-score of 0.9772. Subsequently, we utilize Flan-T5 with prompt learning to generate workflow phrases from these paragraphs, yielding ROUGE-1, ROUGE-2, and ROUGE-L scores of 0.4543, 0.2877, and 0.4427, respectively. These phrases are then systematically categorized into data preparation, data processing, and data analysis stages using ChatGPT with few-shot learning, achieving a classification precision of 0.958. By mapping categorized phrases to their document locations in the documents, we finally generate readable visual flowcharts of the entire research workflows. This approach facilitates the analysis of workflows derived from an NLP corpus and reveals key methodological shifts over the past two decades, including the increasing emphasis on data analysis and the transition from feature engineering to ablation studies. Our work offers a validated technical framework for automated workflow generation, along with a novel, process-oriented perspective for the empirical investigation of evolving scientific paradigms. Source code and data are available at: https://github.com/ZH-heng/research_workflow.

2025-09-16T10:59:23Z Journal of Informetrics, 2025 Heng Zhang Chengzhi Zhang http://arxiv.org/abs/2307.00794v2 Current policies governing editorial conflicts of interest are ineffective 2025-09-23T08:42:30Z

Research-active editors face a potential conflict of interest (COI) when handling submissions from authors who share the same affiliation or those who recently collaborated with the editor. Since perception of COIs arising from such editor-author associations may erode trust in science, some policies recommend, and others demand, recusal in such incidents. However, the effectiveness of such measures is unknown to date. To fill this gap, we analyze half a million papers from six publishers who specify the handling editor of each paper. We find numerous papers with editor-author associations, and demonstrate that such papers tend to be accepted faster. A quasi-experimental design exploiting policy changes at PNAS and PLOS reveals the limited effectiveness of current COI policies. A network neural embedding model reveals that requiring editors with potential COIs to recuse may compromise the suitability of the handling editor. Finally, an online survey experiment demonstrates that such COIs influence trust in the paper's finding, but public disclosure eliminates this effect.

2023-07-03T07:22:19Z Fengyuan Liu Bedoor AlShebli Talal Rahwan http://arxiv.org/abs/2502.01525v3 Archiving and Replaying Current Web Advertisements: Challenges and Opportunities 2025-09-23T02:57:59Z

Although web advertisements represent an inimitable part of digital cultural heritage, serious archiving and replay challenges persist. To explore these challenges, we created a dataset of 279 archived ads. We encountered five problems in archiving and replaying them. For one, prior to August 2023, Internet Archive's Save Page Now service excluded not only well-known ad services' ads, but also URLs with ad related file and directory names. Although after August 2023, Save Page Now still blocked the archiving of ads loaded on a web page, it permitted the archiving of an ad's resources if the user directly archived the URL(s) associated with the ad. Second, Brozzler's incompatibility with Chrome prevented ads from being archived. Third, during crawling and replay sessions, Google's and Amazon's ad scripts generated URLs with different random values. This precluded archived ads' replay. Updating replay systems' fuzzy matching approach should enable the replay of these ads. Fourth, when loading Flashtalking web page ads outside of ad iframes, the ad script requested a non-existent URL. This, prevented the replay of ad resources. But as was the case with Google and Amazon ads, updating replay systems' fuzzy matching approach should enable Flashtalking ads' replay. Finally, successful replay of ads loaded in iframes with the src attribute of "about:blank" depended upon a given browser's service worker implementation. A Chromium bug stopped service workers from accessing resources inside of this type of iframe, which in turn prevented replay. Replacing the "about:blank" value for the iframe's src attribute with a blob URL before an ad was loaded solved this problem. Resolving these replay problems will improve the replay of ads and other dynamically loaded embedded web resources that use random values or "about:blank" iframes.

2025-02-03T17:02:12Z Travis Reid Alex H. Poole Hyung Wook Choi Christopher Rauch Mat Kelly Michael L. Nelson Michele C. Weigle http://arxiv.org/abs/2510.21725v1 Shifting norms in scholarly publications: trends in readability, objectivity, authorship, and AI use 2025-09-22T08:34:20Z

Academic and scientific publishing practices have changed significantly in recent years. This paper presents an analysis of 17 million research papers published since 2000 to explore changes in authorship and content practices. It shows a clear trend towards more authors, more references and longer abstracts. While increased authorship has been reported elsewhere, the present analysis shows that it is pervasive across many major fields of study. We also identify a decline in author productivity which suggests that `gift' authorship (the inclusion of authors who have not contributed significantly to a work) may be a significant factor. We further report on a tendency for authors to use more hyperbole, perhaps exaggerating their contributions to compete for the limited attention of reviewers, and often at the expense of readability. This has been especially acute since 2023, as AI has been increasingly used across many fields of study, but particularly in fields such as Computer Science, Engineering and Business. In summary, many of these changes are causes of significant concern. Increased authorship counts and gift authorship have the potential to distort impact metrics such as field-weighted citation impact andh-index, while increased AI usage may compromise readability and objectivity.

2025-09-22T08:34:20Z 21 pages, 6 figurs Padraig Cunningham Padhraic Smyth Barry Smyth http://arxiv.org/abs/2509.17465v1 Open Political Corpora: Structuring, Searching, and Analyzing Political Text Collections with PoliCorp 2025-09-22T08:00:55Z

In this work, we present PoliCorp (https://demo-pollux.gesis.org/), a web portal designed to facilitate the search and analysis of political text corpora. PoliCorp provides researchers with access to rich textual data, enabling in-depth analysis of parliamentary discourse over time. The platform currently features a collection of transcripts from debates in the German parliament, spanning 76 years of proceedings. With the advanced search functionality, researchers can apply logical operations to combine or exclude search criteria, making it easier to filter through vast amounts of parliamentary debate data. The search can be customised by combining multiple fields and applying logical operators to uncover complex patterns and insights within the data. Additional data processing steps were performed to enable web-based search and incorporate extra features. A key feature that differentiates PoliCorp is its intuitive web-based interface that enables users to query processed political texts without requiring programming skills. The user-friendly platform allows for the creation of custom subcorpora via search parameters, which can be freely downloaded in JSON format for further analysis.

2025-09-22T08:00:55Z Nina Smirnova Muhammad Ahsan Shahid Philipp Mayr http://arxiv.org/abs/2505.18207v2 BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text 2025-09-21T22:30:31Z

In scientific research, ``limitations'' refer to the shortcomings, constraints, or weaknesses of a study. A transparent reporting of such limitations can enhance the quality and reproducibility of research and improve public trust in science. However, authors often underreport limitations in their papers and rely on hedging strategies to meet editorial requirements at the expense of readers' clarity and confidence. This tendency, combined with the surge in scientific publications, has created a pressing need for automated approaches to extract and generate limitations from scholarly papers. To address this need, we present a full architecture for computational analysis of research limitations. Specifically, we (1) create a dataset of limitations from ACL, NeurIPS, and PeerJ papers by extracting them from the text and supplementing them with external reviews; (2) we propose methods to automatically generate limitations using a novel Retrieval Augmented Generation (RAG) technique; (3) we design a fine-grained evaluation framework for generated limitations, along with a meta-evaluation of these techniques.

2025-05-22T06:04:02Z 16 pages, 8 figures, Accepted to the 2025 Conference on Empirical Methods in Natural Language Processing (Findings) Ibrahim Al Azher Miftahul Jannat Mokarrama Zhishuai Guo Sagnik Ray Choudhury Hamed Alhoori http://arxiv.org/abs/2509.16831v1 Survivors, Complainers, and Borderliners: Upward Bias in Online Discussions of Academic Conference Reviews 2025-09-20T23:04:15Z

Online discussion platforms, such as community Q&A sites and forums, have become important hubs where academic conference authors share and seek information about the peer review process and outcomes. However, these discussions involve only a subset of all submissions, raising concerns about the representativeness of the self-reported review scores. In this paper, we conduct a systematic study comparing the review score distributions of self-reported submissions in online discussions (based on data collected from Zhihu and Reddit) with those of all submissions. We reveal a consistent upward bias: the score distribution of self-reported samples is shifted upward relative to the population score distribution, with this difference statistically significant in most cases. Our analysis identifies three distinct contributors to this bias: (1) survivors, authors of accepted papers who are more likely to share good results than those of rejected papers who tend to conceal bad ones; (2) complainers, authors of high-scoring rejected papers who are more likely to voice complaints about the peer review process or outcomes than those of low scores; and (3) borderliners, authors with borderline scores who face greater uncertainty prior to decision announcements and are more likely to seek advice during the rebuttal period. These findings have important implications for how information seekers should interpret online discussions of academic conference reviews.

2025-09-20T23:04:15Z Hangxiao Zhu Yian Yin Yu Zhang http://arxiv.org/abs/2503.09811v2 Exploring the dynamics of external and self-citations and their role in shaping scientific impact 2025-09-19T20:39:49Z

Understanding the mechanisms driving the distribution of scientific citations is a key challenge in assessing the scientific impact of authors. We investigate the influence of the preferential attachment rule (PAR) in this process by analysing individual citation events from the DBLP dataset and two Scopus-based datasets, enabling us to estimate the probability of citations being assigned preferentially. Our findings reveal that, for the aggregated dataset, PAR dominates the citation distribution process, with approximately 70% of citations adhering to this mechanism. However, analysis at the individual level shows significant variability, with some authors experiencing a greater prevalence of preferential citations, particularly in the context of external citations. In contrast, self-citations exhibit notably different behaviour, with only 20% following PAR. We also demonstrate that the prominence of PAR increases with an author's citability (average citations per paper), suggesting that more citable authors are preferentially cited, while less-cited authors experience more random citation patterns. Furthermore, we show that self-citations may influence bibliometric indices, such as the h-index. Our results confirm the distinct dynamics of self-citations compared to external citations, raising questions about the mechanisms driving self-citation patterns. These findings provide new insights into citation behaviours and highlight the limitations of existing approaches.

2025-03-12T20:20:45Z Maciej J. Mrowinski Aleksandra Buczek Agata Fronczak http://arxiv.org/abs/2509.15780v1 Ontology Creation and Management Tools: the Case of Anatomical Connectivity 2025-09-19T09:10:29Z

We are developing infrastructure to support researchers in mapping data related to the peripheral nervous system and other physiological systems, with an emphasis on their relevance to the organs under investigation. The nervous system, a complex network of nerves and ganglia, plays a critical role in coordinating and transmitting signals throughout the body. To aid in this, we have created ApiNATOMY, a framework for the topological and semantic representation of multiscale physiological circuit maps. ApiNATOMY integrates a Knowledge Representation (KR) model and a suite of Knowledge Management (KM) tools. The KR model enables physiology experts to easily capture interactions between anatomical entities, while the KM tools help modelers convert high-level abstractions into detailed models of physiological processes, which can be integrated with external ontologies and knowledge graphs.

2025-09-19T09:10:29Z 14 pages Natallia Kokash Bernard de Bono Tom Gillespie http://arxiv.org/abs/2406.04405v2 Streamlining and standardizing software citations with The Software Citation Station 2025-09-18T17:27:29Z

Software is crucial for the advancement of astronomy especially in the context of rapidly growing datasets that increasingly require algorithm and pipeline development to process the data and produce results. However, software has not always been consistently cited, despite its importance to strengthen support for software development. To encourage, streamline, and standardize the process of citing software in academic work such as publications we introduce 'The Software Citation Station': a publicly available website and tool to quickly find or add software citations

2024-06-06T18:00:04Z Visit the Software citation station at https://www.tomwagg.com/software-citation-station/ which includes a very easy form to add your own software package! comments welcome! Tom Wagg Floor S. Broekgaarden http://arxiv.org/abs/2509.14457v1 Keywords are not always the key: A metadata field analysis for natural language search on open data portals 2025-09-17T22:14:27Z

Open data portals are essential for providing public access to open datasets. However, their search interfaces typically rely on keyword-based mechanisms and a narrow set of metadata fields. This design makes it difficult for users to find datasets using natural language queries. The problem is worsened by metadata that is often incomplete or inconsistent, especially when users lack familiarity with domain-specific terminology. In this paper, we examine how individual metadata fields affect the success of conversational dataset retrieval and whether LLMs can help bridge the gap between natural queries and structured metadata. We conduct a controlled ablation study using simulated natural language queries over real-world datasets to evaluate retrieval performance under various metadata configurations. We also compare existing content of the metadata field 'description' with LLM-generated content, exploring how different prompting strategies influence quality and impact on search outcomes. Our findings suggest that dataset descriptions play a central role in aligning with user intent, and that LLM-generated descriptions can support effective retrieval. These results highlight both the limitations of current metadata practices and the potential of generative models to improve dataset discoverability in open data portals.

2025-09-17T22:14:27Z Accepted to CHIRA 2025 as Full Paper Lisa-Yao Gan Arunav Das Johanna Walker Elena Simperl