An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

17 de julio de 2026

Sequential recommendation is a central task in recommender systems, and recent research has increasingly shifted toward generative recommenders that leverage both sequential patterns and semantic item information. However, these methods are often evaluated on a small set of widely used benchmarks. This raises a natural question: do these benchmarks actually require the advanced modeling capabilities of modern generative recommenders? We conduct a benchmark audit using an intentionally simple graph heuristic: starting from only the last one or two interacted items, it retrieves candidates from a few-hop item-transition graph and ranks them with item-feature similarity. Surprisingly, despite its simplicity, this heuristic matches or outperforms a broad set of modern baselines on a variety of popular sequential recommendation benchmarks. For example, it achieves relative NDCG@10 improvements of 38.10% and 44.18% over the best competing baseline on the widely used Amazon Review Sports and CDs datasets, respectively.

An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation Leer más »

Páginas: 1 2 3

When silence is safer: a review and decision-theoretic framework for LLM abstention in healthcare

14 de julio de 2026

Large language models (LLMs) are designed to generate answers to user prompts, which often drives them to respond even when uncertainty is high, information is incomplete, or a refusal would be more appropriate. In healthcare, this tendency can be dangerous: confidently stated but inaccurate medical advice can cause significant harm, making the ability to abstain especially important. In this paper, we review studies investigating LLM abstention behaviors in healthcare. The literature highlights two main motivations: (1) uncertainty-driven abstention, where the model withholds a response when confidence is low, and (2) safety-driven abstention, where the model declines to provide potentially harmful information. Most existing mechanisms are extrinsic and rely on auxiliary tools to determine when to abstain. We find that state-of-the-art LLMs still struggle to refuse inappropriate prompts, and that few benchmarks evaluate abstention in realistic medical scenarios, where performance lags behind other domains. Building on these findings, we introduce a decision-theoretic formalization of abstention that models the trade-off between answering and withholding responses under uncertainty and potential harm. Based on this formulation, we present MedSAFE, a framework for evaluating abstention in clinical dialogs, and demonstrate its operationalization through a proof-of-concept pilot across clinical scenarios derived from the review.

When silence is safer: a review and decision-theoretic framework for LLM abstention in healthcare Leer más »

Páginas: 1 2 3

Advancing regulatory variant effect prediction with AlphaGenome

12 de julio de 2026

Deep learning models that predict functional genomic measurements from DNA sequences are powerful tools for deciphering the genetic regulatory code. Existing methods involve a trade-off between input sequence length and prediction resolution, thereby limiting their modality scope and performance1,2,3,4,5. We present AlphaGenome, a unified DNA sequence model, which takes as input 1 Mb of DNA sequence and predicts thousands of functional genomic tracks up to single-base-pair resolution across diverse modalities. The modalities include gene expression, transcription initiation, chromatin accessibility, histone modifications, transcription factor binding, chromatin contact maps, splice site usage and splice junction coordinates and strength. Trained on human and mouse genomes, AlphaGenome matches or exceeds the strongest available external models in 25 of 26 evaluations of variant effect prediction. The ability of AlphaGenome to simultaneously score variant effects across all modalities accurately recapitulates the mechanisms of clinically relevant variants near the TAL1 oncogene6. To facilitate broader use, we provide tools for making genome track and variant effect predictions from sequence.

Advancing regulatory variant effect prediction with AlphaGenome Leer más »

Páginas: 1 2 3

Sequence Feature Extraction for Malware Family Analysis via Graph Neural Network

25 de septiembre de 2022

Malicious software (malware) causes much harm to our devices and life. We are eager to understand the malware behavior and the threat it made. Most of the record files of malware are variable length and text-based files with time stamps, such as event log data and dynamic analysis profiles. Using the time stamps, we can sort such data into sequence-based data for the following analysis. However, dealing with the text-based sequences with variable lengths is difficult. In addition, unlike natural language text data, most sequential data in information security have specific properties and structure, such as loop, repeated call, noise, etc. To deeply analyze the API call sequences with their structure, we use graphs to represent the sequences, which can further investigate the information and structure, such as the Markov model. Therefore, we design and implement an Attention Aware Graph Neural Network (AWGCN) to analyze the API call sequences. Through AWGCN, we can obtain the sequence embeddings to analyze the behavior of the malware. Moreover, the classification experiment result shows that AWGCN outperforms other classifiers in the call-like datasets, and the embedding can further improve the classic model’s performance.

Sequence Feature Extraction for Malware Family Analysis via Graph Neural Network Leer más »

Páginas: 1 2 3

Auto Quantum Circuits

11 de agosto de 2022

«AutoQML, self-assembling circuits, hyper-parameterized Quantum ML platform, using cirq, tensorflow and tfq. Trillions of possible qubit registries, gate combinations and moment sequences, ready to be adapted into your ML flow. Here I demonstrate climatechange, jameswebbspacetelescope and microbiology vision applications… [Thus far, a circuit with 16-Qubits and a gate sequence of [ YY ] – [ XX ] – [CNOT] has performed the best, per my blend of metrics…].

Auto Quantum Circuits Leer más »

Páginas: 1 2

Federated Learning: Issues in Medical Application

29 de mayo de 2022

In this presentation, the current issues to make federated learning flawlessly useful in the real world will be briefly overviewed. They are related to data/system heterogeneity, client management, traceability, and security. Also, we introduce the modularized federated learning framework, we currently develop, to experiment various techniques and protocols to find solutions for aforementioned issues. The framework will be open to public after development completes.

Federated Learning: Issues in Medical Application Leer más »

Páginas: 1 2 3

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

27 de marzo de 2022

In this survey, we connect several lines of work from the pre-neural and neural era, by showing how hybrid approaches of words and characters as well as subword-based approaches based on learned segmentation have been proposed and evaluated. We conclude that there is and likely will never be a silver bullet singular solution for all applications and that thinking seriously about tokenization remains important for many applications

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP Leer más »

Páginas: 1 2 3

Scientific Visualization: Python + Matplotlib

27 de febrero de 2022

The Python scientific visualisation landscape is huge. It is composed of a myriad of tools, ranging from the most versatile and widely used down to the more specialised and confidential. Some of these tools are community based while others are developed by companies. Some are made specifically for the web, others are for the desktop only, some deal with 3D and large data, while others target flawless 2D rendering.

Scientific Visualization: Python + Matplotlib Leer más »

Páginas: 1 2 3

R0:62a6aa3b4882ad9b194a4ae5c97b4d58-Ethics-based auditing of automated decision-making systems: intervention points and policy implications

Ethics-based auditing of automated decision-making systems: intervention points and policy implications

16 de enero de 2022

Organisations increasingly use automated decision-making systems (ADMS) to inform decisions that affect humans and their environment. While the use of ADMS can improve the accuracy and efficiency of decision-making processes, it is also coupled with ethical challenges. Unfortunately, the governance mechanisms currently used to oversee human decision-making often fail when applied to ADMS.

Ethics-based auditing of automated decision-making systems: intervention points and policy implications Leer más »

Páginas: 1 2 3

Do Vision Transformers See Like Convolutional Neural Networks?

5 de diciembre de 2021

Convolutional neural networks (CNNs) have so far been the de-facto model for visual data. Recent work has shown that (Vision) Transformer models (ViT) can achieve comparable or even superior performance on image classification tasks. This raises a central question: how are Vision Transformers solving these tasks? Are they acting like convolutional networks, or learning entirely different visual representations? Analyzing the internal representation structure of ViTs and CNNs on image classification benchmarks, we find striking differences between the two architectures, such as ViT having more uniform representations across all layers. We explore how these differences arise, finding crucial roles played by self-attention, which enables early aggregation of global information, and ViT residual connections, which strongly propagate features from lower to higher layers.

Do Vision Transformers See Like Convolutional Neural Networks? Leer más »

Páginas: 1 2 3

Research