{"id":3112,"date":"2023-01-17T20:44:50","date_gmt":"2023-01-18T02:44:50","guid":{"rendered":"https:\/\/kermitmurray.com\/msblog\/?page_id=3112"},"modified":"2023-01-17T20:44:50","modified_gmt":"2023-01-18T02:44:50","slug":"biorxiv-bioinformatics","status":"publish","type":"page","link":"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-bioinformatics\/","title":{"rendered":"BioRxiv Bioinformatics"},"content":{"rendered":"\n<div class=\"wp-block-caxton-grid relative\"><div class=\"absolute absolute--fill\"><div class=\"absolute absolute--fill cover bg-center\" style=\"background-color:;background-image:linear-gradient( );\"><\/div><div class=\"absolute absolute--fill\" style=\"background-color:;background-image:linear-gradient( );opacity:1;\"><\/div><\/div><div class=\"relative caxton-columns caxton-grid-block\" style=\"padding-top:0;padding-left:0;padding-bottom:0;padding-right:0;grid-template-columns:repeat(12, 1fr)\" data-tablet-css=\"padding-left:em;padding-right:em;\" data-mobile-css=\"padding-left:em;padding-right:em;\">\n<div class=\"wp-block-caxton-section relative\" style=\"grid-area:span 1\/span 8\"><div class=\"absolute absolute--fill\"><div class=\"absolute absolute--fill cover bg-center\" style=\"background-color:;background-image:linear-gradient( );\"><\/div><div class=\"absolute absolute--fill\" style=\"background-color:;background-image:linear-gradient( );opacity:1;\"><\/div><\/div><div class=\"relative caxton-section-block\" style=\"padding-top:5px;padding-left:5px;padding-bottom:5px;padding-right:5px\" data-mobile-css=\"padding-left:1em;padding-right:1em;\" data-tablet-css=\"padding-left:1em;padding-right:1em;\">\n<p><strong><a href=\"https:\/\/www.biorxiv.org\/alertsrss\" target=\"_blank\" rel=\"noreferrer noopener\">Journal Home<\/a><\/strong><\/p>\n<\/div><\/div>\n\n\n\n<div class=\"wp-block-caxton-section relative\" style=\"grid-area:span 1\/span 4\"><div class=\"absolute absolute--fill\"><div class=\"absolute absolute--fill cover bg-center\" style=\"background-color:;background-image:linear-gradient( );\"><\/div><div class=\"absolute absolute--fill\" style=\"background-color:;background-image:linear-gradient( );opacity:1;\"><\/div><\/div><div class=\"relative caxton-section-block\" style=\"padding-top:5px;padding-left:5px;padding-bottom:5px;padding-right:5px\" data-mobile-css=\"padding-left:1em;padding-right:1em;\" data-tablet-css=\"padding-left:1em;padding-right:1em;\">\n<p><strong><a href=\"http:\/\/connect.biorxiv.org\/biorxiv_xml.php?subject=bioinformatics\" target=\"_blank\" rel=\"noreferrer noopener\">RSS<\/a><\/strong><\/p>\n<\/div><\/div>\n<\/div><\/div>\n\n\n<ul class=\"has-dates has-authors has-excerpts wp-block-rss\"><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723611v1?rss=1'>ClaroAI-Bench: Evaluating Agentic Scientific Reproducibility on Real Biomedical Papers<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by O&#039;Connell, K. A.<\/span><div class=\"wp-block-rss__item-excerpt\">We introduce ClaroAI-Bench, an evaluation suite for measuring AI agents&#039; ability to reproduce computational findings from published biomedical research. The benchmark comprises 35 real NIH-funded papers spanning five modalities (genomics, imaging, clinical\/EHR, epidemiology, wet-lab) scored on a five-dimension rubric: data findability (D1), data accessibility (D2), code availability (D3), environment reconstructability (D4), and results reproducibility (D5). Each task requires an agent to locate data, obtain code, reconstruct the compute environment, execute the analysis, and verify results against published claims, mirroring the [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.07.723601v1?rss=1'>CausalKnowledgeTrace: A Novel Computational Framework for Automated Literature-Based Causal Graph Construction and Evidence-Based Variable Selection in Biomedical Research<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Upadhayaya, R., Pradhan, M. M., Metzger, V. T., Malec, S. A.<\/span><div class=\"wp-block-rss__item-excerpt\">Background: Variable selection for causal inference from observational biomedical data is challenging, as overlooking confounders or conditioning on colliders leads to biased estimates. While vast causal knowledge exists in biomedical literature, manually extracting this information for principled variable selection is impractical at scale. Methods: We developed CausalKnowledgeTrace, a Python-based computational framework with Django web interface that systematically leverages structured causal knowledge from the Semantic MEDLINE Database (SemMedDB) to inform variable selection in causal studies. The system implements a six-stage analysis [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.07.723640v1?rss=1'>A novel vaccine and drug targets for global eradication of bovine tuberculosis: Holistic frameworks for construction of a potent vaccine and identification of drug targets<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Pawar, P., samarasinghe, s., Kulasiri, D.<\/span><div class=\"wp-block-rss__item-excerpt\">Bovine tuberculosis (TB), caused by Mycobacterium bovis, has become a global concern over the last two decades. Bovine TB primarily affects cattle, but other domestic livestock are also affected and it is more common in less developed and developing countries. The significant loss of livestock leads to trade restrictions and economic crises. Zoonotic potential of bovine TB raises health concerns for the public. Currently, no effective treatment is available and animal slaughtering is usually undertaken to reduce the burden of [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.07.723626v1?rss=1'>Engineering a pacemaker-driven human mini-heart guided by spatial multi-omics of sinoatrial node development<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Zhu, J., Zhang, Z., Gregorio, R. D., Chang, K., Dong, X., Banerjee, K., Liu, K., Rea-Moreno, M., Kizilbash, M., Alonso, A., Liu, J., Tsai, S., Chen, Y.-W., Evans, T., Chen, S.<\/span><div class=\"wp-block-rss__item-excerpt\">The human sinoatrial node (SAN) functions as the primary pacemaker of the heart and coordinates the hierarchical electrical activity that drives cardiac contraction. However, experimental systems capable of reconstructing pacemaker driven cardiac organization in human tissues remain limited. Here we integrate spatial multi-omics of the human fetal SAN with stem cell engineering to generate pacemaker organoids (Sinoids) and assemble them into a pacemaker driven human mini-heart composed of sinoatrial, atrial and ventricular cardiac modules. High-resolution spatial transcriptomics and single nucleus [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723719v1?rss=1'>Carbohydrate active enzymes in Pectobacteriaceae: coevolving enzyme sets and host adaptation<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Hobbs, E. E. M., Gloster, T. M., Pritchard, L.<\/span><div class=\"wp-block-rss__item-excerpt\">Many phytopathogenic bacteria have evolved large, diverse arsenals of Carbohydrate Active enZymes (CAZymes) that liberate simple sugars, and thus nutrition and energy, from the complex lignocellulosic matrices of their plant hosts. The CAZyme arsenals of these phytopathogens are expected to be influenced by and adapted to the cell wall composition of their plant hosts. The solutions these organisms have reached for the problem of degrading plant material may help us understand their host ranges and present a rich source of [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.07.723665v1?rss=1'>Dual-view Guided Context-aware Network for Automated Bone Lesion Segmentation and Quantification in Whole-body SPECT<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by chen, w., Yang, X., Lu, J., Miao, M., Huang, Y., Zheng, S., Zhang, C., Xie, L., Zhang, Y.<\/span><div class=\"wp-block-rss__item-excerpt\">Whole-body SPECT bone scintigraphy reflects skeletal metabolic activity throughout the body and plays an indispensable role in the screening, treatment evaluation, and prognostic assessment of bone metastases in tumors. However, the automatic detection and segmentation of hypermetabolic bone lesions remain challenging due to low contrast, limited spatial resolution, and complex lesion distributions. In this study, we proposed Bone-Segnet, a dual-view guided automatic segmentation network for hypermetabolic bone lesions that integrated multi-scale feature modeling, global context modeling, and view-conditioned modulation. Pixel-level [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723701v1?rss=1'>Receptor-Anchored Olfaction Representation through Perception-Consistent Metric Learning<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Tian, C., Wang, J., Hou, J., Liu, W., Luo, Y., Wang, Y., Yang, L., Lin, W.<\/span><div class=\"wp-block-rss__item-excerpt\">Olfactory perception arises from distributed activation across hundreds of olfactory receptors (ORs), yet our understanding of this landscape remains constrained by the scarcity of OR affinity measurements. Here, we present Receptor-Anchored Metric Supervision (RAMS), a transfer learning framework using perceptual consistency as weak supervision to predict OR activation spectra. RAMS fine-tunes a pretrained drug-target affinity model by imposing constraints derived from olfactory perception, where similar odorants are encouraged to exhibit similar OR activations. It transfers protein-ligand interaction knowledge learned from [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723807v1?rss=1'>MicrobeMS &#8211; A MATLAB Toolbox for Microbial Identification Based on Mass Spectrometry<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Lasch, P.<\/span><div class=\"wp-block-rss__item-excerpt\">Over the last two decades, matrix-assisted laser desorption\/ionization time-of-flight mass spectrometry (MALDI-ToF MS) has become the standard method for identifying bacteria and has found a wide range of applications, especially in clinical microbiology. The method&#039;s high taxonomic resolution, minimal sample preparation, and complete, ready-to-use commercial systems, which include instrumentation, experimental protocols, spectral databases, and identification analysis software, were key factors in the success of MALDI-ToF MS as the standard for identifying microorganisms in routine diagnostic laboratories. However, despite the availability [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.12.724499v1?rss=1'>AI-derived Protein Structures Validation: AlphaFold2 Models in the Twilight Zone<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Griffin, P., Deganutti, G., Jadeja, K., Idigbe, C., Pipito&#039;, L., Mejuto, L., Ng, C. P., Peck, S., Greaves, J., Reynolds, C. A.<\/span><div class=\"wp-block-rss__item-excerpt\">In any field, unquestioningly accepting artificial intelligence (AI) results should be considered bad practise. Here, we devised a comparative modelling-based strategy for validating protein structures that exploits the well-known observation that protein folds are far more conserved than protein sequences. We identify proteins with a similar fold to the AlphaFold-generated query protein and determine their structural alignment to the query. The hypothesis is that if the sequence alignment coincides with the structural alignment, then the structure is validated. The strategy [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.11.724344v1?rss=1'>Detecting and quantifying overparametrization in RNA language models with REDIAL<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Teng, D., Qiu, Y., Sakthivel, G., Aranganathan, A., Herron, L., Tiwary, P.<\/span><div class=\"wp-block-rss__item-excerpt\">While RNA language models (LMs) have served as foundation models (FMs) to advanced structural prediction, their evaluation relies heavily on supervised downstream tasks. Such tasks can often mask FM inefficiencies and reflect downstream training set memorization. To address this, here we introduce REDIAL (RNA Embedding perturbation Diagnostics for Language models), a zero-shot, unsupervised framework designed to extract coevolutionary signals directly from the high-dimensional latent spaces of RNA language models. By applying REDIAL, we uncover stark, layer-wise disparities in how popular [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.11.724398v1?rss=1'>On the applicability domain of HADDOCK3 for protein-aptamer docking: documented failure modes from a 5&#215;7 cross-target screening matrix and a 1676 aa receptor case study (P01031)<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Dohi, E.<\/span><div class=\"wp-block-rss__item-excerpt\">We screened a 5 receptor x 7 aptamer = 35-cell cross-target matrix with HADDOCK3 [1] under blind ambiguous-interaction-restraint (AIR) protocols on AlphaFold-modelled receptors. The screen surfaced 12 operationally distinct failure modes (collapsing to about 8 conceptual classes; Section 3.1). The K_D-calibration subset is n = 4 cells with literature K_D records under matched assay conditions; the broader cohort includes &gt;= 6 biological cognate or intended-cognate cells. The principal case study is P01031 (complement C5, 1676 aa, &gt;= 12 structural domains): [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723784v1?rss=1'>Unified sampling framework and experimental benchmarking of sequence- and structure-based protein models<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Spinner, A., Notin, P., Berry, S., Cortade, D., Sisson, Z., Ikonomova, S., Ross, D., Marks, D.<\/span><div class=\"wp-block-rss__item-excerpt\">Generative models are increasingly used for protein design, but the lack of standardized evaluation frameworks limits comparison across model classes and hinders translation to experimental success. Here, we introduce a unified sampling and benchmarking framework that enables controlled sequence generation across alignment, protein language, and structure-based models, and apply it to Tobacco etch virus (TEV) protease. Across hundreds of thousands of designed sequences, different models explore distinct regions of sequence space with no clear computational selection metrics to assess enzymatic [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723796v1?rss=1'>Corpus-wide causality: Algorithm design &amp; application for aggregating gene-disease causal evidence<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Bansal, N., Parsodkar, A. P., Pathak, A., Narayanan, M.<\/span><div class=\"wp-block-rss__item-excerpt\">Identifying causal relationships, rather than mere associations, is essential for applications such as finding genes driving diseases and guiding drug discovery towards disease mechanisms rather than symptom management. Although many studies extract biomedical relations from large literature corpora such as PubMed, fewer focus on causal relations from abstracts, and fewer still summarize corpus-level evidence for causal links. LLMs (Large Language Models) are increasingly used for biomedical summarization and relation extraction, but explicit benchmarks comparing generalized LLMs against specialized, domain-aware methods [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.11.724158v1?rss=1'>The elusive resistome: a global comparison reveals large discrepancies among detection pipelines<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Inda-Diaz, J. S., Adegoke, F., Lo\u0308ber, U., Jarquin-Diaz, V. H., Duan, Y., Bengtsson-Palme, J., Ugarcina Perovic, S., Coelho, L. P.<\/span><div class=\"wp-block-rss__item-excerpt\">Identifying antibiotic resistance genes (ARGs) from metagenomic data is critical for studying antimicrobial resistance across microbial communities and pathogens. However, there is no standardized methodology for ARG annotation. Here, we compare ten commonly used ARG detection pipelines by analysing over 270 million prokaryotic genes from the Global Microbial Gene Catalogue across 13 distinct habitats. We observed up to a 45-fold difference in the number of reported ARGs, with a mean Jaccard index of only 16% between pipelines. Pipeline selection profoundly [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723925v1?rss=1'>Temporal-deviation-driven community detection uncovers early-warning signals for critical transitions in complex diseases<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Wang, L., Xu, M., Yan, H., Zheng, Y., Feng, S., Zhang, Y., Li, C., Qiu, D., Hu, B., Wan, X., Zhang, F.<\/span><div class=\"wp-block-rss__item-excerpt\">Early detection of critical transitions in complex diseases is crucial for timely clinical intervention. However, as patients often provide only a single snapshot, identifying sample-specific early-warning signals (EWS) from a dynamical evolution perspective remains challenging, coupled with high-dimensional noise amplification. Here, we present TD-COM, a framework for detecting personalized EWS of critical transitions via single-sample community detection. By constructing a temporal perturbation map STDN, TD-COM captures latent dynamical perturbations inferred from static individual profiles. Synergizing these temporal-deviation signals with static [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.09.723882v1?rss=1'>PARiS: Probabilistic Assignment and Repartitioning of isomiR Sequences: A data-driven method for denoising isomiR read count data<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Swan, H. K., Baran, A. M., Aparicio-Puerta, E., Halushka, M. K., Jun, S.-H., McCall, M. N.<\/span><div class=\"wp-block-rss__item-excerpt\">MicroRNAs (miRNAs) are non-coding RNAs, approximately 18 &#8211; 24 nucleotides in length, with important gene regulatory functions. In small RNA sequencing (sRNA-seq), observed iso- forms of miRNA, called isomiRs, arise from my biological and technical processes. Alterations in isomiR expression has been linked to a wide variety of human diseases, from cancers to neurological diseases. However, it is difficult to distinguish be- tween technical and biological isomiRs. We present PARiS, an algorithm for the Probabilistic Assignment and Repartitioning of isomiR [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723458v1?rss=1'>SigBridgeR: An Integrative Framework and Toolkit for Comprehensive Screening and Benchmarking of Phenotype-Associated Cell Subpopulations in Single-Cell Transcriptomics<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Yang, Y., Yan, Z., Qian, H., Du, L., Wang, C., Peng, Y., Bu, X., Zhou, J.-G., Wang, S.<\/span><div class=\"wp-block-rss__item-excerpt\">Single-cell RNA sequencing has revolutionized our understanding of cellular heterogeneity, yet linking specific cell subpopulations to clinically relevant phenotypes remains a persistent challenge. Although multiple computational methods have been developed to bridge this gap, they are typically implemented as standalone packages with heterogeneous preprocessing pipelines, incompatible parameter conventions, and divergent output formats, thereby hindering rigorous cross-method benchmarking and reproducible multi-method workflows. Here, we present SigBridgeR, an extensible R framework and comprehensive toolkit that currently unifies eight state-of-the-art phenotype-associated cell screening [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.11.724222v1?rss=1'>A Conditional Random Field approach for de novo reconstruction of bacterial haplotypes from a de Bruijn graph representation<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Steyaert, A., Van Hecke, M., Marchal, K., Fostier, J.<\/span><div class=\"wp-block-rss__item-excerpt\">Background: Detecting distinct bacterial strains in a mixed sample is an important, yet less well-developed aspect of metagenomic research. Several methods exist that successfully retrieve a de novo reconstruction of viral strains. However, the reconstruction of bacterial haplotypes poses its own distinct challenges, and methods that successfully reconstruct full genome-length bacterial strains de novo are scarce. Here, we develop HaploDetox, a method for de novo bacterial haplotype reconstruction from short reads. We use a de Bruijn graph representation of the [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.07.723509v1?rss=1'>Culsma: A Formal Language for Laboratory Protocols<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Chen, Y., Sun, M., Tadepally, L., Wang, J., Barcenilla, H., Gonzalez, L., Brodin, P.<\/span><div class=\"wp-block-rss__item-excerpt\">The application of artificial intelligence to biomedical research increasingly depends on iterative cycles in which AI systems analyze experimental data, propose follow-up conditions, and drive automated execution at scale, a paradigm central to Bio-AI and autonomous laboratory science. For such cycles to operate, laboratory protocols must be expressed in a form that is simultaneously human-readable and machine-executable. Natural-language descriptions, the current standard in laboratory practice, do not satisfy this dual requirement. We present Culsma, a formal language and execution framework [&hellip;]<\/div><\/li><li class='wp-block-rss__item'><div class='wp-block-rss__item-title'><a href='https:\/\/www.biorxiv.org\/content\/10.64898\/2026.05.08.723814v1?rss=1'>Unlocking Open-Access Genomic and Transcriptomic Data: The First Bioinformatic Exploitation of Tunisian Durum Wheat Landraces Chili and Mahmoudi, Pioneering Data-Driven Research in North Africa<\/a><\/div><time datetime=\"2026-05-12T00:00:00-05:00\" class=\"wp-block-rss__item-publish-date\">May 12, 2026<\/time> <span class=\"wp-block-rss__item-author\">by Gdoura-Ben Amor, M., MATHLOUTHI, N. E. H., BELGUITH, I., DEROUICH, R.<\/span><div class=\"wp-block-rss__item-excerpt\">Durum wheat (Triticum turgidum subsp. durum) is a Mediterranean dietary staple threatened by accelerating climate change, yet the genomic basis of adaptation in North African landraces remains poorly characterised. We present the first integrated whole-genome sequencing (WGS) and RNA-seq study of two contrasting Tunisian landraces: humid-adapted Chili and arid-adapted Mahmoudi. From 27,777 high-confidence SNPs, permutation-based FST outlier analysis (1,000 shuffles) identified 46 selection hotspots across six chromosomes, with a peak signal on chromosome 6B (FST = 0.833; p = 0.013). [&hellip;]<\/div><\/li><\/ul>\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity is-style-wide\"\/>\n\n\n\n<h4 class=\"wp-block-heading\">Related Journals<\/h4>\n\n\n<ul class=\"su-siblings\"><li class=\"page_item page-item-3099\"><a href=\"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-biochemistry\/\">BioRxiv Biochemistry<\/a><\/li>\n<li class=\"page_item page-item-3132\"><a href=\"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-biophysics\/\">BioRxiv Biophysics<\/a><\/li>\n<li class=\"page_item page-item-3188\"><a href=\"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-cancer-biology\/\">BioRxiv Cancer Biology<\/a><\/li>\n<li class=\"page_item page-item-3190\"><a href=\"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-pharmacology-and-toxicology\/\">BioRxiv Pharmacology and Toxicology<\/a><\/li>\n<li class=\"page_item page-item-3114\"><a href=\"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-systems-biology\/\">BioRxiv Systems Biology<\/a><\/li>\n<li class=\"page_item page-item-3193\"><a href=\"https:\/\/kermitmurray.com\/msblog\/links\/journal-feeds\/biochemistry-journal-feeds\/biorxiv\/biorxiv-zoology\/\">BioRxiv Zoology<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Related Journals<\/p>\n","protected":false},"author":1,"featured_media":2652,"parent":3087,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":"","_links_to":"","_links_to_target":""},"class_list":["post-3112","page","type-page","status-publish","has-post-thumbnail","hentry","entry"],"_links":{"self":[{"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/pages\/3112","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/comments?post=3112"}],"version-history":[{"count":1,"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/pages\/3112\/revisions"}],"predecessor-version":[{"id":3113,"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/pages\/3112\/revisions\/3113"}],"up":[{"embeddable":true,"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/pages\/3087"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/media\/2652"}],"wp:attachment":[{"href":"https:\/\/kermitmurray.com\/msblog\/wp-json\/wp\/v2\/media?parent=3112"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}