Lidé
Ing. Alikhan Anuarbekov
Všechny publikace
circGPAcorr: an integrative tool for functional annotation of circular RNAs using expression data
- Autoři: Bc. Petr Ryšavý, MSc., Ph.D., Ing. Alikhan Anuarbekov, Dostálová Merkerová, M., doc. Ing. Jiří Kléma, Ph.D.,
- Publikace: BioData Mining. 2025, 18(1), ISSN 1756-0381.
- Rok: 2025
- DOI: 10.1186/s13040-025-00468-3
- Odkaz: https://doi.org/10.1186/s13040-025-00468-3
- Pracoviště: Katedra počítačů, Intelligent Data Analysis
-
Anotace:
Circular RNAs play a crucial role in cell development and serve as biomarkers in many diseases. Nevertheless, the function of many circular RNAs remains unknown. This function can be inferred from sponging and silencing interactions with micro RNAs and messenger RNAs. We recently proposed a network-based circRNA functional annotation tool, circGPA. However, validation data for RNA interactions are often sparse and predicted interactions contain many false positives. To address this issue, we propose an extended algorithm named circGPAcorr, which uses expression data to weight the interactions, resulting in more precise functional annotation. To assess the significance of the results, the p-value is calculated using reduction to circGPA, a generating-polynomial-based method. We show that the problem is #P-hard, and thus computationally difficult. The circGPAcorr algorithm is tested on publicly available myelodysplastic syndromes expression data, providing gene ontology annotations that align with the literature on myelodysplastic syndromes. At the same time, we demonstrate its performance in the circRNA-disease annotation task.
Utilizing RNA-seq Data in Monotone Iterative Generalized Linear Model to Elevate Prior Knowledge Quality of the CircRNA-miRNA-mRNA Regulatory Axis
- Autoři: Ing. Alikhan Anuarbekov, doc. Ing. Jiří Kléma, Ph.D.,
- Publikace: BMC Bioinformatics. 2025, 26(1), 1-35. ISSN 1471-2105.
- Rok: 2025
- DOI: 10.1186/s12859-025-06161-w
- Odkaz: https://doi.org/10.1186/s12859-025-06161-w
- Pracoviště: Katedra počítačů, Intelligent Data Analysis
-
Anotace:
BackgroundCurrent experimental data on RNA interactions remain limited, particularly for non-coding RNAs, many of which have only recently been discovered and operate within complex regulatory networks. Researchers often rely on in-silico interaction detection algorithms, such as TargetScan, which are based on biochemical sequence alignment. However, these algorithms have limited performance. RNA-seq expression data can provide valuable insights into regulatory networks, especially for understudied interactions such as circRNA-miRNA-mRNA. By integrating RNA-seq data with prior interaction networks obtained experimentally or through in-silico predictions, researchers can discover novel interactions, validate existing ones, and improve interaction prediction accuracy.ResultsThis paper introduces Pi-GMIFS, an extension of the generalized monotone incremental forward stagewise (GMIFS) regression algorithm that incorporates prior knowledge. The algorithm first estimates prior response values through a prior-only regression, interpolates between these prior values and the original data, and then applies the GMIFS method. Our experimental results on circRNA-miRNA-mRNA regulatory interaction networks demonstrate that Pi-GMIFS consistently enhances precision and recall in RNA interaction prediction by leveraging implicit information from bulk RNA-seq expression data, outperforming the initial prior knowledge.ConclusionPi-GMIFS is a robust algorithm for inferring acyclic interaction networks when the variable ordering is known. Its effectiveness was confirmed through extensive experimental validation. We proved that RNA-seq data of a representative size help infer previously unknown interactions available in TarBase v9 and improve the quality of circRNA disease annotation.