• 제목/요약/키워드: Gene ontology analysis

검색결과 228건 처리시간 0.028초

Integrative Analysis of Microarray Data with Gene Ontology to Select Perturbed Molecular Functions using Gene Ontology Functional Code

  • Kim, Chang-Sik;Choi, Ji-Won;Yoon, Suk-Joon
    • Genomics & Informatics
    • /
    • 제7권2호
    • /
    • pp.122-130
    • /
    • 2009
  • A systems biology approach for the identification of perturbed molecular functions is required to understand the complex progressive disease such as breast cancer. In this study, we analyze the microarray data with Gene Ontology terms of molecular functions to select perturbed molecular functional modules in breast cancer tissues based on the definition of Gene ontology Functional Code. The Gene Ontology is three structured vocabularies describing genes and its products in terms of their associated biological processes, cellular components and molecular functions. The Gene Ontology is hierarchically classified as a directed acyclic graph. However, it is difficult to visualize Gene Ontology as a directed tree since a Gene Ontology term may have more than one parent by providing multiple paths from the root. Therefore, we applied the definition of Gene Ontology codes by defining one or more GO code(s) to each GO term to visualize the hierarchical classification of GO terms as a network. The selected molecular functions could be considered as perturbed molecular functional modules that putatively contributes to the progression of disease. We evaluated the method by analyzing microarray dataset of breast cancer tissues; i.e., normal and invasive breast cancer tissues. Based on the integration approach, we selected several interesting perturbed molecular functions that are implicated in the progression of breast cancers. Moreover, these selected molecular functions include several known breast cancer-related genes. It is concluded from this study that the present strategy is capable of selecting perturbed molecular functions that putatively play roles in the progression of diseases and provides an improved interpretability of GO terms based on the definition of Gene Ontology codes.

프마이크로어레이 데이터의 유전자 집합 및 대사 경로 분석 (Gene Set and Pathway Analysis of Microarray Data)

  • 김선영
    • 유전체소식지
    • /
    • 제6권1호
    • /
    • pp.29-33
    • /
    • 2006
  • Gene set analysis is a new concept and method. to analyze and interpret microarray gene expression data and tries to extract biological meaning from gene expression data at gene set level rather than at gene level. Compared with methods which select a few tens or hundreds of genes before gene ontology and pathway analysis, gene set analysis identifies important gene ontology terms and pathways more consistently and performs well even in gene expression data sets with minimal or moderate gene expression changes. Moreover, gene set analysis is useful for comparing multiple gene expression data sets dealing with similar biological questions. This review briefly summarizes the rationale behind the gene set analysis and introduces several algorithms and tools now available for gene set analysis.

  • PDF

An Efficient Functional Analysis Method for Micro-array Data Using Gene Ontology

  • Hong, Dong-Wan;Lee, Jong-Keun;Park, Sung-Soo;Hong, Sang-Kyoon;Yoon, Jee-Hee
    • Journal of Information Processing Systems
    • /
    • 제3권1호
    • /
    • pp.38-42
    • /
    • 2007
  • Microarray data includes tens of thousands of gene expressions simultaneously, so it can be effectively used in identifying the phenotypes of diseases. However, the retrieval of functional information from a large corpus of gene expression data is still a time-consuming task. In this paper, we propose an efficient method for identifying functional categories of differentially expressed genes from a micro-array experiment by using Gene Ontology (GO). Our method is as follows: (1) The expression data set is first filtered to include only genes with mean expression values that differ by at least 3-fold between the two groups. (2) The genes are then ranked based on the t-statistics. The 100 most highly ranked genes are selected as informative genes. (3) The t-value of each informative gene is imposed as a score on the associated GO terms. High-scoring GO terms are then listed with their associated genes and represent the functional category information of the micro-array experiment. A system called HMDA (Hallym Micro-array Data analysis) is implemented on publicly available micro-array data sets and validated. Our results were also compared with the original analysis.

유전자군 분석의 방법론과 응용 (A Method for Gene Group Analysis and Its Application)

  • 이태원
    • 응용통계연구
    • /
    • 제25권2호
    • /
    • pp.269-277
    • /
    • 2012
  • 마이크로어레이 분석은 특이 발현하는 개별적인 유전자보다 유전자 온톨로지(Gene Ontology)와 같이 기능적 분류나 생물학적 경로(pathway)와 관련된 유전자군을 찾아내는 것이 그 해석의 용이성 때문에 최근 더욱 많은 연구가 진행되고 있다. 약물 처리에 의한 생물학적 반응을 연구할 때, 한 유전자군에 속하는 유전자들 각각의 특이 발현 여부의 유의성을 나타내는 $p$-value들을 취합하여 그 유전자군의 유의성을 결정하는 통계 검증 방법을 본 논문에서 소개하였다. 본 논문에 제시된 유전자군 분석(Gene group analysis) 방법은 Fisher's exact test나 permutation test와 같은 기존의 대표적인 방법들보다 더 정확하고 적용범위가 넓음을 실재 생물학 실험 자료의 분석을 통해 보였다. 제시된 유전자군 분석 방법은 SAS 프로그램으로 구현되었고 저자의 홈페이지(http://cafe.daum.net/go.analysis)에서 내려 받아 사용할 수 있다.

Discovery of Cellular RhoA Functions by the Integrated Application of Gene Set Enrichment Analysis

  • Chun, Kwang-Hoon
    • Biomolecules & Therapeutics
    • /
    • 제30권1호
    • /
    • pp.98-116
    • /
    • 2022
  • The small GTPase RhoA has been studied extensively for its role in actin dynamics. In this study, multiple bioinformatics tools were applied cooperatively to the microarray dataset GSE64714 to explore previously unidentified functions of RhoA. Comparative gene expression analysis revealed 545 differentially expressed genes in RhoA-null cells versus controls. Gene set enrichment analysis (GSEA) was conducted with three gene set collections: (1) the hallmark, (2) the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, and (3) the Gene Ontology Biological Process. GSEA results showed that RhoA is related strongly to diverse pathways: cell cycle/growth, DNA repair, metabolism, keratinization, response to fungus, and vesicular transport. These functions were verified by heatmap analysis, KEGG pathway diagramming, and direct acyclic graphing. The use of multiple gene set collections restricted the leakage of information extracted. However, gene sets from individual collections are heterogenous in gene element composition, number, and the contextual meaning embraced in names. Indeed, there was a limit to deriving functions with high accuracy and reliability simply from gene set names. The comparison of multiple gene set collections showed that although the gene sets had similar names, the gene elements were extremely heterogeneous. Thus, the type of collection chosen and the analytical context influence the interpretation of GSEA results. Nonetheless, the analyses of multiple collections made it possible to derive robust and consistent function identifications. This study confirmed several well-described roles of RhoA and revealed less explored functions, suggesting future research directions.

Gene Expression Profiling of Human Bronchial Epithelial (BEAS-2B) Cells Treated with Nitrofurantoin, a Pulmonary Toxicant

  • Kim, Youn-Jung;Song, Mee;Ryu, Jae-Chun
    • Molecular & Cellular Toxicology
    • /
    • 제3권4호
    • /
    • pp.222-230
    • /
    • 2007
  • Some drugs may be limited in their clinical application due to their propensity towards their adverse effects. Toxicogenomic technology represents a useful approach for evaluating the toxic properties of new drug candidates early in the drug discovery process. Nitrofurantoin (NF) is clinical chemotherapeutic agent and antimicrobial and used to treatment of urinary tract infections. However, NF has been shown to result in pulmonary toxic effects. In this research, we revealed the changing expression gene profiles in BEAS-2B, human bronchial epithelial cell line, exposed to NF by using human oligonucleotide chip. Through the clustering analysis of gene expression profiles, we identified 136 up-regulated genes and 379 down-regulated genes changed by more than 2-fold by NF. This study identifies several interesting targets and functions in relation to NF-induced toxicity through a gene ontology analysis method including biological process, cellular components, molecular function and KEGG pathway.

Gene Set Analyses of Genome-Wide Association Studies on 49 Quantitative Traits Measured in a Single Genetic Epidemiology Dataset

  • Kim, Jihye;Kwon, Ji-Sun;Kim, Sangsoo
    • Genomics & Informatics
    • /
    • 제11권3호
    • /
    • pp.135-141
    • /
    • 2013
  • Gene set analysis is a powerful tool for interpreting a genome-wide association study result and is gaining popularity these days. Comparison of the gene sets obtained for a variety of traits measured from a single genetic epidemiology dataset may give insights into the biological mechanisms underlying these traits. Based on the previously published single nucleotide polymorphism (SNP) genotype data on 8,842 individuals enrolled in the Korea Association Resource project, we performed a series of systematic genome-wide association analyses for 49 quantitative traits of basic epidemiological, anthropometric, or blood chemistry parameters. Each analysis result was subjected to subsequent gene set analyses based on Gene Ontology (GO) terms using gene set analysis software, GSA-SNP, identifying a set of GO terms significantly associated to each trait ($p_{corr}$ < 0.05). Pairwise comparison of the traits in terms of the semantic similarity in their GO sets revealed surprising cases where phenotypically uncorrelated traits showed high similarity in terms of biological pathways. For example, the pH level was related to 7 other traits that showed low phenotypic correlations with it. A literature survey implies that these traits may be regulated partly by common pathways that involve neuronal or nerve systems.

Fisher Criterion을 이용한 Gene Set Enrichment Analysis 기반 유의 유전자 집합의 검출 방법 연구 (Identifying Statistically Significant Gene-Sets by Gene Set Enrichment Analysis Using Fisher Criterion)

  • 김재영;신미영
    • 전자공학회논문지CI
    • /
    • 제45권4호
    • /
    • pp.19-26
    • /
    • 2008
  • Gene set enrichment analysis (GSEA)는 두 개의 클래스를 가지는 마이크로어레이 실험 데이터 분석을 위해 생물학적 특징을 기반으로 구성된 다양한 유전자-집합 중에서 두 클래스의 발현값들이 통계적으로 중요한 차이를 나타내는 유의한 유전자-집합을 추출하기 위한 분석 방법이다. 특히, 유전자에 대한 다양한 생물학적인 정보를 지닌 유전자 주석 데이터베이스(Cytogenetic Band, KEGG pathway, Gene Ontology 등)를 이용하여 마이크로어레이 실험에 사용된 전체 유전자 중 특정 기능을 가지는 유전자들을 그룹화하여 다양한 유전자-집합을 발굴하고, 각 유전자-집합 내에서 두 클래스간에 발현값의 차이를 참조하여 유의한 유전자들을 결정하여, 이를 기반으로 통계적으로 유의한 유전자-집합들을 최종 검출하는 방법이다. 본 논문에서는 GSEA 분석 과정에서 현재 주로 사용되고 있는 signal-to-noise ratio 기반 유전자 서열화(gene ranking) 방법 대신에, Fisher criterion을 이용한 유전자 서열화 방법을 적용함으로써 기존의 GSEA 방법에서 추출하지 못한 생물학적으로 의미 있는 새로운 유의 유전자-집합을 추출하는 방법을 제안하고자 한다. 또한, 제안한 방법의 성능을 고찰하기 위하여 공개된 Leukemia 관련 마이크로어레이 실험 데이터 분석에 적용하였으며, 기존의 알려진 결과와 비교 분석함으로써 제안한 방법의 유용성을 검증하고자 하였다.

시스템 약리학적 분석에 의한 상산의 암전이 억제 효과 (Systems Pharmacological Analysis of Dichroae Radix in Anti-Tumor Metastasis Activity)

  • 이지예;신아연;김학군;안원근
    • 대한한의학방제학회지
    • /
    • 제31권4호
    • /
    • pp.295-313
    • /
    • 2023
  • Objectives : While treatments for cancer are advancing, the development of effective treatments for cancer metastasis, the main cause of cancer patient death, remains insufficient. Recent studies on Dichroae Radix have revealed that its active ingredients have the potential to inhibit cancer metastasis. This study aimed to investigate the cancer metastasis inhibitory effect of Dichroae Radix using network pharmacological analysis. Methods : The active compounds of Dichroae Radix have been identified using Traditional Chinese Medicine System Pharmacology Database and Analysis Platform. The UniProt database was used to collect each of information of all target proteins associated with the active compounds. To find the bio-metabolic processes associated with each target, the DAVID6.8 Gene Functional classifier tool was used. Compound-Target and Target-Pathway networks were analyzed via Cytoscape 3.40. Results : In total, 25 active compounds and their 62 non-redundant targets were selected through the TCMSP database and analysis platform. The target genes underwent gene ontology and pathway enrichment analysis. The gene list applied to the gene ontology analysis revealed associations with various biological processes, including signal transduction, chemical synaptic transmission, G-protein-coupled receptor signaling pathways, response to xenobiotic stimulus, and response to drugs, among others. A total of eleven genes, including HSP90AB1, CALM1, F2, AR, PAKACA, PTGS2, NOS2, RXRA, ESR1, ESR2, and NCOA1, were found to be associated with biological pathways related to cancer metastasis. Furthermore, nineteen of the active compounds from Dichroae Radix were confirmed to interact with these genes. Conclusions : The results provide valuable insights into the mechanism of action and molecular targets of Dichroae Radix. Notably, Berberine, the main active ingredient of Dichroae Radix, plays a significant role in degrading AR proteins in advanced prostate cancer. Further studies and validations can provide crucial data to advance cancer metastasis prevention and treatment strategies.

BINGO: Biological Interpretation Through Statistically and Graph-theoretically Navigating Gene $Ontology^{TM}$

  • Lee, Sung-Geun;Yang, Jae-Seong;Chung, Il-Kyung;Kim, Yang-Seok
    • Molecular & Cellular Toxicology
    • /
    • 제1권4호
    • /
    • pp.281-283
    • /
    • 2005
  • Extraction of biologically meaningful data and their validation are very important for toxicogenomics study because it deals with huge amount of heterogeneous data. BINGO is an annotation mining tool for biological interpretation of gene groups. Several statistical modeling approaches using Gene Ontology (GO) have been employed in many programs for that purpose. The statistical methodologies are useful in investigating the most significant GO attributes in a gene group, but the coherence of the resultant GO attributes over the entire group is rarely assessed. BINGO complements the statistical methods with graph-theoretic measures using the GO directed acyclic graph (DAG) structure. In addition, BINGO visualizes the consistency of a gene group more intuitively with a group-based GO subgraph. The input group can be any interesting list of genes or gene products regardless of its generation process if the group is built under a functional congruency hypothesis such as gene clusters from DNA microarray analysis.