Skip to content

Commit 6547932

Browse files
committed
update
1 parent adaacbc commit 6547932

File tree

3 files changed

+18
-102
lines changed

3 files changed

+18
-102
lines changed

content/contribution-data-visualization.md

Lines changed: 5 additions & 27 deletions
Original file line numberDiff line numberDiff line change
@@ -29,16 +29,8 @@ To address the fragmentation of the R plotting landscape, we developed a suite o
2929
* **[ggtangle](https://github.com/YuLab-SMU/ggtangle) & [ggflow](https://github.com/YuLab-SMU/ggflow) (Relational & Process Flow):** **ggtangle** reimagines network visualization within the tidy framework, while **ggflow** provides a dedicated grammar for flowcharts and transition processes, bridging the gap between static relationships and dynamic workflows.
3030
* **[ggfun](https://cran.r-project.org/package=ggfun) (UX & Utilities):** Provides foundational utilities that enhance the developer and user experience across the entire ecosystem.
3131

32-
<table style="border:none; font-size: 90%; width:100%;">
33-
<tr style="border:none;">
34-
<td style="border:none;width:25%">
35-
<img src="/images/data-visualization/methodology-foundations.svg" style="width:100%; height:auto;"/>
36-
</td>
37-
<td style="border:none;">
32+
3833
<strong>Logic Unification:</strong> By treating plots as first-class algebraic objects, these tools allow for the "compositional" creation of complex figures that were previously impossible or required significant manual effort.
39-
</td>
40-
</tr>
41-
</table>
4234

4335
## 2. Specialized Academic Domains: Deep Integration
4436

@@ -52,16 +44,9 @@ Beyond general-purpose utilities, we have pioneered visualization standards in s
5244
* **Single-Cell & Fine-Scale Omics:** [**ggsc**](https://github.com/YuLab-SMU/ggsc) and [**ivolcano**](https://github.com/YuLab-SMU/ivolcano) address the unique needs of high-resolution data, providing specialized geometries and interactive exploration for single-cell clusters and differential expression.
5345
* **Glycobiology & Complex Carbohydrates:** [**gglycan**](https://github.com/YuLab-SMU/gglycan) introduces a grammar for visualizing complex glycan structures. By supporting standard symbolic nomenclature (e.g., SNFG), it enables researchers to integrate glycomic data with other biological layers, bridging a critical gap in multi-omics synthesis.
5446

55-
<table style="border:none; font-size: 90%; width:100%;">
56-
<tr style="border:none;">
57-
<td style="border:none;width:25%">
58-
<img src="/images/data-visualization/academic-domains.svg" style="width:100%; height:auto;"/>
59-
</td>
60-
<td style="border:none;">
47+
6148
<strong>Domain Leadership:</strong> These tools are not mere plotting scripts but are **interpretative frameworks** cited in thousands of studies across <em>Nature</em>, <em>Science</em>, and <em>Cell</em>.
62-
</td>
63-
</tr>
64-
</table>
49+
6550

6651
## 3. Semantic Enrichment & Scientific Communication
6752

@@ -73,16 +58,9 @@ To bridge the gap between abstract data and human intuition, we developed tools
7358
* **[emojifont](https://cran.r-project.org/package=emojifont), [shadowtext](https://cran.r-project.org/package=shadowtext), & [meme](https://cran.r-project.org/package=meme):** Enhancing semantic storytelling through advanced typography and cultural icons. These tools improve optical clarity through text halos and allow for creative, engaging data interaction, bridging the gap between formal analysis and impactful communication.
7459
* **[hexSticker](https://cran.r-project.org/package=hexSticker):** Revolutionizing how R developers brand their work. **hexSticker** has established the "Hex Logo" as the universal symbol of professional R package development.
7560

76-
<table style="border:none; font-size: 90%; width:100%;">
77-
<tr style="border:none;">
78-
<td style="border:none;width:25%">
79-
<img src="/images/data-visualization/semantic-enrichment.svg" style="width:100%; height:auto;"/>
80-
</td>
81-
<td style="border:none;">
61+
8262
<strong>Scientific Communication:</strong> These tools empower researchers to communicate complex data with clarity, impact, and professional polish, fostering broader adoption of open-source science.
83-
</td>
84-
</tr>
85-
</table>
63+
8664

8765
## Community Impact & Influence
8866

content/contribution-knowledge-mining.md

Lines changed: 10 additions & 57 deletions
Original file line numberDiff line numberDiff line change
@@ -24,103 +24,56 @@ Translating massive omics datasets into clinical insights requires more than sim
2424

2525
Before knowledge can be mined, it must be quantified. We established the mathematical prerequisites for biological discovery by developing semantic similarity measures that map fuzzy biological concepts onto rigorous mathematical spaces.
2626

27-
<table style="border:none; font-size: 90%; width:100%;">
28-
<tr style="border:none;">
29-
<td style="border:none;width:25%">
30-
<a href="http://bioinformatics.oxfordjournals.org/cgi/content/abstract/26/7/976"><img src="/images/knowledge-mining/semsim.png" style="width:100%; max-width:1000px;"/></a>
31-
</td>
32-
<td style="border:none;">
27+
3328
<strong>Semantic Quantification:</strong> Through [**GOSemSim**](https://bioconductor.org/packages/GOSemSim), [**DOSE**](https://bioconductor.org/packages/DOSE), and [**meshes**](https://bioconductor.org/packages/meshes), we provided the first comprehensive software suite for measuring functional similarity among gene products across Gene Ontology, Disease Ontology, and MeSH domains. This work, starting from <em>Bioinformatics</em> (2010), enables the prediction of gene functions, disease-gene associations, and drug-repurposing candidates by leveraging the underlying semantic structure of biomedical knowledge.
34-
</td>
35-
</tr>
36-
</table>
29+
3730

3831

3932
## Pillar 2: The Universal Enrichment Framework — A Paradigm Shift in Theme Discovery
4033

4134
A cornerstone of our contribution is the creation of a **universal mechanistic engine** for functional genomics. By establishing [**clusterProfiler**](https://bioconductor.org/packages/clusterProfiler), we shifted the field from static gene-list analysis to dynamic discovery.
4235

43-
<table style="border:none; font-size: 90%; width:100%;">
44-
<tr style="border:none;">
45-
<td style="border:none;width:25%">
46-
<a href="https://doi.org/10.1016/j.xinn.2021.100141"><img src="/images/knowledge-mining/graphic-abstract-The-Innovation-2021.jpg" style="width:100%; max-width:1000px;"/></a>
47-
</td>
48-
<td style="border:none;">
4936
<strong>Comparative Theme Discovery:</strong> We pioneered the concept of **Comparative Biological Theme Analysis** (<em>OMICS</em> 2012), allowing researchers to compare functional profiles across complex experimental designs—multiple time points, genotypes, and drug treatments—simultaneously. This paradigm has been integrated into over 40 bioinformatics tools and cited over 40,000 times, becoming the <em>de facto</em> global standard as detailed in our **<em>Nature Protocols</em>** (2024) feature.
5037
<br><br>
5138
<strong>Universal Extensibility:</strong> **clusterProfiler** provides a generic platform that breaks the model-organism barrier, supporting thousands of species and any custom functional annotation, from KEGG and GO to Reactome and disease ontologies.
52-
</td>
53-
</tr>
54-
</table>
5539

5640

5741
## Pillar 3: Microbial Functional Architecture — Microbiome & Metagenomics
5842

5943
The microbiome represents one of the final frontiers in functional discovery. Our team has developed specialized infrastructures to address the unique challenges of inter-species interactions and metabolic flow in microbial communities.
6044

61-
<table style="border:none; font-size: 90%; width:100%;">
62-
<tr style="border:none;">
63-
<td style="border:none;width:25%">
64-
<a href="https://github.com/YuLab-SMU/MicrobiotaProcess"><img src="/images/knowledge-mining/microbiome.png" style="width:100%; max-width:1000px;"/></a>
65-
</td>
66-
<td style="border:none;">
45+
6746
<strong>Tidy Ecological Mining:</strong> Through [**MicrobiotaProcess**](https://bioconductor.org/packages/MicrobiotaProcess) and [**MicrobiomeProfiler**](https://bioconductor.org/packages/MicrobiomeProfiler), we established a **tidy framework** for microbiome data mining. **MicrobiotaProcess** provides a comprehensive pipeline for ecological analysis (alpha/beta diversity, biomarker discovery), while **MicrobiomeProfiler** enables functional enrichment analysis specifically tailored to the metabolic and genomic context of microbial datasets.
68-
</td>
69-
</tr>
70-
</table>
47+
7148

7249

7350
## Pillar 4: Deciphering Cistromic Regulation — From Genomic Location to Function
7451

7552
Biological function is often encoded in the non-coding genome. We developed the infrastructure to bridge the gap between genomic cis-regulatory positions and biological function.
7653

77-
<table style="border:none; font-size: 90%; width:100%;">
78-
<tr style="border:none;">
79-
<td style="border:none;width:25%">
80-
<a href="https://academic.oup.com/bioinformatics/article/31/14/2382/255379"><img src="/images/knowledge-mining/chipseeker.png" style="width:100%; max-width:1000px;"/></a>
81-
</td>
82-
<td style="border:none;">
54+
8355
<strong>Cistromic Annotation:</strong> [**ChIPseeker**](https://bioconductor.org/packages/ChIPseeker) (<em>Bioinformatics</em> 2015) has become a global cornerstone for epigenomic annotation. It provides a comprehensive framework for annotating genomic peaks (ChIP-seq, ATAC-seq, DNase-seq) with biological context and facilitating cross-dataset comparison and mining within the GEO database.
84-
</td>
85-
</tr>
86-
</table>
56+
8757

8858

8959
## Pillar 5: Knowledge-Driven Discovery in Single-Cell & Spatial Omics
9060

9161
As biological data moves toward higher resolution, the integration of prior knowledge becomes essential for interpretability. Our team has pioneered methods to incorporate biological intelligence into single-cell and spatial discovery.
9262

93-
<table style="border:none; font-size: 90%; width:100%;">
94-
<tr style="border:none;">
95-
<td style="border:none;width:50%">
63+
9664
<strong>Biological Latent Spaces:</strong> Through [**MSGNN**](https://github.com/YuLab-SMU/MSGNN), we proposed integrating biological knowledge priors directly into the graph-based community detection process via graph neural networks. This ensures that single-cell clustering outcomes are not only mathematically robust but also align closely with biological interpretations.
9765
<br><br>
9866
<strong>Spatial Functional States:</strong> With [**SVP**](https://github.com/YuLab-SMU/SVP), we introduced a novel paradigm for characterizing cell functional states and identifying spatially variable features, enabling the direct mapping of biological functions (e.g., pathways) across tissue architectures.
99-
</td>
100-
<td style="border:none;width:23.5%">
101-
<a href="https://github.com/YuLab-SMU/MSGNN/"><img src="/images/knowledge-mining/msgnn.png" style="width:100%; max-width:1000px;"/></a>
102-
</td>
103-
<td style="border:none;width:23.5%">
104-
<a href="https://github.com/YuLab-SMU/SVP/"><img src="/images/knowledge-mining/svp.png" style="width:100%; max-width:1000px;"/></a>
105-
</td>
106-
</tr>
107-
</table>
67+
10868

10969

11070
## Pillar 6: From Abstract Results to Intuitive Insight — Visual Logic
11171

11272
A functional discovery is only complete when it can be interpreted. Our team has pioneered the "Visual Logic" of enrichment, transforming abstract statistical tables into rigorous visual narratives.
11373

114-
<table style="border:none; font-size: 90%; width:100%;">
115-
<tr style="border:none;">
116-
<td style="border:none;width:25%">
117-
<a href="https://guangchuangyu.github.io/software/enrichplot/"><img src="/images/knowledge-mining/enrichplot.png" style="width:100%; max-width:1000px;"/></a>
118-
</td>
119-
<td style="border:none;">
74+
12075
<strong>Interpretive Synthesis:</strong> Through [**enrichplot**](https://bioconductor.org/packages/enrichplot), we established the visualization standards for functional discovery. By implementing sophisticated geometries (e.g., dot plots, gene-concept networks, upset plots), **enrichplot** enables researchers to visually synthesize complex enrichment results, uncovering the hidden relationships between biological themes and their underlying molecular drivers.
121-
</td>
122-
</tr>
123-
</table>
76+
12477

12578

12679
## Feedback from the academic community

content/contribution-tree-data.md

Lines changed: 3 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -62,12 +62,7 @@ Before our work, tree visualization was largely restricted to topological displa
6262

6363
As omics data reached unprecedented scales, our team introduced the **"Data-to-Tree" paradigm** in foundational work published in [*Molecular Biology and Evolution*](https://academic.oup.com/mbe/article-abstract/35/12/3041/5142656) in 2018 (ESI highly cited). This work proposed two comprehensive methods that redefined the integration of heterogeneous data within a unified evolutionary context.
6464

65-
<table style="border:none; font-size: 90%; width:100%;">
66-
<tr style="border:none;">
67-
<td style="border:none;width:25%">
68-
<a href="https://doi.org/10.1093/molbev/msab166"><img src="/images/ggtree/ggtreeExtra-2021.png" style="width:100%; max-width:1000px;"/></a>
69-
</td>
70-
<td style="border:none;">
65+
7166
<strong>Theoretical Foundations & universal derivatives:</strong>
7267
The two methods introduced in 2018 have since evolved from specialized phylogenetic tools into universal visualization standards:
7368
<ul>
@@ -82,24 +77,14 @@ The two methods introduced in 2018 have since evolved from specialized phylogene
8277
</ul>
8378
<br>
8479
<strong>Programmable Reproducibility:</strong> Our work in <em>iMeta</em> (2022) established the **ggtree** object—a programmable structure that ensures analytical reproducibility by encapsulating trees, data, and visualization directives.
85-
</td>
86-
</tr>
87-
</table>
80+
8881

8982
## Pillar 4: Vertical Generalization — From Phylogeny to General Hierarchy
9083

9184
To demonstrate the universal utility of our grammar, we expanded its scope beyond evolutionary biology. By generalizing the framework to encompass all hierarchical structures, we bridged the gap between specialized biological interpretation and general data science.
9285

93-
<table style="border:none; font-size: 90%; width:100%;">
94-
<tr style="border:none;">
95-
<td style="border:none;width:25%">
96-
<a href="https://www.bioconductor.org/packages/ggtreeDendro"><img src="/images/ggtree/ggtreeDendro.png" style="width:100%; max-width:1000px;"/></a>
97-
</td>
98-
<td style="border:none;">
86+
9987
<strong>Universal Scope:</strong> Through [**ggtreeDendro**](https://bioconductor.org/packages/ggtreeDendro), we extended the phylogenetic grammar to hierarchical clustering and classification/regression trees. This enables the same rigorous data integration methods used in phylogenetics to be applied to any sample-level or feature-level hierarchical relationship (e.g., cell clustering), unifying disparate analytical workflows under a single theoretical umbrella.
100-
</td>
101-
</tr>
102-
</table>
10388

10489
----
10590

0 commit comments

Comments
 (0)