Inter-Modular Linkers

Within Bacterial NRP Biosynthetic Pathways


General Workflow

network

How selective IMLs are toward pairs of modules

  • Here we are trying to explore whether linkers tend to bind multiple pairs of modules (universal linkers) or every unique linker binds a specific pair of modules. Clustering of the ~38k extracted inter-modular linkers (IMLs) using the UCLUST clustering algorithm (Edgar RC.), have shown that the data comprise of 3,916 unique linkers (centroids/clusters). 3,616 IMLs were found to be associated with only a single pair of modules, while the remaining 300 linkers tend to bind multiple distinct pairs of modules (ranging between 2 and up to 13 unique pairs) (Fig. 1).


selectivity selectivity

Fig. 1 - Linkers universality: Only 8% of all extracted inter-modular linkers (IMLs) tend to bridge multiple pairs of modules, while the remaining 92% binds specifically to just a single pair of modules. Revealing the high selectivity nature of linkers towards pairs of modules.

  • In order to establish a more rigorous case about our previous analysis, we decided to create a graph with 993 Inter-modular linkers extracted from the species Mycobacterium Abscessus. The nodes of the graph represent the linkers and edges are constructed between nodes, that share at least 80% pairwise sequence similarity. After constructing the graph, we applied Louvain community detection algorithm(Vincent D. Blondel et.al). The algorithm was able to detect 9 distinct communities. Each community consists of linkers that links specifically a single pair of modules. For simplicity purposes, a pair of modules are represented by their activated substrates (e.g. Ser-Linker-Ala --> Ser-Ala) (Fig. 2).


selectivity


selectivity

Fig. 2 - Mycobacterium Abscessus module-specific IMLs: (A) Community network, where nodes refer to linkers, and edges are constructed between two linkers, if they share 80% sequence similarity. The graph depicts nine distinct communities. Each community represents all the linkers that bind a specific pair of modules. For instance, the orange community refers to all linkers that bind the pair of modules that activate phenylalanine and tyrosine, respectively. (B) Metadata of the community network, showing: The size of each community, number of unique and redundant BGCs from which the IMLs were extracted. For instance, the orange community refers to all linkers that bind Phenylalanine and Tyrosine.

Conclusion:

From the aboved mentioned analyses we concluded, that Linkers are highly selective towards pairs of modules and they do not link them randomly.

How conserved are IMLs within and across species:

Our analysis so far indicates that IMLs are very selective towards module pairs. Here, we ask whether pairs of modules tend to be linked by the same linker regardless of the bacterial species they were obtained from. We conducted an all-by-all comparison of module pairs vs. genera (computing the degree of conservation of IMLs linking a specific module pair both within and across genera) and then built a community network to visualize the phylogenetic distributions of IMLs that link the same module pair. We then constructed a community network visualization to illustrate the phylogenetic specificity of IMLs. We took all of the Thr-Val IMLs obtained across all species and created a graph where again the nodes represent IMLs and edges are drawn only between nodes that share at least 80% sequence similarity (Figure 8). After applying the Louvain community detection algorithm to this data, the nodes were colored based on the bacterial species they were obtained from. If IMLs were globally conserved across many bacterial species, we would have expected to obtain a single large community network with multi-colored nodes. If instead IMLs were conserved within a single bacterial species, we would expect multiple distinct communities to be detected, where nodes within each community had the same color. The data indicate that the latter is the case, and that there is some phylogenetic specificity to IMLs (Fig. 3)


conservation
Fig. 3 - (Thr- Val) IMLs community network: A network of all the linkers that bridge pair of modules (Thr-Val). These linkers belong to various distinct communities, despite the fact that they are all linking the same pair of modules. The coloring of the graph refers to the species from which linkers were extracted.

Conclusion:

Linkers of the same pair of modules across all species belong to various communities. These findings indicate that module-specific IMLs tend to be more conserved within bacterial genera, whereas multiple distinct IMLs exist that link the same module pair across different genera


Copyright © mml.unc.edu