Molecular Dynamics Simulation of TDP-43 RRM in the Presence and Absence of RNA

Article Information

David Donald Scott1#, David Mowrey2#, Karthi Nagarajan2#, Anil Nair2*, May Khanna1,3*

1University of Arizona Center for Innovation, Tucson, Arizona

2OmniAb, Inc., Oro Valley, Tucson, Arizona

3Department of Pharmacology and Therapeutics, Gainesville, Florida

*Corresponding author: Dr. May Khanna, Department of Pharmacology and Therapeutics, University of Florida, Gainesville, FL 32610
Dr. Anil Nair, In silico Discovery Center, OmniAb, Inc. 12470 N Rancho Vistoso Blvd, Suite 150 Oro Valley, AZ 85755

Received: 01 July 2024; Accepted: 08 July 2024; Published: 23 August 2024

Citation: David Donald Scott, David Mowrey, Karthi Nagarajan, Anil Nair, May Khanna. Molecular Dynamics Simulation of TDP-43 RRM in the Presence and Absence of RNA. Journal of Bioinformatics and Systems Biology. 7 (2024): 161-168.

Share at Facebook

Abstract

Structural characterization of the prion prone TAR DNA Binding protein (TDP)-43 has been challenging since its intrinsically disordered regions represent 15-30% of the total protein. TDP-43 is a nucleic acid binding protein with an N-terminal domain, two RNA Recognition Motifs (RRM1 and RRM2) and the C-terminal domain. In this study, we seek to define possible new targetable sites on the apo structure of TDP-43 RRM domains. To do so, we used molecular dynamic (MD) simulations on the NMR solved TDP-43RRM1-2 structure bound to RNA to predict the apo structure. Contact analysis of TDP-43 showed that while the integrity of the individual domains was maintained upon RNA removal, a decrease in interdomain contacts was observed. Moreover, we compared apo TDP- 43 structures obtained from MD to AlphaFold 2 (AF2) predicted TDP-43 structures and found differences in loop regions. Sitemap on predicted apo RRM and RNA bound structures revealed fewer pockets in the RNA-free compared to the RNA-bound structures.

Keywords

TDP-43, Molecular Dynamics (MD), RNA.

TDP-43 articles; Molecular Dynamics (MD) articles; RNA articles;

Article Details

Introduction

TAR DNA-binding protein 43 (TDP-43) is a nuclear RNA-binding protein implicated in RNA metabolism, yet reported to be present as cytoplasmic inclusions in neurons and/or glial cells of a range of neurodegenerative diseases including ALS-FTLD and AD [1]. Depletion of cellular TDP-43, as a result of being trapped in inclusions, leads to the loss of neuronal activities p-value on the difference between the means of the two Bernoulli distributions.

For interdomain angle analysis the first 100 ns of simulation (500 frames) were also discarded and the remaining frames of all 15 simulations were concatenated. To measure relative orientations between the two domains we measured both a dihedral twisting angle (τ) and an RRM1 opening angle (α). For the τ angle we chose residues G148 and A230 as the center of each domain and residues C173 and R191 to define the axis of rotation. The full dihedral angle is defined using Cα atoms of the residue list G148 – C173 – R191 – A230. The α angle was measured using Cα atoms of the residue list G148 - C173 - R191 (Figure. 1a).

2.2 AlphaFold 2 prediction of apo TDP-43 RRM domains

Computational approaches -first with AlphaFold then with AlphaFold 2 (AF2) [15], a neural network-based algorithm that integrates multi-sequences alignments and training procedures based on the evolutionary, physical and geometric constraints of protein structures- are now able to predict protein structures with high accuracy. We thus used Colabfold_AF2advanced_v2 [15] without templates to predict apo TDP-43 RRM domains (Figure. S2) and obtained five models (named here AF_1-5). Only the RRM domains (102- 269 amino acids) construct was submitted to structure prediction.

2.3 Sitemap Analysis

Sitemap analysis was done using Schrodinger [16] as previously described [8]. We attempted to converge an average structure from the multiple structures generated by MD simulations and AF2 models, but they resulted in non- physical structures. Therefore, the best representative structure based on minimal energy refinement was used.

Results

To assess the system equilibration the root mean square deviation (RMSD) between the starting structure of the simulation and all succeeding frames was assessed (Figure. S1). Plots of the RMSD over time show that in most cases (13 out of 15 simulations) the trajectories of RNA-bound simulations equilibrate within 50 ns of simulation while the other two simulations show a slight jump later in the trajectory [10]. In contrast, simulations of the RNA-free system showed much greater deviation from the starting structure and required significantly more time to reach equilibrium in this case reaching a steady trajectory after 600 ns of simulation.

3.1 Root-mean-square-fluctuation (RMSF) Analysis

Root-mean-square-fluctuation (RMSF) analysis measures the positional variance of each residue over the course of the trajectory and indicates residue flexibility over the course of the trajectory (Figure. 1a). As expected and consistent with the RMSD analysis, we observe a significant global increase in backbone flexibility in the absence of bound RNA. Particularly large increases in flexibility occur around residues K140 and P225, located in loop regions (Figure. 1b).

fortune-biomass-feedstock

Figure 1: Differences in root mean square fluctuation (RMSF) between RNA-free and RNA-bound simulations of TDP43. (a) Plot of the RMSF values for RNA-free (red) and RNA-bound (green) simulations. Error bars are the standard error among 15 simulations. The black lines represent the 99.9% confidence interval for the difference between means. (b) The structure of TDP43 colored by difference in RMSF between RNA-free and RNA-bound from white (minimal RMSF difference) to red (maximum RMSF difference). The RNA strand is shown as a cyan ribbon for reference. Nearly all residues of RNA-free TDP43 show a significant increase in fluctuations compared to the RNA-bound TDP43, exceptions being the N- and C- termini and a short span between the two domains.

We next performed contact analysis of TDP-43 to assess any global changes or rearrangements in structure upon removal of RNA from the initial NMR structure (Figure. 2). The contact maps show that the structural integrity of individual domains is maintained in the absence of bound RNA (Figure. 2a and b). However, we observe a greater decrease in interdomain contacts as compared to formation of new contacts. Specifically, we observe significant decreases in contact probability (p<0.05) between residue pairs D247-S104, D247-D105, R151-D247, I249-L131, I249-M132, I249-R151, G252-M132, S254-D105, and S254-M132. Two new contacts are predicted to form between residues E261-K192 and F231-P223 (Figure. 2c).

fortune-biomass-feedstock

Figure 2: Contact maps of RNA-free (a) and RNA-bound (b) TDP43. Contact maps were computed from concatenating all 15 trajectories with frames sampled every 0.2 ns over the 1000 ns simulation (75000 frames total). A contact exists between residues i and j if any heavy atom of residue i is within 4.5 Å of any heavy atom of residue j. Contacts are only shown if the interaction is present for >= 39% of the concatenated trajectory (p<0.01 from a one-sided test from zero). Contact maps show that the structural integrity of each individual domain (upper left and lower right quadrants) is maintained between RNA-free and RNA-bound simulations. A major change between the contact maps is decreased interdomain contacts in the RNA-free simulations compared to RNA-bound simulations (lower left and upper right quadrants). (c) Residues showing significant changes in contacts between the two systems are highlighted as sticks on the TDP43 structure and colored by residue type white – nonpolar, green – polar, red – acidic, and blue – basic. The bound RNA is shown as a cyan ribbon. In the absence of RNA, the two domains are held less rigidly together and are freer to adopt other conformations with respect to each other.

3.2 Intra- and Inter-domain Motion Analysis

Cross-correlation matrices were generated to determine changes in correlated motion between RNA-free and RNA-bound simulations (Figure. 3). Motion can be correlated (residues move in the same direction), anti- correlated (residues move in opposite directions), or uncorrelated (the movement of one residue is independent of the other residue). Cross-correlation matrices of RNA-free and RNA-bound simulations show much stronger correlations in RNA-free simulations than in RNA-bound simulations. Most of the increased correlated motion is likely the result of the overall increased motion in RNA-free simulations compared to RNA-bound simulations. In several instances, however, we observe increased interdomain correlated motion that significantly exceeds the increased intradomain correlated motion. The magnitude of the increase in intradomain correlated motion is likely indicative of the magnitude of the effect coming from an overall increase in motion. The fact that the interdomain correlated motion exceeds the intradomain correlated motion highlights its significance.

fortune-biomass-feedstock

Figure 3: Cross-correlation matrices for RNA-free (a), RNA-bound (b), and the difference between RNA-free and RNA-bound (c). Cross-correlation matrices were computed from concatenating all 15 trajectories with frames sampled every 0.2 ns over the 1000 ns simulation (75000 frames total) based on Cα atoms. Significance filters were applied so only correlations passing a 5σ threshold are shown.

To further characterize inter-domain motion, we performed angle analysis examining the relative orientation of the two domains with respect to each other (Figure. 4). We measured both a dihedral twisting angle (τ) and an RRM1 opening angle (α). The analysis shows that in the absence of RNA there is a significant increase in orientational variability from what is observed in RNA-bound simulations. The increases come from variations in both angles; however, several distinct populations can be observed indicating the presence of multiple stable conformations. These results are consistent with increased inter-domain correlated motion in the absence of RNA- binding.

fortune-biomass-feedstock

Figure 4: Interdomain angle analysis. (a) To measure domain rotation along the axis of the protein, we chose residues G148 and A230 as the center of each domain (blue) and residues C173 and R191 (red) defines the axis of rotation (cyan line). The 4 Cα atoms of these residues defined the dihedral angle (τ) reflecting the relative twisting of these two domains with respect to each other. The RRM1 opening angle was measured using the angle G148 - C173 - R191 (α) centered on Cα atoms of each residue. (b) Contour map of the dihedral angle (τ) vs the RRM1 opening angle (α) for RNA-bound (green) and RNA-free (red) simulations. We observe that both angles assume significantly broader distributions in the RNA-free simulations compared with RNA-bound simulations. Furthermore, the RNA-free simulation can be observed in multiple stable conformations.

fortune-biomass-feedstock

Figure 5: Distributions of phi and psi angles for R227 and A228. Two distinct populations of ψ angles were observed in RNA-free (a, b) and RNA-bound (c, d) for residues R227 (a, c) and A228 (b, d) in the simulations of TDP-43. In structure bundles of RNA-free (e) and RNA-bound (f) TDP- 43, loop conformations (residues I222 to A228) are colored grey and magenta corresponding to upper and lower ψ angle clusters of R227, respectively. The larger ψ angle population in RNA-free simulations (lower distribution, magenta) corresponds in a loop conformation shifted towards where RNA would be bound. This movement also corresponds with the shrinking distance between P223 and F231.

As a final analysis of dynamic changes between RNA-free and RNA-bound simulations, we analyzed φ and ψ angle distributions for each residue over the equilibrated time course (the final 900 of 1000 ns) and analyzed the differences between RNA-free and RNA-bound simulations. We used a 2-dimensional Kolmogorov-Smirnov test for two samples to identify significant changes in φ,ψ angle distributions. The only residues showing a significant (p<0.05) difference between RNA-free and RNA-bound simulations were A228 (p=0.006), R227 (p=0.008), E261 (p=0.013), and A260 (p=0.02) (Figure. 5). Residues A260 and E261 lie near the C-terminus, while residues R227 and A228 lie at the end of a loop running from I222 to A228. It is worth noting that most changes cluster in the RRM2 domain of TDP-43. The changes in φ,ψ angle distribution could be related to changes in the conformation of the (222-228) loop. In the RNA-free system, the loop is able to access a conformation that seems to be restricted by the presence of RNA which brings P223 in closer contact with F231, consistent with our distance analysis.

3.3 AlphaFold 2 and MD simulation predicted structures and SiteMap Analysis

We compared the top 5 structures predicted by AlphaFold 2. Not surprisingly, the structures of the individual RRM domains were maintained in all models. Those models were aligned to the RNA-bound published structure of TDP-43 (PDB ID: 4bs26) as well as to the seven structures derived from the MD simulation (named MD_01 through MD_07) and the corresponding RMSD were analyzed (Table S1, Figure. S3). The models AF_2 through AF_5 poorly predicted the 4bs2 structure and any of the MD- derived structures, as the RMSDs ranged from 8.014 to

17.018 Å. However, model AF_1 exhibited the lowest RMSDs (ranging from 2.203 to 5.004 Å) for the RNA-bound RRM domains as well as four different MD derived models, with MD_02 being the closest structure (RMSD = 2.203 Å; Figure. 6a). Interestingly, we found that the AF2 predicted TDP-43 RRM elements close to either the RNA bound structure or the MD simulated apo structure domains exhibit structural (Figure. 6b-f). More specifically, the secondary structures β2 of RRM1 (Y155-S162) and β2 of RRM2 (V217-F221) are more similar between AF2 and MD predictions while loops regions (C198-T203 and K136- G146) are more similar between AF2 prediction, and the RNA bound TDP-43 NMR structure. Finally, both MD and AF2 are predicting a longer β-strand in β5 of RRM2 (Figure. 6b).

We then utilized the structures from AF2 and MD and used Sitemap [17] to predict druggable pockets of apo TDP-43 RRM domains to define how dynamics may affect binding sites. Five druggable sites of the average of the RNA bound structure solved by NMR were identified, while only three pockets were found for both MD and AF2 predicted structures (Figure. S4). Importantly, a druggable pocket in the central region of the RRM domains that anchors a G- nucleotide (G5 in 4BS2) in the RNA-bound structure was also identified in the MD simulations and AF2-predicted structures. A SiteMap calculation begins with an initial search stage that determines one or more regions on or near the protein surface, called sites, that may be suitable for binding of a ligand to the receptor16. From this, two important characteristics of pocket interactions can be addressed using SiteScore and DScore. SiteScore determines the probability of ligand-receptor interactions, while DScore determines the druggability of the pocket [18]. Our analysis shows that MD structures have poorer SiteScore (0.71 vs 0.779) and DScore (0.58 vs 0.76) compared to AF2 (Table S2), indicating AF2 predicted structures may have better pocket formation than MD simulations of apo TDP-43.

fortune-biomass-feedstock

Figure 6: Structural comparison of MD simulated apo TDP-43 with AlphaFold predicted apo TDP-43 and RNA bound known TDP-43 structures. a. AlphaFold predicted structure (TDP-43_AF1) was overlayed to published RNA bound (4bs2) and MD derived structure of TDP-43RRM (TDP-43_MD_02). b. Zoomed in structure of β4 and 5 in RRM2 (aa. G245-I250). c. Zoomed in structure of á2 of RRM1 region (aa Y155-S162). d. Zoomed in structure of β2 in RRM2 (aa. V217-F221). e. Zoomed in structure of loops in RRM2 (aa. C198-T203). f. Zoomed in structure of loop regions in RRM1 (aa. K136-G146).

4. Discussion

Defining targetable pockets of TDP-43 is of therapeutic interest in neurodegenerative diseases. While several targeting approaches have been met with various success, defining which form of TDP-43 will efficiently offer toxicity relief is still being debated. Diversifying the TDP-43 targeting chemical space may bring new opportunities, particularly if small molecules can distinguish RNA-bound and RNA-free form of TDP-43.

Structural determination of the apo RRM domains of TDP-43 has been technically challenging using traditional NMR and X-ray techniques, due to the dynamic nature of the construct. We hence used Molecular Dynamics (MD)

simulations starting from RNA bound structure of the tandem domains solved by solution NMR [6] to sample conformations of the apo TDP-43 structure [19]. From the analysis of the simulations, we generally observed an increase in the dynamics upon removal of the RNA as seen in both RMSD and RMSF analysis. Some of the increased dynamics can be explained by additional flexibility allowed to the loops previously contacting the RNA. Notably, the loop spanning residues 138 to 146 showed a particularly large increase in flexibility after the RNA was removed (Figure. 1). We also noticed changes in the phi/psi distributions for some residues of the loop138-146 were changed, though the change was not as significant as observed for the loop222-228. While loop flexibility contributed to the dynamics, most of the additional motion upon removing the RNA came from interdomain motion between RRM1 and RRM2. Contact map analysis reveals that intra-domain contacts are almost unchanged between RNA-free and RNA-bound simulations (Figure. 2). However, the RNA-free simulations show a significant decrease in interdomain contacts compared with the RNA-bound simulations. In fact, of the 11 residue pairs with a significant difference between RNA-free and RNA- bound simulations only two (E261-K192 and F231-P223) were not at the interdomain interface and correspond to new contacts observed in the RNA-free versus RNA-bound simulations. The nine remaining residue pairs were all at the interface and showed decreased contact in RNA-free versus RNA-bound simulations. The result is consistent with previous observations noting a decrease in inter-domain hydrophobic contacts in much shorter time-scale simulations (100ns) [20]. We also note that the result is consistent with experimental evidence of intrinsic low affinity between RRM1 and RRM2 of TDP43 and the requirement of a linker for interaction [10], [20-23]. Longer coarse-grained simulations reported an apparent contradiction with our result citing an increase in contacts between RRM1 and RRM2 in the absence of RNA [11]. However, the contact probability threshold is significantly lower than what is used here in our study and represents a fundamental difference in scope between the two studies. Here we set a quite stringent probability cutoff for determining the presence of a contact (p<0.01) with the goal of identifying novel stable conformations that can be targeted by small molecules. The goal of the other publication was to identify novel interactions leading to aggregation which could involve more rare occurrences [11].

Looking at the cross-correlation matrix, we detect a significant increase in positive correlation between residue H143 and residues 212 to 217 in the RNA-free vs RNA- bound simulation, as well as significant increases in negative cross-correlation between H143 and residues 129, 152, 153, 194, and 260 in RNA-free vs RNA-bound simulations. Together this is likely due to a more global 'clam-shell' movement between the two domains, which appears when the RNA is removed. This ‘clam-shell’ movement is particularly highlighted by our inter-domain angle analysis. In particular, the increased distribution of the α angle in RNA-free simulations highlights this broadening of the angle between RRM1 and RRM2 in RNA-free compared to RNA-bound simulations (Figure. 4). Indeed, the clamshell movement reported here, particularly the decrease in the α angle is what produced the apparent increase in contacts between RRM domains in the prior coarse-grained MD study [11]. The results presented in Figure 4 further serve to highlight the increased sampling of varying orientations between the two domains in the absence of RNA. However, we observe some conformational preference for some states in this plot as indicated by contour peaks where there was greater sampling of a given conformation with respect to neighboring conformations. While the timescales presented in this paper are not likely sufficient to be an exhaustive sampling of the accessible conformational space, the use of multiple starting structures and multiple random seeds significantly increases the ability to cover the locally accessible conformational space.

Our interest has been to define druggable sites in TDP-43, either in the RRM or the NTD domain [7,8]. We hypothesized that the RNA-free RRM domains of TDP-43 might exhibit pockets distinct from RNA-bound TDP-43, which could lead to new targeting opportunities. As a first step to drug RNA-free RRM domains, we predicted the apo structure using MD simulations and AlphaFold 2. Sitemap on predicted apo RRM and RNA bound structures revealed fewer pockets in the RNA-free compared to the RNA-bound structures. While this might indicate that small molecules may recognize both RNA-bound and RNA-free, there are more opportunities to specifically target RNA-bound TDP-43. A higher resolution of the RNA-free structure may offer more sites. However, based on our results of the increased dynamics of the apo TDP-43, it may be more challenging to obtain a high resolution of the RNA-free structure.

Acknowledgements

Research reported in this publication was supported by the Center for Innovation in Brain Science. David Donald Scott was funded by 1T32AG061897-01. The Khanna laboratory is funded by DOD ALS DOD and NIH 5R01NS136684-02. We would like to thank Dr. Liberty Francois-Moutal for comments on the manuscript.

Conflict of Interest

The authors declare that they have no conflicts of interest with the contents of this study.

References

  1. de Boer EMJ, Orie VK, Williams T, Baker MR, De Oliveira HM, et al. TDP-43 proteinopathies: a new wave of neurodegenerative diseases. J Neurol Neurosurg Psychiatry 92 (2020): 86-95.
  2. Suk TR, Rousseaux MWC. The role of TDP-43 mislocalization in amyotrophic lateral sclerosis. Mol Neurodegener 15 (2020): 45.
  3. Franco A, Arranz R, Fernandez-Rivero N, Velazquez-Campoy A, Martin-Benito J, Segura J, et al. Structural insights into the ability of nucleoplasmin to assemble and chaperone histone octamers for DNA deposition. Sci Rep 9 (2019): 9487.
  4. Afroz T, Hock EM, Ernst P, Foglieni C, Jambeau M, et al. Functional and dynamic polymerization of the ALS-linked protein TDP-43 antagonizes its pathologic aggregation. Nat Commun 8 (2017): 45.
  5. Babinchak WM, Haider R, Dumm BK, Sarkar P, Surewicz K, et al. The role of liquid- liquid phase separation in aggregation of the TDP-43 low-complexity domain. J Biol Chem 294 (2019): 6306-6317.
  6. Lukavsky PJ, Daujotyte D, Tollervey JR, Ule J, Stuani C, et al. Molecular basis of UG-rich RNA recognition by the human splicing factor TDP-43. Nat Struct Mol Biol 20 (2013): 1443- 1449.
  7. Francois-Moutal L, Felemban R, Scott DD, Sayegh MR, Miranda VG, et al. Small Molecule Targeting TDP-43's RNA Recognition Motifs Reduces Locomotor Defects in a Drosophila Model of Amyotrophic Lateral Sclerosis (ALS). ACS Chem Biol 14 (2019): 2006-2013.
  8. Mollasalehi N, Francois-Moutal L, Scott DD, Tello JA, Williams H, et al. An Allosteric Modulator of RNA Binding Targeting the N-Terminal Domain of TDP-43 Yields Neuroprotective Properties. ACS Chem Biol 15 (2020): 2854-2859.
  9. Furukawa Y, Suzuki Y, Fukuoka M, Nagasawa K, Nakagome K, et al. A molecular mechanism realizing sequence-specific recognition of nucleic acids by TDP-43. Sci Rep 6 (2016):
  10. Dang M, Li Y, Song J. Tethering-induced destabilization and ATP-binding for tandem RRM domains of ALS- causing TDP-43 and hnRNPA1. Sci Rep 11 (2017): 1034.
  11. Ingolfsson HI, Rizuan A, Liu X, Mohanty P, Souza PCT, et al. Multiscale simulations reveal TDP-43 molecular-level interactions driving condensation. Biophys J 122 (2020): 4370-4381.
  12. Jumper J, Evans R, Pritzel A, Green T, Figurnov M, et al. Highly accurate protein structure prediction with AlphaFold. Nature 596 (2020): 583-589.
  13. Harder E, Damm W, Maple J, Wu C, Reboul M, et al. OPLS3: A Force Field Providing Broad Coverage of Drug-like Small Molecules and Proteins, J Chem Theory Comput 12 (2016): 281-296.
  14. Aggarwal P, Bhavesh NS. Hinge like domain motion facilitates human RBMS1 protein binding to proto- oncogene c-myc promoter. Nucleic Acids Res 49 (2019): 5943-5955.
  15. Mirdita M, Schütze K, Moriwaki Y, Heo L, Ovchinnikov S, et al. ColabFold - Making protein folding accessible to all. 10 (2015): 456425.
  16. Halgren T. New method for fast and accurate binding-site identification and analysis, Chem Biol Drug Des 69 (2007): 146- 148.
  17. Nayal M, Honig B. On the nature of cavities on protein surfaces: application to the identification of drug- binding sites. Proteins 63 (2006): 892-906.
  18. Halgren TA. Identifying and characterizing binding sites and assessing druggability, J Chem Inf Model 49 (2009): 377-389.
  19. Sponer J, Bussi G,  Krepl M,  Banas P,  Bottaro S, et al. RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview. Chem Rev 118 (2018): 4177-4338.
  20. Prakash A, Kumar V, Meena NK, Hassan MI, Lynn AM. Comparative analysis of thermal unfolding simulations of RNA recognition motifs (RRMs) of TAR DNA-binding protein 43 (TDP-43). J Biomol Struct Dyn 37 (2019): 178- 194.
  21. Bhandare VV, Ramaswamy A. The proteinopathy of D169G and K263E mutants at the RNA Recognition Motif (RRM) domain of tar DNA-binding protein (tdp43) causing neurological disorders: A computational study. J Biomol Struct Dynamics 36 (2016): 1075-1093.
  22. Mackness BC, Tran MT, McClain SP, Matthews CR, Zitzewitz JA. Folding of the RNA recognition motif (RRM) domains of the amyotrophic lateral sclerosis (ALS)-linked protein TDP-43 reveals an intermediate state. J Biol Chem 289 (2014): 8264-8276.
  23. Wei L, Chaoqun L, Jiankai S, Yan W, Guangju C. Insights into the Aggregation Mechanism of RRM domains in TDP-43: A Molecular Dynamics Study. Research Square 18 (2021).

Supplementary Information:

Table S1: RMSD for each AlphaFold predicted TDP-43RRM structure compared to 4bs2 and MD derived models.

TDP-43_AF1

4bs2

2.235 (2361 atoms)

TDP43_MD_01

3.378 (1213 atoms)

TDP43_MD_02

2.203 (1160 atoms)

TDP43_MD_03

4.448 (1255 atoms)

TDP43_MD_04

5.004 (1261 atoms)

TDP43_MD_05

9.895 (1303 atoms)

TDP43_MD_06

12.184 (1353 atoms)

TDP43_MD_07

13.904 (1327 atoms)

TDP-43_AF2

4bs2

16.476 (2623 atoms)

TDP43_MD_01

15.283 (1300 atoms)

TDP43_MD_02

17.018 (1336 atoms)

TDP43_MD_03

15.412 (1330 atoms)

TDP43_MD_04

13.606 (1312 atoms)

TDP43_MD_05

14.047 (1361 atoms)

TDP43_MD_06

10.655 (1350 atoms)

TDP43_MD_07

8.014 (1308 atoms)

TDP-43_AF3

4bs2

14.164 (2648 atoms)

TDP43_MD_01

12.604 (1321 atoms)

TDP43_MD_02

14.106 (1325 atoms)

TDP43_MD_03

12.805 (1356 atoms)

TDP43_MD_04

10.655 (1348 atoms)

TDP43_MD_05

12.943 (1318 atoms)

TDP43_MD_06

9.424 (1352 atoms)

TDP43_MD_07

8.490 (1309 atoms)

TDP-43_AF4

4bs2

16.590 (2608 atoms)

TDP43_MD_01

15.094 (1281 atoms)

TDP43_MD_02

16.789 (1324 atoms)

TDP43_MD_03

16.188 (1334 atoms)

TDP43_MD_04

14.224 (1324 atoms)

TDP43_MD_05

13.877 (1356 atoms)

TDP43_MD_06

10.292 (1343 atoms)

TDP43_MD_07

7.687 (1280 atoms)

TDP-43_AF5

4bs2

16.276 (2576 atoms)

TDP43_MD_01

15.088 (1286 atoms)

TDP43_MD_02

16.491 (1315 atoms)

TDP43_MD_03

15.944 (1319 atoms)

TDP43_MD_04

14.323 (1321 atoms)

TDP43_MD_05

13.302 (1351 atoms)

TDP43_MD_06

9.844 (1330 atoms)

TDP43_MD_07

7.235 (1244 atoms)

fortune-biomass-feedstock

Figure S1: RMSD of each MD simulation over time. To evaluate system convergence, RMSD with respect to the reference structure, was computed over the course of the 1-μs simulations. Analysis from simulations starting from each of the 5 sampled NMR structures are segregated by row: (a, b) conformation 1, (c, d) conformation 2, (e, f) conformation 3, (g, h) conformation 5, and (i, j) conformation 7. The RNA-bound (a, c, e, g, i) and RNA-free (b, d, f, h, j) simulations are distinguished by columns. The plot color distinguishes the three random seeds (in blue, green and orange). For most (13 out of 15) RNA-bound simulations the trajectories converge within 50 ns of simulation. Seed1 in panels c and e which show a slight jump later in the trajectory. RNA-free simulations system show much greater deviation from the starting structure and tend to reach equilibrium after 600ns of simulation.

fortune-biomass-feedstock

Figure S2: AlphaFold prediction of TDP-43 RRM domains. A. Multiple sequence alignment of TDP-43 RRM domains. B. Predicted local distance difference test of TDP-43 residues 102-269.

  1. Predicted alignment error of the 5 models obtained. D. Predicted contact maps of the 5 models obtained. E. Predicted distograms of the 5 models obtained.
fortune-biomass-feedstock

Figure S3: Contour map of the Md simulated structures showing the seven structures retrieved from the 15 simulations at various different time.

fortune-biomass-feedstock

Figure S4: Sitemap Results of TDP-43 structure models. Representative image of Sitemap results from all models overlaid onto NMR solved structure PDB 4BS2. Spheres represent contact points within surface pocket density. Data on sites is included in Supplemental Table 1.

Table S2: Sitemap Results for Molecular Dynamics, AlphaFold2, and RNA-bound TDP-43 models. Sitemap predicted pockets organized by SiteScore, Dscore, volume of pocket and hydrophibicity/hydrophilicity of pocket. Ideal scores are labeled at the top as determined by sitemap calibration. Average results for each model are shown under each subgroup for ease of comparison.

Table icon

© 2016-2024, Copyrights Fortune Journals. All Rights Reserved