Abstract
In SARS-CoV-2 replication complex, the Non-structural protein 9 (Nsp9) is an important RNA binding subunit in the RNA-synthesizing machinery. The dimeric forms of coronavirus Nsp9 increase their nucleic acid binding affinity and the N-finger motif appears to play a critical role in dimerization. Here, we present a structural, lipophilic and energetic study about the Nsp9 dimer of SARS-CoV-2 through computational methods that complement hydrophobicity scales of amino acids with molecular dynamics simulations. Additionally, we presented a virtual N-finger mutation to investigate whether this motif contributes to dimer stability. The results reveal for the native dimer that the N-finger contributes favorably through hydrogen bond interactions and two amino acids bellowing to the hydrophobic region, Leu45 and Leu106, are crucial in the formation of the cavity for potential drug binding. On the other hand, Gly100 and Gly104, are responsible for stabilizing the α-helices and making the dimer interface remain stable in both, native and mutant (without N-finger motif) systems. Besides, clustering results for the native dimer showed accessible cavities to drugs. In addition, the energetic and lipophilic analysis reveal that the higher binding energy in the native dimer can be deduced since it is more lipophilic than the mutant one, increasing non-polar interactions, which is in line with the result of MM-GBSA and SIE approaches where the van der Waals energy term has the greatest weight in the stability of the native dimer. Overall, we provide a detailed study on the Nsp9 dimer of SARS-CoV-2 that may aid in the development of new strategies for the treatment and prevention of COVID-19.
Similar content being viewed by others
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is currently a global pandemic, which has spread rapidly throughout the world since December 2019 when it was first reported1,2,3,4,5,6. During the replication process of the virus, polyprotein processing releases RNA polymerase along with several non-structural proteins (Nsps) that facilitate RNA synthesis and may play a key role in the replication process, although they are not included in the viral envelope7,8,9,10,11. All Nsps are considered essential for transcription, replication, and translation of viral RNA, except Nsp1 and Nsp22,3,4,5,6,7,8,9,10,11,12,13,14. Nsp9 along with Nsp7, Nsp8, and Nsp10 are located within the replication complex and thus, are likely to be members of this process12. In addition, Nsps are considered important for viral replication during the human cell infection phase13,14. CoV Nsp9s have diverse forms of dimerization that promote their biological function. SARS-CoV Nsp9 forms a dimer from a conserved region called "GxxxG" α-helical motif, where the interruption of key residues within this region reduces RNA binding and SARS-CoV proliferation14,15. Additionally, it was observed that porcine delta coronavirus (PDCoV) Nsp9 mutant (Nsp9 without the N-finger motif) is monomeric in solution16. Since the dimeric form of Nsp9 is essential for viral replication and infection, studies suggest that dimer disruption may be an effective strategy in combating coronavirus-associated diseases13,14,15,16,17,18.
Although there is an increasing number of proteins determined by structural techniques as NMR, X-ray diffraction and cryogenic electron microscopy19,20 where the formation of protein–protein complexes have evidenced to be essential in biological systems, it is necessary to complement that structural information with a detailed quantitative understanding of the main features that govern the binding mode between the two proteins at an atomic level21,22,23,24,25,26. Accordingly, information about the effect of conformational changes of the two proteins that form dimer species such as lipophilicity, the free energy of binding, movement of the protein's dynamic domain, hot-spot residues in the interaction interface need to be investigated27,28,29.
Experimental evidence has shown through structure and function studies that protein dimerization is controlled by the interaction of hydrophobic surfaces30, however, the dynamic, lipophilic, and energetic analysis of protein–protein interactions (PPI) continues to be a major challenge in theoretical studies31,32,33. From a computational point of view, strategies to face these challenges include studies based on lipophylic scales that consider the local context of proteins34,35, molecular dynamics simulations and binding free energy calculations, which provide crucial information about the dynamics of complex protein structures and detailed energetic information36,37,38,39.
We present in this work a structural, lipophilic and energetic study about the Nsp9 dimer of SARS-CoV-2 through computational methods that complement hydrophobicity scales of amino acids with molecular dynamics. To elucidate contacts between residues that make interactions at the dimer interface we have analyzed the impact of structural movements of the dimer already formed, including after the deletion of the N-finger motif. Clustering results led us to find cavities with high druggability scores, placed near hydrophobic residues and accessible to potential drugs. It may aid to the development of new strategies for the treatment and prevention of COVID-19.
Results and discussion
Dynamics of Nsp9 native and mutant dimers allows identification of possible binding sites for inhibitors
The non-structural proteins (Nsps) of SARS-CoV-2 are not incorporated into virion particles. Due to their degree of sequence conservation, enzymatic roles, and essentiality of each of the NSPs in SARS-CoV-2, it is believed that these proteins mimic the behavior of homologous proteins in coronaviruses17. These Nsps appear to be necessary for viral replication in SARS-CoV and influence pathogenesis14. Although they present a close homology among viruses, the interest in Nsps is because they show conserved functions within the life cycle of SARS-Cov-2 that may be susceptible to inhibition17.
Non-structural protein 9 (Nsp9) has been considered essential for viral replication during infection of human cells14. Several Nsp9 homologs have been identified in many coronaviruses, including SARS-CoV-2. Nsp9 dimerizes via a conserved α-helical motif called “GxxxG”, where disruption of key residues reduces RNA binding and SARS-CoV viral replication14,15.
The crystal structures of SARS-CoV-2 Nsp9 show an unusual fold seen only in coronaviruses15,40. The core of this fold is a small β-barrel enclosed by six β-sheets where a series of extended loops protrude outward. These loops connect to the individual β-sheet of the barrel with an N-terminal β-sheet and a C-terminal α-helix, where the last two elements compose the main regions of the dimer interface (Fig. 1).
Thus, to obtain different conformations of the protein complex and observe how these regions interact in the dimeric interface, 2000 ns of all-atom molecular dynamics simulations were performed. Sampling was obtained for the native and the mutant proteins in order to observe structural, lipophilic, and energetic aspects of these two different systems. It is noteworthy that either the chain A or the chain B in the native Nsp9 dimer present the region called N-finger (NNEL residues) at the N-terminal region which plays a critical role in the dimerization process16. However, the lack of this region in the mutant system imposes a relevant structural difference which can play a crucial role in terms of stability and dynamics.
The evolutions of conformations in the systems were analyzed by determining the mean square deviation (RMSD) of each structure with respect to the reference structure of the equilibrium step, which was calculated after alignment based on the backbone atoms (Fig. 2).
Figure 2 shows the RMSD for both systems, which was significant stable, particularly after 800 ns of MD simulations where the RMSD values of all systems were within a reasonable fluctuation in a range of 1 to 4 Å suggesting that the structural equilibrium was reached. In addition, visualizations of the sampled structures in the trajectories indicate that some regions of the monomers present moderate movements concerning the initial structure. Comparing the mutant structure against the native as the reference structure, the lDDT score (a local superposition-free score for comparing protein structures) is plotted as a function of the residue numbers. Deviation of no more than 0.9 was observed (Fig. S4). Additionally, the flexibility of each system was verified by means of the fluctuations of the backbone atoms for each residue. In consequence, the Root-Mean-Square Fluctuations (RMSF) were calculated to characterize the local movement of residues in the dimeric systems (see Fig. 3).
Overall, the RMSF data show similar trends obtained for both native and mutant systems. Figure 3 indicates that the residues with higher fluctuation values are located in the N and C-terminals regions (blue and green marks in Fig. 3, respectively), they present high flexibility that amounts to 10 Å. Furthermore, the other regions with relevant fluctuations correspond to regions of the β2/β3, β4/β5, and β6/β7 loops that connect the β-sheets inside the barrel (residues highlighted in purple). The N-terminal regions in monomers A show similar fluctuations, whereas the monomer B in the mutant system has a smaller fluctuation.
To better understand the conformational changes of the binding regions at the dimer interface, it is necessary to analyze the movements in more detail. Therefore, the final trajectories of the native system were analyzed through cluster analysis by grouping the poses extracted from the MD simulations. In general, the three most populated clusters present regions with minor differences in their structure (See Fig. S1 in support information). Following the RMSF analysis (Fig. 3), the loop regions, N and C-terminal present major deviations. Cluster1 showed a conformation similar to cluster3 regarding the loop regions, therefore, there are differences in the terminal regions. In cluster1 the N-terminal of chain A (Fig. 4, in cyan) is interacting with chain B (Fig. 4, in green) which limits cavity formation due to steric impediment. Cluster3 has a cavity located in this region since the N-terminal of chain A is interacting with the same chain and promotes cavity formation nearby the α-helices. On the other hand, Cluster2 presents differences in conformation mainly in the β2/β3 and β3/β4 loops of chain A compared to cluster1 and 2. Thus, its cavity is located closer to cluster3. Figure 4 depicts a representative structure of each of the first 3 dominant clusters, which allowed us to accurately model the structural interfaces of the systems. Additionally, we use these structures with the FPocket software to identify the possible pockets with greater affinity for drugs in the dimer interface regions in each of the selected structures46.
Cluster 1 has the highest percentage of structures and is also the one with the highest druggability score, which may help to understand why the cavity formed, where hydrophobic residues Leu45 and Leu106 are present, is the most accessible to possible drugs. These findings are similar to the study by Littler and co-workers who identified the surface of the hydrophobic interface cavity between Nsp9 dimer proteins17. This type of cavity analysis41,42 has been applied to other protein systems43 and has shown promise for screening enzyme inhibitors and may help in the search for molecules with anti-SARS-CoV-2 potential.
Finally, it is worth mentioning that the N-terminal regions are isolated making contact with counterpart monomer residues (Fig. 5A,C). In contrast, Fig. 5B shows the C-terminal portions surrounded by hydrophobic residues, which causes it to create funnel-shaped hydrophobic cavities on either side of the interface helices.
Main interactions at the NSP9 dimer interface
It is well known that hydrogen bonds and hydrophobic interactions play important roles in protein–protein interactions31,44,45,46. The arrangement of monomers within Nsp9 dimers is well conserved at different CoVs and is maintained at SARS-CoV-217. The main region of interaction between the monomers is the conserved "GxxxG" protein-binding motif, which allows van der Waals interactions in the interface regions of the C-terminal between the α-helices47. The main hydrogen interactions observed at the binding interface of monomers A and B are listed in Table 1, along with its occupancy during the last 100 ns of MD simulations. If there is more than one interaction with the same residues only the highest value is reported.
The most stable hydrogen interactions involve residues from the N and C-terminal regions of proteins. Some studies indicate that these regions are important to maintain the structure of the dimer formed19. While the native system has a higher number of interactions mainly with residues from the beginning of the chain. The mutant system has only two interactions, one at the beginning and one at the end of the chain. When observing the structures at the end of the simulation, it is possible to notice that the N-finger region located at the N-terminal of monomer A establishes more interactions with the region where the β6 of monomer B is located.
For these systems, the most frequent interactions are van der Waals located in the α-helices, where the residues that contribute to these interactions are mainly Gly100 and Gly104, responsible for stabilizing the α-helices and making the dimer interface remain stable. Our results strongly suggest that these interactions are mainly responsible for the maintenance of the dimeric form of SARs-CoV-2 Nsp9 since experimental data show that the native enzyme with the presence of the N-finger plays an important role in maintaining the stability of the dimer21.
Energetic analysis of NSP9 dimers
In this analysis, we used the last 100 ns trajectories of the MD simulation of the native and mutant systems for protein–protein binding free energy calculations using the MMGBSA and SIE methods (see Table 2). These calculations use a portion of the trajectory from which snapshots were selected. In the case of this study, we used 10,000 frames with an interval equal to 2, resulting in 5000 frames for the calculation.
Table 2 confirm that the two methods were able to predict a strong binding affinity for the two systems. For the native system, this affinity was − 63.51 kcal/mol, whereas for the mutant system, − 36.99 kcal/mol. Using the SIE method, we were able to describe the same energy trend seen in the MMGBSA method, with values of − 16.50 kcal/mol for the native system and − 11.52 kcal/mol for the mutant system.
To identify hot spots of binding affinity between monomers, analyzes of energy decomposition by residue were performed using the MM-GBSA method. The last 100 ns of the trajectories of the two systems were analyzed allowing the description of the residues that are part of the energetic contribution of the protein that assists in the energetic stability process of the dimer. Thus, Fig. 6 shows the contribution of all residues to the binding free energy.
As noted, the residues for both systems show similar peaks. For the native system, the residues located in the N-finger that contribute favorably are residues that are part of the main hydrogen interactions throughout the simulation (see Table 1). On the other hand, the Asn1 residue contributes unfavorably, with positive values near to 2 kcal/mol in monomer A and 4.5 kcal/mol in monomer B.
In this analysis, it was also observed that the fact that the mutant system does not present the N-finger region, the Ser5 residue shows a behavior comparable to Asn1, with an unfavorable contribution. This happens because this region presents high flexibility, making this residue fluctuate during the simulation, establishing few or no interactions.
Residues located in the 97–104 range in both systems contribute favorably to the binding free energy. These are residues found in the contact region of the interface of the monomers where the conserved "GxxxG" motif of Nsp9 is found. Thus, with the decomposition of energy per residue, it is evident that in both systems the main contributions come from residues that are in the interface between the monomers, keeping the dimer stable. It is worth remarking that despite the structural analyses indicate the native system is the one with the greatest movements during the MD simulations, this system is the one with the greatest affinity between the monomers according to the energetic analysis. These observations on predicted binding affinities may be associated with local conformational changes in the N-finger region and in the F71-F75 region, reinforcing that the N-finger region is essential not only for dimer formation but also for maintaining interactions between an interface.
Analysis of the lipophilicity in NSP9 dimers
Here, we have complemented the structural and energetic studies with a lipophilic analysis in the native and mutant dimeric forms of Nsp9 protein. This is accomplished by using a novel hydrophobicity scale of amino acids based on quatum-mechanical implicit solvation model. The individual lipophilicity of each amino acid that forms the native dimer of Nsp9 protein is shown in Fig. 7 where hydrophobic residues are present in the yellow region whereas hydrophilic amino acids are in the blue one. In the hydrophobic region, it can be noted that two crucial residues involved in the formation of the cavity for potential drug binding are present, Leu45 and Leu106. These findings support the druggability score in representative structures of the clustering results for the native dimer mentioned above. On the other hand, main interactions at the Nsp9 dimer interface found in this work pointed out the importance of hydrophobic interactions (van der Waals interactions) in the GxxxG protein-binding motif which is in agreement with the lipophilic profile for these fragments (see grey rectangles, Fig. 7) where the residues between glycine residues, M101, V102, and L103, belongs to the highly hydrophobic portions in the dimeric Nsp9 protein.
While the previous results showed important structural traits in the native protein, the difference in the cluster-weighted lipophilicities between the native and mutant Nsp9 dimers can provide a counterpart in the energetic analysis of these biomolecules. Figure 8 shows the difference between cluster-weighted residue lipophilicities in the native form regarding the mutant protein where differences higher to 0.5 log units are labeled. Here, a positive difference means that the residue in the mutant concerning the native dimer is more hydrophilic whereas a negative difference implies an increase in the lipophilicity of the amino acid in the mutant. Overall, the hydrophilicity increased by more than 0.5 log units at 17 residues in mutant. Indeed, just in the monomer A (mutant) there is a slightly increase of lipophilicity in some residues (Ala16, Val41, Asp50, Asp78, and Gln113), which is not observed in the monomer B which explains why solvation free energy was the term that most contributes to the stability of the mutant (see Table 2). Accordingly, the higher binding energy in the native dimer can be deduced since it is more lipophilic than the mutant, increasing non-polar interactions, which is in line with the result of MM-GBSA and SIE approaches where the van der Waals energy term has the greatest weight in the stability of the native dimer.
Conclusions
Herein, we have evaluated structural, energetic and lipophilic aspects of Nsp9 dimers from SARS-CoV-2. The modeled mutant dimer without the N-finger motif revealed that the interaction in the GxxxG protein-binding motif was sufficient to maintain the protein–protein complex bound along with the simulation. This virtual mutation was responsible to decrease the bind of Nsp9 by 26.52 and 4.98 kcal/mol according to MM-GBSA and SIE calculations. The main interactions at the Nsp9 dimer interface found in this work pointed out the importance of hydrophobic interactions (van der Waals interactions) in the GxxxG protein-binding motif which is in agreement with the lipophilic profile. Thus, our results can be useful for understanding why the cavity formed is the most accessible to possible drugs. This type of cavity analysis has been applied to other protein systems and has shown promise for screening enzyme inhibitors and may help in the search for molecules with anti-SARS-CoV-2 potential.
Materials and methods
Molecular dynamics simulations of native and mutant systems
Two models were prepared considering both the native and a mutant protein, where the first four residues (NNEL) of N-terminal region are truncated. As a starting point, the crystal structure of Nsp9 RNA binding protein of SARS CoV-2 with 2.95 Å resolution was used (PDB ID 6W4B)48. This structure is a dimer in which the N-terminal monomer A needed to be modeled with the Swiss model49 using monomer B as template. The protonation states in the two systems for all residues were predicted using H++ program at pH 7.050,51. The systems were prepared for molecular dynamics simulations using the AMBER18 package with the force field FF14SB52,53. By using the Leap module of Amber42 the charges were neutralized by the addition of counterions (Na+) and then, the systems were inserted into a cubic box with TIP3P water molecules employing a minimum distance of 12 Å between the protein surface and the side of the box. The models were submitted to four minimization steps before MD simulation. In these four stages, the minimization procedure was applied to the following atoms: First, water molecules and counterions (8000 steps), then, the hydrogen atoms of the protein (5000 steps), next, all hydrogen atoms (8000 steps), and finally, the complete system (10,000 steps).
The models have been submitted to a gradual heating step during 200 ps up to 300 K using a Langevin thermostat at constant volume (NVT ensemble). In the next step of 300 ps the density of the systems was balanced. Then, a total of 500 ps of MD was made with constant pressure to balance the systems before starting MD productions. Finally, the productions were performed by 2000 ns of MD with an NTP ensemble at constant temperature (300 K), using periodic boundary conditions, with a 2 fs integration step using the SHAKE algorithm to restrict bonds involving hydrogen atoms54,55. A 10 Å cutoff was used during all the simulations for unconnected interactions. The final trajectories were analyzed in terms of root mean square deviation (RMSD), formation of hydrogen bonds, and cluster analysis. Cluster analyses were performed using the average-linkage hierarchical agglomerative method45. For this algorithm the RMSD coordinate was used as a distance metric. The algorithm was used on all heavy atoms of the native (1–226 atoms) and mutant (1–218 atoms) protein with a critical distance value of 3 Å and a variable sieve value to ensure an initial passage of 10 frames through the trajectory45. From the cluster analysis in the set of poses, only for the native system was selected a representative structure of each of the three main clusters. In addition, the FPocket software was used to identify possible drug-susceptible pockets in each of the selected structures42,56.
Binding free energy calculations
The protein–protein binding free energy can be expressed according to an MMGBSA approach:57,58
where the terms < Gx > represent the average over the snapshots of a single trajectory of the MD complex and i corresponds to the ith snapshot of the protein complex. AMBER was used to calculate free energy with MMGBSA (Eq. 2) and SIE methods (Eq. 3) for 5000 frames taken from the last 100 ns of MD production31,53,59.
\(\Delta E_{MM}\) is total gas phase energy (sum of \(\Delta E_{internal}\), \(\Delta E_{electrostatic}\), and \(\Delta E_{vdw}\)); \(\Delta G_{sol}\) is sum of polar \(\left( {\Delta G_{GB} } \right)\) and non-polar \(\left( {\Delta G_{SA} } \right)\) contributions to solvation.
\(E_{C}\) and \(E_{vdw}\) are the intermolecular Coulomb and van der Waals interaction energies in the bound state. \(\Delta G_{bind}^{R}\) is the change in the reaction field energy between the bound and free states. The \(\Delta MSA\) term is the change in molecular surface area upon binding. The AMBER van der Waals radii linear scaling coefficient (ρ), the solute interior dielectric constant \(\left( {D_{in} } \right)\), the molecular surface area coefficient (γ), the global proportionality coefficient relating to the loss of configurational entropy upon binding (α), and a constant (C) are parameters calibrated by fitting to absolute binding free energies31.
As our objective is to analyze the contribution of each energy component and the Gibbs absolute energy, the entropy contribution was not included in the calculations due to the difficulty of accurately calculating entropy for a large protein–protein complex60.
In order to identify the main residues responsible for the dimer formation process, free energy decomposition was performed for the contribution of each residue. This contribution was calculated using the decomposition process with MMGBSA in AMBER. All energy components were also calculated for 5000 frames obtained from the last 100 ns of MD production.
Lipophilicity calculations
The structure-based and pH-dependent lipophilicity scale developed by Zamora et al.35 based on the IEFPCM/MST continuum solvation method was employed to determine the lipophilicity of each dimer to pH = 7.4. The lipophilicity of each amino acid was computed using the ProtL scale taking into account its specific structural features in both, native and mutant dimers. Cluster-weighted lipophilicities of amino acids in the native and mutant proteins were used in this work (see Table S1).
References
Gorbalenya, A. E. et al. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 5, 536–544 (2020).
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382, 727–733 (2020).
Hui, D. S. et al. The continuing 2019-nCoV epidemic threat of novel coronaviruses to global health & the latest 2019 novel coronavirus outbreak in Wuhan, China. Int. J. Infect. Dis. 91, 264–266 (2020).
Zhang, C. et al. Structural basis for the multimerization of nonstructural protein nsp9 from SARS-CoV-2. Mol. Biomed. 1, 5 (2020).
Li, X., Zai, J., Wang, X. & Li, Y. Potential of large “first generation” human-to-human transmission of 2019-nCoV. J. Med. Virol. 92, 448–454 (2020).
Gralinski, L. E. & Menachery, V. D. Return of the coronavirus: 2019-nCoV. Viruses 12, 10 (2020).
Thiel, V. et al. Mechanisms and enzymes involved in SARS coronavirus genome expression. J. Gen. Virol. 84, 2305–2315 (2003).
Liu, D. X., Tibbles, K. W., Cavanagh, D., Brown, T. D. K. & Brierley, I. Identification, expression, and processing of an 87-kDa polypeptide encoded by ORF 1a of the coronavirus infectious bronchitis virus. Virology 208, 48–57 (1995).
Lim, K. P., Ng, L. F. P. & Liu, D. X. Identification of a novel cleavage activity of the first papain-like proteinase domain encoded by open reading frame 1a of the coronavirus & avian infectious bronchitis virus and characterization of the cleavage products. J. Virol. 74, 1674 LP – 1685 (2000).
Lai, M. M. C. & Cavanagh, D. The Molecular Biology of Coronaviruses. in (eds. Maramorosch, K., Murphy, F. A. & Shatkin, A. J. B. T.-A. V. R.), vol. 48 1–100 (Academic Press, 1997).
Herold, J., Gorbalenya, A. E., Thiel, V., Schelle, B. & Siddell, S. G. Proteolytic processing at the amino terminus of human coronavirus 229E gene 1-encoded polyproteins: identification of a papain-like proteinase and its substrate. J. Virol. 72, 910–918 (1998).
Bost, A. G., Carnahan, R. H., Lu, X. T. & Denison, M. R. Four proteins processed from the replicase gene polyprotein of mouse hepatitis virus colocalize in the cell periphery and adjacent to sites of virion assembly. J. Virol. 74, 3379 LP – 3387 (2000).
Deming, D. J., Graham, R. L., Denison, M. R. & Baric, R. S. Processing of open reading frame 1a replicase proteins nsp7 to nsp10 in murine hepatitis virus strain A59 replication. J. Virol. 81, 10280 LP – 10291 (2007).
Frieman, M. et al. Molecular determinants of severe acute respiratory syndrome coronavirus pathogenesis and virulence in young and aged mouse models of human disease. J. Virol. 86, 884 LP – 897 (2012).
Sutton, G. et al. The nsp9 replicase protein of SARS-coronavirus, structure and functional insights. Structure 12, 341–353 (2004).
Zeng, Z. et al. Dimerization of coronavirus nsp9 with diverse modes enhances its nucleic acid binding affinity. J. Virol. 92, e00692-e718 (2018).
Littler, D. R., Gully, B. S., Colson, R. N. & Rossjohn, J. Crystal structure of the SARS-CoV-2 non-structural protein 9, Nsp9. iScience 23, 101258 (2020).
Miknis, Z. J. et al. Severe acute respiratory syndrome coronavirus nsp9 dimerization is essential for efficient viral growth. J. Virol. 83, 3007 LP – 3018 (2009).
Papageorgiou, A. C., Poudel, N. & Mattsson, J. Protein structure analysis and validation with X-Ray crystallography. Methods Mol. Biol. 2178, 377–404 (2021).
Assaiya, A., Burada, A. P., Dhingra, S. & Kumar, J. An overview of the recent advances in cryo-electron microscopy for life sciences. Emerg. Top. Life Sci. 5, 151–168 (2021).
Gohlke, H. & Case, D. A. Converging free energy estimates: MM-PB(GB)SA studies on the protein-protein complex Ras-Raf. J. Comput. Chem. 25, 238–250 (2004).
Stites, W. E. Proteinminus signProtein interactions: interface structure, binding thermodynamics, and mutational analysis. Chem. Rev. 97, 1233–1250 (1997).
Jones, S., Marin, A. & Thornton, J. M. Protein domain interfaces: characterization and comparison with oligomeric protein interfaces. Protein Eng. Des. Sel. 13, 77–82 (2000).
Jones, S. & Thornton, J. M. Principles of protein-protein interactions. Proc. Natl. Acad. Sci. 93, 13 LP – 20 (1996).
Davies, D. R. & Cohen, G. H. Interactions of protein antigens with antibodies. Proc. Natl. Acad. Sci. 93, 7 LP – 12 (1996).
Brooijmans, N., Sharp, K. A. & Kuntz, I. D. Stability of macromolecular complexes. Proteins 48, 645–653 (2002).
Gromiha, M. M., Yokota, K. & Fukui, K. Energy based approach for understanding the recognition mechanism in protein-protein complexes. Mol. Biosyst. 5, 1779–1786 (2009).
Ofran, Y. & Rost, B. Analysing six types of protein-protein interfaces. J. Mol. Biol. 325, 377–387 (2003).
Chakrabarti, P. & Janin, J. Dissecting protein–protein recognition sites. Proteins Struct. Funct. Bioinf. 47, 334–343 (2002).
Kumar, A. et al. Surface hydrophobics mediate functional dimerization of CYP121A1 of Mycobacterium tuberculosis. Sci. Rep. 11, 394 (2021).
Cui, Q. et al. Molecular dynamics—solvated interaction energy studies of protein-protein interactions: the MP1–p14 scaffolding complex. J. Mol. Biol. 379, 787–802 (2008).
Kraml, J., Kamenik, A. S., Waibl, F., Schauperl, M. & Liedl, K. R. Solvation free energy as a measure of hydrophobicity: application to serine protease binding interfaces. J. Chem. Theory Comput. 15, 5872–5882 (2019).
Da Costa, K. S. et al. Structural analysis of viral infectivity factor of HIV type 1 and its interaction with A3G, EloC and EloB. PLoS One 9, e89116 (2014).
Patel, A. J. & Garde, S. Efficient method to characterize the context-dependent hydrophobicity of proteins. J. Phys. Chem. B 118, 1564–1573 (2014).
Zamora, W. J., Campanera, J. M. & Luque, F. J. Development of a structure-based, pH-dependent lipophilicity scale of amino acids from continuum solvation calculations. J. Phys. Chem. Lett. 10, 883–889 (2019).
Kortemme, T., Kim, D. E. & Baker, D. Computational alanine scanning of protein-protein interfaces. Sci. STKE 2004, pl2 (2004).
Kortemme, T. & Baker, D. A simple physical model for binding energy hot spots in protein–protein complexes. Proc. Natl. Acad. Sci. 99, 14116 LP – 14121 (2002).
Massova, I. & Kollman, P. A. Computational Alanine scanning to probe protein−protein interactions: a novel approach to evaluate binding free energies. J. Am. Chem. Soc. 121, 8133–8143 (1999).
Huo, S., Massova, I. & Kollman, P. A. Computational alanine scanning of the 1:1 human growth hormone-receptor complex. J. Comput. Chem. 23, 15–27 (2002).
Egloff, M.-P. et al. The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl. Acad. Sci. USA 101, 3792 LP – 3796 (2004).
Kuzmanic, A., Bowman, G. R., Juarez-Jimenez, J., Michel, J. & Gervasio, F. L. Investigating cryptic binding sites by molecular dynamics simulations. Acc. Chem. Res. 53, 654–661 (2020).
Le Guilloux, V., Schmidtke, P. & Tuffery, P. Fpocket: an open source platform for ligand pocket detection. BMC Bioinf. 10, 168 (2009).
Lapaillerie, D. et al. In Silico, In Vitro and in cellulo models for monitoring SARS-CoV-2 Spike/human ACE2 complex, viral entry and cell fusion. Viruses 13, 569 (2021).
Anand, S. & Mohanty, D. Inter-domain movements in polyketide synthases: a molecular dynamics study. Mol. Biosyst. 8, 1157–1171 (2012).
Deriu, M. A. et al. Investigation of the Josephin Domain protein-protein interaction by molecular dynamics. PLoS One 9, e108677 (2014).
Gohlke, H., Kiel, C. & Case, D. A. Insights into protein-protein binding by binding free energy calculation and free energy decomposition for the Ras-Raf and Ras-RalGDS complexes. J. Mol. Biol. 330, 891–913 (2003).
Hu, T. et al. Structural basis for dimerization and RNA binding of avian infectious bronchitis virus nsp9. Protein Sci. 26, 1037–1048 (2017).
Tan, K. et al. No Title. The crystal structure of Nsp9 RNA binding protein of SARS CoV-2. Center for Structural Genomics of Infectious Diseases (2020) https://doi.org/10.2210/pdb6W4B/pdb.
Waterhouse, A. et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296–W303 (2018).
Gordon, J. C. et al. H++: a server for estimating pKas and adding missing hydrogens to macromolecules. Nucleic Acids Res. 33, W368–W371 (2005).
Anandakrishnan, R., Aguilar, B. & Onufriev, A. V. H++ 3.0: automating pK prediction and the preparation of biomolecular structures for atomistic molecular modeling and simulations. Nucleic Acids Res. 40, W537–W541 (2012).
Maier, J. A. et al. ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 11, 3696–3713 (2015).
Case, D. A. et al. The Amber biomolecular simulation programs. J. Comput. Chem. 26, 1668–1688 (2005).
Elber, R., Ruymgaart, A. P. & Hess, B. SHAKE parallelization. Eur. Phys. J. Spec. Top. 200, 211–223 (2011).
Ryckaert, J.-P., Ciccotti, G. & Berendsen, H. J. C. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J. Comput. Phys. 23, 327–341 (1977).
Schmidtke, P., Le Guilloux, V., Maupetit, J. & Tufféry, P. fpocket: online tools for protein ensemble pocket detection and tracking. Nucleic Acids Res. 38, W582–W589 (2010).
Srinivasan, J., Cheatham, T. E., Cieplak, P., Kollman, P. A. & Case, D. A. Continuum solvent studies of the stability of DNA, RNA, and phosphoramidate−DNA helices. J. Am. Chem. Soc. 120, 9401–9409 (1998).
Kollman, P. A. et al. Calculating structures and free energies of complex molecules: combining molecular mechanics and continuum models. Acc. Chem. Res. 33, 889–97 (2000).
Lill, M. A. & Thompson, J. J. Solvent interaction energy calculations on molecular dynamics trajectories: increasing the efficiency using systematic frame selection. J. Chem. Inf. Model. 51, 2680–2689 (2011).
Yang, Y., Liu, H. & Yao, X. Understanding the molecular basis of MK2-p38α signaling complex assembly: insights into protein-protein interaction by molecular dynamics and free energy studies. Mol. Biosyst. 8, 2106–2118 (2012).
Acknowledgements
The authors are grateful to Conselho Nacional de Desenvolvimento Cientifico e Tecnológico (CNPq) and PAPQ 2021-PROPESP/UFPA for the financial support. We also thank the access of the computational resources of the Supercomputer Santos Dumont (SDumont) provided by the Laboratório de Computação Científica (LNCC). W.J.Z thanks to the Ministerio de Ciencia, Tecnologia y Telecomunicaciones (MICITT), Consejo Nacional para the Investigaciones Cientificas y Tecnoloǵicas (CONICIT;Costa Rica), and University of Costa Rica (UCR).
Author information
Authors and Affiliations
Contributions
A.H.L. and J.O.A. conceived the experiment, A.H.L. and J.O.A. conducted MD simulations experiments, J.L.S and C.N.A. conducted clustering analysis and drugability score, W.J.Z. and S.S.P. conducted lipophilicity experiments. All authors analyzed the results and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
de O. Araújo, J., Pinheiro, S., Zamora, W.J. et al. Structural, energetic and lipophilic analysis of SARS-CoV-2 non-structural protein 9 (NSP9). Sci Rep 11, 23003 (2021). https://doi.org/10.1038/s41598-021-02366-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-02366-0
This article is cited by
-
SARS-CoV-2: analysis of the effects of mutations in non-structural proteins
Archives of Virology (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.