proteinProtein folding is a process in which proteins obtain their functional structure and conformation.Through thisphysical process , protein fromIrregular crimpCollapse into specific functionalitythree-dimensional structure。FrommRNASequence translated into linearPeptide chainProtein exists in the form of unfolded polypeptide or irregular curl.
In 2023,Japanese scientists have discovered a series of previously unknown protein folding.[1]
Structure determines function. Just knowing the genome sequence can not make us fully understand the function of protein, let alone how it works.Proteins can interact in the cell environment (specificPH, temperature, etc.), which is called protein folding.
Protein folding is listed as "the 21st centuryBiophysics”It is an important subject ofmolecular biologyCentral ruleA major unsolved biological problem.Prediction of protein molecular structure from primary sequenceTertiary structureIt is a challenging task to further predict its functions.The study of protein folding, especially the early process of folding, that is, the folding process of new peptide segments, is a fundamental problem to comprehensively clarify the central rule. In this field, recent discoveries have fundamentally revised the traditional concept that new peptide segments can be folded spontaneously.Among them,X-rayCrystal diffractionAnd variousspectrumTechnology andelectron microscopeTechnology has played an extremely important role.The 13th sessionInternational Conference on BiophysicsErnst, Nobel Prize winner, emphasized in his report that,NMROne of the main advantages of studying proteins is that it can study the dynamics of protein molecules in great detail, that is, the relationship between dynamic structure or structural movement and protein molecular function.The current NMR technology has been able topicosecondObservation in time domain ofprotein structureThe movement process of, including the main chain andSide chainAnd protein folding and unfolding under various temperatures and pressures.proteinmacromoleculeOfstructural analysisIt is not only to solve a specific structure, but also to pay more attention to the fluctuation and movement of the structure.For example, enzymes and proteins that transport small molecules usually have two conformationsligandAnd unbound ligand.The structural fluctuation within a conformation is the necessary prelude to the conformational transformation, so it is necessary tospectroscopy,SpectroscopyCombine with X-ray structure analysis to study the balance of structural fluctuations, conformational changes and various forms formed in the process of changeIntermediate stateFor another example, in order to understand how proteins fold, it is necessary to know several basic processes of protein foldingtime scale And mechanisms, includingSecondary structure(Spiral and fold) formation, curling, long-range interactions, and total collapse of unfolded peptides.Multiple technologies are used to study secondary processes, such as fastnuclear magnetic resonance, rapid spectroscopy (fluorescence, far ultraviolet andNear ultravioletCircular dichroism).
Research overview
Announce
edit
In organisms,BioinformationThe flow of can be divided into two parts: the first part is stored inDNA sequenceIngenetic informationIt is the transmission of one-dimensional information into the primary sequence of the protein through transcription and translation, and the triplet code mediates thisTransfer process;The second part isPeptide chainAfter hydrophobic collapse, space curlSide chainAggregation and other folding processes form the natural conformation of protein, and obtainbiological activityTo express life information;Protein is the life informationexpression vector , the specificspace structureIt is the basis of its biological function, that is to say, the transformation process from one-dimensional information to three-dimensional information is necessary to show the vitality of life.
Since the 1960s, Anfinsen has been based on reduced and denatured bovine pancreasRNaseWithout any otherMaterial helpLower, only by removingDenaturantandreducing agentOn the basis of the experimental results of restoring its natural structure, the author proposed thatPeptide chainOfamino acidThe sequence consists ofthermodynamicsAll information necessary for stable natural conformation“Self-assembly theory”Since then, with the extensive development of protein folding research, people have further supplemented and expanded the theory of protein folding.Anfinsen's "Thermodynamic Hypothesis of Self assembly" has gained a lotIn vitro experimentIt is proved that many proteins can undergo reversible denaturation andRenaturation, especially some small molecular weight proteins, but not all proteins are the same.And because of the specialenvironmental factorThe protein folding in the body is far from the same.
Protein folding in the body often requires otherCofactorWithATPHydrolysis of.Therefore, Ellis proposed the "auxiliary assembly theory" of protein folding in 1987.This shows that protein folding is not just athermodynamicsIt is obvious that the process of is also controlled by dynamics.Some scholars based on some similaritiesamino acidThe sequence of proteins has differentFolding structureHowever, some other proteins with different amino acid sequences are similar in structure, suggesting thatmRNASecondary structureMaybe as aGenetic codeThus affectingprotein structureHypothesis.But so far, there is no experimental evidence for this hypothesis, only some pure mathematical arguments [3].So, how does the amino acid sequence of a protein determine its spatial conformation?Around this issueScientific researchersA lot of excellent work has been done, but so far our understanding of protein folding mechanism is still incomplete, and even some aspects still have wrong views.
A typical example of research that has made important contributions in this regard is CB.AmphensonGroup AboutRNase I Denaturation andRenaturationResearch.There are 124 bovine pancreatic ribonucleasesAmino acid residue, consisting of 8Mercapto groupPairing to form 4 pairsDisulfide bond。It can be calculated that there are 105 possible ways in which eight sulfhydryl groups in the enzyme molecule form four disulfide bonds, which provides a quantitative index for estimating refolding recombination.Under mild alkaline conditions, 8moleOf concentrated urea and a large amount ofMercaptoethanolIt can completely reduce four pairs of disulfide bonds, and the whole molecule becomesIrregular crimpAnd denaturation of enzyme molecules.The urea is removed by dialysis. In the presence of oxygen, the disulfide bond is formed again, and the enzyme molecule is completelyRenaturationThe paired sulfhydryl groups in disulfide bonds are the same as those in nature. The refolded molecules can be crystallized and havecrystalThe same X-ray diffraction pattern confirmed that the enzyme molecule could not only refold spontaneously, but only chose one of 105 possible disulfide bond pairing modes during refolding.
In 2023, Japanese scientists discovered a series of previously unknown protein folding.[1]
Japanese scientists have discovered a series of previously unknown protein folding
theoretical model
Announce
edit
Frame model
(Framework Model)
Frame model[4] It is assumed that the local conformation of the protein depends on the localamino acidSequence.stayPolypeptide chainIn the initial stage of the folding process, unstableSecondary structureUnit;It is called "flicking cluster", and then these secondary structures are close to each other to form a stable secondary structure framework;Finally, the secondary structural frames are spliced together,Peptide chainGradually shrink, forming the tertiary structure of protein.This model believes that even aSmall moleculeThe protein can also be folded part by part, and the sub domain formed in it is an important structure of folding intermediates.
Collapse model
(Hydrophobic Collapse Model)
In the hydrophobic collapse model [5], hydrophobicForceIt is considered to be a decisive factor in protein folding.After forming anySecondary structureandTertiary structureBefore that, fast nonspecific hydrophobic collapse occurred first.
Adhesion mechanism
(Diffusion-Collision-Adhesion Model)
This model believes that protein folding starts from stretchingPeptide chainSeveral sites on which unstableSecondary structureUnit or hydrophobic cluster, mainly depending on the process of local sequence orMedium range(3-4Residue)Interaction to maintain.They are nonspecificBrownian motionThe way of diffusion, collision and mutual adhesion leads to the formation of large structures and thus increases the stability.Further collision forms a class with hydrophobic core and secondary structureMolten spheroid stateSpherical structure of intermediate.The spherical intermediate is adjusted to a dense, inactive, highly ordered molten spheroidal structure similar to the natural structure.Finally, the inactive highly ordered molten ball state is transformed into a complete and active natural state.
growth model
(Nuclear-Condensation-Growth Model)
According to this model,Peptide chainAn area in the can form a "foldCrystal nucleus”With them as the core, the whole peptide chain continues to fold to obtain a natural conformation.The so-called "crystal nucleus" is actually composed of some specialAmino acid residueFormed similar to natural state interactionnetwork structureThese residues are not nonspecificHydrophobic actionIt is maintained by the specific interaction that makes these residues form a tight stack.Nucleus formation is the initial stage of foldingSpeed limit steps。
Mosaic model
(Jig-Saw Puzzle Model)
The central idea of this model [9] isPolypeptide chainIt can be folded along many different paths. In the process of folding along each path, there are more and more natural structures, which can eventually form a natural conformation. In addition, the folding speed along each path is faster, which is more than that of a single pathPeptide chainThe speed is fast. On the other hand, small changes or mutations in the external physiological and biochemical environment may have a greater impact on a single folding path. For folding methods with multiple paths, these changes may have an impact on one folding path, but will not affect another, so they will not interfere with the folding of polypeptide chains in general,Unless the changes caused by these factors are too large to fundamentally affect the folding of polypeptide chains.
Lattice model
Lattice pointThe model (also referred to as HP model) was first proposed by Dill et al. in 1989.The lattice model can be divided into2D modelandthree-dimensional modelTwo types.The two-dimensional grid point model is orthogonal in the plane spaceUnit length's grid, eachamino acidMolecules are placed on these grid intersections in sequence. When adjacent amino acid molecules in the sequence are placed in the grid, they must also be adjacent, that is, the distance between adjacent amino acid molecules in the grid model is 1.However, it should be noted that each intersection in the grid can only place one amino acid molecule at most. If an amino acid molecule in the sequence has been placed at this positionPostorderThe amino acid molecule of can no longer be placed on this grid point.If there is no place for the amino acid molecule to be placed during the placement of amino acid molecules, it means that the configuration is unreasonable and needs to be relocated.The 3D grid model is similar to the 2D grid modelthree-dimensional spaceThe solid mesh per unit length generated in.Place in grid pointamino acidThe molecular method is the same as that of two-dimensional, but when placing amino acid molecules in two-dimensional grid model, there are only three directions to choose at most except for the first two amino acid molecules in the sequence, while in three-dimensional grid model, the complexity has been improved a lot, and there are at most five directions to choose when placing amino acid molecules.
Molecular chaperone
Announce
edit
In 1978, LaskeyHistoneandDNAIn vitro physiologyionic strengthIt was found during the experiment that there must benucleusIn the presence of the acidic protein nucleoplasmin, they can be assembled intoNucleosomeOtherwise, sedimentation will occur.So Laskey calls it“Molecular chaperone”。Molecular chaperone refers to the ability to bind and stabilize the unstable conformation of another protein, and to promote newborn through controlled binding and releasePolypeptide chainFoldPOLYMERAssembly or degradation andOrganelleA class of proteins involved in transmembrane transport of proteins [10,11].Molecular chaperone is defined from the function. All proteins with this function are molecular chaperones, and their structures can be completely different.This concept has been extended to many proteins, and the molecular chaperones identified now mainly belong to three types of highly conservativeProtein family[12]:stress 90 family、stress 70 family、stress 60 family。The stress 60 family exists inEukaryoteOfmitochondrion(onmammalCalled Hsp58 in)chloroplast(called cpn60), in theprokaryoteOfcytoplasmIt is called GroEL.
significance
Announce
edit
The elucidation of protein folding mechanism will reveal the second set in lifeGenetic codeThis is its theoretical significance.The narrow definition of protein folding is to study protein specificthree-dimensional spaceThe regularity and stability of structure formationbiological activityRelationship.ConceptuallythermodynamicsThe problem of dynamics;Protein folding in vitro and in cells;Theoretical research andexperimental study Problems.Here's the most fundamentalscientific problems namelyPolypeptide chainHow does its primary structure determine its spatial structure?Since the former determines the latter, there must be a certain relationship between the primary structure and the spatial structurenucleotideVia“Triple password”Decisionamino acidIs there a set of passwords in that order?Some people call the password that the primary structure determines the spatial structure "the second genetic code".
If the "triple code" has been deciphered but has actually become clear code, then decipher the "secondGenetic code”Exactly“protein structurePrediction is the most direct way to solve the problem of protein folding in theory, which is one of the last mysteries of protein research that has not yet been revealed.“Protein structure prediction”TheoreticalthermodynamicsQuestion.It is to predict the specific spatial structure determined by the Anfinsen principle according to the measured protein primary sequence.The amino acid sequence of a protein, especially one that encodes proteinsnucleotide sequenceThe determination of DNA has now become almost routine technology, from complementary DNA(cDNA)The sequence can be based on“Triple password”The deduced amino acid sequence, which has made great breakthrough in the last centuryMolecular Biology Technology, greatly acceleratedPrimary structure of proteinDetermination of.at presentProtein databaseThe primary structure of about 170000 proteins has been stored in, but only about 12000 proteins have been measured for spatial structure, many of which are very similarHomologous proteinThere are only more than 1000 different proteins.along withhuman genome projectThe successful completion of, has interpreted the entire sequence of human DNA, the growth of protein primary structure data will inevitably explode, and the speed of spatial structure measurement is far behind, so there will be a greater distance between the two, which needs moreprotein structureForecast of.
prospect
Announce
edit
At the same time, it also has important potential application prospects, such as the following aspects:
▲DNA recombinationandPolypeptide synthesisThe development of technology enables us to design longPolypeptide chain。However, because we cannot understand what kind of conformation this polypeptide will fold, we cannot design the protein we need with specific functions according to our own wishes.
Pathogenesis
▲ Many diseases, such asAlzheimer's disease (Alzheimer's),Mad cow disease(Mad Cow,BSE), transmissibilitySpongiform encephalopathy(CJD), MuscleAtrophic lateral sclerosis(ALS)And Parkinson's disease, which is caused by the mutation of some important proteins in cellsCoalescence precipitationOr wrong folding.Therefore, an in-depth understanding of the relationship between protein folding and misfolding will be of great help to the elucidation of the pathogenesis of these diseases and the search for treatment methods.
Revealing function
▲ The development of genome sequence has enabled us to obtain a large number of protein sequences, and the acquisition of structural information is very important to reveal their biological functions.Rely on existing means (X-ray crystal diffractionNMRIt takes a long time to determine the protein structure, so the pace of structure analysis has lagged behind the pace of discovering new proteins.Although the method of structural prediction is fast, its reliability is not highprotein structureOnly when the physical and chemical factors driving protein folding are better understood can this method be fundamentally improved.In addition, we are interested in the interaction between proteinsligandInteraction with proteinIsostructureAndFunctional relationshipThe research of protein folding also depends on the elucidation of protein folding mechanism.
Folding disease
Announce
edit
Protein molecularamino acidThe sequence does not change, but its structure or conformation changes can also cause disease, which is called“Conformational disease”, or "folding disease".
Mad cow diseasefromPrionIt is caused by the infection of protein, which can also infect peopleNervous system diseases。In the normal body, Prion is a protein required for normal neural activity, and the disease causing Prion and normal PrionPrimary structureIdentical, but different in spatial structure.
The diseases caused by the abnormal folding of proteins that result in molecular aggregation or even precipitation or can not be transported in place normally aresenile dementia, cystic fibrosis, familial hypercholesterolemia, familial amyloidosis, sometumour、cataractwait.
becauseMolecular chaperoneIt plays an important role in protein folding. Mutation of molecular chaperone itself will obviously cause abnormal protein folding and cause folding disease.With the deepening of protein folding research, scientists will find the true causes of more diseases and more targeted treatment methods, and design more effective drugs.