; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020331 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020331
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCST complex subunit STN1
Genome locationscaffold665:1090951..1091457
RNA-Seq ExpressionMS020331
SyntenyMS020331
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0008033 - tRNA processing (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0004674 - protein serine/threonine kinase activity (molecular function)
GO:0004712 - protein serine/threonine/tyrosine kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0106310 - protein serine kinase activity (molecular function)
InterPro domainsIPR004365 - OB-fold nucleic acid binding domain, AA-tRNA synthetase-type
IPR012340 - Nucleic acid-binding, OB-fold
IPR040260 - Replication factor A protein-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571457.1 CST complex subunit STN1, partial [Cucurbita argyrosperma subsp. sororia]3.9e-7183.03Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQ-TSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIAN
        R+F + H H+KLLGFDL SLTQ +SSSSSSDAVSFSR+G+AVSC EIVGVVV RDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFA R  PDVRII N
Subjt:  RSFPMDHRHVKLLGFDLLSLTQ-TSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIAN

Query:  MSTHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP
        M+THFAAQIRVGIV RVRGKV+SYRGVVQITVSDVVVE+DPNA+ILHWLDCMRLALKCYDL P P
Subjt:  MSTHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP

KAG7011223.1 CST complex subunit STN1, partial [Cucurbita argyrosperma subsp. argyrosperma]3.9e-7183.03Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQ-TSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIAN
        R+F + H H+KLLGFDL SLTQ +SSSSSSDAVSFSR+G+AVSC EIVGVVV RDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFA R  PDVRII N
Subjt:  RSFPMDHRHVKLLGFDLLSLTQ-TSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIAN

Query:  MSTHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP
        M+THFAAQIRVGIV RVRGKV+SYRGVVQITVSDVVVE+DPNA+ILHWLDCMRLALKCYDL P P
Subjt:  MSTHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP

XP_022963452.1 LOW QUALITY PROTEIN: CST complex subunit STN1 [Cucurbita moschata]3.5e-7282.32Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM
        R+F + H H+KLLGFDL SLTQ++SSSSSDAVSFSR+G+AVSC EIVGVVV RDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFASR  PDVRII NM
Subjt:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM

Query:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP
        +THFAAQIRVGIV RVRG+V+SYRGVVQ+TVSDVVVE+DPNA+ILHWLDCMRLALKCYDL P P
Subjt:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP

XP_022967373.1 CST complex subunit STN1 [Cucurbita maxima]5.8e-6785.43Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM
        R+F + H H+KLLGFDL SLTQ+SSSSSSDAVSFSR+G AVSCAEIVGVVVSRDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFASR  PDVRII NM
Subjt:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM

Query:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM
        +THFAAQIRVGIV RVRGKV+SYRGVVQITVSDVVVE+DPNA+ILHWLDCM
Subjt:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM

XP_023553655.1 CST complex subunit STN1 [Cucurbita pepo subsp. pepo]1.6e-7282.93Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM
        R+F + H H+KLLGFDL SL Q+SSSSSSDAVSFSR+G+AVSC E+VGVVV RDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFASR  PDVRII NM
Subjt:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM

Query:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP
        +THFAAQIRVGIV RVRGKV+SYRGVVQITVSDVVVE+DPNA+ILHWLDCMRLALKCYDL P P
Subjt:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP

TrEMBL top hitse value%identityAlignment
A0A0A0LNV0 OB domain-containing protein5.8e-6578.66Show/hide
Query:  SFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMS
        SF +   HVKLL FDL SL QT   SSSD+VSFSRKG+AVSC EIVGVVV RDLKPNRFLKFSVDDGT CV CILWLN L+S YFASR   DVRI+ +M+
Subjt:  SFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMS

Query:  THFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT
        THFAAQIRVGIV RVRGK++SYRG+VQITVSDVVVEDDPNA+ILHWLD MRLA+KCYDL PTPT
Subjt:  THFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT

A0A1S3BLT5 CST complex subunit STN16.9e-6679.27Show/hide
Query:  SFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMS
        SF +   HVKLLGFDL SL Q   +SSSD+VSFSRKG+AVSC EIVGVVV RDLKPNRFLKFSVDDGT CV CILWLN L+S YFASR  PDVRI+A+M+
Subjt:  SFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMS

Query:  THFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT
        THFAAQIRVGIV RVRGK++SYRG+VQITVSDVVVEDDPNA+ILHWLD MRLA+KCYDL P PT
Subjt:  THFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT

A0A5D3E3H2 CST complex subunit STN16.9e-6679.27Show/hide
Query:  SFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMS
        SF +   HVKLLGFDL SL Q   +SSSD+VSFSRKG+AVSC EIVGVVV RDLKPNRFLKFSVDDGT CV CILWLN L+S YFASR  PDVRI+A+M+
Subjt:  SFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMS

Query:  THFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT
        THFAAQIRVGIV RVRGK++SYRG+VQITVSDVVVEDDPNA+ILHWLD MRLA+KCYDL P PT
Subjt:  THFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT

A0A6J1HFA9 LOW QUALITY PROTEIN: CST complex subunit STN11.7e-7282.32Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM
        R+F + H H+KLLGFDL SLTQ++SSSSSDAVSFSR+G+AVSC EIVGVVV RDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFASR  PDVRII NM
Subjt:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM

Query:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP
        +THFAAQIRVGIV RVRG+V+SYRGVVQ+TVSDVVVE+DPNA+ILHWLDCMRLALKCYDL P P
Subjt:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTP

A0A6J1HUX0 CST complex subunit STN12.8e-6785.43Show/hide
Query:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM
        R+F + H H+KLLGFDL SLTQ+SSSSSSDAVSFSR+G AVSCAEIVGVVVSRDLKPNRFL+FSVDDGTGCV+CILWLN L+SPYFASR  PDVRII NM
Subjt:  RSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANM

Query:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM
        +THFAAQIRVGIV RVRGKV+SYRGVVQITVSDVVVE+DPNA+ILHWLDCM
Subjt:  STHFAAQIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM

SwissProt top hitse value%identityAlignment
D2GXY4 CST complex subunit STN14.4e-0931.39Show/hide
Query:  FSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHF---------AAQIRVGIVGRVRGKVTSYR
        F   GH +   EI+G V+ R  K + F  + VDDGTG +NCI W  + S+    S   P + ++ N+++             +I +G + ++RG V +YR
Subjt:  FSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHF---------AAQIRVGIVGRVRGKVTSYR

Query:  GVVQITVSDVVVEDDP--NAQILHWLDCMRLALKCYD
           +I V+     DDP  N QI   L+   +  K YD
Subjt:  GVVQITVSDVVVEDDP--NAQILHWLDCMRLALKCYD

Q8LFJ8 Replication protein A 32 kDa subunit B6.5e-0526.12Show/hide
Query:  LSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVR
        L+L Q SS+S++   +FS  G  +    IVG +   + +  + + F VDDGTG V+C+ W +                  A   T     +++G+  R+ 
Subjt:  LSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVR

Query:  GKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM
        G +  ++G   + V  V    D N  + H+ +CM
Subjt:  GKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM

Q9LMK5 CST complex subunit STN19.0e-3954.97Show/hide
Query:  HVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHA-VSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQ
        H KL+  D+  LTQ+ + S+    SFS  G A VS  EIVG +VSRDL P +FLKF VDDGTGCV C++WLNQL+S YF+   P  + ++A+ +   AAQ
Subjt:  HVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHA-VSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQ

Query:  IRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCY
        IR+G V RVRG+V SYRGV+QIT +  V E DPNA+ILHWL+C++L   CY
Subjt:  IRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCY

Arabidopsis top hitse value%identityAlignment
AT1G07130.1 Nucleic acid-binding, OB-fold-like protein6.4e-4054.97Show/hide
Query:  HVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHA-VSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQ
        H KL+  D+  LTQ+ + S+    SFS  G A VS  EIVG +VSRDL P +FLKF VDDGTGCV C++WLNQL+S YF+   P  + ++A+ +   AAQ
Subjt:  HVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHA-VSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQ

Query:  IRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCY
        IR+G V RVRG+V SYRGV+QIT +  V E DPNA+ILHWL+C++L   CY
Subjt:  IRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCY

AT2G24490.1 replicon protein A22.5e-0423.2Show/hide
Query:  SSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVRGKVTSYRGV
        SS +       G +++   +VG+V  +D      ++F++DDGTG ++C  W+++     F +R                  +R G   R+ G + +++G 
Subjt:  SSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVRGKVTSYRGV

Query:  VQITVSDVVVEDDPNAQILHWLDCM
         Q+ V  V    D N    H+++C+
Subjt:  VQITVSDVVVEDDPNAQILHWLDCM

AT2G24490.2 replicon protein A22.5e-0423.2Show/hide
Query:  SSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVRGKVTSYRGV
        SS +       G +++   +VG+V  +D      ++F++DDGTG ++C  W+++     F +R                  +R G   R+ G + +++G 
Subjt:  SSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVRGKVTSYRGV

Query:  VQITVSDVVVEDDPNAQILHWLDCM
         Q+ V  V    D N    H+++C+
Subjt:  VQITVSDVVVEDDPNAQILHWLDCM

AT3G02920.1 Replication protein A, subunit RPA324.6e-0626.12Show/hide
Query:  LSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVR
        L+L Q SS+S++   +FS  G  +    IVG +   + +  + + F VDDGTG V+C+ W +                  A   T     +++G+  R+ 
Subjt:  LSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVR

Query:  GKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM
        G +  ++G   + V  V    D N  + H+ +CM
Subjt:  GKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM

AT3G02920.2 Replication protein A, subunit RPA324.6e-0626.12Show/hide
Query:  LSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVR
        L+L Q SS+S++   +FS  G  +    IVG +   + +  + + F VDDGTG V+C+ W +                  A   T     +++G+  R+ 
Subjt:  LSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAAQIRVGIVGRVR

Query:  GKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM
        G +  ++G   + V  V    D N  + H+ +CM
Subjt:  GKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCGGAGAATTTCCGTAGCTTTCCGATGGACCACCGCCACGTTAAGCTCCTGGGTTTTGACCTTCTTTCACTCACTCAAACTTCATCTTCTTCCTCCTCCGATGCCGTCTC
CTTCTCTCGCAAAGGCCATGCCGTCTCCTGTGCAGAGATCGTCGGCGTCGTGGTCTCCCGTGACCTCAAACCCAACAGATTTCTCAAGTTTTCCGTCGACGATGGCACCG
GCTGTGTTAACTGTATTCTATGGCTCAATCAGCTGAGCTCGCCTTACTTTGCAAGCCGGCGCCCACCAGATGTTCGAATAATTGCTAATATGTCGACCCATTTCGCTGCC
CAAATTAGGGTTGGGATTGTTGGTAGAGTCCGGGGGAAGGTCACCAGCTATAGGGGCGTGGTGCAGATCACGGTGTCAGATGTTGTGGTTGAGGATGACCCAAATGCTCA
GATTCTGCATTGGTTGGATTGCATGAGGTTGGCTCTAAAGTGTTATGACCTCTTGCCTACACCCACA
mRNA sequenceShow/hide mRNA sequence
TCGGAGAATTTCCGTAGCTTTCCGATGGACCACCGCCACGTTAAGCTCCTGGGTTTTGACCTTCTTTCACTCACTCAAACTTCATCTTCTTCCTCCTCCGATGCCGTCTC
CTTCTCTCGCAAAGGCCATGCCGTCTCCTGTGCAGAGATCGTCGGCGTCGTGGTCTCCCGTGACCTCAAACCCAACAGATTTCTCAAGTTTTCCGTCGACGATGGCACCG
GCTGTGTTAACTGTATTCTATGGCTCAATCAGCTGAGCTCGCCTTACTTTGCAAGCCGGCGCCCACCAGATGTTCGAATAATTGCTAATATGTCGACCCATTTCGCTGCC
CAAATTAGGGTTGGGATTGTTGGTAGAGTCCGGGGGAAGGTCACCAGCTATAGGGGCGTGGTGCAGATCACGGTGTCAGATGTTGTGGTTGAGGATGACCCAAATGCTCA
GATTCTGCATTGGTTGGATTGCATGAGGTTGGCTCTAAAGTGTTATGACCTCTTGCCTACACCCACA
Protein sequenceShow/hide protein sequence
SENFRSFPMDHRHVKLLGFDLLSLTQTSSSSSSDAVSFSRKGHAVSCAEIVGVVVSRDLKPNRFLKFSVDDGTGCVNCILWLNQLSSPYFASRRPPDVRIIANMSTHFAA
QIRVGIVGRVRGKVTSYRGVVQITVSDVVVEDDPNAQILHWLDCMRLALKCYDLLPTPT