; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002088 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002088
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionnuclear factor Y, subunit B13
Genome locationscaffold30:2299190..2300893
RNA-Seq ExpressionMS002088
SyntenyMS002088
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR003958 - Transcription factor CBF/NF-Y/archaeal histone domain
IPR009072 - Histone-fold
IPR044255 - Protein Dr1 homolog


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011648700.1 protein Dr1 homolog isoform X1 [Cucumis sativus]1.1e-6696.48Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYI EVYAAYEQHR+ETMK LQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQ EPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

XP_022134978.1 protein Dr1 homolog isoform X1 [Momordica charantia]8.0e-70100Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

XP_022921863.1 protein Dr1 homolog isoform X1 [Cucurbita moschata]6.3e-6797.18Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHR ETMK  QQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQPEPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

XP_022929360.1 protein Dr1 homolog isoform X1 [Cucurbita moschata]2.2e-6797.18Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKE+KRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMK+LQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQ EPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

XP_038879126.1 protein Dr1 homolog isoform X1 [Benincasa hispida]1.5e-6898.59Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMK LQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQPEPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

TrEMBL top hitse value%identityAlignment
A0A6J1C3I2 protein Dr1 homolog isoform X13.9e-70100Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

A0A6J1E503 protein Dr1 homolog isoform X13.1e-6797.18Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHR ETMK  QQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQPEPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

A0A6J1EU70 protein Dr1 homolog isoform X11.1e-6797.18Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKE+KRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMK+LQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQ EPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

A0A6J1JII3 protein Dr1 homolog isoform X13.1e-6797.18Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHR ETMK  QQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQPEPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

A0A6J1KL43 protein Dr1 homolog isoform X11.1e-6797.18Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKE+KRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMK+LQQDSLKGGKWSN
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GAEMTEEEALAEQQRMFAEARARMNG+NTAPKQ EPEQSLES
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

SwissProt top hitse value%identityAlignment
P49592 Protein Dr1 homolog4.1e-5377.46Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESN+VC+KE+KRTIAPEHVLKAL+VLGF EYIEEVYAAYEQH+ ETM    QD+ +  KW+ 
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GA+MTEEEA AEQQRMFAEARARMNG  + P+   PE    S
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

Q01658 Protein Dr12.2e-2249.19Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        A + K+IKE L P+VRVA DA++L++ CC EFI+L+SSE+NE+C+K EK+TI+PEHV++ALE LGF  YI EV         E ++  +  +LK  K S+
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAE---MTEEEALAEQQRMFAEAR
          E   + EEE L +QQ +FA+AR
Subjt:  GAE---MTEEEALAEQQRMFAEAR

Q55DJ5 Protein Dr1 homolog2.1e-2544.93Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        AT++K+IKEMLP DV+ + + +DL++ECCVEFI+L+SSE+N++C +E+KRTIA EHV+KAL  LGFS+Y ++V   Y++H+LE    +   S    K+ N
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQ
          + T E+ + EQQ +FA+AR+    +  A +  + +Q
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQ

Q5ZMV3 Protein Dr12.2e-2249.19Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        A + K+IKE L P+VRVA DA++L++ CC EFI+L+SSE+NE+C+K EK+TI+PEHV++ALE LGF  YI EV         E ++  +  +LK  K S+
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAE---MTEEEALAEQQRMFAEAR
          E   + EEE L +QQ +FA+AR
Subjt:  GAE---MTEEEALAEQQRMFAEAR

Q91WV0 Protein Dr12.2e-2249.19Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        A + K+IKE L P+VRVA DA++L++ CC EFI+L+SSE+NE+C+K EK+TI+PEHV++ALE LGF  YI EV         E ++  +  +LK  K S+
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAE---MTEEEALAEQQRMFAEAR
          E   + EEE L +QQ +FA+AR
Subjt:  GAE---MTEEEALAEQQRMFAEAR

Arabidopsis top hitse value%identityAlignment
AT5G08190.1 nuclear factor Y, subunit B124.3e-5381.48Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLP DVRVARDAQDLLIECCVEFINL+SSESNEVC+KE+KRTIAPEHVLKAL+VLGF EY+EEVYAAYEQH+ ETM    QDS +  K ++
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPE
        GAEMTEEEA AEQQRMFAEARARMNG  T P QPE
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPE

AT5G08190.2 nuclear factor Y, subunit B121.6e-5280.74Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLP DVRVARDAQDLLIECCVEFINL+SSESNEVC+KE+KRTIAPEHVLKAL+VLGF EY+EEVYAAYEQH+ ETM     DS +  K ++
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPE
        GAEMTEEEA AEQQRMFAEARARMNG  T P QPE
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPE

AT5G23090.1 nuclear factor Y, subunit B132.9e-5477.46Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESN+VC+KE+KRTIAPEHVLKAL+VLGF EYIEEVYAAYEQH+ ETM    QD+ +  KW+ 
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GA+MTEEEA AEQQRMFAEARARMNG  + P+   PE    S
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

AT5G23090.2 nuclear factor Y, subunit B132.9e-5477.46Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESN+VC+KE+KRTIAPEHVLKAL+VLGF EYIEEVYAAYEQH+ ETM    QD+ +  KW+ 
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GA+MTEEEA AEQQRMFAEARARMNG  + P+   PE    S
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES

AT5G23090.4 nuclear factor Y, subunit B131.1e-5376.76Show/hide
Query:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN
        ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESN+VC+KE+KRTIAPEHVLKAL+VLGF EYIEEVYAAYEQH+ ETM     D+ +  KW+ 
Subjt:  ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSN

Query:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES
        GA+MTEEEA AEQQRMFAEARARMNG  + P+   PE    S
Subjt:  GAEMTEEEALAEQQRMFAEARARMNGNNTAPKQPEPEQSLES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GCGACCATGACCAAAATTATTAAAGAGATGTTGCCCCCTGATGTACGTGTTGCAAGAGATGCGCAAGATCTTCTGATTGAGTGTTGTGTAGAGTTTATAAACCTTGTATC
ATCCGAGTCTAATGAAGTTTGTAGCAAAGAAGAAAAAAGAACAATTGCACCTGAGCACGTGCTCAAGGCTCTCGAGGTGCTTGGTTTTAGTGAGTACATCGAGGAAGTTT
ATGCTGCATATGAACAGCACAGGCTCGAAACTATGAAATCTCTCCAGCAAGACTCCTTGAAAGGTGGAAAGTGGAGCAATGGAGCTGAGATGACCGAGGAAGAAGCTTTG
GCTGAGCAGCAAAGAATGTTTGCAGAGGCTCGTGCAAGAATGAATGGCAACAACACTGCTCCAAAGCAACCGGAGCCCGAGCAAAGTTTAGAGAGC
mRNA sequenceShow/hide mRNA sequence
GCGACCATGACCAAAATTATTAAAGAGATGTTGCCCCCTGATGTACGTGTTGCAAGAGATGCGCAAGATCTTCTGATTGAGTGTTGTGTAGAGTTTATAAACCTTGTATC
ATCCGAGTCTAATGAAGTTTGTAGCAAAGAAGAAAAAAGAACAATTGCACCTGAGCACGTGCTCAAGGCTCTCGAGGTGCTTGGTTTTAGTGAGTACATCGAGGAAGTTT
ATGCTGCATATGAACAGCACAGGCTCGAAACTATGAAATCTCTCCAGCAAGACTCCTTGAAAGGTGGAAAGTGGAGCAATGGAGCTGAGATGACCGAGGAAGAAGCTTTG
GCTGAGCAGCAAAGAATGTTTGCAGAGGCTCGTGCAAGAATGAATGGCAACAACACTGCTCCAAAGCAACCGGAGCCCGAGCAAAGTTTAGAGAGC
Protein sequenceShow/hide protein sequence
ATMTKIIKEMLPPDVRVARDAQDLLIECCVEFINLVSSESNEVCSKEEKRTIAPEHVLKALEVLGFSEYIEEVYAAYEQHRLETMKSLQQDSLKGGKWSNGAEMTEEEAL
AEQQRMFAEARARMNGNNTAPKQPEPEQSLES