; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C016367 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C016367
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionDUF4228 domain-containing protein
Genome locationchr07:22152063..22152985
RNA-Seq ExpressionMELO3C016367
SyntenyMELO3C016367
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044818.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]5.5e-11899.55Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGG DGEEMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

XP_004146558.1 uncharacterized protein LOC101218947 [Cucumis sativus]2.3e-11698.65Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDG EMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSR SEKNGGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

XP_008452047.1 PREDICTED: uncharacterized protein LOC103493172 [Cucumis melo]8.5e-119100Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

XP_023552343.1 uncharacterized protein LOC111810037 [Cucurbita pepo subsp. pepo]9.8e-10791.93Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLF GG+GEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLL RNRGRNRG  +  EMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKN-GGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQW
        SNSVPEA AA A MAPYRMSFDYQGVLRRSQTEVFSR SEKN GGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSD W
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKN-GGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQW

Query:  SLSTATANATPSVSSKSGGLLEI
        SLS+ TANATPS SSKSGGLLEI
Subjt:  SLSTATANATPSVSSKSGGLLEI

XP_038902281.1 uncharacterized protein LOC120088913 [Benincasa hispida]1.3e-11195.5Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLF GG+GEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRG E+  EMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAA AAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LST TANATPS SSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

TrEMBL top hitse value%identityAlignment
A0A0A0KUU6 Uncharacterized protein1.1e-11698.65Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDG EMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSR SEKNGGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

A0A1S3BSZ8 uncharacterized protein LOC1034931724.1e-119100Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

A0A5A7TPJ8 DUF4228 domain-containing protein2.7e-11899.55Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGG DGEEMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

A0A5D3D1S5 DUF4228 domain-containing protein4.1e-119100Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LSTATANATPSVSSKSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

A0A6J1JCF7 uncharacterized protein LOC1114832128.1e-10790.99Show/hide
Query:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR
        MGNCLF GG+GEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLL RNRGRNRG  +  EMG+IRAREGHVR
Subjt:  MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVR

Query:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS
        SNSVPEA AA A MA YRMSFDYQGVLRRSQTEVFSR SEKNGGVWKVKLVISP+RLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSD WS
Subjt:  SNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWS

Query:  LSTATANATPSVSSKSGGLLEI
        LS+ TANATPS S+KSGGLLEI
Subjt:  LSTATANATPSVSSKSGGLLEI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64700.1 unknown protein3.4e-3343.54Show/hide
Query:  MGNCLFAG-GAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHV
        MGNCLF G G  E    IKVI S+GG++E  SP+T G ++  F G+ +F + DL W PL H+  L+PG+SYYL P            +E+        HV
Subjt:  MGNCLFAG-GAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHV

Query:  RSNSVPEAAAAMAAMAPYRMSFDY-QGVLRRSQTEVFSRCS----------------EKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGS
        RSNS      +++A+ PYRMS DY   VL+RS T+VFSR S                   G +WKV L+I+   L++IL E+G T ELIESVR VAK G 
Subjt:  RSNSVPEAAAAMAAMAPYRMSFDY-QGVLRRSQTEVFSRCS----------------EKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGS

Query:  TSTSSSFSS
        TS+ +S SS
Subjt:  TSTSSSFSS

AT3G61920.1 unknown protein4.5e-2538.21Show/hide
Query:  MGNCLFAGGAG------EIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWN--PLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGII
        MGNC+F G  G      +    IKV+T NGG+MEL  PI    I +EFPG+ I  S  L  +  PL H EEL PG  YYLLP +                
Subjt:  MGNCLFAGGAG------EIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWN--PLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGII

Query:  RAREGHVRSNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSS
                + +  +  ++     PYRMSF         +T + +  S    GVWKV+LVISP +L EIL E+  T+ L+ESVRTVAKCG          S
Subjt:  RAREGHVRSNSVPEAAAAMAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSS

Query:  MAFSDQWSLSTA
         A SDQ S++++
Subjt:  MAFSDQWSLSTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTAACTGTTTATTCGCCGGAGGGGCAGGGGAGATTCAGGGCAAAATCAAAGTGATTACATCTAATGGTGGGATTATGGAGTTGGGTTCTCCGATTACGGTTGGTTG
TATCGCCGATGAGTTTCCCGGTTACGGAATATTCAAAAGTCACGATCTTTTTTGGAACCCACTTCCTCATAATGAGGAGCTGCTTCCCGGGAAATCATACTACTTGCTTC
CGAGAAACAGGGGAAGAAACAGAGGAGGAGAAGATGGGGAGGAGATGGGGATTATAAGGGCGCGTGAAGGGCACGTGCGGTCGAATAGTGTACCGGAAGCGGCGGCGGCG
ATGGCGGCTATGGCGCCGTATAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGATCACAAACGGAGGTGTTTTCACGGTGTAGTGAGAAGAATGGTGGGGTTTGGAA
AGTGAAATTGGTGATTAGTCCAAGGAGATTGGTGGAGATTTTGGAAGAAGAAGGACATACTCAAGAATTGATTGAAAGTGTAAGAACTGTGGCGAAATGTGGAAGCACAA
GCACGAGCAGTAGCTTCTCGTCGTCAATGGCGTTTTCCGATCAATGGAGTTTGTCGACGGCCACTGCCAATGCTACTCCAAGTGTTTCTTCCAAAAGTGGTGGCTTGTTG
GAGATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTAACTGTTTATTCGCCGGAGGGGCAGGGGAGATTCAGGGCAAAATCAAAGTGATTACATCTAATGGTGGGATTATGGAGTTGGGTTCTCCGATTACGGTTGGTTG
TATCGCCGATGAGTTTCCCGGTTACGGAATATTCAAAAGTCACGATCTTTTTTGGAACCCACTTCCTCATAATGAGGAGCTGCTTCCCGGGAAATCATACTACTTGCTTC
CGAGAAACAGGGGAAGAAACAGAGGAGGAGAAGATGGGGAGGAGATGGGGATTATAAGGGCGCGTGAAGGGCACGTGCGGTCGAATAGTGTACCGGAAGCGGCGGCGGCG
ATGGCGGCTATGGCGCCGTATAGAATGTCGTTTGATTATCAGGGGGTTTTGAGGAGATCACAAACGGAGGTGTTTTCACGGTGTAGTGAGAAGAATGGTGGGGTTTGGAA
AGTGAAATTGGTGATTAGTCCAAGGAGATTGGTGGAGATTTTGGAAGAAGAAGGACATACTCAAGAATTGATTGAAAGTGTAAGAACTGTGGCGAAATGTGGAAGCACAA
GCACGAGCAGTAGCTTCTCGTCGTCAATGGCGTTTTCCGATCAATGGAGTTTGTCGACGGCCACTGCCAATGCTACTCCAAGTGTTTCTTCCAAAAGTGGTGGCTTGTTG
GAGATTTAACAGTAAATAATTTTTTCCAAGTTTAATAAGTTTATTTCACTTTAGTTCCTTTAGGGTTTCTTTCATGATATCTATTTCTTTTGTTTTTTCCTTAAGTGATT
ATTTAATAGATGATAGATATAGTGACTTTCAAATAGCTGTATTAATATATTACATGTGGGTTTGGCATATCAATTATTTAATAGAGAATATACATATAGGGACTTTGATA
TATGTATTTTTATGTTAACATGAACTTGTTTGCAAGATATTTG
Protein sequenceShow/hide protein sequence
MGNCLFAGGAGEIQGKIKVITSNGGIMELGSPITVGCIADEFPGYGIFKSHDLFWNPLPHNEELLPGKSYYLLPRNRGRNRGGEDGEEMGIIRAREGHVRSNSVPEAAAA
MAAMAPYRMSFDYQGVLRRSQTEVFSRCSEKNGGVWKVKLVISPRRLVEILEEEGHTQELIESVRTVAKCGSTSTSSSFSSSMAFSDQWSLSTATANATPSVSSKSGGLL
EI