; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C007790.jh1 (gene) of Melon (Harukei-3) v1.41 genome

Gene IDMELO3C007790.jh1
OrganismCucumis melo var. reticulatus cv. Harukei-3 (Melon (Harukei-3) v1.41)
DescriptionTranslation initiation factor IF-2, putative isoform 1
Genome locationchr08:5293685..5296586
RNA-Seq ExpressionMELO3C007790.jh1
SyntenyMELO3C007790.jh1
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604007.1 hypothetical protein SDJN03_04616, partial [Cucurbita argyrosperma subsp. sororia]1.34e-14187.55Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETAN------------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQ
        MARRKAKK+VKKS+PSP+ EAKD +AN            DEDVERHA AIRAIRDVEI RLITELRLLRSYFNKEQLQTPLLQFFEEKLP LSISR GEQ
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETAN------------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQ

Query:  GEIEVQWKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSI
        GEIEVQWKDTEDEL TNPADGIDIHASLLHRLS AYPNCSAGMRSFNGFEFSSKSVKTNPF VENLQIPNF LEEPSDNMVLGMPD+LQTPGV NQRLSI
Subjt:  GEIEVQWKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGE+LVSIHGSPLGVY+EDNMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_004143493.2 uncharacterized protein LOC101215210 [Cucumis sativus]1.12e-15396.94Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
        MARRKAKKTVKKS+PSP+LEAKDE ANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED

Query:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
        ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNF LEEPSDN+VLGMPDI QTPGV NQRLSIGMTPKTRRLPKP
Subjt:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP

Query:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_008440665.1 PREDICTED: uncharacterized protein LOC103485008 [Cucumis melo]3.64e-159100Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
        MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED

Query:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
        ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
Subjt:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP

Query:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_023543921.1 uncharacterized protein LOC111803645 [Cucurbita pepo subsp. pepo]2.21e-14087.14Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETAN------------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQ
        MARRKAKK+VKKS+PSPV EAKD +AN            DEDVERHA AIRAIRDVEI RLITELRLLRSYFNKEQLQTPLLQFFEEKLP LSISR GEQ
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETAN------------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQ

Query:  GEIEVQWKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSI
        GEIEVQWK+TEDEL TNPADGIDIHASLLHRLS AYPNCSAGMRSFNGFEFSSKSVKTNPF VENLQIPNF LEEPSDNMVLGMPD+LQTPGV NQRLSI
Subjt:  GEIEVQWKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGE+LVSIHGSPLGVY+E+NMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_038891412.1 uncharacterized protein LOC120080833 [Benincasa hispida]1.47e-13989.74Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETAN------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQ
        MARR+AKKTVKKS+PSP  +AKDE AN      DEDVERHAAAIRAIRDVEI RLITELRLLRSYFNKEQLQTPLLQFFEEKLP LSISR GEQGEIEVQ
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETAN------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQ

Query:  WKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKT
        WKDTEDEL TNPADGIDIHASLLH LS AYP CSAGMRSFNGFEFSSKSVKTNPF VENLQIPN  LEEPSDNMVLGMP+ILQTPGV NQRLSIGMTPKT
Subjt:  WKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKT

Query:  RRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        RRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
Subjt:  RRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

TrEMBL top hitse value%identityAlignment
A0A0A0KK81 Uncharacterized protein1.05e-15196.07Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
        MARRKAKKTVKKS+PSP+LEAKDE ANDEDVERHAAAIRAIRDVEI RLIT LRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED

Query:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
        ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNF LEEPSDN+VLGMPDI QTPGV NQRLSIGMTPKTRRLPKP
Subjt:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP

Query:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A1S3B1M1 uncharacterized protein LOC1034850081.76e-159100Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
        MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED

Query:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
        ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
Subjt:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP

Query:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A1S4DWB9 uncharacterized protein LOC1034894368.27e-13988.51Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETAN------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQ
        MARRKAKKTVKKS+PS    AKDE A+      DEDVERHAAAIRAIRDVEI RLITELRLLRSYFNKEQLQTPLLQFFEEKLP LSIS  G+QGEIEVQ
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETAN------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQ

Query:  WKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKT
        WKDTEDEL TNPADG+DIHASLLHRLS AYP CSAGMRSFNGFEFSSKSVKTNPF  ENLQIPNF LEEPSDNMVLGMPDILQTPG+ NQRLSIGMTPKT
Subjt:  WKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKT

Query:  RRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        RRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  RRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A5D3CLG2 Uncharacterized protein1.76e-159100Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
        MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTED

Query:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
        ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP
Subjt:  ELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKP

Query:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A6J1ILD3 uncharacterized protein LOC1114784964.86e-13785.83Show/hide
Query:  MARRKAKKTVKKSNPSPVLEAKDETAN------------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQ
        MARRKAKK+VKKS+PSPV EAKD +AN            DEDVERHA AIRAIRDVEI RLITELRLLRSYFNKEQLQTPLLQFF EKLP LSISR GEQ
Subjt:  MARRKAKKTVKKSNPSPVLEAKDETAN------------DEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQ

Query:  GEIEVQWKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSI
        GEIEVQWKDTEDEL TNPADGIDIHASLLHRLS AYPNCSAG+RSFNGFEFSSKSVKTNPF VENLQIPNF LEEPSDNMVLGMPD+LQTPG  NQRLSI
Subjt:  GEIEVQWKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        GMTPKTRRLPKPGE++VSIHGSPLGVY+E NMEAIHESEE
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39630.1 unknown protein7.1e-5050Show/hide
Query:  MARRKAKKTVKKSNPSP------VLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQ
        M +RKAK+ VK +           +E K E   DE+VER  AAIRAIRDVEI +++T LRLLRSYF +EQL TP+L FF+E LP LSISR  E GEIE++
Subjt:  MARRKAKKTVKKSNPSP------VLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQ

Query:  WKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKT
        W+D   +      +G+D++ S+L RLS+ + +  +   S  G++    +VK N    +N Q+ N   +  S+N +L   D  QTPGV  QRLS GMTPKT
Subjt:  WKDTEDELRTNPADGIDIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKT

Query:  RRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE
         RLPK GEM++S+HGSPLGVYKED NM AI+E
Subjt:  RRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGGCGAAAGGCGAAGAAAACAGTTAAGAAGTCCAACCCTTCACCCGTACTAGAAGCAAAGGATGAAACAGCGAATGACGAGGATGTTGAACGGCATGCTGCTGC
AATCCGTGCCATTCGGGATGTGGAGATCGGGCGTTTGATTACTGAATTGCGGTTGCTTCGTTCGTATTTCAACAAGGAGCAATTACAAACTCCTTTATTGCAATTCTTCG
AAGAAAAACTTCCTAGGTTGTCTATTTCGAGAACAGGCGAACAAGGAGAAATTGAAGTACAATGGAAGGATACAGAGGATGAGTTACGCACCAATCCAGCTGATGGAATA
GATATACATGCGTCTCTCCTTCACCGCCTGTCGATAGCTTATCCTAACTGCTCTGCCGGAATGCGATCTTTCAATGGATTCGAATTTTCCAGTAAATCAGTGAAAACGAA
TCCTTTCATTGTCGAGAACCTGCAAATTCCGAACTTCGATTTGGAGGAGCCCTCGGATAATATGGTGCTCGGCATGCCAGACATACTCCAAACTCCTGGGGTGATTAACC
AACGATTGTCCATTGGGATGACTCCGAAAACCCGAAGACTGCCAAAACCAGGCGAGATGCTTGTGTCCATCCATGGATCCCCTTTGGGTGTTTACAAGGAAGACAACATG
GAAGCAATCCATGAATCGGAAGAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
AACAGCACTATTATTCAATTTCAAACGACCATCTTCTCCCGCCCAATCTTCTTGCCCACGTCTCAACGTCGGTTCTCCGGCAGAGCTGAAAAAGAGGAAGAGACAGATTT
CGCGGCAGTCACCGGCGTAGATTAATCCATTTATTTCCATCTCCCGCGCCTACATTTCTTCATGCTTTGGAATCCCACTGCGAGCTTCAATTAATTTCACTGCGCACTAC
ATTAGGGTTTGAAAGTCTTTTATACAGTTGAAAATCCAGTGACGAAATACGGATTCATCTCAGAATGCTGCAACCTGAGTAGTTCAGTAAAGTCAAGAAAGATTGGTAGC
TGAGAATGGCAAGGCGAAAGGCGAAGAAAACAGTTAAGAAGTCCAACCCTTCACCCGTACTAGAAGCAAAGGATGAAACAGCGAATGACGAGGATGTTGAACGGCATGCT
GCTGCAATCCGTGCCATTCGGGATGTGGAGATCGGGCGTTTGATTACTGAATTGCGGTTGCTTCGTTCGTATTTCAACAAGGAGCAATTACAAACTCCTTTATTGCAATT
CTTCGAAGAAAAACTTCCTAGGTTGTCTATTTCGAGAACAGGCGAACAAGGAGAAATTGAAGTACAATGGAAGGATACAGAGGATGAGTTACGCACCAATCCAGCTGATG
GAATAGATATACATGCGTCTCTCCTTCACCGCCTGTCGATAGCTTATCCTAACTGCTCTGCCGGAATGCGATCTTTCAATGGATTCGAATTTTCCAGTAAATCAGTGAAA
ACGAATCCTTTCATTGTCGAGAACCTGCAAATTCCGAACTTCGATTTGGAGGAGCCCTCGGATAATATGGTGCTCGGCATGCCAGACATACTCCAAACTCCTGGGGTGAT
TAACCAACGATTGTCCATTGGGATGACTCCGAAAACCCGAAGACTGCCAAAACCAGGCGAGATGCTTGTGTCCATCCATGGATCCCCTTTGGGTGTTTACAAGGAAGACA
ACATGGAAGCAATCCATGAATCGGAAGAGGGTTGATTTGCTGATTCAAGGCAACCTGACTGCTTGTCAATAGAAGTGATACTCCAACAATCATCTCGACCATTGTTTTAC
TCATACCTGATGTACTTTCAACCTTCTTGTCTTGTATCTAACCTTTTGATTACTCGTTAAAAGTTACTCTTATGTGTATAACTATAAACTATTGTGTTTTTAACTACAAG
GAACGAGCATGTACAAACATGCAATTGGTCAACTAGATTCTTGTGCATTGTTATTTGTTGCGAAAGTTGTATTGGATTAAAGGTGAATCACAGGTTACATCAT
Protein sequenceShow/hide protein sequence
MARRKAKKTVKKSNPSPVLEAKDETANDEDVERHAAAIRAIRDVEIGRLITELRLLRSYFNKEQLQTPLLQFFEEKLPRLSISRTGEQGEIEVQWKDTEDELRTNPADGI
DIHASLLHRLSIAYPNCSAGMRSFNGFEFSSKSVKTNPFIVENLQIPNFDLEEPSDNMVLGMPDILQTPGVINQRLSIGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNM
EAIHESEEG