; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G10448 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G10448
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionTranslation initiation factor IF-2, putative isoform 1
Genome locationctg1678:376636..380171
RNA-Seq ExpressionCucsat.G10448
SyntenyCucsat.G10448
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034696.1 Translation initiation factor IF-2, putative isoform 1 [Cucumis melo var. makuwa]5.97e-14994.04Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        MARRKAKKTVKKSSPSSGR AKDEA +K+KT SDEDVERHAAAIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRG QGEIEV+
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNM LGMPDILQTPG     ISNQRLSIG
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
        MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

KAG7034170.1 hypothetical protein SDJN02_03897, partial [Cucurbita argyrosperma subsp. argyrosperma]1.78e-14687.8Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTD------SDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQ
        MARRKAKK+VKKSSPS  R+AKD + N+LK++      SDEDVERHA AIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSIS RG+Q
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTD------SDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQ

Query:  GEIEVRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISN
        GEIEV+WKDTEDELHTNPADG+DIHASLLHRLSTAYP CSAGMRSFNGFEFSSKSVKTNPFN ENLQIPNF LEEPSDNM LGMPD+LQTPGVRIFM+SN
Subjt:  GEIEVRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISN

Query:  QRLSIGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        QRLSIGMTPKTRRLPKPGE+LVSIHGSPLGVY+EDNMEAIHESEEG
Subjt:  QRLSIGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_016900279.1 PREDICTED: uncharacterized protein LOC103489436 [Cucumis melo]2.75e-15294.17Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        MARRKAKKTVKKSSPSSGR AKDEA +K+KT SDEDVERHAAAIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRG QGEIEV+
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNM LGMPDILQTPG     ISNQRLSIG
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_031741813.1 uncharacterized protein LOC101204054 [Cucumis sativus]1.36e-16599.17Show/hide
Query:  MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIE
        MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIE
Subjt:  MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIE

Query:  VRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLS
        VRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSD+MTLGMPDI+QTPGVRIFMISNQRLS
Subjt:  VRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_038891412.1 uncharacterized protein LOC120080833 [Benincasa hispida]4.87e-14690.38Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        MARR+AKKTVKKSSPS GRDAKDEA N+LK+D DEDVERHAAAIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLP+LSIS RG+QGEIEV+
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        WKDTEDELHTNPADG+DIHASLLH LSTAYPYCSAGMRSFNGFEFSSKSVKTNPFN ENLQIPN  LEEPSDNM LGMP+ILQTPGV     SNQRLSIG
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

TrEMBL top hitse value%identityAlignment
A0A0A0KK81 Uncharacterized protein1.52e-13687.5Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        MARRKAKKTVKKSSPS   +AKDEA N      DEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLP LSIS  G+QGEIEV+
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        WKDTEDEL TNPADG+DIHASLLHRLS AYP CSAGMRSFNGFEFSSKSVKTNPF  ENLQIPNFVLEEPSDN+ LGMPDI QTPGV     SNQRLSIG
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A0A0KRH4 Uncharacterized protein6.61e-16699.17Show/hide
Query:  MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIE
        MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIE
Subjt:  MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIE

Query:  VRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLS
        VRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSD+MTLGMPDI+QTPGVRIFMISNQRLS
Subjt:  VRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A1S4DWB9 uncharacterized protein LOC1034894361.33e-15294.17Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        MARRKAKKTVKKSSPSSGR AKDEA +K+KT SDEDVERHAAAIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRG QGEIEV+
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNM LGMPDILQTPG     ISNQRLSIG
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A5A7SXV4 Translation initiation factor IF-2, putative isoform 12.89e-14994.04Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        MARRKAKKTVKKSSPSSGR AKDEA +K+KT SDEDVERHAAAIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRG QGEIEV+
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNM LGMPDILQTPG     ISNQRLSIG
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
        MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIH

A0A6J1ILD3 uncharacterized protein LOC1114784961.16e-13684.49Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTD------SDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQ
        MARRKAKK+VKKSSPS  R+AKD + N+LK++      SDEDVERHA AIRAIRDVEIERLIT LRLLRSYFNKEQLQTPLLQFF EKLPSLSIS RG+Q
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTD------SDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQ

Query:  GEIEVRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISN
        GEIEV+WKDTEDELHTNPADG+DIHASLLHRLSTAYP CSAG+RSFNGFEFSSKSVKTNPFN ENLQIPNFVLEEPSDNM LGMPD+LQTPG      SN
Subjt:  GEIEVRWKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISN

Query:  QRLSIGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        QRLSIGMTPKTRRLPKPGE++VSIHGSPLGVY+E NMEAIHESEE
Subjt:  QRLSIGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39630.1 unknown protein4.9e-4948.52Show/hide
Query:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR
        M +RKAK+ VK +      + +     K +   DE+VER  AAIRAIRDVEIE+++T LRLLRSYF +EQL TP+L FF+E LP LSIS   + GEIE++
Subjt:  MARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVR

Query:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG
        W+D   +      +GVD++ S+L RLS  +    +   S  G++    +VK N    +N Q+ N V +  S+N  L   D  QTPGV     + QRLS G
Subjt:  WKDTEDELHTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE
        MTPKT RLPK GEM++S+HGSPLGVYKED NM AI+E
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATGGCAAGAAGAAAGGCAAAGAAAACAGTTAAGAAGTCCAGCCCTTCATCTGGACGAGACGCAAAAGATGAAGCAGTGAATAAGTTAAAGACTGACTCCGACGA
AGATGTTGAACGGCATGCTGCTGCAATCCGTGCCATTCGGGATGTGGAGATCGAGCGTTTGATTACTGTATTGCGGTTGCTTCGTTCGTATTTCAACAAAGAGCAATTGC
AAACTCCTCTATTGCAATTTTTCGAAGAGAAACTTCCAAGCTTGTCCATTTCGATAAGAGGCAAACAAGGTGAAATTGAAGTACGATGGAAGGATACAGAGGATGAATTA
CACACCAATCCAGCGGATGGAGTAGATATACATGCTTCTCTTCTTCATCGCCTTTCCACAGCTTATCCTTACTGCTCTGCCGGAATGCGATCGTTTAATGGATTTGAATT
TTCCAGTAAATCAGTGAAAACAAATCCTTTCAATGCTGAGAACCTGCAAATTCCGAACTTTGTTTTGGAGGAGCCCTCGGATAATATGACGCTTGGCATGCCAGATATTC
TCCAGACTCCTGGCGTTCGTATCTTCATGATAAGTAACCAAAGATTGTCTATTGGGATGACACCGAAAACCCGAAGACTGCCGAAGCCAGGCGAGATGCTTGTGTCTATC
CATGGATCCCCACTTGGTGTTTACAAGGAAGACAACATGGAAGCAATCCATGAATCAGAAGAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAATGGCAAGAAGAAAGGCAAAGAAAACAGTTAAGAAGTCCAGCCCTTCATCTGGACGAGACGCAAAAGATGAAGCAGTGAATAAGTTAAAGACTGACTCCGACGA
AGATGTTGAACGGCATGCTGCTGCAATCCGTGCCATTCGGGATGTGGAGATCGAGCGTTTGATTACTGTATTGCGGTTGCTTCGTTCGTATTTCAACAAAGAGCAATTGC
AAACTCCTCTATTGCAATTTTTCGAAGAGAAACTTCCAAGCTTGTCCATTTCGATAAGAGGCAAACAAGGTGAAATTGAAGTACGATGGAAGGATACAGAGGATGAATTA
CACACCAATCCAGCGGATGGAGTAGATATACATGCTTCTCTTCTTCATCGCCTTTCCACAGCTTATCCTTACTGCTCTGCCGGAATGCGATCGTTTAATGGATTTGAATT
TTCCAGTAAATCAGTGAAAACAAATCCTTTCAATGCTGAGAACCTGCAAATTCCGAACTTTGTTTTGGAGGAGCCCTCGGATAATATGACGCTTGGCATGCCAGATATTC
TCCAGACTCCTGGCGTTCGTATCTTCATGATAAGTAACCAAAGATTGTCTATTGGGATGACACCGAAAACCCGAAGACTGCCGAAGCCAGGCGAGATGCTTGTGTCTATC
CATGGATCCCCACTTGGTGTTTACAAGGAAGACAACATGGAAGCAATCCATGAATCAGAAGAGGGTTGA
Protein sequenceShow/hide protein sequence
MIMARRKAKKTVKKSSPSSGRDAKDEAVNKLKTDSDEDVERHAAAIRAIRDVEIERLITVLRLLRSYFNKEQLQTPLLQFFEEKLPSLSISIRGKQGEIEVRWKDTEDEL
HTNPADGVDIHASLLHRLSTAYPYCSAGMRSFNGFEFSSKSVKTNPFNAENLQIPNFVLEEPSDNMTLGMPDILQTPGVRIFMISNQRLSIGMTPKTRRLPKPGEMLVSI
HGSPLGVYKEDNMEAIHESEEG