; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026038 (gene) of Chayote v1 genome

Gene IDSed0026038
OrganismSechium edule (Chayote v1)
DescriptionTranslation initiation factor IF-2, putative isoform 1
Genome locationLG14:20307352..20311364
RNA-Seq ExpressionSed0026038
SyntenySed0026038
Gene Ontology termsGO:0006413 - translational initiation (biological process)
GO:0003743 - translation initiation factor activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594850.1 hypothetical protein SDJN03_11403, partial [Cucurbita argyrosperma subsp. sororia]1.9e-9782.16Show/hide
Query:  MGRRKAKRTVNKS----EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGEL
        M RRKAK+TV  S     EK+KD  ENEL++EEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF+E LPNLSISG GE+
Subjt:  MGRRKAKRTVNKS----EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGEL

Query:  RDIEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLS
         +IEVQWK TE EL  N    +D+HASLLHRLSIAYPN SAGMRSLNGFEFSSKSVKTN FNVE+LQIPSLVLE EPSDS+M GM D L TPGV+NQRLS
Subjt:  RDIEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLS

Query:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        IGMTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  IGMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

XP_022133198.1 uncharacterized protein LOC111005854 [Momordica charantia]8.7e-9882.28Show/hide
Query:  MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIE
        M RRKAK+T  K         ENE + EEQAPLVS+EDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTP+LQFFEE LPNLSIS  GE  +IE
Subjt:  MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIE

Query:  VQWKHTEGELHT----NLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTP
        VQWK   GELHT     +DIHASLLHRLSIAYPNCSAGM+S+NGFEFSSKSVKTNAFNVENLQIPS VLEEPSDS+M GMPD L TPGV NQRLSIGMTP
Subjt:  VQWKHTEGELHT----NLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTP

Query:  KTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        KTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  KTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

XP_022962735.1 uncharacterized protein LOC111463139 [Cucurbita moschata]1.3e-9682.43Show/hide
Query:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD
        M RRKAK+TV KS   EK KD  ENEL++EEQ PLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF+E LPNLSISG GE+ +
Subjt:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD

Query:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG
        IEVQWK TE EL  N    +D+HASLL RLSIAYPN SAGMRSLNGFEFSSKSVKTN FNVE+LQIPSLVLE EPSDS+M GM D L TPGV+NQRLSIG
Subjt:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        MTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

XP_023003233.1 uncharacterized protein LOC111496903 [Cucurbita maxima]2.8e-9682.01Show/hide
Query:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD
        M RRKAK+ V KS   EK KD  ENEL++EEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF+E LPNLSISG  E+ +
Subjt:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD

Query:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG
        IEVQWK TE EL  N    +D+HASLLHRLSIAYPN SAGMRSLNGFEFSSKSVKTN FNVE+LQIPSLVLE EPSDS+M GM D L TPG +NQRLSIG
Subjt:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        MTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

XP_023518319.1 uncharacterized protein LOC111781838 [Cucurbita pepo subsp. pepo]3.7e-9682.43Show/hide
Query:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD
        M RRKAK+TV K+   EK KD  ENEL++EEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF+E LPNLSISG GE+ +
Subjt:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD

Query:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG
        IEVQ K TE EL  N    +D+HASLLHRLSIAYPN SAGMRSLNGFEFSSKSVKTN FNVE+LQIPSLVLE EPSDS+M GM D L TPGV+NQRLSIG
Subjt:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        MTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

TrEMBL top hitse value%identityAlignment
A0A1S4DWB9 uncharacterized protein LOC1034894368.5e-9177.18Show/hide
Query:  MGRRKAKRTVNKSEEKS----KDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGEL
        M RRKAK+TV KS   S    KD   ++++T       SDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFFEE LP+LSIS  G+ 
Subjt:  MGRRKAKRTVNKSEEKS----KDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGEL

Query:  RDIEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSI
         +IEVQWK TE ELHTN    +DIHASLLHRLS AYP CSAGMRS NGFEFSSKSVKTN FN ENLQIP+ VLEEPSD+++ GMPD L TPG++NQRLSI
Subjt:  RDIEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A6J1BYE7 uncharacterized protein LOC1110058544.2e-9882.28Show/hide
Query:  MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIE
        M RRKAK+T  K         ENE + EEQAPLVS+EDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTP+LQFFEE LPNLSIS  GE  +IE
Subjt:  MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIE

Query:  VQWKHTEGELHT----NLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTP
        VQWK   GELHT     +DIHASLLHRLSIAYPNCSAGM+S+NGFEFSSKSVKTNAFNVENLQIPS VLEEPSDS+M GMPD L TPGV NQRLSIGMTP
Subjt:  VQWKHTEGELHT----NLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTP

Query:  KTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
        KTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG
Subjt:  KTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEEG

A0A6J1HHY4 uncharacterized protein LOC1114631396.1e-9782.43Show/hide
Query:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD
        M RRKAK+TV KS   EK KD  ENEL++EEQ PLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF+E LPNLSISG GE+ +
Subjt:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD

Query:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG
        IEVQWK TE EL  N    +D+HASLL RLSIAYPN SAGMRSLNGFEFSSKSVKTN FNVE+LQIPSLVLE EPSDS+M GM D L TPGV+NQRLSIG
Subjt:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        MTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

A0A6J1ILD3 uncharacterized protein LOC1114784962.0e-9277.5Show/hide
Query:  MGRRKAKRTVNKSE----EKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGEL
        M RRKAK++V KS      ++KD   NEL++E+QA LVSDEDVERHA AIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF E LP+LSIS  GE 
Subjt:  MGRRKAKRTVNKSE----EKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGEL

Query:  RDIEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSI
         +IEVQWK TE ELHTN    +DIHASLLHRLS AYPNCSAG+RS NGFEFSSKSVKTN FNVENLQIP+ VLEEPSD+++ GMPD L TPG +NQRLSI
Subjt:  RDIEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSI

Query:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        GMTPKTRRLPKPGE++VSIHGSPLGVY+E NMEAIHESEE
Subjt:  GMTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

A0A6J1KR76 uncharacterized protein LOC1114969031.4e-9682.01Show/hide
Query:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD
        M RRKAK+ V KS   EK KD  ENEL++EEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKE LQTPLLQFF+E LPNLSISG  E+ +
Subjt:  MGRRKAKRTVNKS--EEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRD

Query:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG
        IEVQWK TE EL  N    +D+HASLLHRLSIAYPN SAGMRSLNGFEFSSKSVKTN FNVE+LQIPSLVLE EPSDS+M GM D L TPG +NQRLSIG
Subjt:  IEVQWKHTEGELHTN----LDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLE-EPSDSVMHGMPDALHTPGVNNQRLSIG

Query:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE
        MTP+TRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHES+E
Subjt:  MTPKTRRLPKPGEMLVSIHGSPLGVYKEDNMEAIHESEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39630.1 unknown protein7.2e-5050Show/hide
Query:  MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIE
        M +RKAK  V  +EE   D  E  +  EE+     DE+VER  AAIRAIRDVEIE+++T LRLLRSYF +E L TP+L FF+ENLP+LSIS   E  +IE
Subjt:  MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIE

Query:  VQWKHTEGEL----HTNLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTP
        ++W+   G+        +D++ S+L RLS+ + +  +   SL G++    +VK N    +N Q+ +LV +  S++ M    DA  TPGVN QRLS GMTP
Subjt:  VQWKHTEGEL----HTNLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTP

Query:  KTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE
        KT RLPK GEM++S+HGSPLGVYKED NM AI+E
Subjt:  KTRRLPKPGEMLVSIHGSPLGVYKED-NMEAIHE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGCGAAAGGCGAAGAGAACTGTCAACAAATCTGAAGAAAAATCGAAGGATGTAGTGGAAAATGAGTTAGAAACTGAGGAGCAAGCACCTTTGGTGTCCGATGA
GGATGTTGAGCGGCATGCTGCTGCAATTCGTGCCATTCGGGATGTGGAGATCGAGCGTTTGATTACTGAATTGCGGCTGCTTCGTTCGTATTTCAACAAAGAGCATTTGC
AAACTCCTCTATTGCAATTTTTTGAGGAAAATCTTCCAAACTTGAGCATCTCAGGTGGAGGCGAACTACGGGACATTGAAGTACAATGGAAACATACCGAGGGTGAATTA
CACACTAACCTGGATATACATGCTTCTCTTCTGCATCGCCTGTCCATAGCTTATCCTAACTGCTCTGCCGGAATGCGATCTTTAAATGGATTTGAATTTTCTAGTAAATC
AGTGAAAACAAATGCTTTCAATGTTGAGAACCTACAAATTCCGAGCTTGGTTCTGGAAGAGCCCTCTGATAGTGTGATGCATGGAATGCCAGATGCTCTACACACTCCTG
GGGTAAATAACCAAAGATTATCCATTGGGATGACACCGAAAACCCGAAGGCTACCGAAGCCCGGTGAGATGCTTGTGTCTATTCATGGATCCCCCCTTGGTGTTTACAAG
GAAGATAACATGGAAGCAATACATGAATCAGAGGAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
CCATATTCAATCTACTGATACAGACTCATGTTCTGCGTAGCATTATTCAATTCAAACTAATCTCTTTCCCGCCGCCCAATCGCTCAAATCTAACACTGCTTCTCCGCCGG
AGCTGAATCGGCGAACTCGTAGCATTTTGCCGGCGTAGTGTAATTAATTCATTCTCCTCCCCGGCGGCGACCAATTATTGATCGAAAATCAGAATGTTGCAATATGTGTA
GTCTGGTAAAGTCCAGAGAGATTTGTTGCTGAACATGGGAAGGCGAAAGGCGAAGAGAACTGTCAACAAATCTGAAGAAAAATCGAAGGATGTAGTGGAAAATGAGTTAG
AAACTGAGGAGCAAGCACCTTTGGTGTCCGATGAGGATGTTGAGCGGCATGCTGCTGCAATTCGTGCCATTCGGGATGTGGAGATCGAGCGTTTGATTACTGAATTGCGG
CTGCTTCGTTCGTATTTCAACAAAGAGCATTTGCAAACTCCTCTATTGCAATTTTTTGAGGAAAATCTTCCAAACTTGAGCATCTCAGGTGGAGGCGAACTACGGGACAT
TGAAGTACAATGGAAACATACCGAGGGTGAATTACACACTAACCTGGATATACATGCTTCTCTTCTGCATCGCCTGTCCATAGCTTATCCTAACTGCTCTGCCGGAATGC
GATCTTTAAATGGATTTGAATTTTCTAGTAAATCAGTGAAAACAAATGCTTTCAATGTTGAGAACCTACAAATTCCGAGCTTGGTTCTGGAAGAGCCCTCTGATAGTGTG
ATGCATGGAATGCCAGATGCTCTACACACTCCTGGGGTAAATAACCAAAGATTATCCATTGGGATGACACCGAAAACCCGAAGGCTACCGAAGCCCGGTGAGATGCTTGT
GTCTATTCATGGATCCCCCCTTGGTGTTTACAAGGAAGATAACATGGAAGCAATACATGAATCAGAGGAGGGTTGATGAGCTGATCAAGGCCACTTAACACCTTGTAAAT
GTAAACTTCAAATGGAATTATCAACCTATATGTATCTAACATTTTGATTAGTACTTACTAAGACTAAGAGCTACTCTTATGTGTATAGCTAAGAATTAGTGTAACTTTAC
TTGAAAGAAAGCGAGTATGTACAATAGTTTCACTTAATTATTGTGTTTGATTATTTGTTGTCAAAGTTGATGGTGATATTGATCAAAGGCGAGTCACTGCCTACATCAAC
TTTATTTTATTACATTGAGAATCTGGCATAACAAGTTTCTCTTCATTCTTGATTGCTGTTTGTTTTCAAATATATTTTTTATGATTACT
Protein sequenceShow/hide protein sequence
MGRRKAKRTVNKSEEKSKDVVENELETEEQAPLVSDEDVERHAAAIRAIRDVEIERLITELRLLRSYFNKEHLQTPLLQFFEENLPNLSISGGGELRDIEVQWKHTEGEL
HTNLDIHASLLHRLSIAYPNCSAGMRSLNGFEFSSKSVKTNAFNVENLQIPSLVLEEPSDSVMHGMPDALHTPGVNNQRLSIGMTPKTRRLPKPGEMLVSIHGSPLGVYK
EDNMEAIHESEEG