; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g07970 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g07970
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr9:6346291..6359065
RNA-Seq ExpressionMoc09g07970
SyntenyMoc09g07970
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]1.1e-3841.73Show/hide
Query:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF
        MT+ NAT+ P AL+YVVG  TSK++W  L K YSS+S++N+VNLKS+LQ I KKS E I  +I+ +KEIKDKLAN+  VV+DEDL+IY LNGL  EYNTF
Subjt:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF

Query:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCND---------AFSELVLNRGIPTSEIWETGAGMEMASPRPVAI-------------PNLPAVNSNS
        RTS+RTRS PV F++LHVLL +EE A+ KQSK +D         A S+ +++     +  +  G G         +                L   N++S
Subjt:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCND---------AFSELVLNRGIPTSEIWETGAGMEMASPRPVAI-------------PNLPAVNSNS

Query:  SWWVD--------SCFN----------------AHVTTDSSQFTNATN--------ATEYSGEDHISVGSGQSLPISH
        S  +          CFN                A V + ++ F +  N        A+ Y GE+ + VGSGQSLPISH
Subjt:  SWWVD--------SCFN----------------AHVTTDSSQFTNATN--------ATEYSGEDHISVGSGQSLPISH

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-3841.73Show/hide
Query:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF
        MT+ NAT+ P AL+YVVG  TSK++W  L K YSS+S++N+VNLKS+LQ I KKS E I  +I+ +KEIKDKLAN+  VV+DEDL+IY LNGL  EYNTF
Subjt:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF

Query:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCND---------AFSELVLNRGIPTSEIWETGAGMEMASPRPVAI-------------PNLPAVNSNS
        RTS+RTRS PV F++LHVLL +EE A+ KQSK +D         A S+ +++     +  +  G G         +                L   N++S
Subjt:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCND---------AFSELVLNRGIPTSEIWETGAGMEMASPRPVAI-------------PNLPAVNSNS

Query:  SWWVD--------SCFN----------------AHVTTDSSQFTNATN--------ATEYSGEDHISVGSGQSLPISH
        S  +          CFN                A V + ++ F +  N        A+ Y GE+ + VGSGQSLPISH
Subjt:  SWWVD--------SCFN----------------AHVTTDSSQFTNATN--------ATEYSGEDHISVGSGQSLPISH

XP_022154561.1 uncharacterized protein LOC111021802 [Momordica charantia]3.9e-3938.87Show/hide
Query:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------
        RE+E+ST +TISFNLF  ++SF R +F +IS LKY R+PVR++T P+RL  LYFND  D++L++FEK+Y AA FEDD+D +                   
Subjt:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------

Query:  ---------------------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGAN------------------------
                                   KTI SL+RGP K SKDG  +KSY+ YGFPW FQVWAY TISSLS R AN                        
Subjt:  ---------------------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGAN------------------------

Query:  --------NGITRSIEEMEAETTFLDSAFEPLEPEDENEGLCESDNAEPSSARAGSEKDDGGQVANIDEGVREDDHVKAEEKR
                 G TR+++E + ET+FL+ +F+P   +D++      DNA PS+ R GS+ DD  + A++ E V +D  ++ EE +
Subjt:  --------NGITRSIEEMEAETTFLDSAFEPLEPEDENEGLCESDNAEPSSARAGSEKDDGGQVANIDEGVREDDHVKAEEKR

XP_022157455.1 uncharacterized protein LOC111024149 [Momordica charantia]1.1e-3863.31Show/hide
Query:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF
        MTL NAT+ P A +YVVG  +SKEIW TL KHYSS+S+TN+VNLKS+LQ+I KK  E I  +++ +KE+KDKL N+ VVVDDEDL+IYTLNGL + YN F
Subjt:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF

Query:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCNDAFSE
        RTS+RTRS  V FD+LHVL+ SEEVA+++Q K +D FS+
Subjt:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCNDAFSE

XP_022157998.1 uncharacterized protein LOC111024595 [Momordica charantia]1.2e-4051.27Show/hide
Query:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------
        RE+E+ST DTISFNLF +KVSFGRREFDIIS LKY RSPVRK T P RLR LYFN+S D+LL+E EK+Y +  FEDDFDA+                   
Subjt:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------

Query:  -----------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPLEPEDENE
                         KTIYSL+RG +K+SK+GG +KSY+ +GFPW FQVWAY+TISSLSGR A  G+     E +     +D     +E   ENE
Subjt:  -----------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPLEPEDENE

TrEMBL top hitse value%identityAlignment
A0A6J1D9L6 uncharacterized protein LOC1110188928.7e-3737.42Show/hide
Query:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF
        MTL NAT+   AL+YVV   TSK++W  L KHYSSNS+TN+VNLKS+LQ+I+KK+ E I  +++ +KEIKDK AN+ + ++DE L+IY LNGL  EYNT 
Subjt:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF

Query:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCNDA-----------------------------------------FSELVLNRGI-PTSEIWETGAGM
         TS+RTR+  V F++LHV + SEE AI KQ K  D                                          F+    N+G   +S  + T    
Subjt:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCNDA-----------------------------------------FSELVLNRGI-PTSEIWETGAGM

Query:  EMASP--------------------------RPVAIPNLPAVNSNS----------SWWVDSCFNAHVTTDSSQFTNATNATEYSGEDHISVGSGQSLPI
        +  SP                           P  +  + AV +NS          +W  DS  N H+T D S  + A+ A++Y+GE++ISVGSGQS PI
Subjt:  EMASP--------------------------RPVAIPNLPAVNSNS----------SWWVDSCFNAHVTTDSSQFTNATNATEYSGEDHISVGSGQSLPI

Query:  SH
        +H
Subjt:  SH

A0A6J1DP34 uncharacterized protein LOC1110218021.9e-3938.87Show/hide
Query:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------
        RE+E+ST +TISFNLF  ++SF R +F +IS LKY R+PVR++T P+RL  LYFND  D++L++FEK+Y AA FEDD+D +                   
Subjt:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------

Query:  ---------------------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGAN------------------------
                                   KTI SL+RGP K SKDG  +KSY+ YGFPW FQVWAY TISSLS R AN                        
Subjt:  ---------------------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGAN------------------------

Query:  --------NGITRSIEEMEAETTFLDSAFEPLEPEDENEGLCESDNAEPSSARAGSEKDDGGQVANIDEGVREDDHVKAEEKR
                 G TR+++E + ET+FL+ +F+P   +D++      DNA PS+ R GS+ DD  + A++ E V +D  ++ EE +
Subjt:  --------NGITRSIEEMEAETTFLDSAFEPLEPEDENEGLCESDNAEPSSARAGSEKDDGGQVANIDEGVREDDHVKAEEKR

A0A6J1DQC8 uncharacterized protein LOC1110233536.1e-3843.72Show/hide
Query:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------
        RE+EDST +TISFNLFG +VSFGRREFD+IS L YDRSPVRK T  ++LR LYFND  + +L++F K+Y+AALF+DDFD I                   
Subjt:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------

Query:  ---------------------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPL
                                   KTI SL RGPT  +KD G +KSY+ YGFPW FQVW Y+              TR +E  +AET F+   FEP 
Subjt:  ---------------------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPL

Query:  EPEDENEGLCESDNAEPSSARAGSEKDDGGQ
        EPED++       +A PS+ R G++  D G+
Subjt:  EPEDENEGLCESDNAEPSSARAGSEKDDGGQ

A0A6J1DT57 uncharacterized protein LOC1110241495.5e-3963.31Show/hide
Query:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF
        MTL NAT+ P A +YVVG  +SKEIW TL KHYSS+S+TN+VNLKS+LQ+I KK  E I  +++ +KE+KDKL N+ VVVDDEDL+IYTLNGL + YN F
Subjt:  MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTF

Query:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCNDAFSE
        RTS+RTRS  V FD+LHVL+ SEEVA+++Q K +D FS+
Subjt:  RTSIRTRSLPVLFDDLHVLLVSEEVAINKQSKCNDAFSE

A0A6J1DUW1 uncharacterized protein LOC1110245955.9e-4151.27Show/hide
Query:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------
        RE+E+ST DTISFNLF +KVSFGRREFDIIS LKY RSPVRK T P RLR LYFN+S D+LL+E EK+Y +  FEDDFDA+                   
Subjt:  REIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAI-------------------

Query:  -----------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPLEPEDENE
                         KTIYSL+RG +K+SK+GG +KSY+ +GFPW FQVWAY+TISSLSGR A  G+     E +     +D     +E   ENE
Subjt:  -----------------KTIYSLKRGPTKRSKDGGFKKSYNFYGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPLEPEDENE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.7e-0627.19Show/hide
Query:  TSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTFRTSIRTRSLPVLFDDLHVLL
        TS++IW  +   + +N     + L S L+      + +   + + +K++ D L N+ V V D +L++Y LNGL  +++     I+ R     FDD   +L
Subjt:  TSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTFRTSIRTRSLPVLFDDLHVLL

Query:  VSEEVAINKQSKCN
          EE  + +  K N
Subjt:  VSEEVAINKQSKCN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTTAACAAATGCAACTGTCCCGCCTATAGCCCTATCATATGTTGTTGGCTGCCATACTTCGAAAGAAATTTGGGCAACTCTCGTAAAGCATTACTCCTCAAATTC
ACAAACAAATATCGTGAATCTCAAATCTAATCTGCAAGCTATAATGAAGAAGTCTATAGAAATTATTCATCAGTTTATTCAGCACGTCAAGGAAATTAAAGATAAGTTGG
CGAACATTTATGTTGTTGTTGATGATGAAGACCTAATCATTTACACCTTGAATGGTCTCCGTGCTGAATACAATACCTTTAGAACTTCTATCCGCACTCGATCTTTACCA
GTTTTGTTTGATGACCTTCACGTTCTTCTTGTATCAGAAGAAGTTGCAATTAACAAGCAGTCTAAATGCAATGATGCCTTTTCGGAACTCGTCCTCAATCGAGGAATCCC
TACCTCCGAAATTTGGGAGACGGGGGCAGGGATGGAGATGGCCTCGCCCCGCCCCGTTGCCATCCCTAACCTTCCTGCTGTCAATTCTAATTCTTCATGGTGGGTAGATT
CTTGCTTCAATGCTCATGTCACTACCGACTCAAGTCAGTTCACCAATGCTACTAATGCTACTGAATATTCTGGGGAAGATCATATTAGTGTAGGGAGTGGTCAATCTCTT
CCAATCTCTCACAGGGAGATTGAGGATAGTACATCTGACACCATTAGCTTCAACTTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGAGAGTTTGACATTATTTCTCGCCT
TAAATATGATAGGAGTCCAGTTAGAAAAGACACATCTCCCTATAGACTTAGGGCTCTCTACTTTAATGATAGCAAAGACATACTGTTGAATGAGTTTGAGAAGATTTATG
TAGCCGCACTGTTCGAGGATGACTTCGACGCCATCAAGACTATATATAGTCTAAAGCGTGGCCCGACGAAGAGGTCGAAGGATGGCGGGTTCAAGAAATCGTACAATTTC
TACGGTTTCCCTTGGGCGTTCCAGGTGTGGGCGTACAAGACTATATCTTCCCTGTCTGGGCGTGGGGCCAATAATGGAATAACTCGGTCAATAGAAGAAATGGAGGCTGA
GACGACCTTCTTAGATAGTGCGTTCGAACCACTCGAGCCCGAAGATGAGAACGAAGGTCTGTGTGAAAGCGATAATGCTGAACCATCGAGTGCACGTGCAGGATCAGAAA
AGGACGATGGAGGACAAGTAGCGAACATCGATGAAGGTGTCAGAGAAGACGACCATGTCAAGGCTGAGGAGAAAAGGAGAATGGTAGTGGGTTTACCTATCGACCCGAAT
GACATAAGAAGAGGTAGCAGCGGAGATGGTACTGGGCGAGGAGATGATCCTAGTGATGCATCCAGTGACAGACCCAGTGGCACCGATGGTGGTCAGAGTGATGGTCCTAA
GGATGGTGATGGTTCTACACCATCTGCTATGGATGGGGAAGTCACAGATGACATGATCATTGATCCCCCGAATGTGAACGTGTACGGGATAGAAGCACTTCATTCCAGTC
CAACTATTGGGGAACAGGCTCAACAAGACCTTCCTGCATTTCAGACACCAGAACGCGAGCATGCACTTGCCTTATCGCGTAAGGAAGACATGGGTACAGAGGACGTGCAT
AAAGAGAGTATGGAAACCGGTTTAAATGCGCATTGCGAGGTAGTCCCTCTCGAGGAGATTCTTGTTCAGAGCACACCGGTTGACCATATTATGATCGATTCACAGTCGTT
AGAGTCATCCATGGACGATGAGGATGAATATGCAGAGGATTTCACGGACTCTGATGCGGAAGGGCCAGGAGAAGTAAAATCTCAGACTACTGAGCATACATCCGAGACAT
TGCATCCAGACCTGGATGAGGCACGTGTGCTATCACAGCTCGTTGAACGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACCTTAACAAATGCAACTGTCCCGCCTATAGCCCTATCATATGTTGTTGGCTGCCATACTTCGAAAGAAATTTGGGCAACTCTCGTAAAGCATTACTCCTCAAATTC
ACAAACAAATATCGTGAATCTCAAATCTAATCTGCAAGCTATAATGAAGAAGTCTATAGAAATTATTCATCAGTTTATTCAGCACGTCAAGGAAATTAAAGATAAGTTGG
CGAACATTTATGTTGTTGTTGATGATGAAGACCTAATCATTTACACCTTGAATGGTCTCCGTGCTGAATACAATACCTTTAGAACTTCTATCCGCACTCGATCTTTACCA
GTTTTGTTTGATGACCTTCACGTTCTTCTTGTATCAGAAGAAGTTGCAATTAACAAGCAGTCTAAATGCAATGATGCCTTTTCGGAACTCGTCCTCAATCGAGGAATCCC
TACCTCCGAAATTTGGGAGACGGGGGCAGGGATGGAGATGGCCTCGCCCCGCCCCGTTGCCATCCCTAACCTTCCTGCTGTCAATTCTAATTCTTCATGGTGGGTAGATT
CTTGCTTCAATGCTCATGTCACTACCGACTCAAGTCAGTTCACCAATGCTACTAATGCTACTGAATATTCTGGGGAAGATCATATTAGTGTAGGGAGTGGTCAATCTCTT
CCAATCTCTCACAGGGAGATTGAGGATAGTACATCTGACACCATTAGCTTCAACTTGTTTGGGAGTAAGGTGTCATTTGGGCGGAGAGAGTTTGACATTATTTCTCGCCT
TAAATATGATAGGAGTCCAGTTAGAAAAGACACATCTCCCTATAGACTTAGGGCTCTCTACTTTAATGATAGCAAAGACATACTGTTGAATGAGTTTGAGAAGATTTATG
TAGCCGCACTGTTCGAGGATGACTTCGACGCCATCAAGACTATATATAGTCTAAAGCGTGGCCCGACGAAGAGGTCGAAGGATGGCGGGTTCAAGAAATCGTACAATTTC
TACGGTTTCCCTTGGGCGTTCCAGGTGTGGGCGTACAAGACTATATCTTCCCTGTCTGGGCGTGGGGCCAATAATGGAATAACTCGGTCAATAGAAGAAATGGAGGCTGA
GACGACCTTCTTAGATAGTGCGTTCGAACCACTCGAGCCCGAAGATGAGAACGAAGGTCTGTGTGAAAGCGATAATGCTGAACCATCGAGTGCACGTGCAGGATCAGAAA
AGGACGATGGAGGACAAGTAGCGAACATCGATGAAGGTGTCAGAGAAGACGACCATGTCAAGGCTGAGGAGAAAAGGAGAATGGTAGTGGGTTTACCTATCGACCCGAAT
GACATAAGAAGAGGTAGCAGCGGAGATGGTACTGGGCGAGGAGATGATCCTAGTGATGCATCCAGTGACAGACCCAGTGGCACCGATGGTGGTCAGAGTGATGGTCCTAA
GGATGGTGATGGTTCTACACCATCTGCTATGGATGGGGAAGTCACAGATGACATGATCATTGATCCCCCGAATGTGAACGTGTACGGGATAGAAGCACTTCATTCCAGTC
CAACTATTGGGGAACAGGCTCAACAAGACCTTCCTGCATTTCAGACACCAGAACGCGAGCATGCACTTGCCTTATCGCGTAAGGAAGACATGGGTACAGAGGACGTGCAT
AAAGAGAGTATGGAAACCGGTTTAAATGCGCATTGCGAGGTAGTCCCTCTCGAGGAGATTCTTGTTCAGAGCACACCGGTTGACCATATTATGATCGATTCACAGTCGTT
AGAGTCATCCATGGACGATGAGGATGAATATGCAGAGGATTTCACGGACTCTGATGCGGAAGGGCCAGGAGAAGTAAAATCTCAGACTACTGAGCATACATCCGAGACAT
TGCATCCAGACCTGGATGAGGCACGTGTGCTATCACAGCTCGTTGAACGCTAA
Protein sequenceShow/hide protein sequence
MTLTNATVPPIALSYVVGCHTSKEIWATLVKHYSSNSQTNIVNLKSNLQAIMKKSIEIIHQFIQHVKEIKDKLANIYVVVDDEDLIIYTLNGLRAEYNTFRTSIRTRSLP
VLFDDLHVLLVSEEVAINKQSKCNDAFSELVLNRGIPTSEIWETGAGMEMASPRPVAIPNLPAVNSNSSWWVDSCFNAHVTTDSSQFTNATNATEYSGEDHISVGSGQSL
PISHREIEDSTSDTISFNLFGSKVSFGRREFDIISRLKYDRSPVRKDTSPYRLRALYFNDSKDILLNEFEKIYVAALFEDDFDAIKTIYSLKRGPTKRSKDGGFKKSYNF
YGFPWAFQVWAYKTISSLSGRGANNGITRSIEEMEAETTFLDSAFEPLEPEDENEGLCESDNAEPSSARAGSEKDDGGQVANIDEGVREDDHVKAEEKRRMVVGLPIDPN
DIRRGSSGDGTGRGDDPSDASSDRPSGTDGGQSDGPKDGDGSTPSAMDGEVTDDMIIDPPNVNVYGIEALHSSPTIGEQAQQDLPAFQTPEREHALALSRKEDMGTEDVH
KESMETGLNAHCEVVPLEEILVQSTPVDHIMIDSQSLESSMDDEDEYAEDFTDSDAEGPGEVKSQTTEHTSETLHPDLDEARVLSQLVER