; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027966 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027966
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPeptidase_S9 domain-containing protein
Genome locationtig00153056:1998099..2009358
RNA-Seq ExpressionSgr027966
SyntenySgr027966
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008236 - serine-type peptidase activity (molecular function)
InterPro domainsIPR001375 - Peptidase S9, prolyl oligopeptidase, catalytic domain
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591741.1 hypothetical protein SDJN03_14087, partial [Cucurbita argyrosperma subsp. sororia]3.1e-16379.09Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSAD-GANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQ
        M EA+VDA KFR EF RVLRSRRS +   N       +  +PP+ S                    K+M SCPKA FSNLKDLLHEENLHLTTE  EQG+
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSAD-GANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQ

Query:  LPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED
        LPILII MKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY DALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED
Subjt:  LPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED

Query:  IDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPA
        IDP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ FRWA+DND WQARVESIKPVFEEARI+LGM+EINKEVV+KVWNRIAPGL SQF SIYSVPA
Subjt:  IDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPA

Query:  IAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHR
        IAPRPLLLLNGADDPRCPI GLDAP+S+ Q AY+K GC ENFKFIAQPGIGHEMTPEMVKE S WFD    HR
Subjt:  IAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHR

KAG6608085.1 hypothetical protein SDJN03_01427, partial [Cucurbita argyrosperma subsp. sororia]1.8e-16379.51Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        MEEALVDA KFR EF RVLR RRS  G         K    P      PP L             K+M SCPK + SNLKDLLHEENLHL TE  EQGQL
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        PILII MKDS+QQ++PAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDAL+SSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRYAVVVP+IGVQ FRWA+DND WQARVESIKPVFEEARI+LGM+EINKEVV KVWNRIAPGLDS+FDSIYSVPA+
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGADDPRCPI GLDA ++R QT Y+  GC ENFKFIAQPGIGH MTPEMVKE SDWFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

XP_022153900.1 uncharacterized protein LOC111021276 [Momordica charantia]1.9e-16881.42Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        MEEALVDA KFR+EF RVLRSRRS +             +P S     P + H  + + +  T  K+M SCPK NFSNLKDLLHEENL+LTTE  EQGQL
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        PILII MKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK+KTTYRDALISSWKRGDTMPFIFDT WDLIKLADYLT+RED+
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRYAVVVP+IGVQ FRWA+D+D WQARVESIKPVFEEARI+LG+SEINKE+VEKVWNRIAPGL SQF SIYSVPAI
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGADDPRCPIGGLDAP+SR QTAYRK GC +NFKFIAQPGIGHEMTPEMVKE SDWFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

XP_022936567.1 uncharacterized protein LOC111443135 isoform X1 [Cucurbita moschata]4.0e-16379.09Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSAD-GANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQ
        M EA+VDA KFR EF RVLRSRRS +   N       +  +PP+ S                    K+M SCPKA FSNLKDLLHEENLHLTTE  EQG+
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSAD-GANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQ

Query:  LPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED
        LPILII MKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY DALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED
Subjt:  LPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED

Query:  IDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPA
        IDP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ FRWA+DND WQARVESIKPVFEEARI+LGM+EI+KEVV+KVWNRIAPGL SQF SIYSVPA
Subjt:  IDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPA

Query:  IAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHR
        IAPRPLLLLNGADDPRCPI GLDAP+SR Q AY+K GC ENFKFIAQPGIGHEMTPEMVKE S WFD    HR
Subjt:  IAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHR

XP_038898085.1 putative esterase YitV [Benincasa hispida]5.1e-16680.05Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        MEEA+VDA KFR EF  VLRSRRS +               P +     P L+  + + +  T  ++M SCPKA FSNLKDLLHEENLHLTTE  EQGQL
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        P+LII MKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ FRWA+DND WQARVESIKPVFEEARI+LGM+EINKEVV+KVWNRIAPGLDSQFDSIYSVPAI
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGA+DPRCPI GLDAP+SR QTAY+K GC ENFKFI QP IGH+MT EMVKE SDWFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

TrEMBL top hitse value%identityAlignment
A0A1S3CRR2 putative esterase YitV2.6e-16378.14Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        MEEA+VDA KFR EF RVLR+RRS +               P +     P L+  + +    T  K+M SCPK    NLKDLLHEENLHLTTE  EQGQL
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        PILI+ MK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK KTTYRDALIS+WK+GDTMPFIFDTVWDLIKLADYLT+REDI
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ F WA+DND WQARV+SIKPVFEEARIDLGM+EINKEVV+KVWNRIAPGLDSQFDSIYSVPAI
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGADDPRCP+ GLDAP+SR+QTAY+K GC ENFKFIAQ GIGHEMT EMVKE SDWFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

A0A5A7TAI4 Putative esterase YitV2.6e-16378.14Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        MEEA+VDA KFR EF RVLR+RRS +               P +     P L+  + +    T  K+M SCPK    NLKDLLHEENLHLTTE  EQGQL
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        PILI+ MK+SRQQ+RP IVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK KTTYRDALIS+WK+GDTMPFIFDTVWDLIKLADYLT+REDI
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ F WA+DND WQARV+SIKPVFEEARIDLGM+EINKEVV+KVWNRIAPGLDSQFDSIYSVPAI
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGADDPRCP+ GLDAP+SR+QTAY+K GC ENFKFIAQ GIGHEMT EMVKE SDWFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

A0A6J1DI56 uncharacterized protein LOC1110212769.1e-16981.42Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        MEEALVDA KFR+EF RVLRSRRS +             +P S     P + H  + + +  T  K+M SCPK NFSNLKDLLHEENL+LTTE  EQGQL
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        PILII MKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAK+KTTYRDALISSWKRGDTMPFIFDT WDLIKLADYLT+RED+
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRYAVVVP+IGVQ FRWA+D+D WQARVESIKPVFEEARI+LG+SEINKE+VEKVWNRIAPGL SQF SIYSVPAI
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGADDPRCPIGGLDAP+SR QTAYRK GC +NFKFIAQPGIGHEMTPEMVKE SDWFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

A0A6J1F8T3 uncharacterized protein LOC111443135 isoform X12.0e-16379.09Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSAD-GANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQ
        M EA+VDA KFR EF RVLRSRRS +   N       +  +PP+ S                    K+M SCPKA FSNLKDLLHEENLHLTTE  EQG+
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSAD-GANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQ

Query:  LPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED
        LPILII MKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTY DALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED
Subjt:  LPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKRED

Query:  IDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPA
        IDP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ FRWA+DND WQARVESIKPVFEEARI+LGM+EI+KEVV+KVWNRIAPGL SQF SIYSVPA
Subjt:  IDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPA

Query:  IAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHR
        IAPRPLLLLNGADDPRCPI GLDAP+SR Q AY+K GC ENFKFIAQPGIGHEMTPEMVKE S WFD    HR
Subjt:  IAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHR

A0A6J1IHS0 uncharacterized protein LOC111476405 isoform X15.7e-16378.96Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL
        M+EA+VDA KFR EF RVLRSRRS +               P +    PP  H  + + +  T  K+M SCPKA FSNLKDLLHEENLHLTTE  EQG+L
Subjt:  MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQL

Query:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI
        PILII MKDSRQQ+RPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNK+TY DALISSWK+GDTMPFIFDTVWDLIKLADYLTKREDI
Subjt:  PILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDI

Query:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI
        DP R+GITGESLGGMHAWFAAAADTRY+VVVP+IGVQ FRWA+DND WQARVESIKPVFEEARI+LGM+EIN EVV+KVWNRIAPGL SQF SIYSVPAI
Subjt:  DPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAI

Query:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        APRPLLLLNGADDPRCPI GLDA +S  Q AY++ GC ENFKFIAQPGIGHEMTPEMVKE S WFD
Subjt:  APRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

SwissProt top hitse value%identityAlignment
O34973 Putative hydrolase YtaP3.1e-0924.43Show/hide
Query:  SRGYVAIAIDSRYHGER--AKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQ
        S GY  +AID    G+R        +++ L++       M      ++D +   DY+  R D+ P R+G  G S+GG+ AW+ AA D R  V V +    
Subjt:  SRGYVAIAIDSRYHGER--AKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQ

Query:  GFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWN-------RIAPGLDSQFDSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQT
                                       S+++  V+ K  N          P L   F +      IAPRP L L G  D   P  G+D     L  
Subjt:  GFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWN-------RIAPGLDSQFDSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQT

Query:  AYRKAGCSENFKFIAQPGIGH
         Y   G ++ ++ + +   GH
Subjt:  AYRKAGCSENFKFIAQPGIGH

P29368 Uncharacterized 31.7 kDa protein in traX-finO intergenic region8.0e-0529.41Show/hide
Query:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESL
        + P I+  H     +  L P    A+   G+  I  D R  GE             S  +RG  +P +     D+I + ++  K+E ID  R+G+ G SL
Subjt:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESL

Query:  GGMHAWFAAAADTRYAVVV
        GG H + A A D R   +V
Subjt:  GGMHAWFAAAADTRYAVVV

Q99390 Uncharacterized 31.7 kDa protein in traX-finO intergenic region2.1e-0530.25Show/hide
Query:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESL
        + P I+  H     +  L P    A+   G+  I  D R  GE             S  +RG  +P +     D+I + ++  K+E ID  R+G+ G SL
Subjt:  RRPAIVFLHSTNKCKEWLRP-LLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESL

Query:  GGMHAWFAAAADTRYAVVV
        GG H + AAA D R   +V
Subjt:  GGMHAWFAAAADTRYAVVV

Arabidopsis top hitse value%identityAlignment
AT5G25770.1 alpha/beta-Hydrolases superfamily protein5.7e-13161.56Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRS------ADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEA
        ME  +     FR +F R+L SRRS      AD +N  E    +A                   DV   T ++   SCPK N   LKD+L EEN+HL TE 
Subjt:  MEEALVDAGKFRTEFFRVLRSRRS------ADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEA

Query:  REQGQLPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYL
         EQG+LP+LI+ +K+  +++RPAIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT YRDALISSW+ G+TMPFIFDTVWDLIKLA+YL
Subjt:  REQGQLPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYL

Query:  TKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSI
        T+R+DIDP ++GITG SLGGMHAWFAAAADTRY+VVVP+IGVQGFRWA++ND W+ARV SIKP+FEEARIDLG + I+KE+VEKVWNRIAPGL S+FDS 
Subjt:  TKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSI

Query:  YSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        YS+P IAPRPL +LNGA+DPRCP+GGL+  + R + AY++     NFKF A+ G+GHE T  M+KE SDWFD
Subjt:  YSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

AT5G25770.2 alpha/beta-Hydrolases superfamily protein5.7e-13161.56Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRS------ADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEA
        ME  +     FR +F R+L SRRS      AD +N  E    +A                   DV   T ++   SCPK N   LKD+L EEN+HL TE 
Subjt:  MEEALVDAGKFRTEFFRVLRSRRS------ADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEA

Query:  REQGQLPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYL
         EQG+LP+LI+ +K+  +++RPAIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT YRDALISSW+ G+TMPFIFDTVWDLIKLA+YL
Subjt:  REQGQLPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYL

Query:  TKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSI
        T+R+DIDP ++GITG SLGGMHAWFAAAADTRY+VVVP+IGVQGFRWA++ND W+ARV SIKP+FEEARIDLG + I+KE+VEKVWNRIAPGL S+FDS 
Subjt:  TKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSI

Query:  YSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        YS+P IAPRPL +LNGA+DPRCP+GGL+  + R + AY++     NFKF A+ G+GHE T  M+KE SDWFD
Subjt:  YSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD

AT5G25770.3 alpha/beta-Hydrolases superfamily protein5.7e-13161.56Show/hide
Query:  MEEALVDAGKFRTEFFRVLRSRRS------ADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEA
        ME  +     FR +F R+L SRRS      AD +N  E    +A                   DV   T ++   SCPK N   LKD+L EEN+HL TE 
Subjt:  MEEALVDAGKFRTEFFRVLRSRRS------ADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEA

Query:  REQGQLPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYL
         EQG+LP+LI+ +K+  +++RPAIVF+H TN  KEWLRP LEAYASRGYVAI +DSRYHGERA  KT YRDALISSW+ G+TMPFIFDTVWDLIKLA+YL
Subjt:  REQGQLPILIIRMKDSRQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYL

Query:  TKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSI
        T+R+DIDP ++GITG SLGGMHAWFAAAADTRY+VVVP+IGVQGFRWA++ND W+ARV SIKP+FEEARIDLG + I+KE+VEKVWNRIAPGL S+FDS 
Subjt:  TKREDIDPYRLGITGESLGGMHAWFAAAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSI

Query:  YSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD
        YS+P IAPRPL +LNGA+DPRCP+GGL+  + R + AY++     NFKF A+ G+GHE T  M+KE SDWFD
Subjt:  YSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQTAYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAGCTCTCGTTGACGCCGGCAAGTTTCGGACTGAATTCTTTCGAGTTCTGCGTAGCAGACGATCTGCAGATGGTGCGAATTCCGCTGAGTTTGAAGCCCGCAA
AGCCTGTTCACCACCCTCTGATTCAGGAGGCCAGCCCCCCAACCTTCATAGTAGCCTACACGATGTTGATTATTTTACCTGTTTGAAGTCCATGGGATCTTGTCCGAAGG
CAAATTTTAGCAATTTGAAGGACTTGCTTCATGAGGAAAATCTTCATCTGACTACGGAGGCAAGAGAGCAAGGCCAGTTGCCCATATTGATTATAAGAATGAAGGATAGC
AGACAGCAAAGAAGACCTGCAATTGTTTTTCTGCACAGTACAAACAAGTGCAAAGAGTGGTTGAGACCATTGCTTGAGGCTTATGCATCGAGGGGATATGTAGCCATTGC
CATTGATTCTCGTTACCATGGTGAAAGGGCCAAGAACAAAACCACCTACCGTGATGCTCTTATATCTTCATGGAAAAGAGGTGATACCATGCCGTTCATATTTGACACGG
TATGGGACTTGATAAAATTAGCAGATTATCTGACGAAAAGGGAGGACATTGATCCATATAGATTAGGAATTACTGGTGAATCACTTGGAGGAATGCATGCATGGTTTGCT
GCTGCTGCTGACACCCGCTATGCTGTGGTTGTCCCCGTAATTGGTGTCCAGGGTTTTCGATGGGCCATGGATAATGATATGTGGCAGGCACGAGTTGAGAGTATAAAACC
TGTTTTTGAGGAAGCACGAATTGATTTAGGCATGAGTGAGATCAACAAAGAAGTTGTGGAGAAGGTCTGGAATAGGATTGCTCCTGGTTTAGATTCCCAATTTGACTCGA
TTTATTCAGTTCCAGCTATTGCCCCACGTCCTTTGTTGTTACTAAATGGTGCAGATGACCCTCGATGTCCAATTGGCGGTTTGGATGCTCCCATTTCAAGATTACAGACA
GCTTATCGGAAGGCTGGTTGTTCAGAAAATTTTAAGTTCATCGCACAACCTGGGATTGGCCACGAAATGACACCAGAGATGGTAAAAGAAGGTAGCGATTGGTTTGACAG
TGGACGTACCCACCGCCAGACAAGCAGCATAGGAATCGGACCAACGAGGGAAAGAATTGCATTGAGAGAAACCGGGGGAGAGAATGGAAGGGAGCCTAAAAAGAGGAAGA
GGAGTGAAATCAATCTTCACCTGCCACCAGCAGTTGCAGGAGTAGGTTTCACGGGCTTGAACTTTGCCAGGTCAACAAGATCACCAAACAGATTATCCTCAGGCTTTGAA
GGCCTGCTGGGAGGCACATTAGACGAGGCAGAAACCTGCCAGAGAATTGGTTGTACATTTGCGAAGGATATAGAACTGCCATCTGCCAGTCCCTGGGGATATACACCACT
CTGCACATGCGTGAAACGACCTGAGTCACCTTCATTGAATTGGGATACTGATCACCTGTTACTGAACCATTATCATCCACAGGCTGAGCCTCCCAAGGTGGCGGCGGGAA
TGACTCGTTGTTTTGAGAACCTGACCAACCACCCAAACCAACTGAAGCATGGTTTTGATCCCCTGGATTGCCGGACTGGGAATCGGCAGCGTCATTGTTTGGTTGCTATC
ATAATACACAGGATCCTGGAGCATCTTGTTGATAATCAGGATTGTGCAGATTCTGATTATATAATCCTGCGTGTTGCTGTTGCCTTTGTGGAGTATAATATCATCACATT
ATTACTGATCATACCAATAATTCTTGATAAGCAGCATAATATTGTGGATATCGTCCGGTTGGTCCTCCTAAAGCCCCTTGCCAGAGCTACAATATATACACACACAGATA
TGCCGATAGCAAAATATACACAGCAGATGCAAAAGGTGAAATGTGTTGAAGACTTGAAGCCACAAGGCAGTTTTCGAGCAAGCAAACAAACTCCTACAACTGCCGATAAT
CAAACTCGAAAGCGGAACAAAGTATTCCTTCGAAAAATCATAAAAAACACCAACACACGTGAAGAAGTAATGAAACAGCAACCAAGAAAAACTGCAAACAGAGGAGAGGG
AGAGTGTAATCCAGTATATCACGCGTACCCAAGATCTCTGTTCATCATATCGCATATCTCCATGTTCGCGGCCCAGTCAGCACCGATCAGCATATCGCTCGTGGCACGTT
CAACCATCGGATTCACCATGTCGGCGACTACCAAAACGACTACCCGAACAGAAGAGAAAAATTCCAACTTCCGAAAGGCGACAAACCTTAAAATCGTTCTGCAACTGCAA
GCTCCTAAACTATTCGAACCGTCCAACAGTATCAGTCTCGCTCTCTCCAAGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAGCTCTCGTTGACGCCGGCAAGTTTCGGACTGAATTCTTTCGAGTTCTGCGTAGCAGACGATCTGCAGATGGTGCGAATTCCGCTGAGTTTGAAGCCCGCAA
AGCCTGTTCACCACCCTCTGATTCAGGAGGCCAGCCCCCCAACCTTCATAGTAGCCTACACGATGTTGATTATTTTACCTGTTTGAAGTCCATGGGATCTTGTCCGAAGG
CAAATTTTAGCAATTTGAAGGACTTGCTTCATGAGGAAAATCTTCATCTGACTACGGAGGCAAGAGAGCAAGGCCAGTTGCCCATATTGATTATAAGAATGAAGGATAGC
AGACAGCAAAGAAGACCTGCAATTGTTTTTCTGCACAGTACAAACAAGTGCAAAGAGTGGTTGAGACCATTGCTTGAGGCTTATGCATCGAGGGGATATGTAGCCATTGC
CATTGATTCTCGTTACCATGGTGAAAGGGCCAAGAACAAAACCACCTACCGTGATGCTCTTATATCTTCATGGAAAAGAGGTGATACCATGCCGTTCATATTTGACACGG
TATGGGACTTGATAAAATTAGCAGATTATCTGACGAAAAGGGAGGACATTGATCCATATAGATTAGGAATTACTGGTGAATCACTTGGAGGAATGCATGCATGGTTTGCT
GCTGCTGCTGACACCCGCTATGCTGTGGTTGTCCCCGTAATTGGTGTCCAGGGTTTTCGATGGGCCATGGATAATGATATGTGGCAGGCACGAGTTGAGAGTATAAAACC
TGTTTTTGAGGAAGCACGAATTGATTTAGGCATGAGTGAGATCAACAAAGAAGTTGTGGAGAAGGTCTGGAATAGGATTGCTCCTGGTTTAGATTCCCAATTTGACTCGA
TTTATTCAGTTCCAGCTATTGCCCCACGTCCTTTGTTGTTACTAAATGGTGCAGATGACCCTCGATGTCCAATTGGCGGTTTGGATGCTCCCATTTCAAGATTACAGACA
GCTTATCGGAAGGCTGGTTGTTCAGAAAATTTTAAGTTCATCGCACAACCTGGGATTGGCCACGAAATGACACCAGAGATGGTAAAAGAAGGTAGCGATTGGTTTGACAG
TGGACGTACCCACCGCCAGACAAGCAGCATAGGAATCGGACCAACGAGGGAAAGAATTGCATTGAGAGAAACCGGGGGAGAGAATGGAAGGGAGCCTAAAAAGAGGAAGA
GGAGTGAAATCAATCTTCACCTGCCACCAGCAGTTGCAGGAGTAGGTTTCACGGGCTTGAACTTTGCCAGGTCAACAAGATCACCAAACAGATTATCCTCAGGCTTTGAA
GGCCTGCTGGGAGGCACATTAGACGAGGCAGAAACCTGCCAGAGAATTGGTTGTACATTTGCGAAGGATATAGAACTGCCATCTGCCAGTCCCTGGGGATATACACCACT
CTGCACATGCGTGAAACGACCTGAGTCACCTTCATTGAATTGGGATACTGATCACCTGTTACTGAACCATTATCATCCACAGGCTGAGCCTCCCAAGGTGGCGGCGGGAA
TGACTCGTTGTTTTGAGAACCTGACCAACCACCCAAACCAACTGAAGCATGGTTTTGATCCCCTGGATTGCCGGACTGGGAATCGGCAGCGTCATTGTTTGGTTGCTATC
ATAATACACAGGATCCTGGAGCATCTTGTTGATAATCAGGATTGTGCAGATTCTGATTATATAATCCTGCGTGTTGCTGTTGCCTTTGTGGAGTATAATATCATCACATT
ATTACTGATCATACCAATAATTCTTGATAAGCAGCATAATATTGTGGATATCGTCCGGTTGGTCCTCCTAAAGCCCCTTGCCAGAGCTACAATATATACACACACAGATA
TGCCGATAGCAAAATATACACAGCAGATGCAAAAGGTGAAATGTGTTGAAGACTTGAAGCCACAAGGCAGTTTTCGAGCAAGCAAACAAACTCCTACAACTGCCGATAAT
CAAACTCGAAAGCGGAACAAAGTATTCCTTCGAAAAATCATAAAAAACACCAACACACGTGAAGAAGTAATGAAACAGCAACCAAGAAAAACTGCAAACAGAGGAGAGGG
AGAGTGTAATCCAGTATATCACGCGTACCCAAGATCTCTGTTCATCATATCGCATATCTCCATGTTCGCGGCCCAGTCAGCACCGATCAGCATATCGCTCGTGGCACGTT
CAACCATCGGATTCACCATGTCGGCGACTACCAAAACGACTACCCGAACAGAAGAGAAAAATTCCAACTTCCGAAAGGCGACAAACCTTAAAATCGTTCTGCAACTGCAA
GCTCCTAAACTATTCGAACCGTCCAACAGTATCAGTCTCGCTCTCTCCAAGTCCTAG
Protein sequenceShow/hide protein sequence
MEEALVDAGKFRTEFFRVLRSRRSADGANSAEFEARKACSPPSDSGGQPPNLHSSLHDVDYFTCLKSMGSCPKANFSNLKDLLHEENLHLTTEAREQGQLPILIIRMKDS
RQQRRPAIVFLHSTNKCKEWLRPLLEAYASRGYVAIAIDSRYHGERAKNKTTYRDALISSWKRGDTMPFIFDTVWDLIKLADYLTKREDIDPYRLGITGESLGGMHAWFA
AAADTRYAVVVPVIGVQGFRWAMDNDMWQARVESIKPVFEEARIDLGMSEINKEVVEKVWNRIAPGLDSQFDSIYSVPAIAPRPLLLLNGADDPRCPIGGLDAPISRLQT
AYRKAGCSENFKFIAQPGIGHEMTPEMVKEGSDWFDSGRTHRQTSSIGIGPTRERIALRETGGENGREPKKRKRSEINLHLPPAVAGVGFTGLNFARSTRSPNRLSSGFE
GLLGGTLDEAETCQRIGCTFAKDIELPSASPWGYTPLCTCVKRPESPSLNWDTDHLLLNHYHPQAEPPKVAAGMTRCFENLTNHPNQLKHGFDPLDCRTGNRQRHCLVAI
IIHRILEHLVDNQDCADSDYIILRVAVAFVEYNIITLLLIIPIILDKQHNIVDIVRLVLLKPLARATIYTHTDMPIAKYTQQMQKVKCVEDLKPQGSFRASKQTPTTADN
QTRKRNKVFLRKIIKNTNTREEVMKQQPRKTANRGEGECNPVYHAYPRSLFIISHISMFAAQSAPISISLVARSTIGFTMSATTKTTTRTEEKNSNFRKATNLKIVLQLQ
APKLFEPSNSISLALSKS