; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008791 (gene) of Snake gourd v1 genome

Gene IDTan0008791
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function DUF829, transmembrane 53
Genome locationLG01:113975556..113978071
RNA-Seq ExpressionTan0008791
SyntenyTan0008791
Gene Ontology termsGO:0005777 - peroxisome (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR008547 - Protein of unknown function DUF829, TMEM53
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141248.1 uncharacterized protein LOC101212227 [Cucumis sativus]1.1e-22090.69Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGR+YWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHS+FLNMFFPDKAASLAFDILK L+EELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHE QQHSSD YQLVRDC+AGYIYDSSPVDFTSDLGTRF+LHPTV+KASQPPRI SW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRL+DLGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVYCQRTRPNEEV AVDKMNCDSC  TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSF++ ALAP DHL FSS +DGFDY    SMRDEHMEGVMRLSN+PS+IPHGVLGQILYD CVPKNVEDWDIGSSSSS  VLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

XP_008452531.1 PREDICTED: uncharacterized protein LOC103493530 [Cucumis melo]2.0e-22291.67Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHS+FLNMFFPDKAASLAFDILK L+EELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHE QQHSSD YQLVRDCIAGYIYDSSPVDFTSDLGTRF+LHPTV+KASQPPRIVSW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRL++LGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVYCQRTRPNEEV AVDKMNCDSC  TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSF++ ALAP+DHL FSS +DGFDY    SMRDEHMEGVMRLSNSPS+IPHGVLGQILYD CVPKNVEDWDIGSSSSS GVLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

XP_022982259.1 uncharacterized protein LOC111481139 isoform X1 [Cucurbita maxima]3.5e-21991.67Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVG+VEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHSEFLNMFFPDKAASLAF+ILK LVEELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHEP+++S D YQ+VRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSW AHSIASGLD LFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLG DVKLIKWNGSPHVGHFLHFPIEYRAAVTELL+KAAGVY QRTRPN EVAAVDKMNCDSCKPTPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSFQEPALAPS+HLFFSSM+DG DY G  SM DE MEGV+RLSNSP +IPHG  GQILYDVCVPKNVEDWDI SSSSSNGVLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

XP_023535408.1 uncharacterized protein LOC111796852 [Cucurbita pepo subsp. pepo]6.0e-21990.69Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHSEFLNMFFPDKAASLAFDILK LVEELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYK+LQIIEGYHEP QH SD Y+LVR+CI+GYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVYCQRTRP+EEV A+DKMN DS K TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSFQEPALAPSDHL+FSS+VDGFDY G  SM D+HMEGV++L NS ++IPHGVLGQILYDVCVPKNVEDWDIGSSSSSN VLR  TRR +SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

XP_038898412.1 uncharacterized protein LOC120086059 [Benincasa hispida]8.4e-22191.42Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHS+F+NMFFPDKAASLAFD+LK LVEEL IKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHEPQQHSSD YQLVRDCIAGYIYDSSPVDFTSDLGTRF+LHPTV+KASQPPRIVSW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLA YQTIFNFAQRL+DLGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVY QRTRP+EEV AVDKMNCDSC  TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSFQ+ ALAPSDHL FSS +DGFDY    SMRDEHMEGVMRLSNSPS+IPHGVLGQILYDVC+PKNVEDWDIGSSSSSNGVL  HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

TrEMBL top hitse value%identityAlignment
A0A0A0KZR1 Uncharacterized protein5.3e-22190.69Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGR+YWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHS+FLNMFFPDKAASLAFDILK L+EELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHE QQHSSD YQLVRDC+AGYIYDSSPVDFTSDLGTRF+LHPTV+KASQPPRI SW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRL+DLGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVYCQRTRPNEEV AVDKMNCDSC  TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSF++ ALAP DHL FSS +DGFDY    SMRDEHMEGVMRLSN+PS+IPHGVLGQILYD CVPKNVEDWDIGSSSSS  VLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

A0A1S3BTG1 uncharacterized protein LOC1034935309.6e-22391.67Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHS+FLNMFFPDKAASLAFDILK L+EELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHE QQHSSD YQLVRDCIAGYIYDSSPVDFTSDLGTRF+LHPTV+KASQPPRIVSW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRL++LGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVYCQRTRPNEEV AVDKMNCDSC  TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSF++ ALAP+DHL FSS +DGFDY    SMRDEHMEGVMRLSNSPS+IPHGVLGQILYD CVPKNVEDWDIGSSSSS GVLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

A0A5D3D9H9 DUF829 domain-containing protein9.6e-22391.67Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHS+FLNMFFPDKAASLAFDILK L+EELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHE QQHSSD YQLVRDCIAGYIYDSSPVDFTSDLGTRF+LHPTV+KASQPPRIVSW AH+IASGLDALFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRL++LGGDVKLIKWNGSPHVGH+LHFPIEYRAAVTELLSKAAGVYCQRTRPNEEV AVDKMNCDSC  TPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSF++ ALAP+DHL FSS +DGFDY    SMRDEHMEGVMRLSNSPS+IPHGVLGQILYD CVPKNVEDWDIGSSSSS GVLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

A0A6J1FKU5 uncharacterized protein LOC111446333 isoform X14.2e-21890.93Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVG+VEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHSEFLNMFFPDKAASLAF+ILK LVEELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHEP+++S D YQ+VRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSW AHSIASGLD LFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLG DVKLIKWNGSPHVGHFLHFPIEYRAAVTELL+KAAGVY QRTRPN EVAAVDKMNCDSC PTPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSFQEPALAPS+HLFFSSM+DGFDY G  S  DE MEG MRLSNSP ++P G  GQILYDVCVPKNVEDWDI SSSSSNGVLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

A0A6J1J416 uncharacterized protein LOC111481139 isoform X11.7e-21991.67Show/hide
Query:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC
        MWGFGGRYYWGRRERVG+VEGIVVAFAWMSSQERHLKRYV++YSSLGWNSLVCHSEFLNMFFPDKAASLAF+ILK LVEELKIKRCPIVFASFSGGPKAC
Subjt:  MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKAC

Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQIIEGYHEP+++S D YQ+VRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSW AHSIASGLD LFLNRFESHRAEYWQTLYASVS
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLG DVKLIKWNGSPHVGHFLHFPIEYRAAVTELL+KAAGVY QRTRPN EVAAVDKMNCDSCKPTPDVRK
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI
        AAS SSSFQEPALAPS+HLFFSSM+DG DY G  SM DE MEGV+RLSNSP +IPHG  GQILYDVCVPKNVEDWDI SSSSSNGVLR HTRRH+SFNPI
Subjt:  AASLSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPI

Query:  KLMRRSRL
        KLMRRSRL
Subjt:  KLMRRSRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15695.1 Protein of unknown function DUF829, transmembrane 535.8e-10346.46Show/hide
Query:  GGRYYWGRR-----ERVGKVE-----GIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFS
        GGR YWG++     E    V+     G+VV F W S  E  L  +V+LYSSLGWNSLVC ++FL   +P+ A SLAF +L  LVEELK + CP++F +FS
Subjt:  GGRYYWGRR-----ERVGKVE-----GIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFS

Query:  GGPKACMYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQT
        G PKACMYKVLQ+I    E Q H  D+ QLVR C++G++YDS P+DFTSDL  +F LHPT+ + S P R+VSW A  I+SGLD L+L RFES R+EYWQ 
Subjt:  GGPKACMYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQT

Query:  LYASVSMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVA----AVDKMNCD
        LY+SV + APYLILCSE D+LAP Q I +F  +L++LGG+VK++KW  SPH GH+ H PI+YRA ++  L KA  V+  + R   E A     + ++ CD
Subjt:  LYASVSMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVA----AVDKMNCD

Query:  SCKPTPDVRKAASLSSSFQEPALAPSDHLFFSSMV---DGFDYMGTASMRDEHMEGVMRLSNSPSSI-PHGVLGQILYDVCVPKNVEDWDIGSSSSSNG-
          K       A + + S +  A  P DH F  S        +    +S ++E  E        P+SI  H VLGQ L+D CVPKN+E WDI  +   NG 
Subjt:  SCKPTPDVRKAASLSSSFQEPALAPSDHLFFSSMV---DGFDYMGTASMRDEHMEGVMRLSNSPSSI-PHGVLGQILYDVCVPKNVEDWDIGSSSSSNG-

Query:  -VLRTHTRRHSSFNPIKLMRRSRL
            + +R++S+    K   RSRL
Subjt:  -VLRTHTRRHSSFNPIKLMRRSRL

AT2G18245.1 alpha/beta-Hydrolases superfamily protein1.5e-0522.53Show/hide
Query:  GFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILK---ALVEELKIK------RCPIVFASF
        G G    +G  E  GK E  VV   W+ ++ +HL+RYVE Y+S G N++    +  ++   D    L   I +    LV  +  K      +C +VF SF
Subjt:  GFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILK---ALVEELKIK------RCPIVFASF

Query:  SGGPKACMYKVLQIIEGYHEPQQHSSDAYQLVRDCI---------------AGY---IYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASG
        S         +L+   G         D  + ++ CI               AG+   I        T++  +            + P  +     S    
Subjt:  SGGPKACMYKVLQIIEGYHEPQQHSSDAYQLVRDCI---------------AGY---IYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASG

Query:  LDALFLNR--FESHRAEYWQTLYASVSMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSK
        L  +FLN     +   +  Q LY +     P L L S  D + P  ++    +  + +G  +    +  SPHV H+ +FP  Y + +   L +
Subjt:  LDALFLNR--FESHRAEYWQTLYASVSMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSK

AT3G19970.1 alpha/beta-Hydrolases superfamily protein1.7e-0922.39Show/hide
Query:  IVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEEL-----KIKRCPIVFASFSGGPKACMYKVLQIIEGYHEPQQ
        +VV   W+ S+++HLK+Y + Y+S G++ ++  +  +N     +    A   +++LV  L     + ++  +VF +FS         +L+      + Q+
Subjt:  IVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEEL-----KIKRCPIVFASFSGGPKACMYKVLQIIEGYHEPQQ

Query:  HSSDAYQLVRDCI--AGYIYDSSPVDFTSDLGTRFLLHPTV--------------LKASQPPRIVSWTA-HSIASGLDALFLNRFESHR--AEYWQTLYA
          S     V+ CI  +  +  + P  + S     FL   +V              +  SQP    + TA   +     A+ LN  + +R  A+   TL +
Subjt:  HSSDAYQLVRDCI--AGYIYDSSPVDFTSDLGTRFLLHPTV--------------LKASQPPRIVSWTA-HSIASGLDALFLNRFESHR--AEYWQTLYA

Query:  SVSMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLS
        +     P L + S  D + P + + +F       G +V+   +  SPHV HF   P  Y A +   ++
Subjt:  SVSMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLS

AT5G44250.1 Protein of unknown function DUF829, transmembrane 531.5e-12254.72Show/hide
Query:  MWGFGGRYYWGRR-ERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKA
        MWG GG YYW ++    G+ E IVV FAWMSS+ER+LK +V+LYSSL W+SLVCHS+FLNMF PDKAA LA +++  LV+ELK K  P+VFASFSGGP A
Subjt:  MWGFGGRYYWGRR-ERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKA

Query:  CMYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASV
        CMYKVLQI+EG  E   +  D  +LVR+CI+G+IYDS PVDFTSDLG R  +HPT LK S PP+   W A+ IAS LD +FLNRFES RAEYWQTLY+++
Subjt:  CMYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASV

Query:  SMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVR
         M+ PYLILCSE DDLAPYQTI NFA RL++LGG+VKL+KWN SPH GH+ +  ++Y+AAV+E LSKAA VY Q+TR  +  A     + +  +P   + 
Subjt:  SMKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVR

Query:  KAAS-LSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSN---SPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHS
        ++ S L+ SF    L  +DH F  S V  +       ++DEH + ++ LSN   + S  P+GVLGQIL+DV +PKNVEDWDI  S +     R   R   
Subjt:  KAAS-LSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSN---SPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHS

Query:  SFNPIKLMRRSRL
               +RRSRL
Subjt:  SFNPIKLMRRSRL

AT5G44250.2 Protein of unknown function DUF829, transmembrane 531.3e-8151.28Show/hide
Query:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS
        MYKVLQI+EG  E   +  D  +LVR+CI+G+IYDS PVDFTSDLG R  +HPT LK S PP+   W A+ IAS LD +FLNRFES RAEYWQTLY+++ 
Subjt:  MYKVLQIIEGYHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVS

Query:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK
        M+ PYLILCSE DDLAPYQTI NFA RL++LGG+VKL+KWN SPH GH+ +  ++Y+AAV+E LSKAA VY Q+TR  +  A     + +  +P   + +
Subjt:  MKAPYLILCSEEDDLAPYQTIFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRK

Query:  AAS-LSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSN---SPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSS
        + S L+ SF    L  +DH F  S V  +       ++DEH + ++ LSN   + S  P+GVLGQIL+DV +PKNVEDWDI  S +     R   R    
Subjt:  AAS-LSSSFQEPALAPSDHLFFSSMVDGFDYMGTASMRDEHMEGVMRLSN---SPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSS

Query:  FNPIKLMRRSRL
              +RRSRL
Subjt:  FNPIKLMRRSRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGGGATTTGGAGGAAGGTATTATTGGGGAAGAAGAGAGCGAGTCGGGAAAGTAGAAGGGATTGTAGTGGCGTTCGCATGGATGTCTAGTCAGGAGAGGCACTTGAA
AAGATACGTGGAATTGTATTCCTCTCTGGGTTGGAATTCACTCGTTTGCCATTCTGAATTCCTCAATATGTTCTTCCCTGATAAGGCTGCATCTCTGGCATTTGACATTC
TTAAAGCACTAGTCGAGGAGCTAAAGATAAAGAGATGCCCCATAGTATTTGCATCATTTTCTGGTGGGCCTAAAGCATGCATGTATAAGGTTCTCCAGATAATTGAGGGA
TACCATGAACCGCAGCAACATAGTTCGGATGCTTATCAACTGGTCAGGGACTGTATTGCTGGCTATATTTATGATTCCAGTCCAGTTGATTTTACCAGTGACTTGGGAAC
TCGATTTTTGCTTCATCCAACTGTGCTGAAAGCATCCCAACCACCAAGAATAGTATCATGGACAGCACACAGCATTGCTTCTGGTCTTGATGCGCTCTTCCTCAACAGAT
TTGAATCACACCGTGCAGAATATTGGCAAACTCTTTATGCCTCAGTTAGTATGAAAGCTCCTTATCTTATTTTGTGCTCGGAAGAAGATGATCTTGCTCCCTATCAGACA
ATCTTCAATTTTGCTCAACGACTGGAAGATCTTGGGGGAGATGTTAAATTAATCAAATGGAATGGCTCCCCACATGTAGGGCATTTTCTGCATTTTCCAATTGAATATAG
AGCTGCTGTTACAGAGTTGCTAAGTAAGGCCGCTGGAGTTTACTGTCAAAGAACTAGACCTAATGAAGAAGTAGCTGCGGTAGATAAAATGAATTGCGACTCTTGCAAGC
CAACACCTGACGTTAGAAAAGCTGCATCTCTATCAAGTAGTTTTCAGGAGCCTGCTCTTGCTCCAAGCGACCATCTTTTCTTCTCCAGCATGGTGGATGGCTTTGACTAC
ATGGGCACCGCGTCCATGCGTGACGAACACATGGAAGGAGTAATGCGGCTATCTAATTCGCCGAGTTCCATTCCTCATGGAGTTCTCGGTCAAATCCTTTACGATGTATG
TGTTCCGAAGAACGTCGAGGATTGGGACATCGGATCATCAAGTTCTTCAAATGGTGTCTTGCGCACACACACAAGACGACATTCCTCATTCAATCCCATCAAACTGATGC
GTCGCTCGAGACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGGGGATTTGGAGGAAGGTATTATTGGGGAAGAAGAGAGCGAGTCGGGAAAGTAGAAGGGATTGTAGTGGCGTTCGCATGGATGTCTAGTCAGGAGAGGCACTTGAA
AAGATACGTGGAATTGTATTCCTCTCTGGGTTGGAATTCACTCGTTTGCCATTCTGAATTCCTCAATATGTTCTTCCCTGATAAGGCTGCATCTCTGGCATTTGACATTC
TTAAAGCACTAGTCGAGGAGCTAAAGATAAAGAGATGCCCCATAGTATTTGCATCATTTTCTGGTGGGCCTAAAGCATGCATGTATAAGGTTCTCCAGATAATTGAGGGA
TACCATGAACCGCAGCAACATAGTTCGGATGCTTATCAACTGGTCAGGGACTGTATTGCTGGCTATATTTATGATTCCAGTCCAGTTGATTTTACCAGTGACTTGGGAAC
TCGATTTTTGCTTCATCCAACTGTGCTGAAAGCATCCCAACCACCAAGAATAGTATCATGGACAGCACACAGCATTGCTTCTGGTCTTGATGCGCTCTTCCTCAACAGAT
TTGAATCACACCGTGCAGAATATTGGCAAACTCTTTATGCCTCAGTTAGTATGAAAGCTCCTTATCTTATTTTGTGCTCGGAAGAAGATGATCTTGCTCCCTATCAGACA
ATCTTCAATTTTGCTCAACGACTGGAAGATCTTGGGGGAGATGTTAAATTAATCAAATGGAATGGCTCCCCACATGTAGGGCATTTTCTGCATTTTCCAATTGAATATAG
AGCTGCTGTTACAGAGTTGCTAAGTAAGGCCGCTGGAGTTTACTGTCAAAGAACTAGACCTAATGAAGAAGTAGCTGCGGTAGATAAAATGAATTGCGACTCTTGCAAGC
CAACACCTGACGTTAGAAAAGCTGCATCTCTATCAAGTAGTTTTCAGGAGCCTGCTCTTGCTCCAAGCGACCATCTTTTCTTCTCCAGCATGGTGGATGGCTTTGACTAC
ATGGGCACCGCGTCCATGCGTGACGAACACATGGAAGGAGTAATGCGGCTATCTAATTCGCCGAGTTCCATTCCTCATGGAGTTCTCGGTCAAATCCTTTACGATGTATG
TGTTCCGAAGAACGTCGAGGATTGGGACATCGGATCATCAAGTTCTTCAAATGGTGTCTTGCGCACACACACAAGACGACATTCCTCATTCAATCCCATCAAACTGATGC
GTCGCTCGAGACTTTAA
Protein sequenceShow/hide protein sequence
MWGFGGRYYWGRRERVGKVEGIVVAFAWMSSQERHLKRYVELYSSLGWNSLVCHSEFLNMFFPDKAASLAFDILKALVEELKIKRCPIVFASFSGGPKACMYKVLQIIEG
YHEPQQHSSDAYQLVRDCIAGYIYDSSPVDFTSDLGTRFLLHPTVLKASQPPRIVSWTAHSIASGLDALFLNRFESHRAEYWQTLYASVSMKAPYLILCSEEDDLAPYQT
IFNFAQRLEDLGGDVKLIKWNGSPHVGHFLHFPIEYRAAVTELLSKAAGVYCQRTRPNEEVAAVDKMNCDSCKPTPDVRKAASLSSSFQEPALAPSDHLFFSSMVDGFDY
MGTASMRDEHMEGVMRLSNSPSSIPHGVLGQILYDVCVPKNVEDWDIGSSSSSNGVLRTHTRRHSSFNPIKLMRRSRL