; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015841 (gene) of Snake gourd v1 genome

Gene IDTan0015841
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein AF-9
Genome locationLG02:71370638..71373400
RNA-Seq ExpressionTan0015841
SyntenyTan0015841
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030060.1 hypothetical protein SDJN02_08406, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-8859.28Show/hide
Query:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
        +PNP+ P+SGA IRSLVKHLKTK+                 +++PSKMAE    Q  +PHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
Subjt:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK

Query:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL
        KA   EQ  Q QTP +P PAR  E K R RKI K SSS ERN +   Q+NFN N+NN  N  N +N+    N  +P W +NSQS+E +N M I LPEQTL
Subjt:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL

Query:  GLNLNLQDFKNLEANMIFSKGSVS------------------STDQEGSNGT-ERNRSGGGGG-GMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAW
        GLNLNLQDF NLE N+  +  SVS                  +TDQE  N T E N SGGG G G+HVAVGEEEMAEIRSIG+KH +EWSDKMNLVKSAW
Subjt:  GLNLNLQDFKNLEANMIFSKGSVS------------------STDQEGSNGT-ERNRSGGGGG-GMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAW

Query:  WLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        WLRFMK+  +E      GF FGD  DQ ILEFPDWMNNGN+    EQ  + +  + H  HPS      SALPC+DIGEFEG+DGEWLA
Subjt:  WLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

XP_008464887.1 PREDICTED: protein AF-9 [Cucumis melo]1.5e-9159.85Show/hide
Query:  PMSGAYIRSLVKHLKTKDPIHPNSSSSSSSS------SSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAM
        P+S  YI +++KH K K PI+P SSSSSSSS      SSSS++  SKMA+   K+Q +QPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAM
Subjt:  PMSGAYIRSLVKHLKTKDPIHPNSSSSSSSS------SSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAM

Query:  KKAAAGEQNRQSQTPTEP-----PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMNS-----QSVEENN
        KKAAA   N    +P  P     P R QE K + RKIPKSS++ ERN    PQSNF   NNNN         ++NLCY      MMN       SVE NN
Subjt:  KKAAAGEQNRQSQTPTEP-----PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMNS-----QSVEENN

Query:  --MEIVLPEQTLGLNLNLQDFKNLEANMIFSKGSVS---STDQEGSNGTERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFM
          ++IVLPEQTLGLNLNLQDFKNL+AN +FS  SVS   ST        E+ R GGGGGGMHVAVGEEEMAE+R+IGEKH +EWSDKM++VKSAWWLRFM
Subjt:  --MEIVLPEQTLGLNLNLQDFKNLEANMIFSKGSVS---STDQEGSNGTERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFM

Query:  KM------EGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        KM      E +E Q    G+ +GDP DQ ILEFPDWMNNGN+N   E+    + +    HH HP       SALPCMDIGEFEG+DGEWLA
Subjt:  KM------EGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

XP_011657359.1 vitellogenin-A2 [Cucumis sativus]1.8e-8959.59Show/hide
Query:  PMSGAYIRSLVKHLKTKDPIHPN---SSSSSSSSSSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA
        P+S  YI +L+KH K K P++PN   SSSSSSSSSSSS++  SKMA+   K+Q +QPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA
Subjt:  PMSGAYIRSLVKHLKTKDPIHPN---SSSSSSSSSSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA

Query:  AAGEQNRQS--QTPTE--PPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMN----SQSVEENN--MEI
        AA   +  +  ++P E   P R QE K + RK PKSS++TERN    PQSNF   NNNN         ++N CY      MMN    S S+E NN  ++I
Subjt:  AAGEQNRQS--QTPTE--PPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMN----SQSVEENN--MEI

Query:  VLPEQTLGLNLNLQDFKNLEANMIFSKGSVS-STDQEGSNGT------ERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFMK
        VLPEQTLGLNLNLQDFKNL+AN +FS  S+S S    GS  T      E+   GGGG GMHVAVGEEEMAE+R+IGEKH +EWSDKM++VKSAWWLRFMK
Subjt:  VLPEQTLGLNLNLQDFKNLEANMIFSKGSVS-STDQEGSNGT------ERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFMK

Query:  MEGEEYQRGDQ-------GFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        M GE+ +  DQ       G+ +GDP DQ ILEFPDWMNNGN+N   E+    + +    HH HP       SALPCMDIGEFEG+DGEWLA
Subjt:  MEGEEYQRGDQ-------GFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

XP_022999732.1 uncharacterized protein LOC111493992 isoform X1 [Cucurbita maxima]3.3e-9161.46Show/hide
Query:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
        +PNPE P+SGA IRSLVKHLKTK+ I                 +PSKMAE    Q  +PHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
Subjt:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK

Query:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL
        KA   EQ  Q QTP +P PAR QE K R RKI K SSS ERN +    +NFN N+NN  N  N +N+    N  +P W +NSQS+E +N M IVLPEQTL
Subjt:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL

Query:  GLNLNLQDFKNLEANMIFSKGSVS--------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRF
        GLNLNLQDF NLE N+  +  SVS              +TDQE  N T E N SGGG GGG+HVAVGEEEMAEIRSIG+KH +EWSDKMNLVKSAWWLRF
Subjt:  GLNLNLQDFKNLEANMIFSKGSVS--------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRF

Query:  MKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        MK+ G++ + G  GF FGD  DQ ILEFPDWMNNGN+    EQ  + +  + H  HPS      SALPCMDIGEFEG+DGEWLA
Subjt:  MKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

XP_023546423.1 uncharacterized protein LOC111805545 [Cucurbita pepo subsp. pepo]6.2e-9060Show/hide
Query:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
        +PNP+ P+SGA IRSLVKHLKTK+ I                 +PSKMAE    Q  +PHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
Subjt:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK

Query:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL
        KA   EQ  Q QTP +P PAR  E K R RKI K SSS +RN +   Q+NFN N+NN  N  N +N+    N  +P W +NSQS+E +N M IVLPEQTL
Subjt:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL

Query:  GLNLNLQDFKNLEANMIFSKGSVS--------------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKS
        GLNLNLQDF NLE N+  +  SVS                     TDQE  N T E N SGGG GGG+HVAVGEEEMAEIRSIG+KH +EWSDKMNLVKS
Subjt:  GLNLNLQDFKNLEANMIFSKGSVS--------------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKS

Query:  AWWLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        AWWLRFMK+ G++ + G  GF FGD  DQ ILEFPDWMNNGN+    EQ  + +  + H  HPS      SALPCMDIGEFEG+DGEWLA
Subjt:  AWWLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

TrEMBL top hitse value%identityAlignment
A0A0A0KDI3 Uncharacterized protein3.1e-7958.9Show/hide
Query:  PMSGAYIRSLVKHLKTKDPIHPN---SSSSSSSSSSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA
        P+S  YI +L+KH K K P++PN   SSSSSSSSSSSS++  SKMA+   K+Q +QPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA
Subjt:  PMSGAYIRSLVKHLKTKDPIHPN---SSSSSSSSSSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA

Query:  AAGEQNRQS--QTPTE--PPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMN----SQSVEENN--MEI
        AA   +  +  ++P E   P R QE K + RK PKSS++TERN    PQSNF   NNNN         ++N CY      MMN    S S+E NN  ++I
Subjt:  AAGEQNRQS--QTPTE--PPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMN----SQSVEENN--MEI

Query:  VLPEQTLGLNLNLQDFKNLEANMIFSKGSVS-STDQEGSNGT------ERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFMK
        VLPEQTLGLNLNLQDFKNL+AN +FS  S+S S    GS  T      E+   GGGG GMHVAVGEEEMAE+R+IGEKH +EWSDKM++VKSAWWLRFMK
Subjt:  VLPEQTLGLNLNLQDFKNLEANMIFSKGSVS-STDQEGSNGT------ERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFMK

Query:  MEGEEYQRGDQ-------GFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPS
        M GE+ +  DQ       G+ +GDP DQ ILEFPDWMNNGN+N   E+    + +    HH HPS
Subjt:  MEGEEYQRGDQ-------GFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPS

A0A1S3CML6 protein AF-97.2e-9259.85Show/hide
Query:  PMSGAYIRSLVKHLKTKDPIHPNSSSSSSSS------SSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAM
        P+S  YI +++KH K K PI+P SSSSSSSS      SSSS++  SKMA+   K+Q +QPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAM
Subjt:  PMSGAYIRSLVKHLKTKDPIHPNSSSSSSSS------SSSSNTHPSKMAEN-EKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAM

Query:  KKAAAGEQNRQSQTPTEP-----PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMNS-----QSVEENN
        KKAAA   N    +P  P     P R QE K + RKIPKSS++ ERN    PQSNF   NNNN         ++NLCY      MMN       SVE NN
Subjt:  KKAAAGEQNRQSQTPTEP-----PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYP----WMMNS-----QSVEENN

Query:  --MEIVLPEQTLGLNLNLQDFKNLEANMIFSKGSVS---STDQEGSNGTERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFM
          ++IVLPEQTLGLNLNLQDFKNL+AN +FS  SVS   ST        E+ R GGGGGGMHVAVGEEEMAE+R+IGEKH +EWSDKM++VKSAWWLRFM
Subjt:  --MEIVLPEQTLGLNLNLQDFKNLEANMIFSKGSVS---STDQEGSNGTERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFM

Query:  KM------EGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        KM      E +E Q    G+ +GDP DQ ILEFPDWMNNGN+N   E+    + +    HH HP       SALPCMDIGEFEG+DGEWLA
Subjt:  KM------EGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ----HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

A0A6J1G4K1 uncharacterized protein LOC111450745 isoform X11.4e-8758.67Show/hide
Query:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
        +PN + P+SGA IRSLVKHLKTK+                 +++PSKMAE    Q  +PHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
Subjt:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK

Query:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL
        KA   EQ  Q QTP +P PAR  E K R RKI K SSS ERN +   Q+NFN N+NN  N  N +N+    N  +P W +NSQS+E +N M I LPEQTL
Subjt:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL

Query:  GLNLNLQDFKNLEANMIFSKGSVS----------------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLV
        GLNLNLQDF NLE N+  +  SVS                      +TD E  N T E N SGGG GGG+HVAVGEEEMAEIRSIG+KH +EWSDKMNLV
Subjt:  GLNLNLQDFKNLEANMIFSKGSVS----------------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLV

Query:  KSAWWLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        KSAWWLRFMK+ G++ + G  GF FGD  DQ ILEFPDWM+NGN+    EQ  + +  + H  HPS      SALPCMDIGEFEG+DGEWLA
Subjt:  KSAWWLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

A0A6J1KBM1 uncharacterized protein LOC111493992 isoform X11.6e-9161.46Show/hide
Query:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
        +PNPE P+SGA IRSLVKHLKTK+ I                 +PSKMAE    Q  +PHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
Subjt:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK

Query:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL
        KA   EQ  Q QTP +P PAR QE K R RKI K SSS ERN +    +NFN N+NN  N  N +N+    N  +P W +NSQS+E +N M IVLPEQTL
Subjt:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL

Query:  GLNLNLQDFKNLEANMIFSKGSVS--------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRF
        GLNLNLQDF NLE N+  +  SVS              +TDQE  N T E N SGGG GGG+HVAVGEEEMAEIRSIG+KH +EWSDKMNLVKSAWWLRF
Subjt:  GLNLNLQDFKNLEANMIFSKGSVS--------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRF

Query:  MKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA
        MK+ G++ + G  GF FGD  DQ ILEFPDWMNNGN+    EQ  + +  + H  HPS      SALPCMDIGEFEG+DGEWLA
Subjt:  MKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA

A0A6J1KKK2 uncharacterized protein LOC111493992 isoform X27.4e-8160.56Show/hide
Query:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
        +PNPE P+SGA IRSLVKHLKTK+ I                 +PSKMAE    Q  +PHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK
Subjt:  DPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMK

Query:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL
        KA   EQ  Q QTP +P PAR QE K R RKI K SSS ERN +    +NFN N+NN  N  N +N+    N  +P W +NSQS+E +N M IVLPEQTL
Subjt:  KAAAGEQNRQSQTPTEP-PARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNH--MRNLCYP-WMMNSQSVE-ENNMEIVLPEQTL

Query:  GLNLNLQDFKNLEANMIFSKGSVS--------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRF
        GLNLNLQDF NLE N+  +  SVS              +TDQE  N T E N SGGG GGG+HVAVGEEEMAEIRSIG+KH +EWSDKMNLVKSAWWLRF
Subjt:  GLNLNLQDFKNLEANMIFSKGSVS--------------STDQEGSNGT-ERNRSGGG-GGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRF

Query:  MKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPA
        MK+ G++ + G  GF FGD  DQ ILEFPDWMNNGN+    EQ  + +  + H  HPS A
Subjt:  MKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ--HHFFHHHHHHHPSPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G21280.1 hydroxyproline-rich glycoprotein family protein3.0e-1831.88Show/hide
Query:  EAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKAAA
        E  +S  YIRSLVK             + S++ ++++ T  +  +   K Q +Q HKKQVRRRLHTSRPYQERLLNMAEARREIVTALK HRA+M++A  
Subjt:  EAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKAAA

Query:  GEQNRQSQTPTEPPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYPWMMNSQSVEENNMEIVLPEQTLGLNLNLQDF
                 P +PP   Q +   +   P             P   F++ N + N                              +LP Q LGLNLN QDF
Subjt:  GEQNRQSQTPTEPPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYPWMMNSQSVEENNMEIVLPEQTLGLNLNLQDF

Query:  KN-LEANMIFSKGSVSSTDQEGSNGTERN---RSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFM--KMEGEEYQRGDQGFEFGD
         + ++ +   S  S SST    S+    N    S             +   ++ S         + + N+V SAWW   M   +E E     ++     D
Subjt:  KN-LEANMIFSKGSVSSTDQEGSNGTERN---RSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFM--KMEGEEYQRGDQGFEFGD

Query:  ---PLDQIILEFPDWMNNGNDNSSFEQHHFFHHHHHHHPSPASASASALPCMDIGEFEGLDG-EWLA
           P    ++EFP W+N   +    E  H ++   H+  SP +     L CM+IGE EG+DG +WLA
Subjt:  ---PLDQIILEFPDWMNNGNDNSSFEQHHFFHHHHHHHPSPASASASALPCMDIGEFEGLDG-EWLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTCGATAAGCTTAAGGAAGATCCAAATCCAGAGGCTCCTATGTCCGGCGCTTACATCCGTAGCCTCGTCAAACATTTGAAAACCAAAGATCCCATTCATCCCAA
TTCTTCTTCTTCTTCCTCTTCCTCTTCTTCCTCTTCCAACACCCATCCTTCAAAAATGGCAGAAAATGAGAAGGCTCAAGCAAGTCAGCCGCACAAAAAGCAAGTTCGAA
GAAGGCTCCATACAAGTCGACCGTACCAAGAGCGGCTGTTGAATATGGCTGAAGCGAGACGGGAGATTGTGACTGCGCTTAAGTATCACCGGGCGGCGATGAAGAAGGCC
GCCGCAGGCGAGCAGAATCGGCAATCGCAGACGCCGACGGAGCCCCCTGCCAGATGCCAAGAAGTGAAGACAAGGGCGAGGAAAATCCCCAAATCGTCGAGTAGTACAGA
GAGAAATAATTTAATTTACCCCCAAAGCAATTTCAACTATAATAATAATAATAATAATAATAATAATAACAACAACAATCATATGAGGAATTTATGTTATCCATGGATGA
TGAATTCGCAATCGGTGGAGGAGAATAATATGGAGATTGTATTACCAGAGCAAACTCTAGGGCTGAATCTGAACTTGCAGGATTTCAAGAATTTGGAGGCGAATATGATA
TTCTCAAAGGGAAGCGTTTCATCGACAGATCAGGAGGGTTCGAATGGAACAGAGAGGAATAGGAGCGGCGGCGGCGGAGGAGGGATGCACGTGGCGGTGGGGGAGGAGGA
GATGGCGGAGATAAGATCGATAGGGGAGAAGCACTCGGTGGAGTGGAGTGATAAGATGAATTTGGTGAAATCGGCGTGGTGGTTGAGGTTCATGAAAATGGAAGGGGAAG
AATATCAAAGAGGAGATCAAGGATTTGAGTTTGGGGATCCATTAGATCAAATAATATTGGAGTTTCCAGATTGGATGAACAACGGCAACGACAACTCATCTTTTGAACAA
CACCACTTCTTCCATCATCATCATCATCATCATCCTTCTCCTGCTTCTGCTTCTGCTTCTGCTTTGCCTTGCATGGACATTGGAGAGTTTGAAGGCTTGGATGGGGAGTG
GTTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTCGATAAGCTTAAGGAAGATCCAAATCCAGAGGCTCCTATGTCCGGCGCTTACATCCGTAGCCTCGTCAAACATTTGAAAACCAAAGATCCCATTCATCCCAA
TTCTTCTTCTTCTTCCTCTTCCTCTTCTTCCTCTTCCAACACCCATCCTTCAAAAATGGCAGAAAATGAGAAGGCTCAAGCAAGTCAGCCGCACAAAAAGCAAGTTCGAA
GAAGGCTCCATACAAGTCGACCGTACCAAGAGCGGCTGTTGAATATGGCTGAAGCGAGACGGGAGATTGTGACTGCGCTTAAGTATCACCGGGCGGCGATGAAGAAGGCC
GCCGCAGGCGAGCAGAATCGGCAATCGCAGACGCCGACGGAGCCCCCTGCCAGATGCCAAGAAGTGAAGACAAGGGCGAGGAAAATCCCCAAATCGTCGAGTAGTACAGA
GAGAAATAATTTAATTTACCCCCAAAGCAATTTCAACTATAATAATAATAATAATAATAATAATAATAACAACAACAATCATATGAGGAATTTATGTTATCCATGGATGA
TGAATTCGCAATCGGTGGAGGAGAATAATATGGAGATTGTATTACCAGAGCAAACTCTAGGGCTGAATCTGAACTTGCAGGATTTCAAGAATTTGGAGGCGAATATGATA
TTCTCAAAGGGAAGCGTTTCATCGACAGATCAGGAGGGTTCGAATGGAACAGAGAGGAATAGGAGCGGCGGCGGCGGAGGAGGGATGCACGTGGCGGTGGGGGAGGAGGA
GATGGCGGAGATAAGATCGATAGGGGAGAAGCACTCGGTGGAGTGGAGTGATAAGATGAATTTGGTGAAATCGGCGTGGTGGTTGAGGTTCATGAAAATGGAAGGGGAAG
AATATCAAAGAGGAGATCAAGGATTTGAGTTTGGGGATCCATTAGATCAAATAATATTGGAGTTTCCAGATTGGATGAACAACGGCAACGACAACTCATCTTTTGAACAA
CACCACTTCTTCCATCATCATCATCATCATCATCCTTCTCCTGCTTCTGCTTCTGCTTCTGCTTTGCCTTGCATGGACATTGGAGAGTTTGAAGGCTTGGATGGGGAGTG
GTTAGCTTGA
Protein sequenceShow/hide protein sequence
MEVDKLKEDPNPEAPMSGAYIRSLVKHLKTKDPIHPNSSSSSSSSSSSSNTHPSKMAENEKAQASQPHKKQVRRRLHTSRPYQERLLNMAEARREIVTALKYHRAAMKKA
AAGEQNRQSQTPTEPPARCQEVKTRARKIPKSSSSTERNNLIYPQSNFNYNNNNNNNNNNNNNHMRNLCYPWMMNSQSVEENNMEIVLPEQTLGLNLNLQDFKNLEANMI
FSKGSVSSTDQEGSNGTERNRSGGGGGGMHVAVGEEEMAEIRSIGEKHSVEWSDKMNLVKSAWWLRFMKMEGEEYQRGDQGFEFGDPLDQIILEFPDWMNNGNDNSSFEQ
HHFFHHHHHHHPSPASASASALPCMDIGEFEGLDGEWLA