; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016937 (gene) of Snake gourd v1 genome

Gene IDTan0016937
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG01:25037906..25040798
RNA-Seq ExpressionTan0016937
SyntenyTan0016937
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]1.2e-10152.93Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLM-----STNNRQSCPIGFVPIRRTLKKDLIRLRSLPSK----------Q
        TKEGDIIDCVDINKQPALDHP LK+HKVQT P+ Y     K+ S   +N +      + N  + CP GFVPIRRTLKKDLIRL+SL S           Q
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLM-----STNNRQSCPIGFVPIRRTLKKDLIRLRSLPSK----------Q

Query:  TIDIKDSTIGAVTFPHDKNVVSLGLKKGI-TYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTG
             D    +V FP+ +NVVS  L+KG   YYG  +++SVYN+S++ DQSSS N+WIVGGP  +L V++ GW VNP VNGD +TR F    ADGG  TG
Subjt:  TIDIKDSTIGAVTFPHDKNVVSLGLKKGI-TYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTG

Query:  CYNMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKP
        CYNM+C+GFVQV+       PL P+STY G Q DYQF + Q    GNWW+L+ +  + +GYWPKEL   L +GA+Q+AWGGIA+PS  G+SP LG+ +KP
Subjt:  CYNMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKP

Query:  --SGKYNEACYFRNIGFITDKRQSQ--MPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNCRGNI
          +G YNE CY RNI  I+    +   +P + NT++Y SN++CY L N N+NCG D ++YCFTFGGPGG NC   I
Subjt:  --SGKYNEACYFRNIGFITDKRQSQ--MPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNCRGNI

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]5.9e-9351.5Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMST---NNRQSCPIGFVPIRRTLKKDLIRLRSLPSK----------QTI
        TKEGDIIDCVDINKQPALDHP LK+HKVQT P+ +     K  S S+ N+ + T   NN + CPIGFVPIRRTLK+DLIRL+SL S           Q  
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMST---NNRQSCPIGFVPIRRTLKKDLIRLRSLPSK----------QTI

Query:  DIKDSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVA-QDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY
          KD +  AV FP+D+NVVS  L K   Y+G  A ++VYN+S++ ++QSSSAN+W+VGGP  +LNV++       AV+GDSL R F     D G  TGCY
Subjt:  DIKDSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVA-QDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY

Query:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPS-PSGMSPPLGNSYKPS
        NM C+GFV V+  I     + P+S Y+G Q DYQF + Q +  G+WW+ + D ++ +GYWP ELF  L  GAEQVAWGG A+PS  S  SPPLG+ +KP+
Subjt:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPS-PSGMSPPLGNSYKPS

Query:  GKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC
        G+ +EAC+ RNI +I       +P   NT+ YVS+S+CY L +N  NC  D  +YCFTFGGPGG +C
Subjt:  GKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]5.6e-10454.73Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP
        T+EGDIIDCVDINKQPALDHPSLK HK+QTRP+ YP G  K  S S+  S ++ NNR +CP G+VPIRRT+KKDLIR+RSL SK+   IK S  G V FP
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP

Query:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDF
        ++++VVS+ +KKGI YYG +  VSVYNLSVAQDQSSS+N+WI+GGPP A NVIL GWQVNP +NGDSLTR+                             
Subjt:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDF

Query:  PLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPS-GKYNEACYFRNIGFITDK
                        FV + D  +GNWWL + +    IGYWPKELFG+L +G EQVAWGGIAKPSP+GMSPPLGN +KP+  KY++ACYFR + ++ + 
Subjt:  PLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPS-GKYNEACYFRNIGFITDK

Query:  RQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC
         + Q P  +NT  Y+SN++CY L+N    CG +   YC TFGGPGGNNC
Subjt:  RQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC

XP_024038072.1 uncharacterized protein LOC18039972 [Citrus clementina]1.7e-8748.72Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP
        T+EGD IDCVDINKQPALDHP LK+HKVQTRP  +P G  K+ S+SK + +   + R+SCP G VPIRRT K+DLI+  S+P +    I   T    +FP
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP

Query:  --HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFCRGFVQVHQ
          H  + V++G+KK + Y+G++  + V+NL+VA+DQ S  N+WI  GPP  LNVIL GW V+PA+NGD LTRLF    ADG   TGCYN  C GFV  + 
Subjt:  --HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFCRGFVQVHQ

Query:  GITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPSGKYNEACYFRNIG
         ITP++PL  +STY   Q D + ++YQD  +GNWWL+L+D  + +GYWPKELF +L  GAE VAWGGIA    +G+SPP+G+    +  +   CY RNI 
Subjt:  GITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPSGKYNEACYFRNIG

Query:  FITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGG
        ++  + + + P       ++  S CYGL +   NCG   + YC  FGG GG
Subjt:  FITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGG

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]1.9e-9650.81Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNR--QSCPIGFVPIRRTLKKDLIRLRSLPSK----------QTID
        TKEGDIIDCVDINKQPALDHP LK+HKVQT P+ +     K  S   +N ++++NN   + CP+GFVPIRRTLK+DLIRL+SL S           +  D
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNR--QSCPIGFVPIRRTLKKDLIRLRSLPSK----------QTID

Query:  IK-DSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVAQD-QSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY
        +  DS   AV FP+ +NVVS  L K   Y+G  A ++V+N+S++ + QSSSAN+W++GG   +LNV++ GWQVNPAVNGD+L R F     D GV TGCY
Subjt:  IK-DSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVAQD-QSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY

Query:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSG-MSPPLGNSYKPS
        NM C+GFV V+  I     + P+S Y+G Q DYQF + Q +  G+WW+ + D ++ +GYWP ELF  L  GA+QVAWGG A+P+  G  SPPLG+ +KP+
Subjt:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSG-MSPPLGNSYKPS

Query:  GKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNCRGN
        GK +EA + RNI +I       +P   NT+ YVSNS+CY L +N  NC  D  +YCFTFGGPGG+ C  +
Subjt:  GKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNCRGN

TrEMBL top hitse value%identityAlignment
A0A059CIH2 Uncharacterized protein3.0e-7944.94Show/hide
Query:  WLTKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRS---LPSKQTIDIKDSTIG
        +LT+EGDIIDC+DI+KQPA+DHP LK+HK+Q +P      + K  S +K+   +   +R+ CPIG VPI+R  K+DLIR RS   +PS   ++I+D+   
Subjt:  WLTKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRS---LPSKQTIDIKDSTIG

Query:  A---VTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF---ADGGVKTGCYNMFCRGF
            V   HD           +  YG + ++SVYN+S A DQ SS N+WI  GPP  +++I+ GW+V+P +N D LTRLF      G + GCYN FC+GF
Subjt:  A---VTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF---ADGGVKTGCYNMFCRGF

Query:  VQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPSGKYNEACY
        VQV + ITP++PL+P STY GP  + +  V QD+ +GNWWL + D  I +GYWPKELF  L+NG+   AWGG+AK   +G  PP+GN + P    ++A Y
Subjt:  VQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPSGKYNEACY

Query:  FRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGG
        FR + ++    +S  P Y+N    V   +CY L   NL    D   Y FTFGGPGG
Subjt:  FRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGG

A0A5D3CJM0 Neprosin 22.8e-9351.5Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMST---NNRQSCPIGFVPIRRTLKKDLIRLRSLPSK----------QTI
        TKEGDIIDCVDINKQPALDHP LK+HKVQT P+ +     K  S S+ N+ + T   NN + CPIGFVPIRRTLK+DLIRL+SL S           Q  
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMST---NNRQSCPIGFVPIRRTLKKDLIRLRSLPSK----------QTI

Query:  DIKDSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVA-QDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY
          KD +  AV FP+D+NVVS  L K   Y+G  A ++VYN+S++ ++QSSSAN+W+VGGP  +LNV++       AV+GDSL R F     D G  TGCY
Subjt:  DIKDSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSVA-QDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY

Query:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPS-PSGMSPPLGNSYKPS
        NM C+GFV V+  I     + P+S Y+G Q DYQF + Q +  G+WW+ + D ++ +GYWP ELF  L  GAEQVAWGG A+PS  S  SPPLG+ +KP+
Subjt:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPS-PSGMSPPLGNSYKPS

Query:  GKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC
        G+ +EAC+ RNI +I       +P   NT+ YVS+S+CY L +N  NC  D  +YCFTFGGPGG +C
Subjt:  GKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC

A0A6J1CVJ6 uncharacterized protein LOC1110147772.7e-10454.73Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP
        T+EGDIIDCVDINKQPALDHPSLK HK+QTRP+ YP G  K  S S+  S ++ NNR +CP G+VPIRRT+KKDLIR+RSL SK+   IK S  G V FP
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP

Query:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDF
        ++++VVS+ +KKGI YYG +  VSVYNLSVAQDQSSS+N+WI+GGPP A NVIL GWQVNP +NGDSLTR+                             
Subjt:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDF

Query:  PLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPS-GKYNEACYFRNIGFITDK
                        FV + D  +GNWWL + +    IGYWPKELFG+L +G EQVAWGGIAKPSP+GMSPPLGN +KP+  KY++ACYFR + ++ + 
Subjt:  PLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPS-GKYNEACYFRNIGFITDK

Query:  RQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC
         + Q P  +NT  Y+SN++CY L+N    CG +   YC TFGGPGGNNC
Subjt:  RQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNC

A0A6J1CVW9 uncharacterized protein LOC1110147741.5e-7846.57Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP
        TKEG+IIDCVDI+KQPALDHPSLK+HKVQ RP+ YP G  K  + SK+                                                    
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP

Query:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDF
           NVVSLGLK+GI YYG+  + SVYNLSVAQDQSSS+N+WIVGGPP  LN       VNP +NGDSLTR+                             
Subjt:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDF

Query:  PLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKP-SGKYNEACYFRNIGFITDK
                        FV + D  +G+WWL ++D +  IGYWPKELFG+L +GAEQVAWGGIAKPSP+GMSPPLGN +KP +GKYNEACYF++I +I   
Subjt:  PLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKP-SGKYNEACYFRNIGFITDK

Query:  RQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNCR
             P Y+N V++VSNS+CYGL +    C  D + +CFTFGGPGGNNCR
Subjt:  RQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGNNCR

A0A6J1CW60 uncharacterized protein LOC1110147754.2e-8148.9Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP
        T+EGDIIDCVDINKQPALDHP LK+HKVQ                                                                       
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFP

Query:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFCRGFVQVHQGI
            V S+ +K+G  YYG    VSVYNLSVAQDQSSS+N+WI+GGPP A NVIL GWQVNP +NGDS TR+F    ADGG+ TG YNMFCR F+Q +   
Subjt:  HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFCRGFVQVHQGI

Query:  TPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPS-GKYNEACYFRNIGF
         P+ PL PSSTY+G Q DY F V+QD  +G+WWL ++D +  IGYWPKELFG+L +GAEQVAWGGIAKPSP+GMSPPLGN +KP+ GK+++ACYFR + +
Subjt:  TPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPS-GKYNEACYFRNIGF

Query:  ITDKRQSQMPFYQNTVAYV
        I +  +S++   +NT +Y+
Subjt:  ITDKRQSQMPFYQNTVAYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)5.3e-5231.88Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWL-------KRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSL-----PSKQTID
        + +GD+IDCV I+KQPA DHP LKDHK+Q +PN +P+G          + +E + +     +    C  G +P+RRT + D++R  S+       ++++ 
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWL-------KRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSL-----PSKQTID

Query:  IKDSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY
        +  S    +      +  ++   +G  YYG  A ++V+   +  Q++ S + +W++GG     LN I  GWQV+P + GD+ TRLF    +D    TGCY
Subjt:  IKDSTIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCY

Query:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSG---MSPPLGNSYK
        N+ C GF+Q++  I     +SP S YR  Q D   ++++D + G+WW+   +  + +GYWP  LF YL   A  + WGG    S S     S  +G+   
Subjt:  NMFCRGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSG---MSPPLGNSYK

Query:  PSGKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN
        P   +++A YFRNI  +      + P  +    +   SNCY +   +    +D   + F +GGPG N
Subjt:  PSGKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN

AT2G44220.1 Protein of Unknown Function (DUF239)3.2e-4933.89Show/hide
Query:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYP---DGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIR---LRSLPSKQTIDIKDSTI
        +++GD+IDCV I  QPA DH  LK+H +Q RP+ YP     + KR++++        +    CP   VPIRRT K+DL+R   +RS   K    I  +T 
Subjt:  TKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYP---DGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIR---LRSLPSKQTIDIKDSTI

Query:  GAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFA----DGGVKTGCYNMFCRGF
           T  H      +G++ G  +YG    ++++   V    + S A  W+V G   +LN I  GWQV P +  D+  R F     DG  KTGCYN+ C GF
Subjt:  GAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFA----DGGVKTGCYNMFCRGF

Query:  VQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSG---MSPPLGNSYKPSGKYNE
        VQ     T    ++  S YRG Q D   ++++D ++GNWWL + ++ + IGYWP  LF  L   A +V WGG    S +G    +  +G+ +     + +
Subjt:  VQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSG---MSPPLGNSYKPSGKYNE

Query:  ACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN
        A YFRN+  +      + P  Q    +    NCY +   N   G       F +GGPG N
Subjt:  ACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN

AT3G13510.1 Protein of Unknown Function (DUF239)5.9e-5132.79Show/hide
Query:  EGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQ------SCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGA
        +GDIIDC+ I+KQPA DHP LKDHK+Q RP+ +P+G       S       T+  Q       C  G +P+RRT + D++R  S+        +   I  
Subjt:  EGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQ------SCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGA

Query:  VTFP----HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQ-DQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFC
           P     + +  ++   +G  YYG  A ++V+   +   ++ S + +W++GG     LN I  GWQV+P + GD+ TRLF    +D    TGCYN+ C
Subjt:  VTFP----HDKNVVSLGLKKGITYYGINAHVSVYNLSVAQ-DQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFC

Query:  RGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPS---GMSPPLGNSYKPSGK
         GF+Q++  I     +SP S YR  Q D   ++++D + G+WW+   +  + +GYWP  LF YL   A  + WGG    S S        +G+ + P   
Subjt:  RGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPS---GMSPPLGNSYKPSGK

Query:  YNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN-NC
        +++A YFRNI  +      + P  +    +   SNCY +   +    +D   + F +GGPG N NC
Subjt:  YNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN-NC

AT5G56530.1 Protein of Unknown Function (DUF239)1.7e-5033.79Show/hide
Query:  EGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWL------KRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLP---SKQTIDIK-DS
        +GDIIDCV I+KQPA DHP LKDHK+Q  P+  P+         ++  ES +      +    C  G +P+RRT K+D++R  S+     K+ + +    
Subjt:  EGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWL------KRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLP---SKQTIDIK-DS

Query:  TIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFC
        +          +  ++   +G  +YG  A ++V+   V + ++ S + LWI+GG     LN I  GWQV+P + GD+ TRLF    +D    TGCYN+ C
Subjt:  TIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFC

Query:  RGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGG-IAKPSPSG--MSPPLGNSYKPSGK
         GF+Q++  I     +SP S +  PQ D    +++D + G+WW+   D  + +GYWP  LF YL + A  V WGG +      G   +  +G+   P   
Subjt:  RGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGG-IAKPSPSG--MSPPLGNSYKPSGK

Query:  YNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN-NCR
        + +A YFRNI  +      + P   NT  +   SNCY +    +   DD   Y F +GGPG N NC+
Subjt:  YNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN-NCR

AT5G56530.2 Protein of Unknown Function (DUF239)1.7e-5033.79Show/hide
Query:  EGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWL------KRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLP---SKQTIDIK-DS
        +GDIIDCV I+KQPA DHP LKDHK+Q  P+  P+         ++  ES +      +    C  G +P+RRT K+D++R  S+     K+ + +    
Subjt:  EGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWL------KRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLP---SKQTIDIK-DS

Query:  TIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFC
        +          +  ++   +G  +YG  A ++V+   V + ++ S + LWI+GG     LN I  GWQV+P + GD+ TRLF    +D    TGCYN+ C
Subjt:  TIGAVTFPHDKNVVSLGLKKGITYYGINAHVSVYNLSV-AQDQSSSANLWIVGGP-PYALNVILVGWQVNPAVNGDSLTRLF----ADGGVKTGCYNMFC

Query:  RGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGG-IAKPSPSG--MSPPLGNSYKPSGK
         GF+Q++  I     +SP S +  PQ D    +++D + G+WW+   D  + +GYWP  LF YL + A  V WGG +      G   +  +G+   P   
Subjt:  RGFVQVHQGITPDFPLSPSSTYRGPQNDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGG-IAKPSPSG--MSPPLGNSYKPSGK

Query:  YNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN-NCR
        + +A YFRNI  +      + P   NT  +   SNCY +    +   DD   Y F +GGPG N NC+
Subjt:  YNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGLNNNNLNCGDDRLQYCFTFGGPGGN-NCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGAAAGCAATGTGGTTAACAAAAGAGGGAGATATCATTGATTGTGTTGACATCAATAAACAACCAGCTCTCGATCATCCTTCACTAAAAGATCACAAAGTTCA
GACTCGGCCAAATAGATATCCAGACGGTTGGCTAAAACGGCAATCTGAATCAAAACATAACTCACTCATGAGTACCAACAACAGACAATCTTGTCCTATTGGATTTGTTC
CAATCCGAAGAACTCTAAAGAAAGATTTAATTAGGTTAAGGTCTCTACCATCAAAGCAAACAATCGATATAAAAGACAGTACTATTGGTGCAGTCACGTTTCCTCACGAT
AAAAATGTGGTTTCTCTTGGTTTGAAGAAAGGTATAACATATTATGGAATTAATGCACACGTATCGGTATACAATTTAAGTGTGGCTCAAGATCAGTCTTCTTCTGCAAA
CCTATGGATAGTTGGTGGTCCTCCTTATGCTCTTAATGTAATACTTGTCGGCTGGCAGGTAAATCCAGCCGTAAATGGTGATAGTCTCACAAGGTTGTTTGCTGATGGGG
GTGTTAAAACGGGATGTTACAATATGTTTTGTCGAGGTTTTGTACAAGTACATCAGGGTATTACTCCAGATTTTCCTCTTTCTCCATCCTCCACCTATCGAGGGCCACAA
AATGATTATCAATTTGTAGTTTATCAGGATGTAGAATCAGGAAATTGGTGGCTTCTTTTAACAGATCGTAGAATAGCCATTGGATATTGGCCAAAGGAGTTGTTTGGATA
TCTCAAAAATGGAGCCGAACAAGTGGCATGGGGAGGCATTGCAAAGCCTTCACCAAGTGGAATGAGCCCTCCCTTAGGGAATTCCTATAAGCCAAGTGGTAAATACAATG
AGGCTTGCTACTTTAGAAACATTGGTTTTATAACAGATAAACGTCAAAGTCAAATGCCTTTTTATCAAAATACAGTGGCTTATGTGAGTAACTCCAACTGTTATGGTTTG
AACAATAATAATCTCAATTGTGGTGACGATCGGTTACAGTATTGCTTTACCTTTGGAGGACCAGGTGGAAATAATTGTAGAGGCAACATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGAAAGCAATGTGGTTAACAAAAGAGGGAGATATCATTGATTGTGTTGACATCAATAAACAACCAGCTCTCGATCATCCTTCACTAAAAGATCACAAAGTTCA
GACTCGGCCAAATAGATATCCAGACGGTTGGCTAAAACGGCAATCTGAATCAAAACATAACTCACTCATGAGTACCAACAACAGACAATCTTGTCCTATTGGATTTGTTC
CAATCCGAAGAACTCTAAAGAAAGATTTAATTAGGTTAAGGTCTCTACCATCAAAGCAAACAATCGATATAAAAGACAGTACTATTGGTGCAGTCACGTTTCCTCACGAT
AAAAATGTGGTTTCTCTTGGTTTGAAGAAAGGTATAACATATTATGGAATTAATGCACACGTATCGGTATACAATTTAAGTGTGGCTCAAGATCAGTCTTCTTCTGCAAA
CCTATGGATAGTTGGTGGTCCTCCTTATGCTCTTAATGTAATACTTGTCGGCTGGCAGGTAAATCCAGCCGTAAATGGTGATAGTCTCACAAGGTTGTTTGCTGATGGGG
GTGTTAAAACGGGATGTTACAATATGTTTTGTCGAGGTTTTGTACAAGTACATCAGGGTATTACTCCAGATTTTCCTCTTTCTCCATCCTCCACCTATCGAGGGCCACAA
AATGATTATCAATTTGTAGTTTATCAGGATGTAGAATCAGGAAATTGGTGGCTTCTTTTAACAGATCGTAGAATAGCCATTGGATATTGGCCAAAGGAGTTGTTTGGATA
TCTCAAAAATGGAGCCGAACAAGTGGCATGGGGAGGCATTGCAAAGCCTTCACCAAGTGGAATGAGCCCTCCCTTAGGGAATTCCTATAAGCCAAGTGGTAAATACAATG
AGGCTTGCTACTTTAGAAACATTGGTTTTATAACAGATAAACGTCAAAGTCAAATGCCTTTTTATCAAAATACAGTGGCTTATGTGAGTAACTCCAACTGTTATGGTTTG
AACAATAATAATCTCAATTGTGGTGACGATCGGTTACAGTATTGCTTTACCTTTGGAGGACCAGGTGGAAATAATTGTAGAGGCAACATATAA
Protein sequenceShow/hide protein sequence
MASKAMWLTKEGDIIDCVDINKQPALDHPSLKDHKVQTRPNRYPDGWLKRQSESKHNSLMSTNNRQSCPIGFVPIRRTLKKDLIRLRSLPSKQTIDIKDSTIGAVTFPHD
KNVVSLGLKKGITYYGINAHVSVYNLSVAQDQSSSANLWIVGGPPYALNVILVGWQVNPAVNGDSLTRLFADGGVKTGCYNMFCRGFVQVHQGITPDFPLSPSSTYRGPQ
NDYQFVVYQDVESGNWWLLLTDRRIAIGYWPKELFGYLKNGAEQVAWGGIAKPSPSGMSPPLGNSYKPSGKYNEACYFRNIGFITDKRQSQMPFYQNTVAYVSNSNCYGL
NNNNLNCGDDRLQYCFTFGGPGGNNCRGNI