; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019759 (gene) of Snake gourd v1 genome

Gene IDTan0019759
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationLG01:25570470..25574744
RNA-Seq ExpressionTan0019759
SyntenyTan0019759
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]5.6e-11955.3Show/hide
Query:  KYNQASDLNLSRKEEFEFETQLKLLNKPFVKT------------FETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLF-KDSYQSNN--ETL
        K ++AS+  LSR+EE E E  LKLLNKP +KT            ++TKEGDIIDCVDINKQPALD+PLLKN KVQT PS Y+S LF KDS Q+NN   TL
Subjt:  KYNQASDLNLSRKEEFEFETQLKLLNKPFVKT------------FETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLF-KDSYQSNN--ETL

Query:  AININ--EESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKG-RKYFGASACFAVYDLSDVEGDQSS
          N N  EE CP GFVPIRRTLKKDLIRL+SLSS  +  +S S N  +   D+FF D+++FPY +NV+SHS+ KG  KY+G  +  +VY++S +  DQSS
Subjt:  AININ--EESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKG-RKYFGASACFAVYDLSDVEGDQSS

Query:  SSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVD
        S+NIWIVGGP  D+L V++ GW VNP +NGD   R FV    DGG  TGCYNM CQGFVQV+ S H G+PL PTSTY+GQQY+          F ++ ++
Subjt:  SSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVD

Query:  KSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMP--NGQYNEGCYIGRIAYMYD--DYQVQVPTWQNTLSYVSN
            WWV +    E   +GYWP EL  NL++GADQI WGGIA+P S++G+SP+LG+GH P  NG YNEGCYI  I  +         +PTW NTLSY SN
Subjt:  KSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMP--NGQYNEGCYIGRIAYMYD--DYQVQVPTWQNTLSYVSN

Query:  SDCYGLIANQNCGNDFLKYCFTFGGPGGYQCGTT
        + CY L  N NCG D ++YCFTFGGPGG  C  T
Subjt:  SDCYGLIANQNCGNDFLKYCFTFGGPGGYQCGTT

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]4.2e-12254.12Show/hide
Query:  VLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDS--YQSNNETLAININ
        + FVC +CK+N AS+ NLSR+EE E E QLKLLNKPF+KT++TKEGDIIDCVDINKQPALD+PLLKN KVQT PS +IS LFK+    QSNN  L  N N
Subjt:  VLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDS--YQSNNETLAININ

Query:  E-ESCPVGFVPIRRTLKKDLIRLRSLSS--KKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIW
          E CP+GFVPIRRTLK+DLIRL+SLSS  KKQES  +  ++      +F  D +RFPYD+NV+SHS++K   Y GA A  AVY++S  + +QSSS+NIW
Subjt:  E-ESCPVGFVPIRRTLKKDLIRLRSLSS--KKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIW

Query:  IVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKW
        +VGGP +++LNV++A      +++GDS PR FV    D G  TGCYNMLCQGFV V+     GS + P S YQG+QY+    I + I            W
Subjt:  IVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKW

Query:  WVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIAN
        WV   + ++   +GYWP+ELFPNL+ GA+Q+ WGG A+PS  +  SP LG+GH PNG+ +E C++  I Y+  +Y + +PT  NT++YVS+S CY LI+N
Subjt:  WVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIAN

Query:  QNCGNDFLKYCFTFGGPGGYQCGTT
        +NC  D  KYCFTFGGPGG  C  T
Subjt:  QNCGNDFLKYCFTFGGPGGYQCGTT

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]2.5e-9547.76Show/hide
Query:  MASKA-TWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSN
        MASKA  WL+IVL + ++CK + A D NLSR+EE E E QLKLLN+PF+ TF+T+EGDIIDCVDINKQPALD+P LK+ K+QTRPS Y   L KDS  S 
Subjt:  MASKA-TWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSN

Query:  NETLAININEESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQS
        +++  IN N  +CP G+VPIRRT+KKDLIR+RSLSSK          EP  G        + FPY+++V+S +M KG KY+GAS   +VY+LS V  DQS
Subjt:  NETLAININEESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQS

Query:  SSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFVDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSD
        SSSNIWI+GGP   A NVILAGWQVNP INGDS  R+F                   V+ +  P                                  + 
Subjt:  SSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFVDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSD

Query:  KWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPN-GQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGL
         WW  + + E  + +GYWP ELF +L +G +Q+ WGGIAKPS  NG+SP LGNGH PN  +Y++ CY   + Y+ ++ + Q P  +NT +Y+SN+ CY L
Subjt:  KWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPN-GQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGL

Query:  IANQNCGNDFLKYCFTFGGPGGYQC
           + CG +F  YC TFGGPGG  C
Subjt:  IANQNCGNDFLKYCFTFGGPGGYQC

XP_031738648.1 uncharacterized protein LOC105435061 [Cucumis sativus]8.2e-8645.06Show/hide
Query:  NQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEESCPVGFVPIR
        ++AS+  LSR+EE E E  LKLLNKP +KT++TKEGDIIDCVDINKQPALD+PLLKN KVQ                                       
Subjt:  NQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEESCPVGFVPIR

Query:  RTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKG-RKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVIL
                                                       V+SHS+ KG  KY+G  +  +VY++S +  DQSSS+NIWIVGGP  D+L V++
Subjt:  RTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKG-RKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVIL

Query:  AGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMG
         GW VNP +NGD   R FV    DGG  TGCYNM CQGFVQV+ S H G+PL PTSTY+GQQY+          F ++ ++    WWV +    E   +G
Subjt:  AGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMG

Query:  YWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMP--NGQYNEGCYIGRIAYMYD--DYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKY
        YWP EL  NL++GADQI WGGIA+P S++G+SP+LG+GH P  NG YNEGCYI  I  +         +PTW NTLSY SN+ CY L  N NCG D ++Y
Subjt:  YWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMP--NGQYNEGCYIGRIAYMYD--DYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKY

Query:  CFTFGGPGGYQCGTT
        CFTFGGPGG  C  T
Subjt:  CFTFGGPGGYQCGTT

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]1.2e-13256.67Show/hide
Query:  SKATWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFK-DSYQSNNE
        SKAT LVIV FVC +CK+N AS+ NLSR+E+ E E QLKLLNKPF+KT++TKEGDIIDCVDINKQPALD+PLLKN KVQT PS ++S LFK DS QSNN 
Subjt:  SKATWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFK-DSYQSNNE

Query:  TLAININE-ESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSS
         L  N N  E CPVGFVPIRRTLK+DLIRL+SLSS  +  +S  + E +    + F D +RFPY +NV+SHS++K + Y GA A  AV+++S  +  QSS
Subjt:  TLAININE-ESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSS

Query:  SSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLF----VDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVD
        S+NIW++GG S+D+LNV++AGWQVNP++NGD+ PR F    VD GV TGCYNMLCQGFV V+ + H GS + P S YQGQQY+    I + I        
Subjt:  SSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLF----VDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVD

Query:  KSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCY
            WWV   + +    +GYWP ELFPNL+ GADQ+ WGG A+P+     SP LG+GH PNG+ +E  ++  I Y+  +Y + +PT  NT++YVSNS CY
Subjt:  KSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCY

Query:  GLIANQNCGNDFLKYCFTFGGPGGYQC
         LI+N+NC  D  KYCFTFGGPGG+ C
Subjt:  GLIANQNCGNDFLKYCFTFGGPGGYQC

TrEMBL top hitse value%identityAlignment
A0A0A0L400 Neprosin domain-containing protein1.7e-7651.47Show/hide
Query:  FLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLF----VDGGVATGCYNMLC
        F D +RFPY +NV+SHS++K + Y GA A  AV+++S  +  QSSS+NIW++GG S+D+LNV++AGWQVNP++NGD+ PR F    VD GV TGCYNMLC
Subjt:  FLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLF----VDGGVATGCYNMLC

Query:  QGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLG
        QGFV V+ + H GS + P S YQGQQY+    I + I            WWV   + +    +GYWP ELFPNL+ GADQ+ WGG A+P+     SP LG
Subjt:  QGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLG

Query:  NGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPGGYQC
        +GH PNG+ +E  ++  I Y+  +Y + +PT  NT++YVSNS CY LI+N+NC  D  KYCFTFGGPGG+ C
Subjt:  NGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPGGYQC

A0A5A7UEV4 Uncharacterized protein6.4e-7652.84Show/hide
Query:  DEFFLDTIRFPYDKNVISHSMVKG-RKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCY
        D+FF D++++P ++NV+SHS+ KG  KY+G  +  +VY++S +   QSSSSNIWIVGGP+N +L V++ GW VNP +NGD   R FV    DGG  TGCY
Subjt:  DEFFLDTIRFPYDKNVISHSMVKG-RKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCY

Query:  NMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLS
        NM CQGFVQV+ S H G+PL PTSTYQGQQY+          F ++ +  +  WWV +    E   +GYWP EL  NL++GA+QI WGGIAKP S++G+S
Subjt:  NMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLS

Query:  PLLGNGHMP--NGQYNEGCYIGRIAYMYD--DYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPGGYQCGT
        P+LG+GH P  NG YNEGCYI  I  +        ++PTW NTLSY SN+ CY L  N NCG+D ++YCFTFGGPGG  C T
Subjt:  PLLGNGHMP--NGQYNEGCYIGRIAYMYD--DYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPGGYQCGT

A0A5D3CJM0 Neprosin 22.0e-12254.12Show/hide
Query:  VLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDS--YQSNNETLAININ
        + FVC +CK+N AS+ NLSR+EE E E QLKLLNKPF+KT++TKEGDIIDCVDINKQPALD+PLLKN KVQT PS +IS LFK+    QSNN  L  N N
Subjt:  VLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDS--YQSNNETLAININ

Query:  E-ESCPVGFVPIRRTLKKDLIRLRSLSS--KKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIW
          E CP+GFVPIRRTLK+DLIRL+SLSS  KKQES  +  ++      +F  D +RFPYD+NV+SHS++K   Y GA A  AVY++S  + +QSSS+NIW
Subjt:  E-ESCPVGFVPIRRTLKKDLIRLRSLSS--KKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIW

Query:  IVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKW
        +VGGP +++LNV++A      +++GDS PR FV    D G  TGCYNMLCQGFV V+     GS + P S YQG+QY+    I + I            W
Subjt:  IVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKW

Query:  WVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIAN
        WV   + ++   +GYWP+ELFPNL+ GA+Q+ WGG A+PS  +  SP LG+GH PNG+ +E C++  I Y+  +Y + +PT  NT++YVS+S CY LI+N
Subjt:  WVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIAN

Query:  QNCGNDFLKYCFTFGGPGGYQCGTT
        +NC  D  KYCFTFGGPGG  C  T
Subjt:  QNCGNDFLKYCFTFGGPGGYQCGTT

A0A6J1CVJ6 uncharacterized protein LOC1110147771.2e-9547.76Show/hide
Query:  MASKA-TWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSN
        MASKA  WL+IVL + ++CK + A D NLSR+EE E E QLKLLN+PF+ TF+T+EGDIIDCVDINKQPALD+P LK+ K+QTRPS Y   L KDS  S 
Subjt:  MASKA-TWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSN

Query:  NETLAININEESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQS
        +++  IN N  +CP G+VPIRRT+KKDLIR+RSLSSK          EP  G        + FPY+++V+S +M KG KY+GAS   +VY+LS V  DQS
Subjt:  NETLAININEESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQS

Query:  SSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFVDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSD
        SSSNIWI+GGP   A NVILAGWQVNP INGDS  R+F                   V+ +  P                                  + 
Subjt:  SSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFVDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSD

Query:  KWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPN-GQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGL
         WW  + + E  + +GYWP ELF +L +G +Q+ WGGIAKPS  NG+SP LGNGH PN  +Y++ CY   + Y+ ++ + Q P  +NT +Y+SN+ CY L
Subjt:  KWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPN-GQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGL

Query:  IANQNCGNDFLKYCFTFGGPGGYQC
           + CG +F  YC TFGGPGG  C
Subjt:  IANQNCGNDFLKYCFTFGGPGGYQC

A0A6J1CW60 uncharacterized protein LOC1110147752.0e-7442.93Show/hide
Query:  MASKA-TWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSN
        MASKA  WL+IVL + ++CK + A D NLS +EE EFE QLKLLNKP + TF+T+EGDIIDCVDINKQPALD+PLLKN KVQ                  
Subjt:  MASKA-TWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSN

Query:  NETLAININEESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQS
                                                                            V S +M +G+KY+G +   +VY+LS V  DQS
Subjt:  NETLAININEESCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQS

Query:  SSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNV
        SSSNIWI+GGP  +A NVIL GWQVNP INGDSR R+FV    DGG+ TG YNM C+ F+Q + ST P  PL P+STYQG+QY+          F V   
Subjt:  SSSNIWIVGGPSNDALNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNV

Query:  DKSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPN-GQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYV
          +  WW  + +++    +GYWP ELF +L +GA+Q+ WGGIAKPS  NG+SP LGNGH PN G++++ CY   + Y+ ++ + +V   +NT SY+
Subjt:  DKSDKWWVEILINEETEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNGLSPLLGNGHMPN-GQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)2.1e-5533.09Show/hide
Query:  KEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEE------------SCPVGFVP
        K++FE +  L  LNKP VK+ ++ +GD+IDCV I+KQPA D+P LK+ K+Q +P+ +   LF D     N+  A   NE+             C  G +P
Subjt:  KEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEE------------SCPVGFVP

Query:  IRRTLKKDLIRLRSL----SSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDA
        +RRT + D++R  S+      K++      S EP         D I     ++ I++  V+G KY+GA A   V++    + ++ S S IW++GG     
Subjt:  IRRTLKKDLIRLRSL----SSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDA

Query:  LNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEE
        LN I AGWQV+P + GD+  RLF     D   ATGCYN+LC GF+Q++     G+ + P S Y+  QY++ I I K          K   WW++      
Subjt:  LNVILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEE

Query:  TEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDF
           +GYWP+ LF  L   A  I WGG    S  +G   S  +G+G  P   +++  Y   I  +     ++ P    T  +   S+CY +   Q   ND 
Subjt:  TEAMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDF

Query:  LKYCFTFGGPGGYQ
          + F +GGPG  Q
Subjt:  LKYCFTFGGPGGYQ

AT2G44240.1 Protein of Unknown Function (DUF239)2.4e-5133.25Show/hide
Query:  EFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEE--SCPVGFVPIRRTLKKDLIRL
        E + +  LK LNKP +K+ ++++GDIIDCV I  QPA D+PLLKN  +Q +PS +I     DS  +  ET A  + ++   CP   +PIRRT K++++R 
Subjt:  EFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEE--SCPVGFVPIRRTLKKDLIRL

Query:  RSLSS---KKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILAGWQVNPS
        +SL S   K  +   + ++ P    +  F+                V+  K++G  A   V+        + S S  WIV G    + N I AGWQV P 
Subjt:  RSLSS---KKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILAGWQVNPS

Query:  INGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFP
        + G++ PRLFV    DG   TGCYN++C GFVQ       G      S Y G Q         V+N  +    K+  WW++I    + + +GYWP  LF 
Subjt:  INGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFP

Query:  NLMNGADQIGWGG-IAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPG
        +L +GA ++ WGG I  P+S    +  +G+GH       +  Y+  I  +     ++ P  Q   SY  N +CY ++   N G  F  + F +GGPG
Subjt:  NLMNGADQIGWGG-IAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPG

AT3G13510.1 Protein of Unknown Function (DUF239)2.1e-5533.5Show/hide
Query:  SRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQS----NNETLAINI--NEESCPVGFVPIRRT
        S +++FE +  L  LNKP VKT ++ +GDIIDC+ I+KQPA D+P LK+ K+Q RPS +   LF D+  S      ET    +      C  G +P+RRT
Subjt:  SRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQS----NNETLAINI--NEESCPVGFVPIRRT

Query:  LKKDLIRLRSL----SSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSM--VKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALN
         + D++R  S+      K +      S EP++              ++N   H++  V+G KY+GA A   V++      ++ S S IW++GG     LN
Subjt:  LKKDLIRLRSL----SSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSM--VKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALN

Query:  VILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETE
         I AGWQV+P + GD+  RLF     D   ATGCYN+LC GF+Q++     G+ + P S Y+  QY++ I I K          K   WW++        
Subjt:  VILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETE

Query:  AMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLK
         +GYWP+ LF  L   A  I WGG    S   G      +G+GH P   +++  Y   I  +     ++ P    T  +   S+CY +   Q   ND   
Subjt:  AMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLK

Query:  YCFTFGGPG
        + F +GGPG
Subjt:  YCFTFGGPG

AT5G56530.1 Protein of Unknown Function (DUF239)3.4e-5333.25Show/hide
Query:  ASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAIN------INEESCPVGF
        A  L++SR + FE    L  LNKP VK+ ++ +GDIIDCV I+KQPA D+P LK+ K+Q  PS    +LF +S  S     ++N           C  G 
Subjt:  ASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAIN------INEESCPVGF

Query:  VPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALN
        +P+RRT K+D+  LR+ S K+   K   S       D    D I     ++ I++  V+G K++GA A   V++      ++ S S +WI+GG     LN
Subjt:  VPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALN

Query:  VILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETE
         I AGWQV+P + GD+  RLF     D   ATGCYN+LC GF+Q++     G+ + P S +   QY++ I I K          K   WW++     +  
Subjt:  VILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETE

Query:  AMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLK
         +GYWP+ LF  L + A  + WGG       +G   +  +G+G  P+  + +  Y   I  +     ++ P   NT  +   S+CY +   +   ND   
Subjt:  AMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLK

Query:  YCFTFGGPG
        + F +GGPG
Subjt:  YCFTFGGPG

AT5G56530.2 Protein of Unknown Function (DUF239)3.4e-5333.25Show/hide
Query:  ASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAIN------INEESCPVGF
        A  L++SR + FE    L  LNKP VK+ ++ +GDIIDCV I+KQPA D+P LK+ K+Q  PS    +LF +S  S     ++N           C  G 
Subjt:  ASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAIN------INEESCPVGF

Query:  VPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALN
        +P+RRT K+D+  LR+ S K+   K   S       D    D I     ++ I++  V+G K++GA A   V++      ++ S S +WI+GG     LN
Subjt:  VPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALN

Query:  VILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETE
         I AGWQV+P + GD+  RLF     D   ATGCYN+LC GF+Q++     G+ + P S +   QY++ I I K          K   WW++     +  
Subjt:  VILAGWQVNPSINGDSRPRLFV----DGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETE

Query:  AMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLK
         +GYWP+ LF  L + A  + WGG       +G   +  +G+G  P+  + +  Y   I  +     ++ P   NT  +   S+CY +   +   ND   
Subjt:  AMGYWPAELFPNLMNGADQIGWGGIAKPSSVNG--LSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLK

Query:  YCFTFGGPG
        + F +GGPG
Subjt:  YCFTFGGPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTAAAGCAACATGGTTGGTGATTGTGTTGTTTGTTTGCATCCATTGCAAATACAACCAAGCCTCTGATTTAAACCTCTCAAGAAAAGAAGAATTTGAGTTCGA
AACACAACTCAAACTTCTCAACAAACCCTTCGTCAAAACATTTGAGACCAAAGAGGGAGATATCATTGATTGTGTCGACATCAATAAACAACCAGCCCTTGATTATCCTT
TATTGAAAAATCAGAAAGTTCAGACTCGACCAAGTAGATATATATCTAATTTGTTCAAAGATTCATATCAATCAAATAATGAGACACTCGCAATTAACATCAATGAAGAA
TCTTGTCCTGTTGGATTTGTTCCCATCCGAAGAACACTAAAGAAAGATCTAATTAGGTTAAGATCTCTATCCTCAAAGAAGCAAGAATCAAAATCTCAATCATCAAATGA
GCCAGAAATTGGTTTTGATGAGTTTTTTCTTGATACTATCAGGTTCCCTTATGATAAAAATGTGATTTCTCATTCTATGGTAAAAGGAAGGAAATATTTTGGAGCTTCTG
CATGTTTTGCTGTATACGATTTGAGTGACGTAGAAGGGGATCAATCATCTTCTTCTAACATATGGATAGTTGGTGGCCCTTCTAACGATGCTCTTAATGTAATACTTGCA
GGCTGGCAGGTGAATCCATCAATAAATGGTGATAGTCGCCCTAGATTGTTTGTTGATGGGGGTGTTGCAACGGGATGTTACAATATGCTTTGTCAAGGTTTTGTACAAGT
ACATCAAAGTACTCATCCAGGCTCTCCTCTTGAACCAACCTCCACTTATCAAGGACAACAATATGAGGTTCCAATCTTTATCAATAAGGTAATAAATTTCAACGTTTTAA
ATGTAGATAAATCAGACAAATGGTGGGTTGAAATACTAATAAATGAGGAAACAGAGGCAATGGGATATTGGCCAGCTGAGTTATTTCCAAATCTAATGAATGGGGCAGAT
CAAATTGGATGGGGAGGAATTGCAAAACCTTCATCAGTAAATGGATTGAGCCCTCTATTGGGAAATGGTCATATGCCAAATGGTCAATATAATGAAGGTTGCTACATTGG
ACGCATAGCCTATATGTACGATGACTATCAAGTTCAAGTACCCACTTGGCAAAATACTCTCAGCTATGTAAGCAATTCTGATTGTTATGGTTTGATTGCCAATCAAAATT
GTGGTAATGATTTCTTGAAATATTGCTTCACCTTTGGAGGACCTGGTGGATATCAATGTGGAACCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTAAAGCAACATGGTTGGTGATTGTGTTGTTTGTTTGCATCCATTGCAAATACAACCAAGCCTCTGATTTAAACCTCTCAAGAAAAGAAGAATTTGAGTTCGA
AACACAACTCAAACTTCTCAACAAACCCTTCGTCAAAACATTTGAGACCAAAGAGGGAGATATCATTGATTGTGTCGACATCAATAAACAACCAGCCCTTGATTATCCTT
TATTGAAAAATCAGAAAGTTCAGACTCGACCAAGTAGATATATATCTAATTTGTTCAAAGATTCATATCAATCAAATAATGAGACACTCGCAATTAACATCAATGAAGAA
TCTTGTCCTGTTGGATTTGTTCCCATCCGAAGAACACTAAAGAAAGATCTAATTAGGTTAAGATCTCTATCCTCAAAGAAGCAAGAATCAAAATCTCAATCATCAAATGA
GCCAGAAATTGGTTTTGATGAGTTTTTTCTTGATACTATCAGGTTCCCTTATGATAAAAATGTGATTTCTCATTCTATGGTAAAAGGAAGGAAATATTTTGGAGCTTCTG
CATGTTTTGCTGTATACGATTTGAGTGACGTAGAAGGGGATCAATCATCTTCTTCTAACATATGGATAGTTGGTGGCCCTTCTAACGATGCTCTTAATGTAATACTTGCA
GGCTGGCAGGTGAATCCATCAATAAATGGTGATAGTCGCCCTAGATTGTTTGTTGATGGGGGTGTTGCAACGGGATGTTACAATATGCTTTGTCAAGGTTTTGTACAAGT
ACATCAAAGTACTCATCCAGGCTCTCCTCTTGAACCAACCTCCACTTATCAAGGACAACAATATGAGGTTCCAATCTTTATCAATAAGGTAATAAATTTCAACGTTTTAA
ATGTAGATAAATCAGACAAATGGTGGGTTGAAATACTAATAAATGAGGAAACAGAGGCAATGGGATATTGGCCAGCTGAGTTATTTCCAAATCTAATGAATGGGGCAGAT
CAAATTGGATGGGGAGGAATTGCAAAACCTTCATCAGTAAATGGATTGAGCCCTCTATTGGGAAATGGTCATATGCCAAATGGTCAATATAATGAAGGTTGCTACATTGG
ACGCATAGCCTATATGTACGATGACTATCAAGTTCAAGTACCCACTTGGCAAAATACTCTCAGCTATGTAAGCAATTCTGATTGTTATGGTTTGATTGCCAATCAAAATT
GTGGTAATGATTTCTTGAAATATTGCTTCACCTTTGGAGGACCTGGTGGATATCAATGTGGAACCACTTAA
Protein sequenceShow/hide protein sequence
MASKATWLVIVLFVCIHCKYNQASDLNLSRKEEFEFETQLKLLNKPFVKTFETKEGDIIDCVDINKQPALDYPLLKNQKVQTRPSRYISNLFKDSYQSNNETLAININEE
SCPVGFVPIRRTLKKDLIRLRSLSSKKQESKSQSSNEPEIGFDEFFLDTIRFPYDKNVISHSMVKGRKYFGASACFAVYDLSDVEGDQSSSSNIWIVGGPSNDALNVILA
GWQVNPSINGDSRPRLFVDGGVATGCYNMLCQGFVQVHQSTHPGSPLEPTSTYQGQQYEVPIFINKVINFNVLNVDKSDKWWVEILINEETEAMGYWPAELFPNLMNGAD
QIGWGGIAKPSSVNGLSPLLGNGHMPNGQYNEGCYIGRIAYMYDDYQVQVPTWQNTLSYVSNSDCYGLIANQNCGNDFLKYCFTFGGPGGYQCGTT