; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003034 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003034
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationChr11:16554374..16566040
RNA-Seq ExpressionHG10003034
SyntenyHG10003034
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]4.5e-13166.49Show/hide
Query:  KCNEASNSNLSREEELELEGQLKLLNKPYIKT------------FQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDI--L
        K +EASNS LSREEELE+E  LKLLNKP IKT            +QTK+GDIIDCV+INKQPALDHPLLKNHKVQT PS Y+SKLFK +SSQ+NN I  L
Subjt:  KCNEASNSNLSREEELELEGQLKLLNKPYIKT------------FQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDI--L

Query:  TSNNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKG-PKYYGANARIVVYNLSLSSEQ
         SNNNN  EGCP GFVPIRRTLK+DLIRL+SLSSNNK +    QSS  PQDD SDDF  D+VKFPY QNVVSHSL KG  KYYG  + + VYN+SLS +Q
Subjt:  TSNNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKG-PKYYGANARIVVYNLSLSSEQ

Query:  SSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLV
        SSS NIWIVGG  +SL VLM GW VNP +N + ++R FVYWTADGG  TGCYNM CQ  V VN +  +G+ LLP STY+GQQYDYQFTIIQ  GNWWVLV
Subjt:  SSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLV

Query:  GENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIASPNLDNT
        GEN +G+GYWPKEL+ +L DGAD++AWGGIA+PS D  SP LG+GHKP  NG++ E CYIRNI  I  A  +    P  DNT
Subjt:  GENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIASPNLDNT

TYK11502.1 neprosin 2 [Cucumis melo var. makuwa]3.4e-13967.99Show/hide
Query:  VLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNS-SQSNNDILTSNNN
        + FVCFN K N ASN NLSREEELE+E QLKLLNKP+IKT++TK+GDIIDCV+INKQPALDHPLLKNHKVQT PS ++SKLFK++S SQSNN ILTS NN
Subjt:  VLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNS-SQSNNDILTSNNN

Query:  NNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSE-QSSSAN
        NNGEGCP GFVPIRRTLKEDLIRL+SLSSN K++    +SS KPQDD S DF +D V+FPY+QNVVSHSL+K   Y+GA ARI VYN+SLS E QSSSAN
Subjt:  NNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSE-QSSSAN

Query:  IWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQV
        IW+VGG D SLNVLMA      A++ +SL R FVYWT D GA TGCYNM+CQ  V VN ++ +GS++LPAS Y+G+QYDYQF+I+QA G+WWV VG++QV
Subjt:  IWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQV

Query:  GVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFV
        G+GYWP EL P+L  GA++VAWGG A+PS   +ESPPLG+GHKPNG   EAC++RNI YIA+++ ++ P LDNT  +V
Subjt:  GVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFV

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]2.9e-9049.2Show/hide
Query:  WLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTS
        WL+IVL +  N K + A +SNLSREEELELE QLKLLN+P+I TFQT++GDIIDCV+INKQPALDHP LK+HK+QTRPS Y   L KD+SS  +   +  
Subjt:  WLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTS

Query:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSS
          NNN   CPAG+VPIRRT+K+DLIR+RSLSS       TS              +   V FPYNQ+VVS ++ KG KYYGA+  + VYNLS++ +QSSS
Subjt:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSS

Query:  ANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGEN
        +NIWI+GG   + NV++AGWQVNP +N +SL+RMFVYWT                                                +  GNWW+ VGE+
Subjt:  ANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGEN

Query:  QVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT
           +GYWPKEL   L+DG ++VAWGGIAKPS +  SPPLGNGHKPN  ++ +ACY R ++Y+  +++   P  +NT
Subjt:  QVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT

XP_031738648.1 uncharacterized protein LOC105435061 [Cucumis sativus]4.0e-8750.82Show/hide
Query:  NEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNNNNGEGCPAGFV
        +EASNS LSREEELE+E  LKLLNKP IKT++TK+GDIIDCV+INKQPALDHPLLKNHKVQ                                       
Subjt:  NEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNNNNGEGCPAGFV

Query:  PIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKG-PKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSL
                                                             VVSHSL KG  KYYG  + + VYN+SLS +QSSS NIWIVGG  +SL
Subjt:  PIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKG-PKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSL

Query:  NVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVP
         VLM GW VNP +N + ++R FVYWTADGG  TGCYNM CQ  V VN +  +G+ LLP STY+GQQYDYQFTIIQ  GNWWVLVGEN +G+GYWPKEL+ 
Subjt:  NVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVP

Query:  SLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIASPNLDNT
        +L DGAD++AWGGIA+PS D  SP LG+GHKP  NG++ E CYIRNI  I  A  +    P  DNT
Subjt:  SLSDGADRVAWGGIAKPSKDEESPPLGNGHKP--NGEHKEACYIRNIHYI--ANDHRIASPNLDNT

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]6.9e-14869.25Show/hide
Query:  YSKATWLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNN
        YSKAT LVIV FVCFN K N ASN NLSREE+LE+E QLKLLNKP+IKT++TK+GDIIDCV+INKQPALDHPLLKNHKVQT PS ++SKLFK++SSQSNN
Subjt:  YSKATWLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNN

Query:  DILTSNNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQ-DDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLS
         ILTS NNNNGEGCP GFVPIRRTLKEDLIRL+SLSSN+K +    QSS  P+ DDLS D  YD V+FPY QNVVSHSL+K   Y+GA ARI V+N+SLS
Subjt:  DILTSNNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQ-DDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLS

Query:  SE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNW
           QSSSANIW++GG+D+SLNVLMAGWQVNPA+N ++L R FVYWT D G  TGCYNM+CQ  V VN N+ +GS++LPAS Y+GQQYDYQF+I+QA G+W
Subjt:  SE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNW

Query:  WVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFV
        WV VG+NQVG+GYWP EL P+L  GAD+VAWGG A+P+   +ESPPLG+GHKPNG+  EA ++RNI YIA ++ ++ P L+NT  +V
Subjt:  WVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFV

TrEMBL top hitse value%identityAlignment
A0A0A0L400 Neprosin domain-containing protein5.4e-8263.95Show/hide
Query:  DDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTG
        DDLS D  YD V+FPY QNVVSHSL+K   Y+GA ARI V+N+SLS   QSSSANIW++GG+D+SLNVLMAGWQVNPA+N ++L R FVYWT D G  TG
Subjt:  DDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSE-QSSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTG

Query:  CYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPN
        CYNM+CQ  V VN N+ +GS++LPAS Y+GQQYDYQF+I+QA G+WWV VG+NQVG+GYWP EL P+L  GAD+VAWGG A+P+   +ESPPLG+GHKPN
Subjt:  CYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPN

Query:  GEHKEACYIRNIHYIANDHRIASPNLDNTQFFV
        G+  EA ++RNI YIA ++ ++ P L+NT  +V
Subjt:  GEHKEACYIRNIHYIANDHRIASPNLDNTQFFV

A0A5A7UEV4 Uncharacterized protein5.6e-7965.38Show/hide
Query:  PQDDLSDDFLYDTVKFPYNQNVVSHSLMKGP-KYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAK
        PQDD SDDF  D+VK+P NQNVVSHSL KGP KYYG  + + VYN+SLS  QSSS+NIWIVGG  NSL VLM GW VNP +N + ++R FVYWTADGGA 
Subjt:  PQDDLSDDFLYDTVKFPYNQNVVSHSLMKGP-KYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAK

Query:  TGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP
        TGCYNM CQ  V VN +  +G+ L P STY+GQQYDYQFTIIQ  GNWWVLVGEN +G+GYWPKELV +L DGA+++AWGGIAKPS D  SP LG+GHKP
Subjt:  TGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKP

Query:  --NGEHKEACYIRNIHYI--ANDHRIASPNLDNT
          NG++ E CYIRNI  I  A  +    P  DNT
Subjt:  --NGEHKEACYIRNIHYI--ANDHRIASPNLDNT

A0A5D3CJM0 Neprosin 21.7e-13967.99Show/hide
Query:  VLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNS-SQSNNDILTSNNN
        + FVCFN K N ASN NLSREEELE+E QLKLLNKP+IKT++TK+GDIIDCV+INKQPALDHPLLKNHKVQT PS ++SKLFK++S SQSNN ILTS NN
Subjt:  VLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNS-SQSNNDILTSNNN

Query:  NNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSE-QSSSAN
        NNGEGCP GFVPIRRTLKEDLIRL+SLSSN K++    +SS KPQDD S DF +D V+FPY+QNVVSHSL+K   Y+GA ARI VYN+SLS E QSSSAN
Subjt:  NNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSE-QSSSAN

Query:  IWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQV
        IW+VGG D SLNVLMA      A++ +SL R FVYWT D GA TGCYNM+CQ  V VN ++ +GS++LPAS Y+G+QYDYQF+I+QA G+WWV VG++QV
Subjt:  IWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQV

Query:  GVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFV
        G+GYWP EL P+L  GA++VAWGG A+PS   +ESPPLG+GHKPNG   EAC++RNI YIA+++ ++ P LDNT  +V
Subjt:  GVGYWPKELVPSLSDGADRVAWGGIAKPS-KDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFV

A0A6J1CVJ6 uncharacterized protein LOC1110147771.4e-9049.2Show/hide
Query:  WLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTS
        WL+IVL +  N K + A +SNLSREEELELE QLKLLN+P+I TFQT++GDIIDCV+INKQPALDHP LK+HK+QTRPS Y   L KD+SS  +   +  
Subjt:  WLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTS

Query:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSS
          NNN   CPAG+VPIRRT+K+DLIR+RSLSS       TS              +   V FPYNQ+VVS ++ KG KYYGA+  + VYNLS++ +QSSS
Subjt:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSS

Query:  ANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGEN
        +NIWI+GG   + NV++AGWQVNP +N +SL+RMFVYWT                                                +  GNWW+ VGE+
Subjt:  ANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGEN

Query:  QVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT
           +GYWPKEL   L+DG ++VAWGGIAKPS +  SPPLGNGHKPN  ++ +ACY R ++Y+  +++   P  +NT
Subjt:  QVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNT

A0A6J1CW60 uncharacterized protein LOC1110147751.7e-8043.46Show/hide
Query:  WLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTS
        WL+IVL +  N K + A +SNLS EEELE E QLKLLNKP I TFQT++GDIIDCV+INKQPALDHPLLKNHKVQ                         
Subjt:  WLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTS

Query:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSS
                                                                           V S ++ +G KYYG    + VYNLS++ +QSSS
Subjt:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSS

Query:  ANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVG
        +NIWI+GG   + NV++ GWQVNP +N +S +RMFVYWTADGG  TG YNM C+  +  N +      L P+STY+G+QYDY FT+ Q    G+WW+ V 
Subjt:  ANIWIVGGTDNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVG

Query:  ENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNTQFFV
        ++Q  +GYWPKEL   L+DGA++VAWGGIAKPS +  SPPLGNGHKPN G+H +ACY R ++YI  ++      ++NT  ++
Subjt:  ENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEESPPLGNGHKPN-GEHKEACYIRNIHYIANDHRIASPNLDNTQFFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.6e-5733.25Show/hide
Query:  FKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNNNNGEG---
        F  + A+ S +S+ ++ E++  L  LNKP +K+ Q+ DGD+IDCV I+KQPA DHP LK+HK+Q +P+ +   LF DN        +++  +N  EG   
Subjt:  FKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNNNNGEG---

Query:  --------CPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSLSSE-Q
                C  G +P+RRT ++D++R  S+    K++  +    +  + DL             NQ+   H++  ++G KYYGA A I V+   +  + +
Subjt:  --------CPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSLSSE-Q

Query:  SSSANIWIVGGT-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWW
         S + IW++GG+    LN + AGWQV+P L  ++ +R+F YWT+D    TGCYN++C   + +NS++ +G+++ P S Y+  QYD    I +    G+WW
Subjt:  SSSANIWIVGGT-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWW

Query:  VLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDE---ESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASP
        +  G   V +GYWP  L   L++ A  + WGG    S+ +    S  +G+G  P     +A Y RNI  +   + + +P
Subjt:  VLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDE---ESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASP

AT2G44240.1 Protein of Unknown Function (DUF239)1.1e-5535.93Show/hide
Query:  ASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNNNNGEGCPAGFVPI
        A         E++++  LK LNKP +K+ +++DGDIIDCV I  QPA DHPLLKNH +Q +PS ++ +   D++        T      GE CP   +PI
Subjt:  ASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNNNNGEGCPAGFVPI

Query:  RRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSS-EQSSSANIWIVGGTDNSLNV
        RRT KE+++R +SL S  K+          P+D  S ++ ++           +   ++  K+YG  A I V+   +++  + S +  WIV G   S N 
Subjt:  RRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSS-EQSSSANIWIVGGTDNSLNV

Query:  LMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVGENQVGVGYWPKELVP
        + AGWQV P +  N+  R+FVYWT+DG  KTGCYN+VC   V   +   +G + + AS Y G Q      I +    GNWW+ + +N V +GYWP  L  
Subjt:  LMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVGENQVGVGYWPKELVP

Query:  SLSDGADRVAWGG-IAKPSKDEE-SPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASP
        SL DGA +V WGG I  P+ D   +  +G+GH      K+A Y++NI  +   + +  P
Subjt:  SLSDGADRVAWGG-IAKPSKDEE-SPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASP

AT3G13510.1 Protein of Unknown Function (DUF239)1.1e-5834.04Show/hide
Query:  IVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDN---SSQSNNDILTS
        + L+V  +  C  AS    S  ++ E++  L  LNKP +KT Q+ DGDIIDC+ I+KQPA DHP LK+HK+Q RPS +   LF DN   +     +    
Subjt:  IVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDN---SSQSNNDILTS

Query:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSL-SSEQ
           +    C  G +P+RRT ++D++R  S+    K++  +    +  + DL             NQN   H++  ++G KYYGA A + V+   + ++ +
Subjt:  NNNNNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSL-SSEQ

Query:  SSSANIWIVGGT-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWW
         S + IW++GG+    LN + AGWQV+P L  ++ +R+F YWT+D    TGCYN++C   + +NS++ +G+++ P S Y+  QYD    I +    G+WW
Subjt:  SSSANIWIVGGT-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWW

Query:  VLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEES---PPLGNGHKPNGEHKEACYIRNIHYIANDHRIASP
        +  G   V +GYWP  L   L++ A  + WGG    S+ E       +G+GH P     +A Y RNI  +   + + +P
Subjt:  VLVGENQVGVGYWPKELVPSLSDGADRVAWGGIAKPSKDEES---PPLGNGHKPNGEHKEACYIRNIHYIANDHRIASP

AT5G56530.1 Protein of Unknown Function (DUF239)1.7e-5636.19Show/hide
Query:  ASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLF-----KDNSSQSNNDILTSNNNNNGEGCPA
        A   ++SR +  E+   L  LNKP +K+ Q+ DGDIIDCV+I+KQPA DHP LK+HK+Q  PS     LF      +   +S N I T   + NG  C  
Subjt:  ASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLF-----KDNSSQSNNDILTSNNNNNGEGCPA

Query:  GFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSL-SSEQSSSANIWIVGG
        G +P+RRT KED++R  S+    K++  +    R    DL             NQ+   H++  ++G K+YGA A I V+   + SS + S + +WI+GG
Subjt:  GFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSL-SSEQSSSANIWIVGG

Query:  T-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVGENQVGVG
        +    LN + AGWQV+P L  ++ +R+F YWT+D    TGCYN++C   + +NS + +G+++ P S +   QYD   TI +    G+WW+  G+  V +G
Subjt:  T-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVGENQVGVG

Query:  YWPKELVPSLSDGADRVAWGG-IAKPSKD--EESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNT
        YWP  L   L+D A  V WGG +    +D    +  +G+G  P+    +A Y RNI  + + + +  P   NT
Subjt:  YWPKELVPSLSDGADRVAWGG-IAKPSKD--EESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNT

AT5G56530.2 Protein of Unknown Function (DUF239)1.7e-5636.19Show/hide
Query:  ASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLF-----KDNSSQSNNDILTSNNNNNGEGCPA
        A   ++SR +  E+   L  LNKP +K+ Q+ DGDIIDCV+I+KQPA DHP LK+HK+Q  PS     LF      +   +S N I T   + NG  C  
Subjt:  ASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLF-----KDNSSQSNNDILTSNNNNNGEGCPA

Query:  GFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSL-SSEQSSSANIWIVGG
        G +P+RRT KED++R  S+    K++  +    R    DL             NQ+   H++  ++G K+YGA A I V+   + SS + S + +WI+GG
Subjt:  GFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSL--MKGPKYYGANARIVVYNLSL-SSEQSSSANIWIVGG

Query:  T-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVGENQVGVG
        +    LN + AGWQV+P L  ++ +R+F YWT+D    TGCYN++C   + +NS + +G+++ P S +   QYD   TI +    G+WW+  G+  V +G
Subjt:  T-DNSLNVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQ--AGGNWWVLVGENQVGVG

Query:  YWPKELVPSLSDGADRVAWGG-IAKPSKD--EESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNT
        YWP  L   L+D A  V WGG +    +D    +  +G+G  P+    +A Y RNI  + + + +  P   NT
Subjt:  YWPKELVPSLSDGADRVAWGG-IAKPSKD--EESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTATTCCAAAGCAACATGGTTGGTGATAGTGCTTTTTGTATGCTTCAATTTCAAATGCAATGAAGCCTCTAACTCAAATCTTTCAAGAGAAGAAGAATTGGAGTT
AGAAGGACAGCTCAAACTTCTCAACAAGCCCTACATCAAAACATTTCAGACCAAAGATGGAGATATCATTGATTGTGTCAATATCAATAAACAACCAGCTCTCGATCATC
CTTTACTGAAAAATCACAAAGTTCAGACTCGTCCAAGTAGATATCTATCTAAATTGTTCAAAGACAATTCATCTCAATCAAACAATGACATACTCACAAGTAACAACAAC
AATAATGGAGAAGGTTGTCCAGCTGGATTTGTTCCCATTCGAAGAACATTAAAAGAAGATCTAATTAGGTTAAGATCTCTATCATCCAACAACAAGCAAGAATCATCAAC
CTCTCAATCATCAAGAAAGCCACAAGATGATCTATCTGATGATTTTCTTTATGACACTGTCAAATTCCCTTACAATCAAAATGTTGTTTCTCATTCTTTGATGAAAGGTC
CGAAATATTATGGAGCTAATGCACGCATTGTTGTGTACAATTTGAGTTTGAGTTCAGAGCAATCTTCTTCTGCTAACATATGGATAGTTGGTGGCACTGATAACTCTCTT
AATGTTCTTATGGCAGGCTGGCAGGTGAATCCAGCACTAAATGATAATAGTCTATCTAGAATGTTTGTGTATTGGACGGCTGACGGAGGTGCTAAAACGGGATGTTACAA
TATGGTTTGTCAAAGACTCGTACATGTAAATTCAAATGTTGTTTTAGGCTCTACTCTTCTTCCGGCCTCCACTTATAAAGGACAACAATATGACTATCAATTCACTATCA
TTCAGGCTGGAGGGAATTGGTGGGTTCTAGTAGGTGAAAATCAAGTGGGAGTAGGATATTGGCCAAAAGAGTTGGTTCCAAGTTTAAGTGATGGAGCAGATCGAGTTGCA
TGGGGAGGCATTGCAAAGCCTTCAAAAGATGAAGAAAGCCCTCCATTGGGAAATGGCCACAAACCAAATGGTGAACACAAGGAAGCTTGTTACATTAGAAACATACATTA
CATAGCAAATGACCATAGAATTGCATCACCCAATTTGGATAACACCCAATTCTTTGTGTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTATTCCAAAGCAACATGGTTGGTGATAGTGCTTTTTGTATGCTTCAATTTCAAATGCAATGAAGCCTCTAACTCAAATCTTTCAAGAGAAGAAGAATTGGAGTT
AGAAGGACAGCTCAAACTTCTCAACAAGCCCTACATCAAAACATTTCAGACCAAAGATGGAGATATCATTGATTGTGTCAATATCAATAAACAACCAGCTCTCGATCATC
CTTTACTGAAAAATCACAAAGTTCAGACTCGTCCAAGTAGATATCTATCTAAATTGTTCAAAGACAATTCATCTCAATCAAACAATGACATACTCACAAGTAACAACAAC
AATAATGGAGAAGGTTGTCCAGCTGGATTTGTTCCCATTCGAAGAACATTAAAAGAAGATCTAATTAGGTTAAGATCTCTATCATCCAACAACAAGCAAGAATCATCAAC
CTCTCAATCATCAAGAAAGCCACAAGATGATCTATCTGATGATTTTCTTTATGACACTGTCAAATTCCCTTACAATCAAAATGTTGTTTCTCATTCTTTGATGAAAGGTC
CGAAATATTATGGAGCTAATGCACGCATTGTTGTGTACAATTTGAGTTTGAGTTCAGAGCAATCTTCTTCTGCTAACATATGGATAGTTGGTGGCACTGATAACTCTCTT
AATGTTCTTATGGCAGGCTGGCAGGTGAATCCAGCACTAAATGATAATAGTCTATCTAGAATGTTTGTGTATTGGACGGCTGACGGAGGTGCTAAAACGGGATGTTACAA
TATGGTTTGTCAAAGACTCGTACATGTAAATTCAAATGTTGTTTTAGGCTCTACTCTTCTTCCGGCCTCCACTTATAAAGGACAACAATATGACTATCAATTCACTATCA
TTCAGGCTGGAGGGAATTGGTGGGTTCTAGTAGGTGAAAATCAAGTGGGAGTAGGATATTGGCCAAAAGAGTTGGTTCCAAGTTTAAGTGATGGAGCAGATCGAGTTGCA
TGGGGAGGCATTGCAAAGCCTTCAAAAGATGAAGAAAGCCCTCCATTGGGAAATGGCCACAAACCAAATGGTGAACACAAGGAAGCTTGTTACATTAGAAACATACATTA
CATAGCAAATGACCATAGAATTGCATCACCCAATTTGGATAACACCCAATTCTTTGTGTTATGA
Protein sequenceShow/hide protein sequence
MDYSKATWLVIVLFVCFNFKCNEASNSNLSREEELELEGQLKLLNKPYIKTFQTKDGDIIDCVNINKQPALDHPLLKNHKVQTRPSRYLSKLFKDNSSQSNNDILTSNNN
NNGEGCPAGFVPIRRTLKEDLIRLRSLSSNNKQESSTSQSSRKPQDDLSDDFLYDTVKFPYNQNVVSHSLMKGPKYYGANARIVVYNLSLSSEQSSSANIWIVGGTDNSL
NVLMAGWQVNPALNDNSLSRMFVYWTADGGAKTGCYNMVCQRLVHVNSNVVLGSTLLPASTYKGQQYDYQFTIIQAGGNWWVLVGENQVGVGYWPKELVPSLSDGADRVA
WGGIAKPSKDEESPPLGNGHKPNGEHKEACYIRNIHYIANDHRIASPNLDNTQFFVL