; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017324 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017324
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationchr5:2146896..2149295
RNA-Seq ExpressionLag0017324
SyntenyLag0017324
Gene Ontology termsNA
InterPro domainsIPR004314 - Neprosin
IPR025521 - Neprosin activation peptide


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8650029.1 hypothetical protein Csa_011504 [Cucumis sativus]3.1e-8143.34Show/hide
Query:  KLSKAAIPTLSPQQDLELENELKFLNKPPILT------------HQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSK--LSIL
        K+S+A+   LS +++LE+E  LK LNKP I T            +QT+EGD+IDC+DINKQPALDHPLLKNHK+QT PS +     K+ SS++   +S L
Subjt:  KLSKAAIPTLSPQQDLELENELKFLNKPPILT------------HQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSK--LSIL

Query:  EHN----RKSCPSGSVPIRRTQKEDLIRMKSFSN------SPLRP------------ANFDLNYIVVSLSMKEG-QTYYGVTGTMAGYNLSVAHDQASST
          N     + CP+G VPIRRT K+DLIR+KS S+      S + P              F     VVS S+++G + YYG    M+ YN+S++ DQ+SST
Subjt:  EHN----RKSCPSGSVPIRRTQKEDLIRMKSFSN------SPLRP------------ANFDLNYIVVSLSMKEG-QTYYGVTGTMAGYNLSVAHDQASST

Query:  NLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTD---HIVYP----------------------GNWWVELTEAG
        N+W+ GGP   L +++ GW VNP VN D +TR F+YWTADG   TGCYN+ C+GFVQ +   H+  P                      GNWWV L    
Subjt:  NLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTD---HIVYP----------------------GNWWVELTEAG

Query:  RAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKP--DGNFRVGCYIRGITYRNN--QNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFK
          +GY+PKEL+ NL +GA+Q+AWGGIA+   DG+ P LGSG+KP  +G++  GCYIR I   +    N  V P   +   +  + +CY L  +  C G+ 
Subjt:  RAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKP--DGNFRVGCYIRGITYRNN--QNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFK

Query:  QLYYCFTYGGPGG
         + YCFT+GGPGG
Subjt:  QLYYCFTYGGPGG

XP_022145288.1 uncharacterized protein LOC111014777 [Momordica charantia]1.6e-9048.67Show/hide
Query:  MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSK
        M  K  +W+M+V +L+  N K S A    LS +++LELE +LK LN+P I T QTEEGD+IDC+DINKQPALDHP LK+HKIQT+PST+P   +K+SSS 
Subjt:  MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSK

Query:  SKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSN--------SPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPS
           S + +N ++CP+G VPIRRT K+DLIR++S S+        S     +F  N  VVS++MK+G  YYG +G+++ YNLSVA DQ+SS+N+W+ GGP 
Subjt:  SKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSN--------SPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPS

Query:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGM
        Q  N+ILAGWQVNP +N D LTRMF+YWT    R T                   GNWW+ + E+ + IGY+PKEL  +L++G  QVAWGGIAK   +GM
Subjt:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGM

Query:  KPPLGSGNKPD-GNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGG
         PPLG+G+KP+   +   CY R + Y +  N    P   +   ++ + +CY L N ETCGG +  YYC T+GGPGG
Subjt:  KPPLGSGNKPD-GNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGG

XP_024038072.1 uncharacterized protein LOC18039972 [Citrus clementina]1.8e-9246.53Show/hide
Query:  MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSK
        M++K+ + ++++  ++  + K    A  +LS + DL+LE +LK LNKP I T +T+EGD IDC+DINKQPALDHPLLKNHK+QT+P TFP    K+ SSK
Subjt:  MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSK

Query:  SKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSNSPLR--------PANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPS
        SK S + H R+SCP G+VPIRRT KEDLI+  S     +         PA     +  V++ MK+   Y+GV G +  +NL+VA DQ S TN+W+  GP 
Subjt:  SKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSNSPLR--------PANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPS

Query:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYP--------------------------GNWWVELTEAGRAIGYFPK
         QLN+ILAGW V+PA+N D LTR+F +WTADG+R TGCYN LC GFV T+  + P                          GNWW+ L++    +GY+PK
Subjt:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYP--------------------------GNWWVELTEAGRAIGYFPK

Query:  ELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG
        EL  +LS GA  VAWGGIA  G++G+ PP+GSG   + +FR  CYIR I Y + QN   +P  + L+QH+    CYGL + + CG  K++YYC  +GG G
Subjt:  ELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG

Query:  GKCG
        G+CG
Subjt:  GKCG

XP_031738649.1 uncharacterized protein LOC116402744 [Cucumis sativus]1.1e-8343.98Show/hide
Query:  MMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSIL---
        +++   VC N K + A+ P LS ++DLE+E +LK LNKP I T++T+EGD+IDC+DINKQPALDHPLLKNHK+QT PS F     KE SS+S   IL   
Subjt:  MMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSIL---

Query:  EHNRKSCPSGSVPIRRTQKEDLIRMKSFSN------SPLRPANFDLN-------------YIVVSLSMKEGQTYYGVTGTMAGYNLSVA-HDQASSTNLW
         +N + CP G VPIRRT KEDLIR+KS S+      S + P + DL+               VVS S+ + Q Y+G    +A +N+S++ + Q+SS N+W
Subjt:  EHNRKSCPSGSVPIRRTQKEDLIRMKSFSN------SPLRPANFDLN-------------YIVVSLSMKEGQTYYGVTGTMAGYNLSVA-HDQASSTNLW

Query:  VTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTD---HI--------VYP--------------GNWWVELTEAGRAI
        V GG    LN+++AGWQVNPAVN D L R F+YWT D    TGCYN+LC+GFV  +   H+        +Y               G+WWV + +    +
Subjt:  VTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTD---HI--------VYP--------------GNWWVELTEAGRAI

Query:  GYFPKELLPNLSNGANQVAWGGIAKGGRDG-MKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCF
        GY+P EL PNL  GA+QVAWGG A+    G   PPLGSG+KP+G      ++R I Y     +   P  ++   ++ + +CY L+++E C  F    YCF
Subjt:  GYFPKELLPNLSNGANQVAWGGIAKGGRDG-MKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCF

Query:  TYGGPGG
        T+GGPGG
Subjt:  TYGGPGG

XP_039166851.1 uncharacterized protein LOC104443193 [Eucalyptus grandis]7.2e-7845.28Show/hide
Query:  LSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNRKSCPSGSVPIR
        +S++    +S   D++LE +LK LNKPPI T  TEEGD+IDCIDI+KQPA+DHPLLKNHKIQ KP   P S   ++SS +K  I   +RK CP G+VPI+
Subjt:  LSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNRKSCPSGSVPIR

Query:  RTQKEDLIRMKSFSNSPLRPANFDLNYIVVSLSMKEGQ---------TY---YGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVN
        R +KEDLIR +S    P       +N + +  +   GQ         TY   YG +G ++ YN+S A DQ SS N+W+  GP   ++MI+AGW+V+P +N
Subjt:  RTQKEDLIRMKSFSNSPLRPANFDLNYIVVSLSMKEGQ---------TY---YGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVN

Query:  RDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVG
         DGLTR+F YWT DG R  GCYN  C+GFVQ    +  GNWW+ + +    +GY+PKEL  NL NG+   AWGG+AK G +G  PP+G+G+ PD      
Subjt:  RDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVG

Query:  CYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKC
         Y R + + N       P K +L + +  P+CY LLN +         Y FT+GGPGG C
Subjt:  CYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKC

TrEMBL top hitse value%identityAlignment
A0A059CIH2 Uncharacterized protein1.3e-7742.64Show/hide
Query:  LSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNRKSCPSGSVPIR
        +S++    +S   D++LE +LK LNKPPI T  TEEGD+IDCIDI+KQPA+DHPLLKNHKIQ KP   P S   ++SS +K  I   +RK CP G+VPI+
Subjt:  LSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNRKSCPSGSVPIR

Query:  RTQKEDLIRMKSFSNSPLRPANFDLNYIVVSLSMKEGQ---------TY---YGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVN
        R +KEDLIR +S    P       +N + +  +   GQ         TY   YG +G ++ YN+S A DQ SS N+W+  GP   ++MI+AGW+V+P +N
Subjt:  RTQKEDLIRMKSFSNSPLRPANFDLNYIVVSLSMKEGQ---------TY---YGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVN

Query:  RDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYP---------------------------GNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWG
         DGLTR+F YWT DG R  GCYN  C+GFVQ D ++ P                           GNWW+ + +    +GY+PKEL  NL NG+   AWG
Subjt:  RDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYP---------------------------GNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWG

Query:  GIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKC
        G+AK G +G  PP+G+G+ PD       Y R + + N       P K +L + +  P+CY LLN +         Y FT+GGPGG C
Subjt:  GIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKC

A0A1R3H4A8 Uncharacterized protein2.2e-7243.57Show/hide
Query:  MEMKLALWVMMVTMLVCS--NSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESS
        M +KL   ++++ +LV S   S +S  ++ +LS ++DLELE +LK LNKPPI T QT  GD+IDCIDINKQPALDHPLLKNHKIQTKP+T P+   K SS
Subjt:  MEMKLALWVMMVTMLVCS--NSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESS

Query:  SKSKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSNSPLRPAN-----FDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGP--
        +++  + L   R +CP+G+VPIRRT KEDLIRMKS    PL  +      F  N+ VV   MKEG  YYGV G ++ YNLSV ++Q SS N+W+  GP  
Subjt:  SKSKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSNSPLRPAN-----FDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGP--

Query:  -SQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYP---------------------------GNWWVELTEAGRAIGY
         S QLN+ILAGW  N                      T CYN  C GFV  +  + P                           GNWW  + +    IGY
Subjt:  -SQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYP---------------------------GNWWVELTEAGRAIGY

Query:  FPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVD-VSPQKSDLQQHIGDPNCYG
        +PKELLP LSNGAN VAWGGIA   + G  PP+GSG+ P+ +     Y + I + N      ++P   D+     +  CYG
Subjt:  FPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVD-VSPQKSDLQQHIGDPNCYG

A0A5D3CJM0 Neprosin 23.8e-7742.29Show/hide
Query:  VCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESS-SKSKLSIL---EHNRK
        VC N K + A+ P LS +++LE+E +LK LNKP I T++T+EGD+IDC+DINKQPALDHPLLKNHK+QT PS F     KE S S+S   IL    +N +
Subjt:  VCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESS-SKSKLSIL---EHNRK

Query:  SCPSGSVPIRRTQKEDLIRMKSFSN------SPLRP------------ANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVA-HDQASSTNLWVTGGPS
         CP G VPIRRT KEDLIR+KS S+      S ++P              F  +  VVS S+ +   Y+G    +A YN+S++  +Q+SS N+WV GGP 
Subjt:  SCPSGSVPIRRTQKEDLIRMKSFSN------SPLRP------------ANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVA-HDQASSTNLWVTGGPS

Query:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFV-------------------------QTDHIVYPGNWWVELTEAGRAIGYFPKE
        + LN+++A      AV+ D L R F+YWT D    TGCYN+LC+GFV                         Q   +   G+WWV + +    +GY+P E
Subjt:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFV-------------------------QTDHIVYPGNWWVELTEAGRAIGYFPKE

Query:  LLPNLSNGANQVAWGGIAKGG-RDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG
        L PNL  GA QVAWGG A+        PPLGSG+KP+G     C++R I Y  +  +   P   +   ++   +CY L+++E C  F    YCFT+GGPG
Subjt:  LLPNLSNGANQVAWGGIAKGG-RDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG

Query:  GK
        G+
Subjt:  GK

A0A6J1CVJ6 uncharacterized protein LOC1110147778.0e-9148.67Show/hide
Query:  MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSK
        M  K  +W+M+V +L+  N K S A    LS +++LELE +LK LN+P I T QTEEGD+IDC+DINKQPALDHP LK+HKIQT+PST+P   +K+SSS 
Subjt:  MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSK

Query:  SKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSN--------SPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPS
           S + +N ++CP+G VPIRRT K+DLIR++S S+        S     +F  N  VVS++MK+G  YYG +G+++ YNLSVA DQ+SS+N+W+ GGP 
Subjt:  SKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSN--------SPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPS

Query:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGM
        Q  N+ILAGWQVNP +N D LTRMF+YWT    R T                   GNWW+ + E+ + IGY+PKEL  +L++G  QVAWGGIAK   +GM
Subjt:  QQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGM

Query:  KPPLGSGNKPD-GNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGG
         PPLG+G+KP+   +   CY R + Y +  N    P   +   ++ + +CY L N ETCGG +  YYC T+GGPGG
Subjt:  KPPLGSGNKPD-GNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGG

V4SWW2 Uncharacterized protein1.1e-7141.58Show/hide
Query:  TLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNRKSCPSGSVPIRRTQKEDL
        +LS + DL+LE +LK LNKP I T +T+EGD IDC+DINKQPALDHPLLKNHK+QT                                            
Subjt:  TLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNRKSCPSGSVPIRRTQKEDL

Query:  IRMKSFSNSPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGT
                              V++ MK+   Y+GV G +  +NL+VA DQ S TN+W+  GP  QLN+ILAGW V+PA+N D LTR+F +WTADG+R T
Subjt:  IRMKSFSNSPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGT

Query:  GCYNLLCRGFVQTDHIVYP--------------------------GNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKP
        GCYN LC GFV T+  + P                          GNWW+ L++    +GY+PKEL  +LS GA  VAWGGIA  G++G+ PP+GSG   
Subjt:  GCYNLLCRGFVQTDHIVYP--------------------------GNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKP

Query:  DGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKCG
        + +FR  CYIR I Y + QN   +P  + L+QH+    CYGL + + CG  K++YYC  +GG GG+CG
Subjt:  DGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)2.2e-4832.49Show/hide
Query:  LSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQ--------SETKESSSKSKLSILEHNRKSC
        LS AA   +S +Q  E++  L  LNKP + + Q+ +GDVIDC+ I+KQPA DHP LK+HKIQ KP+  P+        S  K +  +  +  L H    C
Subjt:  LSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQ--------SETKESSSKSKLSILEHNRKSC

Query:  PSGSVPIRRTQKEDLIRMKSFS-------NSPLRPANFDLNYIVVS-----LSMKEGQTYYGVTGTMAGYNLSV-AHDQASSTNLWVTGGP-SQQLNMIL
          G++P+RRT+++D++R  S          S   P + + + I  S     ++  EG  YYG   T+  +   +   ++ S + +W+ GG   Q LN I 
Subjt:  PSGSVPIRRTQKEDLIRMKSFS-------NSPLRPANFDLNYIVVS-----LSMKEGQTYYGVTGTMAGYNLSV-AHDQASSTNLWVTGGP-SQQLNMIL

Query:  AGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTEAGRAIGYFPKELLPNL
        AGWQV+P +  D  TR+F YWT+D  + TGCYNLLC GF+Q +  +                             G+WW++    G  +GY+P  L   L
Subjt:  AGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTEAGRAIGYFPKELLPNL

Query:  SNGANQVAWGG-IAKGGRDGM--KPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG
        +  A+ + WGG +     DG      +GSG  P+  F    Y R I   +  N   +P+   L       NCY    D   G      + F YGGPG
Subjt:  SNGANQVAWGG-IAKGGRDGM--KPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG

AT2G44220.1 Protein of Unknown Function (DUF239)2.3e-5035.54Show/hide
Query:  QDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSK-LSILEHNRKSCPSGSVPIRRTQKEDLIRMK
        +++++   LK  NKP + + ++E+GDVIDC+ I  QPA DH LLKNH IQ +PS +P S++  +  ++K ++ + H    CP  +VPIRRT+KEDL+R K
Subjt:  QDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSK-LSILEHNRKSCPSGSVPIRRTQKEDLIRMK

Query:  SFSN-------SPLRPANFD--LNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQA----SSTNLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLY
        S  +       S  R   FD  L +    + ++ G+ Y    GT    NL   + Q     S    WV  G    LN I AGWQV P +  D   R F+Y
Subjt:  SFSN-------SPLRPANFD--LNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQA----SSTNLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLY

Query:  WTADGSRGTGCYNLLCRGFVQTDH-----------------------IVY----PGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGG---IAKGGR
        WT DG R TGCYNLLC GFVQT +                       +++     GNWW+ + E    IGY+P  L  +L   A +V WGG    +K G 
Subjt:  WTADGSRGTGCYNLLCRGFVQTDH-----------------------IVY----PGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGG---IAKGGR

Query:  DGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG
              +GSG+  D  F+   Y R +   +  N    PQ   L       NCY   N +T  G       F YGGPG
Subjt:  DGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG

AT3G13510.1 Protein of Unknown Function (DUF239)1.5e-4931.48Show/hide
Query:  LALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQS-------ETKES
        + LWVM+          LS AA    S +Q  E++  L  LNKPP+ T Q+ +GD+IDCI I+KQPA DHP LK+HKIQ +PS  P+          +  
Subjt:  LALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQS-------ETKES

Query:  SSKSKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSN------------SPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAH-DQASSTN
          ++ +  L H    C  G++P+RRT+++D++R  S                   P   + N    +++  EG  YYG   T+  +   + + ++ S + 
Subjt:  SSKSKLSILEHNRKSCPSGSVPIRRTQKEDLIRMKSFSN------------SPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAH-DQASSTN

Query:  LWVTGGP-SQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTE
        +W+ GG   Q LN I AGWQV+P +  D  TR+F YWT+D  + TGCYNLLC GF+Q +  +                             G+WW++   
Subjt:  LWVTGGP-SQQLNMILAGWQVNPAVNRDGLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTE

Query:  AGRAIGYFPKELLPNLSNGANQVAWGG-IAKGGRDGMK--PPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGF
         G  +GY+P  L   L+  A+ + WGG +     +G      +GSG+ P+  F    Y R I   +  N   +P+   L       NCY    D   G  
Subjt:  AGRAIGYFPKELLPNLSNGANQVAWGG-IAKGGRDGMK--PPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGF

Query:  KQLYYCFTYGGPG
            + F YGGPG
Subjt:  KQLYYCFTYGGPG

AT5G56530.1 Protein of Unknown Function (DUF239)5.2e-5034.81Show/hide
Query:  QQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKES--SSKSKLSI-----LEHNRKSCPSGSVPIRRTQK
        +Q+ E+   L  LNKP + + Q+ +GD+IDC+ I+KQPA DHP LK+HKIQ  PS  P+S   ES  S K K S+     L H    C  G++P+RRT+K
Subjt:  QQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKES--SSKSKLSI-----LEHNRKSCPSGSVPIRRTQK

Query:  EDLIRMKSFSN-------SPLRPANFDLNYIVVS-----LSMKEGQTYYGVTGTMAGYNLSV-AHDQASSTNLWVTGGP-SQQLNMILAGWQVNPAVNRD
        ED++R  S          S   P + D + I  S     ++  EG  +YG   T+  +   V + ++ S + LW+ GG   Q LN I AGWQV+P +  D
Subjt:  EDLIRMKSFSN-------SPLRPANFDLNYIVVS-----LSMKEGQTYYGVTGTMAGYNLSV-AHDQASSTNLWVTGGP-SQQLNMILAGWQVNPAVNRD

Query:  GLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGG-
          TR+F YWT+D  + TGCYNLLC GF+Q +  +                             G+WW++  + G  +GY+P  L   L++ A+ V WGG 
Subjt:  GLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGG-

Query:  IAKGGRDG--MKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG
        +     DG      +GSG  PD  F    Y R I   ++ N    P+   L       NCY    D   G      + F YGGPG
Subjt:  IAKGGRDG--MKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG

AT5G56530.2 Protein of Unknown Function (DUF239)5.2e-5034.81Show/hide
Query:  QQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKES--SSKSKLSI-----LEHNRKSCPSGSVPIRRTQK
        +Q+ E+   L  LNKP + + Q+ +GD+IDC+ I+KQPA DHP LK+HKIQ  PS  P+S   ES  S K K S+     L H    C  G++P+RRT+K
Subjt:  QQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKES--SSKSKLSI-----LEHNRKSCPSGSVPIRRTQK

Query:  EDLIRMKSFSN-------SPLRPANFDLNYIVVS-----LSMKEGQTYYGVTGTMAGYNLSV-AHDQASSTNLWVTGGP-SQQLNMILAGWQVNPAVNRD
        ED++R  S          S   P + D + I  S     ++  EG  +YG   T+  +   V + ++ S + LW+ GG   Q LN I AGWQV+P +  D
Subjt:  EDLIRMKSFSN-------SPLRPANFDLNYIVVS-----LSMKEGQTYYGVTGTMAGYNLSV-AHDQASSTNLWVTGGP-SQQLNMILAGWQVNPAVNRD

Query:  GLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGG-
          TR+F YWT+D  + TGCYNLLC GF+Q +  +                             G+WW++  + G  +GY+P  L   L++ A+ V WGG 
Subjt:  GLTRMFLYWTADGSRGTGCYNLLCRGFVQTDHIV---------------------------YPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGG-

Query:  IAKGGRDG--MKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG
        +     DG      +GSG  PD  F    Y R I   ++ N    P+   L       NCY    D   G      + F YGGPG
Subjt:  IAKGGRDG--MKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKSDLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGAAGTTAGCTTTGTGGGTGATGATGGTGACAATGTTAGTTTGTTCAAACAGCAAACTCAGCAAGGCAGCCATTCCAACCCTTTCTCCACAACAAGACTTGGA
ATTGGAAAATGAACTCAAATTCCTCAATAAGCCTCCCATTCTAACACATCAAACCGAGGAAGGTGACGTCATTGATTGTATCGACATCAATAAACAACCTGCTCTCGATC
ATCCATTGCTAAAAAATCACAAGATTCAGACCAAACCCTCCACATTTCCTCAAAGTGAGACCAAGGAGAGTTCATCAAAATCAAAGCTTTCAATTCTTGAGCACAACAGA
AAGTCTTGTCCAAGTGGATCAGTTCCCATTAGAAGAACTCAAAAGGAAGATTTGATAAGAATGAAGTCTTTTTCAAATTCGCCATTGCGTCCGGCCAACTTTGACTTGAA
CTACATTGTGGTGTCTCTTTCAATGAAAGAAGGTCAGACATACTATGGAGTAACGGGAACCATGGCGGGTTACAATTTAAGTGTTGCTCATGACCAAGCTTCCTCTACAA
ACTTGTGGGTCACTGGTGGCCCTTCACAACAGCTTAATATGATTCTAGCTGGGTGGCAGGTGAATCCAGCGGTGAATCGTGATGGTCTCACAAGAATGTTCTTGTATTGG
ACGGCCGATGGGTCTCGAGGAACAGGGTGTTACAATTTGCTTTGTCGAGGCTTTGTCCAAACTGATCACATAGTATATCCAGGAAATTGGTGGGTTGAGCTTACAGAAGC
AGGGAGAGCGATTGGATATTTTCCAAAAGAGTTGTTGCCAAATCTAAGTAATGGGGCAAATCAAGTAGCTTGGGGAGGCATTGCAAAGGGAGGGAGAGATGGAATGAAGC
CTCCATTGGGGAGTGGTAATAAACCAGATGGCAACTTTAGGGTTGGATGTTACATAAGAGGGATTACATATAGGAATAATCAAAATGTGGATGTGTCTCCACAAAAATCT
GACCTACAACAACACATTGGAGACCCTAACTGTTATGGTTTGCTAAATGATGAAACTTGTGGGGGTTTTAAACAATTGTATTATTGCTTCACTTATGGTGGACCAGGTGG
CAAATGTGGGGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGAAGTTAGCTTTGTGGGTGATGATGGTGACAATGTTAGTTTGTTCAAACAGCAAACTCAGCAAGGCAGCCATTCCAACCCTTTCTCCACAACAAGACTTGGA
ATTGGAAAATGAACTCAAATTCCTCAATAAGCCTCCCATTCTAACACATCAAACCGAGGAAGGTGACGTCATTGATTGTATCGACATCAATAAACAACCTGCTCTCGATC
ATCCATTGCTAAAAAATCACAAGATTCAGACCAAACCCTCCACATTTCCTCAAAGTGAGACCAAGGAGAGTTCATCAAAATCAAAGCTTTCAATTCTTGAGCACAACAGA
AAGTCTTGTCCAAGTGGATCAGTTCCCATTAGAAGAACTCAAAAGGAAGATTTGATAAGAATGAAGTCTTTTTCAAATTCGCCATTGCGTCCGGCCAACTTTGACTTGAA
CTACATTGTGGTGTCTCTTTCAATGAAAGAAGGTCAGACATACTATGGAGTAACGGGAACCATGGCGGGTTACAATTTAAGTGTTGCTCATGACCAAGCTTCCTCTACAA
ACTTGTGGGTCACTGGTGGCCCTTCACAACAGCTTAATATGATTCTAGCTGGGTGGCAGGTGAATCCAGCGGTGAATCGTGATGGTCTCACAAGAATGTTCTTGTATTGG
ACGGCCGATGGGTCTCGAGGAACAGGGTGTTACAATTTGCTTTGTCGAGGCTTTGTCCAAACTGATCACATAGTATATCCAGGAAATTGGTGGGTTGAGCTTACAGAAGC
AGGGAGAGCGATTGGATATTTTCCAAAAGAGTTGTTGCCAAATCTAAGTAATGGGGCAAATCAAGTAGCTTGGGGAGGCATTGCAAAGGGAGGGAGAGATGGAATGAAGC
CTCCATTGGGGAGTGGTAATAAACCAGATGGCAACTTTAGGGTTGGATGTTACATAAGAGGGATTACATATAGGAATAATCAAAATGTGGATGTGTCTCCACAAAAATCT
GACCTACAACAACACATTGGAGACCCTAACTGTTATGGTTTGCTAAATGATGAAACTTGTGGGGGTTTTAAACAATTGTATTATTGCTTCACTTATGGTGGACCAGGTGG
CAAATGTGGGGTTTAA
Protein sequenceShow/hide protein sequence
MEMKLALWVMMVTMLVCSNSKLSKAAIPTLSPQQDLELENELKFLNKPPILTHQTEEGDVIDCIDINKQPALDHPLLKNHKIQTKPSTFPQSETKESSSKSKLSILEHNR
KSCPSGSVPIRRTQKEDLIRMKSFSNSPLRPANFDLNYIVVSLSMKEGQTYYGVTGTMAGYNLSVAHDQASSTNLWVTGGPSQQLNMILAGWQVNPAVNRDGLTRMFLYW
TADGSRGTGCYNLLCRGFVQTDHIVYPGNWWVELTEAGRAIGYFPKELLPNLSNGANQVAWGGIAKGGRDGMKPPLGSGNKPDGNFRVGCYIRGITYRNNQNVDVSPQKS
DLQQHIGDPNCYGLLNDETCGGFKQLYYCFTYGGPGGKCGV