; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0018251 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0018251
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:20514233..20515038
RNA-Seq ExpressionLag0018251
SyntenyLag0018251
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TXG51459.1 hypothetical protein EZV62_023983 [Acer yangbiense]9.8e-3033.16Show/hide
Query:  SDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDKFLLVLEMVD-AL
        +DE+    ++ +    D + ++  CL GK+LS + +  E  +  +   W     +++E +G N+F+F F   V+RNR+++ GPW+FDK L+VLE  +  +
Subjt:  SDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDKFLLVLEMVD-AL

Query:  VDC-NAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLR
        +D    +  CWG  ++V+++I I+KPL+R + + +D       + +KYERL +FC  CG +GHA K+C    + + A    + +YGSW+R
Subjt:  VDC-NAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLR

XP_022132681.1 uncharacterized protein LOC111005481 [Momordica charantia]1.3e-4238.8Show/hide
Query:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGP
        M A  LLE  +  +LTS+E+ +AVD+D  A+E T   +   L  KLLS R I C +L+ TL I W++D     V+ +G N+FLF+F +  DRNR+ R GP
Subjt:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGP

Query:  WFFDKFLLVLEMVDAL------------------------------------------VDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCW
        W FD+ L++++   +L                                          V+ NA   CWG  LRVR+R  + KPL RGI +N+DGP+GGCW
Subjt:  WFFDKFLLVLEMVDAL------------------------------------------VDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCW

Query:  IPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDG
        IP++YERL +F   CG L H +KDC     D   + S N +YG WLR  G
Subjt:  IPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDG

XP_022156185.1 uncharacterized protein LOC111023135 [Momordica charantia]3.2e-4940.82Show/hide
Query:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPW
        MD E LL   ++ +LTS+E+++A+DVD  A++  +  +   L GKLL+ R I  ++L R L + W+V+  L VE +GKN+FLF F +  D NRV ++GPW
Subjt:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPW

Query:  FFDKFLLVLE--------------------------------------------MVDALVDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGC
        FFDK L+VL+                                             VD  VDCN  G  WG SLR+R+ I ITKPLRRGI INIDGP+GGC
Subjt:  FFDKFLLVLE--------------------------------------------MVDALVDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGC

Query:  WIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALGSGR
        WIP++YERL +FC  CGV+GH+  DC            + +EYG WLR  G    A +G K     R
Subjt:  WIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALGSGR

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.0e-3935.48Show/hide
Query:  LLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDK
        LLE  +  +LTS+EE+ A+DVD  A   T  ++   L GKL   RPI C +++ T+   W+++ +  +V+ LG N+FLFSF + +DRN++++SGPW FD+
Subjt:  LLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDK

Query:  FLLVLEMVDALV------------------------------------------DCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKY
         L+++    AL+                                          DC+     WG +LRVR+ + I+KPLRRGI +N+DGPIGG WIP++Y
Subjt:  FLLVLEMVDALV------------------------------------------DCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKY

Query:  ERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVK
        ERL +FC  CG+                ++    ++YGSWLR  G VK
Subjt:  ERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVK

XP_028122006.1 uncharacterized protein LOC114319195 [Camellia sinensis]9.8e-3032.71Show/hide
Query:  AEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQM---RCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGP
        A++LL+  R L LTS+E+ V     ++  E T + M     CL GKLL+ RP   E ++ TL   W+   G+QV  +G N+F+F F  +VD+ RV   GP
Subjt:  AEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQM---RCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGP

Query:  WFFDKFLLVLEMVDALVD---------------CNA---------------------------AGGCWGISLRVRIRIYITKPLRRGININIDG--PIGG
        W FDK LL+L  +D  V                CN                             G  WG ++R+R+ + + KPLRRG+ + +    PI  
Subjt:  WFFDKFLLVLEMVDALVD---------------CNA---------------------------AGGCWGISLRVRIRIYITKPLRRGININIDG--PIGG

Query:  CWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALGS
         W+  KYERL  +C  CG LGH+ ++C   LS    +   + +YG+WLR+D    +  +G +  GS
Subjt:  CWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALGS

TrEMBL top hitse value%identityAlignment
A0A5C7H421 CCHC-type domain-containing protein4.8e-3033.16Show/hide
Query:  SDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDKFLLVLEMVD-AL
        +DE+    ++ +    D + ++  CL GK+LS + +  E  +  +   W     +++E +G N+F+F F   V+RNR+++ GPW+FDK L+VLE  +  +
Subjt:  SDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDKFLLVLEMVD-AL

Query:  VDC-NAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLR
        +D    +  CWG  ++V+++I I+KPL+R + + +D       + +KYERL +FC  CG +GHA K+C    + + A    + +YGSW+R
Subjt:  VDC-NAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLR

A0A5C7IV71 CCHC-type domain-containing protein6.9e-2931.6Show/hide
Query:  DAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWF
        +A  L   C  L +  DE+ V   + +      +  +  CL GK+LS + +  +  +  +   W     +++E +G+NVFLF F    DRNRV+  GPW 
Subjt:  DAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWF

Query:  FDKFLLVL---------EMVDALVDC-NAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGA
        FDK LLVL         E ++ +++    +  CWG  LRV++RI I+KPL+R + +++D       + +KYERL EFC  CG +GH + +C  + +   A
Subjt:  FDKFLLVL---------EMVDALVDC-NAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGA

Query:  TPSSNNEYGSWLRVDGLVKVADEGIKALGSG
           +   +GSW+R   + K  D+    +  G
Subjt:  TPSSNNEYGSWLRVDGLVKVADEGIKALGSG

A0A6J1BSZ1 uncharacterized protein LOC1110054816.4e-4338.8Show/hide
Query:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGP
        M A  LLE  +  +LTS+E+ +AVD+D  A+E T   +   L  KLLS R I C +L+ TL I W++D     V+ +G N+FLF+F +  DRNR+ R GP
Subjt:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGP

Query:  WFFDKFLLVLEMVDAL------------------------------------------VDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCW
        W FD+ L++++   +L                                          V+ NA   CWG  LRVR+R  + KPL RGI +N+DGP+GGCW
Subjt:  WFFDKFLLVLEMVDAL------------------------------------------VDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCW

Query:  IPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDG
        IP++YERL +F   CG L H +KDC     D   + S N +YG WLR  G
Subjt:  IPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDG

A0A6J1DU55 uncharacterized protein LOC1110231351.6e-4940.82Show/hide
Query:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPW
        MD E LL   ++ +LTS+E+++A+DVD  A++  +  +   L GKLL+ R I  ++L R L + W+V+  L VE +GKN+FLF F +  D NRV ++GPW
Subjt:  MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPW

Query:  FFDKFLLVLE--------------------------------------------MVDALVDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGC
        FFDK L+VL+                                             VD  VDCN  G  WG SLR+R+ I ITKPLRRGI INIDGP+GGC
Subjt:  FFDKFLLVLE--------------------------------------------MVDALVDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGC

Query:  WIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALGSGR
        WIP++YERL +FC  CGV+GH+  DC            + +EYG WLR  G    A +G K     R
Subjt:  WIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALGSGR

A0A6J1DX30 uncharacterized protein LOC1110248741.5e-3935.48Show/hide
Query:  LLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDK
        LLE  +  +LTS+EE+ A+DVD  A   T  ++   L GKL   RPI C +++ T+   W+++ +  +V+ LG N+FLFSF + +DRN++++SGPW FD+
Subjt:  LLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVD-DGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDK

Query:  FLLVLEMVDALV------------------------------------------DCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKY
         L+++    AL+                                          DC+     WG +LRVR+ + I+KPLRRGI +N+DGPIGG WIP++Y
Subjt:  FLLVLEMVDALV------------------------------------------DCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKY

Query:  ERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVK
        ERL +FC  CG+                ++    ++YGSWLR  G VK
Subjt:  ERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding8.9e-0535.29Show/hide
Query:  VDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCP
        VD N      G   RV I + + KPL+  + IN D         + YE L + CS CG+ GH +  CP
Subjt:  VDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCP

AT3G42140.1 zinc ion binding;nucleic acid binding3.4e-0428.87Show/hide
Query:  VFRSGPWFFDKFLLVLEMVDAL-VDCNAAG-GCW----GISLR-VRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCP
        + R GPW F+ ++ V++    L  D        W    GI LR +  RI  +   R G+ +  +       +  +YE+L  FC+ CG+L H   +CP
Subjt:  VFRSGPWFFDKFLLVLEMVDAL-VDCNAAG-GCW----GISLR-VRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGCGGAGGCTTTGCTGGAGGGTTGTCGCCGGCTACGTCTGACGTCTGATGAGGAAGACGTAGCTGTGGACGTGGATAAGGTTGCGATTGAAGACACGAAGGTTCA
GATGCGATGTTGTCTGGCCGGGAAACTCCTTAGTCCGCGACCGATTGGTTGTGAGATTTTGCGGCGAACGTTGTCCATCACTTGGCGTGTTGATGATGGTTTGCAGGTTG
AGCGCTTGGGTAAGAATGTTTTTCTGTTTTCTTTTGTTAAGATGGTCGATCGTAATCGTGTCTTCCGCTCTGGCCCATGGTTCTTCGACAAATTTCTCTTAGTGTTGGAA
ATGGTGGATGCTCTGGTGGATTGTAATGCTGCCGGTGGTTGCTGGGGTATCAGTCTGCGCGTTCGTATTAGAATTTACATCACTAAACCTCTCCGTAGGGGTATTAATAT
TAATATTGATGGACCTATTGGGGGTTGCTGGATTCCCATGAAATATGAAAGGCTCCTGGAGTTCTGTTCTCGATGTGGTGTATTGGGTCATGCCATGAAGGACTGTCCTT
TGCTTTTGTCTGATGTGGGTGCTACGCCGAGTAGCAATAATGAATATGGTTCATGGCTCAGAGTTGATGGTCTTGTGAAGGTTGCTGATGAAGGGATAAAAGCCCTTGGC
AGCGGAAGACATCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATGCGGAGGCTTTGCTGGAGGGTTGTCGCCGGCTACGTCTGACGTCTGATGAGGAAGACGTAGCTGTGGACGTGGATAAGGTTGCGATTGAAGACACGAAGGTTCA
GATGCGATGTTGTCTGGCCGGGAAACTCCTTAGTCCGCGACCGATTGGTTGTGAGATTTTGCGGCGAACGTTGTCCATCACTTGGCGTGTTGATGATGGTTTGCAGGTTG
AGCGCTTGGGTAAGAATGTTTTTCTGTTTTCTTTTGTTAAGATGGTCGATCGTAATCGTGTCTTCCGCTCTGGCCCATGGTTCTTCGACAAATTTCTCTTAGTGTTGGAA
ATGGTGGATGCTCTGGTGGATTGTAATGCTGCCGGTGGTTGCTGGGGTATCAGTCTGCGCGTTCGTATTAGAATTTACATCACTAAACCTCTCCGTAGGGGTATTAATAT
TAATATTGATGGACCTATTGGGGGTTGCTGGATTCCCATGAAATATGAAAGGCTCCTGGAGTTCTGTTCTCGATGTGGTGTATTGGGTCATGCCATGAAGGACTGTCCTT
TGCTTTTGTCTGATGTGGGTGCTACGCCGAGTAGCAATAATGAATATGGTTCATGGCTCAGAGTTGATGGTCTTGTGAAGGTTGCTGATGAAGGGATAAAAGCCCTTGGC
AGCGGAAGACATCAGTGA
Protein sequenceShow/hide protein sequence
MDAEALLEGCRRLRLTSDEEDVAVDVDKVAIEDTKVQMRCCLAGKLLSPRPIGCEILRRTLSITWRVDDGLQVERLGKNVFLFSFVKMVDRNRVFRSGPWFFDKFLLVLE
MVDALVDCNAAGGCWGISLRVRIRIYITKPLRRGININIDGPIGGCWIPMKYERLLEFCSRCGVLGHAMKDCPLLLSDVGATPSSNNEYGSWLRVDGLVKVADEGIKALG
SGRHQ