; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017733 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017733
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr5:8026822..8028063
RNA-Seq ExpressionLag0017733
SyntenyLag0017733
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4364765.1 hypothetical protein F8388_018441 [Cannabis sativa]1.1e-1130.84Show/hide
Query:  RGQKEGQVRESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQ-EGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCW-------
        RG K G    S   +  K     SGGL+LLW ++ +VS+ S+S GHIDA +Q  G   WRFTG YGNP      E+WRL+ RL +  D+   W       
Subjt:  RGQKEGQVRESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQ-EGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCW-------

Query:  -VVTLVRLHMGMRKSVFKVHHL-------ALVATDHRPLLAEWKEEPPDINHNWSRRHYEGSI-RGAINHK---------EVEIQHLLRLGDDL-RDNEL
         V+++     G  +S+  +          +L     +     W      IN      H +  + R   N +         + ++  LL +   L R  E+
Subjt:  -VVTLVRLHMGMRKSVFKVHHL-------ALVATDHRPLLAEWKEEPPDINHNWSRRHYEGSI-RGAINHK---------EVEIQHLLRLGDDL-RDNEL

Query:  KEKENELENLLEDDEIYWKQRTREDWL
        K  E +L +LL  +E YWK R+R DWL
Subjt:  KEKENELENLLEDDEIYWKQRTREDWL

KAF4381998.1 hypothetical protein G4B88_006630 [Cannabis sativa]4.8e-1045.83Show/hide
Query:  KHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQ-EGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDI
        ++  E SGGL+LLW ++ +VS+ S++ GHIDA ++  G   WRFTG YGNP      E+WRL+ RL +  D+
Subjt:  KHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQ-EGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDI

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]2.5e-1431.73Show/hide
Query:  GGLMLLWRNETKVSIHSYSKGHIDATI-QEGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRG-CW-----VVTLVRLHMGMRKSVFKVHHLALVA
        GGL LLW  +  + + SYSK HIDA I  E G SWR T +YG+P       TW L+ RLA  S +   C+     +  L     G  ++  +VH      
Subjt:  GGLMLLWRNETKVSIHSYSKGHIDATI-QEGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRG-CW-----VVTLVRLHMGMRKSVFKVHHLALVA

Query:  TDHRPLLAEWKEEPPDINHNWSRRHYEGSI-----------------RGAINHKEVEIQHLLRLGDDLR--------DNELKEKENELENLLEDDEIYWK
         D R L    K  P      WS R  E  I                 +     ++ +++ L      +R         +ELK+ EN+++N+L+D+EI+WK
Subjt:  TDHRPLLAEWKEEPPDINHNWSRRHYEGSI-----------------RGAINHKEVEIQHLLRLGDDLR--------DNELKEKENELENLLEDDEIYWK

Query:  QRTREDWL
        QR+R DWL
Subjt:  QRTREDWL

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]9.4e-1436.55Show/hide
Query:  TEESGGLMLLWRNETKVSIHSYSKGHIDATIQEGGWSWRFTGIYGNPVRGLHHETWRLMTRLA----------------------------EQSDIRGCW
        T +SGGLMLLW +++ V I S S GHID+ I +   SWRFTG YGNP       +W+L+ RLA                             +S +RGC 
Subjt:  TEESGGLMLLWRNETKVSIHSYSKGHIDATIQEGGWSWRFTGIYGNPVRGLHHETWRLMTRLA----------------------------EQSDIRGCW

Query:  VVT-----LVRLHMGMRKSVFKVHHLALVATDHRPLLAEWKEEPP
        +       L+   M  +    KV HL L+++DHRP+LA W  E P
Subjt:  VVT-----LVRLHMGMRKSVFKVHHLALVATDHRPLLAEWKEEPP

XP_028113864.1 uncharacterized protein LOC114311894 [Camellia sinensis]1.4e-1430.39Show/hide
Query:  TKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATI--QEGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCWVVTLVRLHMGMRKSVFKVHHLA
        T  + E SGGL LLW  E ++ I SYSKGH+D+ I  + G  SW+FTG YGNP   L  ++W L+ RL +Q ++   WV        G    +   H  +
Subjt:  TKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATI--QEGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCWVVTLVRLHMGMRKSVFKVHHLA

Query:  LVATDHRPLLAEWKE-----EPPDINHNWSRRHYE-----------GSIRGAINHKEVEIQHL--LRLGDDLRDNELKEKENELENLLEDDEIYWKQRTR
         +A   +  +  +++     +  D++ +    H             G+++ ++N K   IQ L  L +  D    EL+  + E++ LLE + + W QR R
Subjt:  LVATDHRPLLAEWKE-----EPPDINHNWSRRHYE-----------GSIRGAINHKEVEIQHL--LRLGDDLRDNELKEKENELENLLEDDEIYWKQRTR

Query:  EDWL
         +WL
Subjt:  EDWL

TrEMBL top hitse value%identityAlignment
A0A2N9ER53 Uncharacterized protein2.8e-1130.7Show/hide
Query:  GGLMLLWRNETKVSIHSYSKGHIDA-TIQEGGWSWRFTGIYGNPVRGLHHETWRLM--------------------TRLAEQSDIRGCWVVTLVRLHMGM
        GGL LLW +   V I SYS  HIDA  + E G  WR TG YG+P RGL   +W L+                    T L EQ       +  +  L +  
Subjt:  GGLMLLWRNETKVSIHSYSKGHIDA-TIQEGGWSWRFTGIYGNPVRGLHHETWRLM--------------------TRLAEQSDIRGCWVVTLVRLHMGM

Query:  RK--SVF---KVHHLALVATDHRPLLAEWKEEPPDINHNWSRR---HYE-----GSIRGA---INHKEVEIQHLLRLG-DDLRDNELKEKENELENLLED
         +  S+F   +VHH+ + A D   LL     +P    +N  ++   H+E      ++R     I  K+  ++ L  L  D+ + NE+ +   E+  L E 
Subjt:  RK--SVF---KVHHLALVATDHRPLLAEWKEEPPDINHNWSRR---HYE-----GSIRGA---INHKEVEIQHLLRLG-DDLRDNELKEKENELENLLED

Query:  DEIYWKQRTREDWLT
        +EI+W+QR+R  WL+
Subjt:  DEIYWKQRTREDWLT

A0A6J1DUG8 uncharacterized protein LOC1110241354.5e-1436.55Show/hide
Query:  TEESGGLMLLWRNETKVSIHSYSKGHIDATIQEGGWSWRFTGIYGNPVRGLHHETWRLMTRLA----------------------------EQSDIRGCW
        T +SGGLMLLW +++ V I S S GHID+ I +   SWRFTG YGNP       +W+L+ RLA                             +S +RGC 
Subjt:  TEESGGLMLLWRNETKVSIHSYSKGHIDATIQEGGWSWRFTGIYGNPVRGLHHETWRLMTRLA----------------------------EQSDIRGCW

Query:  VVT-----LVRLHMGMRKSVFKVHHLALVATDHRPLLAEWKEEPP
        +       L+   M  +    KV HL L+++DHRP+LA W  E P
Subjt:  VVT-----LVRLHMGMRKSVFKVHHLALVATDHRPLLAEWKEEPP

A0A7J6F293 Uncharacterized protein5.6e-1230.84Show/hide
Query:  RGQKEGQVRESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQ-EGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCW-------
        RG K G    S   +  K     SGGL+LLW ++ +VS+ S+S GHIDA +Q  G   WRFTG YGNP      E+WRL+ RL +  D+   W       
Subjt:  RGQKEGQVRESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQ-EGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCW-------

Query:  -VVTLVRLHMGMRKSVFKVHHL-------ALVATDHRPLLAEWKEEPPDINHNWSRRHYEGSI-RGAINHK---------EVEIQHLLRLGDDL-RDNEL
         V+++     G  +S+  +          +L     +     W      IN      H +  + R   N +         + ++  LL +   L R  E+
Subjt:  -VVTLVRLHMGMRKSVFKVHHL-------ALVATDHRPLLAEWKEEPPDINHNWSRRHYEGSI-RGAINHK---------EVEIQHLLRLGDDL-RDNEL

Query:  KEKENELENLLEDDEIYWKQRTREDWL
        K  E +L +LL  +E YWK R+R DWL
Subjt:  KEKENELENLLEDDEIYWKQRTREDWL

A0A803NVA0 Uncharacterized protein3.6e-1153.62Show/hide
Query:  SGGLMLLWRNETKVSIHSYSKGHIDATIQEGGW-SWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGC
        SGGL LLW+ E +V+I  +S  HIDA IQ+ G  SWRFTGIYG P R L  +TW L   L E +D+  C
Subjt:  SGGLMLLWRNETKVSIHSYSKGHIDATIQEGGW-SWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGC

A0A803P941 Uncharacterized protein1.4e-1031.51Show/hide
Query:  ESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQEG-GWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCWVVTLVRLHMGMRKSVF
        +S  +V  K K   SGGL LLW++  +V+I S++  HIDA ++ G G+SWRFTG YG+P  G    TW LM RL  ++ ++G W+       + + K   
Subjt:  ESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQEG-GWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCWVVTLVRLHMGMRKSVF

Query:  KVHHLALVATDHRPLLAEWKEEPPDINHN-WSRRHYEGSI---RGAIN-----------HKEVEI--------QHLLRLGDDLRDNELKEK---ENELEN
                +  H       +EE   I  N W  R + GS+   RG IN            K+ E+        + L RL   L + + +++   E +L  
Subjt:  KVHHLALVATDHRPLLAEWKEEPPDINHN-WSRRHYEGSI---RGAIN-----------HKEVEI--------QHLLRLGDDLRDNELKEK---ENELEN

Query:  LLEDDEIYWKQRTREDWLT
          + +E+ WKQR+R  WLT
Subjt:  LLEDDEIYWKQRTREDWLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGATCGAGGCCAGAAAGAAGGTCAAGTGCGTGAATCCATTACGTTAGTCGGTACTAAGCATAAGACAGAGGAGAGTGGGGGATTGATGTTACTATGGAGGAATGA
GACTAAGGTTAGTATTCACTCATATTCGAAGGGGCATATAGATGCGACTATCCAGGAAGGAGGTTGGTCATGGCGGTTTACAGGCATTTATGGAAACCCAGTTAGAGGTT
TGCATCATGAAACGTGGAGGTTAATGACTAGACTAGCTGAGCAGTCAGATATTCGTGGGTGCTGGGTGGTGACTTTAGTGAGATTACATATGGGTATGAGAAAAAGTGTA
TTCAAAGTACACCATTTGGCTCTGGTAGCAACTGACCATAGACCTCTCCTAGCGGAATGGAAAGAAGAACCACCTGATATAAATCATAATTGGAGTAGGCGACACTATGA
AGGATCCATTAGAGGAGCTATTAATCATAAAGAAGTGGAAATTCAACACCTTCTCAGGCTTGGAGATGATCTTAGGGACAATGAGCTTAAGGAGAAGGAAAATGAGCTTG
AAAATCTATTAGAGGATGATGAGATCTATTGGAAGCAGAGAACCCGGGAGGATTGGTTAACTTGGGTGACCGCAATACTAAATGGTTCCATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGGATCGAGGCCAGAAAGAAGGTCAAGTGCGTGAATCCATTACGTTAGTCGGTACTAAGCATAAGACAGAGGAGAGTGGGGGATTGATGTTACTATGGAGGAATGA
GACTAAGGTTAGTATTCACTCATATTCGAAGGGGCATATAGATGCGACTATCCAGGAAGGAGGTTGGTCATGGCGGTTTACAGGCATTTATGGAAACCCAGTTAGAGGTT
TGCATCATGAAACGTGGAGGTTAATGACTAGACTAGCTGAGCAGTCAGATATTCGTGGGTGCTGGGTGGTGACTTTAGTGAGATTACATATGGGTATGAGAAAAAGTGTA
TTCAAAGTACACCATTTGGCTCTGGTAGCAACTGACCATAGACCTCTCCTAGCGGAATGGAAAGAAGAACCACCTGATATAAATCATAATTGGAGTAGGCGACACTATGA
AGGATCCATTAGAGGAGCTATTAATCATAAAGAAGTGGAAATTCAACACCTTCTCAGGCTTGGAGATGATCTTAGGGACAATGAGCTTAAGGAGAAGGAAAATGAGCTTG
AAAATCTATTAGAGGATGATGAGATCTATTGGAAGCAGAGAACCCGGGAGGATTGGTTAACTTGGGTGACCGCAATACTAAATGGTTCCATCTGA
Protein sequenceShow/hide protein sequence
MMDRGQKEGQVRESITLVGTKHKTEESGGLMLLWRNETKVSIHSYSKGHIDATIQEGGWSWRFTGIYGNPVRGLHHETWRLMTRLAEQSDIRGCWVVTLVRLHMGMRKSV
FKVHHLALVATDHRPLLAEWKEEPPDINHNWSRRHYEGSIRGAINHKEVEIQHLLRLGDDLRDNELKEKENELENLLEDDEIYWKQRTREDWLTWVTAILNGSI