; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006241 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006241
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr6:39871889..39872521
RNA-Seq ExpressionLag0006241
SyntenyLag0006241
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0044267 - cellular protein metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8521602.1 hypothetical protein F0562_012275 [Nyssa sinensis]5.0e-5260.67Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS
        SK+E Q  GN K C+RCGKLGH+KR+CR KVVC+RCGKS HI+ N RVNL    ANV  E+ + EQL WEQCLSIE  DQ + L           +AN S
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS

Query:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+Y +DWI+DSGCSHHATGN  LLS+V  H GKR I TA+NSLHPVV+EG  NVK D  NV GV L++VY VP LKK+
Subjt:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

KAA8540328.1 hypothetical protein F0562_024753 [Nyssa sinensis]1.4e-5965.17Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS
        SK EGQS+GN + C+RCGKLGH+KRDCR KVVC+RCGKSGHI+ NCRVNL    ANV  E+++ EQL WEQCLSIE  DQP+ L           +AN S
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS

Query:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+Y +DWI+DSGCSHHATGN  LLS+V  H GKR I TADNSLHPVV+EG  NVK D  N  GV L+DVYHVPGLKK+
Subjt:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

KAA8541518.1 hypothetical protein F0562_022670 [Nyssa sinensis]5.0e-6065.73Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS
        SK EGQS+GN + C+RCGKLGH+KRDCR KVVC+RCGKSGHI+ NCRVNL    ANV  E+++ EQL WEQCLSIE  DQP+ L           +AN S
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS

Query:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+Y +DWI+DSGCSHHATGN  LLS+V  H GKR I TADNSLHPVV+EG  NVK D  NV GV L+DVYHVPGLKK+
Subjt:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

KAA8549858.1 hypothetical protein F0562_001542 [Nyssa sinensis]7.0e-5457.36Show/hide
Query:  KKKSVATLSTTFPKQGLICSKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQP
        K  S +  S+   KQ    SK +GQS+GN K  +RCGKLGH+KRDC  KVVC+RC KS HI+ NCRVNL    ANV  +++K EQL WEQCLSIE  DQP
Subjt:  KKKSVATLSTTFPKQGLICSKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQP

Query:  -----------INLHANVSIEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
                   +  +AN SI+Y +DWI+D GCSHHA GN +LLS+V  H GKR I TADNSLHP+V+EG  NVK D  NV+GV L+DVYHVP LKK+
Subjt:  -----------INLHANVSIEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

RWR74934.1 Integrase, catalytic core [Cinnamomum micranthum f. kanehirae]1.5e-4860.23Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINLHA----NVSIEYDEDW
        S  EG+ +GN KGCFRCG+LGHIKRDC A+VVC+RCGKSGHI+ NCRV L EA ANV QE ++ EQ TWE  LSI      I   A    N SI+Y++ W
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINLHA----NVSIEYDEDW

Query:  IIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+DSGCSHHATGN  LLSDV  H GK+ I TADNSL+PV +EG  + + D  N  GV L +VYHV GLKK+
Subjt:  IIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

TrEMBL top hitse value%identityAlignment
A0A443N8T5 Integrase, catalytic core7.3e-4960.23Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINLHA----NVSIEYDEDW
        S  EG+ +GN KGCFRCG+LGHIKRDC A+VVC+RCGKSGHI+ NCRV L EA ANV QE ++ EQ TWE  LSI      I   A    N SI+Y++ W
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINLHA----NVSIEYDEDW

Query:  IIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+DSGCSHHATGN  LLSDV  H GK+ I TADNSL+PV +EG  + + D  N  GV L +VYHV GLKK+
Subjt:  IIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

A0A5J4ZW51 CCHC-type domain-containing protein2.4e-5260.67Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS
        SK+E Q  GN K C+RCGKLGH+KR+CR KVVC+RCGKS HI+ N RVNL    ANV  E+ + EQL WEQCLSIE  DQ + L           +AN S
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS

Query:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+Y +DWI+DSGCSHHATGN  LLS+V  H GKR I TA+NSLHPVV+EG  NVK D  NV GV L++VY VP LKK+
Subjt:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

A0A5J5BCB3 Uncharacterized protein7.0e-6065.17Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS
        SK EGQS+GN + C+RCGKLGH+KRDCR KVVC+RCGKSGHI+ NCRVNL    ANV  E+++ EQL WEQCLSIE  DQP+ L           +AN S
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS

Query:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+Y +DWI+DSGCSHHATGN  LLS+V  H GKR I TADNSLHPVV+EG  NVK D  N  GV L+DVYHVPGLKK+
Subjt:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

A0A5J5BFR6 Uncharacterized protein2.4e-6065.73Show/hide
Query:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS
        SK EGQS+GN + C+RCGKLGH+KRDCR KVVC+RCGKSGHI+ NCRVNL    ANV  E+++ EQL WEQCLSIE  DQP+ L           +AN S
Subjt:  SKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL-----------HANVS

Query:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
        I+Y +DWI+DSGCSHHATGN  LLS+V  H GKR I TADNSLHPVV+EG  NVK D  NV GV L+DVYHVPGLKK+
Subjt:  IEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

A0A5J5C3K7 Uncharacterized protein3.4e-5457.36Show/hide
Query:  KKKSVATLSTTFPKQGLICSKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQP
        K  S +  S+   KQ    SK +GQS+GN K  +RCGKLGH+KRDC  KVVC+RC KS HI+ NCRVNL    ANV  +++K EQL WEQCLSIE  DQP
Subjt:  KKKSVATLSTTFPKQGLICSKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQP

Query:  -----------INLHANVSIEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH
                   +  +AN SI+Y +DWI+D GCSHHA GN +LLS+V  H GKR I TADNSLHP+V+EG  NVK D  NV+GV L+DVYHVP LKK+
Subjt:  -----------INLHANVSIEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKH

SwissProt top hitse value%identityAlignment
P03352 Gag polyprotein1.7e-0742.11Show/hide
Query:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR
        K  L+   +  Q K   KG    C+ CGK GH+ R CR  ++CH CGK GH++ +CR
Subjt:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR

P23424 Gag polyprotein1.7e-0742.11Show/hide
Query:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR
        K  L+   +  Q K   KG    C+ CGK GH+ R CR  ++CH CGK GH++ +CR
Subjt:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR

P23425 Gag polyprotein1.7e-0742.11Show/hide
Query:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR
        K  L+   +  Q K   KG    C+ CGK GH+ R CR  ++CH CGK GH++ +CR
Subjt:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR

P35955 Gag polyprotein1.3e-0742.11Show/hide
Query:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR
        K  L+   +  Q K   KG    C+ CGK GH+ R CR  ++CH CGK GH++ +CR
Subjt:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR

P35956 Gag-Pol polyprotein1.3e-0742.11Show/hide
Query:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR
        K  L+   +  Q K   KG    C+ CGK GH+ R CR  ++CH CGK GH++ +CR
Subjt:  KQGLICSKVEGQSKGNLKG----CFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCR

Arabidopsis top hitse value%identityAlignment
AT1G75560.1 zinc knuckle (CCHC-type) family protein9.2e-0448.65Show/hide
Query:  NLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNC
        N K C  C   GHI RDCR   VC+ C  SGH+  +C
Subjt:  NLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNC

AT1G75560.2 zinc knuckle (CCHC-type) family protein9.2e-0448.65Show/hide
Query:  NLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNC
        N K C  C   GHI RDCR   VC+ C  SGH+  +C
Subjt:  NLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNC

AT4G36020.1 cold shock domain protein 13.2e-0438.18Show/hide
Query:  GQSKGNLKGCFRCGKLGHIKRDCRAKV-------------VCHRCGKSGHIRPNC
        G  KG   GC+ CG +GH  RDC  KV              C+ CG  GHI  +C
Subjt:  GQSKGNLKGCFRCGKLGHIKRDCRAKV-------------VCHRCGKSGHIRPNC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTCCAATCTTTAAAAAAAAAAAAAGCGTCGCAACGCTATCGACAACTTTTCCAAAGCAAGGCCTAATTTGTTCCAAGGTTGAAGGGCAGTCCAAAGGCAATTT
AAAAGGATGTTTTAGGTGCGGCAAGCTAGGACACATCAAACGTGATTGTCGAGCGAAGGTGGTGTGTCATCGTTGTGGGAAGTCGGGTCATATTAGGCCAAATTGTCGGG
TGAATCTCAAAGAAGCAGAAGCAAATGTTGTACAAGAAAGTAACAAACCTGAACAACTTACTTGGGAACAATGCTTGTCAATTGAAACTTTTGATCAACCCATTAATTTG
CATGCTAATGTTTCTATAGAATATGATGAGGATTGGATTATTGATTCTGGTTGTTCTCATCATGCTACTGGAAATGTTTTTCTTCTCTCTGATGTCCATACCCATCAGGG
AAAAAGAGTTATTGCAACGGCCGATAATTCCTTACATCCTGTTGTTGAAGAAGGGTGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTTTCTTGAAGATG
TTTATCATGTTCCAGGCCTAAAGAAGCACTACAACAAATTTGGGCTTAGATGTCAGTTCTGCACCGTTATTAAACCCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTCCAATCTTTAAAAAAAAAAAAAGCGTCGCAACGCTATCGACAACTTTTCCAAAGCAAGGCCTAATTTGTTCCAAGGTTGAAGGGCAGTCCAAAGGCAATTT
AAAAGGATGTTTTAGGTGCGGCAAGCTAGGACACATCAAACGTGATTGTCGAGCGAAGGTGGTGTGTCATCGTTGTGGGAAGTCGGGTCATATTAGGCCAAATTGTCGGG
TGAATCTCAAAGAAGCAGAAGCAAATGTTGTACAAGAAAGTAACAAACCTGAACAACTTACTTGGGAACAATGCTTGTCAATTGAAACTTTTGATCAACCCATTAATTTG
CATGCTAATGTTTCTATAGAATATGATGAGGATTGGATTATTGATTCTGGTTGTTCTCATCATGCTACTGGAAATGTTTTTCTTCTCTCTGATGTCCATACCCATCAGGG
AAAAAGAGTTATTGCAACGGCCGATAATTCCTTACATCCTGTTGTTGAAGAAGGGTGTGTTAATGTTAAGGATGATGCACCAAATGTTGCTGGTGTTTTTCTTGAAGATG
TTTATCATGTTCCAGGCCTAAAGAAGCACTACAACAAATTTGGGCTTAGATGTCAGTTCTGCACCGTTATTAAACCCCTATGA
Protein sequenceShow/hide protein sequence
MELPIFKKKKSVATLSTTFPKQGLICSKVEGQSKGNLKGCFRCGKLGHIKRDCRAKVVCHRCGKSGHIRPNCRVNLKEAEANVVQESNKPEQLTWEQCLSIETFDQPINL
HANVSIEYDEDWIIDSGCSHHATGNVFLLSDVHTHQGKRVIATADNSLHPVVEEGCVNVKDDAPNVAGVFLEDVYHVPGLKKHYNKFGLRCQFCTVIKPL