; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10022552 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10022552
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionYqaJ domain-containing protein
Genome locationChr05:25432311..25432889
RNA-Seq ExpressionHG10022552
SyntenyHG10022552
Gene Ontology termsGO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0004527 - exonuclease activity (molecular function)
InterPro domainsIPR011335 - Restriction endonuclease type II-like
IPR011604 - Exonuclease, phage-type/RecB, C-terminal
IPR019080 - YqaJ viral recombinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607651.1 hypothetical protein SDJN03_00993, partial [Cucurbita argyrosperma subsp. sororia]5.7e-10392.63Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGK NS  DWLAASPDG IDKMVYGLPSRGVLEIKCPFFDGDM +ASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTPKGSSLFRLYRD EYW+VLKIALSDFWWKHVQPAREMCSKY+ITNPL+ELKSLRPSPKHELCSYIVCESKRVV+NS+LLLREF+GRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

XP_011659114.2 uncharacterized protein LOC101215512 [Cucumis sativus]1.0e-10493.16Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKM+YGLPS+GVLEIKCPFF+GDM+ ASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTP GSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKY +TNPLIELKSLRPSP+HELCSYIVCES+RVVNNSKLLLREFDGRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

XP_022981568.1 uncharacterized protein LOC111480647 [Cucurbita maxima]2.6e-10393.16Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGK NS  DWLAASPDG IDKMVYGLPSRGVLEIKCPFFDGDM+KASPWSRVPLYCIPQ QGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTPKGSSLFRLYRD EYW+VLKIALSDFWWKHVQPAREMCSKY+ITNPLIELKSLRPSPKHELCSYIVCESKRVV+NS+LLLREF+GRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

XP_031742491.1 uncharacterized protein LOC101207616 [Cucumis sativus]3.3e-10392.63Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GD++ A PWSRVP YCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTP GSSLFRLYRD EYWDVLKIALSDFWWKHVQPAREMCSKY ITNPL+ELKSLRPSP+HELCSYIVCESKRVVNNSKLLLREFDGRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

XP_038900094.1 uncharacterized protein LOC120087241 [Benincasa hispida]8.5e-10795.79Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKM+YGLPSRGVLEIKCPFFDGDM+KASPWSR+PLYCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAI NPLIELKSLRPSPKHELCSYIVCESKRVV+NSKLLLREFDGRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

TrEMBL top hitse value%identityAlignment
A0A0A0K4T5 YqaJ domain-containing protein5.9e-10694.74Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GDM+ ASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTP GSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKY +TNPLIELKSLRPSP+HELCSYIVCESKRVVNNSKLLLREFDGRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

A0A0A0KG47 YqaJ domain-containing protein1.6e-10392.63Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GD++ A PWSRVP YCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTP GSSLFRLYRD EYWDVLKIALSDFWWKHVQPAREMCSKY ITNPL+ELKSLRPSP+HELCSYIVCESKRVVNNSKLLLREFDGRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

A0A1S3C4M6 uncharacterized protein LOC1034964276.1e-10392.11Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGKANS DDWLAASPDG IDKMVYGLPSRGVLEIKCPFF+GDM+ ASPWS+VP YCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTP GSSLFRLYRD EYWDVLKIALSDFWWKHVQPARE+CSKY ITNPLIELKS RPSP+HELCSYIVCES+RVVNNSKLLLREFDGRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

A0A6J1EKA9 uncharacterized protein LOC1114333776.1e-10393.16Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVY K NS  DWLAASPDG IDKMVYGLPSRGVLEIKCPFFDGDM KASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTPKGSSLFRLYRD EYW+VLKIALSDFWWKHVQPAREMCSKY+ITNPLIELKSLRPSPKHELCSYIVCESKRVV+NS+LLLREF+GRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

A0A6J1J2F9 uncharacterized protein LOC1114806471.2e-10393.16Show/hide
Query:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY
        MKEEEALERYKLITGNSVLFP+FQVYGK NS  DWLAASPDG IDKMVYGLPSRGVLEIKCPFFDGDM+KASPWSRVPLYCIPQ QGLMEIMDRDWMDFY
Subjt:  MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFY

Query:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        VWTPKGSSLFRLYRD EYW+VLKIALSDFWWKHVQPAREMCSKY+ITNPLIELKSLRPSPKHELCSYIVCESKRVV+NS+LLLREF+GRL
Subjt:  VWTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13810.1 Restriction endonuclease, type II-like superfamily protein2.9e-5248.68Show/hide
Query:  EEEALERYKLITGNSVLFPKFQVYGKANSGDD-WLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYV
        E EALERY  +TGN +L P+F VY    S ++ WL ASPDG I+ +  G+ S GVLE+KCPF + D  K  PW +VP  C+PQ QGLMEI+D DW+D Y 
Subjt:  EEEALERYKLITGNSVLFPKFQVYGKANSGDD-WLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYV

Query:  WTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL
        WT  GSSLFR++RD  +W+ +K AL DFW  HV PARE+ + + I +P ++L+  +P   HE C  I+  ++R+  N+  L  E DG L
Subjt:  WTPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRL

AT1G67660.1 Restriction endonuclease, type II-like superfamily protein5.4e-3539.34Show/hide
Query:  EEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVW
        E  A+ERYK I G  V    F ++  +N    WL ASPDG +D         G+LE+KCP+  G  +   PW +VP Y +PQ QG MEIMDR+W++ Y W
Subjt:  EEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVW

Query:  TPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLRE
        T  GS++FR+ RD  YW ++   L +FWW+ V PARE      +     E+K   P+  H+     + +S  +   SKL+ RE
Subjt:  TPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLRE

AT1G67660.2 Restriction endonuclease, type II-like superfamily protein5.4e-3539.34Show/hide
Query:  EEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVW
        E  A+ERYK I G  V    F ++  +N    WL ASPDG +D         G+LE+KCP+  G  +   PW +VP Y +PQ QG MEIMDR+W++ Y W
Subjt:  EEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVW

Query:  TPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLRE
        T  GS++FR+ RD  YW ++   L +FWW+ V PARE      +     E+K   P+  H+     + +S  +   SKL+ RE
Subjt:  TPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLRE

AT1G67660.3 Restriction endonuclease, type II-like superfamily protein5.4e-3539.34Show/hide
Query:  EEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVW
        E  A+ERYK I G  V    F ++  +N    WL ASPDG +D         G+LE+KCP+  G  +   PW +VP Y +PQ QG MEIMDR+W++ Y W
Subjt:  EEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVW

Query:  TPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLRE
        T  GS++FR+ RD  YW ++   L +FWW+ V PARE      +     E+K   P+  H+     + +S  +   SKL+ RE
Subjt:  TPKGSSLFRLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAGAAGAAGCACTTGAAAGATACAAGCTTATTACTGGAAACTCTGTTTTGTTTCCTAAATTTCAAGTCTATGGTAAAGCAAACTCTGGAGATGATTGGTTGGC
TGCTTCACCCGATGGTACAATTGATAAGATGGTTTATGGATTGCCTTCACGAGGTGTATTGGAGATTAAGTGCCCATTTTTTGATGGTGATATGAAAAAGGCTTCACCAT
GGTCACGAGTTCCTCTTTACTGTATTCCTCAGGCTCAAGGTTTGATGGAAATAATGGATAGAGATTGGATGGACTTTTATGTTTGGACTCCTAAAGGTAGTAGTTTGTTT
AGATTGTATCGAGATGTCGAATATTGGGATGTCTTGAAAATCGCTTTGTCGGATTTTTGGTGGAAGCATGTTCAACCAGCAAGGGAGATGTGTAGTAAATATGCCATTAC
AAATCCCCTCATTGAGCTTAAGTCTCTTAGGCCATCACCCAAGCATGAACTATGCAGTTATATAGTTTGTGAGAGCAAACGGGTTGTTAATAATTCTAAGTTGCTCTTGC
GTGAATTTGATGGGAGACTTCATAACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAGAAGAAGCACTTGAAAGATACAAGCTTATTACTGGAAACTCTGTTTTGTTTCCTAAATTTCAAGTCTATGGTAAAGCAAACTCTGGAGATGATTGGTTGGC
TGCTTCACCCGATGGTACAATTGATAAGATGGTTTATGGATTGCCTTCACGAGGTGTATTGGAGATTAAGTGCCCATTTTTTGATGGTGATATGAAAAAGGCTTCACCAT
GGTCACGAGTTCCTCTTTACTGTATTCCTCAGGCTCAAGGTTTGATGGAAATAATGGATAGAGATTGGATGGACTTTTATGTTTGGACTCCTAAAGGTAGTAGTTTGTTT
AGATTGTATCGAGATGTCGAATATTGGGATGTCTTGAAAATCGCTTTGTCGGATTTTTGGTGGAAGCATGTTCAACCAGCAAGGGAGATGTGTAGTAAATATGCCATTAC
AAATCCCCTCATTGAGCTTAAGTCTCTTAGGCCATCACCCAAGCATGAACTATGCAGTTATATAGTTTGTGAGAGCAAACGGGTTGTTAATAATTCTAAGTTGCTCTTGC
GTGAATTTGATGGGAGACTTCATAACTAA
Protein sequenceShow/hide protein sequence
MKEEEALERYKLITGNSVLFPKFQVYGKANSGDDWLAASPDGTIDKMVYGLPSRGVLEIKCPFFDGDMKKASPWSRVPLYCIPQAQGLMEIMDRDWMDFYVWTPKGSSLF
RLYRDVEYWDVLKIALSDFWWKHVQPAREMCSKYAITNPLIELKSLRPSPKHELCSYIVCESKRVVNNSKLLLREFDGRLHN