; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10008821 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10008821
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr06:73496..74659
RNA-Seq ExpressionHG10008821
SyntenyHG10008821
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336140.1 unnamed protein product [Arabidopsis thaliana]6.0e-8986.67Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                 PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GG
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG

Query:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

CAD5336141.1 unnamed protein product [Arabidopsis thaliana]6.0e-8986.67Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                 PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GG
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG

Query:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]6.0e-8986.67Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                 PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GG
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG

Query:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

KAF3606029.1 hypothetical protein DY000_02045118 [Brassica cretica]8.6e-8078.79Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR--------------------------------PFEILRRVALWRAQYDESCKLCSGGSYCL
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                                PFEILRRVALWRAQYDESCKLCSGGS CL
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR--------------------------------PFEILRRVALWRAQYDESCKLCSGGSYCL

Query:  SLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETR
        SLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIY FSFMDVDKI PFSSTLGWHSLKVK EVQT+KGLRWIPRHPETR
Subjt:  SLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETR

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]6.2e-8688.71Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR--PFEILR------RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQR
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR  P    R      RVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQR
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR--PFEILR------RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQR

Query:  FESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        FESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  FESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

TrEMBL top hitse value%identityAlignment
A0A200Q5G5 Uncharacterized protein ycf686.2e-7669.6Show/hide
Query:  VVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------------------------PFEILRRVALWR
        VVGVGGS RVPSSGIPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                                   PFEILRRVALWR
Subjt:  VVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------------------------PFEILRRVALWR

Query:  AQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQ
        AQ                      S GGLRGGGLPCGGCQRFESAYLQLVNLADTK+YDST FFRFGSSIYD SFMDVDKIL FSSTLGWHSLKV GEVQ
Subjt:  AQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQ

Query:  TKKGLRWIPRHPETRKGVVSDEMLRVV
        T+KGLRWIPRHPETRKGV SDEMLR V
Subjt:  TKKGLRWIPRHPETRKGVVSDEMLRVV

A0A5N6MLP8 Uncharacterized protein ycf682.9e-7375Show/hide
Query:  RTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRG
        RTWTVVGVGGS RVP SGIPGEEDQVGPCEQLDA  PFNPLSE+RQKEGKSMDRPH    +             + +L  G      L       G LRG
Subjt:  RTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRPFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRG

Query:  GGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFG SIYD SFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

A0A7G2FJL3 Uncharacterized protein ycf682.9e-8986.67Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                 PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GG
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG

Query:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

A0A7G2FKR6 Uncharacterized protein ycf682.9e-8986.67Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                 PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GG
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG

Query:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

A0A7G2FMH4 Uncharacterized protein ycf682.9e-8986.67Show/hide
Query:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG
        IPGEEDQVGPCEQLDA  PFNPLSEMRQKEGKSMDRPHR                 PFEILRRVALWRAQYDESCKLCSGGS CLSLASMVESV GL GG
Subjt:  IPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHR-----------------PFEILRRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGG

Query:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV
        GLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDFSFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLR V
Subjt:  GLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRVV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACCTCAACCAGCTGGGCAAGGGACTACTATTGAAAGAACTCGTTGGGTTGCGTTAGACCGAACTCTAAAAGTAAACTTCAACGGGATCCGTACATGGACGGTAGT
TGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCG
AAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGT
TCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCT
CCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGATAAGATCCTTCCAT
TTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGC
GACGAAATGCTTCGGGTAGTCGGTGGTCTAATCCCGGATTTGCGCATACATCTCACCGGTTTGCATTCGAACATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTACCTCAACCAGCTGGGCAAGGGACTACTATTGAAAGAACTCGTTGGGTTGCGTTAGACCGAACTCTAAAAGTAAACTTCAACGGGATCCGTACATGGACGGTAGT
TGGAGTCGGCGGCTCTCTTAGGGTTCCCTCATCTGGGATCCCTGGGGAAGAGGATCAAGTTGGCCCTTGCGAACAGCTTGATGCACGAGATCCCTTCAACCCTTTGAGCG
AAATGCGGCAAAAGGAAGGAAAATCCATGGACCGACCCCATCGACCTTTTGAGATTTTGAGAAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGT
TCGGGGGGGAGTTATTGTCTATCGTTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCT
CCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTCTCATTCATGGACGTTGATAAGATCCTTCCAT
TTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGC
GACGAAATGCTTCGGGTAGTCGGTGGTCTAATCCCGGATTTGCGCATACATCTCACCGGTTTGCATTCGAACATCTGA
Protein sequenceShow/hide protein sequence
MVPQPAGQGTTIERTRWVALDRTLKVNFNGIRTWTVVGVGGSLRVPSSGIPGEEDQVGPCEQLDARDPFNPLSEMRQKEGKSMDRPHRPFEILRRVALWRAQYDESCKLC
SGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDFSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVS
DEMLRVVGGLIPDLRIHLTGLHSNI