; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017419 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017419
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr03:14111547..14116592
RNA-Seq ExpressionHG10017419
SyntenyHG10017419
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GEX73483.1 hypothetical protein BVC80_4285g1 [Tanacetum cinerariifolium]2.0e-5684.96Show/hide
Query:  RFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        RFESAYLQLVNLADTKLYDST FFRFG SIY LSFMDVDKI PFSSTLGWHSLK+KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVE K++ GD R+G+
Subjt:  RFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

Query:  PFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP
        P +LL  P AGKR PGEL+HLSSQRKRKQKRFP
Subjt:  PFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP

KAF7117209.1 hypothetical protein RHSIM_RhsimPtG0005300 [Rhododendron simsii]3.6e-7495.24Show/hide
Query:  GGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRG
        G LRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFG SIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRG
Subjt:  GGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRG

Query:  VENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP
        VENK RSGDSRIGQPF+LLLNPWA KRQPGELKHLSSQRKRKQKRFP
Subjt:  VENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP

KAG5222336.1 Vacuolar protein sorting-associated protein [Salix suchowensis]8.5e-7694.12Show/hide
Query:  SSVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVS
        S VESVGG RGGGLPCGGCQRFESAYLQLV+L  TKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVS
Subjt:  SSVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVS

Query:  DEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP
        DEMLRGVENKRRSGDSRIGQPFKLL NPWAGKRQPGELKHLS+QRKRKQKRFP
Subjt:  DEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]1.5e-5693.33Show/hide
Query:  SSVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVS
        S VESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVS
Subjt:  SSVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVS

Query:  DEMLRGVENKRRSGDSRIGQ
        DEMLRGVENKRRSGDSRIG+
Subjt:  DEMLRGVENKRRSGDSRIGQ

OVA05688.1 hypothetical protein BVC80_4285g1 [Macleaya cordata]1.2e-7754.66Show/hide
Query:  GQEGFLTPSFFFSSELFHKDLPWCDDLLHGRGLWFKSRMAQLRQG---KNKRIEEASDSFMQAPLGSGGYSSVE--------------------------
        GQEG LTPSFFFSSELFHKDLPW      G     +S + +   G     KRIEEASDSFM APLGSGGYSSV                           
Subjt:  GQEGFLTPSFFFSSELFHKDLPWCDDLLHGRGLWFKSRMAQLRQG---KNKRIEEASDSFMQAPLGSGGYSSVE--------------------------

Query:  ------------------------------------------------------------------------------------SVGGLRGGGLPCGGCQ
                                                                                            S GGLRGGGLPCGGCQ
Subjt:  ------------------------------------------------------------------------------------SVGGLRGGGLPCGGCQ

Query:  RFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        RFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIPRHPETRKGV SDEMLRGVENK RSGDSRIGQ
Subjt:  RFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

Query:  PFKLLLNPWAGKRQPGELKHLS
        PF+LLLNPWAGKRQPGELKHLS
Subjt:  PFKLLLNPWAGKRQPGELKHLS

TrEMBL top hitse value%identityAlignment
A0A200Q5G5 Uncharacterized protein ycf685.7e-7854.66Show/hide
Query:  GQEGFLTPSFFFSSELFHKDLPWCDDLLHGRGLWFKSRMAQLRQG---KNKRIEEASDSFMQAPLGSGGYSSVE--------------------------
        GQEG LTPSFFFSSELFHKDLPW      G     +S + +   G     KRIEEASDSFM APLGSGGYSSV                           
Subjt:  GQEGFLTPSFFFSSELFHKDLPWCDDLLHGRGLWFKSRMAQLRQG---KNKRIEEASDSFMQAPLGSGGYSSVE--------------------------

Query:  ------------------------------------------------------------------------------------SVGGLRGGGLPCGGCQ
                                                                                            S GGLRGGGLPCGGCQ
Subjt:  ------------------------------------------------------------------------------------SVGGLRGGGLPCGGCQ

Query:  RFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        RFESAYLQLVNLADTK+YDST FFRFGSSIYDLSFMDVDKIL FSSTLGWHSLKV GEVQT+KGLRWIPRHPETRKGV SDEMLRGVENK RSGDSRIGQ
Subjt:  RFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

Query:  PFKLLLNPWAGKRQPGELKHLS
        PF+LLLNPWAGKRQPGELKHLS
Subjt:  PFKLLLNPWAGKRQPGELKHLS

A0A2N9HP93 Uncharacterized protein ycf681.4e-7134.3Show/hide
Query:  AMQCSWTLMSIRMNHPFHGGKSLPARQEDSKLQILSR---------------------------------------------------------------
        AMQCSWTLMSIRMNH F GGKSLPARQEDSKLQILSR                                                               
Subjt:  AMQCSWTLMSIRMNHPFHGGKSLPARQEDSKLQILSR---------------------------------------------------------------

Query:  -----------------------------------------------------------------------------TLPGLDMPRILLKERGAFGNADT
                                                                                     TLPGLDMPRILLKERGAFGNADT
Subjt:  -----------------------------------------------------------------------------TLPGLDMPRILLKERGAFGNADT

Query:  GGAWLSSARATAGDKPEEGEDDVKSSCPLCPG-----------------------DTRATMAGTKGRD--------------------------------
        GGAWLSSARATAGDKPEEGEDDVKSSCPLCPG                         R TM     R                                 
Subjt:  GGAWLSSARATAGDKPEEGEDDVKSSCPLCPG-----------------------DTRATMAGTKGRD--------------------------------

Query:  ---PAR------------------------GPGDLKKDLRVSRLGQ--------------------EGFLTPSFFF-------------SSELFHKDLP-
           P R                        GPG       +S + +                       + P+F+              SS L +  +P 
Subjt:  ---PAR------------------------GPGDLKKDLRVSRLGQ--------------------EGFLTPSFFF-------------SSELFHKDLP-

Query:  -WCDDLLHGRGL-----------------------------------------W---------------------------------------FKSRMAQ
         + D  L   GL                                         W                                          R+A 
Subjt:  -WCDDLLHGRGL-----------------------------------------W---------------------------------------FKSRMAQ

Query:  LRQ--------------------GKNKRIEEASDSFMQAPLG----------------SGG------YSSVESVGGLRGGGLPCGGCQRFESAYLQLVNL
        +R+                    GK++ + + +   +   +G                SGG       S VESVGGLRGGGLPCGGCQRFESAYLQLVNL
Subjt:  LRQ--------------------GKNKRIEEASDSFMQAPLG----------------SGG------YSSVESVGGLRGGGLPCGGCQRFESAYLQLVNL

Query:  ADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        ADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG+
Subjt:  ADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

A0A2N9HUL9 Uncharacterized protein ycf683.0e-7998.01Show/hide
Query:  VESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDE
        VESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDE
Subjt:  VESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDE

Query:  MLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP
        MLRGVENKRRSGDSRIGQPF+LLLNPWAGKRQPGELKHLSSQRKRKQKRFP
Subjt:  MLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP

A0A2N9I678 Uncharacterized protein ycf681.7e-7747.03Show/hide
Query:  TLPGLDMPRILLKERGAFGNADTGGAWLSSARATAGDKPEEGEDDVKSSCPLCPGD---TRATMAGTKGRDPARGPGDLKKDLRV---------------
        TLPGLDMPRILLKERGAFGNADTGGAWLSSARATAGDKPEEGEDDVKSSCPLCPG          G++ R+ A G    + +L +               
Subjt:  TLPGLDMPRILLKERGAFGNADTGGAWLSSARATAGDKPEEGEDDVKSSCPLCPGD---TRATMAGTKGRDPARGPGDLKKDLRV---------------

Query:  --------------------SRLGQEG-------FL-----------TPSFFFSSELFH--------KDLPWCDD----LLHGRGLW-------------
                             R G  G       FL            P    S E  H            W  D      H +  W             
Subjt:  --------------------SRLGQEG-------FL-----------TPSFFFSSELFH--------KDLPWCDD----LLHGRGLW-------------

Query:  --------------------------FKSRMAQLRQ--------------------GKNKRIEEASDSFMQAPLG----------------SGG------
                                     R+A +R+                    GK++ + + +   +   +G                SGG      
Subjt:  --------------------------FKSRMAQLRQ--------------------GKNKRIEEASDSFMQAPLG----------------SGG------

Query:  YSSVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVV
         S VESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVV
Subjt:  YSSVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVV

Query:  SDEMLRGVENKRRSGDSRIGQ
        SDEMLRGVENKRRSGDSRIG+
Subjt:  SDEMLRGVENKRRSGDSRIGQ

A0A2P2JY74 Uncharacterized protein ycf681.7e-7494.12Show/hide
Query:  VESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS--LKVKGEVQTKKGLRWIPRHPETRKGVVS
        VESVGG RGGGLP GGCQRFESAYLQLVNL  TKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHS  LKVKGEVQT+KGLRWIPRHPETRKGVVS
Subjt:  VESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS--LKVKGEVQTKKGLRWIPRHPETRKGVVS

Query:  DEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP
        DEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLS+QRKRKQKRFP
Subjt:  DEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGAAGCAAGGAGGTCAACCTCTTTCAAATATACAACATGGATTCTGGCAATGCAATGTAGTTGGACTCTCATGTCGATCCGAATGAATCATCCTTTCCACGGAGG
TAAATCTTTGCCTGCTAGGCAAGAGGATAGCAAGTTACAAATTCTGTCGCGAACCTTACCAGGGCTTGACATGCCGCGAATCCTCTTGAAAGAGAGGGGTGCCTTCGGGA
ACGCGGACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGCCACTGCCGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGCGAC
ACACGTGCTACAATGGCCGGGACAAAGGGTCGTGATCCCGCGAGGGGACCAGGAGATTTGAAAAAGGATCTTAGAGTGTCTAGGTTGGGCCAGGAGGGTTTCTTAACGCC
TTCTTTTTTCTTCTCATCGGAGTTATTTCACAAAGACTTGCCATGGTGCGATGATTTACTTCACGGGCGAGGTCTCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAG
GAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTA
CCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTA
TGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGA
TACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAACCTTTCAAACTG
CTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGCGAACTGAAACATCTTAGTAGCCAGAGGAAAAGAAAGCAAAAGCGATTCCCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGAGAAGCAAGGAGGTCAACCTCTTTCAAATATACAACATGGATTCTGGCAATGCAATGTAGTTGGACTCTCATGTCGATCCGAATGAATCATCCTTTCCACGGAGG
TAAATCTTTGCCTGCTAGGCAAGAGGATAGCAAGTTACAAATTCTGTCGCGAACCTTACCAGGGCTTGACATGCCGCGAATCCTCTTGAAAGAGAGGGGTGCCTTCGGGA
ACGCGGACACAGGTGGTGCATGGCTGTCGTCAGCTCGTGCCACTGCCGGTGATAAGCCGGAGGAAGGTGAGGATGACGTCAAGTCATCATGCCCCTTATGCCCTGGCGAC
ACACGTGCTACAATGGCCGGGACAAAGGGTCGTGATCCCGCGAGGGGACCAGGAGATTTGAAAAAGGATCTTAGAGTGTCTAGGTTGGGCCAGGAGGGTTTCTTAACGCC
TTCTTTTTTCTTCTCATCGGAGTTATTTCACAAAGACTTGCCATGGTGCGATGATTTACTTCACGGGCGAGGTCTCTGGTTCAAGTCCAGGATGGCCCAGCTACGCCAAG
GAAAGAATAAAAGAATAGAAGAAGCATCTGACTCCTTCATGCAGGCCCCACTTGGCTCGGGGGGATATAGCTCAGTTGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTA
CCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATACAAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTA
TGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATAGCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGA
TACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAAAATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAACCTTTCAAACTG
CTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGCGAACTGAAACATCTTAGTAGCCAGAGGAAAAGAAAGCAAAAGCGATTCCCGTAG
Protein sequenceShow/hide protein sequence
MREARRSTSFKYTTWILAMQCSWTLMSIRMNHPFHGGKSLPARQEDSKLQILSRTLPGLDMPRILLKERGAFGNADTGGAWLSSARATAGDKPEEGEDDVKSSCPLCPGD
TRATMAGTKGRDPARGPGDLKKDLRVSRLGQEGFLTPSFFFSSELFHKDLPWCDDLLHGRGLWFKSRMAQLRQGKNKRIEEASDSFMQAPLGSGGYSSVESVGGLRGGGL
PCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKL
LLNPWAGKRQPGELKHLSSQRKRKQKRFP