; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018742 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018742
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationChr04:7859439..7862575
RNA-Seq ExpressionHG10018742
SyntenyHG10018742
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
InterPro domainsIPR022546 - Uncharacterised protein family Ycf68


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAD5336141.1 unnamed protein product [Arabidopsis thaliana]9.2e-7395.17Show/hide
Query:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK
        RVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLK
Subjt:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK

Query:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG
        VKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG
Subjt:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG

CAD5336145.1 unnamed protein product [Arabidopsis thaliana]9.2e-7395.17Show/hide
Query:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK
        RVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLK
Subjt:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK

Query:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG
        VKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG
Subjt:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG

KAF7117209.1 hypothetical protein RHSIM_RhsimPtG0005300 [Rhododendron simsii]7.5e-7569.96Show/hide
Query:  MDSSMCSALPDPEMWGSSKAHSMAYFSCLNRGLKPNFSSEDRWGFRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQL
        MDSSMCS+ PDPEMW             + +G                 WR        + +G              G LRGGGLPCGGCQRFESAYLQL
Subjt:  MDSSMCSALPDPEMWGSSKAHSMAYFSCLNRGLKPNFSSEDRWGFRVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQL

Query:  VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPW
        VNLADTKLYDST FFRFG SIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENK RSGDSRIGQPF+LLLNPW
Subjt:  VNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPW

Query:  AGKRQPGELKHLSSQRKRKQKRF
        A KRQPGELKHLSSQRKRKQKRF
Subjt:  AGKRQPGELKHLSSQRKRKQKRF

KAG5222336.1 Vacuolar protein sorting-associated protein [Salix suchowensis]5.9e-8894.22Show/hide
Query:  RAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEV
        +AQYDESCKLCSGGSYCLS ASMVESVGG RGGGLPCGGCQRFESAYLQLV+L  TKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEV
Subjt:  RAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEV

Query:  QTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRF
        QT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLL NPWAGKRQPGELKHLS+QRKRKQKRF
Subjt:  QTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRF

KAG7528872.1 hypothetical protein ISN44_Un153g000040 [Arabidopsis suecica]5.4e-7394.52Show/hide
Query:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK
        RVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLK
Subjt:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK

Query:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        VKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG+
Subjt:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

TrEMBL top hitse value%identityAlignment
A0A2N9HP93 Uncharacterized protein ycf681.5e-7697.93Show/hide
Query:  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKV
        VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKV
Subjt:  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKV

Query:  KGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG+
Subjt:  KGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

A0A2N9HP93 Uncharacterized protein ycf683.0e-0556.06Show/hide
Query:  TAGDKREEG---LQLAC--MKPESLVIAGQPYGGEFV---PGLVHTARHTMGAGHARSRYLNCKEG
        TAGDK EEG   ++ +C           G+  G   V   PGLVHTARHTMGAGHARSRYLN KEG
Subjt:  TAGDKREEG---LQLAC--MKPESLVIAGQPYGGEFV---PGLVHTARHTMGAGHARSRYLNCKEG

A0A2N9HP93 Uncharacterized protein ycf682.4e-7494.12Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS--LKVKGEVQTKKGLRWIPRHPETRKGVV
        MVESVGG RGGGLP GGCQRFESAYLQLVNL  TKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHS  LKVKGEVQT+KGLRWIPRHPETRKGVV
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHS--LKVKGEVQTKKGLRWIPRHPETRKGVV

Query:  SDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRF
        SDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLS+QRKRKQKRF
Subjt:  SDEMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRF

A0A2N9HUL9 Uncharacterized protein ycf684.1e-7998.01Show/hide
Query:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD
        MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQT+KGLRWIPRHPETRKGVVSD
Subjt:  MVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSD

Query:  EMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRF
        EMLRGVENKRRSGDSRIGQPF+LLLNPWAGKRQPGELKHLSSQRKRKQKRF
Subjt:  EMLRGVENKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRF

A0A2N9I678 Uncharacterized protein ycf681.5e-7697.93Show/hide
Query:  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKV
        VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKV
Subjt:  VALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKV

Query:  KGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ
        KGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG+
Subjt:  KGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIGQ

A0A7G2FKR6 Uncharacterized protein ycf684.4e-7395.17Show/hide
Query:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK
        RVALWRAQYDESCKLCSGGS CLSLASMVESV GL GGGLPCGGCQRFESAYLQLVNLADTKLYDST FFRFGSSIYD SFMDVDKI PFSSTLGWHSLK
Subjt:  RVALWRAQYDESCKLCSGGSYCLSLASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLK

Query:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG
        VKGEVQT+KGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG
Subjt:  VKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVENKRRSGDSRIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGATGCAAAGCGAAGAACCTTACCAGGGCTTGACATGCCGCGAACCTCTTGA
AAGAGAGGGGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGTGTGTCAGCTCGTGCCTCCCGCAACGAGCGCAACCCTCGTGTTAGGTTGCCAACCGTTGAGGTTGG
AACCTGAGAAGACTGCCGGTGATAAGCGGGAGGAAGGCTTGCAACTCGCCTGCATGAAGCCGGAATCGCTAGTAATCGCCGGTCAGCCATACGGTGGTGAATTCGTTCCG
GGCCTTGTACACACCGCCCGTCACACTATGGGAGCTGGCCATGCCCGAAGTCGTTACCTTAACTGCAAGGAGGGGATGCCGAAGGCAGGGCTAGTGACTGGAGTGAAGTC
GGCTCTCAGCCACATGGATAGTTCAATGTGCTCAGCGCTGCCTGACCCTGAGATGTGGGGATCATCCAAGGCACATAGCATGGCGTACTTCTCCTGTTTGAACCGGGGTT
TGAAACCAAACTTCTCCTCAGAGGATAGATGGGGGTTCAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCG
TTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATAC
AAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATA
GCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAA
AATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAACCTTTCAAACTGCTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGTGAACTGAAACATCTTAGTAGCCAGAG
GAAAAGAAAGCAAAAGCGATTCTCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACTCAAAGGAATTGACGGGGGCCCGCACAAGCGGTGGAGCATGTGGTTTAATTCGATGCAAAGCGAAGAACCTTACCAGGGCTTGACATGCCGCGAACCTCTTGA
AAGAGAGGGGTGCCTTCGGGAACGCGGACACAGGTGGTGCATGGTGTGTCAGCTCGTGCCTCCCGCAACGAGCGCAACCCTCGTGTTAGGTTGCCAACCGTTGAGGTTGG
AACCTGAGAAGACTGCCGGTGATAAGCGGGAGGAAGGCTTGCAACTCGCCTGCATGAAGCCGGAATCGCTAGTAATCGCCGGTCAGCCATACGGTGGTGAATTCGTTCCG
GGCCTTGTACACACCGCCCGTCACACTATGGGAGCTGGCCATGCCCGAAGTCGTTACCTTAACTGCAAGGAGGGGATGCCGAAGGCAGGGCTAGTGACTGGAGTGAAGTC
GGCTCTCAGCCACATGGATAGTTCAATGTGCTCAGCGCTGCCTGACCCTGAGATGTGGGGATCATCCAAGGCACATAGCATGGCGTACTTCTCCTGTTTGAACCGGGGTT
TGAAACCAAACTTCTCCTCAGAGGATAGATGGGGGTTCAGAGTTGCTCTTTGGAGAGCACAGTACGATGAAAGTTGTAAGCTGTGTTCGGGGGGGAGTTATTGTCTATCG
TTGGCCTCTATGGTAGAATCAGTCGGGGGCCTGAGAGGCGGTGGTTTACCCTGTGGCGGATGTCAGCGGTTCGAGTCCGCTTATCTCCAACTCGTGAACTTAGCCGATAC
AAAGCTATATGATAGCACTCCATTTTTCCGATTCGGCAGTTCGATCTATGATTTATCATTCATGGACGTTGATAAGATCCTTCCATTTAGCAGCACCTTAGGATGGCATA
GCCTTAAAGTTAAGGGCGAGGTTCAAACGAAGAAAGGCTTACGGTGGATACCTAGGCACCCAGAGACGAGGAAGGGCGTAGTAAGCGACGAAATGCTTCGGGGAGTTGAA
AATAAGCGTAGATCCGGAGATTCCCGAATAGGTCAACCTTTCAAACTGCTGCTGAATCCATGGGCAGGCAAGAGACAACCTGGTGAACTGAAACATCTTAGTAGCCAGAG
GAAAAGAAAGCAAAAGCGATTCTCGTAG
Protein sequenceShow/hide protein sequence
MKLKGIDGGPHKRWSMWFNSMQSEEPYQGLTCREPLEREGCLRERGHRWCMVCQLVPPATSATLVLGCQPLRLEPEKTAGDKREEGLQLACMKPESLVIAGQPYGGEFVP
GLVHTARHTMGAGHARSRYLNCKEGMPKAGLVTGVKSALSHMDSSMCSALPDPEMWGSSKAHSMAYFSCLNRGLKPNFSSEDRWGFRVALWRAQYDESCKLCSGGSYCLS
LASMVESVGGLRGGGLPCGGCQRFESAYLQLVNLADTKLYDSTPFFRFGSSIYDLSFMDVDKILPFSSTLGWHSLKVKGEVQTKKGLRWIPRHPETRKGVVSDEMLRGVE
NKRRSGDSRIGQPFKLLLNPWAGKRQPGELKHLSSQRKRKQKRFS