; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014524 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014524
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr02:13566222..13568732
RNA-Seq ExpressionHG10014524
SyntenyHG10014524
Gene Ontology termsGO:0009987 - cellular process (biological process)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600811.1 hypothetical protein SDJN03_06044, partial [Cucurbita argyrosperma subsp. sororia]6.1e-0681.82Show/hide
Query:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV
        GCLL+D  GN+QEI IHDFLRFLVHPNENL+FV
Subjt:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV

KAG7031448.1 hypothetical protein SDJN02_05488 [Cucurbita argyrosperma subsp. argyrosperma]6.1e-0681.82Show/hide
Query:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV
        GCLL+D  GN+QEI IHDFLRFLVHPNENL+FV
Subjt:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV

QWT43305.1 kinesin-related protein KIN7C [Citrullus lanatus subsp. vulgaris]8.8e-0552.27Show/hide
Query:  ESKVSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL
        +++V SYG WIK++NL ID+WS +TFK IG+  GG  + +KK L
Subjt:  ESKVSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL

XP_022942054.1 uncharacterized protein LOC111447242 [Cucurbita moschata]6.1e-0681.82Show/hide
Query:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV
        GCLL+D  GN+QEI IHDFLRFLVHPNENL+FV
Subjt:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV

XP_022989850.1 uncharacterized protein LOC111486913 [Cucurbita maxima]6.1e-0681.82Show/hide
Query:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV
        GCLL+D  GN+QEI IHDFLRFLVHPNENL+FV
Subjt:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV

TrEMBL top hitse value%identityAlignment
A0A5A7V878 DUF4283 domain-containing protein3.6e-0447.06Show/hide
Query:  YKSIVPAESKVSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL
        +KSI+     V  YGGWI +KNLP+D WSI+ +K IG  +GG   I+ K +
Subjt:  YKSIVPAESKVSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL

A0A5D3BKT8 LINE-1 retrotransposable element ORF2 protein6.2e-0433.85Show/hide
Query:  YCVEIAKLESSGRCYKSIVPAESKVSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL
        Y V+    +S+   + S++P      SYGGW++ + +P+  W+  TF+ IG A GG  D+AK+ +
Subjt:  YCVEIAKLESSGRCYKSIVPAESKVSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL

A0A5D3C8A5 Fanconi anemia group M protein isoform X64.7e-0451.22Show/hide
Query:  VSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL
        V  +GGW+++KNLP+D W  +TF+ IG  +GGL DIA + L
Subjt:  VSSYGGWIKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKAL

A0A6J1FQ78 uncharacterized protein LOC1114472423.0e-0681.82Show/hide
Query:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV
        GCLL+D  GN+QEI IHDFLRFLVHPNENL+FV
Subjt:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV

A0A6J1JNI2 uncharacterized protein LOC1114869133.0e-0681.82Show/hide
Query:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV
        GCLL+D  GN+QEI IHDFLRFLVHPNENL+FV
Subjt:  GCLLDDHDGNRQEINIHDFLRFLVHPNENLIFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACAGGTTGTTTGCTTGATGACCATGATGGAAATAGACAAGAAATTAACATTCATGACTTTCTGAGATTTTTGGTTCATCCTAATGAAAATTTAATATTTGTGCA
CACTACTTTGGCTTTTGTCGACGATCTAACAACTTTTTTACTCACCCTTCAACCAAAAGTGTTATTCAAACAGGAACACGTCGATAGTCATGTCTTTTGGGTTGAAAAAC
TACTGAATCGTCGCGGCTATTGTGTCGAGATTGCTAAATTGGAAAGTAGTGGGAGATGCTACAAATCGATTGTCCCTGCAGAGTCAAAAGTTTCTTCTTATGGAGGTTGG
ATCAAGGTAAAGAATCTTCCTATTGACAGATGGAGTATTGAAACTTTTAAGTTCATCGGACAAGCCTATGGTGGCCTATGTGATATTGCCAAAAAAGCTCTCTTGAATGG
ACATGATGGAGGTGTGTTCGAAGGCATTCATGGAAATAAGTCCAAGAAGCCCATGTGTCAATCTGTCCTATTTGATACTAGTTTGCCTCCCTATGTCACGCGCGAGATTC
AGTTTAAGCCTCTACCCACCTTTATCCATTCACAGTCCATATCGTACCCCAAAGCATCATCCCCCTCTTTACTTAGCGGACAAAAAGTCATCTCTTCCTCTACATATGAT
CCCGAGACCTCAACCGGGCCAAATATTATTGGCCAAACCAACCAATCACCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTACAGGTTGTTTGCTTGATGACCATGATGGAAATAGACAAGAAATTAACATTCATGACTTTCTGAGATTTTTGGTTCATCCTAATGAAAATTTAATATTTGTGCA
CACTACTTTGGCTTTTGTCGACGATCTAACAACTTTTTTACTCACCCTTCAACCAAAAGTGTTATTCAAACAGGAACACGTCGATAGTCATGTCTTTTGGGTTGAAAAAC
TACTGAATCGTCGCGGCTATTGTGTCGAGATTGCTAAATTGGAAAGTAGTGGGAGATGCTACAAATCGATTGTCCCTGCAGAGTCAAAAGTTTCTTCTTATGGAGGTTGG
ATCAAGGTAAAGAATCTTCCTATTGACAGATGGAGTATTGAAACTTTTAAGTTCATCGGACAAGCCTATGGTGGCCTATGTGATATTGCCAAAAAAGCTCTCTTGAATGG
ACATGATGGAGGTGTGTTCGAAGGCATTCATGGAAATAAGTCCAAGAAGCCCATGTGTCAATCTGTCCTATTTGATACTAGTTTGCCTCCCTATGTCACGCGCGAGATTC
AGTTTAAGCCTCTACCCACCTTTATCCATTCACAGTCCATATCGTACCCCAAAGCATCATCCCCCTCTTTACTTAGCGGACAAAAAGTCATCTCTTCCTCTACATATGAT
CCCGAGACCTCAACCGGGCCAAATATTATTGGCCAAACCAACCAATCACCATAA
Protein sequenceShow/hide protein sequence
MSTGCLLDDHDGNRQEINIHDFLRFLVHPNENLIFVHTTLAFVDDLTTFLLTLQPKVLFKQEHVDSHVFWVEKLLNRRGYCVEIAKLESSGRCYKSIVPAESKVSSYGGW
IKVKNLPIDRWSIETFKFIGQAYGGLCDIAKKALLNGHDGGVFEGIHGNKSKKPMCQSVLFDTSLPPYVTREIQFKPLPTFIHSQSISYPKASSPSLLSGQKVISSSTYD
PETSTGPNIIGQTNQSP