; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014366 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014366
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationChr02:10036910..10037904
RNA-Seq ExpressionHG10014366
SyntenyHG10014366
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030477908.1 uncharacterized protein LOC115694945 [Cannabis sativa]3.6e-1831.84Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL------VIEP--------------------------Q
        G+ L KS NEA +I++ IA+NN  W +T   PT++ VA + +VD + ++  Q+ ++ N L NL       I+P                           
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL------VIEP--------------------------Q

Query:  KSVCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTA-PPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AA
        +SVCY+G Q        + N YN+ W++HPNF+W   G+S++T     R A PPGF   PR    AQ   P      ++  L++DYMA+ND       A+
Subjt:  KSVCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTA-PPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AA

Query:  LRNLEMQ-------------------------NGNEQCKAVTLRS
        LRNLE+Q                         +G EQCK++ LRS
Subjt:  LRNLEMQ-------------------------NGNEQCKAVTLRS

XP_030478190.1 uncharacterized protein LOC115695250 [Cannabis sativa]1.1e-1934.43Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL----VIEP--------------------------QKS
        G  L KS NEA +I++ IA+NN  W  T  TPT++ VA + +VD L ++  Q+ +I N L N+     ++P                            S
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL----VIEP--------------------------QKS

Query:  VCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPG-SSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AALR
        VCYVG Q      + Y N YN  W+ HPNF+W   G SS+  + Q  +  PPGF   PR Q   QP   P   + ++ +L++DYMA+ND       A+LR
Subjt:  VCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPG-SSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AALR

Query:  NLEMQ-------------------------NGNEQCKAVTLRSE
        NLE+Q                         +G E CKAVTLRSE
Subjt:  NLEMQ-------------------------NGNEQCKAVTLRSE

XP_030494874.1 uncharacterized protein LOC115710657 [Cannabis sativa]4.7e-1830.15Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQK------SVCYVGYQKAQDYWDTYFNIYNRGW
        G+ L KS NE  +I++ IA+NN  W +T   PT++ V  + +VD + ++  Q+ ++ N L NL    +K      SVCY+G Q        + N YN+ W
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQK------SVCYVGYQKAQDYWDTYFNIYNRGW

Query:  RSHPNFAWAKPGSSTNTKHQSTRTA-PPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AALRNLEMQ-------------------
        ++HPN +W   G+S++T     R A PPGF   PR    AQ   P      ++  L++DYMA+ND       A+LRNLE+Q                   
Subjt:  RSHPNFAWAKPGSSTNTKHQSTRTA-PPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AALRNLEMQ-------------------

Query:  ------NGNEQCKAVTLRSELEYEGPRMPNAQDQCPEKWSTSVSNAKDITSTSVEKKKVVDEDKKKDDVEPI
              +G EQC A+ LRS     G  + N++++        +  + + TS  ++KK      ++  D  P+
Subjt:  ------NGNEQCKAVTLRSELEYEGPRMPNAQDQCPEKWSTSVSNAKDITSTSVEKKKVVDEDKKKDDVEPI

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]1.9e-1931.25Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL----VIEPQKS--------VCYVGYQKAQDYWDTYFN
        G+ L KS NEA +I++ IA+NN  W  T   PT++ VA + +VD L ++  Q+ ++ N L N+     ++P +         +CYVG Q      + Y N
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL----VIEPQKS--------VCYVGYQKAQDYWDTYFN

Query:  IYNRGWRSHPNFAWAKPG-SSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AALRNLEMQ-------------
         YN  W+ HPNF+W   G SS+  + Q  ++ PPGF   PR Q   QP  P   ++ ++  L++DYMA+ND       A+LRNLE+Q             
Subjt:  IYNRGWRSHPNFAWAKPG-SSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AALRNLEMQ-------------

Query:  ------------NGNEQCKAVTLR------SELEYEGPRMPNAQDQCPEKWSTSVSNAKDITSTSVEKKKVV
                    +G E CKAVTLR      S +  +G + P++  +  E      ++A +I        K++
Subjt:  ------------NGNEQCKAVTLR------SELEYEGPRMPNAQDQCPEKWSTSVSNAKDITSTSVEKKKVV

XP_030509259.1 uncharacterized protein LOC115723937 [Cannabis sativa]1.6e-1832.79Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL----VIEP--------------------------QKS
        G+ L KS NEA +I++ IA+NN  W  T   PT++ VA + +VD L ++  Q+ ++ N L N+     ++P                            S
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNL----VIEP--------------------------QKS

Query:  VCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTN--TKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AAL
        VCYVG Q      + Y N YN  W+ HPNF+W   G+S++   + Q  ++ PPGF   PR Q   QP  P   ++ ++  L++DYMA+ND       A+L
Subjt:  VCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTN--TKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARND-------AAL

Query:  RNLEMQ-------------------------NGNEQCKAVTLRS
        RNLE+Q                         +G E CKAVTLRS
Subjt:  RNLEMQ-------------------------NGNEQCKAVTLRS

TrEMBL top hitse value%identityAlignment
A0A6J1DAE9 uncharacterized protein LOC1110185141.3e-1027.21Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTK--TVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQKS-------------------VC-------
        G++ KK+ NE   I++ +A++N+ W      P  K    A +  +D   SM +++  +   L  L +E + +                   VC       
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTK--TVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQKS-------------------VC-------

Query:  -------------YVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTN-TKHQSTRTAPPGFPTV----------PRQ--QPRAQPPVPPPEKSPTM
                     YVG+   + + + Y N YN G R HPNF+W   GSS+  T+ Q+ +   P  P+           P+Q  Q +  P  P    + ++
Subjt:  -------------YVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTN-TKHQSTRTAPPGFPTV----------PRQ--QPRAQPPVPPPEKSPTM

Query:  VDLLKDYMARND--------------AALRNLEMQ-------------------------NGNEQCKAVTLRSELEYEGPRMP
         ++ K+YMARND              A +RNLE+Q                         +G EQCKAVTLRS L YEGP+MP
Subjt:  VDLLKDYMARND--------------AALRNLEMQ-------------------------NGNEQCKAVTLRSELEYEGPRMP

A0A6J1DY39 uncharacterized protein LOC1110256532.5e-1226.49Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTV--AEIQDVDTLASMNQQLMAIGNTLHNL------------------VIEPQKSVC--------
        G +  KS NE  +I+D ++ +N  W   +    +K    A +  +D + SM +Q+  I   L N+                  V +  +S C        
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTV--AEIQDVDTLASMNQQLMAIGNTLHNL------------------VIEPQKSVC--------

Query:  ------------YVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKH-QSTRTA--PPGFPT------VPRQQPRAQPPVPPPEKSPTMVDLL-
                    YVG Q  Q  ++ Y N YN GW+ HPNF+W+  GSS  T H Q  + A  PPGFP        P Q  + +  V P +++ + +++L 
Subjt:  ------------YVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKH-QSTRTA--PPGFPT------VPRQQPRAQPPVPPPEKSPTMVDLL-

Query:  ----------------------------KDYMARNDAALRNLEMQ-------------------------NGNEQCKAVTLRSELEYEGPRMPNAQDQCP
                                    KDYM RND  +R LEMQ                          G E C ++  RS L+YEGPRMP+     P
Subjt:  ----------------------------KDYMARNDAALRNLEMQ-------------------------NGNEQCKAVTLRSELEYEGPRMPNAQDQCP

Query:  EK
         +
Subjt:  EK

A0A6J1DYG0 uncharacterized protein LOC1110257641.1e-0925.91Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGD--TEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLV----------IEPQK-------------------
        G++ KK+ NE   I++ +A++N+ W    +   P  +  A +  +D  +SM ++ + +   L  +V          I+P +                   
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGD--TEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLV----------IEPQK-------------------

Query:  ---------------SVCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTN-TKHQSTRTAPPGFPTV------PRQQPRAQPPVPPPE-KSPTM
                       SV YVG+   +++ + Y N YN GWR HPNF+W   G S    + QS +   P  P        P+Q+   +   PP +  +  +
Subjt:  ---------------SVCYVGYQKAQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTN-TKHQSTRTAPPGFPTV------PRQQPRAQPPVPPPE-KSPTM

Query:  VDLLKDYMARND-------AALRNLEMQ-------------------------NGNEQCKAVTLRSELEYEGPRMPNAQDQCPEKWSTSVSNAKDITSTS
         +++K+YMAR D       A++RN E Q                          G EQCKAVTLRS L Y+ P MP A  Q P   ST  +       T+
Subjt:  VDLLKDYMARND-------AALRNLEMQ-------------------------NGNEQCKAVTLRSELEYEGPRMPNAQDQCPEKWSTSVSNAKDITSTS

Query:  VEKKKVVDEDKKKDDVEPIVQEKENNAV
         EK  +   ++    V P + EK   A+
Subjt:  VEKKKVVDEDKKKDDVEPIVQEKENNAV

A0A6J1EEI2 uncharacterized protein LOC1114333942.3e-1030.97Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQK-------------------------------
        G+ L K+ NEA +I++ IA+NN  W D    P  KT   + +VD L+S+N QL ++ N L NL +                                   
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQK-------------------------------

Query:  ------SVCYVGYQKAQ--DYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTAPPGFP-----TVPRQQPRAQPPVPPPEK---SPTMVDLLKDY
              S+ YVG Q +Q     + + N YN GWR+HPNF+W   G S N +       PPGF          QQ   Q    P  +     ++  L+K+Y
Subjt:  ------SVCYVGYQKAQ--DYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTAPPGFP-----TVPRQQPRAQPPVPPPEK---SPTMVDLLKDY

Query:  MARND-------AALRNLEMQNGNEQ
        MA+ND       A+LRNLE+Q    Q
Subjt:  MARND-------AALRNLEMQNGNEQ

A0A6J1EQ90 uncharacterized protein LOC1114364115.1e-1030.84Show/hide
Query:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQK-------------------------------
        G+ L K+ NEA +I++ IA+NN  W D    P  KT   + +VD L+S+N QL ++ N L NL +                                   
Subjt:  GSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQK-------------------------------

Query:  ------SVCYVGYQKAQDYW--DTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMV---------DLLKD
              S+ YVG Q +Q     + + N YN GWR+HPNF+W K  S  N +       P GF  +  Q   +   V    K  T            L+K+
Subjt:  ------SVCYVGYQKAQDYW--DTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMV---------DLLKD

Query:  YMARND-------AALRNLEMQNGNEQ
        YMA+ND       A+LRNLE+Q G E+
Subjt:  YMARND-------AALRNLEMQNGNEQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCACATCACGGACTCCCAGCTTGCATTGTGGTGGAACAGTTTTACAATGGATTAAATCAAGCCTCGAAAACAATGGTGAATGCGTCCGCAGAGGGTCATGGCTTAA
GAAATCCGTGAATGAAGCACAAAAGATTATGGATGCTATTGCTAACAACAATCAGAATTGGGGAGATACAGAGTACACTCCAACCACAAAGACGGTGGCTGAAATACAAG
ACGTTGATACACTCGCGTCCATGAATCAACAGCTGATGGCCATAGGAAATACGTTACATAATTTGGTTATAGAACCACAAAAATCCGTGTGTTATGTAGGTTACCAAAAG
GCGCAAGATTATTGGGATACCTATTTTAATATATACAATCGGGGATGGAGATCGCATCCAAATTTCGCTTGGGCAAAACCAGGTAGCAGTACCAACACAAAACATCAGAG
CACAAGGACAGCACCACCAGGATTCCCTACAGTACCACGTCAGCAGCCGCGAGCACAACCACCAGTGCCTCCCCCTGAGAAGAGCCCTACAATGGTTGATCTGTTGAAAG
ATTACATGGCCAGGAACGATGCAGCTCTGCGTAACTTGGAGATGCAGAATGGAAATGAGCAATGCAAGGCAGTCACATTAAGGAGCGAACTAGAGTACGAAGGACCAAGA
ATGCCTAATGCTCAAGATCAATGTCCAGAGAAATGGTCAACAAGCGTCAGCAATGCGAAAGACATCACATCAACATCTGTGGAGAAGAAAAAGGTTGTTGACGAAGACAA
AAAGAAGGATGATGTTGAACCAATAGTTCAGGAAAAAGAAAATAATGCAGTATATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCACATCACGGACTCCCAGCTTGCATTGTGGTGGAACAGTTTTACAATGGATTAAATCAAGCCTCGAAAACAATGGTGAATGCGTCCGCAGAGGGTCATGGCTTAA
GAAATCCGTGAATGAAGCACAAAAGATTATGGATGCTATTGCTAACAACAATCAGAATTGGGGAGATACAGAGTACACTCCAACCACAAAGACGGTGGCTGAAATACAAG
ACGTTGATACACTCGCGTCCATGAATCAACAGCTGATGGCCATAGGAAATACGTTACATAATTTGGTTATAGAACCACAAAAATCCGTGTGTTATGTAGGTTACCAAAAG
GCGCAAGATTATTGGGATACCTATTTTAATATATACAATCGGGGATGGAGATCGCATCCAAATTTCGCTTGGGCAAAACCAGGTAGCAGTACCAACACAAAACATCAGAG
CACAAGGACAGCACCACCAGGATTCCCTACAGTACCACGTCAGCAGCCGCGAGCACAACCACCAGTGCCTCCCCCTGAGAAGAGCCCTACAATGGTTGATCTGTTGAAAG
ATTACATGGCCAGGAACGATGCAGCTCTGCGTAACTTGGAGATGCAGAATGGAAATGAGCAATGCAAGGCAGTCACATTAAGGAGCGAACTAGAGTACGAAGGACCAAGA
ATGCCTAATGCTCAAGATCAATGTCCAGAGAAATGGTCAACAAGCGTCAGCAATGCGAAAGACATCACATCAACATCTGTGGAGAAGAAAAAGGTTGTTGACGAAGACAA
AAAGAAGGATGATGTTGAACCAATAGTTCAGGAAAAAGAAAATAATGCAGTATATTGA
Protein sequenceShow/hide protein sequence
MPTSRTPSLHCGGTVLQWIKSSLENNGECVRRGSWLKKSVNEAQKIMDAIANNNQNWGDTEYTPTTKTVAEIQDVDTLASMNQQLMAIGNTLHNLVIEPQKSVCYVGYQK
AQDYWDTYFNIYNRGWRSHPNFAWAKPGSSTNTKHQSTRTAPPGFPTVPRQQPRAQPPVPPPEKSPTMVDLLKDYMARNDAALRNLEMQNGNEQCKAVTLRSELEYEGPR
MPNAQDQCPEKWSTSVSNAKDITSTSVEKKKVVDEDKKKDDVEPIVQEKENNAVY