; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021967 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021967
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCACTA en-spm transposon protein
Genome locationChr05:19100652..19102799
RNA-Seq ExpressionHG10021967
SyntenyHG10021967
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143616.1 uncharacterized protein LOC111013476 [Momordica charantia]2.0e-2742.16Show/hide
Query:  LLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKYTTANDWNILCDRWESSSWK----------------------------
        LL +F VDI+QPH+ RYI YEIG RFKDYR  LY+HY+K  +     + P+K     DWNILCD+ ES +WK                            
Subjt:  LLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKYTTANDWNILCDRWESSSWK----------------------------

Query:  ---ENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS-YSQEYVESLETRLAKTEE------------LLKDQRQGYERKFSQ
           E M+ L+         KT+EEIM  VLGKRS+Y+ G+ YGPKP R+K +SS YS EYVESLE RL K EE             L+ Q Q + RK  +
Subjt:  ---ENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS-YSQEYVESLETRLAKTEE------------LLKDQRQGYERKFSQ

Query:  MNEI
        M ++
Subjt:  MNEI

XP_022159083.1 uncharacterized protein LOC111025525 [Momordica charantia]7.7e-2749.61Show/hide
Query:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-
        +G+IK+ WT  QG+PVG  +  FN EIG L R +I  K  K+K+I     + + + LL +F VDI+QPH+ RYI YEIG RFKDYR  L++HY+K  +P 
Subjt:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-

Query:  ---RNPHKYTTANDWNILCDRWESSSWKE
           + P+K      WNILCDRWES +WKE
Subjt:  ---RNPHKYTTANDWNILCDRWESSSWKE

XP_038895319.1 uncharacterized protein LOC120083572 isoform X1 [Benincasa hispida]7.1e-7362.7Show/hide
Query:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-
        MGRIKVTWTPTQGKP+G+MA LFNGEIGVLVRKFIPLKYEKQKDIPNELYDILT+QLLNQFDVDI+QPHIKRYI YEIGNRFKDYRWTLYKHYQKYA+P 
Subjt:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-

Query:  ---RNPHKYTTANDWNILCDRWESSSWK-----------------------------------------------------------------ENMLILR
           RNP+KYTT +DWNILCDRWESSSWK                                                                 ENML+LR
Subjt:  ---RNPHKYTTANDWNILCDRWESSSWK-----------------------------------------------------------------ENMLILR

Query:  QVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS
        +VEK     KTDEEI+VMVLGKRSSYM G  YGPKPPR KEASS
Subjt:  QVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS

XP_038895320.1 uncharacterized protein LOC120083572 isoform X2 [Benincasa hispida]9.7e-6289.15Show/hide
Query:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-
        MGRIKVTWTPTQGKP+G+MA LFNGEIGVLVRKFIPLKYEKQKDIPNELYDILT+QLLNQFDVDI+QPHIKRYI YEIGNRFKDYRWTLYKHYQKYA+P 
Subjt:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-

Query:  ---RNPHKYTTANDWNILCDRWESSSWKE
           RNP+KYTT +DWNILCDRWESSSWKE
Subjt:  ---RNPHKYTTANDWNILCDRWESSSWKE

XP_038895321.1 uncharacterized protein LOC120083572 isoform X3 [Benincasa hispida]9.7e-6289.15Show/hide
Query:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-
        MGRIKVTWTPTQGKP+G+MA LFNGEIGVLVRKFIPLKYEKQKDIPNELYDILT+QLLNQFDVDI+QPHIKRYI YEIGNRFKDYRWTLYKHYQKYA+P 
Subjt:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-

Query:  ---RNPHKYTTANDWNILCDRWESSSWKE
           RNP+KYTT +DWNILCDRWESSSWKE
Subjt:  ---RNPHKYTTANDWNILCDRWESSSWKE

TrEMBL top hitse value%identityAlignment
A0A438G7G3 Uncharacterized protein8.4e-1928.29Show/hide
Query:  PTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKY
        P    P G  +     EIG +VR + PL  EK  DI       + ++L  +F +D TQ H+K+ I  ++  R++D+R   +KH++KY       +NP+K+
Subjt:  PTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKY

Query:  TT-ANDWNILCDRWES-----------------------------------SSWKENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPR
         +    W+ LCDR+ S                                   +S+++ + + +Q   +      + EI V VLG RS Y+KGL +GP+PP 
Subjt:  TT-ANDWNILCDRWES-----------------------------------SSWKENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPR

Query:  SKEASSYSQEYVESLETRLAKTEELLKDQRQGYERKFSQMNEILKKLSEGR
        S   S++       LE  L  T ELL+ Q    + + SQ+ ++   +SE R
Subjt:  SKEASSYSQEYVESLETRLAKTEELLKDQRQGYERKFSQMNEILKKLSEGR

A0A438I7V1 Uncharacterized protein1.0e-2433.18Show/hide
Query:  IKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPN-ELYDILTK-------QLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQK
        + +T  P+ GK  G+     + EIG+ VR+  P++ EK K +P  E+  +L +        +  +F +D+TQ H+K+ +  ++ +RF+++R  L+KH++K
Subjt:  IKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPN-ELYDILTK-------QLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQK

Query:  YAN----PRNPHK-YTTANDWNILCDRWESSSWKENMLIL-RQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASSYSQEYVESLETRLA
        +       RNPH+  +   DW+ LCDR+ S  +KE ML L RQ   + +   T+ EI   VLG++S Y+KGL +GPKP    ++   S E    LE RL 
Subjt:  YAN----PRNPHK-YTTANDWNILCDRWESSSWKENMLIL-RQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASSYSQEYVESLETRLA

Query:  KTEELLKDQRQGYERKFSQMNEI
        +T+ L++ Q+Q  E +  +++++
Subjt:  KTEELLKDQRQGYERKFSQMNEI

A0A6J1CQT5 uncharacterized protein LOC1110134769.9e-2842.16Show/hide
Query:  LLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKYTTANDWNILCDRWESSSWK----------------------------
        LL +F VDI+QPH+ RYI YEIG RFKDYR  LY+HY+K  +     + P+K     DWNILCD+ ES +WK                            
Subjt:  LLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKYTTANDWNILCDRWESSSWK----------------------------

Query:  ---ENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS-YSQEYVESLETRLAKTEE------------LLKDQRQGYERKFSQ
           E M+ L+         KT+EEIM  VLGKRS+Y+ G+ YGPKP R+K +SS YS EYVESLE RL K EE             L+ Q Q + RK  +
Subjt:  ---ENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS-YSQEYVESLETRLAKTEE------------LLKDQRQGYERKFSQ

Query:  MNEI
        M ++
Subjt:  MNEI

A0A6J1D6S9 uncharacterized protein LOC1110174612.7e-2539.71Show/hide
Query:  ILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKYTTANDWNILCDRWESSSWK-----------------------
        +LT   + +   DI+Q H+ RYI YEIG RFKDYR  LY+HY+K  +     + P+K     DWNILCD+ ES +WK                       
Subjt:  ILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYAN----PRNPHKYTTANDWNILCDRWESSSWK-----------------------

Query:  --------ENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS-YSQEYVESLETRLAKTEE------------LLKDQRQGYE
                E M+ L+         KT+EEIM  VLGKRS+Y+ G+ YGPKP R+K +SS YS EYVESLE RL K EE             L+ Q Q + 
Subjt:  --------ENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASS-YSQEYVESLETRLAKTEE------------LLKDQRQGYE

Query:  RKFSQMNEI
        RK  +M ++
Subjt:  RKFSQMNEI

A0A6J1DXU5 uncharacterized protein LOC1110255253.7e-2749.61Show/hide
Query:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-
        +G+IK+ WT  QG+PVG  +  FN EIG L R +I  K  K+K+I     + + + LL +F VDI+QPH+ RYI YEIG RFKDYR  L++HY+K  +P 
Subjt:  MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANP-

Query:  ---RNPHKYTTANDWNILCDRWESSSWKE
           + P+K      WNILCDRWES +WKE
Subjt:  ---RNPHKYTTANDWNILCDRWESSSWKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGGATAAAAGTAACATGGACTCCCACACAAGGCAAGCCAGTTGGAGAAATGGCAATTCTATTTAATGGAGAAATAGGAGTTTTGGTGAGAAAATTCATCCCTTT
AAAATATGAGAAACAAAAAGACATTCCAAATGAGCTTTATGATATTTTAACAAAACAATTGTTGAATCAATTTGATGTTGACATCACTCAGCCACATATTAAGAGATACA
TCAATTACGAAATTGGTAATCGATTTAAAGATTATAGATGGACGTTGTATAAACACTACCAGAAATATGCTAATCCACGAAATCCGCATAAATATACTACTGCTAATGAT
TGGAATATTTTGTGCGATAGATGGGAGTCTTCTTCATGGAAGGAAAATATGTTGATATTAAGGCAAGTTGAAAAAGATTCAAGTGCACCAAAGACTGATGAAGAAATTAT
GGTTATGGTTCTTGGAAAGAGATCATCATACATGAAAGGATTAAGATATGGACCAAAACCACCACGAAGTAAAGAAGCATCGTCTTACTCACAAGAGTATGTCGAATCTC
TAGAGACTCGTCTTGCAAAGACTGAAGAATTATTAAAGGATCAACGCCAAGGGTATGAAAGAAAGTTTAGCCAAATGAATGAAATTCTGAAGAAGTTAAGTGAAGGAAGA
GATGGCAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGGATAAAAGTAACATGGACTCCCACACAAGGCAAGCCAGTTGGAGAAATGGCAATTCTATTTAATGGAGAAATAGGAGTTTTGGTGAGAAAATTCATCCCTTT
AAAATATGAGAAACAAAAAGACATTCCAAATGAGCTTTATGATATTTTAACAAAACAATTGTTGAATCAATTTGATGTTGACATCACTCAGCCACATATTAAGAGATACA
TCAATTACGAAATTGGTAATCGATTTAAAGATTATAGATGGACGTTGTATAAACACTACCAGAAATATGCTAATCCACGAAATCCGCATAAATATACTACTGCTAATGAT
TGGAATATTTTGTGCGATAGATGGGAGTCTTCTTCATGGAAGGAAAATATGTTGATATTAAGGCAAGTTGAAAAAGATTCAAGTGCACCAAAGACTGATGAAGAAATTAT
GGTTATGGTTCTTGGAAAGAGATCATCATACATGAAAGGATTAAGATATGGACCAAAACCACCACGAAGTAAAGAAGCATCGTCTTACTCACAAGAGTATGTCGAATCTC
TAGAGACTCGTCTTGCAAAGACTGAAGAATTATTAAAGGATCAACGCCAAGGGTATGAAAGAAAGTTTAGCCAAATGAATGAAATTCTGAAGAAGTTAAGTGAAGGAAGA
GATGGCAAGAACTGA
Protein sequenceShow/hide protein sequence
MGRIKVTWTPTQGKPVGEMAILFNGEIGVLVRKFIPLKYEKQKDIPNELYDILTKQLLNQFDVDITQPHIKRYINYEIGNRFKDYRWTLYKHYQKYANPRNPHKYTTAND
WNILCDRWESSSWKENMLILRQVEKDSSAPKTDEEIMVMVLGKRSSYMKGLRYGPKPPRSKEASSYSQEYVESLETRLAKTEELLKDQRQGYERKFSQMNEILKKLSEGR
DGKN