; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC03g0208 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC03g0208
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCACTA en-spm transposon protein
Genome locationMC03:4190163..4191068
RNA-Seq ExpressionMC03g0208
SyntenyMC03g0208
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]3.24e-4743.13Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGPL
        +++ +++Q +FK++R+DLH YY +   P  AR  PP+R+T+ EDW  LCDRWETPEWK     NKK+R+KLP+NHRAGSKSF +L  ELK+K+G +IGP+
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGPL

Query:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALE-----EQLALT
         LF E+ Y+   G  N  AE+ Y+ +  L + P  EG EP T+P+  R + G R  +V G+ +G   QP   KRG SS   S      E     E + + 
Subjt:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALE-----EQLALT

Query:  RAELTSNNARM
          E+ + N R+
Subjt:  RAELTSNNARM

XP_022157107.1 uncharacterized protein LOC111023906 isoform X2 [Momordica charantia]1.70e-4562.6Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKNK-----KSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        ++Y++  L+TTFKEFRA+LHA+YK+   P VAR KP  RIT L+DW KLCDRWETP+WK K     +SR+KLPYNHRAGSKSFGRL  ELK ++GVEIGP
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKNK-----KSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLK
        +++FKETR HP KGW +   E TY   VR+K
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLK

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]4.93e-4137.61Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        K+Y+ + +Q TFKE+R+DL+ +Y++   P+ AR  PP+RIT   DW  LC+RWETPEWK     NKKSRSK+PY HR GSKSF ++  E+K+K+G ++  
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN
        + LF+++ +    GW N  A++ Y ++ RL E    E   P +  +V + + GHRSGY+ G+  G+  +P    S  ++ Q  + LE+++     E+   
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN

Query:  NARMDIMHENELKTRAEL
         A  + M E  +   ++L
Subjt:  NARMDIMHENELKTRAEL

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]4.77e-4137.61Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        K+Y+ + +Q TFKE+R+DL+ +Y++   P+ AR  PP+RIT   DW  LC+RWETPEWK     NKKSRSK+PY HR GSKSF ++  E+K+K+G ++  
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN
        + LF+++ +    GW N  A++ Y ++ RL E    E   P +  +V + + GHRSGY+ G+  G+  +P    S  ++ Q  + LE+++     E+   
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN

Query:  NARMDIMHENELKTRAEL
         A  + M E  +   ++L
Subjt:  NARMDIMHENELKTRAEL

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]2.85e-4137.61Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        K+Y+ + +Q TFKE+R+DL+ +Y++   P+ AR  PP+RIT   DW  LC+RWETPEWK     NKKSRSK+PY HR GSKSF ++  E+K+K+G ++  
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN
        + LF+++ +    GW N  A++ Y ++ RL E    E   P +  +V + + GHRSGY+ G+  G+  +P    S  ++ Q  + LE+++     E+   
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN

Query:  NARMDIMHENELKTRAEL
         A  + M E  +   ++L
Subjt:  NARMDIMHENELKTRAEL

TrEMBL top hitse value%identityAlignment
A0A5A7T4P0 CACTA en-spm transposon protein1.67e-2833.96Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        R++  ++ TTFKEFRAD H ++KK + PE AR  PP  +    EDW  LCD + +  ++     NK +R K PYNH +GSKSF +   EL  ++G  +  
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKG-WRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQR-TQALEEQLALTRAELT
        ++LF+ET  H   G + + AAE+ +++++ L+ +P+ EG +P +E ++   + G R GY  G+ WG   + +R  S+ + S   +Q+ ++++ L +A+L 
Subjt:  LQLFKETRYHPTKG-WRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQR-TQALEEQLALTRAELT

Query:  SNNARMDIMHEN
            R+++   N
Subjt:  SNNARMDIMHEN

A0A5D3C6Z8 CACTA en-spm transposon protein1.64e-2833.96Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        R++  ++ TTFKEFRAD H ++KK + PE AR  PP  +    EDW  LCD + +  ++     NK +R K PYNH +GSKSF +   EL  ++G  +  
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKG-WRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQR-TQALEEQLALTRAELT
        ++LF+ET  H   G + + AAE+ +++++ L+ +P+ EG +P +E ++   + G R GY  G+ WG   + +R  S+ + S   +Q+ ++++ L +A+L 
Subjt:  LQLFKETRYHPTKG-WRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQR-TQALEEQLALTRAELT

Query:  SNNARMDIMHEN
            R+++   N
Subjt:  SNNARMDIMHEN

A0A5D3D3W5 CACTA en-spm transposon protein1.03e-2836.13Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        R++  ++ TTFKEFRAD H ++KK + PE AR  PP  +    EDW  LCD + +  ++     NK +R K PYNH +GSKSF +   EL  ++G  +  
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKG-WRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQL
        ++LF+ET  H   G + + AAE+ +++++ L+ +P  EG +P +E ++   + G R GY  G+ WG   + +R  S+    +  QAL  Q+
Subjt:  LQLFKETRYHPTKG-WRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQL

A0A6J1DTP1 uncharacterized protein LOC111023906 isoform X28.23e-4662.6Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKNK-----KSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP
        ++Y++  L+TTFKEFRA+LHA+YK+   P VAR KP  RIT L+DW KLCDRWETP+WK K     +SR+KLPYNHRAGSKSFGRL  ELK ++GVEIGP
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKNK-----KSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLK
        +++FKETR HP KGW +   E TY   VR+K
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLK

A0A6J1DUH3 uncharacterized protein LOC1110232121.57e-4743.13Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGPL
        +++ +++Q +FK++R+DLH YY +   P  AR  PP+R+T+ EDW  LCDRWETPEWK     NKK+R+KLP+NHRAGSKSF +L  ELK+K+G +IGP+
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-----NKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGPL

Query:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALE-----EQLALT
         LF E+ Y+   G  N  AE+ Y+ +  L + P  EG EP T+P+  R + G R  +V G+ +G   QP   KRG SS   S      E     E + + 
Subjt:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALE-----EQLALT

Query:  RAELTSNNARM
          E+ + N R+
Subjt:  RAELTSNNARM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAAAGATATTTGTACAAGGAGTTACAAACTACGTTTAAAGAATTTAGGGCGGACCTACATGCCTACTACAAGAAGCGAGCAAGTCCTGAGGTAGCTCGTCTAAAACCACC
TCAACGTATTACGAGTTTAGAGGACTGGGCTAAGTTATGCGATAGATGGGAGACTCCTGAGTGGAAGAACAAGAAAAGTAGGTCAAAGCTTCCTTATAACCATCGAGCTG
GGTCAAAATCATTTGGTCGCTTACTACAAGAATTGAAAGTGAAACAAGGCGTTGAAATTGGTCCACTACAATTATTCAAAGAGACGAGATATCATCCGACTAAGGGTTGG
AGAAATCCTGCTGCTGAAGAAACATATGACAAGTTGGTCAGGTTGAAAGAAGAACCAGTAGCTGAAGGAGAGGAACCGCCCACAGAACCACAAGTGGCCAGAACCATATT
TGGACATCGATCAGGATACGTCACGGGCATGAGGTGGGGTGTCACGCTACAACCCAAGAGAGGAGGCTCATCGCAAGCTTTTTCTCAACGAACACAAGCATTAGAAGAAC
AATTAGCCTTGACAAGAGCTGAATTAACAAGTAATAATGCTCGAATGGATATTATGCATGAGAACGAACTTAAGACACGTGCAGAATTA
mRNA sequenceShow/hide mRNA sequence
AAAAGATATTTGTACAAGGAGTTACAAACTACGTTTAAAGAATTTAGGGCGGACCTACATGCCTACTACAAGAAGCGAGCAAGTCCTGAGGTAGCTCGTCTAAAACCACC
TCAACGTATTACGAGTTTAGAGGACTGGGCTAAGTTATGCGATAGATGGGAGACTCCTGAGTGGAAGAACAAGAAAAGTAGGTCAAAGCTTCCTTATAACCATCGAGCTG
GGTCAAAATCATTTGGTCGCTTACTACAAGAATTGAAAGTGAAACAAGGCGTTGAAATTGGTCCACTACAATTATTCAAAGAGACGAGATATCATCCGACTAAGGGTTGG
AGAAATCCTGCTGCTGAAGAAACATATGACAAGTTGGTCAGGTTGAAAGAAGAACCAGTAGCTGAAGGAGAGGAACCGCCCACAGAACCACAAGTGGCCAGAACCATATT
TGGACATCGATCAGGATACGTCACGGGCATGAGGTGGGGTGTCACGCTACAACCCAAGAGAGGAGGCTCATCGCAAGCTTTTTCTCAACGAACACAAGCATTAGAAGAAC
AATTAGCCTTGACAAGAGCTGAATTAACAAGTAATAATGCTCGAATGGATATTATGCATGAGAACGAACTTAAGACACGTGCAGAATTA
Protein sequenceShow/hide protein sequence
KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKNKKSRSKLPYNHRAGSKSFGRLLQELKVKQGVEIGPLQLFKETRYHPTKGW
RNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSNNARMDIMHENELKTRAEL