; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g05670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g05670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionCACTA en-spm transposon protein
Genome locationchr3:4198914..4199948
RNA-Seq ExpressionMoc03g05670
SyntenyMoc03g05670
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156286.1 uncharacterized protein LOC111023212 [Momordica charantia]2.8e-2432.61Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGPL
        +++ +++Q +FK++R+DLH YY +   P  AR  PP+R+T+ EDW  LCDRWETPEWK                               K+K+G +IGP+
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGPL

Query:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALEEQLALTRAELT
         LF E+ Y+   G  N  AE+ Y+ +  L + P  EG EP T+P+  R + G R  +V G+ +G   QP   KRG SS                      
Subjt:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALEEQLALTRAELT

Query:  SNNARMDIMHENELKTRAELESIGERELKT
          N     ++E EL+ + E   +  RE+KT
Subjt:  SNNARMDIMHENELKTRAELESIGERELKT

XP_022157107.1 uncharacterized protein LOC111023906 isoform X2 [Momordica charantia]3.8e-2145.99Show/hide
Query:  MSDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQ
        MS    ++Y++  L+TTFKEFRA+LHA+YK+   P VAR KP  RIT L+DW KLCDRWETP+WK                               K ++
Subjt:  MSDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQ

Query:  GVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK
        GVEIGP+++FKETR HP KGW +   E TY   VR+K
Subjt:  GVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK

XP_038887408.1 poly [ADP-ribose] polymerase 1-like isoform X1 [Benincasa hispida]9.1e-2329.82Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGP
        K+Y+ + +Q TFKE+R+DL+ +Y++   P+ AR  PP+RIT   DW  LC+RWETPEWK                               K+K+G ++  
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN
        + LF+++ +    GW N  A++ Y ++ RL E    E   P +  +V + + GHRSGY+ G+  G+  +P    S  ++ Q  + LE+++     E+   
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN

Query:  NARMDIMHENELKTRAEL
         A  + M E  +   ++L
Subjt:  NARMDIMHENELKTRAEL

XP_038887409.1 poly [ADP-ribose] polymerase 1-like isoform X2 [Benincasa hispida]9.1e-2329.82Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGP
        K+Y+ + +Q TFKE+R+DL+ +Y++   P+ AR  PP+RIT   DW  LC+RWETPEWK                               K+K+G ++  
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN
        + LF+++ +    GW N  A++ Y ++ RL E    E   P +  +V + + GHRSGY+ G+  G+  +P    S  ++ Q  + LE+++     E+   
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN

Query:  NARMDIMHENELKTRAEL
         A  + M E  +   ++L
Subjt:  NARMDIMHENELKTRAEL

XP_038887413.1 uncharacterized protein LOC120077557 isoform X5 [Benincasa hispida]9.1e-2329.82Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGP
        K+Y+ + +Q TFKE+R+DL+ +Y++   P+ AR  PP+RIT   DW  LC+RWETPEWK                               K+K+G ++  
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGP

Query:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN
        + LF+++ +    GW N  A++ Y ++ RL E    E   P +  +V + + GHRSGY+ G+  G+  +P    S  ++ Q  + LE+++     E+   
Subjt:  LQLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSN

Query:  NARMDIMHENELKTRAEL
         A  + M E  +   ++L
Subjt:  NARMDIMHENELKTRAEL

TrEMBL top hitse value%identityAlignment
A0A5A7TFG0 Transposon protein, putative, CACTA, En/Spm sub-class8.0e-1736.23Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKKVKQGVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK
        ++YL +++Q  F+EFRADLH YY +      AR  PP RIT  EDW  +CDRWET  WK  K+G ++  +++F ET +   +GW N  A++ Y ++ R+ 
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKKVKQGVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK

Query:  EEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQ
         E    G +  +  +    + G RS      R G +L+
Subjt:  EEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQ

A0A5A7US78 Uncharacterized protein5.0e-1934.39Show/hide
Query:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKKVKQGVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK
        ++YL +++Q TF+EFRA+LH YY +      AR  PP RIT  EDW  +CDRWET  WKK K+G ++  +++F ET +   +GW N  A++ Y ++ R+ 
Subjt:  KRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKKVKQGVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK

Query:  EEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEE
         E    G +  +  +  + + G RS      R G +L+     S +        L+E
Subjt:  EEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEE

A0A5A7V6R4 CACTA en-spm transposon protein6.1e-1729.35Show/hide
Query:  SDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK---KVKQGVEIGPLQLFKETRYHPTKG-WRNPAAE
        +D    R++  ++ TTFKEFRAD H ++KK + PE AR  PP  +    EDW  LCD + +  ++     ++G  +  ++LF+ET  H   G + + AAE
Subjt:  SDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSL-EDWAKLCDRWETPEWK---KVKQGVEIGPLQLFKETRYHPTKG-WRNPAAE

Query:  ETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQR-TQALEEQLALTRAELTSNNARMDIMHENELKTRAELES
        + +++++ L+ +P+ EG +P +E ++   + G R GY  G+ WG   + +R  S+ + S   +Q+ ++++ L +A+L     R+++   N     +++E+
Subjt:  ETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQR-TQALEEQLALTRAELTSNNARMDIMHENELKTRAELES

Query:  I
        +
Subjt:  I

A0A6J1DTP1 uncharacterized protein LOC111023906 isoform X21.8e-2145.99Show/hide
Query:  MSDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQ
        MS    ++Y++  L+TTFKEFRA+LHA+YK+   P VAR KP  RIT L+DW KLCDRWETP+WK                               K ++
Subjt:  MSDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQ

Query:  GVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK
        GVEIGP+++FKETR HP KGW +   E TY   VR+K
Subjt:  GVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRLK

A0A6J1DUH3 uncharacterized protein LOC1110232121.4e-2432.61Show/hide
Query:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGPL
        +++ +++Q +FK++R+DLH YY +   P  AR  PP+R+T+ EDW  LCDRWETPEWK                               K+K+G +IGP+
Subjt:  RYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWK-------------------------------KVKQGVEIGPL

Query:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALEEQLALTRAELT
         LF E+ Y+   G  N  AE+ Y+ +  L + P  EG EP T+P+  R + G R  +V G+ +G   QP   KRG SS                      
Subjt:  QLFKETRYHPTKGWRNPAAEETYDKLVRLKEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQP---KRGGSSQAFSQRTQALEEQLALTRAELT

Query:  SNNARMDIMHENELKTRAELESIGERELKT
          N     ++E EL+ + E   +  RE+KT
Subjt:  SNNARMDIMHENELKTRAELESIGERELKT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATCATGACGATAAAAGATATTTGTACAAGGAGTTACAAACTACGTTTAAAGAATTTAGGGCGGACCTACATGCCTACTACAAGAAGCGAGCAAGTCCT
GAGGTAGCTCGTCTAAAACCACCTCAACGTATTACGAGTTTAGAGGACTGGGCTAAGTTATGCGATAGATGGGAGACTCCTGAGTGGAAGAAAGTGAAACAAGGC
GTTGAAATTGGTCCACTACAATTATTCAAAGAGACGAGATATCATCCGACTAAGGGTTGGAGAAATCCTGCTGCTGAAGAAACATATGACAAGTTGGTCAGGTTG
AAAGAAGAACCAGTAGCTGAAGGAGAGGAACCGCCCACAGAACCACAAGTGGCCAGAACCATATTTGGACATCGATCAGGATACGTCACGGGCATGAGGTGGGGT
GTCACGCTACAACCCAAGAGAGGAGGCTCATCGCAAGCTTTTTCTCAACGAACACAAGCATTAGAAGAACAATTAGCCTTGACAAGAGCTGAATTAACAAGTAAT
AATGCTCGAATGGATATTATGCATGAGAACGAACTTAAGACACGTGCAGAATTAGAAAGTATAGGGGAGAGGGAACTTAAGACGAGTGCAGAATTAGAAAATTTT
CGACGCGTGTTTGAAAATTACGTTAGTAGTCAACGGAGCCCAAGATCATCTTCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATCATGACGATAAAAGATATTTGTACAAGGAGTTACAAACTACGTTTAAAGAATTTAGGGCGGACCTACATGCCTACTACAAGAAGCGAGCAAGTCCT
GAGGTAGCTCGTCTAAAACCACCTCAACGTATTACGAGTTTAGAGGACTGGGCTAAGTTATGCGATAGATGGGAGACTCCTGAGTGGAAGAAAGTGAAACAAGGC
GTTGAAATTGGTCCACTACAATTATTCAAAGAGACGAGATATCATCCGACTAAGGGTTGGAGAAATCCTGCTGCTGAAGAAACATATGACAAGTTGGTCAGGTTG
AAAGAAGAACCAGTAGCTGAAGGAGAGGAACCGCCCACAGAACCACAAGTGGCCAGAACCATATTTGGACATCGATCAGGATACGTCACGGGCATGAGGTGGGGT
GTCACGCTACAACCCAAGAGAGGAGGCTCATCGCAAGCTTTTTCTCAACGAACACAAGCATTAGAAGAACAATTAGCCTTGACAAGAGCTGAATTAACAAGTAAT
AATGCTCGAATGGATATTATGCATGAGAACGAACTTAAGACACGTGCAGAATTAGAAAGTATAGGGGAGAGGGAACTTAAGACGAGTGCAGAATTAGAAAATTTT
CGACGCGTGTTTGAAAATTACGTTAGTAGTCAACGGAGCCCAAGATCATCTTCTGAATAG
Protein sequenceShow/hide protein sequence
MSDHDDKRYLYKELQTTFKEFRADLHAYYKKRASPEVARLKPPQRITSLEDWAKLCDRWETPEWKKVKQGVEIGPLQLFKETRYHPTKGWRNPAAEETYDKLVRL
KEEPVAEGEEPPTEPQVARTIFGHRSGYVTGMRWGVTLQPKRGGSSQAFSQRTQALEEQLALTRAELTSNNARMDIMHENELKTRAELESIGERELKTSAELENF
RRVFENYVSSQRSPRSSSE