; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g11800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g11800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:10041437..10043956
RNA-Seq ExpressionMoc09g11800
SyntenyMoc09g11800
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_015389226.1 uncharacterized protein LOC107178482 [Citrus sinensis]6.2e-3062.07Show/hide
Query:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERI
        +AIR YA       H  IV  E+EA  FEL  +MFQMLQT+GQF G P+EDPHLHL+ FLE+ETFYN L+ +TRL+VDASANGALLSKSY EA +ILERI
Subjt:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERI

Query:  SANNYHWSDSRAVTER
        + NNY W  +R  T +
Subjt:  SANNYHWSDSRAVTER

XP_022153147.1 probable serine/threonine-protein kinase PBL11 [Momordica charantia]1.2e-4983.46Show/hide
Query:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE
        G +AIRAYAA  LH FH VI G EIEAERFEL  VMFQMLQTVG+FFGNP ED HLHLRYFLEI+ FYNGLSEATRLVVDASAN ALLSKSYVEA+DILE
Subjt:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE

Query:  RISANNYHWSDSRAVTERNNHGVNDNE
        RISANNYHWSDSRA  +R+NHGVNDNE
Subjt:  RISANNYHWSDSRAVTERNNHGVNDNE

XP_022158490.1 uncharacterized protein LOC111024970 [Momordica charantia]1.0e-5643.65Show/hide
Query:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNG----------------LSEATRLVVDASAN
        G + IRAYAAPA+HGFH VI G  IEAERFEL S+MFQMLQTVGQFFGNPSEDPHLHLRYFLE+   +N                 LS+ TR  +++   
Subjt:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNG----------------LSEATRLVVDASAN

Query:  GALLSKSYVEAVDILERISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM-KRCTDPEPSCNA----KKFRGPQSENT--------------
         ++ S + +      E  S      SDS+AV ERNNH  NDNEAMA L DQIANL NM K       S N+    +K R P ++NT              
Subjt:  GALLSKSYVEAVDILERISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM-KRCTDPEPSCNA----KKFRGPQSENT--------------

Query:  ----------------------EEDATENMPIDDDKAD---------GAGASQEQKLHQGQTVHQKVERQPELE-------------INRGKRAAEVEPA
                               E  T N  +   +A          G  A++ +   + QT   +   +P+LE              ++ KRAAE EPA
Subjt:  ----------------------EEDATENMPIDDDKAD---------GAGASQEQKLHQGQTVHQKVERQPELE-------------INRGKRAAEVEPA

Query:  EEVSIETPLQKKVKVLVEYRPPPPYPQRLQKKTQDLQFDRFLE--------------------------DILAKKRRLEEFETVALTKECSAIL
        +EVSIET + KKVKV VEYR PP YPQRLQKKTQDLQFDRFLE                          DILAKKRRL EFE VALTKEC+AIL
Subjt:  EEVSIETPLQKKVKVLVEYRPPPPYPQRLQKKTQDLQFDRFLE--------------------------DILAKKRRLEEFETVALTKECSAIL

XP_022158768.1 uncharacterized protein LOC111025234 [Momordica charantia]5.5e-2632.07Show/hide
Query:  IRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERISA
        IRAYAAPA   F+ VIV   IEA+RFEL   MFQMLQ    F G  SEDP+ HL+YF+++       S+    V+DAS+N ALL K Y EA DILE IS 
Subjt:  IRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERISA

Query:  NNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANL--INMKRCTDPEPSCNAKKFRGPQS--------ENTEEDATENMP--------------------
        N +  S SRAV+   + G+ +++ +A L  +I+ L  I MK  +  +   +  K    Q+        E+  ED   N P                    
Subjt:  NNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANL--INMKRCTDPEPSCNAKKFRGPQS--------ENTEEDATENMP--------------------

Query:  ------IDDDKADGAGASQEQKLHQGQT--------------------------VHQKVERQPELEINR------------------GKRAAEVEPAEEV
                +D      A  E  + +  T                          +H  +  + E E++R                        VEP    
Subjt:  ------IDDDKADGAGASQEQKLHQGQT--------------------------VHQKVERQPELEINR------------------GKRAAEVEPAEEV

Query:  SIE-TPLQKKVK----VLVEYRPPPPYPQRLQKKTQDLQFD--------------------------RFLEDILAKKRRLEEFETVALTKECSAIL
        S E  P+ + V     V  EY P PPYP+RLQK+ QD QF                           RFL+DIL KK RL+EF+ V LTKECS IL
Subjt:  SIE-TPLQKKVK----VLVEYRPPPPYPQRLQKKTQDLQFD--------------------------RFLEDILAKKRRLEEFETVALTKECSAIL

XP_030505151.1 uncharacterized protein LOC115720131 [Cannabis sativa]3.4e-2844.97Show/hide
Query:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI-----------------------------ETFYNGLSE
        +AIR Y AP  +  +  IV  +I+A +FEL  VMFQM+QTVG F G PSEDPHLHL  FLE+                             ETFYNGL+ 
Subjt:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI-----------------------------ETFYNGLSE

Query:  ATRLVVDASANGALLSKSYVEAVDILERISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM
        A+++V+DAS NGA+ SKSY +A +ILE I++NNY WS++RA T R   GV + +A+ TL  Q+ ++ N+
Subjt:  ATRLVVDASANGALLSKSYVEAVDILERISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM

TrEMBL top hitse value%identityAlignment
A0A6J1DI54 probable serine/threonine-protein kinase PBL115.9e-5083.46Show/hide
Query:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE
        G +AIRAYAA  LH FH VI G EIEAERFEL  VMFQMLQTVG+FFGNP ED HLHLRYFLEI+ FYNGLSEATRLVVDASAN ALLSKSYVEA+DILE
Subjt:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE

Query:  RISANNYHWSDSRAVTERNNHGVNDNE
        RISANNYHWSDSRA  +R+NHGVNDNE
Subjt:  RISANNYHWSDSRAVTERNNHGVNDNE

A0A6J1DVZ9 uncharacterized protein LOC1110249705.0e-5743.65Show/hide
Query:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNG----------------LSEATRLVVDASAN
        G + IRAYAAPA+HGFH VI G  IEAERFEL S+MFQMLQTVGQFFGNPSEDPHLHLRYFLE+   +N                 LS+ TR  +++   
Subjt:  GLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNG----------------LSEATRLVVDASAN

Query:  GALLSKSYVEAVDILERISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM-KRCTDPEPSCNA----KKFRGPQSENT--------------
         ++ S + +      E  S      SDS+AV ERNNH  NDNEAMA L DQIANL NM K       S N+    +K R P ++NT              
Subjt:  GALLSKSYVEAVDILERISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM-KRCTDPEPSCNA----KKFRGPQSENT--------------

Query:  ----------------------EEDATENMPIDDDKAD---------GAGASQEQKLHQGQTVHQKVERQPELE-------------INRGKRAAEVEPA
                               E  T N  +   +A          G  A++ +   + QT   +   +P+LE              ++ KRAAE EPA
Subjt:  ----------------------EEDATENMPIDDDKAD---------GAGASQEQKLHQGQTVHQKVERQPELE-------------INRGKRAAEVEPA

Query:  EEVSIETPLQKKVKVLVEYRPPPPYPQRLQKKTQDLQFDRFLE--------------------------DILAKKRRLEEFETVALTKECSAIL
        +EVSIET + KKVKV VEYR PP YPQRLQKKTQDLQFDRFLE                          DILAKKRRL EFE VALTKEC+AIL
Subjt:  EEVSIETPLQKKVKVLVEYRPPPPYPQRLQKKTQDLQFDRFLE--------------------------DILAKKRRLEEFETVALTKECSAIL

A0A6J1DX14 uncharacterized protein LOC1110252342.7e-2632.07Show/hide
Query:  IRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERISA
        IRAYAAPA   F+ VIV   IEA+RFEL   MFQMLQ    F G  SEDP+ HL+YF+++       S+    V+DAS+N ALL K Y EA DILE IS 
Subjt:  IRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERISA

Query:  NNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANL--INMKRCTDPEPSCNAKKFRGPQS--------ENTEEDATENMP--------------------
        N +  S SRAV+   + G+ +++ +A L  +I+ L  I MK  +  +   +  K    Q+        E+  ED   N P                    
Subjt:  NNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANL--INMKRCTDPEPSCNAKKFRGPQS--------ENTEEDATENMP--------------------

Query:  ------IDDDKADGAGASQEQKLHQGQT--------------------------VHQKVERQPELEINR------------------GKRAAEVEPAEEV
                +D      A  E  + +  T                          +H  +  + E E++R                        VEP    
Subjt:  ------IDDDKADGAGASQEQKLHQGQT--------------------------VHQKVERQPELEINR------------------GKRAAEVEPAEEV

Query:  SIE-TPLQKKVK----VLVEYRPPPPYPQRLQKKTQDLQFD--------------------------RFLEDILAKKRRLEEFETVALTKECSAIL
        S E  P+ + V     V  EY P PPYP+RLQK+ QD QF                           RFL+DIL KK RL+EF+ V LTKECS IL
Subjt:  SIE-TPLQKKVK----VLVEYRPPPPYPQRLQKKTQDLQFD--------------------------RFLEDILAKKRRLEEFETVALTKECSAIL

A0A6J1EEI2 uncharacterized protein LOC1114333941.2e-2132.23Show/hide
Query:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFL----------------------------------------
        +AIRAYA PA+   +  I+  E++A  FEL  VMFQMLQT+GQF G PSEDPHLHL+ FL                                        
Subjt:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFL----------------------------------------

Query:  --------------------------------------------------------------EIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE
                                                                      ++ETFYNGL+ AT+ VVDASANGA+LSK+Y EA +ILE
Subjt:  --------------------------------------------------------------EIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE

Query:  RISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM
        RI++NN  W+D R+   R   GV + +A++++  Q+A++ N+
Subjt:  RISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM

U5CUI2 Retrotrans_gag domain-containing protein5.5e-2434.71Show/hide
Query:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI--------------------------------------
        +AIR YAAP  +  +  IV  EI+A +FEL  VMFQMLQTVGQF G P+EDPHLHLR FLE+                                      
Subjt:  KAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEI--------------------------------------

Query:  ----------------------------------------------------------------ETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE
                                                                        ETFYNGL+ A+R+V+DASANGA+LSKSY EA +ILE
Subjt:  ----------------------------------------------------------------ETFYNGLSEATRLVVDASANGALLSKSYVEAVDILE

Query:  RISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM
         I++NNY WS++RA T R   GV + +A+  L  Q+A++ N+
Subjt:  RISANNYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGATCTGCAAGGGCTAAAGGCCATCAGGGCCTATGCTGCACCAGCACTTCATGGGTTTCATCTAGTTATAGTAGGTTCAGAGATAGAAGCTGAGAGGTTTGAATT
AAATTCAGTTATGTTCCAGATGCTCCAAACAGTGGGGCAATTCTTTGGAAATCCATCTGAGGACCCTCATCTGCACTTGAGGTATTTTCTGGAAATAGAGACCTTCTACA
ATGGGCTGAGTGAAGCAACACGTCTGGTAGTTGATGCATCGGCTAATGGAGCATTGTTGTCTAAGTCGTACGTAGAAGCAGTTGATATATTGGAAAGAATTTCTGCTAAT
AACTACCACTGGTCAGATTCCAGAGCAGTAACTGAGAGGAACAATCATGGAGTTAATGATAATGAGGCAATGGCTACGCTGAAAGATCAAATTGCCAATTTAATTAACAT
GAAACGATGCACTGATCCAGAGCCAAGCTGCAACGCTAAGAAATTTAGAGGACCACAATCAGAAAATACAGAAGAGGATGCTACAGAGAATATGCCAATTGATGATGATA
AAGCTGACGGTGCTGGAGCATCACAGGAACAGAAGCTACACCAAGGACAAACAGTTCATCAGAAGGTAGAGAGGCAACCTGAGCTAGAGATAAACAGAGGGAAAAGAGCT
GCAGAAGTAGAGCCAGCAGAAGAGGTATCCATAGAAACTCCGTTGCAAAAAAAGGTAAAAGTGCTTGTGGAATACAGACCACCACCTCCATACCCTCAGAGGCTCCAAAA
GAAAACTCAAGACCTGCAATTTGACCGGTTTTTAGAGGATATTCTGGCCAAGAAGAGGAGGTTAGAGGAGTTTGAAACAGTAGCTCTAACAAAGGAATGCAGTGCCATCT
TAATAGAAGAATGTTCTTTGTTGAGAATAGCAGATGATTTGCTGATGGAGGAGATGCAAACTGAGGAGCTATTAGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGATCTGCAAGGGCTAAAGGCCATCAGGGCCTATGCTGCACCAGCACTTCATGGGTTTCATCTAGTTATAGTAGGTTCAGAGATAGAAGCTGAGAGGTTTGAATT
AAATTCAGTTATGTTCCAGATGCTCCAAACAGTGGGGCAATTCTTTGGAAATCCATCTGAGGACCCTCATCTGCACTTGAGGTATTTTCTGGAAATAGAGACCTTCTACA
ATGGGCTGAGTGAAGCAACACGTCTGGTAGTTGATGCATCGGCTAATGGAGCATTGTTGTCTAAGTCGTACGTAGAAGCAGTTGATATATTGGAAAGAATTTCTGCTAAT
AACTACCACTGGTCAGATTCCAGAGCAGTAACTGAGAGGAACAATCATGGAGTTAATGATAATGAGGCAATGGCTACGCTGAAAGATCAAATTGCCAATTTAATTAACAT
GAAACGATGCACTGATCCAGAGCCAAGCTGCAACGCTAAGAAATTTAGAGGACCACAATCAGAAAATACAGAAGAGGATGCTACAGAGAATATGCCAATTGATGATGATA
AAGCTGACGGTGCTGGAGCATCACAGGAACAGAAGCTACACCAAGGACAAACAGTTCATCAGAAGGTAGAGAGGCAACCTGAGCTAGAGATAAACAGAGGGAAAAGAGCT
GCAGAAGTAGAGCCAGCAGAAGAGGTATCCATAGAAACTCCGTTGCAAAAAAAGGTAAAAGTGCTTGTGGAATACAGACCACCACCTCCATACCCTCAGAGGCTCCAAAA
GAAAACTCAAGACCTGCAATTTGACCGGTTTTTAGAGGATATTCTGGCCAAGAAGAGGAGGTTAGAGGAGTTTGAAACAGTAGCTCTAACAAAGGAATGCAGTGCCATCT
TAATAGAAGAATGTTCTTTGTTGAGAATAGCAGATGATTTGCTGATGGAGGAGATGCAAACTGAGGAGCTATTAGACTAG
Protein sequenceShow/hide protein sequence
MSDLQGLKAIRAYAAPALHGFHLVIVGSEIEAERFELNSVMFQMLQTVGQFFGNPSEDPHLHLRYFLEIETFYNGLSEATRLVVDASANGALLSKSYVEAVDILERISAN
NYHWSDSRAVTERNNHGVNDNEAMATLKDQIANLINMKRCTDPEPSCNAKKFRGPQSENTEEDATENMPIDDDKADGAGASQEQKLHQGQTVHQKVERQPELEINRGKRA
AEVEPAEEVSIETPLQKKVKVLVEYRPPPPYPQRLQKKTQDLQFDRFLEDILAKKRRLEEFETVALTKECSAILIEECSLLRIADDLLMEEMQTEELLD