; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr2:13976840..13980666
RNA-Seq ExpressionMoc02g18760
SyntenyMoc02g18760
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]1.1e-3539.72Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESL---------------TESW-
        M V +SFK E  SK+ MRLKLFP+SLRD    WL+SLP ESITSW+DLAE FLM+YFPP+KNA+ R++INNFQQ   ES                  SW 
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESL---------------TESW-

Query:  -------ERIKGLLEKCPYHGIPRFMKNAI---------ASEAGAS--KAKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQN----------QQTTQ
               +  K L+E   Y  +   ++N            S AGAS     VN +Q   C + E +H + NCPGNP S    G +          QQ  Q
Subjt:  -------ERIKGLLEKCPYHGIPRFMKNAI---------ASEAGAS--KAKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQN----------QQTTQ

Query:  KPPETLSLEDMFKAYMTKNDANVQRQAALLKKRS-------------------------KRDGKEQFKTLTLRSGRALPPAY
              SLE + K YM  NDA V+RQ + L+                            KRDGKEQ K LTL SG+ALPP +
Subjt:  KPPETLSLEDMFKAYMTKNDANVQRQAALLKKRS-------------------------KRDGKEQFKTLTLRSGRALPPAY

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]8.7e-3844.88Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP
        M V +SFK E  S E +RLKLFPYSLRD    WL+SLP+ESITSW+DLAE FLM+YFPPSKNA+ RS+INNFQQ   ES++ESWE  K LL+ CP+HGIP
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP

Query:  RFM--------------------------KNAIASE---------------------------AGASKAKVNAMQSGFCPYCESEHQFENCPGNPASIFY
        R +                          K  + SE                           A    A VN +Q   C +CE +H + NCPGNP S++Y
Subjt:  RFM--------------------------KNAIASE---------------------------AGASKAKVNAMQSGFCPYCESEHQFENCPGNPASIFY

Query:  LGQNQ
        LG  Q
Subjt:  LGQNQ

XP_030494694.1 uncharacterized protein LOC115710474 [Cannabis sativa]4.0e-3532.44Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP
        ++V+DSFK++  S+E +RLKLFP+SLRD   AWL++LP +S+T+WNDLAE FL +YFPP++NA+ RS+I +FQQL  E+ +++WER K LL KCP+HGIP
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP

Query:  ---------------RFMKNAIASEAGASKAKVNAMQ---------------------------------SGFCPYCESEHQFENCPGNPASIFYLGQ--
                       R + +A  + A  SK+   A +                                    C YC   H FENCP NPAS+ Y+G   
Subjt:  ---------------RFMKNAIASEAGASKAKVNAMQ---------------------------------SGFCPYCESEHQFENCPGNPASIFYLGQ--

Query:  ----------------------------------------------NQQTTQKPP------ETLSLEDMFKAYMTKNDANVQRQAALLK-----------
                                                      +QQ   + P      +T SLE + + YMTKNDA +Q QAA L+           
Subjt:  ----------------------------------------------NQQTTQKPP------ETLSLEDMFKAYMTKNDANVQRQAALLK-----------

Query:  --------------KRSKRDGKEQFKTLTLRSGRAL
                      +  +RD KE  K +TLRSG+ +
Subjt:  --------------KRSKRDGKEQFKTLTLRSGRAL

XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]5.8e-3430.95Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP
        ++V+DSFK++  S+E +R KLFP+SLRD   AWL++LP +S+T+WNDLAE FL +YFPP++NA+ RS+I +FQQ   E+ +++WER K +L KCP+HGIP
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP

Query:  ----------------RFMKNAIASEAGASKA---------------------------------KVNA-----------------MQSG----------
                        R + +A A+ A  SK+                                 +V+A                 M  G          
Subjt:  ----------------RFMKNAIASEAGASKA---------------------------------KVNA-----------------MQSG----------

Query:  ----FCPYCESEHQFENCPGNPASIFYLG-------QNQQTTQKPP-----------------ETLSLEDMFKAYMTKNDANVQRQAALLK---------
             C YC   H FENCP NPAS+ Y+G       Q Q     PP                 +T SLE + + YM KNDA +Q QAA L+         
Subjt:  ----FCPYCESEHQFENCPGNPASIFYLG-------QNQQTTQKPP-----------------ETLSLEDMFKAYMTKNDANVQRQAALLK---------

Query:  ----------------KRSKRDGKEQFKTLTLRSGRALPPAYQQEGDSEGEIARPHISEESEQATRSVDSASG-DLRENAKQLGDKKDSGT
                        +  +RDGKE  K +TLRSG+ +       G  E    +    ++ + A  +V+   G D+  +A + G  K + T
Subjt:  ----------------KRSKRDGKEQFKTLTLRSGRALPPAYQQEGDSEGEIARPHISEESEQATRSVDSASG-DLRENAKQLGDKKDSGT

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]1.4e-3229.98Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP
        ++V+DSFK++  S+E +RLKLFP+SLRD   AWL++LP +S+T+WNDLAENFL +YFPP++NA+ RS+I +FQQL  E+ +++WER K LL KCP+HGIP
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP

Query:  ----------------RFMKNAIASEAGASKA---------------------------------KVNA-----------------MQSG----------
                        R + +A A+ A  SK+                                 +V+A                 M  G          
Subjt:  ----------------RFMKNAIASEAGASKA---------------------------------KVNA-----------------MQSG----------

Query:  ----FCPYCESEHQFENCPGNPASIFYLG-------------------------------------QNQQTTQKPP------------ETLSLEDMFKAY
             C YC   H FENCP NPAS+ Y+G                                     Q Q     PP            +T SLE + + Y
Subjt:  ----FCPYCESEHQFENCPGNPASIFYLG-------------------------------------QNQQTTQKPP------------ETLSLEDMFKAY

Query:  MTKNDANVQRQAALLK-------------------------KRSKRDGKEQFKTLTLRSGRALPPAYQQEGDSEGEIARPHISEESEQATRSVDSASGDL
        M KNDA +Q QAA L+                         +  +RD KE  K +TLRSG+ +             + R  I E        + S  G++
Subjt:  MTKNDANVQRQAALLK-------------------------KRSKRDGKEQFKTLTLRSGRALPPAYQQEGDSEGEIARPHISEESEQATRSVDSASGDL

Query:  RENAKQL
        ++N +QL
Subjt:  RENAKQL

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189101.1e-2566.3Show/hide
Query:  EIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIPR
        E ++KE +RLKLF +SLRD    WL SLP ESITSW+DLAE FLM+YFPPSKNA+ RS INNFQQ   ES+TESWE  K L++KC +H IPR
Subjt:  EIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIPR

A0A6J1DTD1 uncharacterized protein LOC1110241365.1e-3639.72Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESL---------------TESW-
        M V +SFK E  SK+ MRLKLFP+SLRD    WL+SLP ESITSW+DLAE FLM+YFPP+KNA+ R++INNFQQ   ES                  SW 
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESL---------------TESW-

Query:  -------ERIKGLLEKCPYHGIPRFMKNAI---------ASEAGAS--KAKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQN----------QQTTQ
               +  K L+E   Y  +   ++N            S AGAS     VN +Q   C + E +H + NCPGNP S    G +          QQ  Q
Subjt:  -------ERIKGLLEKCPYHGIPRFMKNAI---------ASEAGAS--KAKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQN----------QQTTQ

Query:  KPPETLSLEDMFKAYMTKNDANVQRQAALLKKRS-------------------------KRDGKEQFKTLTLRSGRALPPAY
              SLE + K YM  NDA V+RQ + L+                            KRDGKEQ K LTL SG+ALPP +
Subjt:  KPPETLSLEDMFKAYMTKNDANVQRQAALLKKRS-------------------------KRDGKEQFKTLTLRSGRALPPAY

A0A6J1E1F3 uncharacterized protein LOC1110250654.2e-3844.88Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP
        M V +SFK E  S E +RLKLFPYSLRD    WL+SLP+ESITSW+DLAE FLM+YFPPSKNA+ RS+INNFQQ   ES++ESWE  K LL+ CP+HGIP
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP

Query:  RFM--------------------------KNAIASE---------------------------AGASKAKVNAMQSGFCPYCESEHQFENCPGNPASIFY
        R +                          K  + SE                           A    A VN +Q   C +CE +H + NCPGNP S++Y
Subjt:  RFM--------------------------KNAIASE---------------------------AGASKAKVNAMQSGFCPYCESEHQFENCPGNPASIFY

Query:  LGQNQ
        LG  Q
Subjt:  LGQNQ

A0A6J1EEI2 uncharacterized protein LOC1114333944.5e-2431.28Show/hide
Query:  VADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIPRF
        V+DSF+ +   K+ +RL LFPYSLRD   +WL++L + +I SWN L E FL++YFPP++NA  R++I  FQQ   ++L+E+WER K +L KCP+HG+P  
Subjt:  VADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIPRF

Query:  MK----------------NAIASEAGASK-----------------------------------------------------------------------
        ++                +A A+ A  SK                                                                       
Subjt:  MK----------------NAIASEAGASK-----------------------------------------------------------------------

Query:  AKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQNQQTTQKP
        A +N   +  C YC  EH F+ CP NPASIFY+G NQ +   P
Subjt:  AKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQNQQTTQKP

U5CUI2 Retrotrans_gag domain-containing protein1.4e-2859Show/hide
Query:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP
        ++V+DSFK++  S+E +RLKLFP+SLRD   +WL++LP +S+T+WNDLAE FL +YFPP++NA+ RS+I +FQQL  ES +++WER K LL KCP+HGIP
Subjt:  MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTGGCTGATTCATTTAAAGTAGAAATATTCAGTAAGGAGGATATGCGCCTGAAGCTATTCCCTTACTCTTTGAGGGATACTGTTGGAGCATGGTTGGATTCCCT
ACCTGTTGAATCGATCACCTCGTGGAATGATTTAGCAGAAAATTTTTTGATGCAGTATTTCCCACCATCCAAGAATGCAGAACTTCGGAGTAAAATTAACAACTTTCAAC
AGCTTCCTAGAGAATCCTTAACTGAATCATGGGAAAGAATCAAGGGACTGCTTGAGAAGTGTCCTTACCATGGGATACCGCGCTTCATGAAAAATGCAATTGCAAGTGAG
GCTGGAGCCTCCAAAGCTAAGGTAAATGCTATGCAGAGTGGTTTCTGTCCATATTGTGAAAGCGAACACCAGTTTGAGAATTGTCCAGGCAATCCTGCCTCAATCTTTTA
CTTGGGACAGAATCAACAAACCACTCAAAAGCCTCCAGAGACGTTGAGCTTGGAAGATATGTTCAAGGCCTACATGACGAAGAATGATGCTAATGTACAGAGGCAGGCCG
CGTTGCTTAAGAAGCGGTCAAAAAGGGATGGCAAGGAGCAATTCAAGACTTTAACACTACGAAGTGGTCGAGCTTTACCTCCAGCATATCAACAAGAAGGAGACAGTGAA
GGAGAGATAGCCAGACCACATATTTCCGAGGAAAGCGAGCAAGCCACACGGTCCGTTGATTCAGCAAGCGGTGATTTGAGAGAGAATGCAAAGCAATTGGGAGATAAAAA
GGACAGTGGAACATAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTGGCTGATTCATTTAAAGTAGAAATATTCAGTAAGGAGGATATGCGCCTGAAGCTATTCCCTTACTCTTTGAGGGATACTGTTGGAGCATGGTTGGATTCCCT
ACCTGTTGAATCGATCACCTCGTGGAATGATTTAGCAGAAAATTTTTTGATGCAGTATTTCCCACCATCCAAGAATGCAGAACTTCGGAGTAAAATTAACAACTTTCAAC
AGCTTCCTAGAGAATCCTTAACTGAATCATGGGAAAGAATCAAGGGACTGCTTGAGAAGTGTCCTTACCATGGGATACCGCGCTTCATGAAAAATGCAATTGCAAGTGAG
GCTGGAGCCTCCAAAGCTAAGGTAAATGCTATGCAGAGTGGTTTCTGTCCATATTGTGAAAGCGAACACCAGTTTGAGAATTGTCCAGGCAATCCTGCCTCAATCTTTTA
CTTGGGACAGAATCAACAAACCACTCAAAAGCCTCCAGAGACGTTGAGCTTGGAAGATATGTTCAAGGCCTACATGACGAAGAATGATGCTAATGTACAGAGGCAGGCCG
CGTTGCTTAAGAAGCGGTCAAAAAGGGATGGCAAGGAGCAATTCAAGACTTTAACACTACGAAGTGGTCGAGCTTTACCTCCAGCATATCAACAAGAAGGAGACAGTGAA
GGAGAGATAGCCAGACCACATATTTCCGAGGAAAGCGAGCAAGCCACACGGTCCGTTGATTCAGCAAGCGGTGATTTGAGAGAGAATGCAAAGCAATTGGGAGATAAAAA
GGACAGTGGAACATAG
Protein sequenceShow/hide protein sequence
MQVADSFKVEIFSKEDMRLKLFPYSLRDTVGAWLDSLPVESITSWNDLAENFLMQYFPPSKNAELRSKINNFQQLPRESLTESWERIKGLLEKCPYHGIPRFMKNAIASE
AGASKAKVNAMQSGFCPYCESEHQFENCPGNPASIFYLGQNQQTTQKPPETLSLEDMFKAYMTKNDANVQRQAALLKKRSKRDGKEQFKTLTLRSGRALPPAYQQEGDSE
GEIARPHISEESEQATRSVDSASGDLRENAKQLGDKKDSGT