; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g26510 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g26510
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr5:18994633..18995913
RNA-Seq ExpressionMoc05g26510
SyntenyMoc05g26510
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036141.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.8e-3040.18Show/hide
Query:  VIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATI-----QGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNNS
        ++G+  A+DLW AIQ+L GV+S+ EED++R     TR   S++     E+Y  ++ T      Q K  ISW +MQ++LL+FEKRLE QN +K     +  
Subjt:  VIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATI-----QGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNNS

Query:  VSVNMANSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVTIAKNLVSMFKLAKDNNVYLEFHA
         +   +NS++N             AF  T N+N F+  PETV+DSNWY D+GA+NHVTADY+++  P +Y G+E V +          A+DNNVYLEFH 
Subjt:  VSVNMANSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVTIAKNLVSMFKLAKDNNVYLEFHA

Query:  DSCLGKDIRSGEVVLKGDL
        D C   +  +G  +++G L
Subjt:  DSCLGKDIRSGEVVLKGDL

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.9e-3031.98Show/hide
Query:  VIGYENARDLWAAIQELLGVKSQAEEDYIR--------VR----------------------PVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQ
        ++G+ NA+DLW A Q+L GV+S+AEED++R        VR                      PVP R+ ISQ LLGL+E YNP++A IQGK  ISW +MQ
Subjt:  VIGYENARDLWAAIQELLGVKSQAEEDYIR--------VR----------------------PVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQ

Query:  AKLLVFEKRLELQNFHKNTV-LFNNSVSV----------NMANSSRNGVNNNN---------------PGQGT--------SYAFTATQNNNHF------
        ++LL FEKRLE Q+  KNT  +  N V++            +N   +G N NN                G+G          Y  +A    N F      
Subjt:  AKLLVFEKRLELQNFHKNTV-LFNNSVSV----------NMANSSRNGVNNNN---------------PGQGT--------SYAFTATQNNNHF------

Query:  ------------------------------LANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERV-----------------------------
                                       A  +TV++ NWY DSGA+NH+T +Y+++  P+EY G+E++                             
Subjt:  ------------------------------LANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERV-----------------------------

Query:  -----TIAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVL
              I KNLVS+ KLA+DNNVY+EFH   C  KD  +G  +L
Subjt:  -----TIAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVL

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]2.1e-3133.72Show/hide
Query:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG
        +SSSI    +VNPL+E WV                      +G+ N  DLW A Q+  GV+S+AEED++R     TR        GL+E YN ++  IQG
Subjt:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG

Query:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVN------------------NNNPG----------------------
        K  ISW +MQ+KLL+FEKRL+ QN   KNT     S ++NMA     N  RN  N                  NN P                       
Subjt:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVN------------------NNNPG----------------------

Query:  -----------------QGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVT-------------------------
                               F +TQN   F A P+TVVD NWY DSGA+NHVT + ++M  PTEY G+E+VT                         
Subjt:  -----------------QGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVT-------------------------

Query:  ---------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE
                 IAKNL+S+ KLA+DN++Y+EFH   C  KD  +G+
Subjt:  ---------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE

XP_016902204.1 PREDICTED: uncharacterized protein LOC107991581 isoform X4 [Cucumis melo]3.3e-2936.59Show/hide
Query:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG
        +SSSI    +VNPL+E WV                      +G+ N  DLW A Q+  GV+S+AEED++R     TR        GL+E YN ++  IQG
Subjt:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG

Query:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTA
        K  ISW +MQ+KLL+FEKRL+ QN   KNT     S ++NMA     N  RN  N    G    + F+  + N   L N  T        DSGA+NHVT 
Subjt:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTA

Query:  DYNSMVQPTEYRGMERVT----------------------------------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE
        + ++M  PTEY G+E+VT                                  IAKNL+S+ KLA+DN++Y+EFH   C  KD  +G+
Subjt:  DYNSMVQPTEYRGMERVT----------------------------------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]1.0e-3351.63Show/hide
Query:  MFVQQSIRNMETSETITFAPSSSSIAMEVVVNPLYESW----------------------VIGYENARDLWAAIQELLGVKSQAEEDYIRV---------
        MFVQQSI NMETS+T   APSSSSIA E  +NPLYESW                      V+GYENA DLWAAIQEL GV+SQAEEDY+R          
Subjt:  MFVQQSIRNMETSETITFAPSSSSIAMEVVVNPLYESW----------------------VIGYENARDLWAAIQELLGVKSQAEEDYIRV---------

Query:  ---------------------RPVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNN
                              PVPTRSLISQVLLGL+EEYNP+VATIQGKRGISWPEMQA+        + QN +      NN
Subjt:  ---------------------RPVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNN

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X11.0e-3133.72Show/hide
Query:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG
        +SSSI    +VNPL+E WV                      +G+ N  DLW A Q+  GV+S+AEED++R     TR        GL+E YN ++  IQG
Subjt:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG

Query:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVN------------------NNNPG----------------------
        K  ISW +MQ+KLL+FEKRL+ QN   KNT     S ++NMA     N  RN  N                  NN P                       
Subjt:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVN------------------NNNPG----------------------

Query:  -----------------QGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVT-------------------------
                               F +TQN   F A P+TVVD NWY DSGA+NHVT + ++M  PTEY G+E+VT                         
Subjt:  -----------------QGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVT-------------------------

Query:  ---------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE
                 IAKNL+S+ KLA+DN++Y+EFH   C  KD  +G+
Subjt:  ---------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE

A0A1S4E1U9 uncharacterized protein LOC107991581 isoform X41.6e-2936.59Show/hide
Query:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG
        +SSSI    +VNPL+E WV                      +G+ N  DLW A Q+  GV+S+AEED++R     TR        GL+E YN ++  IQG
Subjt:  SSSSIAMEVVVNPLYESWV----------------------IGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQG

Query:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTA
        K  ISW +MQ+KLL+FEKRL+ QN   KNT     S ++NMA     N  RN  N    G    + F+  + N   L N  T        DSGA+NHVT 
Subjt:  KRGISWPEMQAKLLVFEKRLELQNFH-KNTVLFNNSVSVNMA-----NSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTA

Query:  DYNSMVQPTEYRGMERVT----------------------------------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE
        + ++M  PTEY G+E+VT                                  IAKNL+S+ KLA+DN++Y+EFH   C  KD  +G+
Subjt:  DYNSMVQPTEYRGMERVT----------------------------------IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGE

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-941.9e-3031.98Show/hide
Query:  VIGYENARDLWAAIQELLGVKSQAEEDYIR--------VR----------------------PVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQ
        ++G+ NA+DLW A Q+L GV+S+AEED++R        VR                      PVP R+ ISQ LLGL+E YNP++A IQGK  ISW +MQ
Subjt:  VIGYENARDLWAAIQELLGVKSQAEEDYIR--------VR----------------------PVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQ

Query:  AKLLVFEKRLELQNFHKNTV-LFNNSVSV----------NMANSSRNGVNNNN---------------PGQGT--------SYAFTATQNNNHF------
        ++LL FEKRLE Q+  KNT  +  N V++            +N   +G N NN                G+G          Y  +A    N F      
Subjt:  AKLLVFEKRLELQNFHKNTV-LFNNSVSV----------NMANSSRNGVNNNN---------------PGQGT--------SYAFTATQNNNHF------

Query:  ------------------------------LANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERV-----------------------------
                                       A  +TV++ NWY DSGA+NH+T +Y+++  P+EY G+E++                             
Subjt:  ------------------------------LANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERV-----------------------------

Query:  -----TIAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVL
              I KNLVS+ KLA+DNNVY+EFH   C  KD  +G  +L
Subjt:  -----TIAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVL

A0A5D3CPY2 Retrotransposon protein, putative, Ty1-copia subclass8.6e-3140.18Show/hide
Query:  VIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATI-----QGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNNS
        ++G+  A+DLW AIQ+L GV+S+ EED++R     TR   S++     E+Y  ++ T      Q K  ISW +MQ++LL+FEKRLE QN +K     +  
Subjt:  VIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATI-----QGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNNS

Query:  VSVNMANSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVTIAKNLVSMFKLAKDNNVYLEFHA
         +   +NS++N             AF  T N+N F+  PETV+DSNWY D+GA+NHVTADY+++  P +Y G+E V +          A+DNNVYLEFH 
Subjt:  VSVNMANSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVTIAKNLVSMFKLAKDNNVYLEFHA

Query:  DSCLGKDIRSGEVVLKGDL
        D C   +  +G  +++G L
Subjt:  DSCLGKDIRSGEVVLKGDL

A0A6J1D5J0 uncharacterized protein LOC1110175014.9e-3451.63Show/hide
Query:  MFVQQSIRNMETSETITFAPSSSSIAMEVVVNPLYESW----------------------VIGYENARDLWAAIQELLGVKSQAEEDYIRV---------
        MFVQQSI NMETS+T   APSSSSIA E  +NPLYESW                      V+GYENA DLWAAIQEL GV+SQAEEDY+R          
Subjt:  MFVQQSIRNMETSETITFAPSSSSIAMEVVVNPLYESW----------------------VIGYENARDLWAAIQELLGVKSQAEEDYIRV---------

Query:  ---------------------RPVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNN
                              PVPTRSLISQVLLGL+EEYNP+VATIQGKRGISWPEMQA+        + QN +      NN
Subjt:  ---------------------RPVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWPEMQAKLLVFEKRLELQNFHKNTVLFNN

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-0520.98Show/hide
Query:  MFVQQSIRNMETS----ETITFAPSSSSIAMEVVVNPLYESWVIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATI
        M VQ ++    T+    ET+    ++ S      +    + W  G +   D       + G+ ++ ++  +  +P+     + +VL  L EEY P++  I
Subjt:  MFVQQSIRNMETS----ETITFAPSSSSIAMEVVVNPLYESWVIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATI

Query:  QGK-RGISWPEMQAKLLVFEKRLELQN------------FHKNTVLFNNSVSVNMAN--SSRNGVNNNNPGQGTSYAFTATQNNN---------------
          K    +  E+  +LL  E ++   +             H+NT   NN+ + N  N   +RN  NN+ P Q +S  F    N +               
Subjt:  QGK-RGISWPEMQAKLLVFEKRLELQN------------FHKNTVLFNNSVSVNMAN--SSRNGVNNNNPGQGTSYAFTATQNNN---------------

Query:  ---------HFLANPET---------------------VVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVTIA------------------------
                 HFL++  +                        +NW  DSGA++H+T+D+N++     Y G + V +A                        
Subjt:  ---------HFLANPET---------------------VVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVTIA------------------------

Query:  ----------KNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVLKG
                  KNL+S+++L   N V +EF   S   KD+ +G  +L+G
Subjt:  ----------KNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVLKG

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTGCAACAGTCGATTAGGAATATGGAAACAAGCGAAACGATCACATTTGCACCGTCGAGTTCGTCTATAGCAATGGAAGTAGTTGTCAATCCACTATAT
GAGTCATGGGTGATAGGGTATGAAAATGCTCGTGATTTATGGGCTGCCATACAAGAACTCCTTGGAGTAAAGTCTCAGGCGGAAGAAGATTATATCCGGGTCCGT
CCCGTACCCACTCGATCCTTGATTTCTCAAGTTCTGTTGGGATTAAATGAAGAGTATAATCCGATGGTAGCAACGATCCAAGGAAAAAGAGGCATTTCATGGCCT
GAAATGCAAGCCAAATTGTTAGTATTTGAGAAGAGGTTAGAACTTCAGAATTTTCATAAAAATACAGTGTTGTTTAACAACTCTGTTTCTGTGAATATGGCTAAT
AGTAGTAGAAACGGGGTTAACAACAACAACCCTGGACAAGGTACATCTTATGCCTTCACAGCAACCCAGAATAATAATCATTTTTTGGCCAATCCAGAAACGGTG
GTAGACTCGAATTGGTATGCGGATAGTGGTGCTTCGAATCACGTCACCGCGGACTACAACAGTATGGTTCAGCCTACTGAATATAGAGGTATGGAAAGAGTTACA
ATAGCTAAAAATCTAGTTAGCATGTTCAAACTCGCTAAAGACAATAACGTTTACCTTGAATTTCATGCTGATTCTTGTCTTGGAAAGGATATACGTTCGGGTGAG
GTGGTGCTGAAAGGGGATCTAGATATGGACTTTAATGCTGCAATACAGTTGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTGCAACAGTCGATTAGGAATATGGAAACAAGCGAAACGATCACATTTGCACCGTCGAGTTCGTCTATAGCAATGGAAGTAGTTGTCAATCCACTATAT
GAGTCATGGGTGATAGGGTATGAAAATGCTCGTGATTTATGGGCTGCCATACAAGAACTCCTTGGAGTAAAGTCTCAGGCGGAAGAAGATTATATCCGGGTCCGT
CCCGTACCCACTCGATCCTTGATTTCTCAAGTTCTGTTGGGATTAAATGAAGAGTATAATCCGATGGTAGCAACGATCCAAGGAAAAAGAGGCATTTCATGGCCT
GAAATGCAAGCCAAATTGTTAGTATTTGAGAAGAGGTTAGAACTTCAGAATTTTCATAAAAATACAGTGTTGTTTAACAACTCTGTTTCTGTGAATATGGCTAAT
AGTAGTAGAAACGGGGTTAACAACAACAACCCTGGACAAGGTACATCTTATGCCTTCACAGCAACCCAGAATAATAATCATTTTTTGGCCAATCCAGAAACGGTG
GTAGACTCGAATTGGTATGCGGATAGTGGTGCTTCGAATCACGTCACCGCGGACTACAACAGTATGGTTCAGCCTACTGAATATAGAGGTATGGAAAGAGTTACA
ATAGCTAAAAATCTAGTTAGCATGTTCAAACTCGCTAAAGACAATAACGTTTACCTTGAATTTCATGCTGATTCTTGTCTTGGAAAGGATATACGTTCGGGTGAG
GTGGTGCTGAAAGGGGATCTAGATATGGACTTTAATGCTGCAATACAGTTGGAGTAG
Protein sequenceShow/hide protein sequence
MFVQQSIRNMETSETITFAPSSSSIAMEVVVNPLYESWVIGYENARDLWAAIQELLGVKSQAEEDYIRVRPVPTRSLISQVLLGLNEEYNPMVATIQGKRGISWP
EMQAKLLVFEKRLELQNFHKNTVLFNNSVSVNMANSSRNGVNNNNPGQGTSYAFTATQNNNHFLANPETVVDSNWYADSGASNHVTADYNSMVQPTEYRGMERVT
IAKNLVSMFKLAKDNNVYLEFHADSCLGKDIRSGEVVLKGDLDMDFNAAIQLE