; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g36660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g36660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr8:27250046..27251490
RNA-Seq ExpressionMoc08g36660
SyntenyMoc08g36660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036141.1 retrotransposon protein, putative, Ty1-copia subclass [Cucumis melo var. makuwa]1.1e-3543.7Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATI-----QGKRGISWSEMQDELLVFEKMLEL
        MT EVA Q+MG+  A+DLW AIQ+LF VQS+VEE +L +  Q      TR   S++     E+Y  ++ T      Q K  ISW +MQ ELL+FEK LE 
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATI-----QGKRGISWSEMQDELLVFEKMLEL

Query:  QNSHKNTVVNMDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAKNLVSVSKLAKDNNV
        QNS+K        S  +    +N  Q  + AF  T N+NSF+  PETV+D NWYVD+GA+NHVT DY+++  P +Y G+E V V          A+DNNV
Subjt:  QNSHKNTVVNMDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAKNLVSVSKLAKDNNV

Query:  YLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGVV
        YL+FH D   V +  +G+ +++G L+D LY L  V V+
Subjt:  YLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGVV

KAA0057475.1 uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa]1.0e-3340.73Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR
        M  +VA Q+MG+  A+DLW AIQ LF ++S+ EEY+L                          NLGQA S VP R LISQVLLGLDE YN V A IQGK 
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR

Query:  GISWSEMQDELLVFEKMLE--LQNSHKNTVVN----MDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPT
         ISW +MQ ELL+FE ++E  L      T++     ++  +R    N N +Q    AF  TQ ++S LA PETVVD N YVDSGA+NHVT+D++++    
Subjt:  GISWSEMQDELLVFEKMLE--LQNSHKNTVVN----MDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPT

Query:  EYGGMERVAVAK------NLVSVSKLAKDNN------VYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGV
        +Y G E V V        + V  + L    N      +         L KD  +G+V+LKG L D LY L  V +
Subjt:  EYGGMERVAVAK------NLVSVSKLAKDNN------VYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGV

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.6e-3432.87Show/hide
Query:  VATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISW
        +A Q+MG+ NA+DLW A Q+LF VQS+ EE +L                           LGQA S VP R+ ISQ LLGLDE YN V+A IQGK  ISW
Subjt:  VATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISW

Query:  SEMQDELLVFEKMLELQNSHKNT------VVNM------------DNSSRNGVNNNN---------------------------PRQGTSYAFTATQNNN
         +MQ ELL FEK LE Q++ KNT      VVN+             N   +G N NN                            + G S      + N 
Subjt:  SEMQDELLVFEKMLELQNSHKNT------VVNM------------DNSSRNGVNNNN---------------------------PRQGTSYAFTATQNNN

Query:  SFL--------------------------------ANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV-----------------------
         FL                                A  +TV++ NWY+DSGA+NH+T +Y+++  P+EY G+E++ V                       
Subjt:  SFL--------------------------------ANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV-----------------------

Query:  -----------AKNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRV
                    KNLVSVSKLA+DNNVY++FH     +KD  +G+ +L   ++D LY L+ +
Subjt:  -----------AKNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRV

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]5.3e-3033.75Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISWSEMQDELLVFEKMLELQNSHK
        MT +VA Q+MG+ N  DLW A Q+ F VQS+ EE +L            R ++     GLDE YN V+  IQGK  ISW +MQ +LL+FEK L+ QN+ K
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISWSEMQDELLVFEKMLELQNSHK

Query:  NTVVNMD-------------NSSRNGVN------------------NNNP------RQGTS---------------------------------YAFTAT
            N+              N  RN  N                  NN P      + G S                                   F +T
Subjt:  NTVVNMD-------------NSSRNGVN------------------NNNP------RQGTS---------------------------------YAFTAT

Query:  QNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV----------------------------------AKNLVSVSKLAKDNNVY
        QN   F A P+TVVDPNWY+DSGA+NHVT + ++M  PTEY G+E+V V                                  AKNL+SVSKLA+DN++Y
Subjt:  QNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV----------------------------------AKNLVSVSKLAKDNNVY

Query:  LKFHADSYLVKDIRSGK
        ++FH     +KD  +GK
Subjt:  LKFHADSYLVKDIRSGK

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]8.7e-3334.6Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR
        M  +VA QVMG+  +R+LW A+QELF VQS+ E  YL                          NL  A S V  R L+SQVL GLDEEYN +V  +QGK 
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR

Query:  GISWSEMQDELLVFEKMLELQNSHKNTV-------------------VNMDNSSRNGVNNNNPRQGTSYA--------------------FT--------
         +SWSEM  ELL +EK LE QNS K+ +                    N   ++ N  + +N  +G  Y                     FT        
Subjt:  GISWSEMQDELLVFEKMLELQNSHKNTV-------------------VNMDNSSRNGVNNNNPRQGTSYA--------------------FT--------

Query:  -ATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAK-NLVSVSKLAKDNNVYLKFHADSYLVKDI------------RSGKV
         A  + ++ +  PETV+DP+WY DSGA++HVT + N++ Q  +Y G E V VA  N +S+S +   N   +     S  +KD+             SG+ 
Subjt:  -ATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAK-NLVSVSKLAKDNNVYLKFHADSYLVKDI------------RSGKV

Query:  VLKGALRDRLYRLNR
        +LKG L+D LYRL+R
Subjt:  VLKGALRDRLYRLNR

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X12.6e-3033.75Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISWSEMQDELLVFEKMLELQNSHK
        MT +VA Q+MG+ N  DLW A Q+ F VQS+ EE +L            R ++     GLDE YN V+  IQGK  ISW +MQ +LL+FEK L+ QN+ K
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISWSEMQDELLVFEKMLELQNSHK

Query:  NTVVNMD-------------NSSRNGVN------------------NNNP------RQGTS---------------------------------YAFTAT
            N+              N  RN  N                  NN P      + G S                                   F +T
Subjt:  NTVVNMD-------------NSSRNGVN------------------NNNP------RQGTS---------------------------------YAFTAT

Query:  QNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV----------------------------------AKNLVSVSKLAKDNNVY
        QN   F A P+TVVDPNWY+DSGA+NHVT + ++M  PTEY G+E+V V                                  AKNL+SVSKLA+DN++Y
Subjt:  QNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV----------------------------------AKNLVSVSKLAKDNNVY

Query:  LKFHADSYLVKDIRSGK
        ++FH     +KD  +GK
Subjt:  LKFHADSYLVKDIRSGK

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-947.7e-3532.87Show/hide
Query:  VATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISW
        +A Q+MG+ NA+DLW A Q+LF VQS+ EE +L                           LGQA S VP R+ ISQ LLGLDE YN V+A IQGK  ISW
Subjt:  VATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISW

Query:  SEMQDELLVFEKMLELQNSHKNT------VVNM------------DNSSRNGVNNNN---------------------------PRQGTSYAFTATQNNN
         +MQ ELL FEK LE Q++ KNT      VVN+             N   +G N NN                            + G S      + N 
Subjt:  SEMQDELLVFEKMLELQNSHKNT------VVNM------------DNSSRNGVNNNN---------------------------PRQGTSYAFTATQNNN

Query:  SFL--------------------------------ANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV-----------------------
         FL                                A  +TV++ NWY+DSGA+NH+T +Y+++  P+EY G+E++ V                       
Subjt:  SFL--------------------------------ANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAV-----------------------

Query:  -----------AKNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRV
                    KNLVSVSKLA+DNNVY++FH     +KD  +G+ +L   ++D LY L+ +
Subjt:  -----------AKNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRV

A0A5D3CPY2 Retrotransposon protein, putative, Ty1-copia subclass5.3e-3643.7Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATI-----QGKRGISWSEMQDELLVFEKMLEL
        MT EVA Q+MG+  A+DLW AIQ+LF VQS+VEE +L +  Q      TR   S++     E+Y  ++ T      Q K  ISW +MQ ELL+FEK LE 
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATI-----QGKRGISWSEMQDELLVFEKMLEL

Query:  QNSHKNTVVNMDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAKNLVSVSKLAKDNNV
        QNS+K        S  +    +N  Q  + AF  T N+NSF+  PETV+D NWYVD+GA+NHVT DY+++  P +Y G+E V V          A+DNNV
Subjt:  QNSHKNTVVNMDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAKNLVSVSKLAKDNNV

Query:  YLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGVV
        YL+FH D   V +  +G+ +++G L+D LY L  V V+
Subjt:  YLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGVV

A0A5D3E3L7 Uncharacterized protein5.0e-3440.73Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR
        M  +VA Q+MG+  A+DLW AIQ LF ++S+ EEY+L                          NLGQA S VP R LISQVLLGLDE YN V A IQGK 
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR

Query:  GISWSEMQDELLVFEKMLE--LQNSHKNTVVN----MDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPT
         ISW +MQ ELL+FE ++E  L      T++     ++  +R    N N +Q    AF  TQ ++S LA PETVVD N YVDSGA+NHVT+D++++    
Subjt:  GISWSEMQDELLVFEKMLE--LQNSHKNTVVN----MDNSSRNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPT

Query:  EYGGMERVAVAK------NLVSVSKLAKDNN------VYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGV
        +Y G E V V        + V  + L    N      +         L KD  +G+V+LKG L D LY L  V +
Subjt:  EYGGMERVAVAK------NLVSVSKLAKDNN------VYLKFHADSYLVKDIRSGKVVLKGALRDRLYRLNRVGV

A0A6J1DCW4 uncharacterized protein LOC1110195984.2e-3334.6Show/hide
Query:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR
        M  +VA QVMG+  +R+LW A+QELF VQS+ E  YL                          NL  A S V  R L+SQVL GLDEEYN +V  +QGK 
Subjt:  MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLY-------------------------NLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKR

Query:  GISWSEMQDELLVFEKMLELQNSHKNTV-------------------VNMDNSSRNGVNNNNPRQGTSYA--------------------FT--------
         +SWSEM  ELL +EK LE QNS K+ +                    N   ++ N  + +N  +G  Y                     FT        
Subjt:  GISWSEMQDELLVFEKMLELQNSHKNTV-------------------VNMDNSSRNGVNNNNPRQGTSYA--------------------FT--------

Query:  -ATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAK-NLVSVSKLAKDNNVYLKFHADSYLVKDI------------RSGKV
         A  + ++ +  PETV+DP+WY DSGA++HVT + N++ Q  +Y G E V VA  N +S+S +   N   +     S  +KD+             SG+ 
Subjt:  -ATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAK-NLVSVSKLAKDNNVYLKFHADSYLVKDI------------RSGKV

Query:  VLKGALRDRLYRLNR
        +LKG L+D LYRL+R
Subjt:  VLKGALRDRLYRLNR

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-0730.09Show/hide
Query:  NWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVA----------------------------------KNLVSVSKLAKDNNVYLKFHADSYLVKDIRSG
        NW +DSGA++H+T+D+N++     Y G + V VA                                  KNL+SV +L   N V ++F   S+ VKD+ +G
Subjt:  NWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVA----------------------------------KNLVSVSKLAKDNNVYLKFHADSYLVKDIRSG

Query:  KVVLKGALRDRLY
          +L+G  +D LY
Subjt:  KVVLKGALRDRLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.2e-0628.28Show/hide
Query:  NNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVA----------------------------------
        +  N +Q TS  FT  Q   +   N     + NW +DSGA++H+T+D+N++     Y G + V +A                                  
Subjt:  NNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVA----------------------------------

Query:  KNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGALRDRLY
        KNL+SV +L   N V ++F   S+ VKD+ +G  +L+G  +D LY
Subjt:  KNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGALRDRLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCTAGAAGTTGCAACACAGGTGATGGGGTATGAAAATGCTCGTGATTTATGGGCTGCCATACAAGAACTCTTTGAAGTACAGTCTCAGGTGGAAGAATATTATCT
CTACAATTTGGGTCAAGCTAGAAGTCTCGTACCCACTCGATCTTTGATTTCTCAAGTTTTACTAGGATTAGATGAAGAGTATAATCATGTGGTAGCAACGATCCAAGGAA
AAAGAGGCATTTCGTGGTCTGAAATGCAAGACGAATTATTGGTATTCGAGAAGATGTTAGAACTTCAGAATTCTCATAAAAATACAGTAGTGAATATGGATAATAGTAGC
AGAAATGGGGTTAACAACAACAACCCTAGACAAGGTACATCTTATGCGTTCACAGCAACCCAAAATAACAATTCTTTTTTGGCCAATCCAGAAACAGTGGTAGACCCGAA
TTGGTATGTGGATAGTGGTGCTTCAAATCACGTCACCACGGACTACAACAGTATGGTTCAACCTACTGAATATGGAGGTATGGAAAGAGTTGCAGTAGCTAAAAATCTAG
TTAGCGTGTCCAAACTCGCTAAAGACAATAACGTATACCTTAAATTTCATGCTGATTCTTATCTTGTGAAGGATATACGTTCGGGCAAGGTAGTGCTAAAAGGGGCTCTT
AGAGATAGACTTTACCGCCTCAATAGAGTTGGAGTAGTCACTGGGAGTACTTCGACTCCAGTTGACTGTGGCTTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTC
TCTTCCCAAATCATGTAGTAGTATAAATGTTGTGGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTCTAGAAGTTGCAACACAGGTGATGGGGTATGAAAATGCTCGTGATTTATGGGCTGCCATACAAGAACTCTTTGAAGTACAGTCTCAGGTGGAAGAATATTATCT
CTACAATTTGGGTCAAGCTAGAAGTCTCGTACCCACTCGATCTTTGATTTCTCAAGTTTTACTAGGATTAGATGAAGAGTATAATCATGTGGTAGCAACGATCCAAGGAA
AAAGAGGCATTTCGTGGTCTGAAATGCAAGACGAATTATTGGTATTCGAGAAGATGTTAGAACTTCAGAATTCTCATAAAAATACAGTAGTGAATATGGATAATAGTAGC
AGAAATGGGGTTAACAACAACAACCCTAGACAAGGTACATCTTATGCGTTCACAGCAACCCAAAATAACAATTCTTTTTTGGCCAATCCAGAAACAGTGGTAGACCCGAA
TTGGTATGTGGATAGTGGTGCTTCAAATCACGTCACCACGGACTACAACAGTATGGTTCAACCTACTGAATATGGAGGTATGGAAAGAGTTGCAGTAGCTAAAAATCTAG
TTAGCGTGTCCAAACTCGCTAAAGACAATAACGTATACCTTAAATTTCATGCTGATTCTTATCTTGTGAAGGATATACGTTCGGGCAAGGTAGTGCTAAAAGGGGCTCTT
AGAGATAGACTTTACCGCCTCAATAGAGTTGGAGTAGTCACTGGGAGTACTTCGACTCCAGTTGACTGTGGCTTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTC
TCTTCCCAAATCATGTAGTAGTATAAATGTTGTGGTATAG
Protein sequenceShow/hide protein sequence
MTLEVATQVMGYENARDLWAAIQELFEVQSQVEEYYLYNLGQARSLVPTRSLISQVLLGLDEEYNHVVATIQGKRGISWSEMQDELLVFEKMLELQNSHKNTVVNMDNSS
RNGVNNNNPRQGTSYAFTATQNNNSFLANPETVVDPNWYVDSGASNHVTTDYNSMVQPTEYGGMERVAVAKNLVSVSKLAKDNNVYLKFHADSYLVKDIRSGKVVLKGAL
RDRLYRLNRVGVVTGSTSTPVDCGLELAANKTICSVSLPKSCSSINVVV