; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g01000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g01000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr9:908661..916871
RNA-Seq ExpressionMoc09g01000
SyntenyMoc09g01000
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]6.2e-4539.09Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK-----------------------
        +SS  T   VN L+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK                       
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK-----------------------

Query:  ----------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNN---SVSVNMA----------NSSRSVSGGNQRQNQNS
                        DE YN V+  IQGK  ISW +MQ++LL+FEK L+ QN+ K      N   S ++NMA          +S++   G N++     
Subjt:  ----------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNN---SVSVNMA----------NSSRSVSGGNQRQNQNS

Query:  RPPFNN---------------------NRGVVEIEVEDGTSYA-----------FTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYG
        R   NN                     N+      V+D   ++           F +TQN  PF A P+TV+DPNWY+DSGA+NHVT + ++M  PTEY 
Subjt:  RPPFNN---------------------NRGVVEIEVEDGTSYA-----------FTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYG

Query:  GMERVTIGKVVLKGALKDGLYRLNTVGVVI
        G      G+ +L+G L+DG Y+L  VGV I
Subjt:  GMERVTIGKVVLKGALKDGLYRLNTVGVVI

XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]4.1e-4947.21Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK  DE YN V+  IQGK  ISW +
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE

Query:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE
        MQ++LL+FEKRL+ QN+  KNT +   S ++NMA     N  R+ S              QR N N+ P                  FN   +  +V+  
Subjt:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE

Query:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIG
         E  ++         F +TQN  PF A P+TV+DPNWY+DSGA+NHVT + ++M  PTEY G+E+VT+G
Subjt:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIG

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]1.2e-5146.39Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK  DE YN V+  IQGK  ISW +
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE

Query:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE
        MQ++LL+FEKRL+ QN+  KNT +   S ++NMA     N  R+ S              QR N N+ P                  FN   +  +V+  
Subjt:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE

Query:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI
         E  ++         F +TQN  PF A P+TV+DPNWY+DSGA+NHVT + ++M  PTEY G      G+ +L+G L+DG Y+L  VGV I
Subjt:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI

XP_016902205.1 PREDICTED: uncharacterized protein LOC107991581 isoform X5 [Cucumis melo]2.0e-4347.39Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK  DE YN V+  IQGK  ISW +
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE

Query:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGVVEIEVEDGTSYAFTATQNNNPFLANPETVIDPNWYVDSG
        MQ++LL+FEKRL+ QN+  KNT +   S ++NMA         N ++NQ+++  +  NR              F+  + N   L N  T        DSG
Subjt:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGVVEIEVEDGTSYAFTATQNNNPFLANPETVIDPNWYVDSG

Query:  ASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI
        A+NHVT + ++M  PTEY G      G+ +L+G L+DG Y+L  VGV I
Subjt:  ASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI

XP_022148963.1 uncharacterized protein LOC111017501 [Momordica charantia]6.5e-6366.36Show/hide
Query:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--
        MFVQQSIGNMETSQTNISAPSSSSIATEAA+NPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK  
Subjt:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--

Query:  -------------------------------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG
                                             DEEYNPVVATIQGKRGISWPEMQAE                                RSVSGG
Subjt:  -------------------------------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG

Query:  NQRQNQNSRPPFNNNRG
        NQRQNQNS+PPFNNNRG
Subjt:  NQRQNQNSRPPFNNNRG

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X12.0e-4947.21Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK  DE YN V+  IQGK  ISW +
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE

Query:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE
        MQ++LL+FEKRL+ QN+  KNT +   S ++NMA     N  R+ S              QR N N+ P                  FN   +  +V+  
Subjt:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE

Query:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIG
         E  ++         F +TQN  PF A P+TV+DPNWY+DSGA+NHVT + ++M  PTEY G+E+VT+G
Subjt:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIG

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X35.6e-5246.39Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK  DE YN V+  IQGK  ISW +
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE

Query:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE
        MQ++LL+FEKRL+ QN+  KNT +   S ++NMA     N  R+ S              QR N N+ P                  FN   +  +V+  
Subjt:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMA-----NSSRSVSG-----------GNQRQNQNSRPP-----------------FNN--NRGVVEIE

Query:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI
         E  ++         F +TQN  PF A P+TV+DPNWY+DSGA+NHVT + ++M  PTEY G      G+ +L+G L+DG Y+L  VGV I
Subjt:  VEDGTS-------YAFTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI

A0A1S4E2K3 uncharacterized protein LOC107991581 isoform X59.6e-4447.39Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE
        +SS  T   VNPL+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK  DE YN V+  IQGK  ISW +
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--DEEYNPVVATIQGKRGISWPE

Query:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGVVEIEVEDGTSYAFTATQNNNPFLANPETVIDPNWYVDSG
        MQ++LL+FEKRL+ QN+  KNT +   S ++NMA         N ++NQ+++  +  NR              F+  + N   L N  T        DSG
Subjt:  MQAELLVFEKRLELQNSH-KNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGVVEIEVEDGTSYAFTATQNNNPFLANPETVIDPNWYVDSG

Query:  ASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI
        A+NHVT + ++M  PTEY G      G+ +L+G L+DG Y+L  VGV I
Subjt:  ASNHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVI

A0A5A7SIT7 Uncharacterized protein3.0e-4539.09Show/hide
Query:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK-----------------------
        +SS  T   VN L+E WVTTD LLLGWLYNSMTP+VA Q+MG+ N  DLW A Q+ FGVQS+AEED+LRQ+ Q TRK                       
Subjt:  SSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK-----------------------

Query:  ----------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNN---SVSVNMA----------NSSRSVSGGNQRQNQNS
                        DE YN V+  IQGK  ISW +MQ++LL+FEK L+ QN+ K      N   S ++NMA          +S++   G N++     
Subjt:  ----------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNN---SVSVNMA----------NSSRSVSGGNQRQNQNS

Query:  RPPFNN---------------------NRGVVEIEVEDGTSYA-----------FTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYG
        R   NN                     N+      V+D   ++           F +TQN  PF A P+TV+DPNWY+DSGA+NHVT + ++M  PTEY 
Subjt:  RPPFNN---------------------NRGVVEIEVEDGTSYA-----------FTATQNNNPFLANPETVIDPNWYVDSGASNHVTADYNSMVPPTEYG

Query:  GMERVTIGKVVLKGALKDGLYRLNTVGVVI
        G      G+ +L+G L+DG Y+L  VGV I
Subjt:  GMERVTIGKVVLKGALKDGLYRLNTVGVVI

A0A6J1D5J0 uncharacterized protein LOC1110175013.2e-6366.36Show/hide
Query:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--
        MFVQQSIGNMETSQTNISAPSSSSIATEAA+NPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK  
Subjt:  MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRK--

Query:  -------------------------------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG
                                             DEEYNPVVATIQGKRGISWPEMQAE                                RSVSGG
Subjt:  -------------------------------------DEEYNPVVATIQGKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGG

Query:  NQRQNQNSRPPFNNNRG
        NQRQNQNS+PPFNNNRG
Subjt:  NQRQNQNSRPPFNNNRG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGTGCAACAGTCGATTGGTAATATGGAAACAAGCCAAACGAACATATCTGCACCATCGAGTTCCTCTATAGCAACAGAAGCAGCCGTCAATCCACTATATGAGTC
ATGGGTAACTACCGACCAGCTACTTCTTGGTTGGTTGTACAACTCTATGACTCCAGAAGTTGCAACACAGGTGATGGGGTACGAAAATGCTTGTGATTTATGGGCTGCCA
TACAAGAACTCTTTGGAGTACAGTCTCAGGCGGAAGAAGATTATCTCCGTCAGGTATTTCAACAAACTCGAAAAGATGAAGAGTATAATCCTGTGGTAGCAACGATCCAA
GGAAAACGAGGCATTTCGTGGCCTGAAATGCAAGCCGAATTGTTGGTATTTGAGAAGAGGTTAGAACTTCAGAATTCTCATAAAAATACAGTATCTTTTAACAACTCTGT
TTCTGTGAATATGGCTAATAGTAGCAGAAGTGTAAGTGGTGGAAACCAACGTCAAAATCAAAACTCTCGGCCACCATTCAACAACAATCGGGGGGTGGTCGAAATCGAGG
TAGAGGACGGTACATCTTATGCCTTCACAGCAACCCAAAATAACAATCCTTTTTTGGCCAATCCAGAAACAGTGATAGACCCGAATTGGTATGTGGATAGTGGTGCTTCA
AATCATGTCACCGCCGACTACAATAGTATGGTTCCACCTACTGAATATGGAGGTATGGAAAGAGTTACAATAGGCAAGGTGGTGCTGAAAGGGGCTCTTAAGGATGGACT
TTACCGCCTCAATACTGTTGGAGTAGTCATTGGGAGTACTTTGACTCCAGTTGACTGTGGCTTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTCTCTTCCCAAAT
CATCCAGTAGTATAAATGTTGTGATAAGCACCCATGATCCATCTTCTCCACCGATGGCCAAAGCCTTTTTGCTCCATAACCGCACCAAGGAAGCAAGAATAATTATCCTC
AGAGGATTACCGGAAGTTTTTCTACCTGTTAGCTCAGATGCTGGAATTGCCACCACGCAGCCTAATCTAGAAGCTCTTCACTCGAACATAGACAGCTTTGCCAATGTCTT
TATCGCCGAGATTCCATGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGTGCAACAGTCGATTGGTAATATGGAAACAAGCCAAACGAACATATCTGCACCATCGAGTTCCTCTATAGCAACAGAAGCAGCCGTCAATCCACTATATGAGTC
ATGGGTAACTACCGACCAGCTACTTCTTGGTTGGTTGTACAACTCTATGACTCCAGAAGTTGCAACACAGGTGATGGGGTACGAAAATGCTTGTGATTTATGGGCTGCCA
TACAAGAACTCTTTGGAGTACAGTCTCAGGCGGAAGAAGATTATCTCCGTCAGGTATTTCAACAAACTCGAAAAGATGAAGAGTATAATCCTGTGGTAGCAACGATCCAA
GGAAAACGAGGCATTTCGTGGCCTGAAATGCAAGCCGAATTGTTGGTATTTGAGAAGAGGTTAGAACTTCAGAATTCTCATAAAAATACAGTATCTTTTAACAACTCTGT
TTCTGTGAATATGGCTAATAGTAGCAGAAGTGTAAGTGGTGGAAACCAACGTCAAAATCAAAACTCTCGGCCACCATTCAACAACAATCGGGGGGTGGTCGAAATCGAGG
TAGAGGACGGTACATCTTATGCCTTCACAGCAACCCAAAATAACAATCCTTTTTTGGCCAATCCAGAAACAGTGATAGACCCGAATTGGTATGTGGATAGTGGTGCTTCA
AATCATGTCACCGCCGACTACAATAGTATGGTTCCACCTACTGAATATGGAGGTATGGAAAGAGTTACAATAGGCAAGGTGGTGCTGAAAGGGGCTCTTAAGGATGGACT
TTACCGCCTCAATACTGTTGGAGTAGTCATTGGGAGTACTTTGACTCCAGTTGACTGTGGCTTGGAGTTGGCTGCTAATAAAACTATTTGTTCTGTGTCTCTTCCCAAAT
CATCCAGTAGTATAAATGTTGTGATAAGCACCCATGATCCATCTTCTCCACCGATGGCCAAAGCCTTTTTGCTCCATAACCGCACCAAGGAAGCAAGAATAATTATCCTC
AGAGGATTACCGGAAGTTTTTCTACCTGTTAGCTCAGATGCTGGAATTGCCACCACGCAGCCTAATCTAGAAGCTCTTCACTCGAACATAGACAGCTTTGCCAATGTCTT
TATCGCCGAGATTCCATGGTAG
Protein sequenceShow/hide protein sequence
MFVQQSIGNMETSQTNISAPSSSSIATEAAVNPLYESWVTTDQLLLGWLYNSMTPEVATQVMGYENACDLWAAIQELFGVQSQAEEDYLRQVFQQTRKDEEYNPVVATIQ
GKRGISWPEMQAELLVFEKRLELQNSHKNTVSFNNSVSVNMANSSRSVSGGNQRQNQNSRPPFNNNRGVVEIEVEDGTSYAFTATQNNNPFLANPETVIDPNWYVDSGAS
NHVTADYNSMVPPTEYGGMERVTIGKVVLKGALKDGLYRLNTVGVVIGSTLTPVDCGLELAANKTICSVSLPKSSSSINVVISTHDPSSPPMAKAFLLHNRTKEARIIIL
RGLPEVFLPVSSDAGIATTQPNLEALHSNIDSFANVFIAEIPW