; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g15930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g15930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:10588969..10594936
RNA-Seq ExpressionMoc03g15930
SyntenyMoc03g15930
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]2.9e-4354.22Show/hide
Query:  QQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL---------------------------
        Q+ R       D   A AVVT T+LVLSMPAYALFDSGSSHSFIASTFV+HADLELESLGFLLSVSTPSGSVL                           
Subjt:  QQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL---------------------------

Query:  -DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKARAYLASIVDARKV----------VPSIEAVRVVNEFTDVFPEDLPG--------LPP
         DFDVILGMDWLAAN A+I+CSKKEV+FRLPSGQNFTFKGVKA        D + +          +     + +V ++ D      PG        L  
Subjt:  -DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKARAYLASIVDARKV----------VPSIEAVRVVNEFTDVFPEDLPG--------LPP

Query:  RAASVASLVAVCSPIHSELERLEVE
        +AASVASLVA CSP+H+ELERLEVE
Subjt:  RAASVASLVAVCSPIHSELERLEVE

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]1.0e-5150.86Show/hide
Query:  TTTRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--------
        T T+  +   RV AL R             D   A AVVT T+L+LS+PAYALFDSGSSHSFIASTFVRHADLELES GF LSVSTPSGSVL        
Subjt:  TTTRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--------

Query:  --------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKVVPSIEA
                            DFDVILGMDWLAANRA+I+CSKKEVSF L SGQNFTFKGVKA                    AYLAS+VDARKVVPSIE 
Subjt:  --------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKVVPSIEA

Query:  VRVVNEFTDVFPEDLPGLPP--RAASVASLVAVCSPIH--------SELERLEVELTVDDVSALLARLSVEPSLR---QRIIVAQKEDPSL
        VRVVNEFTDVFPEDLPGLPP         L+   +PI         +EL+ L+++L       LL R  + PS+      ++  +K+D S+
Subjt:  VRVVNEFTDVFPEDLPGLPP--RAASVASLVAVCSPIH--------SELERLEVELTVDDVSALLARLSVEPSLR---QRIIVAQKEDPSL

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]5.7e-5562.83Show/hide
Query:  QRWPTTT--RRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--
        QR P TT  +  +   RV AL R             D A A AVV  TVLVLSMPAYALFDS SSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL  
Subjt:  QRWPTTT--RRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--

Query:  --------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKV
                                  DFDVILGMDWLAAN+A+IDCSKKE SFRLPS QNFTFKGVKAR                   AYLAS+VDARKV
Subjt:  --------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKV

Query:  VPSIEAVRVVNEFTDVFPEDLPGLPP
        VPSIEAVRVVNEFTDVFPEDLPGLPP
Subjt:  VPSIEAVRVVNEFTDVFPEDLPGLPP

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.5e-5061.54Show/hide
Query:  RWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL------------
        R    P   A     Q+ R       D   A AVVT T+LV+SMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL            
Subjt:  RWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL------------

Query:  ----------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVK-------------------ARAYLASIVDARKVVPSIEAVRVV
                        DFDVILGMDWLAANRA+I+CSKKEVSFRLPSGQNFTFK VK                   A AYLAS+VDARKVVPSIEAVRVV
Subjt:  ----------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVK-------------------ARAYLASIVDARKVVPSIEAVRVV

Query:  NEFTDVFP
        NEFTDVFP
Subjt:  NEFTDVFP

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]3.8e-4356.82Show/hide
Query:  PLSSAATRAFPDEVQRWPTT--TRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF
        P++ + T+A     QR P T   +  +   RV AL R             D   A AVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADLELESLGF
Subjt:  PLSSAATRAFPDEVQRWPTT--TRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF

Query:  LLSVSTPSGSVL----------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-----------------
        LLSVSTPSGSVL                            DFDVILGMDWLAANRA+IDCSKK+VSFRLPSGQNFTFKGVKA                  
Subjt:  LLSVSTPSGSVL----------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-----------------

Query:  --AYLASIVDARKVVPSIEA
          AYLAS+VDARKVVPSIEA
Subjt:  --AYLASIVDARKVVPSIEA

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase4.8e-5250.86Show/hide
Query:  TTTRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--------
        T T+  +   RV AL R             D   A AVVT T+L+LS+PAYALFDSGSSHSFIASTFVRHADLELES GF LSVSTPSGSVL        
Subjt:  TTTRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--------

Query:  --------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKVVPSIEA
                            DFDVILGMDWLAANRA+I+CSKKEVSF L SGQNFTFKGVKA                    AYLAS+VDARKVVPSIE 
Subjt:  --------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKVVPSIEA

Query:  VRVVNEFTDVFPEDLPGLPP--RAASVASLVAVCSPIH--------SELERLEVELTVDDVSALLARLSVEPSLR---QRIIVAQKEDPSL
        VRVVNEFTDVFPEDLPGLPP         L+   +PI         +EL+ L+++L       LL R  + PS+      ++  +K+D S+
Subjt:  VRVVNEFTDVFPEDLPGLPP--RAASVASLVAVCSPIH--------SELERLEVELTVDDVSALLARLSVEPSLR---QRIIVAQKEDPSL

A0A6J1DR22 uncharacterized protein LOC1110230351.4e-4354.22Show/hide
Query:  QQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL---------------------------
        Q+ R       D   A AVVT T+LVLSMPAYALFDSGSSHSFIASTFV+HADLELESLGFLLSVSTPSGSVL                           
Subjt:  QQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL---------------------------

Query:  -DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKARAYLASIVDARKV----------VPSIEAVRVVNEFTDVFPEDLPG--------LPP
         DFDVILGMDWLAAN A+I+CSKKEV+FRLPSGQNFTFKGVKA        D + +          +     + +V ++ D      PG        L  
Subjt:  -DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKARAYLASIVDARKV----------VPSIEAVRVVNEFTDVFPEDLPG--------LPP

Query:  RAASVASLVAVCSPIHSELERLEVE
        +AASVASLVA CSP+H+ELERLEVE
Subjt:  RAASVASLVAVCSPIHSELERLEVE

A0A6J1DTA8 uncharacterized protein LOC1110241142.7e-5061.54Show/hide
Query:  RWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL------------
        R    P   A     Q+ R       D   A AVVT T+LV+SMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL            
Subjt:  RWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL------------

Query:  ----------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVK-------------------ARAYLASIVDARKVVPSIEAVRVV
                        DFDVILGMDWLAANRA+I+CSKKEVSFRLPSGQNFTFK VK                   A AYLAS+VDARKVVPSIEAVRVV
Subjt:  ----------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVK-------------------ARAYLASIVDARKVVPSIEAVRVV

Query:  NEFTDVFP
        NEFTDVFP
Subjt:  NEFTDVFP

A0A6J1DTE5 uncharacterized protein LOC1110238212.7e-5562.83Show/hide
Query:  QRWPTTT--RRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--
        QR P TT  +  +   RV AL R             D A A AVV  TVLVLSMPAYALFDS SSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL  
Subjt:  QRWPTTT--RRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVL--

Query:  --------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKV
                                  DFDVILGMDWLAAN+A+IDCSKKE SFRLPS QNFTFKGVKAR                   AYLAS+VDARKV
Subjt:  --------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-------------------AYLASIVDARKV

Query:  VPSIEAVRVVNEFTDVFPEDLPGLPP
        VPSIEAVRVVNEFTDVFPEDLPGLPP
Subjt:  VPSIEAVRVVNEFTDVFPEDLPGLPP

A0A6J1DWP4 uncharacterized protein LOC1110252151.8e-4356.82Show/hide
Query:  PLSSAATRAFPDEVQRWPTT--TRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF
        P++ + T+A     QR P T   +  +   RV AL R             D   A AVVT TVLVLSMPAYALFDSGSSHSFIASTFV HADLELESLGF
Subjt:  PLSSAATRAFPDEVQRWPTT--TRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFDSGSSHSFIASTFVRHADLELESLGF

Query:  LLSVSTPSGSVL----------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-----------------
        LLSVSTPSGSVL                            DFDVILGMDWLAANRA+IDCSKK+VSFRLPSGQNFTFKGVKA                  
Subjt:  LLSVSTPSGSVL----------------------------DFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKAR-----------------

Query:  --AYLASIVDARKVVPSIEA
          AYLAS+VDARKVVPSIEA
Subjt:  --AYLASIVDARKVVPSIEA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGAAAGAACCCAAGCTGGTATGTACTGTAACGCCTCGAGTCCCTCAGGGGATGGACTTCTTTTCTCTTGGGCTTGAACTTGTTGGGCCTCCATTGAGT
TCGGCAGCCACAAGAGCGTTTCCCGACGAGGTGCAGCGGTGGCCGACGACGACAAGACGGTGGAGTAGCAACCCACGAGTGGTGGCGCTTCTTCGGATCGAGCAG
CAGCAGCGGCGGCACTGCGGCGCTTCTTCGGATCGAGCAGCAGCTGCGGCGGTGGTCACAAGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGAC
TCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGACATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGGTATCCACACCATCAGGATCT
GTGTTGGATTTCGATGTAATACTAGGCATGGATTGGTTGGCGGCTAACCGGGCTAGTATTGATTGCTCGAAGAAGGAAGTTAGCTTCCGCTTGCCCTCCGGACAA
AACTTTACCTTTAAAGGAGTCAAGGCCAGGGCCTATTTGGCTAGCATTGTGGATGCAAGGAAGGTTGTGCCGAGCATTGAGGCGGTTCGTGTGGTTAATGAGTTC
ACTGACGTGTTCCCTGAGGACCTCCCCGGCTTGCCTCCAAGGGCGGCCAGTGTGGCATCTCTTGTAGCGGTCTGCAGTCCGATACACAGCGAGTTGGAACGCTTG
GAGGTGGAGCTGACGGTGGATGATGTCTCCGCGTTGTTGGCTCGACTCTCAGTGGAACCTAGCTTGAGACAGAGGATCATTGTTGCCCAAAAGGAAGACCCTAGC
TTGGCCAAAGGCTTTAGTATAGTGGGCCATGGAGATTTCACTCTCTTGGAGGAGTCCTTGTGCTACGAAGAGGTACCCGTCAAAATTTTGGCAAAAGAAACCAAG
TTGTTGAGGAACCGGACGATTCGCTTGGTTAAGGTTTTGTGGAGAAACCACCAAGTGGAACAAGCTACTTGGGAGCGGGAGGACGATATCAAGGCGAGATACCCT
GAACTGTTGGAACAGTCAACTTTCGGGGACGAAAGGATGGACTTCTTTTCTCTTGTGCTTGGACTTGTTGGGCCTCCATTGAGCCGTCCACCACCCCTTCCTTCT
TGTTGTAAAGTTCGGCAGCCACAAGAGCGTTTCCCGGCGAGGTGCAGTGGTGCCCGACGACGACACGAAGGTGGAGTAGCAACCCACGAGCAGCGACGCTTCTTC
AGATCGAGCAGCAGCAGCGGCGGTGCTGCGGCGCTTCTTCGGATCGGACAGCAGCGGCAGCGGCGCTCCTTCGGATCGAGCAGCGGCAGCAGCGGCGCGACTTCC
CTGACTTCCCGGCGGTCTGCAGCAGCGGCGACGCGTTCCCCGACGGTCTGCAACGAACCAGCACCGCTCCTCCTCGCGGCGGCACACGGCGACGGGCCAAATCCG
CGGCGGCGCACGGCGAACGGGCGGTACAGCGGTGAACCGGCGGCGACCTACTGCTACGATCCGACGAACGACAACTCCAGCGGCGGGATTTGTGCGTGTGAGGTC
GACGCAAAACTTAAGGGCGCGGTTGTGAGTGAAGGACTCGTGGGCGACCATGGTGATGGGACGGAGGAGACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGAAAGAACCCAAGCTGGTATGTACTGTAACGCCTCGAGTCCCTCAGGGGATGGACTTCTTTTCTCTTGGGCTTGAACTTGTTGGGCCTCCATTGAGT
TCGGCAGCCACAAGAGCGTTTCCCGACGAGGTGCAGCGGTGGCCGACGACGACAAGACGGTGGAGTAGCAACCCACGAGTGGTGGCGCTTCTTCGGATCGAGCAG
CAGCAGCGGCGGCACTGCGGCGCTTCTTCGGATCGAGCAGCAGCTGCGGCGGTGGTCACAAGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGAC
TCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGACATGCGGACCTAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGGTATCCACACCATCAGGATCT
GTGTTGGATTTCGATGTAATACTAGGCATGGATTGGTTGGCGGCTAACCGGGCTAGTATTGATTGCTCGAAGAAGGAAGTTAGCTTCCGCTTGCCCTCCGGACAA
AACTTTACCTTTAAAGGAGTCAAGGCCAGGGCCTATTTGGCTAGCATTGTGGATGCAAGGAAGGTTGTGCCGAGCATTGAGGCGGTTCGTGTGGTTAATGAGTTC
ACTGACGTGTTCCCTGAGGACCTCCCCGGCTTGCCTCCAAGGGCGGCCAGTGTGGCATCTCTTGTAGCGGTCTGCAGTCCGATACACAGCGAGTTGGAACGCTTG
GAGGTGGAGCTGACGGTGGATGATGTCTCCGCGTTGTTGGCTCGACTCTCAGTGGAACCTAGCTTGAGACAGAGGATCATTGTTGCCCAAAAGGAAGACCCTAGC
TTGGCCAAAGGCTTTAGTATAGTGGGCCATGGAGATTTCACTCTCTTGGAGGAGTCCTTGTGCTACGAAGAGGTACCCGTCAAAATTTTGGCAAAAGAAACCAAG
TTGTTGAGGAACCGGACGATTCGCTTGGTTAAGGTTTTGTGGAGAAACCACCAAGTGGAACAAGCTACTTGGGAGCGGGAGGACGATATCAAGGCGAGATACCCT
GAACTGTTGGAACAGTCAACTTTCGGGGACGAAAGGATGGACTTCTTTTCTCTTGTGCTTGGACTTGTTGGGCCTCCATTGAGCCGTCCACCACCCCTTCCTTCT
TGTTGTAAAGTTCGGCAGCCACAAGAGCGTTTCCCGGCGAGGTGCAGTGGTGCCCGACGACGACACGAAGGTGGAGTAGCAACCCACGAGCAGCGACGCTTCTTC
AGATCGAGCAGCAGCAGCGGCGGTGCTGCGGCGCTTCTTCGGATCGGACAGCAGCGGCAGCGGCGCTCCTTCGGATCGAGCAGCGGCAGCAGCGGCGCGACTTCC
CTGACTTCCCGGCGGTCTGCAGCAGCGGCGACGCGTTCCCCGACGGTCTGCAACGAACCAGCACCGCTCCTCCTCGCGGCGGCACACGGCGACGGGCCAAATCCG
CGGCGGCGCACGGCGAACGGGCGGTACAGCGGTGAACCGGCGGCGACCTACTGCTACGATCCGACGAACGACAACTCCAGCGGCGGGATTTGTGCGTGTGAGGTC
GACGCAAAACTTAAGGGCGCGGTTGTGAGTGAAGGACTCGTGGGCGACCATGGTGATGGGACGGAGGAGACATGA
Protein sequenceShow/hide protein sequence
MAMKEPKLVCTVTPRVPQGMDFFSLGLELVGPPLSSAATRAFPDEVQRWPTTTRRWSSNPRVVALLRIEQQQRRHCGASSDRAAAAAVVTRTVLVLSMPAYALFD
SGSSHSFIASTFVRHADLELESLGFLLSVSTPSGSVLDFDVILGMDWLAANRASIDCSKKEVSFRLPSGQNFTFKGVKARAYLASIVDARKVVPSIEAVRVVNEF
TDVFPEDLPGLPPRAASVASLVAVCSPIHSELERLEVELTVDDVSALLARLSVEPSLRQRIIVAQKEDPSLAKGFSIVGHGDFTLLEESLCYEEVPVKILAKETK
LLRNRTIRLVKVLWRNHQVEQATWEREDDIKARYPELLEQSTFGDERMDFFSLVLGLVGPPLSRPPPLPSCCKVRQPQERFPARCSGARRRHEGGVATHEQRRFF
RSSSSSGGAAALLRIGQQRQRRSFGSSSGSSGATSLTSRRSAAAATRSPTVCNEPAPLLLAAAHGDGPNPRRRTANGRYSGEPAATYCYDPTNDNSSGGICACEV
DAKLKGAVVSEGLVGDHGDGTEET