; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc10g20110 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc10g20110
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr10:14851099..14856803
RNA-Seq ExpressionMoc10g20110
SyntenyMoc10g20110
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022146110.1 uncharacterized protein LOC111015405 [Momordica charantia]6.7e-4560.84Show/hide
Query:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET
        ++ASQ VRA +LS D+Q L  RLIQLDMQ+ DVI+ MDWLAT+QANINCS+REV FQLPSG++F FKG+ G VPR VSALKAR LLQ G WGYLA+VV+ 
Subjt:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET

Query:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDEEEDQV
        SK  P+IDS+HVV EFPDVF  DLP L PV+ +L FC  +     P   + Y+   AEL E + Q+
Subjt:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDEEEDQV

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]6.0e-6285.33Show/hide
Query:  MRTQMRTMEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNP
        MRTQM TME+MY++MVQAAG  SRSEN+V R D+ EQRG HLGPV++ HPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSP+CSHRNSNQQAESSYNP
Subjt:  MRTQMRTMEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNP

Query:  VTPEGVITREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE
        +TPEGVITREEFDQLKSKFDAQVE LKAKCEKKES  DDGDLGES FTS+
Subjt:  VTPEGVITREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE

XP_022154299.1 uncharacterized protein LOC111021593 [Momordica charantia]1.8e-4259.01Show/hide
Query:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET
        M++SQMV+   LS D Q L  RLIQLD+++ DVI+ MDWLAT+QA+INCSK+EV FQLP G SFMFKG+ GGVPR+VSAL+ARHLLQ G WG+LASVV+T
Subjt:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET

Query:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE
          + P+IDS+HVVNEF DVF ++L  L PV+ +L FC  +     P   + Y+   AEL E
Subjt:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE

XP_022159042.1 uncharacterized protein LOC111025482 [Momordica charantia]4.0e-4260.25Show/hide
Query:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET
        ++ASQ VRA +LS D+Q L  RLIQLDMQ+ D+I+ MDWLAT+QANINCS+REV FQL SG++F FK + G VPR VSALKA+ LLQ G WGYLA+VV+ 
Subjt:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET

Query:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE
        SK  P+IDS+HVV EFPDVF  DLP L PV+ +L FC  I     P   + Y+   AEL E
Subjt:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.6e-7869.55Show/hide
Query:  DRVRASDRRAQAANDGHQREVGAEVVEGQVHEGLETEPLRKSARITTPVLPPAHPKPSKANRGRGGASKRTTRGPASALIRENFDALQKQMEAMRTQMRT
        D     DRRA  ANDGHQREVGAEVVEGQ+HEGL TEP  +SARITTP L PAHPKP KANRGRGGAS+RTT G A A  RENFDALQK+MEAMRTQM T
Subjt:  DRVRASDRRAQAANDGHQREVGAEVVEGQVHEGLETEPLRKSARITTPVLPPAHPKPSKANRGRGGASKRTTRGPASALIRENFDALQKQMEAMRTQMRT

Query:  MEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNPVTPEGVI
        MEEMYN+MVQA G GSRSE++  R                             +RGDLR+HL+RKRSSSLRKG+SP+CSH+NSNQQAESSYNPV PEGVI
Subjt:  MEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNPVTPEGVI

Query:  TREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE
        TREEFDQLKSKFDAQVE LKA+CE K S  DDGDLGES FTS+
Subjt:  TREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE

TrEMBL top hitse value%identityAlignment
A0A6J1CWD0 uncharacterized protein LOC1110154053.2e-4560.84Show/hide
Query:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET
        ++ASQ VRA +LS D+Q L  RLIQLDMQ+ DVI+ MDWLAT+QANINCS+REV FQLPSG++F FKG+ G VPR VSALKAR LLQ G WGYLA+VV+ 
Subjt:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET

Query:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDEEEDQV
        SK  P+IDS+HVV EFPDVF  DLP L PV+ +L FC  +     P   + Y+   AEL E + Q+
Subjt:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDEEEDQV

A0A6J1DDW5 uncharacterized protein LOC1110196342.9e-6285.33Show/hide
Query:  MRTQMRTMEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNP
        MRTQM TME+MY++MVQAAG  SRSEN+V R D+ EQRG HLGPV++ HPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSP+CSHRNSNQQAESSYNP
Subjt:  MRTQMRTMEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNP

Query:  VTPEGVITREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE
        +TPEGVITREEFDQLKSKFDAQVE LKAKCEKKES  DDGDLGES FTS+
Subjt:  VTPEGVITREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE

A0A6J1DLN2 uncharacterized protein LOC1110215938.8e-4359.01Show/hide
Query:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET
        M++SQMV+   LS D Q L  RLIQLD+++ DVI+ MDWLAT+QA+INCSK+EV FQLP G SFMFKG+ GGVPR+VSAL+ARHLLQ G WG+LASVV+T
Subjt:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET

Query:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE
          + P+IDS+HVVNEF DVF ++L  L PV+ +L FC  +     P   + Y+   AEL E
Subjt:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE

A0A6J1DYR1 uncharacterized protein LOC1110254822.0e-4260.25Show/hide
Query:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET
        ++ASQ VRA +LS D+Q L  RLIQLDMQ+ D+I+ MDWLAT+QANINCS+REV FQL SG++F FK + G VPR VSALKA+ LLQ G WGYLA+VV+ 
Subjt:  MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVET

Query:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE
        SK  P+IDS+HVV EFPDVF  DLP L PV+ +L FC  I     P   + Y+   AEL E
Subjt:  SKIAPNIDSIHVVNEFPDVFLEDLPRLSPVQLDLRFCCGI---NVPSLSSLYKAYEAELDE

A0A6J1DZJ1 uncharacterized protein LOC1110257387.6e-7969.55Show/hide
Query:  DRVRASDRRAQAANDGHQREVGAEVVEGQVHEGLETEPLRKSARITTPVLPPAHPKPSKANRGRGGASKRTTRGPASALIRENFDALQKQMEAMRTQMRT
        D     DRRA  ANDGHQREVGAEVVEGQ+HEGL TEP  +SARITTP L PAHPKP KANRGRGGAS+RTT G A A  RENFDALQK+MEAMRTQM T
Subjt:  DRVRASDRRAQAANDGHQREVGAEVVEGQVHEGLETEPLRKSARITTPVLPPAHPKPSKANRGRGGASKRTTRGPASALIRENFDALQKQMEAMRTQMRT

Query:  MEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNPVTPEGVI
        MEEMYN+MVQA G GSRSE++  R                             +RGDLR+HL+RKRSSSLRKG+SP+CSH+NSNQQAESSYNPV PEGVI
Subjt:  MEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSLRKGQSPACSHRNSNQQAESSYNPVTPEGVI

Query:  TREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE
        TREEFDQLKSKFDAQVE LKA+CE K S  DDGDLGES FTS+
Subjt:  TREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGCTAGTCAAATGGTGAGAGCAGACAAGCTATCCTTAGACGATCAGGCTTTGGCGGTAAGATTGATTCAACTGGACATGCAGAATTTGGATGTTATAGTGTGCAT
GGATTGGCTAGCTACCCATCAAGCCAACATTAATTGTTCGAAAAGGGAAGTCTTCTTCCAACTGCCTTCTGGTCAGAGTTTCATGTTTAAAGGAATCATGGGTGGTGTTC
CGAGGGTAGTTTCAGCTCTAAAGGCCAGACACCTCTTACAGTGTGGTGTTTGGGGTTACTTGGCAAGTGTCGTCGAGACTAGTAAGATTGCACCCAACATTGACTCCATC
CACGTGGTTAATGAGTTCCCGGACGTGTTCCTTGAAGACCTTCCAAGGCTATCCCCTGTTCAACTTGATCTCCGTTTCTGCTGCGGGATTAACGTCCCATCACTGTCTTC
TCTCTACAAGGCCTATGAGGCTGAGCTTGATGAAGAGGAAGATCAGGTCGGAGCTTGTAGTGCTTCAGCTCGCTTTTGTACTAGGAGGTCAAGGTTAAGCAGAAGAGCTT
CATCATTTGCTCGCTGGTCAAAATTTTCAACACTCAATGTTGGCAAGCCGATCTTCACAAGGATGACAGCCTCCAAGCCGAACATTAAATTGAAGGGGTTTTCGCTTGTT
GGTCCTGTTGGATCGGTCCACCTTGAACCGCCTAGAAGACCGGTTGCAGCCATCAAAAAGGCTAAGTATGAAAGCCGAGATACGACATGTTCGGGGTCCGACCTACTGGG
AGACCCGACAGGTCCACTCTCGTGTTCAGGTCGGACCAGAGACCGGGTTCGAGCTAGCGACCGAAGAGCTCAAGCGGCCAATGATGGCCACCAGAGGGAGGTCGGAGCGG
AGGTGGTAGAAGGGCAAGTTCATGAAGGCCTGGAGACAGAACCTCTCCGCAAGTCGGCACGTATCACCACGCCCGTTCTACCACCAGCACATCCAAAGCCATCCAAGGCC
AATCGTGGCCGAGGTGGAGCTTCAAAGAGAACCACCCGAGGACCAGCCTCAGCTCTAATAAGGGAGAATTTTGACGCACTCCAAAAACAAATGGAGGCAATGCGCACTCA
GATGCGCACCATGGAAGAAATGTACAACAAAATGGTGCAAGCTGCGGGCACAGGATCTCGGTCTGAAAACCAGGTGACGCGCGTTGACGTGCGCGAGCAGAGGGGTCCCC
ACCTCGGCCCAGTCGAGGAAGACCATCCCGAAGGAGGTGAGGACGAGGAGTACACTCACCAGAGGGGTGATCTCCGCGAACATCTTAACAGAAAGAGGAGCTCGTCCCTC
CGAAAGGGACAGTCTCCGGCCTGCTCGCACAGGAACTCCAACCAGCAGGCTGAATCCTCCTACAACCCGGTAACCCCCGAAGGGGTGATCACGAGGGAAGAGTTCGACCA
GCTAAAGAGCAAGTTTGATGCTCAGGTTGAGGTCTTGAAGGCCAAGTGCGAGAAGAAAGAAAGTCCACTCGATGATGGTGACCTGGGAGAATCGTCATTCACCTCGGAAT
TTTGGAGGCACCAATCCCTCCGAATTTCAAAACTCCCACCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGCTAGTCAAATGGTGAGAGCAGACAAGCTATCCTTAGACGATCAGGCTTTGGCGGTAAGATTGATTCAACTGGACATGCAGAATTTGGATGTTATAGTGTGCAT
GGATTGGCTAGCTACCCATCAAGCCAACATTAATTGTTCGAAAAGGGAAGTCTTCTTCCAACTGCCTTCTGGTCAGAGTTTCATGTTTAAAGGAATCATGGGTGGTGTTC
CGAGGGTAGTTTCAGCTCTAAAGGCCAGACACCTCTTACAGTGTGGTGTTTGGGGTTACTTGGCAAGTGTCGTCGAGACTAGTAAGATTGCACCCAACATTGACTCCATC
CACGTGGTTAATGAGTTCCCGGACGTGTTCCTTGAAGACCTTCCAAGGCTATCCCCTGTTCAACTTGATCTCCGTTTCTGCTGCGGGATTAACGTCCCATCACTGTCTTC
TCTCTACAAGGCCTATGAGGCTGAGCTTGATGAAGAGGAAGATCAGGTCGGAGCTTGTAGTGCTTCAGCTCGCTTTTGTACTAGGAGGTCAAGGTTAAGCAGAAGAGCTT
CATCATTTGCTCGCTGGTCAAAATTTTCAACACTCAATGTTGGCAAGCCGATCTTCACAAGGATGACAGCCTCCAAGCCGAACATTAAATTGAAGGGGTTTTCGCTTGTT
GGTCCTGTTGGATCGGTCCACCTTGAACCGCCTAGAAGACCGGTTGCAGCCATCAAAAAGGCTAAGTATGAAAGCCGAGATACGACATGTTCGGGGTCCGACCTACTGGG
AGACCCGACAGGTCCACTCTCGTGTTCAGGTCGGACCAGAGACCGGGTTCGAGCTAGCGACCGAAGAGCTCAAGCGGCCAATGATGGCCACCAGAGGGAGGTCGGAGCGG
AGGTGGTAGAAGGGCAAGTTCATGAAGGCCTGGAGACAGAACCTCTCCGCAAGTCGGCACGTATCACCACGCCCGTTCTACCACCAGCACATCCAAAGCCATCCAAGGCC
AATCGTGGCCGAGGTGGAGCTTCAAAGAGAACCACCCGAGGACCAGCCTCAGCTCTAATAAGGGAGAATTTTGACGCACTCCAAAAACAAATGGAGGCAATGCGCACTCA
GATGCGCACCATGGAAGAAATGTACAACAAAATGGTGCAAGCTGCGGGCACAGGATCTCGGTCTGAAAACCAGGTGACGCGCGTTGACGTGCGCGAGCAGAGGGGTCCCC
ACCTCGGCCCAGTCGAGGAAGACCATCCCGAAGGAGGTGAGGACGAGGAGTACACTCACCAGAGGGGTGATCTCCGCGAACATCTTAACAGAAAGAGGAGCTCGTCCCTC
CGAAAGGGACAGTCTCCGGCCTGCTCGCACAGGAACTCCAACCAGCAGGCTGAATCCTCCTACAACCCGGTAACCCCCGAAGGGGTGATCACGAGGGAAGAGTTCGACCA
GCTAAAGAGCAAGTTTGATGCTCAGGTTGAGGTCTTGAAGGCCAAGTGCGAGAAGAAAGAAAGTCCACTCGATGATGGTGACCTGGGAGAATCGTCATTCACCTCGGAAT
TTTGGAGGCACCAATCCCTCCGAATTTCAAAACTCCCACCGTAA
Protein sequenceShow/hide protein sequence
MVASQMVRADKLSLDDQALAVRLIQLDMQNLDVIVCMDWLATHQANINCSKREVFFQLPSGQSFMFKGIMGGVPRVVSALKARHLLQCGVWGYLASVVETSKIAPNIDSI
HVVNEFPDVFLEDLPRLSPVQLDLRFCCGINVPSLSSLYKAYEAELDEEEDQVGACSASARFCTRRSRLSRRASSFARWSKFSTLNVGKPIFTRMTASKPNIKLKGFSLV
GPVGSVHLEPPRRPVAAIKKAKYESRDTTCSGSDLLGDPTGPLSCSGRTRDRVRASDRRAQAANDGHQREVGAEVVEGQVHEGLETEPLRKSARITTPVLPPAHPKPSKA
NRGRGGASKRTTRGPASALIRENFDALQKQMEAMRTQMRTMEEMYNKMVQAAGTGSRSENQVTRVDVREQRGPHLGPVEEDHPEGGEDEEYTHQRGDLREHLNRKRSSSL
RKGQSPACSHRNSNQQAESSYNPVTPEGVITREEFDQLKSKFDAQVEVLKAKCEKKESPLDDGDLGESSFTSEFWRHQSLRISKLPP