; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr4:11721820..11725903
RNA-Seq ExpressionMoc04g15630
SyntenyMoc04g15630
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156067.1 uncharacterized protein LOC111023035 [Momordica charantia]4.5e-7088.27Show/hide
Query:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD
        +K    AAA+GGT RARVFAL+RGDVEHAEAVVTGT+LVLSMPAYALFDSGSSHSFIASTFV+H DLELESLGFLLSVS PSGSVLVTSQVVKGGQLSFD
Subjt:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD

Query:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVV
        GQTLEVKLIQLDMQDFDVILGMDWLAAN A+I+ SKKEV+FRLP GQNFTFKGVKA VPRVV
Subjt:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVV

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]3.3e-7371.3Show/hide
Query:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD
        +K     A +GGT  ARVFAL+RGDVEHAEAVVTGT+L+LS+PAYALFDSGSSHSFIASTFVRH DLELES GF LSVS PSGSVLVTSQVVKGGQLSF 
Subjt:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD

Query:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE
        GQTLEV LIQL+MQDFDVILGMDWLAANRA+I+ SKKEVSF L  GQNFTFKGVKA VPRVVS                           SIE VRVVNE
Subjt:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE

Query:  FTDVFPEDLLGLPPSLEDRLSVE
        FTDVFPEDL GLPP  E    +E
Subjt:  FTDVFPEDLLGLPPSLEDRLSVE

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]8.9e-1088.37Show/hide
Query:  RLSVERSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLSGIVKIL
        RLSVE SLRQRIIVAQKEDPSLAKGFSMVGHGDFTLSG   +L
Subjt:  RLSVERSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLSGIVKIL

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.0e-7190.12Show/hide
Query:  TAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTLE
        TAAA+GGTHRARVFAL+RGDVE+AEAVVT TVLVLSMPAYALFDSGSSHSFIASTFV H DLELESLGFLLSVS PSGSVLVTSQVVKGGQLSFDGQTLE
Subjt:  TAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTLE

Query:  VKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVSSIEA
        VKLIQLDMQDFDVILGMDWLAANRA+ID SKK+VSFRLP GQNFTFKGVKA VPRVV +++A
Subjt:  VKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVSSIEA

XP_022156992.1 uncharacterized protein LOC111023821 [Momordica charantia]6.2e-8077.63Show/hide
Query:  VTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTL
        VT AA+GGTHRARVFAL+RGDV HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRH DLELESLGFLLSVS PSGSVLVTSQ+VKGGQLSFDGQTL
Subjt:  VTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTL

Query:  EVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNEFTDV
        EVKLIQLDMQDFDVILGMDWLAAN+A+ID SKKE SFRLP  QNFTFKGVKARVPRVVS                           SIEAVRVVNEFTDV
Subjt:  EVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNEFTDV

Query:  FPEDLLGLPPSLEDRLSVE
        FPEDL GLPPS E    +E
Subjt:  FPEDLLGLPPSLEDRLSVE

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]5.1e-7476.7Show/hide
Query:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD
        +K    AAA+GGT RARVFAL+RGDVEHAEAVVTGT+LV+SMPAYALFDSGSSHSFIASTFVRH DLELESLGFLLSVS PSGSVLV SQVVKGGQLSFD
Subjt:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD

Query:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE
        GQT EVKLIQLDMQDFDVILGMDWLAANRA+I+ SKKEVSFRLP GQNFTFK VK  VPRVVS                           SIEAVRVVNE
Subjt:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE

Query:  FTDVFP
        FTDVFP
Subjt:  FTDVFP

TrEMBL top hitse value%identityAlignment
A0A6J1DQB9 Reverse transcriptase1.6e-7371.3Show/hide
Query:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD
        +K     A +GGT  ARVFAL+RGDVEHAEAVVTGT+L+LS+PAYALFDSGSSHSFIASTFVRH DLELES GF LSVS PSGSVLVTSQVVKGGQLSF 
Subjt:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD

Query:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE
        GQTLEV LIQL+MQDFDVILGMDWLAANRA+I+ SKKEVSF L  GQNFTFKGVKA VPRVVS                           SIE VRVVNE
Subjt:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE

Query:  FTDVFPEDLLGLPPSLEDRLSVE
        FTDVFPEDL GLPP  E    +E
Subjt:  FTDVFPEDLLGLPPSLEDRLSVE

A0A6J1DQB9 Reverse transcriptase4.3e-1088.37Show/hide
Query:  RLSVERSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLSGIVKIL
        RLSVE SLRQRIIVAQKEDPSLAKGFSMVGHGDFTLSG   +L
Subjt:  RLSVERSLRQRIIVAQKEDPSLAKGFSMVGHGDFTLSGIVKIL

A0A6J1DQB9 Reverse transcriptase2.0e-7190.12Show/hide
Query:  TAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTLE
        TAAA+GGTHRARVFAL+RGDVE+AEAVVT TVLVLSMPAYALFDSGSSHSFIASTFV H DLELESLGFLLSVS PSGSVLVTSQVVKGGQLSFDGQTLE
Subjt:  TAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTLE

Query:  VKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVSSIEA
        VKLIQLDMQDFDVILGMDWLAANRA+ID SKK+VSFRLP GQNFTFKGVKA VPRVV +++A
Subjt:  VKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVSSIEA

A0A6J1DR22 uncharacterized protein LOC1110230352.2e-7088.27Show/hide
Query:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD
        +K    AAA+GGT RARVFAL+RGDVEHAEAVVTGT+LVLSMPAYALFDSGSSHSFIASTFV+H DLELESLGFLLSVS PSGSVLVTSQVVKGGQLSFD
Subjt:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD

Query:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVV
        GQTLEVKLIQLDMQDFDVILGMDWLAAN A+I+ SKKEV+FRLP GQNFTFKGVKA VPRVV
Subjt:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVV

A0A6J1DTA8 uncharacterized protein LOC1110241142.5e-7476.7Show/hide
Query:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD
        +K    AAA+GGT RARVFAL+RGDVEHAEAVVTGT+LV+SMPAYALFDSGSSHSFIASTFVRH DLELESLGFLLSVS PSGSVLV SQVVKGGQLSFD
Subjt:  EKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFD

Query:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE
        GQT EVKLIQLDMQDFDVILGMDWLAANRA+I+ SKKEVSFRLP GQNFTFK VK  VPRVVS                           SIEAVRVVNE
Subjt:  GQTLEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNE

Query:  FTDVFP
        FTDVFP
Subjt:  FTDVFP

A0A6J1DTE5 uncharacterized protein LOC1110238213.0e-8077.63Show/hide
Query:  VTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTL
        VT AA+GGTHRARVFAL+RGDV HAEAVV GTVLVLSMPAYALFDS SSHSFIASTFVRH DLELESLGFLLSVS PSGSVLVTSQ+VKGGQLSFDGQTL
Subjt:  VTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQTL

Query:  EVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNEFTDV
        EVKLIQLDMQDFDVILGMDWLAAN+A+ID SKKE SFRLP  QNFTFKGVKARVPRVVS                           SIEAVRVVNEFTDV
Subjt:  EVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVS---------------------------SIEAVRVVNEFTDV

Query:  FPEDLLGLPPSLEDRLSVE
        FPEDL GLPPS E    +E
Subjt:  FPEDLLGLPPSLEDRLSVE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGGCCATGTTGGGCGGGAAAAAGAATATGTTACAGCGGCAGCTGAAGGTGGGACCCATAGGGCGCGCGTCTTCGCTCTCTCCAGGGGGGATGTTGAACATGCCGA
GGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGACATGTGGACC
TAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGGTATCCATACCGTCAGGATCTGTATTGGTCACTAGTCAAGTGGTGAAAGGAGGCCAACTCTCTTTCGATGGTCAGACC
TTGGAAGTAAAGTTAATCCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTGGCGGCTAACCGAGCTAGTATTGATAGCTCGAAGAAGGAAGTAAG
CTTCCGCTTGCCCTTCGGACAAAACTTTACCTTTAAAGGAGTCAAGGCCAGGGTTCCGAGGGTTGTGTCGAGCATTGAGGCAGTTCGTGTGGTTAATGAGTTCACTGACG
TGTTCCCTGAGGACCTCCTCGGCTTGCCTCCGTCTCTTGAAGATCGACTCTCAGTGGAACGTAGCCTGAGACAGAGGATCATTGTTGCCCAAAAGGAAGACCCTAGCTTG
GCCAAAGGCTTTAGTATGGTGGGCCATGGGGATTTCACTCTCTCGGGTATCGTCAAGATTTTGGCAAATGAAACCAAGTTGTTGAGGAACTGGACGATTCGCTTGGTTAA
GCCGCCCACCACCCCCCTCCTTCTCGTTGTAAAGTTCGGCAGCCACAAGAACATTTCCCGGCGAACTGCAGCGGTGGCCGACGACGACATGACGGTGGAGCAGCAACCCA
CGAGCGACGGCGCTTCTTCGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGGCCATGTTGGGCGGGAAAAAGAATATGTTACAGCGGCAGCTGAAGGTGGGACCCATAGGGCGCGCGTCTTCGCTCTCTCCAGGGGGGATGTTGAACATGCCGA
GGCGGTGGTCACAGGGACTGTTTTAGTGCTTAGTATGCCTGCTTACGCATTATTTGACTCTGGATCTAGTCATTCTTTCATTGCTTCTACCTTTGTTCGACATGTGGACC
TAGAGCTAGAATCGTTAGGCTTTTTGTTGTCGGTATCCATACCGTCAGGATCTGTATTGGTCACTAGTCAAGTGGTGAAAGGAGGCCAACTCTCTTTCGATGGTCAGACC
TTGGAAGTAAAGTTAATCCAACTGGATATGCAGGATTTCGATGTGATACTAGGCATGGATTGGTTGGCGGCTAACCGAGCTAGTATTGATAGCTCGAAGAAGGAAGTAAG
CTTCCGCTTGCCCTTCGGACAAAACTTTACCTTTAAAGGAGTCAAGGCCAGGGTTCCGAGGGTTGTGTCGAGCATTGAGGCAGTTCGTGTGGTTAATGAGTTCACTGACG
TGTTCCCTGAGGACCTCCTCGGCTTGCCTCCGTCTCTTGAAGATCGACTCTCAGTGGAACGTAGCCTGAGACAGAGGATCATTGTTGCCCAAAAGGAAGACCCTAGCTTG
GCCAAAGGCTTTAGTATGGTGGGCCATGGGGATTTCACTCTCTCGGGTATCGTCAAGATTTTGGCAAATGAAACCAAGTTGTTGAGGAACTGGACGATTCGCTTGGTTAA
GCCGCCCACCACCCCCCTCCTTCTCGTTGTAAAGTTCGGCAGCCACAAGAACATTTCCCGGCGAACTGCAGCGGTGGCCGACGACGACATGACGGTGGAGCAGCAACCCA
CGAGCGACGGCGCTTCTTCGGATTGA
Protein sequenceShow/hide protein sequence
MLGHVGREKEYVTAAAEGGTHRARVFALSRGDVEHAEAVVTGTVLVLSMPAYALFDSGSSHSFIASTFVRHVDLELESLGFLLSVSIPSGSVLVTSQVVKGGQLSFDGQT
LEVKLIQLDMQDFDVILGMDWLAANRASIDSSKKEVSFRLPFGQNFTFKGVKARVPRVVSSIEAVRVVNEFTDVFPEDLLGLPPSLEDRLSVERSLRQRIIVAQKEDPSL
AKGFSMVGHGDFTLSGIVKILANETKLLRNWTIRLVKPPTTPLLLVVKFGSHKNISRRTAAVADDDMTVEQQPTSDGASSD