; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr1:2745287..2749058
RNA-Seq ExpressionMoc01g04210
SyntenyMoc01g04210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155000.1 uncharacterized protein LOC111022144 [Momordica charantia]2.9e-5958.04Show/hide
Query:  HIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL---------
        H+ +SEA FIKDFKRYGPPTFDGESERA  AEEW+RELEA YAYLGCEDQFKVKGAVFMLRGEALN WDSIA AEDHANV +PWARFKDLL         
Subjt:  HIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL---------

Query:  -------------------------------------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKA
                                                         L++ IRGPVDLQRPA+YAEAVRGALIMD DVS+K   L EVGSSSGVKRK 
Subjt:  -------------------------------------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKA

Query:  HLTFTDQPFRARQRQTQQQSMLPV
        H T+ D   RA Q Q Q + M PV
Subjt:  HLTFTDQPFRARQRQTQQQSMLPV

XP_022156326.1 uncharacterized protein LOC111023247 [Momordica charantia]1.7e-6752.05Show/hide
Query:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE
        MPPR SMRL  D DPA                                       GVGG QA PP+H H PQSEA+FIKDFKRYGPPTFDGESERA   E
Subjt:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE

Query:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------
        EW+RELEALYAYLGCEDQFKVKGAVFMLRGEALN WDS+A AED+ANVP+PWARFK+LL                                         
Subjt:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------

Query:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQQSMLPV
                         L + IRGPVDLQRP TYAEAVRGAL+MD DVS+K  PL EVGSSSGVKRK   T+ D   RA QRQ Q Q M PV
Subjt:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQQSMLPV

XP_022156546.1 uncharacterized protein LOC111023424 [Momordica charantia]3.3e-8762.59Show/hide
Query:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE
        MPPRRSMRL  DVDPA   +NVADPPPP  GD+AG VPP P  AAQ  ALINNT GVGGAQ QPPRH H PQSEAQFIKDFKRYGPPTF G SERA  AE
Subjt:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE

Query:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------
        EWVRELEALYAYLGCEDQFKVKGAVFMLR EALN WDS+A  EDHANVPVPWARFK+LL                                         
Subjt:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------

Query:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQ
                         L + IRG VDLQRP TYAEAVRG LIMD DVS++ QPL+EVGSS GVKRK   T+ DQPFRA QR  QQ
Subjt:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQ

XP_022158302.1 uncharacterized protein LOC111024816 [Momordica charantia]1.1e-5360.4Show/hide
Query:  QAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVP---------
        QA PP HFH PQ EAQFIKDFK YGPPTFDG SE+A  +EEWVRELEA Y YLGC DQFKVKGAVFMLRGEALN WDSIA AED ANVP+          
Subjt:  QAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVP---------

Query:  ------WARFKDLL--------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQ
                +F +L                     L + IR PVDLQ PA+YAEAVRGALIMD DV+SK QPLLEV SSSGVKR     + DQPFR  Q Q
Subjt:  ------WARFKDLL--------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQ

Query:  TQ
         Q
Subjt:  TQ

XP_022158637.1 uncharacterized protein LOC111025088 [Momordica charantia]8.1e-5474.15Show/hide
Query:  PPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWD
        P + L+A   +ALINNT GVGGAQA PPRHFH PQSEAQFIKDFKRYGPPTFDG SERA  AE WVRELEALYAYLGCEDQFKVKG VFMLRGEALN WD
Subjt:  PPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWD

Query:  SIAVAEDHANVPVPWARFKDLLLAQRIRGPVDLQRPATYAEAVRGAL
        SIAVAEDHANVPVPWARFKDLL        V   + A +    +G L
Subjt:  SIAVAEDHANVPVPWARFKDLLLAQRIRGPVDLQRPATYAEAVRGAL

TrEMBL top hitse value%identityAlignment
A0A6J1DL73 uncharacterized protein LOC1110221441.4e-5958.04Show/hide
Query:  HIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL---------
        H+ +SEA FIKDFKRYGPPTFDGESERA  AEEW+RELEA YAYLGCEDQFKVKGAVFMLRGEALN WDSIA AEDHANV +PWARFKDLL         
Subjt:  HIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL---------

Query:  -------------------------------------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKA
                                                         L++ IRGPVDLQRPA+YAEAVRGALIMD DVS+K   L EVGSSSGVKRK 
Subjt:  -------------------------------------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKA

Query:  HLTFTDQPFRARQRQTQQQSMLPV
        H T+ D   RA Q Q Q + M PV
Subjt:  HLTFTDQPFRARQRQTQQQSMLPV

A0A6J1DUM2 uncharacterized protein LOC1110232478.2e-6852.05Show/hide
Query:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE
        MPPR SMRL  D DPA                                       GVGG QA PP+H H PQSEA+FIKDFKRYGPPTFDGESERA   E
Subjt:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE

Query:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------
        EW+RELEALYAYLGCEDQFKVKGAVFMLRGEALN WDS+A AED+ANVP+PWARFK+LL                                         
Subjt:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------

Query:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQQSMLPV
                         L + IRGPVDLQRP TYAEAVRGAL+MD DVS+K  PL EVGSSSGVKRK   T+ D   RA QRQ Q Q M PV
Subjt:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQQSMLPV

A0A6J1DVA0 uncharacterized protein LOC1110234241.6e-8762.59Show/hide
Query:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE
        MPPRRSMRL  DVDPA   +NVADPPPP  GD+AG VPP P  AAQ  ALINNT GVGGAQ QPPRH H PQSEAQFIKDFKRYGPPTF G SERA  AE
Subjt:  MPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAE

Query:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------
        EWVRELEALYAYLGCEDQFKVKGAVFMLR EALN WDS+A  EDHANVPVPWARFK+LL                                         
Subjt:  EWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLL-----------------------------------------

Query:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQ
                         L + IRG VDLQRP TYAEAVRG LIMD DVS++ QPL+EVGSS GVKRK   T+ DQPFRA QR  QQ
Subjt:  -----------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQ

A0A6J1DWW5 uncharacterized protein LOC1110248165.1e-5460.4Show/hide
Query:  QAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVP---------
        QA PP HFH PQ EAQFIKDFK YGPPTFDG SE+A  +EEWVRELEA Y YLGC DQFKVKGAVFMLRGEALN WDSIA AED ANVP+          
Subjt:  QAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVP---------

Query:  ------WARFKDLL--------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQ
                +F +L                     L + IR PVDLQ PA+YAEAVRGALIMD DV+SK QPLLEV SSSGVKR     + DQPFR  Q Q
Subjt:  ------WARFKDLL--------------------LAQRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQ

Query:  TQ
         Q
Subjt:  TQ

A0A6J1DXQ7 uncharacterized protein LOC1110250883.9e-5474.15Show/hide
Query:  PPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWD
        P + L+A   +ALINNT GVGGAQA PPRHFH PQSEAQFIKDFKRYGPPTFDG SERA  AE WVRELEALYAYLGCEDQFKVKG VFMLRGEALN WD
Subjt:  PPIPLVAAQGRALINNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWD

Query:  SIAVAEDHANVPVPWARFKDLLLAQRIRGPVDLQRPATYAEAVRGAL
        SIAVAEDHANVPVPWARFKDLL        V   + A +    +G L
Subjt:  SIAVAEDHANVPVPWARFKDLLLAQRIRGPVDLQRPATYAEAVRGAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAGAACCAATACACCGGCAGTCATCGGTACCTTGGGGATAAAGGACAAGGCCGAACAACAAGTTACTATGGATGAGTATTGGGATCTTGGGGATAAATATCAAGA
ACCAATGCGCCAAGTAGTCATCGGTGCGAACAGGGCCCCCGACCGTGGAGATGACGTGGAGGAGACAATGCCACCCCGTCGTAGTATGAGGTTGTGTGTAGACGTCGACC
CAGCACTCAGATGCAAGAATGTGGCGGACCCACCGCCCCCTTCTACTGGCGATAAGGCGGGGGCAGTTCCTCCAATTCCTCTAGTAGCGGCTCAGGGGCGGGCGCTGATC
AATAACACAGTAGGGGTTGGCGGTGCACAAGCTCAACCACCTCGACATTTTCATATTCCCCAGAGCGAGGCCCAATTCATCAAGGATTTCAAGCGTTATGGACCCCCTAC
CTTTGACGGGGAAAGTGAGAGAGCGATAACAGCAGAAGAGTGGGTCAGAGAGTTGGAAGCCCTTTACGCGTACCTAGGTTGTGAGGACCAATTCAAGGTTAAGGGTGCGG
TTTTTATGTTGAGGGGCGAGGCCCTGAATTTGTGGGACTCAATAGCAGTGGCAGAAGATCATGCTAATGTGCCAGTTCCGTGGGCAAGATTTAAGGACTTGTTGCTTGCG
CAGAGGATCAGAGGACCAGTGGACCTTCAACGACCCGCCACCTATGCTGAGGCAGTTAGGGGCGCTTTGATTATGGATTGGGATGTCTCTAGCAAGACCCAACCTCTGCT
AGAAGTCGGTTCGTCTTCAGGTGTAAAGAGGAAAGCCCATCTGACTTTTACCGACCAGCCATTTAGAGCACGACAGCGCCAGACTCAGCAACAGAGCATGCTGCCAGTAG
GGAGGTTAATCGAGGTCATCCTAGCCGATCGAACTCGCAGAACCATCCCTCCTTCAAATGGGACGACGGTCAAGAGGGAGGTAGCCAACACTCAGACTCATGAGGGTCCA
AAGGACCTTCGGGACAAGCTCAACAAAAACCGATCTAGACGCATAATGACCATCTCCAACATAGAGTCTCAAACCAGCACCGAGACATCTAGTGGATCGAGTAGGTCCCA
AGCCGAGATCGGCTCGGAAGAGTTAGATCAAGTGAAGCAGGATACAATGACGCCGAGCAATACCGAGGCTTCTCGGGAACCCTCATCAGATCGGTCACCGAGATACAAGA
AATATACCCCTACACAAATCCTGGTGCACCAAATATTGGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAGAACCAATACACCGGCAGTCATCGGTACCTTGGGGATAAAGGACAAGGCCGAACAACAAGTTACTATGGATGAGTATTGGGATCTTGGGGATAAATATCAAGA
ACCAATGCGCCAAGTAGTCATCGGTGCGAACAGGGCCCCCGACCGTGGAGATGACGTGGAGGAGACAATGCCACCCCGTCGTAGTATGAGGTTGTGTGTAGACGTCGACC
CAGCACTCAGATGCAAGAATGTGGCGGACCCACCGCCCCCTTCTACTGGCGATAAGGCGGGGGCAGTTCCTCCAATTCCTCTAGTAGCGGCTCAGGGGCGGGCGCTGATC
AATAACACAGTAGGGGTTGGCGGTGCACAAGCTCAACCACCTCGACATTTTCATATTCCCCAGAGCGAGGCCCAATTCATCAAGGATTTCAAGCGTTATGGACCCCCTAC
CTTTGACGGGGAAAGTGAGAGAGCGATAACAGCAGAAGAGTGGGTCAGAGAGTTGGAAGCCCTTTACGCGTACCTAGGTTGTGAGGACCAATTCAAGGTTAAGGGTGCGG
TTTTTATGTTGAGGGGCGAGGCCCTGAATTTGTGGGACTCAATAGCAGTGGCAGAAGATCATGCTAATGTGCCAGTTCCGTGGGCAAGATTTAAGGACTTGTTGCTTGCG
CAGAGGATCAGAGGACCAGTGGACCTTCAACGACCCGCCACCTATGCTGAGGCAGTTAGGGGCGCTTTGATTATGGATTGGGATGTCTCTAGCAAGACCCAACCTCTGCT
AGAAGTCGGTTCGTCTTCAGGTGTAAAGAGGAAAGCCCATCTGACTTTTACCGACCAGCCATTTAGAGCACGACAGCGCCAGACTCAGCAACAGAGCATGCTGCCAGTAG
GGAGGTTAATCGAGGTCATCCTAGCCGATCGAACTCGCAGAACCATCCCTCCTTCAAATGGGACGACGGTCAAGAGGGAGGTAGCCAACACTCAGACTCATGAGGGTCCA
AAGGACCTTCGGGACAAGCTCAACAAAAACCGATCTAGACGCATAATGACCATCTCCAACATAGAGTCTCAAACCAGCACCGAGACATCTAGTGGATCGAGTAGGTCCCA
AGCCGAGATCGGCTCGGAAGAGTTAGATCAAGTGAAGCAGGATACAATGACGCCGAGCAATACCGAGGCTTCTCGGGAACCCTCATCAGATCGGTCACCGAGATACAAGA
AATATACCCCTACACAAATCCTGGTGCACCAAATATTGGTGTAG
Protein sequenceShow/hide protein sequence
MSRTNTPAVIGTLGIKDKAEQQVTMDEYWDLGDKYQEPMRQVVIGANRAPDRGDDVEETMPPRRSMRLCVDVDPALRCKNVADPPPPSTGDKAGAVPPIPLVAAQGRALI
NNTVGVGGAQAQPPRHFHIPQSEAQFIKDFKRYGPPTFDGESERAITAEEWVRELEALYAYLGCEDQFKVKGAVFMLRGEALNLWDSIAVAEDHANVPVPWARFKDLLLA
QRIRGPVDLQRPATYAEAVRGALIMDWDVSSKTQPLLEVGSSSGVKRKAHLTFTDQPFRARQRQTQQQSMLPVGRLIEVILADRTRRTIPPSNGTTVKREVANTQTHEGP
KDLRDKLNKNRSRRIMTISNIESQTSTETSSGSSRSQAEIGSEELDQVKQDTMTPSNTEASREPSSDRSPRYKKYTPTQILVHQILV