; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g04550 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g04550
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:2998370..3000536
RNA-Seq ExpressionMoc01g04550
SyntenyMoc01g04550
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152352.1 uncharacterized protein LOC111020095 [Momordica charantia]1.9e-4692.38Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MSTSI+ALLAAQRLNG+NYKQWKSN+N ILVIDDL+FVLQEDCPQA APNATVAVR  YDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

XP_022157449.1 uncharacterized protein LOC111024145 [Momordica charantia]5.3e-3655.63Show/hide
Query:  VLRRVSVAPVRVSAIGLHREVSYMTCVSSWSDHPYGGFIDYWGGPLSMSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATV
        +++ VS  PVR+ A   H +  Y  C      + Y           S STSI+ALLAA++ N +NY QWK+N+N ILV+DDLRF+L E+CPQAP PNA  
Subjt:  VLRRVSVAPVRVSAIGLHREVSYMTCVSSWSDHPYGGFIDYWGGPLSMSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATV

Query:  AVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQSMFGQ
        A R+ YDRWIKANDKA VYIL SISDVL+KKHE  +TA+EIMDSLQ +F Q
Subjt:  AVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQSMFGQ

XP_022158062.1 uncharacterized protein LOC111024637 [Momordica charantia]9.0e-4478.45Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MSTSI+ LL AQ+LN +NYKQWKSN+N IL+IDDLRFVLQEDCPQAPAPNATVAVRN+YDRWIKANDKAKV ILASISDVLAKKHE+++  KEIMDSLQS
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQPPHRLDMKPLSS
        MFGQP  +   + L+S
Subjt:  MFGQPPHRLDMKPLSS

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]3.6e-4587.62Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MS SI+ALLAAQ+LNG+NY+QWKSN+N ILVIDDLRFVLQEDCPQAP  NATVAVRN YDRWIK+NDKAKVYILASISDVLAKKHEDT+T KEIMDSLQS
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

XP_022158202.1 uncharacterized protein LOC111024739 [Momordica charantia]1.2e-3572.12Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MSTS++ALLA ++LNGKNY QWK+N+N ILV+DDLRFVL E+C Q P PNA  A R+ YDRWIKANDKAKVYI ASISDVLAKKH+  +T +EIMDSL+ 
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

TrEMBL top hitse value%identityAlignment
A0A6J1DFZ2 uncharacterized protein LOC1110200959.4e-4792.38Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MSTSI+ALLAAQRLNG+NYKQWKSN+N ILVIDDL+FVLQEDCPQA APNATVAVR  YDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

A0A6J1DW68 uncharacterized protein LOC1110246374.3e-4478.45Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MSTSI+ LL AQ+LN +NYKQWKSN+N IL+IDDLRFVLQEDCPQAPAPNATVAVRN+YDRWIKANDKAKV ILASISDVLAKKHE+++  KEIMDSLQS
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQPPHRLDMKPLSS
        MFGQP  +   + L+S
Subjt:  MFGQPPHRLDMKPLSS

A0A6J1DWI4 uncharacterized protein LOC1110241452.6e-3655.63Show/hide
Query:  VLRRVSVAPVRVSAIGLHREVSYMTCVSSWSDHPYGGFIDYWGGPLSMSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATV
        +++ VS  PVR+ A   H +  Y  C      + Y           S STSI+ALLAA++ N +NY QWK+N+N ILV+DDLRF+L E+CPQAP PNA  
Subjt:  VLRRVSVAPVRVSAIGLHREVSYMTCVSSWSDHPYGGFIDYWGGPLSMSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATV

Query:  AVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQSMFGQ
        A R+ YDRWIKANDKA VYIL SISDVL+KKHE  +TA+EIMDSLQ +F Q
Subjt:  AVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQSMFGQ

A0A6J1DWL0 uncharacterized protein LOC1110247341.8e-4587.62Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MS SI+ALLAAQ+LNG+NY+QWKSN+N ILVIDDLRFVLQEDCPQAP  NATVAVRN YDRWIK+NDKAKVYILASISDVLAKKHEDT+T KEIMDSLQS
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQP
        MFGQP
Subjt:  MFGQP

A0A6J1DWL4 uncharacterized protein LOC1110247395.7e-3672.12Show/hide
Query:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS
        MSTS++ALLA ++LNGKNY QWK+N+N ILV+DDLRFVL E+C Q P PNA  A R+ YDRWIKANDKAKVYI ASISDVLAKKH+  +T +EIMDSL+ 
Subjt:  MSTSILALLAAQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQS

Query:  MFGQ
        MFGQ
Subjt:  MFGQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCAACGCCATGGCGTTGCGGGGACAGCACACAGCGCCACGACGCTGCACTGTAGCGCCGCGGCGCTGTGCAGCGCCATGGCGCCATGCCAGGGCGCCGCGGCGCT
GCTGCTGCAGCATTTTGCTGCCTTTAGGCACCGAGGCGCTGTCCCTGGTGTTCTTCGGCGCGTTTCCGTGGCTCCGGTTCGCGTCTCCGCCATTGGTTTGCACCGTGAGG
TTTCATACATGACCTGCGTGTCGTCCTGGAGCGACCATCCCTACGGAGGGTTCATTGATTATTGGGGTGGACCTCTGAGCATGTCTACTTCTATTCTTGCACTCTTAGCC
GCACAAAGACTTAATGGCAAAAATTACAAACAATGGAAGTCAAACGTAAACATTATTCTCGTGATAGATGATCTTAGATTCGTCTTGCAAGAGGATTGTCCTCAAGCTCC
TGCGCCTAATGCCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAAGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGA
AGCACGAGGACACGATCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCACCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCTC
GCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCAACGCCATGGCGTTGCGGGGACAGCACACAGCGCCACGACGCTGCACTGTAGCGCCGCGGCGCTGTGCAGCGCCATGGCGCCATGCCAGGGCGCCGCGGCGCT
GCTGCTGCAGCATTTTGCTGCCTTTAGGCACCGAGGCGCTGTCCCTGGTGTTCTTCGGCGCGTTTCCGTGGCTCCGGTTCGCGTCTCCGCCATTGGTTTGCACCGTGAGG
TTTCATACATGACCTGCGTGTCGTCCTGGAGCGACCATCCCTACGGAGGGTTCATTGATTATTGGGGTGGACCTCTGAGCATGTCTACTTCTATTCTTGCACTCTTAGCC
GCACAAAGACTTAATGGCAAAAATTACAAACAATGGAAGTCAAACGTAAACATTATTCTCGTGATAGATGATCTTAGATTCGTCTTGCAAGAGGATTGTCCTCAAGCTCC
TGCGCCTAATGCCACTGTGGCGGTGCGCAACGTCTATGACAGGTGGATCAAGGCCAATGACAAGGCCAAAGTCTACATCTTGGCGAGCATATCTGATGTGCTTGCTAAGA
AGCACGAGGACACGATCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCACCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCTC
GCATGA
Protein sequenceShow/hide protein sequence
MQQRHGVAGTAHSATTLHCSAAALCSAMAPCQGAAALLLQHFAAFRHRGAVPGVLRRVSVAPVRVSAIGLHREVSYMTCVSSWSDHPYGGFIDYWGGPLSMSTSILALLA
AQRLNGKNYKQWKSNVNIILVIDDLRFVLQEDCPQAPAPNATVAVRNVYDRWIKANDKAKVYILASISDVLAKKHEDTITAKEIMDSLQSMFGQPPHRLDMKPLSSFTTL
A