; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:9873592..9875609
RNA-Seq ExpressionMoc04g12830
SyntenyMoc04g12830
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]4.5e-6456.46Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MVRE+ FR NATEDPNN+LT+FLDVC TVKMN V D+ IRLRLFP SLQDK                 EMV+ FLT FFPP KT QLRTEI SFR+YDYE
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEI------------
        QLFE WE                               R ILDAAA G LLS+TPEN YILL+DMA +SFQWPSE SNAK+V+G+YEI            
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEI------------

Query:  ---------GPGISHSNELVAAANEYSYYEESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS
                 GPG SHSNELVAA + YSYY E TIEQAQ        FTS PA+KKSS EDLLGAFINE RS
Subjt:  ---------GPGISHSNELVAAANEYSYYEESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]9.8e-5140.33Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  F  +  +DPN +L MFL++CDT+KMN V ++ IRLRLFPFSL+DKAR WL+SLQP S+ SWQ+M + FL KFFPP KTAQLR+EIG FRQ D+E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP---------G
         L+EAWE                               R I+DAA+ G L+SKT E    LLE+MA++++QWP+E + AK+V+GI+E+ P          
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP---------G

Query:  ISH------------SNELVAAANEYSYYEESTIEQAQ----------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAFI
        +SH              E VAA++      E++ EQ Q                                  NVLQPPP F SQP++KK S ED + +F+
Subjt:  ISH------------SNELVAAANEYSYYEESTIEQAQ----------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAFI

Query:  NESRS
         E+++
Subjt:  NESRS

XP_023881727.1 uncharacterized protein LOC111994101 [Quercus suber]4.0e-5243.8Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  F  +  +DPN +L MFL++CDTVKMN V ++ IRLRLFPFSL+DKAR WL+SLQP S+ SWQ+M +  L KFFP  KTAQLR+EIG FRQ D+E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWERIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP---------GISH------------SNELVAAANEYSYYEE
         L++AWER I+DAA+ G L+SKT E    LLE+MA++++QWP+E + AK+V+GI+E+ P          +SH            S E VAA++      E
Subjt:  QLFEAWERIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP---------GISH------------SNELVAAANEYSYYEE

Query:  STIEQAQ----------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS
        ++ EQ Q                                  NVLQP   F SQP++KK S ED + +F+ E+++
Subjt:  STIEQAQ----------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]2.6e-5140.98Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  F  +  +DPN +L MFL++CDTVKMN V ++ IRLRLFPFSL+DKAR WL+SLQP S+ SWQ+M + FL KFFPP KTAQLR+EIG FRQ D+E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP---------G
         L+EAWE                               R I+DAA+ G L+SKT E    LLE+MA++++QWP+E + AK+V+GI+E+ P          
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP---------G

Query:  ISH------------SNELVAAANEYSYYEESTIEQAQ----------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAFI
        +SH            S E VAA++      E++ EQ Q                                  NVLQPPP F SQP++KK S ED + +F+
Subjt:  ISH------------SNELVAAANEYSYYEESTIEQAQ----------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAFI

Query:  NESRS
         E+++
Subjt:  NESRS

XP_030923419.1 uncharacterized protein LOC115950351 [Quercus lobata]4.7e-5346.28Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  F  +  +DPN +L MFL++CDTVKMN V ++ IRLRLFPFSL+DKAR WL+SLQP S+ SWQEM + FL KFFPP KTAQLR+EIG F+Q D+E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWERIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELVAAANEYSY-----------------------Y
        Q      R I++AA+   L+SKT E G  LLE+MA++++QWP+E + AK+V+GI+E+ P  + S ++ + +++  Y                       +
Subjt:  QLFEAWERIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELVAAANEYSY-----------------------Y

Query:  EESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS
        E  +    +NVLQPPP F SQP++KK S ED + +F+ E+++
Subjt:  EESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953732.0e-4142.52Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  F  +  +DPN +L MFL +CDTVK+N V  + IRLRLFPFSL+DKAR WL+SLQ  S+ SWQ+M + FL KFFPP KT QLR+EI  F+Q D+E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPEN-GYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELV
         L+EAWE                               R I+DAAA G L+SKT E     LLE+M ++++QWP+E + AK+V GI+ I    +   + V
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPEN-GYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELV

Query:  AAANEYSYYEESTIEQAQ---NVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS
        AA +      E++ EQ Q   N       F SQ +KK  S ED + +F+ E+ +
Subjt:  AAANEYSYYEESTIEQAQ---NVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS

A0A2I4G4Q3 uncharacterized protein LOC1090047129.6e-4437.46Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  F  +  +DPN +LTMFL++CDTVK+N V ++ IRLRLFPFSL+D+AR WL+SLQP S+ SWQ+M + F  KFFPP KT QLR+EIG F+Q D+E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP----------
         L+EAWE                               R I+D  + G L+ KT E    LLE+MA++++QWP E + AK+V+ I+E+ P          
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGP----------

Query:  -----------GISHSNELVAAAN-------------------EYSY---------------YEESTIEQAQNVL--QPPPSFTSQPAKKKSSHEDLLGA
                    I  S E V A +                    Y+Y               +E  + E  +NVL  QPPP F SQ ++KK S ED + +
Subjt:  -----------GISHSNELVAAAN-------------------EYSY---------------YEESTIEQAQNVL--QPPPSFTSQPAKKKSSHEDLLGA

Query:  FINESRS
        FI E+ +
Subjt:  FINESRS

A0A6J1DU19 uncharacterized protein LOC1110243612.2e-6456.46Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MVRE+ FR NATEDPNN+LT+FLDVC TVKMN V D+ IRLRLFP SLQDK                 EMV+ FLT FFPP KT QLRTEI SFR+YDYE
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEI------------
        QLFE WE                               R ILDAAA G LLS+TPEN YILL+DMA +SFQWPSE SNAK+V+G+YEI            
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEI------------

Query:  ---------GPGISHSNELVAAANEYSYYEESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS
                 GPG SHSNELVAA + YSYY E TIEQAQ        FTS PA+KKSS EDLLGAFINE RS
Subjt:  ---------GPGISHSNELVAAANEYSYYEESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS

A0A6P6XAQ1 Reverse transcriptase1.8e-3743.09Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV++  +  NATEDPN++L+ FL++CDT+K N V+++ I+LRLFPFSL+DKA+ WL+S  P++  +W E+ K FL KFFPP KTA+LR +I SF Q + E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWER-------------------------------IILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEI
         L+EAWER                                 +DAAA G L+ KT E    L+E+MAA+++QW +E  N++R +G+ E+
Subjt:  QLFEAWER-------------------------------IILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEI

A0A803PT47 Uncharacterized protein8.4e-4035.41Show/hide
Query:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE
        MV+ + F   ATEDPN +L +FL+VC  VKMN V D+ IRLRLFP SL+D+ R WL+S+QP S+++W EM + F+ KFFPP+K+AQLR+EIG FR  D E
Subjt:  MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYE

Query:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELVA
          +EAWE                               R ++DAA  G LLSK       LLE+MA +S+ WP+E +  K+++G++E+ P  + + ++ A
Subjt:  QLFEAWE-------------------------------RIILDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELVA

Query:  AANE--------------------YSYYEESTIEQAQ------------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAF
         +N+                     S   E +IEQAQ                                    NVLQ P  F +Q  + K   ED+LG F
Subjt:  AANE--------------------YSYYEESTIEQAQ------------------------------------NVLQPPPSFTSQPAKKKSSHEDLLGAF

Query:  INESR
        + ES+
Subjt:  INESR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGAAGATCCATTTAGGGCCAATGCTACAGAAGATCCAAACAACTATTTAACAATGTTTTTAGATGTTTGCGATACTGTTAAGATGAATTGGGTAAATGACAA
TGTTATTCGCTTACGCCTTTTCCCTTTTTCTTTGCAGGATAAGGCGAGAGATTGGTTGAAATCTTTGCAACCGGACAGTGTTAATTCTTGGCAGGAGATGGTTAAGACGT
TTCTCACAAAATTTTTCCCACCTACCAAGACAGCTCAACTTAGAACAGAGATTGGGTCATTCCGGCAATATGATTATGAGCAATTATTTGAGGCTTGGGAGAGAATTATA
CTGGATGCTGCAGCTAGAGGCATGTTACTATCCAAAACACCGGAGAATGGCTATATCTTACTAGAGGATATGGCAGCCAGTAGCTTTCAATGGCCTAGTGAGATATCAAA
TGCCAAAAGAGTTTCTGGAATCTATGAAATTGGACCAGGAATTTCTCATTCAAACGAGTTGGTGGCAGCAGCAAATGAGTATTCTTATTATGAGGAGTCAACGATCGAGC
AAGCTCAGAATGTTCTGCAACCTCCACCGAGCTTTACATCTCAGCCAGCTAAAAAGAAATCATCTCATGAGGATTTACTTGGGGCTTTCATCAATGAGTCTAGGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGAAGATCCATTTAGGGCCAATGCTACAGAAGATCCAAACAACTATTTAACAATGTTTTTAGATGTTTGCGATACTGTTAAGATGAATTGGGTAAATGACAA
TGTTATTCGCTTACGCCTTTTCCCTTTTTCTTTGCAGGATAAGGCGAGAGATTGGTTGAAATCTTTGCAACCGGACAGTGTTAATTCTTGGCAGGAGATGGTTAAGACGT
TTCTCACAAAATTTTTCCCACCTACCAAGACAGCTCAACTTAGAACAGAGATTGGGTCATTCCGGCAATATGATTATGAGCAATTATTTGAGGCTTGGGAGAGAATTATA
CTGGATGCTGCAGCTAGAGGCATGTTACTATCCAAAACACCGGAGAATGGCTATATCTTACTAGAGGATATGGCAGCCAGTAGCTTTCAATGGCCTAGTGAGATATCAAA
TGCCAAAAGAGTTTCTGGAATCTATGAAATTGGACCAGGAATTTCTCATTCAAACGAGTTGGTGGCAGCAGCAAATGAGTATTCTTATTATGAGGAGTCAACGATCGAGC
AAGCTCAGAATGTTCTGCAACCTCCACCGAGCTTTACATCTCAGCCAGCTAAAAAGAAATCATCTCATGAGGATTTACTTGGGGCTTTCATCAATGAGTCTAGGAGTTGA
Protein sequenceShow/hide protein sequence
MVREDPFRANATEDPNNYLTMFLDVCDTVKMNWVNDNVIRLRLFPFSLQDKARDWLKSLQPDSVNSWQEMVKTFLTKFFPPTKTAQLRTEIGSFRQYDYEQLFEAWERII
LDAAARGMLLSKTPENGYILLEDMAASSFQWPSEISNAKRVSGIYEIGPGISHSNELVAAANEYSYYEESTIEQAQNVLQPPPSFTSQPAKKKSSHEDLLGAFINESRS