; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:9969517..9970445
RNA-Seq ExpressionMoc03g14800
SyntenyMoc03g14800
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]4.9e-6550.93Show/hide
Query:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ
        M V +SFK E +S E +RLKLFPY LRD A  WL+ +P ESITSW+DLAEKFL++YFPPSKNA+ RS+INNFQQ   +S+SESWE FK LL   PHHGI 
Subjt:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ

Query:  RCIQIEIYYDGLNEATQL-----VIDASANGELLSKSYIEAFDICERIS----RNKHQWSKI-------EVNSVHNVSCPYYEGEHHFEDYLGNPASMFN
        RCIQIE YY  LN+AT+L     V   S+ G + S+SY       E ++    R+  Q S +        VN +  +SC + EG+HH+ +  GNP S++ 
Subjt:  RCIQIEIYYDGLNEATQL-----VIDASANGELLSKSYIEAFDICERIS----RNKHQWSKI-------EVNSVHNVSCPYYEGEHHFEDYLGNPASMFN

Query:  LGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQ
        LG+ + NR N+Y NTYN  WRNHPNF W G+Q G N   AG S+AP +Q K  YP GF  Q Q + ++Q
Subjt:  LGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQ

XP_030497851.1 uncharacterized protein LOC115713509 [Cannabis sativa]1.8e-5943.37Show/hide
Query:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ
        ++V+DSFK++ VS+EA+RLKLFP+ LRD A AWL+ +P +S+T+WNDLAEKFL +YFPP++NA+ RS+I +FQQL  ++ S++WERFK LL + PHHGI 
Subjt:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ

Query:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------
         CIQ+E +Y+GLN A+++V+DASANG +LSKSY EAF+I ERI+ N +QWS             +EV+ +                              
Subjt:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------

Query:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQP--
           +SC Y    H FE+   NPAS+  +G+Q   R  N Y N+YN +W++HPNF WG        G   +S+    Q K  +PLGFS Q +     QP  
Subjt:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQP--

Query:  -KAMTLEDM
         +A +LE +
Subjt:  -KAMTLEDM

XP_030507648.1 uncharacterized protein LOC115722545 [Cannabis sativa]2.6e-5843.46Show/hide
Query:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ
        ++V+DSFK++ VS+EA+RLKLFP+ LRD A AWL+ +P +S+T+WNDLAEKFL +YFPP++NA+ RS+I +FQQL  ++ S++WERFK LL + PHHGI 
Subjt:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ

Query:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------
         CIQ+E +Y+GLN  T++V+DASANG +LSKSY EAF+I ERI+ N +QWS             +EV+++                              
Subjt:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------

Query:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQPKA
           +SC Y    H FE+   N AS+  +G+Q  NR  N Y N+YN +W++HPNF WG    GQ   ++GA  A   Q K  +P GFS Q ++   +  + 
Subjt:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQPKA

Query:  MTLEDM
         +LE +
Subjt:  MTLEDM

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]1.1e-5944.15Show/hide
Query:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ
        ++V+DSFK++ VS+EA+RLKLFP+ LRD A AWL+ +P +S+T+WNDLAEKFL +YFPP++NA+ RS+I +FQQL  ++ S++WERFK LL + PHHGI 
Subjt:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ

Query:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------
         CIQ+E +Y+GLN A ++V+DA ANG +LSKSY EAF+I ERI+ N +QWS             +EV+++                              
Subjt:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------

Query:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQPK
           +SC Y    H FE++  NPAS+  +G+Q  NR  N Y N+YN +W++HPNF WG    GQ   ++GA  A   Q K  +  GFS Q +     QP+
Subjt:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQPK

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]2.8e-6045.33Show/hide
Query:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ
        ++V+DSFK++ VS+EA+RLKLFP+ LRD A AWL+ +P++S+T+WNDLAE FL +YFPP++NA+ RS+I +FQQL  ++ S++WERFK LL + PHHGI 
Subjt:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ

Query:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------
         CIQ+E +Y+GLN A+++V+DASANG +LSKSY EAF+I ERI+ N +QWS             +EV+++                              
Subjt:  RCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSK------------IEVNSV------------------------------

Query:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQ
           +SC Y    H FE+   NPAS+  +G+Q  NR  N Y N+YN +W++HPNF WGG Q   ++GA G       Q K  +P GFS Q
Subjt:  -HNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNR-INVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQ

TrEMBL top hitse value%identityAlignment
A0A6J1DRG1 uncharacterized protein LOC1110236695.0e-5548.37Show/hide
Query:  IQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWS--
        ++YFPP KNA+ RS+I NFQQ+ R+S++ESWERFK LL + PHHGI RCIQIE YY GL++AT+LVIDAS NG LL K Y EAF+I ERIS N H WS  
Subjt:  IQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRCIQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWS--

Query:  ----------------------------------------------KIEVNSVHNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNRINVYGNTYNLSWR
                                                      K  V+ +  +SC + EGEHH+ +Y  NP S++ LG+ + N  N Y NTYN  WR
Subjt:  ----------------------------------------------KIEVNSVHNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNRINVYGNTYNLSWR

Query:  NHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQ
        NHPNF W GNQ G N   AG SNAP YQQK  YP  FS Q Q  VQ
Subjt:  NHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQ

A0A6J1E1F3 uncharacterized protein LOC1110250652.4e-6550.93Show/hide
Query:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ
        M V +SFK E +S E +RLKLFPY LRD A  WL+ +P ESITSW+DLAEKFL++YFPPSKNA+ RS+INNFQQ   +S+SESWE FK LL   PHHGI 
Subjt:  MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQ

Query:  RCIQIEIYYDGLNEATQL-----VIDASANGELLSKSYIEAFDICERIS----RNKHQWSKI-------EVNSVHNVSCPYYEGEHHFEDYLGNPASMFN
        RCIQIE YY  LN+AT+L     V   S+ G + S+SY       E ++    R+  Q S +        VN +  +SC + EG+HH+ +  GNP S++ 
Subjt:  RCIQIEIYYDGLNEATQL-----VIDASANGELLSKSYIEAFDICERIS----RNKHQWSKI-------EVNSVHNVSCPYYEGEHHFEDYLGNPASMFN

Query:  LGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQ
        LG+ + NR N+Y NTYN  WRNHPNF W G+Q G N   AG S+AP +Q K  YP GF  Q Q + ++Q
Subjt:  LGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQ

A0A6J1EEI2 uncharacterized protein LOC1114333943.6e-5341.12Show/hide
Query:  VADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRC
        V+DSF+ +RV K+ +RL LFPY LRD A +WL+ +   +I SWN L EKFLI+YFPP++NA  R++I  FQQ    +LSE+WERFK +L + PHHG+  C
Subjt:  VADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRC

Query:  IQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVN-----------------------------------------SVHNV
        IQ+E +Y+GLN AT+ V+DASANG +LSK+Y EA++I ERI+ N  QW+ +  N                                          VH V
Subjt:  IQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVN-----------------------------------------SVHNV

Query:  ---------SCPYYEGEHHFEDYLGNPASMFNLGDQRPN---RINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTM
                 SC Y   EH F+    NPAS+F +G+Q      + N + NTYN  WRNHPNF W G       G+      P    K+ YP GF  QNQ  
Subjt:  ---------SCPYYEGEHHFEDYLGNPASMFNLGDQRPN---RINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTM

Query:  VQKQ
           Q
Subjt:  VQKQ

A0A6J1EQ90 uncharacterized protein LOC1114364118.2e-5039.93Show/hide
Query:  ADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRCI
        +DSF+ + V K+ +RL LFPY LRD A +WL+ +   +I SWN LAE FLI+YFPP++NA  +++I  FQQ   ++LSE+ ERFK +L + PHHG+  CI
Subjt:  ADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRCI

Query:  QIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVN----------------------SVHNV--------------------
        Q+E +Y+GLN  T+ V+DASANG +LSK+Y EA++I ERI+ N  QW+ +  N                      SV N+                    
Subjt:  QIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVN----------------------SVHNV--------------------

Query:  --------SCPYYEGEHHFEDYLGNPASMFNLGDQRPN---RINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMV
                SC Y   EH F+    NPAS+F +G+Q      + N + NTYN  WRNHPNF W G QS  N              K+ YP GF  QNQ   
Subjt:  --------SCPYYEGEHHFEDYLGNPASMFNLGDQRPN---RINVYGNTYNLSWRNHPNFGWGGNQSGQNTGAAGASNAPTYQQKSQYPLGFSGQNQTMV

Query:  QKQ
          Q
Subjt:  QKQ

A0A6J1H7E4 uncharacterized protein LOC1114611682.2e-5042.54Show/hide
Query:  VADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRC
        V+DSF+ + V K+ +RL LFPY LRD A +WL+ +   +I SWN LAEKFLI+YFPP++NA  R++I  FQQ   ++LSE+WERFK +L + PHHG+  C
Subjt:  VADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRC

Query:  IQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVN----------------------SVHNV-------------------
        IQ+E +Y+GLN AT+ V+DASANG +LSK+Y EA++I ERI+ N  QW+ +  N                      SV N+                   
Subjt:  IQIEIYYDGLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVN----------------------SVHNV-------------------

Query:  ---------SCPYYEGEHHFEDYLGNPASMF---NLGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQS
                 SC Y   EH F+    NPAS+    N   Q   + N   NTYN  WRNHPNF W G  S
Subjt:  ---------SCPYYEGEHHFEDYLGNPASMF---NLGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTGGCTGACTCATTTAAGATAGAAAGAGTAAGCAAAGAAGCAATGCGATTGAAGCTATTCCCTTATTTCCTAAGGGATAGTGCTGGGGCATGGTTAGATTTAGT
GCCCGCTGAGTCCATCACTTCATGGAATGATTTAGCAGAGAAGTTCCTAATACAATACTTCCCACCATCGAAAAATGCGGAGCTTAGGAGCAAAATAAATAATTTTCAAC
AACTCCCAAGAAAATCTTTGAGTGAATCTTGGGAAAGATTCAAAGGATTGCTCCATAGGTGGCCACATCACGGTATACAACGTTGTATTCAGATTGAAATATATTATGAC
GGTTTGAATGAGGCGACGCAGTTAGTAATAGATGCCTCTGCAAATGGAGAATTATTATCAAAATCATATATTGAAGCCTTTGATATCTGTGAGAGAATTTCACGCAATAA
GCATCAGTGGTCAAAAATCGAGGTAAATAGTGTGCACAATGTTTCATGCCCTTATTACGAAGGTGAGCATCATTTTGAGGATTATCTAGGCAACCCAGCTTCAATGTTTA
ATTTGGGTGATCAACGACCAAATAGGATAAATGTTTATGGAAACACTTATAATCTGAGTTGGAGAAATCACCCTAATTTTGGCTGGGGCGGAAATCAATCAGGGCAGAAT
ACTGGAGCAGCTGGAGCTAGCAATGCTCCAACATATCAACAGAAAAGTCAGTATCCACTTGGATTTTCAGGGCAAAATCAAACGATGGTTCAAAAGCAACCTAAGGCTAT
GACCCTAGAGGATATGTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTGGCTGACTCATTTAAGATAGAAAGAGTAAGCAAAGAAGCAATGCGATTGAAGCTATTCCCTTATTTCCTAAGGGATAGTGCTGGGGCATGGTTAGATTTAGT
GCCCGCTGAGTCCATCACTTCATGGAATGATTTAGCAGAGAAGTTCCTAATACAATACTTCCCACCATCGAAAAATGCGGAGCTTAGGAGCAAAATAAATAATTTTCAAC
AACTCCCAAGAAAATCTTTGAGTGAATCTTGGGAAAGATTCAAAGGATTGCTCCATAGGTGGCCACATCACGGTATACAACGTTGTATTCAGATTGAAATATATTATGAC
GGTTTGAATGAGGCGACGCAGTTAGTAATAGATGCCTCTGCAAATGGAGAATTATTATCAAAATCATATATTGAAGCCTTTGATATCTGTGAGAGAATTTCACGCAATAA
GCATCAGTGGTCAAAAATCGAGGTAAATAGTGTGCACAATGTTTCATGCCCTTATTACGAAGGTGAGCATCATTTTGAGGATTATCTAGGCAACCCAGCTTCAATGTTTA
ATTTGGGTGATCAACGACCAAATAGGATAAATGTTTATGGAAACACTTATAATCTGAGTTGGAGAAATCACCCTAATTTTGGCTGGGGCGGAAATCAATCAGGGCAGAAT
ACTGGAGCAGCTGGAGCTAGCAATGCTCCAACATATCAACAGAAAAGTCAGTATCCACTTGGATTTTCAGGGCAAAATCAAACGATGGTTCAAAAGCAACCTAAGGCTAT
GACCCTAGAGGATATGTTTTAA
Protein sequenceShow/hide protein sequence
MQVADSFKIERVSKEAMRLKLFPYFLRDSAGAWLDLVPAESITSWNDLAEKFLIQYFPPSKNAELRSKINNFQQLPRKSLSESWERFKGLLHRWPHHGIQRCIQIEIYYD
GLNEATQLVIDASANGELLSKSYIEAFDICERISRNKHQWSKIEVNSVHNVSCPYYEGEHHFEDYLGNPASMFNLGDQRPNRINVYGNTYNLSWRNHPNFGWGGNQSGQN
TGAAGASNAPTYQQKSQYPLGFSGQNQTMVQKQPKAMTLEDMF