; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g05470 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g05470
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr7:4573279..4574429
RNA-Seq ExpressionMoc07g05470
SyntenyMoc07g05470
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]1.1e-1327.65Show/hide
Query:  LVPFDPEIEKTCKRNRKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNF-PNHVGIINLPINANNFELKPGLIQIVRENSF---------
        ++P DPEIE+T +  R+ K     + D  +L                 RT++DY +P    N+  I+  PINANNFELKP LI +V++  F         
Subjt:  LVPFDPEIEKTCKRNRKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNF-PNHVGIINLPINANNFELKPGLIQIVRENSF---------

Query:  --------------------------------KDNARDWLKSLQPGSVNSW-------------------------------------------------
                                        +D AR WL+SLQPGS+ SW                                                 
Subjt:  --------------------------------KDNARDWLKSLQPGSVNSW-------------------------------------------------

Query:  ---------QIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPK
                 Q+Q+FY   NG+TRT +D A GGTL+SKT E    LLE+M +N++Q P    + K
Subjt:  ---------QIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPK

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]1.5e-1326.52Show/hide
Query:  LVPFDPEIEKTCKRN-RKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFP-NHVGIINLPINANNFELKPGLIQIVRENSFK-------
        L+P DPEI++T +RN R    +TTE                  + +   + IRDY QP  P +  GI+N+PIN NNFELKPGLIQ+ RE +F+       
Subjt:  LVPFDPEIEKTCKRN-RKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFP-NHVGIINLPINANNFELKPGLIQIVRENSFK-------

Query:  ----------------------------------DNARDWLKSLQPGSVNSW------------------------------------------------
                                          D A+DWL+++ P S+ +W                                                
Subjt:  ----------------------------------DNARDWLKSLQPGSVNSW------------------------------------------------

Query:  ----------QIQIFYNG---RTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMP
                  QIQ+FYNG    T++ LD   GG++ SK  +  Y +LED+ T S+  P     P
Subjt:  ----------QIQIFYNG---RTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMP

XP_022157708.1 uncharacterized protein LOC111024361 [Momordica charantia]6.4e-4937.44Show/hide
Query:  PPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLPINANNFELKPGLIQIVRENSFKDNARD-------------------------------
        PP  VPIV  +E+PQL+QNNQ TIRDYCQPNFPNHVGIINLPINANN ELKPGLIQ+VREN+F+ NA +                               
Subjt:  PPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLPINANNFELKPGLIQIVRENSFKDNARD-------------------------------

Query:  --------------------------------------------W-------LKSLQPGSVNSWQIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYIL
                                                    W        K  Q G++   QIQ+FY   NG+TRT LD A GGTLLS+T EN YIL
Subjt:  --------------------------------------------W-------LKSLQPGSVNSWQIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYIL

Query:  LEDMTTNSFQCPMRDRMPK--------DLLASMKS-----------------------------------------------------------------
        L+DM  NSFQ P      K        D L+S+K+                                                                 
Subjt:  LEDMTTNSFQCPMRDRMPK--------DLLASMKS-----------------------------------------------------------------

Query:  ----------------------------------------------TSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLA
                                                       SD EV PREHCKA+TL+SGKELQEPEKKK+EEPVITTEEREN+E+VVKE T A
Subjt:  ----------------------------------------------TSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLA

Query:  LQVNKPTSSITSSPPNSSPYPQ
        LQ +KPTSSI SSPPNS PYPQ
Subjt:  LQVNKPTSSITSSPPNSSPYPQ

XP_022159119.1 uncharacterized protein LOC111025551 [Momordica charantia]2.3e-2275Show/hide
Query:  DLLASMKSTSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLALQVNKPTSSITSSPPNSSPYPQ
        +++   K  SDTEVNPREHCKA+TL+SGKELQEPEKKK+EEPVITTEEREN+E+VVKE T  LQ +KPTSSI SS PNS PYPQ
Subjt:  DLLASMKSTSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLALQVNKPTSSITSSPPNSSPYPQ

XP_022863493.1 uncharacterized protein LOC111383609 [Olea europaea var. sylvestris]5.1e-1435.37Show/hide
Query:  KQNNQRTIRDYCQPNF-PNHVGIINLPINANNFELKPGLIQIVRENSFKDNARDWLKSLQPGSVNSW---------------------------------
        + N QR IRDY +P    N+ GI+   I ANNFELKPG           D A+ W +SL  GS+  W                                 
Subjt:  KQNNQRTIRDYCQPNF-PNHVGIINLPINANNFELKPGLIQIVRENSFKDNARDWLKSLQPGSVNSW---------------------------------

Query:  ---------QIQIFYNGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPKDLL
                 QI+IFYNG+TRT LD A GG L++KT E  Y LL+D+ T+S+Q P      K++L
Subjt:  ---------QIQIFYNGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPKDLL

TrEMBL top hitse value%identityAlignment
A0A2I4E1Q5 uncharacterized protein LOC1089854725.2e-1233.96Show/hide
Query:  RTIRDYCQPNF-PNHVGIINLPINANNFELKPGLIQIVRENSF-----------------------------------------KDNARDWLKSLQPGSV
        RT++DY +P    N+ GI    INANNFELKP LI +V++  F                                         +D ARD ++      +
Subjt:  RTIRDYCQPNF-PNHVGIINLPINANNFELKPGLIQIVRENSF-----------------------------------------KDNARDWLKSLQPGSV

Query:  NSW-QIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPK
          W Q+Q+FY   NG+TRT +D   GGTL+SKT E    LLE+MT+N++Q P+   + K
Subjt:  NSW-QIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPK

A0A2I4F4G9 uncharacterized protein LOC1089954237.5e-1134.81Show/hide
Query:  RTIRDYCQPNFPNHVGIINL-PINANNFELKPGLIQIVRENSF-----------------------------------------KDNARDWLKSLQPGSV
        RT++DY +P   ++   I    INANNFELK  LI +V++  F                                         ++NARDWL        
Subjt:  RTIRDYCQPNFPNHVGIINL-PINANNFELKPGLIQIVRENSF-----------------------------------------KDNARDWLKSLQPGSV

Query:  NSWQIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPK
           Q+Q+FY   NG+TRT +DVA GGTL+SKT+E    LLE+MT N++Q P +  M K
Subjt:  NSWQIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPK

A0A3S3N117 Retrotrans_gag domain-containing protein1.2e-1125.56Show/hide
Query:  MCKDKDAVLVPFDPEIEKTCKRNRKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLP-INANNFELKPGLIQIVREN---
        M ++++  LVP DPEIE+T +R +KEKK+ +E              E  ++K+   R++ DY  P        I  P I ANNFE+KP +IQ+V      
Subjt:  MCKDKDAVLVPFDPEIEKTCKRNRKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLP-INANNFELKPGLIQIVREN---

Query:  ---------------------------------------SFKDNARDWLKSLQPGSVNSW----------------------------------------
                                               S +D A+ WL SL   ++ +W                                        
Subjt:  ---------------------------------------SFKDNARDWLKSLQPGSVNSW----------------------------------------

Query:  ------------------QIQIFYNG---RTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCP
                          Q+Q FYNG    TRT++D A GGTL+ K+ E  Y L+E+M TN++Q P
Subjt:  ------------------QIQIFYNG---RTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCP

A0A6J1DU19 uncharacterized protein LOC1110243613.1e-4937.44Show/hide
Query:  PPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLPINANNFELKPGLIQIVRENSFKDNARD-------------------------------
        PP  VPIV  +E+PQL+QNNQ TIRDYCQPNFPNHVGIINLPINANN ELKPGLIQ+VREN+F+ NA +                               
Subjt:  PPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLPINANNFELKPGLIQIVRENSFKDNARD-------------------------------

Query:  --------------------------------------------W-------LKSLQPGSVNSWQIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYIL
                                                    W        K  Q G++   QIQ+FY   NG+TRT LD A GGTLLS+T EN YIL
Subjt:  --------------------------------------------W-------LKSLQPGSVNSWQIQIFY---NGRTRTTLDVAHGGTLLSKTLENTYIL

Query:  LEDMTTNSFQCPMRDRMPK--------DLLASMKS-----------------------------------------------------------------
        L+DM  NSFQ P      K        D L+S+K+                                                                 
Subjt:  LEDMTTNSFQCPMRDRMPK--------DLLASMKS-----------------------------------------------------------------

Query:  ----------------------------------------------TSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLA
                                                       SD EV PREHCKA+TL+SGKELQEPEKKK+EEPVITTEEREN+E+VVKE T A
Subjt:  ----------------------------------------------TSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLA

Query:  LQVNKPTSSITSSPPNSSPYPQ
        LQ +KPTSSI SSPPNS PYPQ
Subjt:  LQVNKPTSSITSSPPNSSPYPQ

A0A6J1E1F6 uncharacterized protein LOC1110255511.1e-2275Show/hide
Query:  DLLASMKSTSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLALQVNKPTSSITSSPPNSSPYPQ
        +++   K  SDTEVNPREHCKA+TL+SGKELQEPEKKK+EEPVITTEEREN+E+VVKE T  LQ +KPTSSI SS PNS PYPQ
Subjt:  DLLASMKSTSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITTEERENEEKVVKEPTLALQVNKPTSSITSSPPNSSPYPQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGATTTGTATGTGCAAGGATAAAGATGCAGTTTTAGTTCCTTTTGATCCTGAAATTGAGAAAACCTGTAAAAGAAATCGAAAGGAGAAAAAGGAGACGACTGA
AGATATGGATCCACCACTACTTGTACCAATTGTTCCACTTGTTGAACAACCCCAGCTGAAACAGAATAACCAAAGAACCATTAGAGATTATTGCCAGCCAAACTTTCCTA
ATCATGTTGGAATAATCAATTTGCCTATTAATGCCAACAATTTTGAGTTGAAACCCGGCCTTATCCAGATCGTTCGAGAAAATTCATTCAAGGATAATGCACGGGATTGG
TTGAAATCCTTGCAACCAGGCAGTGTTAATTCTTGGCAGATTCAAATATTTTACAATGGACGAACAAGAACTACACTAGATGTTGCACATGGAGGCACATTACTATCTAA
AACACTGGAGAATACTTACATCTTACTGGAGGACATGACAACCAATAGCTTTCAATGCCCCATGAGAGATCGAATGCCAAAAGACTTGCTGGCATCTATGAAATCGACGA
GTGATACTGAAGTTAACCCACGAGAACATTGCAAAGCCATCACTTTGAAAAGTGGAAAGGAACTCCAGGAGCCTGAAAAGAAAAAAGTTGAAGAACCAGTCATCACAACT
GAGGAACGGGAAAATGAGGAGAAAGTTGTAAAGGAGCCCACTCTTGCTCTACAGGTTAACAAACCCACTAGTTCTATTACTTCTAGTCCTCCTAACTCTTCACCTTATCC
TCAACGTTTCCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTGATTTGTATGTGCAAGGATAAAGATGCAGTTTTAGTTCCTTTTGATCCTGAAATTGAGAAAACCTGTAAAAGAAATCGAAAGGAGAAAAAGGAGACGACTGA
AGATATGGATCCACCACTACTTGTACCAATTGTTCCACTTGTTGAACAACCCCAGCTGAAACAGAATAACCAAAGAACCATTAGAGATTATTGCCAGCCAAACTTTCCTA
ATCATGTTGGAATAATCAATTTGCCTATTAATGCCAACAATTTTGAGTTGAAACCCGGCCTTATCCAGATCGTTCGAGAAAATTCATTCAAGGATAATGCACGGGATTGG
TTGAAATCCTTGCAACCAGGCAGTGTTAATTCTTGGCAGATTCAAATATTTTACAATGGACGAACAAGAACTACACTAGATGTTGCACATGGAGGCACATTACTATCTAA
AACACTGGAGAATACTTACATCTTACTGGAGGACATGACAACCAATAGCTTTCAATGCCCCATGAGAGATCGAATGCCAAAAGACTTGCTGGCATCTATGAAATCGACGA
GTGATACTGAAGTTAACCCACGAGAACATTGCAAAGCCATCACTTTGAAAAGTGGAAAGGAACTCCAGGAGCCTGAAAAGAAAAAAGTTGAAGAACCAGTCATCACAACT
GAGGAACGGGAAAATGAGGAGAAAGTTGTAAAGGAGCCCACTCTTGCTCTACAGGTTAACAAACCCACTAGTTCTATTACTTCTAGTCCTCCTAACTCTTCACCTTATCC
TCAACGTTTCCAATAG
Protein sequenceShow/hide protein sequence
MELICMCKDKDAVLVPFDPEIEKTCKRNRKEKKETTEDMDPPLLVPIVPLVEQPQLKQNNQRTIRDYCQPNFPNHVGIINLPINANNFELKPGLIQIVRENSFKDNARDW
LKSLQPGSVNSWQIQIFYNGRTRTTLDVAHGGTLLSKTLENTYILLEDMTTNSFQCPMRDRMPKDLLASMKSTSDTEVNPREHCKAITLKSGKELQEPEKKKVEEPVITT
EERENEEKVVKEPTLALQVNKPTSSITSSPPNSSPYPQRFQ