; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g18480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g18480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:14453741..14454433
RNA-Seq ExpressionMoc06g18480
SyntenyMoc06g18480
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059143.1 uncharacterized protein E6C27_scaffold430G00550 [Cucumis melo var. makuwa]2.5e-5048.71Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  D+LVEIEEQ+LYL EV DS+R LE+RVDE SEK   IDAV  R++GLPI+++  RV+ LE                            
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDFKVT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS

TYK31632.1 uncharacterized protein E5676_scaffold340G00230 [Cucumis melo var. makuwa]3.3e-5048.71Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  D+LVEIEEQ+LYL EV DS+R LE+RVDE SEK   IDAV  R++GLPIQ++  RV+ LE                            
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+VT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]7.7e-9281.74Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------
        MSTTKQLSKSHVD+LVEIEEQLLYLREV D LRLLEARVDEFSEKF EIDAVNARIDGLPIQDIAMRVETLESKATRP                      
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]9.7e-8777.83Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------
        MS TKQLSKSHVD+LVEIEEQLLYLREV DSLRLLEARVDEFSEKF EIDAVNAR+DGLPIQDIAMRVET ESKATRP                      
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS

XP_031745591.1 uncharacterized protein LOC116406038 isoform X1 [Cucumis sativus]1.9e-5046.98Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATR-----------------------
        M T K+  K+H ++LV IEEQ+L+L+EVSDS+R +E R+++ +++   +DAV+ R+DGLPI D+  RV+TLES+  +                       
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATR-----------------------

Query:  -----------------PEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+ T+D +R E+ E++T+VNLTMRA+ NQAP    +   K+K+PEPKPF G RDAK LENF+FD+E+YFKAT T +EE K
Subjt:  -----------------PEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS
        VTLATMHL++DAKLWWRS+  DIQ GRC I++
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS

TrEMBL top hitse value%identityAlignment
A0A5A7UT87 Retrotrans_gag domain-containing protein1.2e-5048.71Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  D+LVEIEEQ+LYL EV DS+R LE+RVDE SEK   IDAV  R++GLPI+++  RV+ LE                            
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDFKVT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS

A0A5D3D8P6 Retrotrans_gag domain-containing protein1.6e-5048.71Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  D+LVEIEEQ+LYL EV DS+R LE+RVDE SEK   IDAV  R++GLPIQ++  RV+ LE                            
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+VT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS

A0A5D3E6S8 Retrotrans_gag domain-containing protein1.6e-5048.71Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------
        MS++    K+  D+LVEIEEQ+LYL EV DS+R LE+RVDE SEK   IDAV  R++GLPIQ++  RV+ LE                            
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKAT------------------------

Query:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
                          EDF+VT+D +R E+ +++ R++LTMRA+ NQAP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  ----------------RPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINS

A0A6J1D906 Reverse transcriptase3.7e-9281.74Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------
        MSTTKQLSKSHVD+LVEIEEQLLYLREV D LRLLEARVDEFSEKF EIDAVNARIDGLPIQDIAMRVETLESKATRP                      
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS

A0A6J1DK29 uncharacterized protein LOC1110218294.7e-8777.83Show/hide
Query:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------
        MS TKQLSKSHVD+LVEIEEQLLYLREV DSLRLLEARVDEFSEKF EIDAVNAR+DGLPIQDIAMRVET ESKATRP                      
Subjt:  MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRP----------------------

Query:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
                        EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  ----------------EDFKVTIDTLRAEMTEISTRVNLTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS
        LATMHLTDDAKLWWRSKVNDIQNGRCTINS
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCAACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCTCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTAGATGAATTCTCCGAGAAGTTTAGAGAAATAGACGCAGTGAATGCCCGTATAGACGGGTTGCCAATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAGGCT
CCCAACCAAGCAAACATGGGGTTCAACAAGTTAAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACA
GTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTC
AGAATGGTCGATGCACGATCAATAGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGACGACAAAACAACTGAGCAAGTCGCACGTCGACCAACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCTCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTAGATGAATTCTCCGAGAAGTTTAGAGAAATAGACGCAGTGAATGCCCGTATAGACGGGTTGCCAATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGAAGACTTCAAAGTAACCATCGACACCCTCCGAGCTGAGATGACTGAAATAAGCACTAGAGTGAACCTAACCATGCGAGCCGTGGGAAATCAGGCT
CCCAACCAAGCAAACATGGGGTTCAACAAGTTAAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGAGACGCAAAAGATCTCGAGAACTTCCTGTTCGACGTAGAACA
GTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAAAGCTGTGGTGGAGATCTAAAGTCAACGACATTC
AGAATGGTCGATGCACGATCAATAGCTAG
Protein sequenceShow/hide protein sequence
MSTTKQLSKSHVDQLVEIEEQLLYLREVSDSLRLLEARVDEFSEKFREIDAVNARIDGLPIQDIAMRVETLESKATRPEDFKVTIDTLRAEMTEISTRVNLTMRAVGNQA
PNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQNGRCTINS