; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g05100 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g05100
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr6:3710703..3711592
RNA-Seq ExpressionMoc06g05100
SyntenyMoc06g05100
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057594.1 uncharacterized protein E6C27_scaffold497G00710 [Cucumis melo var. makuwa]8.8e-8656.71Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        M ++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RVDE SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL+N+   ++++ N+M+EDF+                           AP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++WD LK+ELR QFFPDNVE +ARRKLR+LRHTG IW+YVKQF+ +MLDIRDMSEKDKVF F+EGLKP
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP

TYK19839.1 uncharacterized protein E5676_scaffold811G00460 [Cucumis melo var. makuwa]3.4e-8556.71Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RVDE SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
        GEL+N+   ++++ N M+EDF+                           AP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++WD LK+ELR QFFP+NVE +ARRKLR+LRHTG I +YVKQF+ +MLDIRDMSEKDKVF F+EGLKP
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP

TYK21000.1 uncharacterized protein E5676_scaffold328G00270 [Cucumis melo var. makuwa]8.8e-8656.71Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        M ++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RVDE SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL+N+   ++++ N+M+EDF+                           AP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++WD LK+ELR QFFPDNVE +ARRKLR+LRHTG IW+YVKQF+ +MLDIRDMSEKDKVF F+EGLKP
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]2.8e-14088.47Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTEDFK                           APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTI DYVKQFS VM+DIRDMSEKDKVFVFI+GLK
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]3.5e-13586.78Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTEDFK                           APNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTI DYVKQFS VMLDIRDMSEKDKVFVFIEGLK
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK

TrEMBL top hitse value%identityAlignment
A0A5A7UR61 Retrotrans_gag domain-containing protein4.3e-8656.71Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        M ++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RVDE SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL+N+   ++++ N+M+EDF+                           AP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++WD LK+ELR QFFPDNVE +ARRKLR+LRHTG IW+YVKQF+ +MLDIRDMSEKDKVF F+EGLKP
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP

A0A5D3D8P6 Retrotrans_gag domain-containing protein1.6e-8556.71Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        MS++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RVDE SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
        GEL+N+   ++++ N M+EDF+                           AP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++WD LK+ELR QFFP+NVE +ARRKLR+LRHTG I +YVKQF+ +MLDIRDMSEKDKVF F+EGLKP
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP

A0A5D3DC04 Retrotrans_gag domain-containing protein4.3e-8656.71Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM
        M ++    K+  DRLVEIEEQ+LYL EVPDS+R LE+RVDE SEK   IDAV  RV+GLPIQ++  RV+ LE  + A R  ++ERG+SS+     +E R+
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLE--SKATRPGSFERGDSSTNPNTQIEVRM

Query:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK
         EL+N+   ++++ N+M+EDF+                           AP    +  +K+KVPEPKPF G RDAK LEN++FD+EQYFKAT T +EE K
Subjt:  GELNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMK

Query:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP
        VTLATMHL++DAKLWWRS+  DIQ GRCT+++WD LK+ELR QFFPDNVE +ARRKLR+LRHTG IW+YVKQF+ +MLDIRDMSEKDKVF F+EGLKP
Subjt:  VTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP

A0A6J1D906 Reverse transcriptase1.0e-14088.81Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MS TKQLSKSHVDRLVEIEEQLLYLREVPD LRLLEARVDEFSEKFGEIDAVNAR+DGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHSAMMQLFNEMTEDFK                           APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
Subjt:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTI DYVKQFS VM+DIRDMSEKDKVFVFIEGLK
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK

A0A6J1DK29 uncharacterized protein LOC1110218291.7e-13586.78Show/hide
Query:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE
        MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVET ESKATRPGSFERGDSSTNPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGE

Query:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT
        LNNSHS MMQLFNEMTEDFK                           APNQ NMGFNKLKVPEPKPFNGNR  KDLENF FDVEQYFK TGT SE MKVT
Subjt:  LNNSHSAMMQLFNEMTEDFK---------------------------APNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVT

Query:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK
        LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFF DNVEFMARRKLRELRHTGTI DYVKQFS VMLDIRDMSEKDKVFVFIEGLK
Subjt:  LATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNVEFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGAGAGTTAAACAACTCGCATTCGGCAATGATGCAA
TTGTTTAACGAAATGACAGAAGACTTCAAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTCAATGGCAATAGAGACGCAAA
AGATCTCGAGAACTTCCTGTTCGACGTAGAACAGTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAA
AGCTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTC
GAGTTCATGGCTAGAAGGAAGCTACGTGAACTCCGACACACTGGAACAATCTGGGACTACGTGAAACAATTCTCTGTCGTGATGCTGGATATTCGCGACATGTCAGAGAA
AGACAAGGTGTTCGTCTTTATCGAAGGATTGAAACCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCGACAAAACAACTGAGCAAGTCGCACGTCGACCGACTGGTAGAGATAGAAGAACAACTTCTCTACCTAAGAGAAGTCCCAGATTCCCTCCGTCTGCTGGAGGC
GCGAGTTGATGAATTCTCCGAGAAGTTTGGAGAAATAGACGCAGTGAATGCCCGTGTAGACGGGTTGCCGATACAAGATATAGCCATGAGGGTTGAGACCCTAGAAAGCA
AAGCTACGCGTCCTGGTAGCTTCGAACGTGGAGATAGTTCCACGAACCCAAACACACAGATAGAAGTACGTATGGGAGAGTTAAACAACTCGCATTCGGCAATGATGCAA
TTGTTTAACGAAATGACAGAAGACTTCAAAGCTCCCAACCAAGCGAACATGGGGTTCAACAAGTTGAAGGTCCCAGAGCCCAAACCATTCAATGGCAATAGAGACGCAAA
AGATCTCGAGAACTTCCTGTTCGACGTAGAACAGTACTTCAAGGCTACGGGGACAACGTCAGAAGAGATGAAAGTGACTTTGGCCACCATGCATCTTACTGATGATGCAA
AGCTGTGGTGGAGATCTAAAGTCAACGACATTCAGAATGGTCGATGCACGATCAATAGCTGGGATGATCTGAAGAAAGAATTGAGGGGTCAGTTCTTCCCCGACAATGTC
GAGTTCATGGCTAGAAGGAAGCTACGTGAACTCCGACACACTGGAACAATCTGGGACTACGTGAAACAATTCTCTGTCGTGATGCTGGATATTCGCGACATGTCAGAGAA
AGACAAGGTGTTCGTCTTTATCGAAGGATTGAAACCGTGA
Protein sequenceShow/hide protein sequence
MSATKQLSKSHVDRLVEIEEQLLYLREVPDSLRLLEARVDEFSEKFGEIDAVNARVDGLPIQDIAMRVETLESKATRPGSFERGDSSTNPNTQIEVRMGELNNSHSAMMQ
LFNEMTEDFKAPNQANMGFNKLKVPEPKPFNGNRDAKDLENFLFDVEQYFKATGTTSEEMKVTLATMHLTDDAKLWWRSKVNDIQNGRCTINSWDDLKKELRGQFFPDNV
EFMARRKLRELRHTGTIWDYVKQFSVVMLDIRDMSEKDKVFVFIEGLKP