; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g15540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g15540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr4:11681114..11682010
RNA-Seq ExpressionMoc04g15540
SyntenyMoc04g15540
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065526.1 uncharacterized protein E6C27_scaffold638G00290 [Cucumis melo var. makuwa]3.8e-9058.17Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM
        MS++    K+  D+LVEIEEQ+LYL EVPD++R LE RVDE S+K   IDAV  RV+GLPIQ L  RV+AL+  + A R  +++R ++       +E R+
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM

Query:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK
        GEL+N+Q  ++++ N M+EDF+VT+D +R E+A++S R++ TMRA+ +QAP    +  +K+KVPEPKPF G RYAK LENY+FD+EQYFKAT T +EE K
Subjt:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK

Query:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA
        VTLATMH + DAKLWWRS+  DIQ G CT++ WD LK+ELR  FFP+NVE +ARRKLR       IR+YVKQ + +MLDIRDMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA

Query:  RTKLYE
        R KLYE
Subjt:  RTKLYE

TYK19839.1 uncharacterized protein E5676_scaffold811G00460 [Cucumis melo var. makuwa]3.3e-8957.84Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM
        MS++    K+  D+LVEIEEQ+LYL EVPD++R LE RVDE S+K   IDAV  RV+GLPIQ L  RV+AL+  + A R  +++R ++       +E R+
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM

Query:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK
        GEL+N+Q  ++++ N M+EDF+VT+D +R E+A+++ R++ TMRA+ NQAP    +  +K+KVPEPKPF G R AK LENY+FD+EQYFKAT T +EE K
Subjt:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK

Query:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA
        VTLATMH + DAKLWWRS+  DIQ G CT++ WD LK+ELR  FFP+NVE +ARRKLR       IR+YVKQ + +MLDIRDMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA

Query:  RTKLYE
        R KLYE
Subjt:  RTKLYE

TYK29292.1 uncharacterized protein E5676_scaffold1212G00600 [Cucumis melo var. makuwa]4.2e-8957.84Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM
        MS++    K+  D+LVEIEEQ+LYL EVPD++R LE RVDE S+K   IDAV  RV+GLPIQ L  RV+AL+  + A R  +++R ++       +E R+
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM

Query:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK
        GEL+N+Q  ++++ N M+EDF+VT+D +R E+A+++ R++ TMRA+ NQAP    +  +K+KVPEPKPF G R AK LENY+FD+EQYFKAT T +EE K
Subjt:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK

Query:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA
        VTLATMH + DAKLWWRS+  DIQ G CT++ WD LK+ELR  FFP+NVE +ARRKLR       IR+YVKQ + +MLDIRDMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA

Query:  RTKLYE
        R KLYE
Subjt:  RTKLYE

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]1.3e-14185.86Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE
        MS TKQLSKSHVD+LVEIEEQLLYL+EVPD LRLLE RVDEFS+KFGEIDAVNAR+DGLPIQ +A RVE L+SKATRPGSF+R D+  NPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE

Query:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT
        LNNS SAMMQLFNEMTEDFKVTID LRAEM E+STRVN TMRAVGNQAPNQANMGFNKLKVPEPKPFNGNR AKDLEN++FDVEQYFKATGTTSEEMKVT
Subjt:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT

Query:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART
        LATMH T+DAKLWWRSKVNDIQNG CTIN+WDDLKKELRG FFP NVEFMARRKLR      TIRDYVKQ S VM+DIRDMSEKDKVFVF++GLK WART
Subjt:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART

Query:  KLYE
        KLYE
Subjt:  KLYE

XP_022154605.1 uncharacterized protein LOC111021829 [Momordica charantia]1.9e-13784.54Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE
        MSATKQLSKSHVD+LVEIEEQLLYL+EVPD+LRLLE RVDEFS+KFGEIDAVNARVDGLPIQ +A RVE  +SKATRPGSF+R D+  NPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE

Query:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT
        LNNS S MMQLFNEMTEDFKVTID LRAEM E+STRVN TMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLEN+ FDVEQYFK TGT SE MKVT
Subjt:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT

Query:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART
        LATMH T+DAKLWWRSKVNDIQNG CTIN+WDDLKKELRG FF  NVEFMARRKLR      TIRDYVKQ S VMLDIRDMSEKDKVFVF+EGLK WART
Subjt:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART

Query:  KLYE
        KLYE
Subjt:  KLYE

TrEMBL top hitse value%identityAlignment
A0A5A7VGS4 Retrotrans_gag domain-containing protein1.9e-9058.17Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM
        MS++    K+  D+LVEIEEQ+LYL EVPD++R LE RVDE S+K   IDAV  RV+GLPIQ L  RV+AL+  + A R  +++R ++       +E R+
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM

Query:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK
        GEL+N+Q  ++++ N M+EDF+VT+D +R E+A++S R++ TMRA+ +QAP    +  +K+KVPEPKPF G RYAK LENY+FD+EQYFKAT T +EE K
Subjt:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK

Query:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA
        VTLATMH + DAKLWWRS+  DIQ G CT++ WD LK+ELR  FFP+NVE +ARRKLR       IR+YVKQ + +MLDIRDMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA

Query:  RTKLYE
        R KLYE
Subjt:  RTKLYE

A0A5D3D8P6 Retrotrans_gag domain-containing protein1.6e-8957.84Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM
        MS++    K+  D+LVEIEEQ+LYL EVPD++R LE RVDE S+K   IDAV  RV+GLPIQ L  RV+AL+  + A R  +++R ++       +E R+
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM

Query:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK
        GEL+N+Q  ++++ N M+EDF+VT+D +R E+A+++ R++ TMRA+ NQAP    +  +K+KVPEPKPF G R AK LENY+FD+EQYFKAT T +EE K
Subjt:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK

Query:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA
        VTLATMH + DAKLWWRS+  DIQ G CT++ WD LK+ELR  FFP+NVE +ARRKLR       IR+YVKQ + +MLDIRDMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA

Query:  RTKLYE
        R KLYE
Subjt:  RTKLYE

A0A5D3E078 Retrotrans_gag domain-containing protein2.1e-8957.84Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM
        MS++    K+  D+LVEIEEQ+LYL EVPD++R LE RVDE S+K   IDAV  RV+GLPIQ L  RV+AL+  + A R  +++R ++       +E R+
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALK--SKATRPGSFKREDNPMNPNTQIEVRM

Query:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK
        GEL+N+Q  ++++ N M+EDF+VT+D +R E+A+++ R++ TMRA+ NQAP    +  +K+KVPEPKPF G R AK LENY+FD+EQYFKAT T +EE K
Subjt:  GELNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMK

Query:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA
        VTLATMH + DAKLWWRS+  DIQ G CT++ WD LK+ELR  FFP+NVE +ARRKLR       IR+YVKQ + +MLDIRDMSEKDKVF FVEGLKPWA
Subjt:  VTLATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWA

Query:  RTKLYE
        R KLYE
Subjt:  RTKLYE

A0A6J1D906 Reverse transcriptase4.7e-14286.18Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE
        MS TKQLSKSHVD+LVEIEEQLLYL+EVPD LRLLE RVDEFS+KFGEIDAVNAR+DGLPIQ +A RVE L+SKATRPGSF+R D+  NPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE

Query:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT
        LNNS SAMMQLFNEMTEDFKVTID LRAEM E+STRVN TMRAVGNQAPNQANMGFNKLKVPEPKPFNGNR AKDLEN++FDVEQYFKATGTTSEEMKVT
Subjt:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT

Query:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART
        LATMH T+DAKLWWRSKVNDIQNG CTIN+WDDLKKELRG FFP NVEFMARRKLR      TIRDYVKQ S VM+DIRDMSEKDKVFVF+EGLK WART
Subjt:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART

Query:  KLYE
        KLYE
Subjt:  KLYE

A0A6J1DK29 uncharacterized protein LOC1110218299.1e-13884.54Show/hide
Query:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE
        MSATKQLSKSHVD+LVEIEEQLLYL+EVPD+LRLLE RVDEFS+KFGEIDAVNARVDGLPIQ +A RVE  +SKATRPGSF+R D+  NPNTQIEVRMGE
Subjt:  MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGE

Query:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT
        LNNS S MMQLFNEMTEDFKVTID LRAEM E+STRVN TMRAVGNQAPNQ NMGFNKLKVPEPKPFNGNR  KDLEN+ FDVEQYFK TGT SE MKVT
Subjt:  LNNSQSAMMQLFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVT

Query:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART
        LATMH T+DAKLWWRSKVNDIQNG CTIN+WDDLKKELRG FF  NVEFMARRKLR      TIRDYVKQ S VMLDIRDMSEKDKVFVF+EGLK WART
Subjt:  LATMHHTNDAKLWWRSKVNDIQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLR------TIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWART

Query:  KLYE
        KLYE
Subjt:  KLYE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGGCGACAAAACAATTGAGCAAGTCGCACGTCGACCAACTAGTCGAGATAGAAGAACAACTTCTCTATCTGAAAGAAGTCCCAGATACCCTCCGTCTTCTGGAGAT
GCGAGTTGATGAGTTCTCCGATAAGTTTGGAGAAATAGACGCTGTGAATGCCCGAGTGGACGGGTTACCGATACAAGGTCTAGCTACAAGGGTTGAGGCCCTAAAAAGCA
AAGCTACGCGTCCTGGTAGCTTCAAACGTGAAGACAATCCCATGAACCCAAACACCCAGATAGAAGTACGTATGGGTGAGTTAAACAATTCACAGTCAGCAATGATGCAA
TTGTTTAACGAAATGACGGAGGACTTCAAGGTAACCATAGACAATCTCCGAGCTGAGATGGCTGAATTAAGCACTCGAGTTAACCAGACCATGCGAGCCGTGGGAAATCA
AGCTCCCAACCAAGCGAACATGGGGTTCAACAAGCTCAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGATATGCGAAAGATCTCGAGAACTACATGTTCGATGTAG
AACAATACTTCAAGGCTACTGGGACAACGTCAGAAGAAATGAAAGTGACTTTAGCCACCATGCATCATACTAATGATGCAAAGCTATGGTGGAGGTCTAAAGTCAACGAC
ATTCAGAATGGACTATGCACGATCAACAATTGGGATGATCTTAAGAAGGAATTGAGGGGTTGGTTCTTCCCCAAAAATGTCGAGTTCATGGCTAGAAGGAAGCTACGAAC
AATTCGAGACTATGTGAAACAATGCTCTACTGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAAGTGTTCGTCTTTGTCGAAGGGTTGAAACCGTGGGCCAGAA
CAAAGCTCTATGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGGCGACAAAACAATTGAGCAAGTCGCACGTCGACCAACTAGTCGAGATAGAAGAACAACTTCTCTATCTGAAAGAAGTCCCAGATACCCTCCGTCTTCTGGAGAT
GCGAGTTGATGAGTTCTCCGATAAGTTTGGAGAAATAGACGCTGTGAATGCCCGAGTGGACGGGTTACCGATACAAGGTCTAGCTACAAGGGTTGAGGCCCTAAAAAGCA
AAGCTACGCGTCCTGGTAGCTTCAAACGTGAAGACAATCCCATGAACCCAAACACCCAGATAGAAGTACGTATGGGTGAGTTAAACAATTCACAGTCAGCAATGATGCAA
TTGTTTAACGAAATGACGGAGGACTTCAAGGTAACCATAGACAATCTCCGAGCTGAGATGGCTGAATTAAGCACTCGAGTTAACCAGACCATGCGAGCCGTGGGAAATCA
AGCTCCCAACCAAGCGAACATGGGGTTCAACAAGCTCAAGGTCCCAGAGCCCAAACCATTTAATGGCAATAGATATGCGAAAGATCTCGAGAACTACATGTTCGATGTAG
AACAATACTTCAAGGCTACTGGGACAACGTCAGAAGAAATGAAAGTGACTTTAGCCACCATGCATCATACTAATGATGCAAAGCTATGGTGGAGGTCTAAAGTCAACGAC
ATTCAGAATGGACTATGCACGATCAACAATTGGGATGATCTTAAGAAGGAATTGAGGGGTTGGTTCTTCCCCAAAAATGTCGAGTTCATGGCTAGAAGGAAGCTACGAAC
AATTCGAGACTATGTGAAACAATGCTCTACTGTGATGCTGGATATTCGCGACATGTCAGAGAAAGACAAAGTGTTCGTCTTTGTCGAAGGGTTGAAACCGTGGGCCAGAA
CAAAGCTCTATGAATAG
Protein sequenceShow/hide protein sequence
MSATKQLSKSHVDQLVEIEEQLLYLKEVPDTLRLLEMRVDEFSDKFGEIDAVNARVDGLPIQGLATRVEALKSKATRPGSFKREDNPMNPNTQIEVRMGELNNSQSAMMQ
LFNEMTEDFKVTIDNLRAEMAELSTRVNQTMRAVGNQAPNQANMGFNKLKVPEPKPFNGNRYAKDLENYMFDVEQYFKATGTTSEEMKVTLATMHHTNDAKLWWRSKVND
IQNGLCTINNWDDLKKELRGWFFPKNVEFMARRKLRTIRDYVKQCSTVMLDIRDMSEKDKVFVFVEGLKPWARTKLYE