; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g17800 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g17800
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr3:11813714..11815913
RNA-Seq ExpressionMoc03g17800
SyntenyMoc03g17800
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131660.1 uncharacterized protein LOC111004785 [Momordica charantia]1.6e-13084.43Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQDEEWIRELEALYVY
        MAFRRNTRA NYE PNPRGEEA DPNVP  VP GVAPP PQAAPQGV    PQV LLAE LQVLL+NANGAGGAQ QQPRRAQI Q+E  ++ELEALYVY
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQDEEWIRELEALYVY

Query:  LGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQYIPTKQLKID
        LGCSD+FKVRGAVFML G+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTEL RF MQYIPT+QLKID
Subjt:  LGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQYIPTKQLKID

Query:  KFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVG
        KF+DGLRREIKGLLVLKEPTTYAAAVRC LVM+KCLEEP +QQV GSS GVK+KFASFSS+Q SRGHQ  +QRQTA PVCP+CK+SH G
Subjt:  KFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVG

XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]7.8e-13077.6Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGVPQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-----------------
        MAFRRNT+AHNYE PN RGE A D NVPPVVPG               +VVLLAEALQVLLDNANGAGGAQVQQP R QI Q+                 
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGVPQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-----------------

Query:  ---------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYE
                 EEW+RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL VAQYE
Subjt:  ---------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYE

Query:  RKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPP
        RKFTEL RFGMQYIPT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVM+KCLEEPQ+QQV+GSS GVK+KFASFSSSQPSRGHQ++ QRQT PP
Subjt:  RKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPP

Query:  VCPSCKKSHVGPCWLGK
         CPSCKK+H GPCW+GK
Subjt:  VCPSCKKSHVGPCWLGK

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]6.3e-14081Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------
        MAFRRNTRAHNYE PN RGE A DPNV PVVPGGV PP PQAAPQGV    PQV LLAEALQVLL NANGAGGAQVQQPRRAQIPQD             
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------

Query:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
                     EEW+RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ
        AQYERKFTEL RFG QY+PT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVM+KCLEEPQ+QQV+GS+ GVK+KFASFS+SQ SRGHQ+H QRQ
Subjt:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ

Query:  TAPPVCPSCKKSHVGPCWLGK
        TAPPVCPSCKK+H  PCWLGK
Subjt:  TAPPVCPSCKKSHVGPCWLGK

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]6.1e-14382.24Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------
        MAFRRNTRAHNY+ PNPRGE A DPNVP +VPG VAPP PQAAPQGV    PQV LLAEALQVLLDNANGAGGAQVQQPRRAQIPQD             
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------

Query:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
                     EEW+RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ
        AQYERKFTEL RFGMQYIPT+QLKIDKFIDGLR EIKGLLV+KEPTTYAAA+RCALVM+KCLEEPQ+QQVMGSS GVK+KFA FSSSQ SRGHQ+H+QRQ
Subjt:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ

Query:  TAPPVCPSCKKSHVGPCWLGK
        TAPPVCPSCKK+H GPCWLGK
Subjt:  TAPPVCPSCKKSHVGPCWLGK

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]3.5e-14687.83Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD---------EEWI
        MAFRRNTRAHNYE PNPRGE A DPNVPP VPGGVAPP PQAA QGV    PQV LLAEALQVLLDNANGAGGAQVQQPR AQIPQ+         EEW+
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD---------EEWI

Query:  RELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQY
        RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFTEL RFGMQY
Subjt:  RELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQY

Query:  IPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVGPC
        IPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVM+KCLEEPQ+QQV+GSS GVK+KFASFSSSQPSR HQ+H+QRQTAPPVCPSCKKSH GPC
Subjt:  IPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVGPC

Query:  WLGK
        W+GK
Subjt:  WLGK

TrEMBL top hitse value%identityAlignment
A0A6J1BQB2 uncharacterized protein LOC1110047857.6e-13184.43Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQDEEWIRELEALYVY
        MAFRRNTRA NYE PNPRGEEA DPNVP  VP GVAPP PQAAPQGV    PQV LLAE LQVLL+NANGAGGAQ QQPRRAQI Q+E  ++ELEALYVY
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQDEEWIRELEALYVY

Query:  LGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQYIPTKQLKID
        LGCSD+FKVRGAVFML G+AVNWWESVAAAEDHAN PVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTEL RF MQYIPT+QLKID
Subjt:  LGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQYIPTKQLKID

Query:  KFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVG
        KF+DGLRREIKGLLVLKEPTTYAAAVRC LVM+KCLEEP +QQV GSS GVK+KFASFSS+Q SRGHQ  +QRQTA PVCP+CK+SH G
Subjt:  KFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVG

A0A6J1DNV8 uncharacterized protein LOC1110229253.8e-13077.6Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGVPQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-----------------
        MAFRRNT+AHNYE PN RGE A D NVPPVVPG               +VVLLAEALQVLLDNANGAGGAQVQQP R QI Q+                 
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGVPQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-----------------

Query:  ---------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYE
                 EEW+RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SL VAQYE
Subjt:  ---------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYE

Query:  RKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPP
        RKFTEL RFGMQYIPT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVM+KCLEEPQ+QQV+GSS GVK+KFASFSSSQPSRGHQ++ QRQT PP
Subjt:  RKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPP

Query:  VCPSCKKSHVGPCWLGK
         CPSCKK+H GPCW+GK
Subjt:  VCPSCKKSHVGPCWLGK

A0A6J1DQB9 Reverse transcriptase3.1e-14081Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------
        MAFRRNTRAHNYE PN RGE A DPNV PVVPGGV PP PQAAPQGV    PQV LLAEALQVLL NANGAGGAQVQQPRRAQIPQD             
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------

Query:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
                     EEW+RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQGSLTV
Subjt:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ
        AQYERKFTEL RFG QY+PT+QLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVM+KCLEEPQ+QQV+GS+ GVK+KFASFS+SQ SRGHQ+H QRQ
Subjt:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ

Query:  TAPPVCPSCKKSHVGPCWLGK
        TAPPVCPSCKK+H  PCWLGK
Subjt:  TAPPVCPSCKKSHVGPCWLGK

A0A6J1DTA8 uncharacterized protein LOC1110241143.0e-14382.24Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------
        MAFRRNTRAHNY+ PNPRGE A DPNVP +VPG VAPP PQAAPQGV    PQV LLAEALQVLLDNANGAGGAQVQQPRRAQIPQD             
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD-------------

Query:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
                     EEW+RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDH NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV
Subjt:  -------------EEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTV

Query:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ
        AQYERKFTEL RFGMQYIPT+QLKIDKFIDGLR EIKGLLV+KEPTTYAAA+RCALVM+KCLEEPQ+QQVMGSS GVK+KFA FSSSQ SRGHQ+H+QRQ
Subjt:  AQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQ

Query:  TAPPVCPSCKKSHVGPCWLGK
        TAPPVCPSCKK+H GPCWLGK
Subjt:  TAPPVCPSCKKSHVGPCWLGK

A0A6J1DWP4 uncharacterized protein LOC1110252151.7e-14687.83Show/hide
Query:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD---------EEWI
        MAFRRNTRAHNYE PNPRGE A DPNVPP VPGGVAPP PQAA QGV    PQV LLAEALQVLLDNANGAGGAQVQQPR AQIPQ+         EEW+
Subjt:  MAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAPQGV----PQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQD---------EEWI

Query:  RELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQY
        RELEALYVYLGCSDDFKVRGAVFML GEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQGSLTVA+YERKFTEL RFGMQY
Subjt:  RELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQGSLTVAQYERKFTELFRFGMQY

Query:  IPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVGPC
        IPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVM+KCLEEPQ+QQV+GSS GVK+KFASFSSSQPSR HQ+H+QRQTAPPVCPSCKKSH GPC
Subjt:  IPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQNHMQRQTAPPVCPSCKKSHVGPC

Query:  WLGK
        W+GK
Subjt:  WLGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGAGTGTCAGGGCCTCGTGTATAAATGGTCGGGGGCTGATACGTCACTAATGGAGTATCGAGGCCTCGGGTATAAATGGTCGAGGGTCGATGTGAGGAGTCTTTT
GGAAGGAAAACTATTGGGGCCTTGGACTGTCCTAGAATCTAGGTCGATAGTGTATGGTCCTTGTCAGACTCCTCGTCATCACCAGACAATGGCTTTCCGACGGAACACGA
GAGCTCACAACTATGAGGGTCCGAATCCTAGGGGTGAGGAAGCAGTGGATCCAAATGTTCCCCCGGTAGTTCCTGGAGGGGTAGCACCCCCGGCCCCTCAAGCAGCTCCT
CAGGGAGTTCCCCAGGTGGTGTTGCTAGCTGAGGCATTGCAAGTACTGCTGGATAATGCAAATGGGGCTGGTGGAGCTCAAGTGCAGCAGCCTCGCCGGGCACAAATTCC
ACAAGACGAGGAATGGATTAGGGAGTTGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGATTTCAAGGTTCGGGGAGCAGTTTTTATGCTTCTGGGAGAAGCAGTAA
ATTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACGCCAACGTACCCGTCACGTGGGCAAGATTTAAGGACCTACTTTATGAGTACTATTTCCCCGTGACTGTCAGGAAT
GAAAAACGGGCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGCTGTTTCGTTTTGGAATGCAATATATTCCTACTAA
GCAATTAAAGATTGACAAGTTCATTGATGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAACCAACTACATACGCAGCGGCAGTCAGGTGTGCGTTGGTTA
TGAACAAATGTCTCGAGGAGCCTCAGACTCAGCAGGTAATGGGCTCCAGCTTGGGGGTCAAGAAGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCAAGAGGACACCAG
AACCATATGCAAAGGCAGACTGCTCCTCCGGTGTGCCCCTCTTGTAAGAAGAGCCATGTTGGGCCATGTTGGTTGGGAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTGAGTGTCAGGGCCTCGTGTATAAATGGTCGGGGGCTGATACGTCACTAATGGAGTATCGAGGCCTCGGGTATAAATGGTCGAGGGTCGATGTGAGGAGTCTTTT
GGAAGGAAAACTATTGGGGCCTTGGACTGTCCTAGAATCTAGGTCGATAGTGTATGGTCCTTGTCAGACTCCTCGTCATCACCAGACAATGGCTTTCCGACGGAACACGA
GAGCTCACAACTATGAGGGTCCGAATCCTAGGGGTGAGGAAGCAGTGGATCCAAATGTTCCCCCGGTAGTTCCTGGAGGGGTAGCACCCCCGGCCCCTCAAGCAGCTCCT
CAGGGAGTTCCCCAGGTGGTGTTGCTAGCTGAGGCATTGCAAGTACTGCTGGATAATGCAAATGGGGCTGGTGGAGCTCAAGTGCAGCAGCCTCGCCGGGCACAAATTCC
ACAAGACGAGGAATGGATTAGGGAGTTGGAAGCCCTTTATGTGTATTTGGGATGCTCCGACGATTTCAAGGTTCGGGGAGCAGTTTTTATGCTTCTGGGAGAAGCAGTAA
ATTGGTGGGAGTCGGTGGCGGCAGCGGAGGATCACGCCAACGTACCCGTCACGTGGGCAAGATTTAAGGACCTACTTTATGAGTACTATTTCCCCGTGACTGTCAGGAAT
GAAAAACGGGCAGAGTTTCTCCGTCTCACTCAAGGGAGCCTAACTGTGGCCCAATACGAGAGGAAGTTCACTGAGCTGTTTCGTTTTGGAATGCAATATATTCCTACTAA
GCAATTAAAGATTGACAAGTTCATTGATGGTTTGCGTAGGGAGATCAAGGGGCTACTTGTTCTCAAGGAACCAACTACATACGCAGCGGCAGTCAGGTGTGCGTTGGTTA
TGAACAAATGTCTCGAGGAGCCTCAGACTCAGCAGGTAATGGGCTCCAGCTTGGGGGTCAAGAAGAAATTTGCATCGTTCTCCTCCAGTCAACCCTCAAGAGGACACCAG
AACCATATGCAAAGGCAGACTGCTCCTCCGGTGTGCCCCTCTTGTAAGAAGAGCCATGTTGGGCCATGTTGGTTGGGAAAATGA
Protein sequenceShow/hide protein sequence
MGECQGLVYKWSGADTSLMEYRGLGYKWSRVDVRSLLEGKLLGPWTVLESRSIVYGPCQTPRHHQTMAFRRNTRAHNYEGPNPRGEEAVDPNVPPVVPGGVAPPAPQAAP
QGVPQVVLLAEALQVLLDNANGAGGAQVQQPRRAQIPQDEEWIRELEALYVYLGCSDDFKVRGAVFMLLGEAVNWWESVAAAEDHANVPVTWARFKDLLYEYYFPVTVRN
EKRAEFLRLTQGSLTVAQYERKFTELFRFGMQYIPTKQLKIDKFIDGLRREIKGLLVLKEPTTYAAAVRCALVMNKCLEEPQTQQVMGSSLGVKKKFASFSSSQPSRGHQ
NHMQRQTAPPVCPSCKKSHVGPCWLGK