; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g17060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g17060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr6:13383072..13386031
RNA-Seq ExpressionMoc06g17060
SyntenyMoc06g17060
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155925.1 uncharacterized protein LOC111022925 [Momordica charantia]9.0e-13380.72Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP-----VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWVRE
        MAFRRNT+AHNYEDPN  GEGAAD NVPP     VVLLAEALQVLLDNAN AGGAQVQQP R QI Q+EVQFIRDFKRFGPPVFNGVSERPTAAEEWVRE
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP-----VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWVRE

Query:  LEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL---------
        LEALYVYLGCSD+FKV GAVFML+GEAVNWWESVAAAE+HANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESL VA Y+RK TEL         
Subjt:  LEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL---------

Query:  ---------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPCWL
                       EIKGL+VLKEP TYA AVRC LVMDKCLEEPQSQQVIGS SGVKRKFASF SSQ SRGHQH+ QRQTT P CPSCKKNHAGPCW+
Subjt:  ---------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPCWL

Query:  GKRICW
        GKRIC+
Subjt:  GKRICW

XP_022156328.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111023249 [Momordica charantia]4.2e-13076Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN  GE AADPNV PVV                        LLAEALQVLL NAN AGGAQVQQPRRAQIPQDEVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV
        PVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKV GAVFMLRGEAVNWWESVAAAE+HANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQ SLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV

Query:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ
        A Y+RK TEL                        EIKGL+VLKEP TYA AVRC LVMDKCLEEPQSQQVIGS SGVKRKFASF +SQSSRGHQHH QRQ
Subjt:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ

Query:  TTQPVCPSCKKNHAGPCWLGKRICW
        T  PVCPSCKKNHA PCWLGK+IC+
Subjt:  TTQPVCPSCKKNHAGPCWLGKRICW

XP_022157398.1 uncharacterized protein LOC111024105 [Momordica charantia]1.1e-11170.13Show/hide
Query:  QTMAFRRNTRAHNYEDPNPNGEGAADPNVPPVVLLAEALQVLLDNANRAGGAQVQQPRRA-----QIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWV
        +TMAFRRNTRAHNYEDPNP GE AADPNV   V                GG     P+ A     QI Q+EVQFIRDFKRFGPPVFNGVSERPT AEEWV
Subjt:  QTMAFRRNTRAHNYEDPNPNGEGAADPNVPPVVLLAEALQVLLDNANRAGGAQVQQPRRA-----QIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWV

Query:  RELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL-------
        RELEALYVYLGCSD FKV GAVFMLR EAVNWWE VAAAE+HAN P+TWARFK+LLYEYYFPVT+RNEKRAEFLRLTQ SLTVA ++RK T+L       
Subjt:  RELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL-------

Query:  -----------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPC
                         EIKGL+VLKEP TYA AVRC LV+DKCLEEPQSQQV+GS SGVKRKFASF SSQ SRGHQ  VQRQT  PVCPSCKK+HAGPC
Subjt:  -----------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPC

Query:  WLGKRICW
        W GKRIC+
Subjt:  WLGKRICW

XP_022157413.1 uncharacterized protein LOC111024114 [Momordica charantia]2.1e-13476.92Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNP GEGAADPNVP +V                        LLAEALQVLLDNAN AGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKV GAVFMLRGEAVNWWESVAAAE+H NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV

Query:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ
        A Y+RK TEL                        EIKGL+V+KEP TYA A+RC LVMDKCLEEPQSQQV+GS SGVKRKFA F SSQSSRGHQHHVQRQ
Subjt:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ

Query:  TTQPVCPSCKKNHAGPCWLGKRICW
        T  PVCPSCKKNHAGPCWLGKRIC+
Subjt:  TTQPVCPSCKKNHAGPCWLGKRICW

XP_022158750.1 uncharacterized protein LOC111025215 [Momordica charantia]2.5e-11972Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP------------------------VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNP GEGAADPNVPP                        V LLAEALQVLLDNAN AGGAQVQQPR AQIPQ+E            
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP------------------------VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV
             VSERPTAAEEWVRELEALYVYLGCSD+FKV GAVFMLRGEAVNWWESVAAAE+HANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQ SLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV

Query:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ
        A Y+RK TEL                        EIKGL+VLKEP TYA AVRC LVMDKCLEEPQSQQVIGS SGVKRKFASF SSQ SR HQHHVQRQ
Subjt:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ

Query:  TTQPVCPSCKKNHAGPCWLGKRICW
        T  PVCPSCKK+HAGPCW+GKRIC+
Subjt:  TTQPVCPSCKKNHAGPCWLGKRICW

TrEMBL top hitse value%identityAlignment
A0A6J1DNV8 uncharacterized protein LOC1110229254.3e-13380.72Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP-----VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWVRE
        MAFRRNT+AHNYEDPN  GEGAAD NVPP     VVLLAEALQVLLDNAN AGGAQVQQP R QI Q+EVQFIRDFKRFGPPVFNGVSERPTAAEEWVRE
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP-----VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWVRE

Query:  LEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL---------
        LEALYVYLGCSD+FKV GAVFML+GEAVNWWESVAAAE+HANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESL VA Y+RK TEL         
Subjt:  LEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL---------

Query:  ---------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPCWL
                       EIKGL+VLKEP TYA AVRC LVMDKCLEEPQSQQVIGS SGVKRKFASF SSQ SRGHQH+ QRQTT P CPSCKKNHAGPCW+
Subjt:  ---------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPCWL

Query:  GKRICW
        GKRIC+
Subjt:  GKRICW

A0A6J1DQB9 Reverse transcriptase2.0e-13076Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPN  GE AADPNV PVV                        LLAEALQVLL NAN AGGAQVQQPRRAQIPQDEVQFIRDFK FGP
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV
        PVFNGVSERPTAAEEWVRELEALYVYLGCSD+FKV GAVFMLRGEAVNWWESVAAAE+HANVPVTWARFKDLLYEYYFPV  RNEKR EFLRLTQ SLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV

Query:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ
        A Y+RK TEL                        EIKGL+VLKEP TYA AVRC LVMDKCLEEPQSQQVIGS SGVKRKFASF +SQSSRGHQHH QRQ
Subjt:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ

Query:  TTQPVCPSCKKNHAGPCWLGKRICW
        T  PVCPSCKKNHA PCWLGK+IC+
Subjt:  TTQPVCPSCKKNHAGPCWLGKRICW

A0A6J1DT93 uncharacterized protein LOC1110241055.5e-11270.13Show/hide
Query:  QTMAFRRNTRAHNYEDPNPNGEGAADPNVPPVVLLAEALQVLLDNANRAGGAQVQQPRRA-----QIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWV
        +TMAFRRNTRAHNYEDPNP GE AADPNV   V                GG     P+ A     QI Q+EVQFIRDFKRFGPPVFNGVSERPT AEEWV
Subjt:  QTMAFRRNTRAHNYEDPNPNGEGAADPNVPPVVLLAEALQVLLDNANRAGGAQVQQPRRA-----QIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWV

Query:  RELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL-------
        RELEALYVYLGCSD FKV GAVFMLR EAVNWWE VAAAE+HAN P+TWARFK+LLYEYYFPVT+RNEKRAEFLRLTQ SLTVA ++RK T+L       
Subjt:  RELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTEL-------

Query:  -----------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPC
                         EIKGL+VLKEP TYA AVRC LV+DKCLEEPQSQQV+GS SGVKRKFASF SSQ SRGHQ  VQRQT  PVCPSCKK+HAGPC
Subjt:  -----------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQTTQPVCPSCKKNHAGPC

Query:  WLGKRICW
        W GKRIC+
Subjt:  WLGKRICW

A0A6J1DTA8 uncharacterized protein LOC1110241141.0e-13476.92Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
        MAFRRNTRAHNY+DPNP GEGAADPNVP +V                        LLAEALQVLLDNAN AGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPPVV------------------------LLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV
        PVFNGVSERPTA EEWVRELEALYVYLGCSD+FKV GAVFMLRGEAVNWWESVAAAE+H NVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQ SLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV

Query:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ
        A Y+RK TEL                        EIKGL+V+KEP TYA A+RC LVMDKCLEEPQSQQV+GS SGVKRKFA F SSQSSRGHQHHVQRQ
Subjt:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ

Query:  TTQPVCPSCKKNHAGPCWLGKRICW
        T  PVCPSCKKNHAGPCWLGKRIC+
Subjt:  TTQPVCPSCKKNHAGPCWLGKRICW

A0A6J1DWP4 uncharacterized protein LOC1110252151.2e-11972Show/hide
Query:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP------------------------VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP
        MAFRRNTRAHNYEDPNP GEGAADPNVPP                        V LLAEALQVLLDNAN AGGAQVQQPR AQIPQ+E            
Subjt:  MAFRRNTRAHNYEDPNPNGEGAADPNVPP------------------------VVLLAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGP

Query:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV
             VSERPTAAEEWVRELEALYVYLGCSD+FKV GAVFMLRGEAVNWWESVAAAE+HANVPVTWARFKDLLYEYYFPVTVRNEKR EFLRLTQ SLTV
Subjt:  PVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHANVPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTV

Query:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ
        A Y+RK TEL                        EIKGL+VLKEP TYA AVRC LVMDKCLEEPQSQQVIGS SGVKRKFASF SSQ SR HQHHVQRQ
Subjt:  AHYKRKLTEL------------------------EIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSSRGHQHHVQRQ

Query:  TTQPVCPSCKKNHAGPCWLGKRICW
        T  PVCPSCKK+HAGPCW+GKRIC+
Subjt:  TTQPVCPSCKKNHAGPCWLGKRICW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGGGTCGATGCGAGGAGTATTTTGGAAGGAAAACTATTGGGGCCTTGGGTAAAAATGGTCAAGGGCCGATAGATGGTGAAGACATCGGGGCCTTGGGT
AAAAAATGGTCGAGGGTCGATGCAATAGTGTCGGGGCTCTGGGCAAAAATGACTGTCCTAGAATCTAGGTCGATATTGTATGGTCCTTGTCAGTCTCCTCGTCAT
CACCAGACAATGGCTTTCCGGCGGAACACGAGAGCTCACAACTATGAGGATCCGAATCCTAACGGTGAGGGAGCAGCGGACCCAAATGTTCCTCCGGTGGTGTTG
CTAGCTGAGGCATTGCAAGTGCTACTGGATAATGCGAATAGAGCCGGTGGAGCTCAAGTGCAGCAGCCTCGCCGGGCACAAATTCCGCAGGATGAGGTTCAGTTT
ATCAGGGATTTCAAACGCTTTGGGCCACCCGTTTTCAACGGGGTGAGTGAGAGGCCTACTGCGGCCGAGGAATGGGTTAGGGAGTTGGAAGCCCTTTATGTGTAT
TTGGGATGCTCCGACAATTTCAAGGTCTGGGGAGCAGTTTTTATGCTTCGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTGGCGGCAGCAGAGGAGCACGCCAAC
GTACCCGTCACGTGGGCAAGATTTAAGGACCTACTTTATGAGTACTATTTCCCCGTGACTGTCAGAAATGAAAAACGGGCAGAGTTCCTCCGTCTCACTCAAGAG
AGCCTGACCGTGGCCCATTACAAAAGGAAGCTCACTGAGCTGGAGATCAAAGGGCTTATTGTTCTCAAGGAACCAGCTACATACGCAGTGGCAGTCAGGTGTCCG
TTGGTTATGGACAAATGTCTTGAGGAGCCTCAGTCTCAGCAGGTAATAGGCTCCATCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTTCTCCAGTCAATCCTCA
AGAGGACACCAGCACCATGTGCAAAGACAGACTACTCAACCGGTGTGCCCCTCTTGTAAGAAGAACCATGCTGGGCCATGTTGGTTGGGAAAAAGAATATGCTGG
ACCGAGTTCGAGCTTGACCAAGCCAACCGACGAGGTCTGAATGGCAATAACAGCCTCCAGCCCAAGGTCGAGGCTAAGGTGGTGGAGGACCAGGCCGAGGAGGTT
AAGAAGTTGATCCACCTCGAAGGTCGGTCTGCAATGCGAATCCGACCTTACCTCCTACTCAACCGACCCCTGCCAAGACAAACCGAGGTCGAGGTGGAGTCTTCA
AGAAAACCACACGAAGGACTGTACCAGCCACAGATCCCGATACTCTCACTACTCTCCAGTGAGAGCTTGATGACATGCACAATCGGGATCCTAGGCGAAGCTGAT
GACTTGCATTATGACGGTGATGATCAACATCGACCTCCTCCTCTTGACGACGACGCAAGCGTCGCCGAGAATGGAAGAGTTGATTACAGCCACCAAGAGGACGAT
CTAAGAAAAGTCCTCGCTGAAAAGAAGAAGAAGAATTCCACCCTACTAGGCGCGGATGTCTCTCCTTCGTACTCGCAGAAGAACTCAAACTTCAAAGTCCAATCT
CGGTACAACCCCTTGACGTTAGAAAGGATGATTACAAGGGAAGAGTTCGATCTGATGAGAAATAAGTTTGACGGACAAGTTGAAGCCCTTAAGACGAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAGGGTCGATGCGAGGAGTATTTTGGAAGGAAAACTATTGGGGCCTTGGGTAAAAATGGTCAAGGGCCGATAGATGGTGAAGACATCGGGGCCTTGGGT
AAAAAATGGTCGAGGGTCGATGCAATAGTGTCGGGGCTCTGGGCAAAAATGACTGTCCTAGAATCTAGGTCGATATTGTATGGTCCTTGTCAGTCTCCTCGTCAT
CACCAGACAATGGCTTTCCGGCGGAACACGAGAGCTCACAACTATGAGGATCCGAATCCTAACGGTGAGGGAGCAGCGGACCCAAATGTTCCTCCGGTGGTGTTG
CTAGCTGAGGCATTGCAAGTGCTACTGGATAATGCGAATAGAGCCGGTGGAGCTCAAGTGCAGCAGCCTCGCCGGGCACAAATTCCGCAGGATGAGGTTCAGTTT
ATCAGGGATTTCAAACGCTTTGGGCCACCCGTTTTCAACGGGGTGAGTGAGAGGCCTACTGCGGCCGAGGAATGGGTTAGGGAGTTGGAAGCCCTTTATGTGTAT
TTGGGATGCTCCGACAATTTCAAGGTCTGGGGAGCAGTTTTTATGCTTCGGGGAGAAGCAGTAAATTGGTGGGAGTCGGTGGCGGCAGCAGAGGAGCACGCCAAC
GTACCCGTCACGTGGGCAAGATTTAAGGACCTACTTTATGAGTACTATTTCCCCGTGACTGTCAGAAATGAAAAACGGGCAGAGTTCCTCCGTCTCACTCAAGAG
AGCCTGACCGTGGCCCATTACAAAAGGAAGCTCACTGAGCTGGAGATCAAAGGGCTTATTGTTCTCAAGGAACCAGCTACATACGCAGTGGCAGTCAGGTGTCCG
TTGGTTATGGACAAATGTCTTGAGGAGCCTCAGTCTCAGCAGGTAATAGGCTCCATCTCGGGGGTCAAGAGGAAATTTGCATCGTTCTTCTCCAGTCAATCCTCA
AGAGGACACCAGCACCATGTGCAAAGACAGACTACTCAACCGGTGTGCCCCTCTTGTAAGAAGAACCATGCTGGGCCATGTTGGTTGGGAAAAAGAATATGCTGG
ACCGAGTTCGAGCTTGACCAAGCCAACCGACGAGGTCTGAATGGCAATAACAGCCTCCAGCCCAAGGTCGAGGCTAAGGTGGTGGAGGACCAGGCCGAGGAGGTT
AAGAAGTTGATCCACCTCGAAGGTCGGTCTGCAATGCGAATCCGACCTTACCTCCTACTCAACCGACCCCTGCCAAGACAAACCGAGGTCGAGGTGGAGTCTTCA
AGAAAACCACACGAAGGACTGTACCAGCCACAGATCCCGATACTCTCACTACTCTCCAGTGAGAGCTTGATGACATGCACAATCGGGATCCTAGGCGAAGCTGAT
GACTTGCATTATGACGGTGATGATCAACATCGACCTCCTCCTCTTGACGACGACGCAAGCGTCGCCGAGAATGGAAGAGTTGATTACAGCCACCAAGAGGACGAT
CTAAGAAAAGTCCTCGCTGAAAAGAAGAAGAAGAATTCCACCCTACTAGGCGCGGATGTCTCTCCTTCGTACTCGCAGAAGAACTCAAACTTCAAAGTCCAATCT
CGGTACAACCCCTTGACGTTAGAAAGGATGATTACAAGGGAAGAGTTCGATCTGATGAGAAATAAGTTTGACGGACAAGTTGAAGCCCTTAAGACGAAGTAG
Protein sequenceShow/hide protein sequence
MVEGRCEEYFGRKTIGALGKNGQGPIDGEDIGALGKKWSRVDAIVSGLWAKMTVLESRSILYGPCQSPRHHQTMAFRRNTRAHNYEDPNPNGEGAADPNVPPVVL
LAEALQVLLDNANRAGGAQVQQPRRAQIPQDEVQFIRDFKRFGPPVFNGVSERPTAAEEWVRELEALYVYLGCSDNFKVWGAVFMLRGEAVNWWESVAAAEEHAN
VPVTWARFKDLLYEYYFPVTVRNEKRAEFLRLTQESLTVAHYKRKLTELEIKGLIVLKEPATYAVAVRCPLVMDKCLEEPQSQQVIGSISGVKRKFASFFSSQSS
RGHQHHVQRQTTQPVCPSCKKNHAGPCWLGKRICWTEFELDQANRRGLNGNNSLQPKVEAKVVEDQAEEVKKLIHLEGRSAMRIRPYLLLNRPLPRQTEVEVESS
RKPHEGLYQPQIPILSLLSSESLMTCTIGILGEADDLHYDGDDQHRPPPLDDDASVAENGRVDYSHQEDDLRKVLAEKKKKNSTLLGADVSPSYSQKNSNFKVQS
RYNPLTLERMITREEFDLMRNKFDGQVEALKTK