; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19630 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19630
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionReverse transcriptase
Genome locationchr2:14578650..14581292
RNA-Seq ExpressionMoc02g19630
SyntenyMoc02g19630
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022143608.1 uncharacterized protein LOC111013464 [Momordica charantia]3.2e-11397.57Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPLL
        VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPL+
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPLL

Query:  ANELKN
         N   N
Subjt:  ANELKN

XP_022150317.1 uncharacterized protein LOC111018514 [Momordica charantia]9.6e-9461.79Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQR + APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P QP+Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPSQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YV HG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PP  QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPSQQQY

Query:  NQRTKTP------------------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTM
        NQ  +TP                                                  AN+LK +PQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+ P M
Subjt:  NQRTKTP------------------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTM

Query:  P
        P
Subjt:  P

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]6.6e-11969.94Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQP
        IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA  IQP
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQP

Query:  VQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPS
        VQ DYCT APVCQVNDLIC                                        WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPP 
Subjt:  VQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPS

Query:  QQQYNQRTKTP---------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDV
        QQQYNQRT+TP                                        LANELKN+PQGSFPGHTELP+REGKEQCKAVTLRSGL YD PTMPTTDV
Subjt:  QQQYNQRTKTP---------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDV

Query:  QIPFAEPIVKIPENPTTPGKENIRKRIEDTLSVPPQ
        QIP  +P VKIPENPTTP KENIRK  +DT SVPPQ
Subjt:  QIPFAEPIVKIPENPTTPGKENIRKRIEDTLSVPPQ

XP_022159235.1 uncharacterized protein LOC111025653 [Momordica charantia]1.1e-7639.09Show/hide
Query:  LNIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPI
        + IEHFFRG D  TKMMLN AANG FT K+FNEIV+IL+ L+ HN  WCS++ R   K+ DPAGVLALD  TSMQK++ T+ Q LK M     N  A   
Subjt:  LNIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPI

Query:  QPVQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQ
                 P+PV Q+ +  C +C + H  +NCP NP+S++YV   N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P   
Subjt:  QPVQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQ

Query:  YIPPSQQQYNQR---------------------------------TKTPL----------------------------LANELKNKPQGSFPGHTELPKR
          PP+  QYNQ+                                 T+T +                            L NE++ +PQGS P  TE P+R
Subjt:  YIPPSQQQYNQR---------------------------------TKTPL----------------------------LANELKNKPQGSFPGHTELPKR

Query:  EGKEQCKAVTLRSGLAYDRPTMP------------TTDVQIPFAEPIVKIP-------ENPTTPGKENIRKRIEDT-------------LSVP-----PQ
         GKE C ++  RSGL Y+ P MP            T  V     EP V +P         P  P  + + ++ +D              +++P      Q
Subjt:  EGKEQCKAVTLRSGLAYDRPTMP------------TTDVQIPFAEPIVKIP-------ENPTTPGKENIRKRIEDT-------------LSVP-----PQ

Query:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGRPLPMKCNHPGSFTIPCSIGGKNLG-DFEECSAITNLNPV-MFDEF
        MP YAKF+KDI++RKKK+GE+E VA+T+CSS      +P K   PGSFTIPC IGGK++G    +  A  NL P+ +F +F
Subjt:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGRPLPMKCNHPGSFTIPCSIGGKNLG-DFEECSAITNLNPV-MFDEF

XP_022159345.1 uncharacterized protein LOC111025764 [Momordica charantia]4.4e-13979.01Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTP--
        VNDLICSFCSENHIYD CPHNPASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPP QQ+YNQRT+TP  
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTP--

Query:  -------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDVQIPFAEPIVKIPE
                                              LANELKN+PQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIP   P VKIPE
Subjt:  -------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDVQIPFAEPIVKIPE

Query:  NPTTPGKENIRKRIEDTLSVPPQM
        NPTTP K NIRK  EDT SVPPQ+
Subjt:  NPTTPGKENIRKRIEDTLSVPPQM

TrEMBL top hitse value%identityAlignment
A0A6J1CR45 uncharacterized protein LOC1110134641.5e-11397.57Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPLL
        VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPL+
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPLL

Query:  ANELKN
         N   N
Subjt:  ANELKN

A0A6J1DAE9 uncharacterized protein LOC1110185144.6e-9461.79Show/hide
Query:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCT
        LDHPTKMMLNNAANGAFTKKTFNEIVDIL DLASHNELWCSQR + APKKQD AGVLALDIATSMQKEM TMNQ LKE+AL  K   + P  P QP+Y  
Subjt:  LDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCT

Query:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPSQQQY
         +PVCQ+N+++CS+CS+NH+Y+NCPHNPAS +YV HG NR FNPYSNTYNPG RHHPNFSWGGQG SSG  QGQ+QQ KQPYVP T    Q +PP  QQY
Subjt:  PAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPT---QQYIPPSQQQY

Query:  NQRTKTP------------------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTM
        NQ  +TP                                                  AN+LK +PQGSFPGHTE+ KR+G EQCKAVTLRSGL+Y+ P M
Subjt:  NQRTKTP------------------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTM

Query:  P
        P
Subjt:  P

A0A6J1DW02 uncharacterized protein LOC1110248973.2e-11969.94Show/hide
Query:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQP
        IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLA  IQP
Subjt:  IEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQP

Query:  VQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPS
        VQ DYCT APVCQVNDLIC                                        WRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQ+IPP 
Subjt:  VQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPS

Query:  QQQYNQRTKTP---------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDV
        QQQYNQRT+TP                                        LANELKN+PQGSFPGHTELP+REGKEQCKAVTLRSGL YD PTMPTTDV
Subjt:  QQQYNQRTKTP---------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDV

Query:  QIPFAEPIVKIPENPTTPGKENIRKRIEDTLSVPPQ
        QIP  +P VKIPENPTTP KENIRK  +DT SVPPQ
Subjt:  QIPFAEPIVKIPENPTTPGKENIRKRIEDTLSVPPQ

A0A6J1DY39 uncharacterized protein LOC1110256535.1e-7739.09Show/hide
Query:  LNIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPI
        + IEHFFRG D  TKMMLN AANG FT K+FNEIV+IL+ L+ HN  WCS++ R   K+ DPAGVLALD  TSMQK++ T+ Q LK M     N  A   
Subjt:  LNIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPI

Query:  QPVQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQ
                 P+PV Q+ +  C +C + H  +NCP NP+S++YV   N + FNPYSNTYNPGW+ HPNFSW GQG S+    G +QQ K+ Y P   P   
Subjt:  QPVQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVP---PTQQ

Query:  YIPPSQQQYNQR---------------------------------TKTPL----------------------------LANELKNKPQGSFPGHTELPKR
          PP+  QYNQ+                                 T+T +                            L NE++ +PQGS P  TE P+R
Subjt:  YIPPSQQQYNQR---------------------------------TKTPL----------------------------LANELKNKPQGSFPGHTELPKR

Query:  EGKEQCKAVTLRSGLAYDRPTMP------------TTDVQIPFAEPIVKIP-------ENPTTPGKENIRKRIEDT-------------LSVP-----PQ
         GKE C ++  RSGL Y+ P MP            T  V     EP V +P         P  P  + + ++ +D              +++P      Q
Subjt:  EGKEQCKAVTLRSGLAYDRPTMP------------TTDVQIPFAEPIVKIP-------ENPTTPGKENIRKRIEDT-------------LSVP-----PQ

Query:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGRPLPMKCNHPGSFTIPCSIGGKNLG-DFEECSAITNLNPV-MFDEF
        MP YAKF+KDI++RKKK+GE+E VA+T+CSS      +P K   PGSFTIPC IGGK++G    +  A  NL P+ +F +F
Subjt:  MPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGRPLPMKCNHPGSFTIPCSIGGKNLG-DFEECSAITNLNPV-MFDEF

A0A6J1DYG0 uncharacterized protein LOC1110257642.1e-13979.01Show/hide
Query:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ
        MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQR RAAPKKQDPAGVLALDIA+SMQKE VTMNQRLKEM LG+KNPLA PIQPVQ DYCTPAPVCQ
Subjt:  MMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLALDIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQ

Query:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTP--
        VNDLICSFCSENHIYD CPHNPASVFYV HGNNRNFNPYSNTYNPGWRHHPNFSW GQGGS GFNQGQSQQNKQPYVPPTQQYIPP QQ+YNQRT+TP  
Subjt:  VNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSGFNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTP--

Query:  -------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDVQIPFAEPIVKIPE
                                              LANELKN+PQGSFP HTELPKREGKEQCKAVTLRSGLAYD PTMPT DVQIP   P VKIPE
Subjt:  -------------------------------------LLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDVQIPFAEPIVKIPE

Query:  NPTTPGKENIRKRIEDTLSVPPQM
        NPTTP K NIRK  EDT SVPPQ+
Subjt:  NPTTPGKENIRKRIEDTLSVPPQM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTATGGCTCCGGTCACCACTGCACTCATTGGAGAAAAAGAGATGAGGGAGGGTGATTTCCCTTGTTTACAATGGAAAAATTCGAGAGAGGTATCTGGTGCCTTGCC
CAGTTCTGAGAATGCCCTTAATATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCA
ACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTGTGGTGTTCACAAAGATATAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTG
GACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCGTTGGGAATAAAAAATCCATTAGCCATGCCGATACAACCTGTGCAGCC
GGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCCGTTTTTT
ATGTAGAACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGT
TTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGCCATCGCAACAGCAGTACAATCAGAGAACAAAGACTCCACTACT
CGCCAATGAATTGAAGAATAAACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGG
CGTATGATAGACCAACAATGCCAACAACAGATGTACAGATTCCGTTCGCTGAACCAATTGTAAAGATACCAGAGAATCCAACAACACCAGGAAAGGAAAATATTAGAAAA
CGTATTGAGGACACCCTGAGTGTTCCTCCACAGATGCCAAATTATGCTAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAAAAGATAGGAGAGCATGAACTGGTAGCCAT
GACAAAATGTAGTAGTGAAGCTGTAGGCAGGCCGCTACCCATGAAATGTAACCATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGGAAAAACTTAGGAGACTTTG
AAGAGTGCTCTGCTATAACTAACTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAAAGCTTGATAAGATAGCAGAAGGACCGGAA
GAAGTGACTAATCCTGTTGAAAAAATACAAAAAGAAGAATTCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCATAAGCCATTGCCGTCGCATTTGAA
ATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGACAACAGATATCCGAGGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTATGGCTCCGGTCACCACTGCACTCATTGGAGAAAAAGAGATGAGGGAGGGTGATTTCCCTTGTTTACAATGGAAAAATTCGAGAGAGGTATCTGGTGCCTTGCC
CAGTTCTGAGAATGCCCTTAATATAGAACATTTCTTTAGAGGTTTAGATCATCCTACTAAGATGATGCTAAACAATGCTGCCAACGGAGCCTTTACAAAGAAGACATTCA
ACGAGATAGTCGACATCCTAAATGACTTAGCTTCACACAACGAACTGTGGTGTTCACAAAGATATAGGGCAGCACCAAAGAAGCAAGATCCAGCTGGAGTTTTGGCTCTG
GACATTGCGACCTCGATGCAAAAAGAGATGGTTACAATGAACCAGAGGCTGAAAGAGATGGCGTTGGGAATAAAAAATCCATTAGCCATGCCGATACAACCTGTGCAGCC
GGATTATTGCACTCCTGCCCCTGTTTGCCAAGTCAACGATCTCATTTGTTCATTTTGCAGTGAAAACCATATTTATGATAATTGTCCACATAACCCTGCTTCCGTTTTTT
ATGTAGAACATGGGAACAATAGGAACTTTAACCCATATTCGAACACCTACAACCCAGGTTGGAGGCACCACCCTAATTTCTCATGGGGAGGTCAAGGAGGTTCGAGCGGT
TTTAATCAAGGGCAGAGCCAGCAGAACAAGCAGCCCTATGTTCCACCTACACAGCAATACATCCCGCCATCGCAACAGCAGTACAATCAGAGAACAAAGACTCCACTACT
CGCCAATGAATTGAAGAATAAACCACAAGGTTCTTTTCCAGGCCATACTGAATTACCAAAACGAGAAGGGAAAGAACAGTGCAAAGCTGTCACCCTTAGGAGTGGACTGG
CGTATGATAGACCAACAATGCCAACAACAGATGTACAGATTCCGTTCGCTGAACCAATTGTAAAGATACCAGAGAATCCAACAACACCAGGAAAGGAAAATATTAGAAAA
CGTATTGAGGACACCCTGAGTGTTCCTCCACAGATGCCAAATTATGCTAAGTTTTTGAAAGATATAGTTTCTAGGAAGAAAAAGATAGGAGAGCATGAACTGGTAGCCAT
GACAAAATGTAGTAGTGAAGCTGTAGGCAGGCCGCTACCCATGAAATGTAACCATCCTGGCAGTTTTACCATTCCATGTTCCATAGGAGGGAAAAACTTAGGAGACTTTG
AAGAGTGCTCTGCTATAACTAACTTGAATCCTGTTATGTTTGATGAGTTTTATGACTTATTAGTTACAGAGATTGAAGAAAAGCTTGATAAGATAGCAGAAGGACCGGAA
GAAGTGACTAATCCTGTTGAAAAAATACAAAAAGAAGAATTCAAGTCGTTACTTCCGTCCATAGTGGAACCACCCACTTTGGAGCATAAGCCATTGCCGTCGCATTTGAA
ATATGCATTTTGGAGAAGCACAAAAAGGCCATTGGATGGACGACAACAGATATCCGAGGGATAA
Protein sequenceShow/hide protein sequence
MGMAPVTTALIGEKEMREGDFPCLQWKNSREVSGALPSSENALNIEHFFRGLDHPTKMMLNNAANGAFTKKTFNEIVDILNDLASHNELWCSQRYRAAPKKQDPAGVLAL
DIATSMQKEMVTMNQRLKEMALGIKNPLAMPIQPVQPDYCTPAPVCQVNDLICSFCSENHIYDNCPHNPASVFYVEHGNNRNFNPYSNTYNPGWRHHPNFSWGGQGGSSG
FNQGQSQQNKQPYVPPTQQYIPPSQQQYNQRTKTPLLANELKNKPQGSFPGHTELPKREGKEQCKAVTLRSGLAYDRPTMPTTDVQIPFAEPIVKIPENPTTPGKENIRK
RIEDTLSVPPQMPNYAKFLKDIVSRKKKIGEHELVAMTKCSSEAVGRPLPMKCNHPGSFTIPCSIGGKNLGDFEECSAITNLNPVMFDEFYDLLVTEIEEKLDKIAEGPE
EVTNPVEKIQKEEFKSLLPSIVEPPTLEHKPLPSHLKYAFWRSTKRPLDGRQQISEG