; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi05G000110 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi05G000110
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionheavy metal-associated isoprenylated plant protein 31
Genome locationchr05:150959..153684
RNA-Seq ExpressionLsi05G000110
SyntenyLsi05G000110
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571797.1 Heavy metal-associated isoprenylated plant protein 31, partial [Cucurbita argyrosperma subsp. sororia]1.8e-6283.87Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSM+EVRVPNLDCEGCA KLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYDTY  +TN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                  T    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

XP_022931790.1 heavy metal-associated isoprenylated plant protein 31 isoform X1 [Cucurbita moschata]1.4e-6283.87Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSM+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                  T    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

XP_022971858.1 heavy metal-associated isoprenylated plant protein 31 isoform X1 [Cucurbita maxima]3.6e-6384.52Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSM+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                 TT    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

XP_023554193.1 heavy metal-associated isoprenylated plant protein 31 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-6283.87Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSM+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                  T    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

XP_038888453.1 heavy metal-associated isoprenylated plant protein 31 [Benincasa hispida]5.7e-6988.48Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSMVEVRVPNLDCEGCASKLR+ALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHY SFYKYPSYIVNHYYDT Y+STN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  ------NNNKQQQLIT---TTAGGSPT-SPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
              NNNKQ  LI+   TTAGGSPT S HTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
Subjt:  ------NNNKQQQLIT---TTAGGSPT-SPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

TrEMBL top hitse value%identityAlignment
A0A0A0LHJ7 HMA domain-containing protein2.7e-6173.99Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSMVEVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAE WPFPGYSSHY SFYKYPSYI NHYYDTY    +
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGS------------------PTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
        N+N      TTT   +                   +  HTFFQTPSLYSLA+SSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGS------------------PTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

A0A6J1EUM7 heavy metal-associated isoprenylated plant protein 31 isoform X27.2e-6283.66Show/hide
Query:  MVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTNNN
        M+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN  
Subjt:  MVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTNNN

Query:  NKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                T    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

A0A6J1F0F2 heavy metal-associated isoprenylated plant protein 31 isoform X16.5e-6383.87Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSM+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                  T    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

A0A6J1I865 heavy metal-associated isoprenylated plant protein 31 isoform X11.7e-6384.52Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        MSM+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                 TT    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

A0A6J1I9R6 heavy metal-associated isoprenylated plant protein 31 isoform X21.9e-6284.31Show/hide
Query:  MVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTNNN
        M+EVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERK+VKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYI  HYYD Y  +TN  
Subjt:  MVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTNNN

Query:  NKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
               TT    SPTS HTFFQTPSLYS AVSSDH IASLFSDDNPHACSIM
Subjt:  NKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

SwissProt top hitse value%identityAlignment
B3H6D0 Heavy metal-associated isoprenylated plant protein 455.6e-1935.48Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        +S+VE+ V ++DC+GC  K+R+A+ KL GV+ VE++++ QK+TV GY ++  +++K +KR G+ AE WPFP Y+ +Y  +Y YPS  +       Y + +
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
         + K           +  S    +   S   +  + D     LFSDDN HAC+IM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

F4IC29 Heavy metal-associated isoprenylated plant protein 281.5e-1130.06Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVN--------HYY
        +  +E+RV ++DC GC S+++ AL K++GV+ VE+++  QK+TV GY  +++K++K +++ G+ AE W  P    H         Y  N        ++ 
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVN--------HYY

Query:  DTYYTSTNNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
            TS+ N  K          G  ++ ++ ++   +++   S  H   S FSD+NP+ACSIM
Subjt:  DTYYTSTNNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

F4IQG4 Heavy metal-associated isoprenylated plant protein 304.1e-1432.19Show/hide
Query:  CEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYAS---FYKYPSYIVNHYYDTYYTSTNNNNKQQQLI
        C GC   ++ A++KL+GV+ VEV +EM+++TV GY +E +K++KA++RAGK AE WP+P    ++ S   ++K  +      Y+ Y    N +++   + 
Subjt:  CEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYAS---FYKYPSYIVNHYYDTYYTSTNNNNKQQQLI

Query:  TTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
         T  G                      D  +++ F+DDN HACS+M
Subjt:  TTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

O81464 Heavy metal-associated isoprenylated plant protein 246.6e-1230.32Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        M  V +RV  +DCEGC  K++  L  +KGV+ V+V++++QK+TV GY ++ +K+++A K   K  E WP+  Y           + + N Y    Y    
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                        P        T S+    V   +TI  +FSD+NP++C+IM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

Q84K70 Heavy metal-associated isoprenylated plant protein 313.5e-4562.18Show/hide
Query:  MSM-VEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTST
        MSM VE+RVPNLDCEGCASKLRK L KLKGVEEVEVE+E QK+T RGY LEE+K++KA++RAGKAAE WP+   +SH+ASFYKYPSY+ NHYY   +   
Subjt:  MSM-VEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTST

Query:  NNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                  T   GG     HTFF TP+ YS+AV+ D   AS+FSDDNPHAC+IM
Subjt:  NNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

Arabidopsis top hitse value%identityAlignment
AT1G06330.1 Heavy metal transport/detoxification superfamily protein1.0e-1230.06Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVN--------HYY
        +  +E+RV ++DC GC S+++ AL K++GV+ VE+++  QK+TV GY  +++K++K +++ G+ AE W  P    H         Y  N        ++ 
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVN--------HYY

Query:  DTYYTSTNNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
            TS+ N  K          G  ++ ++ ++   +++   S  H   S FSD+NP+ACSIM
Subjt:  DTYYTSTNNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

AT2G18196.1 Heavy metal transport/detoxification superfamily protein2.9e-1532.19Show/hide
Query:  CEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYAS---FYKYPSYIVNHYYDTYYTSTNNNNKQQQLI
        C GC   ++ A++KL+GV+ VEV +EM+++TV GY +E +K++KA++RAGK AE WP+P    ++ S   ++K  +      Y+ Y    N +++   + 
Subjt:  CEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYAS---FYKYPSYIVNHYYDTYYTSTNNNNKQQQLI

Query:  TTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
         T  G                      D  +++ F+DDN HACS+M
Subjt:  TTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

AT3G48970.1 Heavy metal transport/detoxification superfamily protein2.5e-4662.18Show/hide
Query:  MSM-VEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTST
        MSM VE+RVPNLDCEGCASKLRK L KLKGVEEVEVE+E QK+T RGY LEE+K++KA++RAGKAAE WP+   +SH+ASFYKYPSY+ NHYY   +   
Subjt:  MSM-VEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTST

Query:  NNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                  T   GG     HTFF TP+ YS+AV+ D   AS+FSDDNPHAC+IM
Subjt:  NNNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

AT3G56891.1 Heavy metal transport/detoxification superfamily protein4.0e-2035.48Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        +S+VE+ V ++DC+GC  K+R+A+ KL GV+ VE++++ QK+TV GY ++  +++K +KR G+ AE WPFP Y+ +Y  +Y YPS  +       Y + +
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
         + K           +  S    +   S   +  + D     LFSDDN HAC+IM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM

AT4G08570.1 Heavy metal transport/detoxification superfamily protein4.7e-1330.32Show/hide
Query:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN
        M  V +RV  +DCEGC  K++  L  +KGV+ V+V++++QK+TV GY ++ +K+++A K   K  E WP+  Y           + + N Y    Y    
Subjt:  MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTN

Query:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM
                        P        T S+    V   +TI  +FSD+NP++C+IM
Subjt:  NNNKQQQLITTTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATGGTTGAGGTTAGAGTTCCGAATCTCGACTGCGAAGGATGCGCTTCAAAGCTGAGAAAAGCTCTTTTCAAGCTCAAAGGAGTGGAAGAAGTGGAAGTGGAAAT
AGAGATGCAGAAAATAACAGTGAGAGGGTACGGATTGGAAGAAAGAAAAATAGTGAAGGCAATAAAAAGAGCTGGAAAAGCAGCAGAAGCATGGCCATTCCCAGGATATT
CTTCTCATTATGCTTCCTTTTACAAATACCCTTCTTATATAGTCAACCATTATTATGACACTTATTACACTTCCACTAATAATAATAATAAGCAACAACAACTCATCACT
ACTACTGCTGGAGGATCTCCCACTTCTCCTCACACTTTCTTTCAAACTCCTTCCCTTTACTCTCTTGCTGTCTCCTCTGACCATACCATTGCCTCCCTTTTCAGTGATGA
CAATCCCCATGCTTGTTCCATCATGTGA
mRNA sequenceShow/hide mRNA sequence
CTTAGCTTTATGCATGCAACGAATTAGGCCTAATTAACAACTCTTTCACTCTCTCTATTTTCTTCATACAATAACAATTACAAGCCATCCACAATGTCTATGGTTGAGGT
TAGAGTTCCGAATCTCGACTGCGAAGGATGCGCTTCAAAGCTGAGAAAAGCTCTTTTCAAGCTCAAAGGAGTGGAAGAAGTGGAAGTGGAAATAGAGATGCAGAAAATAA
CAGTGAGAGGGTACGGATTGGAAGAAAGAAAAATAGTGAAGGCAATAAAAAGAGCTGGAAAAGCAGCAGAAGCATGGCCATTCCCAGGATATTCTTCTCATTATGCTTCC
TTTTACAAATACCCTTCTTATATAGTCAACCATTATTATGACACTTATTACACTTCCACTAATAATAATAATAAGCAACAACAACTCATCACTACTACTGCTGGAGGATC
TCCCACTTCTCCTCACACTTTCTTTCAAACTCCTTCCCTTTACTCTCTTGCTGTCTCCTCTGACCATACCATTGCCTCCCTTTTCAGTGATGACAATCCCCATGCTTGTT
CCATCATGTGATTTCTTTTGTCTCTCATCTTTTTCCTTTCTTTCCCTTTCTTTTTAAATACCATTACGATGCCTTATCCTTCTTTTCTTTCAATTTCTAGGGTTTTATTA
TTCTATTTTACTTTGTGAATTATCTTTCCAAAGCTCCCCATTACCAATTTAGTCTATTTGGTTTTTCTTTCATTGTCTACCTCGAGTATTTAAATTAGCTTTTTTTTTAA
GCTTTCAATATTATTCCTCGATATTCTTAATTGTCTAATGGGTTCTTGACTTATTTCATATTTCTTTTTTTAAGAAATTCAAAGTTGTTAAGGTTTCATTTGACATAAAA
TTCAAATATATATACATATCGATTAATTAATTTTT
Protein sequenceShow/hide protein sequence
MSMVEVRVPNLDCEGCASKLRKALFKLKGVEEVEVEIEMQKITVRGYGLEERKIVKAIKRAGKAAEAWPFPGYSSHYASFYKYPSYIVNHYYDTYYTSTNNNNKQQQLIT
TTAGGSPTSPHTFFQTPSLYSLAVSSDHTIASLFSDDNPHACSIM