; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001518 (gene) of Snake gourd v1 genome

Gene IDTan0001518
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheavy metal-associated isoprenylated plant protein 31
Genome locationLG07:7037399..7039973
RNA-Seq ExpressionTan0001518
SyntenyTan0001518
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR006121 - Heavy metal-associated domain, HMA
IPR036163 - Heavy metal-associated domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148352.2 heavy metal-associated isoprenylated plant protein 31 [Cucumis sativus]1.6e-6075.72Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTY---ST
        MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAE WPFPGYSSHY SFYKYPSYI N+YYDTY   ++
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTY---ST

Query:  NHHYNNNKSATTATA----------------------LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        N + N+N S TT ++                      LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
Subjt:  NHHYNNNKSATTATA----------------------LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

XP_022931790.1 heavy metal-associated isoprenylated plant protein 31 isoform X1 [Cucurbita moschata]2.1e-6083.11Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH
        MSM+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH

Query:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
                + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

XP_022971858.1 heavy metal-associated isoprenylated plant protein 31 isoform X1 [Cucurbita maxima]1.2e-6083.11Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH
        MSM+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH

Query:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
                + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

XP_023554193.1 heavy metal-associated isoprenylated plant protein 31 isoform X1 [Cucurbita pepo subsp. pepo]2.1e-6083.11Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH
        MSM+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH

Query:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
                + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

XP_038888453.1 heavy metal-associated isoprenylated plant protein 31 [Benincasa hispida]3.4e-6381.71Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYST---
        MSMVEVRVPNLDCEGCASKL++ALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHY SFYKYPSYIVN+YYDTYS+   
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYST---

Query:  -NHHYNNNK-----SATTATA-------LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
         N H+NNNK     S+T  TA       LHTFFQTPSLYSLA+SSDH IASLFSDDNPHACSIM
Subjt:  -NHHYNNNK-----SATTATA-------LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

TrEMBL top hitse value%identityAlignment
A0A0A0LHJ7 HMA domain-containing protein7.6e-6175.72Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTY---ST
        MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAE WPFPGYSSHY SFYKYPSYI N+YYDTY   ++
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTY---ST

Query:  NHHYNNNKSATTATA----------------------LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        N + N+N S TT ++                      LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
Subjt:  NHHYNNNKSATTATA----------------------LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

A0A6J1EUM7 heavy metal-associated isoprenylated plant protein 31 isoform X21.1e-5982.88Show/hide
Query:  MVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYN
        M+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +  
Subjt:  MVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYN

Query:  NNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
              + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  NNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

A0A6J1F0F2 heavy metal-associated isoprenylated plant protein 31 isoform X11.0e-6083.11Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH
        MSM+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH

Query:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
                + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

A0A6J1I865 heavy metal-associated isoprenylated plant protein 31 isoform X15.8e-6183.11Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH
        MSM+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHH

Query:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
                + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  YNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

A0A6J1I9R6 heavy metal-associated isoprenylated plant protein 31 isoform X26.5e-6082.88Show/hide
Query:  MVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYN
        M+EVRVPNLDCEGCASKL+KALFKLKGVEEVEVEIEMQKITVRGYGLEERKV+KAIKR GKAAEAWPFPGYSSHYASFYKYPSYI  +YYD YS   +  
Subjt:  MVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYN

Query:  NNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
              + T+LHTFFQTPSLYS A+SSDHAIASLFSDDNPHACSIM
Subjt:  NNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

SwissProt top hitse value%identityAlignment
B3H6D0 Heavy metal-associated isoprenylated plant protein 456.3e-2038.51Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPS----------YIVNN
        +S+VE+ V ++DC+GC  K+++A+ KL GV+ VE++++ QK+TV GY ++  +VLK +KRTG+ AE WPFP Y+ +Y  +Y YPS          Y   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPS----------YIVNN

Query:  Y---YDTYSTNHHYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        Y   YD Y  +   N N S          +   S   +  + D     LFSDDN HAC+IM
Subjt:  Y---YDTYSTNHHYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

F4IC29 Heavy metal-associated isoprenylated plant protein 283.0e-1434.84Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH-
        +  +E+RV ++DC GC S++K AL K++GV+ VE+++  QK+TV GY  +++KVLK +++TG+ AE W  P    H         Y  N        NH 
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH-

Query:  ------HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
               YN  K    +    ++   P   S+     H   S FSD+NP+ACSIM
Subjt:  ------HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

F4IQG4 Heavy metal-associated isoprenylated plant protein 304.0e-1434.56Show/hide
Query:  CEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNNNKSATTATA
        C GC   +K A++KL+GV+ VEV +EM+++TV GY +E +KVLKA++R GK AE WP+P    ++ S   Y       + ++Y  N++ +    +     
Subjt:  CEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNNNKSATTATA

Query:  LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        +H          +    D  +++ F+DDN HACS+M
Subjt:  LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

O65657 Heavy metal-associated isoprenylated plant protein 232.0e-1338.62Show/hide
Query:  VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNN
        VE++V  +DC+GC  K+K +L  LKGV+ VE+  + QK+TV GY  +  KVLK  K TGK AE WP+  Y+           YI   Y       +    
Subjt:  VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNN

Query:  NKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        + + TT T +  ++  PS             SLFSDDNP+ACSIM
Subjt:  NKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

Q84K70 Heavy metal-associated isoprenylated plant protein 319.6e-4562.42Show/hide
Query:  MSM-VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH
        MSM VE+RVPNLDCEGCASKL+K L KLKGVEEVEVE+E QK+T RGY LEE+KVLKA++R GKAAE WP+   +SH+ASFYKYPSY+         TNH
Subjt:  MSM-VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH

Query:  HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        +Y++         +HTFF TP+ YS+A++ D   AS+FSDDNPHAC+IM
Subjt:  HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

Arabidopsis top hitse value%identityAlignment
AT1G06330.1 Heavy metal transport/detoxification superfamily protein2.1e-1534.84Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH-
        +  +E+RV ++DC GC S++K AL K++GV+ VE+++  QK+TV GY  +++KVLK +++TG+ AE W  P    H         Y  N        NH 
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH-

Query:  ------HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
               YN  K    +    ++   P   S+     H   S FSD+NP+ACSIM
Subjt:  ------HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

AT2G18196.1 Heavy metal transport/detoxification superfamily protein2.8e-1534.56Show/hide
Query:  CEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNNNKSATTATA
        C GC   +K A++KL+GV+ VEV +EM+++TV GY +E +KVLKA++R GK AE WP+P    ++ S   Y       + ++Y  N++ +    +     
Subjt:  CEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNNNKSATTATA

Query:  LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        +H          +    D  +++ F+DDN HACS+M
Subjt:  LHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

AT3G48970.1 Heavy metal transport/detoxification superfamily protein6.9e-4662.42Show/hide
Query:  MSM-VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH
        MSM VE+RVPNLDCEGCASKL+K L KLKGVEEVEVE+E QK+T RGY LEE+KVLKA++R GKAAE WP+   +SH+ASFYKYPSY+         TNH
Subjt:  MSM-VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNH

Query:  HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        +Y++         +HTFF TP+ YS+A++ D   AS+FSDDNPHAC+IM
Subjt:  HYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

AT3G56891.1 Heavy metal transport/detoxification superfamily protein4.5e-2138.51Show/hide
Query:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPS----------YIVNN
        +S+VE+ V ++DC+GC  K+++A+ KL GV+ VE++++ QK+TV GY ++  +VLK +KRTG+ AE WPFP Y+ +Y  +Y YPS          Y   +
Subjt:  MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPS----------YIVNN

Query:  Y---YDTYSTNHHYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        Y   YD Y  +   N N S          +   S   +  + D     LFSDDN HAC+IM
Subjt:  Y---YDTYSTNHHYNNNKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM

AT4G39700.1 Heavy metal transport/detoxification superfamily protein1.4e-1438.62Show/hide
Query:  VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNN
        VE++V  +DC+GC  K+K +L  LKGV+ VE+  + QK+TV GY  +  KVLK  K TGK AE WP+  Y+           YI   Y       +    
Subjt:  VEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNN

Query:  NKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM
        + + TT T +  ++  PS             SLFSDDNP+ACSIM
Subjt:  NKSATTATALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATGGTGGAGGTTAGAGTTCCCAACCTCGACTGCGAGGGATGCGCTTCAAAGTTGAAGAAAGCTCTTTTCAAGCTCAAAGGAGTGGAAGAAGTGGAGGTAGAAAT
AGAGATGCAGAAGATAACAGTGAGAGGGTACGGATTGGAAGAGAGAAAAGTGTTGAAGGCAATAAAGAGAACTGGAAAAGCAGCAGAGGCTTGGCCATTCCCAGGATACT
CTTCCCATTATGCTTCTTTTTACAAATACCCTTCTTACATTGTTAACAATTATTATGACACTTACAGTACCAACCATCATTACAACAACAATAAATCAGCCACTACAGCC
ACTGCTCTTCACACTTTCTTTCAAACCCCTTCTCTTTATTCTCTCGCTCTCTCCTCCGACCACGCCATTGCTTCACTTTTCAGCGACGACAATCCCCATGCTTGCTCTAT
CATGTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAAAAAAAAAAAAAAAAACATGTTGGATGATGAGAGTGAGAGCCTAATACAACTCTAGCTATAAACACTAGACTTGCCTCCCCCTTTCTCCCACACACTCTGCTT
CTCTCTCTCTATGCATGCAACAAAATAACTCTCTCTCAGTTTCTCTATTTTGTTCACAAGTCAGCTATGTCTATGGTGGAGGTTAGAGTTCCCAACCTCGACTGCGAGGG
ATGCGCTTCAAAGTTGAAGAAAGCTCTTTTCAAGCTCAAAGGAGTGGAAGAAGTGGAGGTAGAAATAGAGATGCAGAAGATAACAGTGAGAGGGTACGGATTGGAAGAGA
GAAAAGTGTTGAAGGCAATAAAGAGAACTGGAAAAGCAGCAGAGGCTTGGCCATTCCCAGGATACTCTTCCCATTATGCTTCTTTTTACAAATACCCTTCTTACATTGTT
AACAATTATTATGACACTTACAGTACCAACCATCATTACAACAACAATAAATCAGCCACTACAGCCACTGCTCTTCACACTTTCTTTCAAACCCCTTCTCTTTATTCTCT
CGCTCTCTCCTCCGACCACGCCATTGCTTCACTTTTCAGCGACGACAATCCCCATGCTTGCTCTATCATGTAAAAGTAAAACCCCTTGTTAATTCACTCTTCTTTGCTTT
TAACAACATTATTATTATTGTTGCTTTCTACTTACTTATTACTTATACCATCACCCTGCCTTAATCTGGTTTCATTCCGGTTTTACTCCTTCAACTTTTTTATTTTTTCT
ATTTTTTGTACGAATTAGGCCGAATACAAAATACATCATCTTTCCTTCTACACGTTTAAGCCAAACAAGTTAGTTAGTGAATCACAACGATAAAAAAAAAAAAAAAAAAA
AAGAGATAGAGACAAGAGAACGACAACAAAAAAAAAAAAAACTATTTTGATTTGCCCTAAACTTAGACTAAGTTCAGTTTCTATATAATGAAAAAATTACAAAAACACTC
ACGATTATCAATCGGACGTTAAATAGTCCAACAAAAGCTACTTGACTTTTAAGAACTATAAACTAGTAGCATAGGGTAACGCGACAATATTCAACCAAATTTTCAGCTGC
ACAAAAGAAAAGCTTATTGACATTATTGAGGTAAGCCCTCTATGAAATATTCC
Protein sequenceShow/hide protein sequence
MSMVEVRVPNLDCEGCASKLKKALFKLKGVEEVEVEIEMQKITVRGYGLEERKVLKAIKRTGKAAEAWPFPGYSSHYASFYKYPSYIVNNYYDTYSTNHHYNNNKSATTA
TALHTFFQTPSLYSLALSSDHAIASLFSDDNPHACSIM