; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018650 (gene) of Snake gourd v1 genome

Gene IDTan0018650
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionmembrane protein of ER body-like protein
Genome locationLG06:13958903..13977446
RNA-Seq ExpressionTan0018650
SyntenyTan0018650
Gene Ontology termsGO:0030026 - cellular manganese ion homeostasis (biological process)
GO:0016020 - membrane (cellular component)
GO:0005384 - manganese ion transmembrane transporter activity (molecular function)
InterPro domainsIPR008217 - Ccc1 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023548916.1 uncharacterized protein LOC111807425 isoform X1 [Cucurbita pepo subsp. pepo]2.2e-3841.07Show/hide
Query:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------
        F +I EE E  + LY+SIKSY+++C +C  C+N VIQ ALK++  E+   +  Y    ++K+  + CY V++ FLN   + CP C  Y+           
Subjt:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------

Query:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK
                                                    IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL+
Subjt:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK

Query:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        K   I   E   + YK +++ KM+ LL F IA+LSF FFGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

XP_023548918.1 uncharacterized protein LOC111807425 isoform X3 [Cucurbita pepo subsp. pepo]2.2e-3841.07Show/hide
Query:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------
        F +I EE E  + LY+SIKSY+++C +C  C+N VIQ ALK++  E+   +  Y    ++K+  + CY V++ FLN   + CP C  Y+           
Subjt:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------

Query:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK
                                                    IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL+
Subjt:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK

Query:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        K   I   E   + YK +++ KM+ LL F IA+LSF FFGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

XP_023548919.1 uncharacterized protein LOC111807425 isoform X4 [Cucurbita pepo subsp. pepo]2.2e-3841.07Show/hide
Query:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------
        F +I EE E  + LY+SIKSY+++C +C  C+N VIQ ALK++  E+   +  Y    ++K+  + CY V++ FLN   + CP C  Y+           
Subjt:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------

Query:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK
                                                    IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL+
Subjt:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK

Query:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        K   I   E   + YK +++ KM+ LL F IA+LSF FFGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

XP_023548920.1 uncharacterized protein LOC111807425 isoform X5 [Cucurbita pepo subsp. pepo]2.2e-3841.07Show/hide
Query:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------
        F +I EE E  + LY+SIKSY+++C +C  C+N VIQ ALK++  E+   +  Y    ++K+  + CY V++ FLN   + CP C  Y+           
Subjt:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------

Query:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK
                                                    IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL+
Subjt:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK

Query:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        K   I   E   + YK +++ KM+ LL F IA+LSF FFGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

XP_023548921.1 uncharacterized protein LOC111807425 isoform X6 [Cucurbita pepo subsp. pepo]2.2e-3841.07Show/hide
Query:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------
        F +I EE E  + LY+SIKSY+++C +C  C+N VIQ ALK++  E+   +  Y    ++K+  + CY V++ FLN   + CP C  Y+           
Subjt:  FGDIGEEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI-----------

Query:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK
                                                    IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL+
Subjt:  --------------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK

Query:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        K   I   E   + YK +++ KM+ LL F IA+LSF FFGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  KTEGIE--EEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

TrEMBL top hitse value%identityAlignment
A0A5A7SK09 Membrane protein of ER body-like protein isoform X42.0e-2150.39Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK------KTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSS
        ESITSL IVTSAA  +   GNIV L+L NLISG FI+ H+L  LK        E  +++    Y+VV+ ++ + +LHF +A  SF  FGLVPP VY FS 
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLK------KTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSS

Query:  LKTDNKDLKILATAGASLSCTTVLAIQKS
         K+++KDLK+ A AGASL C T+LA+ K+
Subjt:  LKTDNKDLKILATAGASLSCTTVLAIQKS

A0A6J1GQA4 uncharacterized protein LOC111456533 isoform X14.5e-3740.43Show/hide
Query:  EEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI----------------
        E + + + L +SIKSYN+YC +C  C+N +IQ  LKL+  E+   +  Y    ++K+  + CY V++ FLN  +  CP C  Y+                
Subjt:  EEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI----------------

Query:  ----------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KK
                                                IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL   +K
Subjt:  ----------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KK

Query:  TEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
           IE +G + YK V+++KM+ LL F IA+LSF  FGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  TEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

A0A6J1GQF7 uncharacterized protein LOC111456533 isoform X34.5e-3740.43Show/hide
Query:  EEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI----------------
        E + + + L +SIKSYN+YC +C  C+N +IQ  LKL+  E+   +  Y    ++K+  + CY V++ FLN  +  CP C  Y+                
Subjt:  EEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI----------------

Query:  ----------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KK
                                                IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL   +K
Subjt:  ----------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KK

Query:  TEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
           IE +G + YK V+++KM+ LL F IA+LSF  FGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  TEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

A0A6J1GQF8 uncharacterized protein LOC111456533 isoform X23.5e-3740.58Show/hide
Query:  EEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI----------------
        E + + + L +SIKSYN+YC +C  C+N +IQ  LKL+  E+   +  Y    ++K+  + CY V++ FLN  +  CP C  Y+                
Subjt:  EEWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLD--EDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYI----------------

Query:  ---------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KKT
                                               IRR       +ESITSLAIVT+A    +  GNI+ALALTNLI+G FIIRH + RL   +K 
Subjt:  ---------------------------------------IRR-------VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KKT

Query:  EGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
          IE +G + YK V+++KM+ LL F IA+LSF  FGLVPP VYA SSLK  NK+LKI+A AGASLSCT +LA+QK+
Subjt:  EGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

A0A6J1JQG4 membrane protein of ER body 2-like1.1e-2760.63Show/hide
Query:  VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLK
        +ESITSLAIVT+A    +  GNIVALALTNL++G F+IRH + RL   +K   IE +G + YK V+++KM+ LL F IA+LSF FFGLVPP VYA SSLK
Subjt:  VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL---KKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLK

Query:  TDNKDLKILATAGASLSCTTVLAIQKS
          NK+LKI+  AGASLSCT +LA+QK+
Subjt:  TDNKDLKILATAGASLSCTTVLAIQKS

SwissProt top hitse value%identityAlignment
F4KFS7 Membrane protein of ER body 24.6e-1033.9Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNK
        E+ITSL +V+SA+       NI+ALA+ NL  G  ++  +   L+ +   E++    Y+ ++  +    +H  +A +S+ FFGL+PP VYAFS  +T  K
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNK

Query:  DLKILATAGASLSCTTVL
        + K+++    SL C  +L
Subjt:  DLKILATAGASLSCTTVL

Q8LPT3 Membrane protein of ER body-like protein1.4e-1438.52Show/hide
Query:  VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGI-----------EEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPF
        +E+ITSL +++SAA       NI+ L L NL+ G  +I H+L  L++ E I            EE    YK ++  + +  LH T+A LSF   G++PP 
Subjt:  VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGI-----------EEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPF

Query:  VYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        VY FS  +  NKD K+ +  GASL C  +LAI K+
Subjt:  VYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

Q8W4P8 Membrane protein of ER body 12.4e-1138.73Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL--------KKTEGIEEEGTSP---YKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFV
        ESITSL  VTSAA       N++AL + NL SG  +  HSL  L          T+   EEG      Y+ V+  + +  +H  IA  SF  FGL+PP V
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL--------KKTEGIEEEGTSP---YKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFV

Query:  YAFSSLKTDNK--DLKILATAGASLSCTTVLAIQKSSHSKSQ
        Y FS  K   K  + K+LA    SL C  +L+I K+  SK +
Subjt:  YAFSSLKTDNK--DLKILATAGASLSCTTVLAIQKSSHSKSQ

Arabidopsis top hitse value%identityAlignment
AT4G27860.1 vacuolar iron transporter (VIT) family protein1.7e-1238.73Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL--------KKTEGIEEEGTSP---YKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFV
        ESITSL  VTSAA       N++AL + NL SG  +  HSL  L          T+   EEG      Y+ V+  + +  +H  IA  SF  FGL+PP V
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL--------KKTEGIEEEGTSP---YKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFV

Query:  YAFSSLKTDNK--DLKILATAGASLSCTTVLAIQKSSHSKSQ
        Y FS  K   K  + K+LA    SL C  +L+I K+  SK +
Subjt:  YAFSSLKTDNK--DLKILATAGASLSCTTVLAIQKSSHSKSQ

AT4G27860.2 vacuolar iron transporter (VIT) family protein1.7e-1238.73Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL--------KKTEGIEEEGTSP---YKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFV
        ESITSL  VTSAA       N++AL + NL SG  +  HSL  L          T+   EEG      Y+ V+  + +  +H  IA  SF  FGL+PP V
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRL--------KKTEGIEEEGTSP---YKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFV

Query:  YAFSSLKTDNK--DLKILATAGASLSCTTVLAIQKSSHSKSQ
        Y FS  K   K  + K+LA    SL C  +L+I K+  SK +
Subjt:  YAFSSLKTDNK--DLKILATAGASLSCTTVLAIQKSSHSKSQ

AT4G27870.1 Vacuolar iron transporter (VIT) family protein9.7e-1638.52Show/hide
Query:  VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGI-----------EEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPF
        +E+ITSL +++SAA       NI+ L L NL+ G  +I H+L  L++ E I            EE    YK ++  + +  LH T+A LSF   G++PP 
Subjt:  VESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGI-----------EEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPF

Query:  VYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS
        VY FS  +  NKD K+ +  GASL C  +LAI K+
Subjt:  VYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKS

AT5G24290.1 Vacuolar iron transporter (VIT) family protein3.2e-1133.9Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNK
        E+ITSL +V+SA+       NI+ALA+ NL  G  ++  +   L+ +   E++    Y+ ++  +    +H  +A +S+ FFGL+PP VYAFS  +T  K
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNK

Query:  DLKILATAGASLSCTTVL
        + K+++    SL C  +L
Subjt:  DLKILATAGASLSCTTVL

AT5G24290.2 Vacuolar iron transporter (VIT) family protein3.2e-1133.9Show/hide
Query:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNK
        E+ITSL +V+SA+       NI+ALA+ NL  G  ++  +   L+ +   E++    Y+ ++  +    +H  +A +S+ FFGL+PP VYAFS  +T  K
Subjt:  ESITSLAIVTSAARVDLFYGNIVALALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNK

Query:  DLKILATAGASLSCTTVL
        + K+++    SL C  +L
Subjt:  DLKILATAGASLSCTTVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGGAAGAGGTTGGTTTGGCAGATGAATCAGACAATTACAAACGAATTTCCCTCAAACCTGACCTCTATTCCCTCATCATTTTGGAGACTCGTTCTTATACACAA
AGATGTAAAAATGCTTGTGGAAAATCTTTGGGGTTGTCCTGCTACTTGTCTGCCAATTCTATACTTAGGAATATCGCTCGGTGGTAAACCTTCTACTTATTCATTTTGGG
AGCCTACTGTTGAGAAAATACATAACAATCACATACAATTCTCATATTTATCTAAAGGTGGTCATCTCACTTTATCGAGGCATCTTCATCTTCTTCTCCTTCTTCCCTCA
GACGCACGACTGCCGCCCAACCCAACCGAGCTTGTTGCTGACGACGTCTTCAGCCACTACTCGCCAACATCGTCTTCTTCTCATCACTTGAATTTAGAAAAGGAATATTC
CGAAGGAGAAAGTCATGATTTTATGAAATGCAGTTTTAGGTCCCACTACAGAAAGTTTTATGATTTTAAAAGGCATACCAGGCGAGTGATAGAGATGCAGATATTAATAA
AGAAGAGAGAGAAAGAGAGAGCTAGATTGCAGATAATATTAGTAGAAATAACTTTTATGGTGATTTTTCATTTTCCAGCTTTAGAAGGATTCTTTGGTGATATTGGAGAA
GAATGGGAATATCTAAGGTGGTTATATAATTCAATAAAGTCATATAACTTGTACTGTTGGGAGTGCAACATTTGTATGAACAAGGTGATTCAATGTGCTTTGAAATTGGA
TGAAGACGATGGAAGAGAATTAGATTATTGTCAGATATCAAATGACAAAGAAGAAGAAGAAAATTGTTATCGAGTGGTGCATATGTTTTTAAATTCACCTAGTCTCATCT
GTCCAAATTGCAAGATTTATATTATTAGAAGGGTCGAATCAATCACAAGTTTAGCCATTGTAACCTCTGCAGCAAGGGTCGATCTTTTTTATGGGAATATAGTAGCGTTG
GCATTGACAAACTTGATTTCTGGATTCTTTATAATTAGGCATAGTTTACCAAGACTCAAGAAGACTGAGGGAATTGAAGAGGAAGGGACTAGTCCCTACAAAGTAGTAGT
GAGAGATAAAATGCACGTTCTTCTTCATTTCACCATTGCTTTCTTATCTTTCACATTTTTTGGTTTAGTGCCTCCATTTGTTTATGCATTTTCATCTCTCAAGACTGATA
ACAAGGATCTCAAGATTCTAGCTACTGCAGGAGCTTCTCTTTCATGCACAACAGTTCTTGCTATTCAGAAAAGCTCACACTCAAAATCCCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGGAAGAGGTTGGTTTGGCAGATGAATCAGACAATTACAAACGAATTTCCCTCAAACCTGACCTCTATTCCCTCATCATTTTGGAGACTCGTTCTTATACACAA
AGATGTAAAAATGCTTGTGGAAAATCTTTGGGGTTGTCCTGCTACTTGTCTGCCAATTCTATACTTAGGAATATCGCTCGGTGGTAAACCTTCTACTTATTCATTTTGGG
AGCCTACTGTTGAGAAAATACATAACAATCACATACAATTCTCATATTTATCTAAAGGTGGTCATCTCACTTTATCGAGGCATCTTCATCTTCTTCTCCTTCTTCCCTCA
GACGCACGACTGCCGCCCAACCCAACCGAGCTTGTTGCTGACGACGTCTTCAGCCACTACTCGCCAACATCGTCTTCTTCTCATCACTTGAATTTAGAAAAGGAATATTC
CGAAGGAGAAAGTCATGATTTTATGAAATGCAGTTTTAGGTCCCACTACAGAAAGTTTTATGATTTTAAAAGGCATACCAGGCGAGTGATAGAGATGCAGATATTAATAA
AGAAGAGAGAGAAAGAGAGAGCTAGATTGCAGATAATATTAGTAGAAATAACTTTTATGGTGATTTTTCATTTTCCAGCTTTAGAAGGATTCTTTGGTGATATTGGAGAA
GAATGGGAATATCTAAGGTGGTTATATAATTCAATAAAGTCATATAACTTGTACTGTTGGGAGTGCAACATTTGTATGAACAAGGTGATTCAATGTGCTTTGAAATTGGA
TGAAGACGATGGAAGAGAATTAGATTATTGTCAGATATCAAATGACAAAGAAGAAGAAGAAAATTGTTATCGAGTGGTGCATATGTTTTTAAATTCACCTAGTCTCATCT
GTCCAAATTGCAAGATTTATATTATTAGAAGGGTCGAATCAATCACAAGTTTAGCCATTGTAACCTCTGCAGCAAGGGTCGATCTTTTTTATGGGAATATAGTAGCGTTG
GCATTGACAAACTTGATTTCTGGATTCTTTATAATTAGGCATAGTTTACCAAGACTCAAGAAGACTGAGGGAATTGAAGAGGAAGGGACTAGTCCCTACAAAGTAGTAGT
GAGAGATAAAATGCACGTTCTTCTTCATTTCACCATTGCTTTCTTATCTTTCACATTTTTTGGTTTAGTGCCTCCATTTGTTTATGCATTTTCATCTCTCAAGACTGATA
ACAAGGATCTCAAGATTCTAGCTACTGCAGGAGCTTCTCTTTCATGCACAACAGTTCTTGCTATTCAGAAAAGCTCACACTCAAAATCCCAATAA
Protein sequenceShow/hide protein sequence
MPRKRLVWQMNQTITNEFPSNLTSIPSSFWRLVLIHKDVKMLVENLWGCPATCLPILYLGISLGGKPSTYSFWEPTVEKIHNNHIQFSYLSKGGHLTLSRHLHLLLLLPS
DARLPPNPTELVADDVFSHYSPTSSSSHHLNLEKEYSEGESHDFMKCSFRSHYRKFYDFKRHTRRVIEMQILIKKREKERARLQIILVEITFMVIFHFPALEGFFGDIGE
EWEYLRWLYNSIKSYNLYCWECNICMNKVIQCALKLDEDDGRELDYCQISNDKEEEENCYRVVHMFLNSPSLICPNCKIYIIRRVESITSLAIVTSAARVDLFYGNIVAL
ALTNLISGFFIIRHSLPRLKKTEGIEEEGTSPYKVVVRDKMHVLLHFTIAFLSFTFFGLVPPFVYAFSSLKTDNKDLKILATAGASLSCTTVLAIQKSSHSKSQ