; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C04G078400 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C04G078400
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionprotein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE
Genome locationCla97Chr04:25938749..25945963
RNA-Seq ExpressionCla97C04G078400
SyntenyCla97C04G078400
Gene Ontology termsGO:0009786 - regulation of asymmetric cell division (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ98956.1 putative Adenine nucleotide alpha hydrolases-like superfamily protein [Cucumis melo var. makuwa]5.1e-9087.44Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----G
        MACFLLCSK QQ+NPISDSVENVQK+ELLG GVSGE DFSSDN KAL QNGNRVMVVVDWSVEAKEALEWTLSHAVQ NDTIVLVHVLKSLKLQ     G
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----G

Query:  FEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRR-GKKKTCRATAKYCIQN
        FEFGNKV YIKA+KLLFSMR+MCLK +P+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRP+LRRL NRWA RRSRRR  KKKTCRATA+YCIQN
Subjt:  FEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRR-GKKKTCRATAKYCIQN

Query:  SSCMTIA
        SSCMTIA
Subjt:  SSCMTIA

XP_004137450.1 uncharacterized protein LOC101207475 [Cucumis sativus]6.0e-9188.41Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSK-ALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----
        MACFLLCSK QQ+ PISDSVENVQK+ELLG  VSGEEDFSSDNSK AL QNGNRVMVVVDWSVEAKEALEWTLSHAVQ NDTIVLVHVLKSLKLQ     
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSK-ALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----

Query:  GFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQN
        GFEFGNKV YIKA+KLLFSMR+MCLKT+P+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRP+LRRLLNRWA RRSRRR KKKTCRATA+YCIQN
Subjt:  GFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQN

Query:  SSCMTIA
        SSCMTIA
Subjt:  SSCMTIA

XP_023001141.1 uncharacterized protein LOC111495368 [Cucurbita maxima]5.5e-8484.16Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN
        MACF +CS+PQ K PISDSVENVQKKELLG  VSGEE+FSSD+SKA S+NGNRVMVVVDWSVEA+ ALEWTLSHAV+++DTIVLV+VLKSLK +GFEFGN
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN

Query:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT
        KV   K AYKLLFSMRNMCLK RP+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLL RWAT R RRR KKK+CRATA+YCIQNSSCMT
Subjt:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT

Query:  IA
        IA
Subjt:  IA

XP_023520317.1 uncharacterized protein LOC111783631 [Cucurbita pepo subsp. pepo]4.2e-8484.16Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN
        MACF +CS+PQ+K PISDSVENVQKKELLG  VSGEE+FSSD+SK  S NGNRVMVVVDWSVEA+ ALEWTLSHAV+++DTIVLVHVLKSLK QGFEFGN
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN

Query:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT
        KV   K AYKLLFSMRNMCLK RP+V VE+ALLEGKERGPIIVEEA+KHKLSLLVLGQRKRPILRRLL RWAT R RRR KKKTCRATA+YCIQNSSCMT
Subjt:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT

Query:  IA
        IA
Subjt:  IA

XP_038893566.1 uncharacterized protein LOC120082458 [Benincasa hispida]1.7e-9890.91Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN
        MACFLLCSKPQQKNPISDSVENVQKKELLG GV+GEEDFSSDNSKALSQNGNRVMVVVDWSVEAK+ALEWTLSHAVQNNDTIVLVHVLKSLKLQ FEFGN
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN

Query:  KVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTI
        KV YIKA+KLLFSMRNMCLKTRP+VQVE+ALLEGKERGPIIV+EAKKHKLSLLVLGQRKRPILRRLLNRWATRR+RRR KKKTCRATA+YCIQNSSCMTI
Subjt:  KVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTI

Query:  AGYKVNTMV
        A  K +  +
Subjt:  AGYKVNTMV

TrEMBL top hitse value%identityAlignment
A0A0A0LQN9 Usp domain-containing protein2.9e-9188.41Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSK-ALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----
        MACFLLCSK QQ+ PISDSVENVQK+ELLG  VSGEEDFSSDNSK AL QNGNRVMVVVDWSVEAKEALEWTLSHAVQ NDTIVLVHVLKSLKLQ     
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSK-ALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----

Query:  GFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQN
        GFEFGNKV YIKA+KLLFSMR+MCLKT+P+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRP+LRRLLNRWA RRSRRR KKKTCRATA+YCIQN
Subjt:  GFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQN

Query:  SSCMTIA
        SSCMTIA
Subjt:  SSCMTIA

A0A1S3BY66 uncharacterized protein LOC1034944673.1e-7786.41Show/hide
Query:  QKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----GFEFGNKVKYIKAYKLLFSMRNMC
        +K+ELLG GVSGE DFSSDN KAL QNGNRVMVVVDWSVEAKEALEWTLSHAVQ NDTIVLVHVLKSLKLQ     GFEFGNKV YIKA+KLLFSMR+MC
Subjt:  QKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----GFEFGNKVKYIKAYKLLFSMRNMC

Query:  LKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRR-GKKKTCRATAKYCIQNSSCMTIA
        LK +P+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRP+LRRL NRWA RRSRRR  KKKTCRATA+YCIQNSSCMTIA
Subjt:  LKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRR-GKKKTCRATAKYCIQNSSCMTIA

A0A5D3BIQ2 Putative Adenine nucleotide alpha hydrolases-like superfamily protein2.5e-9087.44Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----G
        MACFLLCSK QQ+NPISDSVENVQK+ELLG GVSGE DFSSDN KAL QNGNRVMVVVDWSVEAKEALEWTLSHAVQ NDTIVLVHVLKSLKLQ     G
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQ-----G

Query:  FEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRR-GKKKTCRATAKYCIQN
        FEFGNKV YIKA+KLLFSMR+MCLK +P+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRP+LRRL NRWA RRSRRR  KKKTCRATA+YCIQN
Subjt:  FEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRR-GKKKTCRATAKYCIQN

Query:  SSCMTIA
        SSCMTIA
Subjt:  SSCMTIA

A0A6J1E832 uncharacterized protein LOC1114316503.5e-8484.65Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN
        MACF +CS+PQ K PISDSVENVQKKELL   VSGEE+FSSD+SKA S+NGNRVMVVVDWSVEA+ ALEWTLSHAV+++DTIVLVHVLKSLK QGFEFGN
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN

Query:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT
        KV   K AYKLLFSMRNMCLK RP+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKR ILRRLL RWAT R RRR KKKTCRATA+YCIQNSSCMT
Subjt:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT

Query:  IA
        IA
Subjt:  IA

A0A6J1KHT2 uncharacterized protein LOC1114953682.6e-8484.16Show/hide
Query:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN
        MACF +CS+PQ K PISDSVENVQKKELLG  VSGEE+FSSD+SKA S+NGNRVMVVVDWSVEA+ ALEWTLSHAV+++DTIVLV+VLKSLK +GFEFGN
Subjt:  MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGN

Query:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT
        KV   K AYKLLFSMRNMCLK RP+VQVE+ALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLL RWAT R RRR KKK+CRATA+YCIQNSSCMT
Subjt:  KVKYIK-AYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMT

Query:  IA
        IA
Subjt:  IA

SwissProt top hitse value%identityAlignment
Q5BPF3 Protein BREAKING OF ASYMMETRY IN THE STOMATAL LINEAGE8.1e-1432.52Show/hide
Query:  KSIASKGKENSKKMSRQRKHSSP----------QAKDVVETKKATAADDSSWPQ---FEDEDYIVFCF-KENGAFDVIKNGNNSETSHSIDLVSTSSRPV
        K I  K K   KK S ++   SP          ++  V  T   +     SWPQ    E+  +IVFCF +E+G FDV+K G   E   +      S R V
Subjt:  KSIASKGKENSKKMSRQRKHSSP----------QAKDVVETKKATAADDSSWPQ---FEDEDYIVFCF-KENGAFDVIKNGNNSETSHSIDLVSTSSRPV

Query:  NRKLNYSEDDKAAKRYNNGGHIRSAEQEDDGEEIENIYIDKEENRMVNHNKVIDDQPIVAVPTESSDSNHSDVSNGSFAFPELGWEWSGSPVQMPKSKGL
        NRKL Y +        NN    +  EQ+ +    +N      ++   +  +   ++  +    +SS S+HSD   GSFAFP LG EW GSP +MP+S  L
Subjt:  NRKLNYSEDDKAAKRYNNGGHIRSAEQEDDGEEIENIYIDKEENRMVNHNKVIDDQPIVAVPTESSDSNHSDVSNGSFAFPELGWEWSGSPVQMPKSKGL

Query:  QLRKHK
          +K K
Subjt:  QLRKHK

Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.4e-2340.37Show/hide
Query:  GNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVK----------YIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPI
        G R++VVVD   EAK AL WTLSH  Q  D+I+L+H LK+   Q  +  NK +            +A K + +++ MC   RP+V+ E+  ++G E+GP 
Subjt:  GNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVK----------YIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPI

Query:  IVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA
        IV+EA++ + SLLVLGQ+K+    RLL  WA+     + +  T     +YCI NS CM IA
Subjt:  IVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein1.5e-1839.07Show/hide
Query:  GNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKL
        G R++VVVD   EAK AL WTLSH  Q  D+I+L+H LK+   Q  +  NK    +  +     +    +   +V+ E+  ++G E+GP IV+EA++ + 
Subjt:  GNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKL

Query:  SLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA
        SLLVLGQ+K+    RLL  WA+     + +  T     +YCI NS CM IA
Subjt:  SLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.0e-2038.51Show/hide
Query:  MVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLE-GKERGPIIVEEAKKHKLSLL
        MVVVD + + K AL+W L+H VQ+ D I L+HV ++   Q  +   + +  +A++L+  ++N C   +P V+ EI ++E  +E+G  IVEE+KK    +L
Subjt:  MVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLE-GKERGPIIVEEAKKHKLSLL

Query:  VLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA
        VLGQRKR    R++ +W T+     G         +YCI NS CM IA
Subjt:  VLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-2340.76Show/hide
Query:  LSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEG--KERGPIIVEE
        +++ GNRVMVVVD  + +  ALEW L H +Q+ D + L++  K  + +G +  N+   +K  +L+ +++ +C   RP ++VEI  L+G  KE+G  IVEE
Subjt:  LSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEG--KERGPIIVEE

Query:  AKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA
        AK+ ++SLLV+G+ K+P + RLL RW  ++ R R        T KYC++ +SCMTIA
Subjt:  AKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.2e-2635.98Show/hide
Query:  QQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKAL---------------------SQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLK
        ++ N  S+ VE+ +++E + + V  EE+   D++K+                      ++ GNRVMVVVD ++ +  ALEW ++H +Q  DT+ L++  K
Subjt:  QQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKAL---------------------SQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLK

Query:  SLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEG--KERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRAT
          +    +  N+ + +K  +L+ +++ +C   RP ++VEI  LEG  K++G  IVEE+KK ++SLLV+GQ K+P + RLL RWA +  RRRG +      
Subjt:  SLKLQGFEFGNKVKYIKAYKLLFSMRNMCLKTRPQVQVEIALLEG--KERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRAT

Query:  AKYCIQNSSCMTIA
         KYC++N+SCMTIA
Subjt:  AKYCIQNSSCMTIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTGTTTCTTGCTTTGTTCTAAGCCACAACAGAAAAACCCCATCTCAGATTCTGTTGAAAATGTTCAAAAAAAGGAGCTTTTGGGGATTGGTGTTAGTGGG
GAAGAAGATTTCAGCTCTGATAACTCTAAAGCTTTGTCTCAAAATGGAAACAGAGTAATGGTGGTTGTTGATTGGAGTGTTGAAGCTAAAGAGGCTTTGGAATGG
ACACTCTCTCATGCTGTTCAGAACAATGACACCATTGTTCTTGTTCATGTTCTCAAAAGTTTGAAGCTTCAAGGTTTTGAGTTTGGTAATAAGGTGAAGTACATA
AAGGCCTATAAGCTTCTCTTTTCCATGAGAAATATGTGCCTAAAGACAAGGCCTCAGGTGCAAGTAGAGATAGCATTATTGGAAGGAAAAGAAAGAGGTCCAATA
ATTGTGGAGGAAGCAAAGAAGCATAAACTTTCACTTTTGGTACTTGGTCAAAGAAAGAGACCGATTCTTCGGCGTCTATTGAACAGATGGGCGACCAGACGCAGT
CGGAGGAGGGGAAAGAAGAAGACTTGTCGAGCTACGGCAAAGTATTGCATTCAGAACTCATCTTGTATGACCATTGCAGGTTACAAAGTGAATACAATGGTTAGA
CCTTCAAGCTTGGACCTGAAGTGGTTGGATGTTGCTACATTTATAGGAAAGTTCGGATTTTCTCATAGGAATCAACAACAAAAGGTACTGATGAAAGTTAGTCAG
CGCTCCTTCGTACAGAACGACTACCCCTGCAACACCAACAAGCATATTGGCTGTGACTCCTCGAAAAAGAGCAGATATGCCCTCATGGCGAATGATCTCAGAAAA
GGCATGCAAACCACTACGGTACTTCAGGGTTTGTCCGGATGTAAGCATCATTCTTCGTCGCAGTGTGTCGAAAGGATAAGCACATACCCCAGAAAAGGTTTGAAT
AAAGAATACTTTTTCAGTTGTTGCTTATTTCACAGATGTGGCCAAATGTTAAGCTATAGGTTGAAAATAGATCCTTACCTCATACTGTCCAACCAGAACAAGAGG
CTTCAAAGTGTCACCAACAATTCCATCACTCGACAATGTTTTTCTGTAGACATCCAATATCCCTTTAAACTGGCGCTGACTGTTACTGCCAACATCCTTGGCATC
GGTGCCAAGTCGAGTTCGTGCATAATCCAAATGATACAGAAACAAAGACATGATAGTAACATGTTAATAAATTCATTCCATTTCGGCGGTTGTTCCGTTCGTGTG
TTCCTCTCACGGGCTCGATATTACGAACCAAATAAATACTTCACTTCAACACCACCACCTCCAATGCCAATAACAAATTTCGTTTTCGACACGAAGAGTATTGCC
TCAAAAGGTAAAGAGAATAGCAAGAAGATGTCGAGACAGAGAAAGCACTCCTCGCCACAAGCTAAAGATGTTGTCGAGACGAAGAAGGCGACGGCGGCGGATGAT
TCGAGCTGGCCACAGTTTGAAGATGAAGACTACATTGTCTTCTGTTTCAAAGAAAATGGAGCATTTGATGTTATAAAGAATGGGAATAATTCAGAGACTTCCCAT
TCCATTGATTTGGTTTCAACAAGTTCAAGACCAGTTAATAGGAAGCTTAATTATAGTGAAGATGATAAAGCAGCCAAAAGATACAACAATGGAGGTCACATTAGA
TCAGCTGAACAGGAAGATGATGGGGAAGAAATAGAGAATATTTACATAGATAAAGAAGAGAACAGAATGGTAAATCACAACAAGGTGATTGATGACCAACCGATC
GTGGCAGTGCCTACCGAATCAAGTGACTCGAATCATTCAGATGTCAGCAATGGATCCTTTGCGTTTCCTGAGTTGGGATGGGAGTGGAGTGGAAGTCCTGTGCAA
ATGCCAAAATCAAAAGGTTTGCAGCTGAGAAAGCACAAGATAATCTATAACCTGCATCAGAAAATGAGAGATGTGTTCAATAATTATAGAACAAAACGTCCCTCT
GGCGCACAACCAACCGTGGGGCATTTCCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTGTTTCTTGCTTTGTTCTAAGCCACAACAGAAAAACCCCATCTCAGATTCTGTTGAAAATGTTCAAAAAAAGGAGCTTTTGGGGATTGGTGTTAGTGGG
GAAGAAGATTTCAGCTCTGATAACTCTAAAGCTTTGTCTCAAAATGGAAACAGAGTAATGGTGGTTGTTGATTGGAGTGTTGAAGCTAAAGAGGCTTTGGAATGG
ACACTCTCTCATGCTGTTCAGAACAATGACACCATTGTTCTTGTTCATGTTCTCAAAAGTTTGAAGCTTCAAGGTTTTGAGTTTGGTAATAAGGTGAAGTACATA
AAGGCCTATAAGCTTCTCTTTTCCATGAGAAATATGTGCCTAAAGACAAGGCCTCAGGTGCAAGTAGAGATAGCATTATTGGAAGGAAAAGAAAGAGGTCCAATA
ATTGTGGAGGAAGCAAAGAAGCATAAACTTTCACTTTTGGTACTTGGTCAAAGAAAGAGACCGATTCTTCGGCGTCTATTGAACAGATGGGCGACCAGACGCAGT
CGGAGGAGGGGAAAGAAGAAGACTTGTCGAGCTACGGCAAAGTATTGCATTCAGAACTCATCTTGTATGACCATTGCAGGTTACAAAGTGAATACAATGGTTAGA
CCTTCAAGCTTGGACCTGAAGTGGTTGGATGTTGCTACATTTATAGGAAAGTTCGGATTTTCTCATAGGAATCAACAACAAAAGGTACTGATGAAAGTTAGTCAG
CGCTCCTTCGTACAGAACGACTACCCCTGCAACACCAACAAGCATATTGGCTGTGACTCCTCGAAAAAGAGCAGATATGCCCTCATGGCGAATGATCTCAGAAAA
GGCATGCAAACCACTACGGTACTTCAGGGTTTGTCCGGATGTAAGCATCATTCTTCGTCGCAGTGTGTCGAAAGGATAAGCACATACCCCAGAAAAGGTTTGAAT
AAAGAATACTTTTTCAGTTGTTGCTTATTTCACAGATGTGGCCAAATGTTAAGCTATAGGTTGAAAATAGATCCTTACCTCATACTGTCCAACCAGAACAAGAGG
CTTCAAAGTGTCACCAACAATTCCATCACTCGACAATGTTTTTCTGTAGACATCCAATATCCCTTTAAACTGGCGCTGACTGTTACTGCCAACATCCTTGGCATC
GGTGCCAAGTCGAGTTCGTGCATAATCCAAATGATACAGAAACAAAGACATGATAGTAACATGTTAATAAATTCATTCCATTTCGGCGGTTGTTCCGTTCGTGTG
TTCCTCTCACGGGCTCGATATTACGAACCAAATAAATACTTCACTTCAACACCACCACCTCCAATGCCAATAACAAATTTCGTTTTCGACACGAAGAGTATTGCC
TCAAAAGGTAAAGAGAATAGCAAGAAGATGTCGAGACAGAGAAAGCACTCCTCGCCACAAGCTAAAGATGTTGTCGAGACGAAGAAGGCGACGGCGGCGGATGAT
TCGAGCTGGCCACAGTTTGAAGATGAAGACTACATTGTCTTCTGTTTCAAAGAAAATGGAGCATTTGATGTTATAAAGAATGGGAATAATTCAGAGACTTCCCAT
TCCATTGATTTGGTTTCAACAAGTTCAAGACCAGTTAATAGGAAGCTTAATTATAGTGAAGATGATAAAGCAGCCAAAAGATACAACAATGGAGGTCACATTAGA
TCAGCTGAACAGGAAGATGATGGGGAAGAAATAGAGAATATTTACATAGATAAAGAAGAGAACAGAATGGTAAATCACAACAAGGTGATTGATGACCAACCGATC
GTGGCAGTGCCTACCGAATCAAGTGACTCGAATCATTCAGATGTCAGCAATGGATCCTTTGCGTTTCCTGAGTTGGGATGGGAGTGGAGTGGAAGTCCTGTGCAA
ATGCCAAAATCAAAAGGTTTGCAGCTGAGAAAGCACAAGATAATCTATAACCTGCATCAGAAAATGAGAGATGTGTTCAATAATTATAGAACAAAACGTCCCTCT
GGCGCACAACCAACCGTGGGGCATTTCCATTAA
Protein sequenceShow/hide protein sequence
MACFLLCSKPQQKNPISDSVENVQKKELLGIGVSGEEDFSSDNSKALSQNGNRVMVVVDWSVEAKEALEWTLSHAVQNNDTIVLVHVLKSLKLQGFEFGNKVKYI
KAYKLLFSMRNMCLKTRPQVQVEIALLEGKERGPIIVEEAKKHKLSLLVLGQRKRPILRRLLNRWATRRSRRRGKKKTCRATAKYCIQNSSCMTIAGYKVNTMVR
PSSLDLKWLDVATFIGKFGFSHRNQQQKVLMKVSQRSFVQNDYPCNTNKHIGCDSSKKSRYALMANDLRKGMQTTTVLQGLSGCKHHSSSQCVERISTYPRKGLN
KEYFFSCCLFHRCGQMLSYRLKIDPYLILSNQNKRLQSVTNNSITRQCFSVDIQYPFKLALTVTANILGIGAKSSSCIIQMIQKQRHDSNMLINSFHFGGCSVRV
FLSRARYYEPNKYFTSTPPPPMPITNFVFDTKSIASKGKENSKKMSRQRKHSSPQAKDVVETKKATAADDSSWPQFEDEDYIVFCFKENGAFDVIKNGNNSETSH
SIDLVSTSSRPVNRKLNYSEDDKAAKRYNNGGHIRSAEQEDDGEEIENIYIDKEENRMVNHNKVIDDQPIVAVPTESSDSNHSDVSNGSFAFPELGWEWSGSPVQ
MPKSKGLQLRKHKIIYNLHQKMRDVFNNYRTKRPSGAQPTVGHFH