; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033184 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033184
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionS-protein homolog
Genome locationchr11:41536601..41543969
RNA-Seq ExpressionLag0033184
SyntenyLag0033184
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR010264 - Plant self-incompatibility S1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAB2620635.1 hypothetical protein D8674_043062 [Pyrus ussuriensis x Pyrus communis]5.7e-2935.19Show/hide
Query:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYILNSPITKFEF----------------IH
        +G   I  G  + W F+   SG+TLYWCD HN   HASF+VFW E+    L+ RC N   C W   D+     N+  T F+F                 H
Subjt:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYILNSPITKFEF----------------IH

Query:  PWEPEPSNIQ-----PLTPMKVTIMNHQ-TNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCR
        P    P+N       PL    + ++N    +  L AHC S DDDLG + I  G  + W F+ N SG+TL+WCD HN   H SF+V             C 
Subjt:  PWEPEPSNIQ-----PLTPMKVTIMNHQ-TNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCR

Query:  INSNCIWSADDKGFYILNSPINQLEFIHPWESG
            C W A D G Y+   P ++ EF   WE G
Subjt:  INSNCIWSADDKGFYILNSPINQLEFIHPWESG

KAB2636489.1 hypothetical protein D8674_027023 [Pyrus ussuriensis x Pyrus communis]1.1e-3735.92Show/hide
Query:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYI-----------------LNSPITKFEFI
        +G   I  G  + W F+ N SG+TLYWCD HN   HASF+VFWPE    WL+ RC N   C W A D G Y+                 +  P       
Subjt:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYI-----------------LNSPITKFEFI

Query:  HPWEPEPSNIQPLTPMKV-----------------TIMNHQTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWP
        H    +   +   TP                     + N      L+AHC SK+DD+G + I  G    W FK NF GTTL+WC       H +F V+W 
Subjt:  HPWEPEPSNIQPLTPMKV-----------------TIMNHQTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWP

Query:  ESTNEWLEERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESG
        ES + WL  RC     C W A D GFYI   P  + E IH WE G
Subjt:  ESTNEWLEERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESG

KAE8646229.1 hypothetical protein Csa_016318 [Cucumis sativus]3.2e-4841.6Show/hide
Query:  HVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPE--TKRWLQDRCGNISNCIWAADDKGFYILNSPITKFEFIHPW----------------
        H++  G  Y W FK NF GTTL+WC      A+ SF+ FWPE  +  WL+DRCG    CIW A D G Y+ N+P  + EF+H W                
Subjt:  HVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPE--TKRWLQDRCGNISNCIWAADDKGFYILNSPITKFEFIHPW----------------

Query:  -------EPEPSN--IQPLTPMK--VTIMNHQTNDNLWAHCHSKDDDLG-QQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLE
               E + SN  +  L P++  + + N  TN ++ AHC SKDDDLG Q ++  G  + W FK NF  TTLFWC      A+VSF VFWPE  + WL 
Subjt:  -------EPEPSN--IQPLTPMK--VTIMNHQTNDNLWAHCHSKDDDLG-QQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLE

Query:  ERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESGR
        +RC     CIWSA D G Y+ N P    E +H W S R
Subjt:  ERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESGR

XP_011656368.1 S-protein homolog 1 [Cucumis sativus]2.2e-2853.91Show/hide
Query:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        PLT   VTI+N+Q N +L  HC SKDDDLG  VI   G+ Y W FK N+  TT +WCDF +K+ H SF+VFWPE    W  +RC  NSNC+W A   GF 
Subjt:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY

Query:  ILNSPINQLEFIHPW
        +LN+P   LEF HPW
Subjt:  ILNSPINQLEFIHPW

XP_038907112.1 S-protein homolog 74-like [Benincasa hispida]7.4e-2956.52Show/hide
Query:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        PLT  +VTI+N+Q N  L  HC SKDDDLG  VI   G+ Y W FK NF  TT FWC+F +++ H SF+VFWPES   WL +RC  +SNC+W AD+KGF 
Subjt:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY

Query:  ILNSPINQLEFIHPW
        +LN P   LEF HPW
Subjt:  ILNSPINQLEFIHPW

TrEMBL top hitse value%identityAlignment
A0A0A0KA40 S-protein homolog1.0e-2853.91Show/hide
Query:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        PLT   VTI+N+Q N +L  HC SKDDDLG  VI   G+ Y W FK N+  TT +WCDF +K+ H SF+VFWPE    W  +RC  NSNC+W A   GF 
Subjt:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY

Query:  ILNSPINQLEFIHPW
        +LN+P   LEF HPW
Subjt:  ILNSPINQLEFIHPW

A0A1S3C8Z6 S-protein homolog1.0e-2854.78Show/hide
Query:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        PLT  +VTI+N+Q N +L  HC SKDDDLG  VI   G+ Y W FK N+  TT FWC+F +++ H SF+VFWPE T  WL +RC  NSNC+W A + GF 
Subjt:  PLTPMKVTIMNHQTNDNLWAHCHSKDDDLGQQVI-GIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY

Query:  ILNSPINQLEFIHPW
        +LN P   LEF HPW
Subjt:  ILNSPINQLEFIHPW

A0A5N5GYD4 S-protein homolog2.7e-2935.19Show/hide
Query:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYILNSPITKFEF----------------IH
        +G   I  G  + W F+   SG+TLYWCD HN   HASF+VFW E+    L+ RC N   C W   D+     N+  T F+F                 H
Subjt:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYILNSPITKFEF----------------IH

Query:  PWEPEPSNIQ-----PLTPMKVTIMNHQ-TNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCR
        P    P+N       PL    + ++N    +  L AHC S DDDLG + I  G  + W F+ N SG+TL+WCD HN   H SF+V             C 
Subjt:  PWEPEPSNIQ-----PLTPMKVTIMNHQ-TNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCR

Query:  INSNCIWSADDKGFYILNSPINQLEFIHPWESG
            C W A D G Y+   P ++ EF   WE G
Subjt:  INSNCIWSADDKGFYILNSPINQLEFIHPWESG

A0A5N5I9W3 S-protein homolog5.5e-3835.92Show/hide
Query:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYI-----------------LNSPITKFEFI
        +G   I  G  + W F+ N SG+TLYWCD HN   HASF+VFWPE    WL+ RC N   C W A D G Y+                 +  P       
Subjt:  VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKR-WLQDRCGNISNCIWAADDKGFYI-----------------LNSPITKFEFI

Query:  HPWEPEPSNIQPLTPMKV-----------------TIMNHQTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWP
        H    +   +   TP                     + N      L+AHC SK+DD+G + I  G    W FK NF GTTL+WC       H +F V+W 
Subjt:  HPWEPEPSNIQPLTPMKV-----------------TIMNHQTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWP

Query:  ESTNEWLEERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESG
        ES + WL  RC     C W A D GFYI   P  + E IH WE G
Subjt:  ESTNEWLEERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESG

A0A6J1CQI8 S-protein homolog9.8e-2751.79Show/hide
Query:  VTIMNHQTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCR--INSNCIWSADDKGFYILNSP
        V ++N+  N+ L  HC SKDDDLG Q +  G  + W FKVNF GTTLFWC+ H   A+V+F+VFWPES N WL  RC      +CIW+A D G Y+ N P
Subjt:  VTIMNHQTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCR--INSNCIWSADDKGFYILNSP

Query:  INQLEFIHPWES
         N  E IH W S
Subjt:  INQLEFIHPWES

SwissProt top hitse value%identityAlignment
F4JLQ5 S-protein homolog 28.8e-0929.37Show/hide
Query:  PWEPEPSN-IQPLTPMKVTIMNHQTND-NLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSN
        P +P  +N + P +   V I N   N   L  HC SKDDDLG + +  G+S+ + F   F G TL++C F       SF ++  +  +   + +C  +  
Subjt:  PWEPEPSN-IQPLTPMKVTIMNHQTND-NLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSN

Query:  CIWSADDKGFYILNSPINQLEFIHPW
        C+W     G    N    Q +  +PW
Subjt:  CIWSADDKGFYILNSPINQLEFIHPW

F4JLS0 S-protein homolog 12.7e-1332.5Show/hide
Query:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG
        +  ++  +VT++N   T + L+ HC SK+DDLG+  +     + W F  N   +T FWC  +    H++  VFW +     L  RC    NCIW+A   G
Subjt:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG

Query:  FYILNSPINQLEFIHPWESG
         Y+ NS   +      WE G
Subjt:  FYILNSPINQLEFIHPWESG

P0DN92 S-protein homolog 242.0e-0836.46Show/hide
Query:  KVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDF-HNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        +VTI N   ND L   HC S+DDDLG  ++  G+ +GW+F VNF  +TL++C F   ++    F+++   +  ++   RC   +NC W A+  G Y
Subjt:  KVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDF-HNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY

Q2HQ46 S-protein homolog 742.7e-1332.5Show/hide
Query:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG
        +  ++  +VT+ N   T + L+ HC SK++DLG   +     + W F  N   +TLFWC       H++ +VFW +     L  RC    NC+W+A + G
Subjt:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG

Query:  FYILNSPINQLEFIHPWESG
         Y+ NS I +      W+SG
Subjt:  FYILNSPINQLEFIHPWESG

Q9FI83 S-protein homolog 288.0e-1037.89Show/hide
Query:  KVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        +VTI N+  ND L A HC S+DDDLG  ++  G+ +GW+F VNF  +TL +C F  +  +    + +  S + +   RC   +NC W A+  GF+
Subjt:  KVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY

Arabidopsis top hitse value%identityAlignment
AT4G16195.1 Plant self-incompatibility protein S1 family6.3e-1029.37Show/hide
Query:  PWEPEPSN-IQPLTPMKVTIMNHQTND-NLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSN
        P +P  +N + P +   V I N   N   L  HC SKDDDLG + +  G+S+ + F   F G TL++C F       SF ++  +  +   + +C  +  
Subjt:  PWEPEPSN-IQPLTPMKVTIMNHQTND-NLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSN

Query:  CIWSADDKGFYILNSPINQLEFIHPW
        C+W     G    N    Q +  +PW
Subjt:  CIWSADDKGFYILNSPINQLEFIHPW

AT4G16295.1 S-protein homologue 11.9e-1432.5Show/hide
Query:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG
        +  ++  +VT++N   T + L+ HC SK+DDLG+  +     + W F  N   +T FWC  +    H++  VFW +     L  RC    NCIW+A   G
Subjt:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG

Query:  FYILNSPINQLEFIHPWESG
         Y+ NS   +      WE G
Subjt:  FYILNSPINQLEFIHPWESG

AT4G29035.1 Plant self-incompatibility protein S1 family1.9e-1432.5Show/hide
Query:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG
        +  ++  +VT+ N   T + L+ HC SK++DLG   +     + W F  N   +TLFWC       H++ +VFW +     L  RC    NC+W+A + G
Subjt:  IQPLTPMKVTIMNH-QTNDNLWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKG

Query:  FYILNSPINQLEFIHPWESG
         Y+ NS I +      W+SG
Subjt:  FYILNSPINQLEFIHPWESG

AT5G06020.1 Plant self-incompatibility protein S1 family6.9e-0937.5Show/hide
Query:  ITKFE-FIHPWEPEPSNIQPLTPMKVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVS-FQVFWPESTNEWL
        +T +E F    EP P    PLT  ++T+ N+  ND L   HC SKDDDLG  +   G+ YGW+F VNF  +TL++C F     +   F +   E      
Subjt:  ITKFE-FIHPWEPEPSNIQPLTPMKVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVS-FQVFWPESTNEWL

Query:  EERCRINSNCIWSADDKGFY
          RCR   NC W+A     Y
Subjt:  EERCRINSNCIWSADDKGFY

AT5G06030.1 Plant self-incompatibility protein S1 family5.7e-1137.89Show/hide
Query:  KVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY
        +VTI N+  ND L A HC S+DDDLG  ++  G+ +GW+F VNF  +TL +C F  +  +    + +  S + +   RC   +NC W A+  GF+
Subjt:  KVTIMNHQTNDNLWA-HCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTTCTTCATTCAAAGTGTGAAAGTTTAATTAATAGCCTTACAGTTTCGCCATTAAAGTTCAGTATTCAGAAGTGTGCACAAGGCAACCGACTCAAGCATCTTTA
CACCGGGCGAGCCTGGACTGTCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACATCCTCTGGCTTGAGCATCTTTACA
CCGGGCGAGCCTGGACTGTCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACC
GGGCGAGCCTGGACTGCCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACCGG
GCGAGCCTGGACTGTCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACCGGGC
GAGCCTGGACTGCCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACCGGGCGA
GCCTGGACTGCCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACATCCTCTGGCTTGAGCATCTTTACACCGGGCGAGC
CTGGACTGCCGAAGTCTACCTCGCCTCATCCAAAAAGCCAGGACCGAACACCTCTTGCCAAAGCCGAGCACCTCTTGCCAAGGCCGAGCACCTCTTGCCGAGGCCGAGCA
CAAACTTTAAAGTACGGTGGGGGTTCGGGGCAACGCCCCTAAGCAAAATTACAAAATGTGATGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGGCAAGCAGAAAGGGA
GATAGCTGTGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGCACCACTTTTGTAATCTCTCACCCTTTTTCATTTTCATTTCATCTAACCCCTATTCAGGGAGTGTCTA
CTTCTATCATACTTTGCAGGATGACAACAAAGCTACACGACAATCGGGGGGAAATCGGGCTGGAAACCGGGCCAAGGAGGCGGAGCCGGTAAGCGGGACGGGCCAAGGCC
GAAGGGGTCGGGTTTTTGACCCGACCCCATGCTCGGCCTCGGCCCATGGCCGAGGCCGAGCACATGGTCGGCCTCGCCATGGGCCGAGGCCGACCCTCGGCCCGCTCGTG
CGGGCCGAGCTCGCTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCCAGGAGGACGAA
CAGGTATTTATATCCCTCTTCGCCACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCTTACTTTTCCACGCTCTA
CCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCCTCAACTACAAATTTACC
GTTGGTGAGCATGTCATCGGAATCGGGAGGAACTATTGGTGGGAATTCAAGGTGAACTTTTCGGGAACGACGTTGTATTGGTGCGACTTTCACAACAAAGTAGCACATGC
TTCGTTTCAAGTTTTTTGGCCAGAAACAAAACGGTGGCTTCAAGATCGTTGCGGCAACATTTCAAACTGTATTTGGGCTGCTGATGACAAGGGTTTTTACATTTTAAATT
CTCCTATAACCAAATTTGAGTTTATTCATCCATGGGAACCTGAACCGTCCAACATACAGCCATTGACGCCAATGAAAGTGACGATTATGAACCATCAAACAAATGATAAT
CTGTGGGCTCATTGCCATTCCAAAGACGATGATTTAGGTCAGCAGGTCATCGGAATCGGGAAGAGCTACGGGTGGCAATTCAAGGTGAACTTTTCAGGAACGACGTTGTT
TTGGTGCGACTTTCACAACAAAGTAGCACATGTTTCGTTTCAAGTTTTTTGGCCAGAATCAACAAATGAGTGGCTTGAAGAACGATGCCGCATCAATTCAAACTGTATTT
GGAGTGCTGATGACAAGGGCTTTTACATTTTAAATTCTCCGATAAACCAATTGGAGTTTATTCATCCATGGGAATCTGGTCGTGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTTTCTTCATTCAAAGTGTGAAAGTTTAATTAATAGCCTTACAGTTTCGCCATTAAAGTTCAGTATTCAGAAGTGTGCACAAGGCAACCGACTCAAGCATCTTTA
CACCGGGCGAGCCTGGACTGTCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACATCCTCTGGCTTGAGCATCTTTACA
CCGGGCGAGCCTGGACTGTCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACC
GGGCGAGCCTGGACTGCCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACCGG
GCGAGCCTGGACTGTCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACCGGGC
GAGCCTGGACTGCCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACCTCCTCTGGCTTGAGCATCTTTACACCGGGCGA
GCCTGGACTGCCGAAGTCTACCTCGCCTCATCCAGAGTGAGACCATGTGAGCCTGGACCGACGAGTCTACCTCACATCCTCTGGCTTGAGCATCTTTACACCGGGCGAGC
CTGGACTGCCGAAGTCTACCTCGCCTCATCCAAAAAGCCAGGACCGAACACCTCTTGCCAAAGCCGAGCACCTCTTGCCAAGGCCGAGCACCTCTTGCCGAGGCCGAGCA
CAAACTTTAAAGTACGGTGGGGGTTCGGGGCAACGCCCCTAAGCAAAATTACAAAATGTGATGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGGCAAGCAGAAAGGGA
GATAGCTGTGCTCGGCCTCTTCCCGAGGCCGAGGCCGACCAGCACCACTTTTGTAATCTCTCACCCTTTTTCATTTTCATTTCATCTAACCCCTATTCAGGGAGTGTCTA
CTTCTATCATACTTTGCAGGATGACAACAAAGCTACACGACAATCGGGGGGAAATCGGGCTGGAAACCGGGCCAAGGAGGCGGAGCCGGTAAGCGGGACGGGCCAAGGCC
GAAGGGGTCGGGTTTTTGACCCGACCCCATGCTCGGCCTCGGCCCATGGCCGAGGCCGAGCACATGGTCGGCCTCGCCATGGGCCGAGGCCGACCCTCGGCCCGCTCGTG
CGGGCCGAGCTCGCTTGGTCCCGTCTGGTCCCCACCGCCTCTGGATGCCCCGGTTTCGCCTGGTTTGACCTAAAACGCCTCCGAAACCCTAAAAAGGCCAGGAGGACGAA
CAGGTATTTATATCCCTCTTCGCCACTGAAGAGGGGATCCCGAATTCTATCCCTAAACTCTACTCTCTATTCTCTGCTTTCTCCTCTTGCTCTTACTTTTCCACGCTCTA
CCGTTCTGTTTGCTGACTTAAGCATCGGAGCCGGTGTGGCGAGCACCACACCGGTGTGCAGGTTTACTGTCTTGCAGGCCACGTCTTCCCCCCTCAACTACAAATTTACC
GTTGGTGAGCATGTCATCGGAATCGGGAGGAACTATTGGTGGGAATTCAAGGTGAACTTTTCGGGAACGACGTTGTATTGGTGCGACTTTCACAACAAAGTAGCACATGC
TTCGTTTCAAGTTTTTTGGCCAGAAACAAAACGGTGGCTTCAAGATCGTTGCGGCAACATTTCAAACTGTATTTGGGCTGCTGATGACAAGGGTTTTTACATTTTAAATT
CTCCTATAACCAAATTTGAGTTTATTCATCCATGGGAACCTGAACCGTCCAACATACAGCCATTGACGCCAATGAAAGTGACGATTATGAACCATCAAACAAATGATAAT
CTGTGGGCTCATTGCCATTCCAAAGACGATGATTTAGGTCAGCAGGTCATCGGAATCGGGAAGAGCTACGGGTGGCAATTCAAGGTGAACTTTTCAGGAACGACGTTGTT
TTGGTGCGACTTTCACAACAAAGTAGCACATGTTTCGTTTCAAGTTTTTTGGCCAGAATCAACAAATGAGTGGCTTGAAGAACGATGCCGCATCAATTCAAACTGTATTT
GGAGTGCTGATGACAAGGGCTTTTACATTTTAAATTCTCCGATAAACCAATTGGAGTTTATTCATCCATGGGAATCTGGTCGTGCTTAG
Protein sequenceShow/hide protein sequence
MFFLHSKCESLINSLTVSPLKFSIQKCAQGNRLKHLYTGRAWTVEVYLASSRVRPCEPGPTSLPHILWLEHLYTGRAWTVEVYLASSRVRPCEPGPTSLPHLLWLEHLYT
GRAWTAEVYLASSRVRPCEPGPTSLPHLLWLEHLYTGRAWTVEVYLASSRVRPCEPGPTSLPHLLWLEHLYTGRAWTAEVYLASSRVRPCEPGPTSLPHLLWLEHLYTGR
AWTAEVYLASSRVRPCEPGPTSLPHILWLEHLYTGRAWTAEVYLASSKKPGPNTSCQSRAPLAKAEHLLPRPSTNFKVRWGFGATPLSKITKCDARPLPEAEADQASRKG
DSCARPLPEAEADQHHFCNLSPFFIFISSNPYSGSVYFYHTLQDDNKATRQSGGNRAGNRAKEAEPVSGTGQGRRGRVFDPTPCSASAHGRGRAHGRPRHGPRPTLGPLV
RAELAWSRLVPTASGCPGFAWFDLKRLRNPKKARRTNRYLYPSSPLKRGSRILSLNSTLYSLLSPLALTFPRSTVLFADLSIGAGVASTTPVCRFTVLQATSSPLNYKFT
VGEHVIGIGRNYWWEFKVNFSGTTLYWCDFHNKVAHASFQVFWPETKRWLQDRCGNISNCIWAADDKGFYILNSPITKFEFIHPWEPEPSNIQPLTPMKVTIMNHQTNDN
LWAHCHSKDDDLGQQVIGIGKSYGWQFKVNFSGTTLFWCDFHNKVAHVSFQVFWPESTNEWLEERCRINSNCIWSADDKGFYILNSPINQLEFIHPWESGRA