; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037961 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037961
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionprotein SPOROCYTELESS
Genome locationchr2:11044298..11045509
RNA-Seq ExpressionLag0037961
SyntenyLag0037961
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR014855 - Plant transcription factor NOZZLE
IPR040356 - SPEAR family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591999.1 hypothetical protein SDJN03_14345, partial [Cucurbita argyrosperma subsp. sororia]1.2e-7356.55Show/hide
Query:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL
        MA+P++     KP EP KTR  R TAAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPL FP   A  ND   G +
Subjt:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL

Query:  LGFDRQGLPVQRLGNGGFIG---GGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------NI
        LGF         +GNGGF G   GGLM +E +PHGG A VD RLLIGN V ASRELSSIP L      PPPP   VSDR DICF KKRVNF+      NI
Subjt:  LGFDRQGLPVQRLGNGGFIG---GGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------NI

Query:  IATAETP-----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEELL
        I TAE P     A FDF+GL+  S        A  D  FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E E+ K A EE L
Subjt:  IATAETP-----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEELL

Query:  GSFAEE---------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
        G FAEE               V  VDHGEGSCIT+SC+D INGG+ +S+ LDLSLKLSF
Subjt:  GSFAEE---------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

KAG7024875.1 hypothetical protein SDJN02_13694, partial [Cucurbita argyrosperma subsp. argyrosperma]6.6e-7757.58Show/hide
Query:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL
        MA+P++     KP EP KTR  R TAAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPL FP   A  ND   G +
Subjt:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL

Query:  LGFDRQGLPVQRLGNGGF----IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------N
        LGF         +GNGGF     GGGLM +E +PHGG A VDPRLLIGN V ASRELSSIP L      PPPP   VSDR DICFKKKRVNF+      N
Subjt:  LGFDRQGLPVQRLGNGGF----IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------N

Query:  IIATAETP----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEELL
        II TAE P    A FDF+GL+  S        A  D  FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E E+ K A EE L
Subjt:  IIATAETP----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEELL

Query:  GSFAEE------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
        G FAEE            V  VDHGEGSCIT+SC+D INGG+ +S+ LDLSLKLSF
Subjt:  GSFAEE------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

XP_022936226.1 protein virilizer homolog [Cucurbita moschata]7.3e-7657.14Show/hide
Query:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL
        MA+P++     KP EP KTR  R TAAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPL F    A  ND   G +
Subjt:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL

Query:  LGFDRQGLPVQRLGNGGF----IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------N
        LGF         +GNGGF     GGGLM +E +PHGG A VDPRLLIGN V ASRELSSIP L      PPPP   VSDR DICFKKKRVNF+      N
Subjt:  LGFDRQGLPVQRLGNGGF----IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------N

Query:  IIATAETP-----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEEL
        II TAE P     A FDF+GL+  S        A  D  FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E E+ K A EE 
Subjt:  IIATAETP-----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEEL

Query:  LGSFAEE------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
        LG FAEE            V  VDHGEGSCIT+SC+D INGG+ +S+ LDLSLKLSF
Subjt:  LGSFAEE------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

XP_022975733.1 protein SPOROCYTELESS [Cucurbita maxima]3.6e-7558.17Show/hide
Query:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGG---APPNDCVG
        MATP++     KP EP KTR  R TAAKNP+QKK PQRGLGVAQLERLRLQE WKKMT++ PPH      PFLL     +FPL FP  G   AP  +  G
Subjt:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGG---APPNDCVG

Query:  GTLLGFDRQGLPVQRLGNGGF--IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL--PPPPAAAVSDR-DICFKKKRVNFN------NIIA
          +LGF         +GN GF   GGGLM +E +PHGG A VDPRLLIGN V ASRELSSIP L  PPPP   VSDR DICFKKKRVNF+      NII 
Subjt:  GTLLGFDRQGLPVQRLGNGGF--IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL--PPPPAAAVSDR-DICFKKKRVNFN------NIIA

Query:  TAE-TPAGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEQLKMAKEELLGSFAEE-
        TAE  P  FDF+GLS  S        A  D  FS  NFNQ GA VKQ+HR+ AG GGG      LMEYEFF R+NGR G+EFE+LK   EE LG FAEE 
Subjt:  TAE-TPAGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEQLKMAKEELLGSFAEE-

Query:  -----------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
                   V  VDHGEGSCIT+SC+D INGG+ +S+ LDLSLKLSF
Subjt:  -----------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

XP_038900102.1 protein SPOROCYTELESS-like [Benincasa hispida]1.5e-7357.46Show/hide
Query:  MATPL------LKP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHF-QFQSP-FLLNPTSHNFPLHFPDGGAP----PN
        MATP+       KP EP KTRP R T A+NP+QKK PQRGLGVAQLERLRLQ+ W K+TEM PPHHF Q  SP FLL+ T  NFPL FP   AP     +
Subjt:  MATPL------LKP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHF-QFQSP-FLLNPTSHNFPLHFPDGGAP----PN

Query:  DCVGGTLLGFDRQGLPVQRLGN-GGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPP--PAAAVSDR-DICFKKKRVNFNNI-----
           GG +LGFD QGL VQR+GN GGF+ GG    E Y HGG      +LIGN  V ASRELSSIPKLPPP  P +  SD  DICFKKKRVNF+N+     
Subjt:  DCVGGTLLGFDRQGLPVQRLGN-GGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPP--PAAAVSDR-DICFKKKRVNFNNI-----

Query:  --IATAETP----AGFDFIGLSPT-STGELTSAA-----AHGDLGFS-TFNFNQGGAEVKQVHRRAAGGGGGV----LMEYEFFGRENGRGGSEFEQLKM
          IA AETP    AGFDF+GLS T ST EL ++      A+ D GFS  FNFNQG +          GGG G     LMEYEFF R+N R G+E E+LKM
Subjt:  --IATAETP----AGFDFIGLSPT-STGELTSAA-----AHGDLGFS-TFNFNQGGAEVKQVHRRAAGGGGGV----LMEYEFFGRENGRGGSEFEQLKM

Query:  AKEELL-----GSFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
         KEEL          EEV  VDHGEGSCIT+SC D INGG+ +S+ALDLSLKLSF
Subjt:  AKEELL-----GSFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

TrEMBL top hitse value%identityAlignment
A0A1S3BZ18 uncharacterized protein LOC103494987 isoform X22.6e-5852.19Show/hide
Query:  MATPLLK-----PEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPP----NDCV-
        MATPL +       P K R  R    KNPNQKK PQRGLGVAQLERLRLQE WK +TE+ PP        FLL+ T  NFPLHFP   APP     DC+ 
Subjt:  MATPLLK-----PEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPP----NDCV-

Query:  --GGTLLGFDRQGLPVQRLG-NGGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPPPAAAVSDR-DICFKKKRVNFNN-------II
           G +LGFD  G  VQR+G NGGF+          P GG      +LIGN  V ASRELSSIPKL   P A  SDR D CF KKRVNF+N       I 
Subjt:  --GGTLLGFDRQGLPVQRLG-NGGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPPPAAAVSDR-DICFKKKRVNFNN-------II

Query:  ATAETPAGFDFIGLSPTSTGELTSAAA---HGDLGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEQLKMAKEELL------G
        A AETP+ FDF+GL   ST EL + +    H D G+   ++F+     +KQV      GG G  LMEYEFF R+NGR G+E E+LKM KEEL        
Subjt:  ATAETPAGFDFIGLSPTSTGELTSAAA---HGDLGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEQLKMAKEELL------G

Query:  SFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
           EEV  +DHGEGSCIT+SC D INGG+ +S+ALDLSLKLSF
Subjt:  SFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

A0A1S3BZ57 uncharacterized protein LOC103494987 isoform X12.7e-6052.48Show/hide
Query:  MATPLLK-----PEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPP----NDCV-
        MATPL +       P K R  R    KNPNQKK PQRGLGVAQLERLRLQE WK +TE+ PP        FLL+ T  NFPLHFP   APP     DC+ 
Subjt:  MATPLLK-----PEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPP----NDCV-

Query:  --GGTLLGFDRQGLPVQRLG-NGGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPPPAAAVSDR-DICFKKKRVNFNN-------II
           G +LGFD  G  VQR+G NGGF+          P GG      +LIGN  V ASRELSSIPKL   P A  SDR D CFKKKRVNF+N       I 
Subjt:  --GGTLLGFDRQGLPVQRLG-NGGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPPPAAAVSDR-DICFKKKRVNFNN-------II

Query:  ATAETPAGFDFIGLSPTSTGELTSAAA---HGDLGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEQLKMAKEELL------G
        A AETP+ FDF+GL   ST EL + +    H D G+   ++F+     +KQV      GG G  LMEYEFF R+NGR G+E E+LKM KEEL        
Subjt:  ATAETPAGFDFIGLSPTSTGELTSAAA---HGDLGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEQLKMAKEELL------G

Query:  SFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
           EEV  +DHGEGSCIT+SC D INGG+ +S+ALDLSLKLSF
Subjt:  SFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

A0A5A7T067 Protein SPOROCYTELESS2.7e-6052.48Show/hide
Query:  MATPLLK-----PEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPP----NDCV-
        MATPL +       P K R  R    KNPNQKK PQRGLGVAQLERLRLQE WK +TE+ PP        FLL+ T  NFPLHFP   APP     DC+ 
Subjt:  MATPLLK-----PEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPP----NDCV-

Query:  --GGTLLGFDRQGLPVQRLG-NGGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPPPAAAVSDR-DICFKKKRVNFNN-------II
           G +LGFD  G  VQR+G NGGF+          P GG      +LIGN  V ASRELSSIPKL   P A  SDR D CFKKKRVNF+N       I 
Subjt:  --GGTLLGFDRQGLPVQRLG-NGGFIGGGLMAVEAYPHGGRAVDPRLLIGN-GVGASRELSSIPKLPPPPAAAVSDR-DICFKKKRVNFNN-------II

Query:  ATAETPAGFDFIGLSPTSTGELTSAAA---HGDLGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEQLKMAKEELL------G
        A AETP+ FDF+GL   ST EL + +    H D G+   ++F+     +KQV      GG G  LMEYEFF R+NGR G+E E+LKM KEEL        
Subjt:  ATAETPAGFDFIGLSPTSTGELTSAAA---HGDLGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEQLKMAKEELL------G

Query:  SFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
           EEV  +DHGEGSCIT+SC D INGG+ +S+ALDLSLKLSF
Subjt:  SFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

A0A6J1F6X7 protein virilizer homolog3.5e-7657.14Show/hide
Query:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL
        MA+P++     KP EP KTR  R TAAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPL F    A  ND   G +
Subjt:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTL

Query:  LGFDRQGLPVQRLGNGGF----IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------N
        LGF         +GNGGF     GGGLM +E +PHGG A VDPRLLIGN V ASRELSSIP L      PPPP   VSDR DICFKKKRVNF+      N
Subjt:  LGFDRQGLPVQRLGNGGF----IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL------PPPPAAAVSDR-DICFKKKRVNFN------N

Query:  IIATAETP-----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEEL
        II TAE P     A FDF+GL+  S        A  D  FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E E+ K A EE 
Subjt:  IIATAETP-----AGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEQLKMAKEEL

Query:  LGSFAEE------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
        LG FAEE            V  VDHGEGSCIT+SC+D INGG+ +S+ LDLSLKLSF
Subjt:  LGSFAEE------------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

A0A6J1IK53 protein SPOROCYTELESS1.8e-7558.17Show/hide
Query:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGG---APPNDCVG
        MATP++     KP EP KTR  R TAAKNP+QKK PQRGLGVAQLERLRLQE WKKMT++ PPH      PFLL     +FPL FP  G   AP  +  G
Subjt:  MATPLL-----KP-EPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGG---APPNDCVG

Query:  GTLLGFDRQGLPVQRLGNGGF--IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL--PPPPAAAVSDR-DICFKKKRVNFN------NIIA
          +LGF         +GN GF   GGGLM +E +PHGG A VDPRLLIGN V ASRELSSIP L  PPPP   VSDR DICFKKKRVNF+      NII 
Subjt:  GTLLGFDRQGLPVQRLGNGGF--IGGGLMAVEAYPHGGRA-VDPRLLIGNGVGASRELSSIPKL--PPPPAAAVSDR-DICFKKKRVNFN------NIIA

Query:  TAE-TPAGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEQLKMAKEELLGSFAEE-
        TAE  P  FDF+GLS  S        A  D  FS  NFNQ GA VKQ+HR+ AG GGG      LMEYEFF R+NGR G+EFE+LK   EE LG FAEE 
Subjt:  TAE-TPAGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEQLKMAKEELLGSFAEE-

Query:  -----------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF
                   V  VDHGEGSCIT+SC+D INGG+ +S+ LDLSLKLSF
Subjt:  -----------VAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACTCCACTGCTCAAACCCGAACCCACAAAAACCAGACCCAAACGAGCCACCGCCGCCAAAAACCCCAACCAGAAAAAGCAGCCTCAGCGCGGCCTCGGCGTCGC
TCAGCTCGAGCGCTTAAGGCTTCAAGAAACTTGGAAGAAAATGACCGAAATGCCCCCTCCCCACCATTTCCAATTCCAATCTCCCTTCCTCCTTAACCCCACTTCACACA
ATTTCCCCCTGCACTTCCCCGACGGCGGCGCTCCGCCCAATGACTGCGTCGGCGGGACCCTTTTGGGGTTTGACCGTCAGGGTTTGCCCGTGCAGAGGCTCGGAAATGGA
GGGTTTATCGGCGGTGGATTGATGGCGGTGGAGGCCTATCCTCACGGCGGCAGAGCGGTGGATCCGAGACTTCTGATCGGAAATGGCGTTGGGGCCTCGAGGGAGCTCTC
TTCAATCCCAAAATTGCCACCGCCGCCGGCGGCGGCCGTTTCTGATCGGGATATTTGCTTCAAGAAGAAACGAGTCAACTTCAACAACATTATCGCGACCGCAGAAACGC
CGGCGGGTTTCGATTTCATCGGACTGAGTCCAACTTCCACCGGAGAATTAACCAGCGCCGCCGCTCACGGAGATTTGGGATTCAGTACTTTCAACTTCAACCAGGGCGGA
GCAGAGGTGAAGCAAGTCCACCGGAGAGCGGCGGGGGGCGGCGGCGGCGTATTGATGGAGTACGAATTCTTTGGGAGGGAAAATGGCAGAGGGGGCTCGGAGTTCGAGCA
GCTGAAAATGGCAAAGGAAGAATTATTGGGTTCATTTGCAGAAGAAGTGGCAGTGGTGGATCATGGAGAAGGTTCTTGTATTACTTCGAGCTGCACTGATTTCATTAATG
GCGGTAGCAGTAGTTCCAGTGCTCTTGATTTGTCTCTCAAGCTTTCATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGACTCCACTGCTCAAACCCGAACCCACAAAAACCAGACCCAAACGAGCCACCGCCGCCAAAAACCCCAACCAGAAAAAGCAGCCTCAGCGCGGCCTCGGCGTCGC
TCAGCTCGAGCGCTTAAGGCTTCAAGAAACTTGGAAGAAAATGACCGAAATGCCCCCTCCCCACCATTTCCAATTCCAATCTCCCTTCCTCCTTAACCCCACTTCACACA
ATTTCCCCCTGCACTTCCCCGACGGCGGCGCTCCGCCCAATGACTGCGTCGGCGGGACCCTTTTGGGGTTTGACCGTCAGGGTTTGCCCGTGCAGAGGCTCGGAAATGGA
GGGTTTATCGGCGGTGGATTGATGGCGGTGGAGGCCTATCCTCACGGCGGCAGAGCGGTGGATCCGAGACTTCTGATCGGAAATGGCGTTGGGGCCTCGAGGGAGCTCTC
TTCAATCCCAAAATTGCCACCGCCGCCGGCGGCGGCCGTTTCTGATCGGGATATTTGCTTCAAGAAGAAACGAGTCAACTTCAACAACATTATCGCGACCGCAGAAACGC
CGGCGGGTTTCGATTTCATCGGACTGAGTCCAACTTCCACCGGAGAATTAACCAGCGCCGCCGCTCACGGAGATTTGGGATTCAGTACTTTCAACTTCAACCAGGGCGGA
GCAGAGGTGAAGCAAGTCCACCGGAGAGCGGCGGGGGGCGGCGGCGGCGTATTGATGGAGTACGAATTCTTTGGGAGGGAAAATGGCAGAGGGGGCTCGGAGTTCGAGCA
GCTGAAAATGGCAAAGGAAGAATTATTGGGTTCATTTGCAGAAGAAGTGGCAGTGGTGGATCATGGAGAAGGTTCTTGTATTACTTCGAGCTGCACTGATTTCATTAATG
GCGGTAGCAGTAGTTCCAGTGCTCTTGATTTGTCTCTCAAGCTTTCATTCTAG
Protein sequenceShow/hide protein sequence
MATPLLKPEPTKTRPKRATAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLHFPDGGAPPNDCVGGTLLGFDRQGLPVQRLGNG
GFIGGGLMAVEAYPHGGRAVDPRLLIGNGVGASRELSSIPKLPPPPAAAVSDRDICFKKKRVNFNNIIATAETPAGFDFIGLSPTSTGELTSAAAHGDLGFSTFNFNQGG
AEVKQVHRRAAGGGGGVLMEYEFFGRENGRGGSEFEQLKMAKEELLGSFAEEVAVVDHGEGSCITSSCTDFINGGSSSSSALDLSLKLSF