; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg016160 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg016160
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein SPOROCYTELESS
Genome locationscaffold9:37113803..37115038
RNA-Seq ExpressionSpg016160
SyntenySpg016160
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR014855 - Plant transcription factor NOZZLE
IPR040356 - SPEAR family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591999.1 hypothetical protein SDJN03_14345, partial [Cucurbita argyrosperma subsp. sororia]7.9e-7858.01Show/hide
Query:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG
        MA+P++     KP EP KTR  R  AAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPLQFP   A A   ND   
Subjt:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG

Query:  GTLLGFDHQGLPVQRLGNGGFIG---GGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN-----
        G +LGF         +GNGGF G   GGLM +E +PHGGGA VD RLLIGNSV ASRELSSIP L      PPPPPP +SDR DICF KKRVNF+     
Subjt:  GTLLGFDHQGLPVQRLGNGGFIG---GGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN-----

Query:  -NIIATAETP-----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPKE
         NII TAE P     A FDF+GL+  S        A  DF FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E EE K   E
Subjt:  -NIIATAETP-----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPKE

Query:  ELLGLFAEE---------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
        E LGLFAEE               V  VDHGEGSCITTSC+D INGG+ +S+ LDLSLKLSF
Subjt:  ELLGLFAEE---------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

KAG7024875.1 hypothetical protein SDJN02_13694, partial [Cucurbita argyrosperma subsp. argyrosperma]4.5e-8159.05Show/hide
Query:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG
        MA+P++     KP EP KTR  R  AAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPLQFP   A A   ND   
Subjt:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG

Query:  GTLLGFDHQGLPVQRLGNGGF----IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN----
        G +LGF         +GNGGF     GGGLM +E +PHGGGA VDPRLLIGNSV ASRELSSIP L      PPPPPP +SDR DICFKKKRVNF+    
Subjt:  GTLLGFDHQGLPVQRLGNGGF----IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN----

Query:  --NIIATAETP----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPKE
          NII TAE P    A FDF+GL+  S        A  DF FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E EE K   E
Subjt:  --NIIATAETP----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPKE

Query:  ELLGLFAEE------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
        E LGLFAEE            V  VDHGEGSCITTSC+D INGG+ +S+ LDLSLKLSF
Subjt:  ELLGLFAEE------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

XP_022936226.1 protein virilizer homolog [Cucurbita moschata]4.9e-8058.61Show/hide
Query:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG
        MA+P++     KP EP KTR  R  AAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPLQF    A A   ND   
Subjt:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG

Query:  GTLLGFDHQGLPVQRLGNGGF----IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN----
        G +LGF         +GNGGF     GGGLM +E +PHGGGA VDPRLLIGNSV ASRELSSIP L      PPPPPP +SDR DICFKKKRVNF+    
Subjt:  GTLLGFDHQGLPVQRLGNGGF----IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN----

Query:  --NIIATAETP-----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPK
          NII TAE P     A FDF+GL+  S        A  DF FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E EE K   
Subjt:  --NIIATAETP-----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPK

Query:  EELLGLFAEE------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
        EE LGLFAEE            V  VDHGEGSCITTSC+D INGG+ +S+ LDLSLKLSF
Subjt:  EELLGLFAEE------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

XP_022975733.1 protein SPOROCYTELESS [Cucurbita maxima]1.8e-8260.17Show/hide
Query:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG
        MATP++     KP EP KTR  R  AAKNP+QKK PQRGLGVAQLERLRLQE WKKMT++ PPH      PFLL     +FPLQFP  GA +    +  G
Subjt:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG

Query:  GTLLGFDHQGLPVQRLGNGGF--IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL--PPPPPPAISDR-DICFKKKRVNFN------NIIA
          +LGF         +GN GF   GGGLM +E +PHGGGA VDPRLLIGNSV ASRELSSIP L  PPPPPP +SDR DICFKKKRVNF+      NII 
Subjt:  GTLLGFDHQGLPVQRLGNGGF--IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL--PPPPPPAISDR-DICFKKKRVNFN------NIIA

Query:  TAE-TPAGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEELKMPKEELLGLFAEE-
        TAE  P  FDF+GLS  S        A  DF FS  NFNQ GA VKQ+HR+ AG GGG      LMEYEFF R+NGR G+EFEELK P EE LGLFAEE 
Subjt:  TAE-TPAGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEELKMPKEELLGLFAEE-

Query:  -----------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
                   V  VDHGEGSCITTSC+D INGG+ +S+ LDLSLKLSF
Subjt:  -----------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

XP_038900102.1 protein SPOROCYTELESS-like [Benincasa hispida]2.6e-8160.67Show/hide
Query:  MATPL------LKP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHF-QFQSP-FLLNPTSHNFPLQFPDGGAPAVVAND
        MATP+       KP EP KTRP R   A+NP+QKK PQRGLGVAQLERLRLQ+ W K+TEM PPHHF Q  SP FLL+ T  NFPLQFP   AP +VA D
Subjt:  MATPL------LKP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHF-QFQSP-FLLNPTSHNFPLQFPDGGAPAVVAND

Query:  C-VGGTLLGFDHQGLPVQRLGN-GGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPP--PPPAISDR-DICFKKKRVNFNNI-----
           GG +LGFDHQGL VQR+GN GGF+ GG    E Y HGGG     +LIGN SV ASRELSSIPKLPPP  PP   SD  DICFKKKRVNF+N+     
Subjt:  C-VGGTLLGFDHQGLPVQRLGN-GGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPP--PPPAISDR-DICFKKKRVNFNNI-----

Query:  --IATAETP----AGFDFIGLSPT-SPGELTSAA-----AHGDFGFS-TFNFNQGGAEVKQVHRRAAGGGGGV----LMEYEFFGRENGRGGSEFEELKM
          IA AETP    AGFDF+GLS T S  EL ++      A+ DFGFS  FNFNQG +          GGG G     LMEYEFF R+N R G+E EELKM
Subjt:  --IATAETP----AGFDFIGLSPT-SPGELTSAA-----AHGDFGFS-TFNFNQGGAEVKQVHRRAAGGGGGV----LMEYEFFGRENGRGGSEFEELKM

Query:  PKEELLGLFA------EEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
        PKEE L LF       EEV  VDHGEGSCITTSC D INGG+ +S+ALDLSLKLSF
Subjt:  PKEELLGLFA------EEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

TrEMBL top hitse value%identityAlignment
A0A1S3BZ18 uncharacterized protein LOC103494987 isoform X29.5e-6152.77Show/hide
Query:  MATPLLK-----PEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCV--
        MATPL +       P K R  R    KNPNQKK PQRGLGVAQLERLRLQE WK +TE+ PP        FLL+ T  NFPL FP    P ++  DC+  
Subjt:  MATPLLK-----PEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCV--

Query:  -GGTLLGFDHQGLPVQRLG-NGGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPPPPPAISDR-DICFKKKRVNFNN-------IIA
          G +LGFDH G  VQR+G NGGF+          P GG      +LIGN SV ASRELSSIPKL   P    SDR D CF KKRVNF+N       I A
Subjt:  -GGTLLGFDHQGLPVQRLG-NGGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPPPPPAISDR-DICFKKKRVNFNN-------IIA

Query:  TAETPAGFDFIGLSPTSPGELTSAAA---HGDFGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLF-----
         AETP+ FDF+GL   S  EL + +    H D G+   ++F+     +KQV      GG G  LMEYEFF R+NGR G+E EELKMPKEE L LF     
Subjt:  TAETPAGFDFIGLSPTSPGELTSAAA---HGDFGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLF-----

Query:  --AEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
           EEV  +DHGEGSCITTSC D INGG+ +S+ALDLSLKLSF
Subjt:  --AEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

A0A1S3BZ57 uncharacterized protein LOC103494987 isoform X11.0e-6253.06Show/hide
Query:  MATPLLK-----PEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCV--
        MATPL +       P K R  R    KNPNQKK PQRGLGVAQLERLRLQE WK +TE+ PP        FLL+ T  NFPL FP    P ++  DC+  
Subjt:  MATPLLK-----PEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCV--

Query:  -GGTLLGFDHQGLPVQRLG-NGGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPPPPPAISDR-DICFKKKRVNFNN-------IIA
          G +LGFDH G  VQR+G NGGF+          P GG      +LIGN SV ASRELSSIPKL   P    SDR D CFKKKRVNF+N       I A
Subjt:  -GGTLLGFDHQGLPVQRLG-NGGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPPPPPAISDR-DICFKKKRVNFNN-------IIA

Query:  TAETPAGFDFIGLSPTSPGELTSAAA---HGDFGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLF-----
         AETP+ FDF+GL   S  EL + +    H D G+   ++F+     +KQV      GG G  LMEYEFF R+NGR G+E EELKMPKEE L LF     
Subjt:  TAETPAGFDFIGLSPTSPGELTSAAA---HGDFGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLF-----

Query:  --AEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
           EEV  +DHGEGSCITTSC D INGG+ +S+ALDLSLKLSF
Subjt:  --AEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

A0A5A7T067 Protein SPOROCYTELESS1.0e-6253.06Show/hide
Query:  MATPLLK-----PEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCV--
        MATPL +       P K R  R    KNPNQKK PQRGLGVAQLERLRLQE WK +TE+ PP        FLL+ T  NFPL FP    P ++  DC+  
Subjt:  MATPLLK-----PEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCV--

Query:  -GGTLLGFDHQGLPVQRLG-NGGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPPPPPAISDR-DICFKKKRVNFNN-------IIA
          G +LGFDH G  VQR+G NGGF+          P GG      +LIGN SV ASRELSSIPKL   P    SDR D CFKKKRVNF+N       I A
Subjt:  -GGTLLGFDHQGLPVQRLG-NGGFIGGGLMAVEAYPHGGGAVDPRLLIGN-SVGASRELSSIPKLPPPPPPAISDR-DICFKKKRVNFNN-------IIA

Query:  TAETPAGFDFIGLSPTSPGELTSAAA---HGDFGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLF-----
         AETP+ FDF+GL   S  EL + +    H D G+   ++F+     +KQV      GG G  LMEYEFF R+NGR G+E EELKMPKEE L LF     
Subjt:  TAETPAGFDFIGLSPTSPGELTSAAA---HGDFGFS-TFNFNQGGAEVKQVHRRAAGGG-GGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLF-----

Query:  --AEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
           EEV  +DHGEGSCITTSC D INGG+ +S+ALDLSLKLSF
Subjt:  --AEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

A0A6J1F6X7 protein virilizer homolog2.4e-8058.61Show/hide
Query:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG
        MA+P++     KP EP KTR  R  AAKNP+ KK PQRGLGVAQLERLRLQE WKKMTE+ PPH FQ  SPFLL     +FPLQF    A A   ND   
Subjt:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG

Query:  GTLLGFDHQGLPVQRLGNGGF----IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN----
        G +LGF         +GNGGF     GGGLM +E +PHGGGA VDPRLLIGNSV ASRELSSIP L      PPPPPP +SDR DICFKKKRVNF+    
Subjt:  GTLLGFDHQGLPVQRLGNGGF----IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL------PPPPPPAISDR-DICFKKKRVNFN----

Query:  --NIIATAETP-----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPK
          NII TAE P     A FDF+GL+  S        A  DF FS  NFNQ GA VKQ+HR+ AG GGG       LMEYEFF R+NGR G+E EE K   
Subjt:  --NIIATAETP-----AGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG------VLMEYEFFGRENGRGGSEFEELKMPK

Query:  EELLGLFAEE------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
        EE LGLFAEE            V  VDHGEGSCITTSC+D INGG+ +S+ LDLSLKLSF
Subjt:  EELLGLFAEE------------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

A0A6J1IK53 protein SPOROCYTELESS8.8e-8360.17Show/hide
Query:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG
        MATP++     KP EP KTR  R  AAKNP+QKK PQRGLGVAQLERLRLQE WKKMT++ PPH      PFLL     +FPLQFP  GA +    +  G
Subjt:  MATPLL-----KP-EPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVG

Query:  GTLLGFDHQGLPVQRLGNGGF--IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL--PPPPPPAISDR-DICFKKKRVNFN------NIIA
          +LGF         +GN GF   GGGLM +E +PHGGGA VDPRLLIGNSV ASRELSSIP L  PPPPPP +SDR DICFKKKRVNF+      NII 
Subjt:  GTLLGFDHQGLPVQRLGNGGF--IGGGLMAVEAYPHGGGA-VDPRLLIGNSVGASRELSSIPKL--PPPPPPAISDR-DICFKKKRVNFN------NIIA

Query:  TAE-TPAGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEELKMPKEELLGLFAEE-
        TAE  P  FDF+GLS  S        A  DF FS  NFNQ GA VKQ+HR+ AG GGG      LMEYEFF R+NGR G+EFEELK P EE LGLFAEE 
Subjt:  TAE-TPAGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFNQGGAEVKQVHRRAAGGGGG-----VLMEYEFFGRENGRGGSEFEELKMPKEELLGLFAEE-

Query:  -----------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF
                   V  VDHGEGSCITTSC+D INGG+ +S+ LDLSLKLSF
Subjt:  -----------VAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCCACTGCTCAAACCCGAACCCACAAAAACCAGACCCAGACGAGCCGCCGCCGCCAAAAACCCCAACCAGAAAAAGCAGCCTCAGCGCGGCCTCGGCGTCGC
TCAGCTCGAGCGCTTAAGGCTTCAAGAGACTTGGAAGAAAATGACCGAAATGCCCCCTCCCCACCATTTCCAATTCCAATCTCCCTTCCTCCTTAACCCCACTTCACACA
ATTTCCCCCTGCAGTTCCCCGACGGCGGCGCTCCGGCCGTCGTTGCCAATGACTGCGTCGGCGGGACCCTTTTGGGGTTTGACCATCAGGGTTTGCCCGTGCAGAGGCTC
GGAAATGGAGGGTTTATCGGCGGTGGATTGATGGCGGTGGAGGCCTATCCTCACGGCGGCGGAGCGGTGGATCCAAGACTTCTGATAGGAAATAGCGTTGGGGCCTCGAG
GGAGCTCTCTTCAATCCCAAAATTGCCGCCGCCGCCGCCGCCGGCCATTTCTGATCGGGATATTTGCTTCAAGAAGAAACGAGTCAACTTCAACAACATTATCGCGACCG
CAGAAACGCCGGCGGGTTTCGATTTCATCGGACTGAGTCCAACTTCCCCCGGAGAATTAACCAGCGCCGCCGCTCACGGAGATTTCGGATTCAGTACTTTCAACTTCAAC
CAGGGCGGAGCAGAGGTGAAGCAAGTCCACCGGAGAGCGGCGGGGGGCGGCGGCGGCGTATTGATGGAGTACGAATTCTTTGGGAGGGAAAATGGCAGAGGAGGCTCGGA
GTTCGAGGAGCTGAAAATGCCAAAGGAAGAATTATTGGGTTTATTTGCAGAAGAAGTGGCAGTGGTGGATCATGGAGAAGGTTCTTGTATTACTACAAGCTGCACTGATT
TCATTAATGGCGGTAGCAGTAGTTCCAGTGCTCTTGATTTGTCTCTCAAGCTTTCATTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCCACTGCTCAAACCCGAACCCACAAAAACCAGACCCAGACGAGCCGCCGCCGCCAAAAACCCCAACCAGAAAAAGCAGCCTCAGCGCGGCCTCGGCGTCGC
TCAGCTCGAGCGCTTAAGGCTTCAAGAGACTTGGAAGAAAATGACCGAAATGCCCCCTCCCCACCATTTCCAATTCCAATCTCCCTTCCTCCTTAACCCCACTTCACACA
ATTTCCCCCTGCAGTTCCCCGACGGCGGCGCTCCGGCCGTCGTTGCCAATGACTGCGTCGGCGGGACCCTTTTGGGGTTTGACCATCAGGGTTTGCCCGTGCAGAGGCTC
GGAAATGGAGGGTTTATCGGCGGTGGATTGATGGCGGTGGAGGCCTATCCTCACGGCGGCGGAGCGGTGGATCCAAGACTTCTGATAGGAAATAGCGTTGGGGCCTCGAG
GGAGCTCTCTTCAATCCCAAAATTGCCGCCGCCGCCGCCGCCGGCCATTTCTGATCGGGATATTTGCTTCAAGAAGAAACGAGTCAACTTCAACAACATTATCGCGACCG
CAGAAACGCCGGCGGGTTTCGATTTCATCGGACTGAGTCCAACTTCCCCCGGAGAATTAACCAGCGCCGCCGCTCACGGAGATTTCGGATTCAGTACTTTCAACTTCAAC
CAGGGCGGAGCAGAGGTGAAGCAAGTCCACCGGAGAGCGGCGGGGGGCGGCGGCGGCGTATTGATGGAGTACGAATTCTTTGGGAGGGAAAATGGCAGAGGAGGCTCGGA
GTTCGAGGAGCTGAAAATGCCAAAGGAAGAATTATTGGGTTTATTTGCAGAAGAAGTGGCAGTGGTGGATCATGGAGAAGGTTCTTGTATTACTACAAGCTGCACTGATT
TCATTAATGGCGGTAGCAGTAGTTCCAGTGCTCTTGATTTGTCTCTCAAGCTTTCATTCTAG
Protein sequenceShow/hide protein sequence
MATPLLKPEPTKTRPRRAAAAKNPNQKKQPQRGLGVAQLERLRLQETWKKMTEMPPPHHFQFQSPFLLNPTSHNFPLQFPDGGAPAVVANDCVGGTLLGFDHQGLPVQRL
GNGGFIGGGLMAVEAYPHGGGAVDPRLLIGNSVGASRELSSIPKLPPPPPPAISDRDICFKKKRVNFNNIIATAETPAGFDFIGLSPTSPGELTSAAAHGDFGFSTFNFN
QGGAEVKQVHRRAAGGGGGVLMEYEFFGRENGRGGSEFEELKMPKEELLGLFAEEVAVVDHGEGSCITTSCTDFINGGSSSSSALDLSLKLSF