; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0005775 (gene) of Chayote v1 genome

Gene IDSed0005775
OrganismSechium edule (Chayote v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 2170 .
Genome locationLG01:7002011..7003591
RNA-Seq ExpressionSed0005775
SyntenySed0005775
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578929.1 TOM1-like protein 3, partial [Cucurbita argyrosperma subsp. sororia]8.8e-8075Show/hide
Query:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV
        EDCI FRGWDSAA  DDSQSESGVCSPTLWG  S      HRPRNRSLSPTSRT+AIARGQQELMEMVR MPESSYELSLKDLVEHHLSNS+RQ G  S+
Subjt:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV

Query:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG
         R DSASETSFRRD SK RSETRALVTRSRSVDSGGFYLKMF P+P   +++KKK NLRTDSG         K PPVDR+WWRKRS      NEGSVSG 
Subjt:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG

Query:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
             GSSNSTSS R  SRNSESQG CWFCISPMRSK  E
Subjt:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

XP_004144253.1 uncharacterized protein LOC101219576 [Cucumis sativus]3.0e-8871.22Show/hide
Query:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGS------DHRPRNRSLSPTSRTRAIARGQQELMEMVRAM
        MGDI +K P K       P        EDCI FRGWDS AA+ DDSQSESGV SPTLW   S       HR RNRSLSPTSRT+AIARGQQELMEMVR M
Subjt:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGS------DHRPRNRSLSPTSRTRAIARGQQELMEMVRAM

Query:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------
        PESSYELSLKDLVEHHL+NS+RQ  G   S+ R DS+SETSFRRD SK R ETRALVTRSRSVDSGGFYLKMFFP+P   +++KKK NLRTDSG      
Subjt:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------

Query:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
           K PPVD+DWWRKRS+V+ GEN+GS+SGGSMTSSGSSNSTSS RSNSRNSESQGSCWFCISPMRSK  E
Subjt:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

XP_008441584.1 PREDICTED: uncharacterized protein LOC103485667 [Cucumis melo]3.3e-8770.85Show/hide
Query:  MGDIKSKSPRK-ATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAM
        MGDI +K P K    + +   +  +   EDCI FRGWDS AA+ DDSQSESGV SPTLW   S      HR RNRSLSPTSRT+AIARGQQELMEMVR M
Subjt:  MGDIKSKSPRK-ATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAM

Query:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------
        PESSYELSLKDLVEHHL+NS+RQ  G   S+ R DS+SETSFRRD SK R ETRALVTRSRSVDSGGFYLKMFFP+P   +++KKK NLRTDSG      
Subjt:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------

Query:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
           K PPVD+DWWRKRS+V+ GEN+GS+SGGSMTSSGSSNSTSS RSNSRNSESQGSCWFCISPMRSK  E
Subjt:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

XP_022938490.1 uncharacterized protein LOC111444710 [Cucurbita moschata]8.8e-8075Show/hide
Query:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV
        EDCI FRGWDSAA  DDSQSESGVCSPTLWG  S      HRPRNRSLSPTSRT+AIARGQQELMEMVR MPESSYELSLKDLVEHHLSNS+RQ G  S+
Subjt:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV

Query:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG
         R DSASETSFRRD SK RSETRALVTRSRSVDSGGFYLKMF P+P   +++KKK NLRTDSG         K PPVDR+WWRKRS      NEGSVSG 
Subjt:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG

Query:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
             GSSNSTSS R  SRNSESQG CWFCISPMRSK  E
Subjt:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

XP_038885358.1 uncharacterized protein LOC120075766 [Benincasa hispida]5.0e-9171.69Show/hide
Query:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMP
        MGDI++K PRK+ H+ +    +     EDCI FRGWDS AA+ DDSQSESGV SPTLWG  S      HRPRNRSLSPTSR +AIARGQQELMEMVR MP
Subjt:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMP

Query:  ESSYELSLKDLVEHHLSNSQRQ-----AGGSVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG-----
        ESSYELSLKDLVEHHL+NS+RQ     A  S+ R DS+SETSFRRD+SK RSETR LVTRSRSVDSGGFYLKMF P+P   +++KKK NLRTDSG     
Subjt:  ESSYELSLKDLVEHHLSNSQRQ-----AGGSVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG-----

Query:  ---SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
            K PPV++DWWRKRSAVA GENEGS+SGGSM SSGSSNSTSS RSNSRNSESQGSCWFCISPMRSK  E
Subjt:  ---SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

TrEMBL top hitse value%identityAlignment
A0A0A0KCW4 Uncharacterized protein1.5e-8871.22Show/hide
Query:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGS------DHRPRNRSLSPTSRTRAIARGQQELMEMVRAM
        MGDI +K P K       P        EDCI FRGWDS AA+ DDSQSESGV SPTLW   S       HR RNRSLSPTSRT+AIARGQQELMEMVR M
Subjt:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGS------DHRPRNRSLSPTSRTRAIARGQQELMEMVRAM

Query:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------
        PESSYELSLKDLVEHHL+NS+RQ  G   S+ R DS+SETSFRRD SK R ETRALVTRSRSVDSGGFYLKMFFP+P   +++KKK NLRTDSG      
Subjt:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------

Query:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
           K PPVD+DWWRKRS+V+ GEN+GS+SGGSMTSSGSSNSTSS RSNSRNSESQGSCWFCISPMRSK  E
Subjt:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

A0A1S3B3A7 uncharacterized protein LOC1034856671.6e-8770.85Show/hide
Query:  MGDIKSKSPRK-ATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAM
        MGDI +K P K    + +   +  +   EDCI FRGWDS AA+ DDSQSESGV SPTLW   S      HR RNRSLSPTSRT+AIARGQQELMEMVR M
Subjt:  MGDIKSKSPRK-ATHSQNKPREQQDPIPEDCIFFRGWDS-AAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAM

Query:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------
        PESSYELSLKDLVEHHL+NS+RQ  G   S+ R DS+SETSFRRD SK R ETRALVTRSRSVDSGGFYLKMFFP+P   +++KKK NLRTDSG      
Subjt:  PESSYELSLKDLVEHHLSNSQRQAGG---SVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG------

Query:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
           K PPVD+DWWRKRS+V+ GEN+GS+SGGSMTSSGSSNSTSS RSNSRNSESQGSCWFCISPMRSK  E
Subjt:  --SKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

A0A6J1FJY2 uncharacterized protein LOC1114447104.3e-8075Show/hide
Query:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV
        EDCI FRGWDSAA  DDSQSESGVCSPTLWG  S      HRPRNRSLSPTSRT+AIARGQQELMEMVR MPESSYELSLKDLVEHHLSNS+RQ G  S+
Subjt:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV

Query:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG
         R DSASETSFRRD SK RSETRALVTRSRSVDSGGFYLKMF P+P   +++KKK NLRTDSG         K PPVDR+WWRKRS      NEGSVSG 
Subjt:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG

Query:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
             GSSNSTSS R  SRNSESQG CWFCISPMRSK  E
Subjt:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

A0A6J1JND4 uncharacterized protein LOC1114868786.8e-7866.54Show/hide
Query:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPE
        MGD+  K PR   H+            EDCI FRG DSA   DDSQSESGVCSPTLWG  S      HR RNR+LSPTSRT+AIARGQQELMEMVR MPE
Subjt:  MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPE

Query:  SSYELSLKDLVEHHLSNSQRQAGGSVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIPM--IASKKKPNLRTDS--------GSKL
        SSYELSLKDLVEHHL   ++Q   SV++ DS SETSF RD  KKRSETRALVTRSRSV+SGGFYLKMFFP+P+  I++KKK NLR+DS          K 
Subjt:  SSYELSLKDLVEHHLSNSQRQAGGSVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIPM--IASKKKPNLRTDS--------GSKL

Query:  PPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
        P VDRDWWRKRS+  +GEN GSVSG   +SS +SNSTSS RSNSRNSES+GSCWFCISP+RSK  E
Subjt:  PPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

A0A6J1K1H6 uncharacterized protein LOC1114897762.1e-7974.58Show/hide
Query:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV
        EDCI FRGWDSAA  DDSQ ESGVCSPTLWG  S      HRPRNRSLSPTSRT+AIARGQQELMEMVR MPESSYELSLKDLVEHHLSNS+RQ G  S+
Subjt:  EDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSD-----HRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAG-GSV

Query:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG
         R DSASETSFRRD SK RSETRALVTRSRSVDSGGFYLKMF P+P   +++KKK NLRTDSG         K PPVDR+WWRKRS      NEGSVSG 
Subjt:  DRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIP--MIASKKKPNLRTDSG--------SKLPPVDRDWWRKRSAVAAGENEGSVSGG

Query:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE
             GSSNSTSS R  SRNSESQG CWFCISPMRSK  E
Subjt:  SMTSSGSSNSTSSLRSNSRNSESQGSCWFCISPMRSKKTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21390.1 embryo defective 21703.8e-2035.87Show/hide
Query:  AVGDDSQSESGVCSPTLWGGGSD------HRPRNR-SLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLS-NSQRQAGGSVDRCDSASETS
        ++ +D  S+SGVCSPTLW           HRP +  SLSP S+ +AIARGQ+ELMEMV  MPES YELSLKDLVE  ++  ++R+    + +  +     
Subjt:  AVGDDSQSESGVCSPTLWGGGSD------HRPRNR-SLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLS-NSQRQAGGSVDRCDSASETS

Query:  FRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIPMIA----SKKKPNLRTDSGSKLPP-----------VDRDWWRKRSAVAAGENEGSVSGGSMTSS
         R+  S KR +      RS   ++ GF LK+ F + + A    +KKK   + D   K+ P            D++WW +            +S  S   S
Subjt:  FRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIPMIA----SKKKPNLRTDSGSKLPP-----------VDRDWWRKRSAVAAGENEGSVSGGSMTSS

Query:  GSSNSTSSLRSNSRNSESQGSCW
        GSS+S +S+RS S   + + SC+
Subjt:  GSSNSTSSLRSNSRNSESQGSCW

AT1G76980.1 BEST Arabidopsis thaliana protein match is: embryo defective 2170 (TAIR:AT1G21390.1)6.0e-1834.7Show/hide
Query:  DSQSESGVCSPTLWGGGSDHRPRN-----RSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAGGSVDRCDSASETSFRRDASK
        D  S+SGVCSP LW       P +     ++LSP ++ + IARGQ+ELM+MV  MPES YELSLKDLVE  ++  + +    + +     E   R+   K
Subjt:  DSQSESGVCSPTLWGGGSDHRPRN-----RSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAGGSVDRCDSASETSFRRDASK

Query:  KRSETRALVTRSRSVDSGGFYLKMFFPIPMIASKKKPNLRTDS--------------GSKLPPV--------DRDWW-------RKRSAVAAGENEGS--
         +S+      R+  V++ GF LK+ FP+  + +KKK N + D+               S  P +        D+DWW       R+  +V +  N GS  
Subjt:  KRSETRALVTRSRSVDSGGFYLKMFFPIPMIASKKKPNLRTDS--------------GSKLPPV--------DRDWW-------RKRSAVAAGENEGS--

Query:  VSGGSMTSSGSSNSTSSLR
         SGGS + S S  S +SLR
Subjt:  VSGGSMTSSGSSNSTSSLR

AT1G76980.2 FUNCTIONS IN: molecular_function unknown3.5e-1834.53Show/hide
Query:  DSQSESGVCSPTLWGGGSDHRPRN-----RSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAGGSVDRCDSASETSFRRDASK
        D  S+SGVCSP LW       P +     ++LSP ++ + IARGQ+ELM+MV  MPES YELSLKDLVE  ++  + +    + +     E   R+   K
Subjt:  DSQSESGVCSPTLWGGGSDHRPRN-----RSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHLSNSQRQAGGSVDRCDSASETSFRRDASK

Query:  KRSETRALVTRSRSVDSGGFYLKMFFPIPMIASKKKPNLRTDS--------------GSKLPPV--------DRDWW-------RKRSAVAAGENEGS--
         +S+      R+  V++ GF LK+ FP+  + +KKK N + D+               S  P +        D+DWW       R+  +V +  N GS  
Subjt:  KRSETRALVTRSRSVDSGGFYLKMFFPIPMIASKKKPNLRTDS--------------GSKLPPV--------DRDWW-------RKRSAVAAGENEGS--

Query:  VSGGSMTSSGSSNSTSSLRSNSR
         SGGS + S S  S +SLR  +R
Subjt:  VSGGSMTSSGSSNSTSSLRSNSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACATCAAATCGAAATCCCCAAGAAAAGCTACACACAGCCAAAACAAACCTCGAGAACAGCAGGACCCTATACCCGAAGATTGCATCTTTTTCAGAGGCTGGGA
CAGCGCCGCCGTCGGCGATGACTCCCAATCAGAATCCGGCGTCTGTTCCCCCACCCTCTGGGGTGGTGGTTCGGACCACCGGCCGCGCAATCGCAGCCTCTCGCCGACGT
CCCGGACTCGAGCCATCGCCCGAGGACAACAGGAGCTCATGGAGATGGTCAGGGCCATGCCCGAATCCTCCTACGAGCTCTCCCTCAAGGATCTCGTCGAACACCACCTG
AGCAATTCTCAGCGCCAGGCCGGCGGCTCCGTCGACAGATGCGATTCCGCCTCAGAAACTTCCTTCCGACGAGACGCGAGCAAGAAGAGGAGTGAAACCAGAGCACTCGT
TACCAGAAGTAGAAGCGTTGATAGCGGTGGATTTTACCTCAAAATGTTCTTCCCCATACCGATGATTGCGTCCAAAAAGAAGCCCAATCTTAGAACCGATTCGGGCTCCA
AGCTGCCGCCAGTGGATAGAGACTGGTGGAGGAAGAGATCGGCGGTGGCCGCCGGCGAGAATGAAGGGAGTGTCTCCGGTGGAAGTATGACCAGCAGCGGGAGTAGTAAT
AGTACTAGCAGCCTAAGAAGCAATAGCAGGAACTCTGAATCACAAGGGAGTTGCTGGTTTTGCATTAGTCCAATGAGAAGTAAAAAAACAGAGTAA
mRNA sequenceShow/hide mRNA sequence
CATGGGAGACATCAAATCGAAATCCCCAAGAAAAGCTACACACAGCCAAAACAAACCTCGAGAACAGCAGGACCCTATACCCGAAGATTGCATCTTTTTCAGAGGCTGGG
ACAGCGCCGCCGTCGGCGATGACTCCCAATCAGAATCCGGCGTCTGTTCCCCCACCCTCTGGGGTGGTGGTTCGGACCACCGGCCGCGCAATCGCAGCCTCTCGCCGACG
TCCCGGACTCGAGCCATCGCCCGAGGACAACAGGAGCTCATGGAGATGGTCAGGGCCATGCCCGAATCCTCCTACGAGCTCTCCCTCAAGGATCTCGTCGAACACCACCT
GAGCAATTCTCAGCGCCAGGCCGGCGGCTCCGTCGACAGATGCGATTCCGCCTCAGAAACTTCCTTCCGACGAGACGCGAGCAAGAAGAGGAGTGAAACCAGAGCACTCG
TTACCAGAAGTAGAAGCGTTGATAGCGGTGGATTTTACCTCAAAATGTTCTTCCCCATACCGATGATTGCGTCCAAAAAGAAGCCCAATCTTAGAACCGATTCGGGCTCC
AAGCTGCCGCCAGTGGATAGAGACTGGTGGAGGAAGAGATCGGCGGTGGCCGCCGGCGAGAATGAAGGGAGTGTCTCCGGTGGAAGTATGACCAGCAGCGGGAGTAGTAA
TAGTACTAGCAGCCTAAGAAGCAATAGCAGGAACTCTGAATCACAAGGGAGTTGCTGGTTTTGCATTAGTCCAATGAGAAGTAAAAAAACAGAGTAAAACAGCAATCAAT
TATGTAATTTCCTTAAGAAGATGGTGTTTTGGTATATTTCCGCAAAATCACGGCTGAATTTGGTAGCTAAGCTTGTCTATATCCATTTGTTTTTTCTTTTCCTACTTTTC
TGAAAAAAAAATATTCCTTGTCTTTATGCCCTTTTTCATTTCTCTTATCATTTCCACCGTACTATCTATGTGTATATATTTATTTTTGAGTTAAACAAGTATTCCAAGGG
G
Protein sequenceShow/hide protein sequence
MGDIKSKSPRKATHSQNKPREQQDPIPEDCIFFRGWDSAAVGDDSQSESGVCSPTLWGGGSDHRPRNRSLSPTSRTRAIARGQQELMEMVRAMPESSYELSLKDLVEHHL
SNSQRQAGGSVDRCDSASETSFRRDASKKRSETRALVTRSRSVDSGGFYLKMFFPIPMIASKKKPNLRTDSGSKLPPVDRDWWRKRSAVAAGENEGSVSGGSMTSSGSSN
STSSLRSNSRNSESQGSCWFCISPMRSKKTE