; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009653 (gene) of Chayote v1 genome

Gene IDSed0009653
OrganismSechium edule (Chayote v1)
DescriptionATP-dependent tryptophan/phenylalanine/tyrosine adenylase
Genome locationLG07:985374..986568
RNA-Seq ExpressionSed0009653
SyntenySed0009653
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576126.1 hypothetical protein SDJN03_26765, partial [Cucurbita argyrosperma subsp. sororia]5.9e-8969.88Show/hide
Query:  MKLKNKGKVHPSP------SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVH
        MK+KNKGKVHPSP      S SSSS SDGDFF+V  YLP A+LAL+S+L VDDREVLAFMMRRSME SS SS +  NKF KR+ KKS APRA+S  ACVH
Subjt:  MKLKNKGKVHPSP------SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVH

Query:  APPSFSCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------V
        +PPSF+CFDCYMS+W+RWNSSPNGELIHQAIEAFEEQLA+GEKSSKNVKGK++++IGR+SS+KP  + APP PP PEV PLP +DEGS           V
Subjt:  APPSFSCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------V

Query:  DAAEGSGELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        +AAEG GE  R+EDE A     ++VP+PP SN KGLARKVWPDVLGLFNSRLWSLWGP+
Subjt:  DAAEGSGELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

KAG6592803.1 hypothetical protein SDJN03_12279, partial [Cucurbita argyrosperma subsp. sororia]8.5e-9676.19Show/hide
Query:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS
        MK+KNKGKVHPSPS SSSS SDGDFFDV  YLPAAILAL++VL +DDREVLAFMMRRSME S+PSSSL ENKF KR SKKS APRA S   CVHAPPS S
Subjt:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS

Query:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG
        CFDCYMS+WNRWNSSPNGELIHQAIEAFEEQLA+GEKS KN+KGKRKD+IGRRSSEKP +++   PPP PEV PL  +D+GS          V+AAEGSG
Subjt:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG

Query:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        ELPR+E  AA     VVVP+ P SNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

KAG7020289.1 hypothetical protein SDJN02_16972, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-9475.79Show/hide
Query:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS
        MK+KNKGKVHPSPS SSSS SDGDFFDV  YLPAAILAL++VL +DDREVLAFMMRRSME S+PSSSL ENKF KR SKKS APRA S   CVHAPPS S
Subjt:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS

Query:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG
        CFDCYMS+WNRWNSSPNGELIHQAIEAFEEQLA+GEKS KN+KGKRKD+IGRRSSEKP +++   P P PEV PL  +D+GS          V+AAEGSG
Subjt:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG

Query:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        ELPR+E  AA     VVVP+ P SNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

XP_022960393.1 uncharacterized protein LOC111461129 [Cucurbita moschata]1.0e-9375.4Show/hide
Query:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS
        MK+KNKGKVHPSPS S SS SDGDFFDV  YLPAAILAL++VL +DDREVLAFMMRRSME S+PSSS  ENKF KR SKKS APRA S   CVHAPPS S
Subjt:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS

Query:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG
        CFDCYMS+WNRWNSSPNGELIHQAIEAFEEQLA+GEKS KN+K KRKD+IGRRSSEKP ++A   PPP PEV PL  +D+GS          V+AAEGSG
Subjt:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG

Query:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        ELPR+E  AA     VVVP+ P SNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

XP_023547231.1 uncharacterized protein LOC111806103 [Cucurbita pepo subsp. pepo]3.5e-8971.37Show/hide
Query:  MKLKNKGKVHPSP--SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPS
        MK+KNKGKVHPSP  S SSSS SDGDFF+V  YLP AILAL+SVL +DDREVLAFMMRRSME SS SS +  NKF KR+ KKS  PRA+S  ACVH+PPS
Subjt:  MKLKNKGKVHPSP--SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPS

Query:  FSCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAE
        F+CFDCYMSFW+RWNSSPNGELIHQAIEAFEEQLA+GEKSSKNVKGK++++IGR+SS+KP  + APP PP PEV PLP  DEGS           V+AAE
Subjt:  FSCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAE

Query:  GSGELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        G GE  R+EDE A     ++VP+PP SN KGLARKVWPDVLGLFNSRLWSLWGP+
Subjt:  GSGELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

TrEMBL top hitse value%identityAlignment
A0A1S3CC21 uncharacterized protein LOC1034990741.0e-8672.16Show/hide
Query:  MKLKNKGKVHPSP-SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF
        MK+KNKGKVHPSP S SSSS SDG+FFDVL YLP AI ALVSVL VDDREVLAFMMRRSME SSPSSS+   KF KR SKKS  PRA S  ACVHAPPS 
Subjt:  MKLKNKGKVHPSP-SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF

Query:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAEG
        +CFDCYMS+W+RWNSSPNGELIHQAIEAFEEQLA GEKSSKNVKGKRKD+IGRRS +K   I +PP  P PE  PL  +DEGS           V+  E 
Subjt:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAEG

Query:  SGELPRSEDEAAAPPPA-VVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        SGE PRSE+EA     A VV+P+PP + HKGLARKVWPDVLGLFNSRLWSLW PN
Subjt:  SGELPRSEDEAAAPPPA-VVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A5D3DM01 Uncharacterized protein5.0e-8671.21Show/hide
Query:  MKLKNKGKVHPSP---SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPP
        MK+KNKGKVHPSP   S SSSS SDG+FFDVL YLP AI ALVSVL VDDREVLAFMMRRSME SSPSSS+   +F KR SKKS  PRA S  ACVHAPP
Subjt:  MKLKNKGKVHPSP---SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPP

Query:  SFSCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAA
        S +CFDCYMS+W+RWNSSPNGELIHQAIEAFEEQLA GEKSSKNVKGKRKD+IGRRS +K   I +PP  P PE  PL  +DEGS           V+  
Subjt:  SFSCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAA

Query:  EGSGELPRSEDEAAAPPPA-VVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        E SGE PRSE+EA     A VV+P+PP + HKGLARKVWPDVLGLFNSRLWSLW PN
Subjt:  EGSGELPRSEDEAAAPPPA-VVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A6J1GQW4 uncharacterized protein LOC1114562854.9e-8970.87Show/hide
Query:  MKLKNKGKVHPSP-SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF
        MK+KNKGKVHPSP S SSSS SDGDFF+V  YLP A+LAL+S+L VDDREVLAFMMRRSME SS SS +  NKF KR+ KKS APRA+S  ACVH+PPSF
Subjt:  MKLKNKGKVHPSP-SPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF

Query:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAEG
        +CFDCYMS+W+RWNSSPNGELIHQAIEAFEEQLA+GEKSSKNVKGK++++IGR+SS+KP  + APP PP P V PLP +DEGS           V+AAEG
Subjt:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAEG

Query:  SGELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
         GE  R+EDE A     ++VP+PP SN KGLARKVWPDVLGLFNSRLWSLWGP+
Subjt:  SGELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A6J1H8S8 uncharacterized protein LOC1114611295.0e-9475.4Show/hide
Query:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS
        MK+KNKGKVHPSPS S SS SDGDFFDV  YLPAAILAL++VL +DDREVLAFMMRRSME S+PSSS  ENKF KR SKKS APRA S   CVHAPPS S
Subjt:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS

Query:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG
        CFDCYMS+WNRWNSSPNGELIHQAIEAFEEQLA+GEKS KN+K KRKD+IGRRSSEKP ++A   PPP PEV PL  +D+GS          V+AAEGSG
Subjt:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS----------VDAAEGSG

Query:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        ELPR+E  AA     VVVP+ P SNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  ELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A6J1JUQ5 uncharacterized protein LOC1114880302.3e-8670.63Show/hide
Query:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS
        MK+KNKGKVHPSPS SS   SDGDFF+V  YLP AILAL+SVL  DDREVLAFMMRRSME SS SS +  NKF KR+ KKS APRA+S  ACVH+PPSF+
Subjt:  MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFS

Query:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAEGS
        CFDCYMS+W+RWNSSPNGELIHQAIEAFEEQLA+GEKS KNVKGK++++IGR+SS+KP  + A   PP PEV PLP +DEGS           V+AAEG 
Subjt:  CFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGS-----------VDAAEGS

Query:  GELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGP
        GE  R+EDE A     V+VP+PP SN KGLARKVWPDVLGLFNSRLWSLWGP
Subjt:  GELPRSEDEAAAPPPAVVVPAPPLSNHKGLARKVWPDVLGLFNSRLWSLWGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein4.9e-1737.97Show/hide
Query:  KLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEAS--SPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF
        KL  KG VHPSP    S+        +L  LP AI +L +VL+ +DREVLA+++  +  +   +P+S L + K  K++   + +P         H     
Subjt:  KLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEAS--SPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF

Query:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGK--RKDRIGRRSS
         CF CY S+W RW+SSP+ +LIH+ I+AFE+ L   +   KNV GK  R+ R G+ SS
Subjt:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGK--RKDRIGRRSS

AT1G24270.1 unknown protein1.1e-2144Show/hide
Query:  MKLKNKGKVHPSPS-PSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF
        MK+  KGKVHPSP  PSSSS +  D   V K L +AIL LVSVL+ +D EVLA+++ RS+  ++  S        K+ S K+P       + C       
Subjt:  MKLKNKGKVHPSPS-PSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF

Query:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDR
         CFDCY S+W++W+SS N ELI+Q IEAFE+ L   E S+ +   K K R
Subjt:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDR

AT1G62422.1 unknown protein1.6e-1536.14Show/hide
Query:  KLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF--
        KL  KG VHPSP P+    +D  F   L  LP AIL+LV+ L+V+DREVLA+++  S + S+  S L +NK                     H  P F  
Subjt:  KLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSF--

Query:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVK--GKRKDRIGRRSSEKPAEIAA
         CF CY S+W RW++SP  +LIH+ I+A+E+ L   +K     K  GK   R+    + + +E+ +
Subjt:  SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKSSKNVK--GKRKDRIGRRSSEKPAEIAA

AT5G13090.1 unknown protein9.5e-3742.5Show/hide
Query:  MKLKNKGKVHPSPSP-----SSSSFS------DGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASA
        MK+K KGKV+PSP P     SSSS S      D D   VLK LPA IL LVSVL+ ++REVLA+++ R    S   +S  +NK  K+S+K S        
Subjt:  MKLKNKGKVHPSPSP-----SSSSFS------DGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASA

Query:  IACVHAPPSF--SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKS----SKNVKGKRKDRIGRR---SSEKPA--------EIAAPPPPP----
            H PP F   CFDCY ++W RW+SSPN ELIH+ IEAFE     GE++    SK+ +GK+K++ GRR   S  KPA        + + P   P    
Subjt:  IACVHAPPSF--SCFDCYMSFWNRWNSSPNGELIHQAIEAFEEQLASGEKS----SKNVKGKRKDRIGRR---SSEKPA--------EIAAPPPPP----

Query:  ----SPEVPPLPGVDEGSVDAAEGSGELPRSEDEAAAPPPAVVVPAPP--LSNHKGLARKVWPDVLGLFNSRLWSLWGPN
            S  V     + E  V   E   E+   ED        VV PA    ++ HKGLARKV PDVLGLF+S  W LW PN
Subjt:  ----SPEVPPLPGVDEGSVDAAEGSGELPRSEDEAAAPPPAVVVPAPP--LSNHKGLARKVWPDVLGLFNSRLWSLWGPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTAAAGAACAAAGGTAAAGTTCACCCATCTCCATCTCCTTCTTCTTCTTCTTTTTCCGATGGGGATTTCTTTGATGTTCTGAAATATCTGCCGGCGGCGATTTT
GGCTCTGGTCTCCGTTTTGAATGTCGACGATCGAGAGGTTTTGGCCTTCATGATGCGAAGATCCATGGAAGCCTCCTCGCCGTCGTCTTCTCTACCGGAAAACAAGTTTC
CCAAGAGATCTTCCAAGAAATCCCCTGCTCCACGCGCCGCTTCTGCCATCGCGTGTGTTCACGCGCCGCCTTCGTTTAGCTGCTTTGACTGTTACATGAGTTTCTGGAAC
CGGTGGAACTCGTCGCCGAACGGCGAGCTGATTCATCAGGCCATTGAGGCTTTTGAAGAGCAATTGGCCAGCGGGGAAAAATCAAGCAAGAACGTCAAAGGGAAGAGGAA
GGACAGAATCGGCCGGCGGTCGTCGGAGAAGCCTGCCGAAATTGCTGCTCCTCCGCCGCCGCCGTCGCCGGAAGTTCCTCCGCTGCCGGGAGTTGATGAGGGTTCTGTGG
ATGCGGCGGAGGGGAGCGGAGAGCTGCCGCGGAGTGAGGATGAGGCAGCGGCGCCGCCGCCGGCGGTGGTTGTTCCGGCGCCGCCCCTGAGCAATCACAAGGGTTTGGCC
CGGAAGGTATGGCCGGATGTGTTAGGGTTATTCAATTCTCGTTTATGGAGTCTTTGGGGTCCAAATTAG
mRNA sequenceShow/hide mRNA sequence
GCCTTAATTTCCAAGTAAATTAGAATCCGAATAATAAATAAATAAATAAATAAATAAATAAACCCTTTGGTTTTTTTTTTTGCTCTCTCTCCAAAACCCACCATTTTGAA
GAAGCTTAGAAGGTAAAAAGCTCAGATACAGAAAACCCATTTCGTTTTTGTCCAAAAACCCTCCAAATTAAGAAAAAAAGTGGCCGCCATGAAATTAAAGAACAAAGGTA
AAGTTCACCCATCTCCATCTCCTTCTTCTTCTTCTTTTTCCGATGGGGATTTCTTTGATGTTCTGAAATATCTGCCGGCGGCGATTTTGGCTCTGGTCTCCGTTTTGAAT
GTCGACGATCGAGAGGTTTTGGCCTTCATGATGCGAAGATCCATGGAAGCCTCCTCGCCGTCGTCTTCTCTACCGGAAAACAAGTTTCCCAAGAGATCTTCCAAGAAATC
CCCTGCTCCACGCGCCGCTTCTGCCATCGCGTGTGTTCACGCGCCGCCTTCGTTTAGCTGCTTTGACTGTTACATGAGTTTCTGGAACCGGTGGAACTCGTCGCCGAACG
GCGAGCTGATTCATCAGGCCATTGAGGCTTTTGAAGAGCAATTGGCCAGCGGGGAAAAATCAAGCAAGAACGTCAAAGGGAAGAGGAAGGACAGAATCGGCCGGCGGTCG
TCGGAGAAGCCTGCCGAAATTGCTGCTCCTCCGCCGCCGCCGTCGCCGGAAGTTCCTCCGCTGCCGGGAGTTGATGAGGGTTCTGTGGATGCGGCGGAGGGGAGCGGAGA
GCTGCCGCGGAGTGAGGATGAGGCAGCGGCGCCGCCGCCGGCGGTGGTTGTTCCGGCGCCGCCCCTGAGCAATCACAAGGGTTTGGCCCGGAAGGTATGGCCGGATGTGT
TAGGGTTATTCAATTCTCGTTTATGGAGTCTTTGGGGTCCAAATTAGAAATTCAGGAAAAAAGAAAAATGAGCATATTTTGATAATTTTCCACTTTTCTTTATTATTTTT
TTTTGAGACCCAATATGAGAGAGAGTAAATTCCTAAAAAAAAAAAAAATTAAAGTAGGGTTATGTGGCAATGGAATATATATGTATGGTCCCATCTTTCATCAACTTTTG
AGAATTATGCTCTCTTTTTTTTCAATTATTTTTTTTGTTGTTTGATATTGAATATATTGTTTATTTTTGGGTTGTGTTTGGCTGGTAAGAAAATG
Protein sequenceShow/hide protein sequence
MKLKNKGKVHPSPSPSSSSFSDGDFFDVLKYLPAAILALVSVLNVDDREVLAFMMRRSMEASSPSSSLPENKFPKRSSKKSPAPRAASAIACVHAPPSFSCFDCYMSFWN
RWNSSPNGELIHQAIEAFEEQLASGEKSSKNVKGKRKDRIGRRSSEKPAEIAAPPPPPSPEVPPLPGVDEGSVDAAEGSGELPRSEDEAAAPPPAVVVPAPPLSNHKGLA
RKVWPDVLGLFNSRLWSLWGPN