; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0008193 (gene) of Chayote v1 genome

Gene IDSed0008193
OrganismSechium edule (Chayote v1)
DescriptionPyriculol/pyriculariol biosynthesis cluster transcription factor 1
Genome locationLG04:3060420..3063415
RNA-Seq ExpressionSed0008193
SyntenySed0008193
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024021.1 hypothetical protein SDJN02_15050, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-4141.42Show/hide
Query:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL
        M++   + + K+K ++ ++     E    S L  + L IG+W+  A N+G+++ K +FAK+ LVWE++ NG  EKVEIEWSNIIG++A M EN+ GILE+
Subjt:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL

Query:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLP-----
        EL QPPKFY++ +   PR+H+QW DGLDFT+GQ   N RRH ++FPP +L++H+ +L   D RL ELS++P+PT N+PYF S+ + +  + +        
Subjt:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLP-----

Query:  SFQP--NYNQTHQLSHAAL----NLILQESVDALPSYSN
         + P  N N+THQ     L    N + +  ++A  +Y N
Subjt:  SFQP--NYNQTHQLSHAAL----NLILQESVDALPSYSN

XP_008450546.1 PREDICTED: uncharacterized protein LOC103492114 isoform X1 [Cucumis melo]2.5e-4143.4Show/hide
Query:  KKKKLVTTKKAKPHK-ELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKF
        K+K+++   +   H  E    S L  + L IG+W+  A N+G+++ K +FAK+QLVWE++ NG  +K+EIEWSNIIG++A + E++PGILE+EL+ PPKF
Subjt:  KKKKLVTTKKAKPHK-ELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKF

Query:  YRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAV----------SLPSFQ
        Y++ +   PR+H+QW DG DFTEGQ   N RRH ++FPP +L++H+ +L   D  L ELSQ+P+PT +  YFSS+   S + +            +PSF 
Subjt:  YRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAV----------SLPSFQ

Query:  PNY--NQTHQLS
        P+   N+ H L+
Subjt:  PNY--NQTHQLS

XP_022961543.1 uncharacterized protein LOC111462091 [Cucurbita moschata]1.1e-4141.42Show/hide
Query:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL
        M++   + + K+K ++ ++     E    S L  + L IG+W+  A N+G+++ K +FAK+ LVWE++ NG  EKVEIEWSNIIG++A M EN+ GILE+
Subjt:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL

Query:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLP-----
        EL QPPKFY++ +   PR+H+QW DGLDFT+GQ   N RRH ++FPP +L++H+ +L   D RL ELS++P+PT N+PYF S+ + +  + +        
Subjt:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLP-----

Query:  SFQP--NYNQTHQLSHAAL----NLILQESVDALPSYSN
         + P  N N+THQ     L    N + +  ++A  +Y N
Subjt:  SFQP--NYNQTHQLSHAAL----NLILQESVDALPSYSN

XP_022968749.1 uncharacterized protein LOC111467893 [Cucurbita maxima]1.7e-4242.26Show/hide
Query:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL
        M++   + + K+K +  ++     E    S L  + L IG+W+  A N+G+++ K +FAK+ LVWE++ NG  EKVEIEWSNIIG++A M EN+ GILE+
Subjt:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL

Query:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSF---
        EL QPPKFY++ +   PR+H+QW DGLDFTEGQ   N RRH ++FPP +L++H+ +L   D RL ELS++P+PT N+PYF S+ + +  + +        
Subjt:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSF---

Query:  ----QPNYNQTHQLSHAAL----NLILQESVDALPSYSN
              N N+THQ     L    N + +  V+A  SY N
Subjt:  ----QPNYNQTHQLSHAAL----NLILQESVDALPSYSN

XP_022968797.1 uncharacterized protein LOC111467928 isoform X1 [Cucurbita maxima]1.0e-4249.73Show/hide
Query:  IKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQ
        +KL IG W+    N  +++VK  F KKQL +E+  NGC  K+EI+WSNIIG++A+M +NEPG+LE+ELS+PPKFY++    +   HSQW DG DFT G Q
Subjt:  IKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQ

Query:  ASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESVD
        AS CRRH +MFPP +L++HF RLI+ D RL  LSQ+PYPT N PYF SQ  L  + A+  P F+ +   T     ++  L   +S D
Subjt:  ASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESVD

TrEMBL top hitse value%identityAlignment
A0A1S3BPE4 uncharacterized protein LOC103492114 isoform X11.2e-4143.4Show/hide
Query:  KKKKLVTTKKAKPHK-ELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKF
        K+K+++   +   H  E    S L  + L IG+W+  A N+G+++ K +FAK+QLVWE++ NG  +K+EIEWSNIIG++A + E++PGILE+EL+ PPKF
Subjt:  KKKKLVTTKKAKPHK-ELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKF

Query:  YRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAV----------SLPSFQ
        Y++ +   PR+H+QW DG DFTEGQ   N RRH ++FPP +L++H+ +L   D  L ELSQ+P+PT +  YFSS+   S + +            +PSF 
Subjt:  YRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAV----------SLPSFQ

Query:  PNY--NQTHQLS
        P+   N+ H L+
Subjt:  PNY--NQTHQLS

A0A5A7UBN7 Uncharacterized protein1.2e-4143.4Show/hide
Query:  KKKKLVTTKKAKPHK-ELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKF
        K+K+++   +   H  E    S L  + L IG+W+  A N+G+++ K +FAK+QLVWE++ NG  +K+EIEWSNIIG++A + E++PGILE+EL+ PPKF
Subjt:  KKKKLVTTKKAKPHK-ELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKF

Query:  YRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAV----------SLPSFQ
        Y++ +   PR+H+QW DG DFTEGQ   N RRH ++FPP +L++H+ +L   D  L ELSQ+P+PT +  YFSS+   S + +            +PSF 
Subjt:  YRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAV----------SLPSFQ

Query:  PNY--NQTHQLS
        P+   N+ H L+
Subjt:  PNY--NQTHQLS

A0A6J1HC39 uncharacterized protein LOC1114620915.3e-4241.42Show/hide
Query:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL
        M++   + + K+K ++ ++     E    S L  + L IG+W+  A N+G+++ K +FAK+ LVWE++ NG  EKVEIEWSNIIG++A M EN+ GILE+
Subjt:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL

Query:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLP-----
        EL QPPKFY++ +   PR+H+QW DGLDFT+GQ   N RRH ++FPP +L++H+ +L   D RL ELS++P+PT N+PYF S+ + +  + +        
Subjt:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLP-----

Query:  SFQP--NYNQTHQLSHAAL----NLILQESVDALPSYSN
         + P  N N+THQ     L    N + +  ++A  +Y N
Subjt:  SFQP--NYNQTHQLSHAAL----NLILQESVDALPSYSN

A0A6J1HUI1 uncharacterized protein LOC111467928 isoform X14.8e-4349.73Show/hide
Query:  IKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQ
        +KL IG W+    N  +++VK  F KKQL +E+  NGC  K+EI+WSNIIG++A+M +NEPG+LE+ELS+PPKFY++    +   HSQW DG DFT G Q
Subjt:  IKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQ

Query:  ASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESVD
        AS CRRH +MFPP +L++HF RLI+ D RL  LSQ+PYPT N PYF SQ  L  + A+  P F+ +   T     ++  L   +S D
Subjt:  ASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESVD

A0A6J1I0K3 uncharacterized protein LOC1114678938.2e-4342.26Show/hide
Query:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL
        M++   + + K+K +  ++     E    S L  + L IG+W+  A N+G+++ K +FAK+ LVWE++ NG  EKVEIEWSNIIG++A M EN+ GILE+
Subjt:  MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILEL

Query:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSF---
        EL QPPKFY++ +   PR+H+QW DGLDFTEGQ   N RRH ++FPP +L++H+ +L   D RL ELS++P+PT N+PYF S+ + +  + +        
Subjt:  ELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSF---

Query:  ----QPNYNQTHQLSHAAL----NLILQESVDALPSYSN
              N N+THQ     L    N + +  V+A  SY N
Subjt:  ----QPNYNQTHQLSHAAL----NLILQESVDALPSYSN

SwissProt top hitse value%identityAlignment
G4N2B2 Pyriculol/pyriculariol biosynthesis cluster transcription factor 11.5e-0426.7Show/hide
Query:  LLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVING--CIEKVEIEWSNI--IGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEG
        L IG W     N  ++IV  F++  +      IN      K+E  +S I  I ++ + +  + G + +EL++PP F+  +   +P  +  +  G DFTE 
Subjt:  LLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVING--CIEKVEIEWSNI--IGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEG

Query:  QQASNCRRHLVMFPPGMLNQHFVRLIKSD---NRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTH
        QQAS C  H +   P +L+    +L+  +   NR   ++  P P  +        + +  +  + PS QPN+ Q H
Subjt:  QQASNCRRHLVMFPPGMLNQHFVRLIKSD---NRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTH

Arabidopsis top hitse value%identityAlignment
AT1G54300.1 unknown protein4.8e-1934.84Show/hide
Query:  IGNWKFHAMNDGEMIVKCFFAKKQLVWEVVIN-------GCIEKVEIEWSNIIGMKASM-NENEPGILELELSQPPKFYRQRDSVNPR--EHSQWDD-GL
        IG W   A N  +++ K +FAKK+L+WE +             K+EI+W+++   + S+ + +E GIL++EL + P F+ +    NP+  +H+QW     
Subjt:  IGNWKFHAMNDGEMIVKCFFAKKQLVWEVVIN-------GCIEKVEIEWSNIIGMKASM-NENEPGILELELSQPPKFYRQRDSVNPR--EHSQWDD-GL

Query:  DFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSS
        DFT G  ASN RRH + FPPG+L ++  +L+ +D+   +L + P+P + + YF S
Subjt:  DFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSS

AT2G24100.1 unknown protein2.1e-3031.5Show/hide
Query:  LLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQAS
        L IG W++ +  +G+++ KC+FAK +LVWEV+  G   K+EI+WS+I+ +KA++ E+EPG L + L++ P F+R+ +   PR+H+ W    DFT+GQ + 
Subjt:  LLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQAS

Query:  NCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVS------LPSFQPNYNQTHQLSHAALNLILQESVDALPSYSNML
        N R+H +  PPG++N+HF +L++ D+RL  LS++P      P+F S+  +  + +VS       P    + ++   LSH AL+        A+      +
Subjt:  NCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVS------LPSFQPNYNQTHQLSHAALNLILQESVDALPSYSNML

Query:  APTNVNLQSS-------SGVDNHDIECSIGNMSMMNNMEALQ----LTSNNTIPINGHMDFNNEVSSNNNFSN
           N N  S          +  +D    + + +  NN E  +    L S+NT       D  + +S  N+F N
Subjt:  APTNVNLQSS-------SGVDNHDIECSIGNMSMMNNMEALQ----LTSNNTIPINGHMDFNNEVSSNNNFSN

AT3G05770.1 unknown protein1.8e-1833.9Show/hide
Query:  TTKKAKPHKELWMVSFLRPIKLL-IGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCI-------EKVEIEWSNIIGMKASMN-ENEPGILELELSQPP
        T+   K  ++L  ++F  PI  + IG+  F A N  +++ K +FAKK+L+WE +    +        K+EI+W+++   + S+N  +E GIL++EL + P
Subjt:  TTKKAKPHKELWMVSFLRPIKLL-IGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCI-------EKVEIEWSNIIGMKASMN-ENEPGILELELSQPP

Query:  KFYRQRDSVNPR--EHSQWDD-GLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYF
         F+ +    NP+  +H+QW     DFT G QAS  RRH + FPPG+L ++  +L+ +D+   +L + P+P + + YF
Subjt:  KFYRQRDSVNPR--EHSQWDD-GLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYF

AT4G30780.1 unknown protein3.6e-3035.12Show/hide
Query:  LLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQAS
        L IG W++ +  +G+++ KC+FAK +LVWEV+  G   K+EI+WS+I+ +KA+  E+ PG L L L++ P F+R+ +   PR+H+ W    DFT+GQ + 
Subjt:  LLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYRQRDSVNPREHSQWDDGLDFTEGQQAS

Query:  NCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESVDALPSYSNMLAPTNVN
        N R+H +    G++N+HF +L++ D+RL  LS++P    ++PYF ++  +  +         P+ ++ H   +  LNL    S+       N+ +P  V 
Subjt:  NCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESVDALPSYSNMLAPTNVN

Query:  LQSSS
         QSSS
Subjt:  LQSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAACAAATCTTGAAACCTGAAAAGAAAAAGAAATTAGTTACAACAAAGAAAGCCAAACCCCATAAAGAATTATGGATGGTCTCCTTTTTAAGACCTATCAAACT
TTTAATTGGCAACTGGAAGTTTCATGCTATGAACGATGGCGAGATGATTGTAAAGTGCTTTTTTGCAAAGAAGCAACTAGTTTGGGAAGTTGTGATCAATGGATGTATTG
AAAAGGTTGAAATTGAATGGTCAAATATCATAGGAATGAAGGCTTCAATGAACGAAAACGAACCCGGAATTCTCGAGCTCGAGCTTTCGCAGCCACCAAAATTCTATAGA
CAACGTGACAGCGTGAATCCGCGGGAACATTCCCAATGGGATGATGGATTGGATTTTACAGAAGGACAGCAAGCTTCTAATTGCAGGAGGCATTTGGTTATGTTTCCTCC
TGGGATGTTGAACCAACACTTTGTGAGATTAATAAAAAGTGATAATAGATTGCTTGAGTTGAGCCAAAAGCCATATCCAACCAATAATAATCCTTACTTTTCTTCACAAA
CTTTACTTTCTGCTGAGGCTGCTGTTTCTTTGCCTTCATTTCAACCTAATTATAACCAAACTCACCAACTTTCTCATGCTGCTCTCAACCTAATTCTACAAGAATCAGTT
GATGCCTTGCCATCATATTCAAACATGTTGGCACCAACAAATGTCAATCTTCAATCAAGCTCAGGTGTTGATAATCATGACATTGAATGTTCAATTGGCAACATGTCCAT
GATGAACAACATGGAAGCTCTTCAATTAACTTCAAATAACACAATTCCAATTAATGGGCACATGGATTTCAATAATGAAGTTTCTTCAAATAACAACTTCTCAAATTGGG
CAAGTACTTATGGAATTGATTCAAGCATCGATTGTATTACAAAGTGCCGCTCGATGATGAATGTCGATGAACCAACATGTTTTAATGGAGTTTCAAGTGATTACTTGTTG
AATCATGTTAATCTCGAAAACAGAAACTCGACGTATTTTCCCAGTGTTTCGAGTAGTTTCTTCTCGAATTGGCCGACGGATCCAAATGGGGTTTGGAAGCCCTTCGGCCA
TGAACAAGGCAGAACACAAAGGGATCCAAAAATTGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAACAAATCTTGAAACCTGAAAAGAAAAAGAAATTAGTTACAACAAAGAAAGCCAAACCCCATAAAGAATTATGGATGGTCTCCTTTTTAAGACCTATCAAACT
TTTAATTGGCAACTGGAAGTTTCATGCTATGAACGATGGCGAGATGATTGTAAAGTGCTTTTTTGCAAAGAAGCAACTAGTTTGGGAAGTTGTGATCAATGGATGTATTG
AAAAGGTTGAAATTGAATGGTCAAATATCATAGGAATGAAGGCTTCAATGAACGAAAACGAACCCGGAATTCTCGAGCTCGAGCTTTCGCAGCCACCAAAATTCTATAGA
CAACGTGACAGCGTGAATCCGCGGGAACATTCCCAATGGGATGATGGATTGGATTTTACAGAAGGACAGCAAGCTTCTAATTGCAGGAGGCATTTGGTTATGTTTCCTCC
TGGGATGTTGAACCAACACTTTGTGAGATTAATAAAAAGTGATAATAGATTGCTTGAGTTGAGCCAAAAGCCATATCCAACCAATAATAATCCTTACTTTTCTTCACAAA
CTTTACTTTCTGCTGAGGCTGCTGTTTCTTTGCCTTCATTTCAACCTAATTATAACCAAACTCACCAACTTTCTCATGCTGCTCTCAACCTAATTCTACAAGAATCAGTT
GATGCCTTGCCATCATATTCAAACATGTTGGCACCAACAAATGTCAATCTTCAATCAAGCTCAGGTGTTGATAATCATGACATTGAATGTTCAATTGGCAACATGTCCAT
GATGAACAACATGGAAGCTCTTCAATTAACTTCAAATAACACAATTCCAATTAATGGGCACATGGATTTCAATAATGAAGTTTCTTCAAATAACAACTTCTCAAATTGGG
CAAGTACTTATGGAATTGATTCAAGCATCGATTGTATTACAAAGTGCCGCTCGATGATGAATGTCGATGAACCAACATGTTTTAATGGAGTTTCAAGTGATTACTTGTTG
AATCATGTTAATCTCGAAAACAGAAACTCGACGTATTTTCCCAGTGTTTCGAGTAGTTTCTTCTCGAATTGGCCGACGGATCCAAATGGGGTTTGGAAGCCCTTCGGCCA
TGAACAAGGCAGAACACAAAGGGATCCAAAAATTGTTTAG
Protein sequenceShow/hide protein sequence
MEEQILKPEKKKKLVTTKKAKPHKELWMVSFLRPIKLLIGNWKFHAMNDGEMIVKCFFAKKQLVWEVVINGCIEKVEIEWSNIIGMKASMNENEPGILELELSQPPKFYR
QRDSVNPREHSQWDDGLDFTEGQQASNCRRHLVMFPPGMLNQHFVRLIKSDNRLLELSQKPYPTNNNPYFSSQTLLSAEAAVSLPSFQPNYNQTHQLSHAALNLILQESV
DALPSYSNMLAPTNVNLQSSSGVDNHDIECSIGNMSMMNNMEALQLTSNNTIPINGHMDFNNEVSSNNNFSNWASTYGIDSSIDCITKCRSMMNVDEPTCFNGVSSDYLL
NHVNLENRNSTYFPSVSSSFFSNWPTDPNGVWKPFGHEQGRTQRDPKIV