; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0013911 (gene) of Chayote v1 genome

Gene IDSed0013911
OrganismSechium edule (Chayote v1)
Descriptiontrihelix transcription factor ASR3
Genome locationLG13:25800024..25802347
RNA-Seq ExpressionSed0013911
SyntenySed0013911
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608224.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. sororia]1.1e-11074.56Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MKE+DG  G   SGSRRTRS+IA +WTAADC+VLVNVIAAVEADC KALSS+QKWKIIAENCTSLDV RNS+QCRRKWD +L++HD IKQWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEE--EKPPLTSLEVKPHEC
        YWC+ESGRRKELGLPDNFDEE+FKAIDNV  MRANQSDTEPDSD EAA E VDE  EPG KRQRR S+  RNQ+LEK+V CEE  E+PP++S EV+   C
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEE--EKPPLTSLEVKPHEC

Query:  YIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNH--RIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
        YIKSHGEK  D+TE EEQ M KKLLE AEKVQAIVSENAEYATSDEKN N+  R + +RRQGSKLI+CL DFL+T++DL  LLED E
Subjt:  YIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNH--RIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

KAG7037576.1 Trihelix transcription factor ASR3, partial [Cucurbita argyrosperma subsp. argyrosperma]6.3e-11174.56Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MKE+DG  G   SGSRRTRS+IA +WTAADC+VLVNVIAAVEADC KALSS+QKWKI+AENCTSLDV RNS+QCRRKWD +L++HD IKQWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEE--EKPPLTSLEVKPHEC
        YWC+ESGRRKELGLPDNFDEE+FKAIDNV  MRANQSDTEPDSD EAA E VDE  EPG KRQRR S+  RN++LEK+VKCEE  E+PP++S EV+   C
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEE--EKPPLTSLEVKPHEC

Query:  YIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNH--RIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
        YIKSHGEK  D+TE EEQ MAKKLLE AEKVQAIVSENAEYATSDEKN N+  R + +RRQGSKLI+CL DFL+T++DL  LLED E
Subjt:  YIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNH--RIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

XP_022139752.1 trihelix transcription factor ASR3 [Momordica charantia]1.5e-11274.22Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MK++DG+ G   SGSRRTRSQIA +WTAA+C+VLVNVIAAVEADCLKALSSYQKWKI+AENCTSLDV R S+QCRRKWD +L++HD IKQWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE---EEKPPLTSLEVKPHE
        YW +ESGRRKELGLP+NFD+ELFKAIDNVA MRANQSDTEPDSD EA  E++DE+ EPG KRQRRRSI KR+QALEK+++CE   EEKPPL S E +P E
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE---EEKPPLTSLEVKPHE

Query:  CYIKSHGEKVADNTEL-EEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
        C+IKS+GEK  D+ EL EEQMM KKLLE  E++QAIVSENAEYATSDEKN NHRID VRRQG+ LIRCLGD L+ ++DL GL ED E
Subjt:  CYIKSHGEKVADNTEL-EEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

XP_023524323.1 trihelix transcription factor ASR3-like [Cucurbita pepo subsp. pepo]2.4e-11074.14Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MKE+DG  G   SGSRRTRS+IA +WTAADC+VLVNVIAAVEADC KALSS+QKWKI+AENCTSLDV RNS+QCRRKWD +L++HD I+QWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKC---EEEKPPLTSLEVKPHE
        YWC+ES RRKELGLPDNFDEELFKAIDNV  MRANQSDTEPDSD EAA E VDE  EPG KRQRR S+  RNQALEK+VKC   EEE+PP++S EV+   
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKC---EEEKPPLTSLEVKPHE

Query:  CYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEK----NVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
        CYIKSHGEK  D+TE EEQ MAKKLLE AEKVQAIVSENAEYATSDEK    N N R + +RRQGSKLI+CL DFL+T++DL  LLED E
Subjt:  CYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEK----NVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

XP_038897371.1 trihelix transcription factor ASR3 [Benincasa hispida]1.8e-11070.68Show/hide
Query:  KEKDGHSG---SGSRRTRSQIAA--EWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGE
        KE  G+ G   SGSRRTRSQIA   +WTAADC+VLVNVIAAVEADCLKALSSYQKWKI+AENCTSLDVVR S+QCRRKWD +L++HD IKQWEL M + +
Subjt:  KEKDGHSG---SGSRRTRSQIAA--EWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGE

Query:  SYWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKC------------------
        SYWC+ESGRRKELGLPDNFDEELFKAIDNVA MRANQSDTEPDSD EAA E +DE+ EPG KRQRRRS+ K NQ LEK+++C                  
Subjt:  SYWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKC------------------

Query:  -----EEEKPPLTSLEVKPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLR
             EEEKP L+  EV+P ECYIK++G KV DN E +EQMMAK LLE AEKVQAIVSENAEYATSDEKN   + +LVR QGSKLIRCLGD L+T++DLR
Subjt:  -----EEEKPPLTSLEVKPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLR

Query:  GLLEDCE
        GLLEDCE
Subjt:  GLLEDCE

TrEMBL top hitse value%identityAlignment
A0A0A0LDW0 Myb-like domain-containing protein1.0e-10668.93Show/hide
Query:  KEKDGHSG---SGSRRTRSQIAAE--WTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGE
        KE  G+ G   SGSRRTRSQIA    WTAADC+VLVNVIAAVEADCLKALSSYQKWKI+AENCTSLDVVR S+QCRRKWD +L++HD IKQWEL M D +
Subjt:  KEKDGHSG---SGSRRTRSQIAAE--WTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGE

Query:  SYWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE-----------------
        SYWC+ SGRRKELGLP+NFDEELFKAIDNVA MRANQSDTEPDSD EAA    DE+ EPG KRQRRRS+ K NQ LEK+++CE                 
Subjt:  SYWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE-----------------

Query:  --------EEKPPLTSLEVKPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHD
                EEKP L+S E++P ECYIKS+  KV DN E +EQMMAK LLE AEKVQAIVSENAEY TSDEK    + +LVR QGSKLIRCLGD L+T++D
Subjt:  --------EEKPPLTSLEVKPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHD

Query:  LRGLLEDCE
        LRGLLEDCE
Subjt:  LRGLLEDCE

A0A6J1CEU7 trihelix transcription factor ASR37.3e-11374.22Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MK++DG+ G   SGSRRTRSQIA +WTAA+C+VLVNVIAAVEADCLKALSSYQKWKI+AENCTSLDV R S+QCRRKWD +L++HD IKQWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE---EEKPPLTSLEVKPHE
        YW +ESGRRKELGLP+NFD+ELFKAIDNVA MRANQSDTEPDSD EA  E++DE+ EPG KRQRRRSI KR+QALEK+++CE   EEKPPL S E +P E
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE---EEKPPLTSLEVKPHE

Query:  CYIKSHGEKVADNTEL-EEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
        C+IKS+GEK  D+ EL EEQMM KKLLE  E++QAIVSENAEYATSDEKN NHRID VRRQG+ LIRCLGD L+ ++DL GL ED E
Subjt:  CYIKSHGEKVADNTEL-EEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

A0A6J1FEH7 trihelix transcription factor ASR3-like5.2e-10368.28Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MK+++G+ G   SGSRRTRSQIA EWTAA+C+VLVNVI AVEADCLKALSSYQKWKI+AE+CT+L+V R S+QCR+KW+ +L++HD IKQWEL M + +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE-------EEKPPLTSLEV
        YWC+ESGRRKELGLPDNFDEELFKAIDNV+ MRANQSDTEPD+D EAA E  DE+ EPG KRQRR S+ KRNQ LEK+++ +       EE+P L+S E 
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCE-------EEKPPLTSLEV

Query:  KPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
           +CYIK++G    D+ E EEQMM KKLLE AE VQ IVSENAE ATSDEKN   + +L+RRQGSKLIRCLGDFL+T++DLR LLEDCE
Subjt:  KPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

A0A6J1FMB3 trihelix transcription factor ASR3-like2.0e-11074.22Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MKE+DG  G   SGSRRTRS+IA +WTAADC+VLVNVIAAVEADC KALSS+QKWKI+AENCTSLDV RNS+QCRRKWD +L++HD IKQWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEE--EKPPLTSLEVKPHEC
        YWC+ESGRRKELGLPDNFDEE+FKAIDNV  MRANQSDTEPDSD EAA E VDE  EPG KRQRR S+  RNQ+LEK+VKCEE  E+P ++S EV+   C
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEE--EKPPLTSLEVKPHEC

Query:  YIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNH--RIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
        YIKSHGEK  D+TE EEQ MAKKLLE AEKVQAIVSENAEYATSDEKN N+  R + +R QGSKLI+CL DFL+T++DL  LLED E
Subjt:  YIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEKNVNH--RIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

A0A6J1J3M2 trihelix transcription factor ASR37.1e-10871.38Show/hide
Query:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES
        MKE+DG  G   SGSRRTRS+IA +WTAA+C+VLVNVIAAVEADC KALSS+QKWKI+AENCTSLDV RNS+QCRRKWD +L++HD IKQWEL M D +S
Subjt:  MKEKDGHSG---SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGES

Query:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKC----------EEEKPPLTS
        YWC+ESGRRKELGLPDNFDEELFKAIDNV LMRANQSDTEPDSD EAA E VDE  EPG KRQRR S+  RNQALEK+VKC          EEE+P ++S
Subjt:  YWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKC----------EEEKPPLTS

Query:  LEVKPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEK----NVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
         EV+   CYIKSHGEK  DNTE EEQ M KKLLE AEKVQAIVSENA+YA S EK    N N+R + +RRQGSKLI+CL DFL+T++DL  L ED E
Subjt:  LEVKPHECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSENAEYATSDEK----NVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G31310.1 hydroxyproline-rich glycoprotein family protein2.6e-0627.18Show/hide
Query:  KWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMAD---------------GE--SYWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQ
        +WK I + C     +R+ +QC  KWD+++  +  ++++E    +               GE  SYW ME   RKE  LP N   + ++A+  V   +   
Subjt:  KWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMAD---------------GE--SYWCMESGRRKELGLPDNFDEELFKAIDNVALMRANQ

Query:  SDT
        S T
Subjt:  SDT

AT4G31270.1 sequence-specific DNA binding transcription factors2.1e-4840.69Show/hide
Query:  SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELG-MADGESYWCMESGRRKE
        SGSRRTRSQ+A EW   DC+VLVN IAAVEADC  ALSS+QKW +I ENC +LDV RN +QCRRKWD ++  ++ IK+WE      G SYW + S +RK 
Subjt:  SGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELG-MADGESYWCMESGRRKE

Query:  LGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVD---EVVEPGSKRQRRRSI----PKRNQALEKTVKCEEEKPPLTSL---------EVKP
        L LP + D ELF+AI+ V +++  ++ TE DSD E A ++VD   E+   GSKR R+R++     K+ +     V+    + P+T+          E KP
Subjt:  LGLPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVD---EVVEPGSKRQRRRSI----PKRNQALEKTVKCEEEKPPLTSL---------EVKP

Query:  HECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSEN--AEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE
         E       E    N E + ++M  KL    + + AIV  N   +  T D  +++ ++  VR+QG +LI CL + +ST++ L  + ++ E
Subjt:  HECYIKSHGEKVADNTELEEQMMAKKLLEAAEKVQAIVSEN--AEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGAAGGACGGCCACAGTGGATCGGGTTCTCGCCGGACGCGGTCGCAAATAGCGGCGGAGTGGACGGCGGCGGACTGCATTGTTCTTGTTAACGTGATTGCGGC
GGTGGAGGCCGATTGCTTGAAAGCTTTGTCCAGTTACCAGAAATGGAAGATTATTGCGGAGAACTGCACGTCCTTGGATGTCGTTCGGAACTCGGATCAGTGCCGGAGAA
AGTGGGACCATATGCTGGTTAAACATGATGCTATCAAGCAATGGGAGTTGGGGATGGCGGATGGTGAATCGTATTGGTGTATGGAGAGTGGAAGGAGAAAAGAATTGGGA
CTTCCTGATAACTTTGACGAGGAGTTGTTTAAAGCAATTGATAATGTCGCTTTGATGAGGGCGAATCAGTCGGATACTGAGCCGGATAGCGATCTTGAGGCTGCGGCCGA
GATCGTTGATGAAGTTGTAGAGCCTGGCTCTAAAAGGCAAAGACGGCGTTCAATACCTAAGAGAAATCAAGCCCTTGAGAAAACTGTAAAATGTGAAGAAGAAAAACCTC
CATTGACCTCTCTGGAAGTAAAGCCACATGAATGCTACATCAAAAGCCACGGAGAAAAGGTGGCTGATAACACGGAACTCGAAGAGCAAATGATGGCAAAGAAACTGCTT
GAAGCTGCAGAAAAAGTTCAAGCAATTGTGTCTGAAAATGCAGAATATGCAACTTCTGATGAAAAGAACGTCAACCATCGAATCGATTTGGTAAGGCGTCAAGGGAGCAA
GCTTATCAGATGCCTTGGGGATTTTCTCAGCACCGTTCATGATCTCCGAGGCCTGCTCGAAGATTGTGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGAAGGACGGCCACAGTGGATCGGGTTCTCGCCGGACGCGGTCGCAAATAGCGGCGGAGTGGACGGCGGCGGACTGCATTGTTCTTGTTAACGTGATTGCGGC
GGTGGAGGCCGATTGCTTGAAAGCTTTGTCCAGTTACCAGAAATGGAAGATTATTGCGGAGAACTGCACGTCCTTGGATGTCGTTCGGAACTCGGATCAGTGCCGGAGAA
AGTGGGACCATATGCTGGTTAAACATGATGCTATCAAGCAATGGGAGTTGGGGATGGCGGATGGTGAATCGTATTGGTGTATGGAGAGTGGAAGGAGAAAAGAATTGGGA
CTTCCTGATAACTTTGACGAGGAGTTGTTTAAAGCAATTGATAATGTCGCTTTGATGAGGGCGAATCAGTCGGATACTGAGCCGGATAGCGATCTTGAGGCTGCGGCCGA
GATCGTTGATGAAGTTGTAGAGCCTGGCTCTAAAAGGCAAAGACGGCGTTCAATACCTAAGAGAAATCAAGCCCTTGAGAAAACTGTAAAATGTGAAGAAGAAAAACCTC
CATTGACCTCTCTGGAAGTAAAGCCACATGAATGCTACATCAAAAGCCACGGAGAAAAGGTGGCTGATAACACGGAACTCGAAGAGCAAATGATGGCAAAGAAACTGCTT
GAAGCTGCAGAAAAAGTTCAAGCAATTGTGTCTGAAAATGCAGAATATGCAACTTCTGATGAAAAGAACGTCAACCATCGAATCGATTTGGTAAGGCGTCAAGGGAGCAA
GCTTATCAGATGCCTTGGGGATTTTCTCAGCACCGTTCATGATCTCCGAGGCCTGCTCGAAGATTGTGAGTGA
Protein sequenceShow/hide protein sequence
MKEKDGHSGSGSRRTRSQIAAEWTAADCIVLVNVIAAVEADCLKALSSYQKWKIIAENCTSLDVVRNSDQCRRKWDHMLVKHDAIKQWELGMADGESYWCMESGRRKELG
LPDNFDEELFKAIDNVALMRANQSDTEPDSDLEAAAEIVDEVVEPGSKRQRRRSIPKRNQALEKTVKCEEEKPPLTSLEVKPHECYIKSHGEKVADNTELEEQMMAKKLL
EAAEKVQAIVSENAEYATSDEKNVNHRIDLVRRQGSKLIRCLGDFLSTVHDLRGLLEDCE