; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G009000 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G009000
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRWP-RK domain-containing protein
Genome locationCG_Chr05:9839757..9842003
RNA-Seq ExpressionClCG05G009000
SyntenyClCG05G009000
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR003035 - RWP-RK domain
IPR044607 - Transcription factor RKD-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK29190.1 protein RKD4 [Cucumis melo var. makuwa]2.2e-4855.22Show/hide
Query:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE------EKKLRL
        NPQ+L N     DF+WL E LFP H  + E+                                          IER  FSS+WKEEEE      EK+LRL
Subjt:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE------EKKLRL

Query:  LRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHT
        + N    ++RK S VLGLEEI+K+FHIPISEAAK+MN     LK     L I RWPHRK KSLNSLIQNVKEMGLTNEVKGLEEHKRL+EE+P++DLTHT
Subjt:  LRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHT

Query:  TKRLRQACFKANYKKKRRIIRSNNIATLHH
         KRLRQACFKANYKK RR +RSNNIA+LHH
Subjt:  TKRLRQACFKANYKKKRRIIRSNNIATLHH

XP_008466842.1 PREDICTED: protein RKD4 [Cucumis melo]8.2e-4854.55Show/hide
Query:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE-------EKKLR
        NPQ+L N     DF+WL E LFP H  + E+                                          IER  FSS+WKEEEE       EK+LR
Subjt:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE-------EKKLR

Query:  LLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTH
        L+ N    ++RK S VLGLEEI+K+FHIPIS+AAK+MN     LK     L I RWPHRK KSLNSLIQNVKEMGLTNEVKGLEEHKRL+EE+P++DLTH
Subjt:  LLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTH

Query:  TTKRLRQACFKANYKKKRRIIRSNNIATLHH
        T KRLRQACFKANYKK RR +RSNNIA+LHH
Subjt:  TTKRLRQACFKANYKKKRRIIRSNNIATLHH

XP_022968161.1 protein RKD4 [Cucurbita maxima]8.8e-4262.5Show/hide
Query:  PQSLQNPLDFDWLQELFPFHKGIREMIERPEFSSSWKEEEE--EKKLRL-LRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-IS
        P+ +    D D  +EL    + +  MIE     S+WKEEEE  EK++RL +RNGR++  SSVLGLEEIQKHFHIPI+EAAK M+     LK     L I 
Subjt:  PQSLQNPLDFDWLQELFPFHKGIREMIERPEFSSSWKEEEE--EKKLRL-LRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-IS

Query:  RWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLH
        RWPHRKLKSLNSLI NV+EMGLTNE+KGLEEHKRL+EELP++DLTH T+RLRQACFKANYKKKRR    +N+A LH
Subjt:  RWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLH

XP_031738302.1 protein RKD4 [Cucumis sativus]8.5e-4553.07Show/hide
Query:  NPQSLQN--PLDFDWLQE-LFPFHKGIREM--IERPEF----------------------------------------SSSWKEEEE-----EKKLRLLR
        NPQ+  N    DF+WL E LFPFH  + E+  +++ EF                                        SS+WKEEEE     EK+LRL+R
Subjt:  NPQSLQN--PLDFDWLQE-LFPFHKGIREM--IERPEF----------------------------------------SSSWKEEEE-----EKKLRLLR

Query:  N---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTK
        N    ++RK S VLGLEEI+K++HIPISEAAK+MN     LK     L I RWPHRKLKS NSLIQNVKEMGLTNEVKGLEE KRL+ E P++DLTHTTK
Subjt:  N---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTK

Query:  RLRQACFKANYKKKRRIIRSNNIATLHH
        RLRQACFK  Y+K RR +RSNNIA+LHH
Subjt:  RLRQACFKANYKKKRRIIRSNNIATLHH

XP_038886661.1 protein RKD2 [Benincasa hispida]7.2e-6061.86Show/hide
Query:  MESNTKL-NPQSLQ-NPLDFDWLQE-LFPF---------------------------------------------------HKGIREMIERPEFSSSWKE
        MES+TKL NPQ+LQ NP DF+WL E LFPF                                                    +GI EM ER  FSSSWKE
Subjt:  MESNTKL-NPQSLQ-NPLDFDWLQE-LFPF---------------------------------------------------HKGIREMIERPEFSSSWKE

Query:  EEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPD
        EEEEK+LRL+RNGR+RK S VLGLEEIQKHFH+PISEAAKQMN     LK     L I RWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRL+EELPD
Subjt:  EEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPD

Query:  LDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLHH
        +DLTH+TKRLRQACFKANYKK+R  IRSN+IATLHH
Subjt:  LDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLHH

TrEMBL top hitse value%identityAlignment
A0A0A0L6Q6 RWP-RK domain-containing protein2.8e-4167.33Show/hide
Query:  ERPEFSSSWKEEEEEKKLRLLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVK
        E  E     +EEE EK+LRL+RN    ++RK S VLGLEEI+K++HIPISEAAK+MN     LK     L I RWPHRKLKS NSLIQNVKEMGLTNEVK
Subjt:  ERPEFSSSWKEEEEEKKLRLLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVK

Query:  GLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLHH
        GLEE KRL+ E P++DLTHTTKRLRQACFK  Y+K RR +RSNNIA+LHH
Subjt:  GLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLHH

A0A1S3CS47 protein RKD44.0e-4854.55Show/hide
Query:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE-------EKKLR
        NPQ+L N     DF+WL E LFP H  + E+                                          IER  FSS+WKEEEE       EK+LR
Subjt:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE-------EKKLR

Query:  LLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTH
        L+ N    ++RK S VLGLEEI+K+FHIPIS+AAK+MN     LK     L I RWPHRK KSLNSLIQNVKEMGLTNEVKGLEEHKRL+EE+P++DLTH
Subjt:  LLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTH

Query:  TTKRLRQACFKANYKKKRRIIRSNNIATLHH
        T KRLRQACFKANYKK RR +RSNNIA+LHH
Subjt:  TTKRLRQACFKANYKKKRRIIRSNNIATLHH

A0A5D3E104 Protein RKD41.0e-4855.22Show/hide
Query:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE------EKKLRL
        NPQ+L N     DF+WL E LFP H  + E+                                          IER  FSS+WKEEEE      EK+LRL
Subjt:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE------EKKLRL

Query:  LRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHT
        + N    ++RK S VLGLEEI+K+FHIPISEAAK+MN     LK     L I RWPHRK KSLNSLIQNVKEMGLTNEVKGLEEHKRL+EE+P++DLTHT
Subjt:  LRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHT

Query:  TKRLRQACFKANYKKKRRIIRSNNIATLHH
         KRLRQACFKANYKK RR +RSNNIA+LHH
Subjt:  TKRLRQACFKANYKKKRRIIRSNNIATLHH

A0A6J1HU37 protein RKD44.3e-4262.5Show/hide
Query:  PQSLQNPLDFDWLQELFPFHKGIREMIERPEFSSSWKEEEE--EKKLRL-LRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-IS
        P+ +    D D  +EL    + +  MIE     S+WKEEEE  EK++RL +RNGR++  SSVLGLEEIQKHFHIPI+EAAK M+     LK     L I 
Subjt:  PQSLQNPLDFDWLQELFPFHKGIREMIERPEFSSSWKEEEE--EKKLRL-LRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-IS

Query:  RWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLH
        RWPHRKLKSLNSLI NV+EMGLTNE+KGLEEHKRL+EELP++DLTH T+RLRQACFKANYKKKRR    +N+A LH
Subjt:  RWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLH

E5GBA6 Transcription factor4.0e-4854.55Show/hide
Query:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE-------EKKLR
        NPQ+L N     DF+WL E LFP H  + E+                                          IER  FSS+WKEEEE       EK+LR
Subjt:  NPQSLQN---PLDFDWLQE-LFPFHKGIREM------------------------------------------IERPEFSSSWKEEEE-------EKKLR

Query:  LLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTH
        L+ N    ++RK S VLGLEEI+K+FHIPIS+AAK+MN     LK     L I RWPHRK KSLNSLIQNVKEMGLTNEVKGLEEHKRL+EE+P++DLTH
Subjt:  LLRN---GRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTH

Query:  TTKRLRQACFKANYKKKRRIIRSNNIATLHH
        T KRLRQACFKANYKK RR +RSNNIA+LHH
Subjt:  TTKRLRQACFKANYKKKRRIIRSNNIATLHH

SwissProt top hitse value%identityAlignment
O81791 Protein RKD55.1e-0827.54Show/hide
Query:  KGIREMIERPEFSSSWKEE-EEEKKLRLLRNGRKRKSSSV--LGLEEIQKHFHIPISEAAKQMN-----CLKEGADNLISRWPHRKLKSLNSLIQNVKE-
        + + E  E  EF +   E+ E + K  +L+  ++  S  V  L LEE+ K+F + I EA++ +        K+  +  I RWPHRK+KSL+ LI +++  
Subjt:  KGIREMIERPEFSSSWKEE-EEEKKLRLLRNGRKRKSSSV--LGLEEIQKHFHIPISEAAKQMN-----CLKEGADNLISRWPHRKLKSLNSLIQNVKE-

Query:  ------------MGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIAT
                    M +  + + LE  KR + + P +++   TK+ RQ  FK  ++  R      ++ T
Subjt:  ------------MGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIAT

Q9CA66 Protein RKD24.6e-1741.67Show/hide
Query:  EEEEEKKLRLLRNGRKRKSSS---VLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVK
        E   E  +R++ +     +SS    L  E + ++F++PI++AA  +N     LK     L I RWPHRKL SLN+LI NVKE+           L + ++
Subjt:  EEEEEKKLRLLRNGRKRKSSS---VLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVK

Query:  GLEEHKRLMEELPDLDLTHTTKRLRQACFKANYK-KKRRIIRSN
         LE+ KR +E+LPDL+    TKRLRQACFKAN+K KK+R ++S+
Subjt:  GLEEHKRLMEELPDLDLTHTTKRLRQACFKANYK-KKRRIIRSN

Q9FGD1 Protein RKD37.0e-1839.57Show/hide
Query:  SSWKEEEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM-----------GLTNE
        ++ KE+  +K++ + R  R+    + +  E ++++F++PI++AAK++N     LK+    L I RWPHRKL SLN+LI N+K++            L N 
Subjt:  SSWKEEEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM-----------GLTNE

Query:  VKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKR
        ++ LE  K+++EE+PDL+    TKRLRQACFKA YK++R
Subjt:  VKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKR

Q9LVU8 Protein RKD43.3e-2352.25Show/hide
Query:  KRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQAC
        K+K    L + EI++ F  PI +AAK++N     LK+    L I RWPHRKLKSLNSLI+N+K +G+  EVK LEEH+ L+E+ PD +L+  TK+LRQAC
Subjt:  KRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQAC

Query:  FKANYKKKRRI
        FKANYK+++ +
Subjt:  FKANYKKKRRI

Q9M9U9 Protein RKD11.9e-1847.41Show/hide
Query:  SSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVKGLEEHKRLMEELPDLDLTHTTK
        S  L  E I  +F++PI++AA+++N     LK+    L I RWPHRKL SL  LI NVKE+           L N ++ LE+ K+ +E+LPDL     TK
Subjt:  SSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVKGLEEHKRLMEELPDLDLTHTTK

Query:  RLRQACFKANYKKKRR
        RLRQACFKAN+K+KRR
Subjt:  RLRQACFKANYKKKRR

Arabidopsis top hitse value%identityAlignment
AT1G18790.1 RWP-RK domain-containing protein1.3e-1947.41Show/hide
Query:  SSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVKGLEEHKRLMEELPDLDLTHTTK
        S  L  E I  +F++PI++AA+++N     LK+    L I RWPHRKL SL  LI NVKE+           L N ++ LE+ K+ +E+LPDL     TK
Subjt:  SSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVKGLEEHKRLMEELPDLDLTHTTK

Query:  RLRQACFKANYKKKRR
        RLRQACFKAN+K+KRR
Subjt:  RLRQACFKANYKKKRR

AT1G74480.1 RWP-RK domain-containing protein3.2e-1841.67Show/hide
Query:  EEEEEKKLRLLRNGRKRKSSS---VLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVK
        E   E  +R++ +     +SS    L  E + ++F++PI++AA  +N     LK     L I RWPHRKL SLN+LI NVKE+           L + ++
Subjt:  EEEEEKKLRLLRNGRKRKSSS---VLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM----------GLTNEVK

Query:  GLEEHKRLMEELPDLDLTHTTKRLRQACFKANYK-KKRRIIRSN
         LE+ KR +E+LPDL+    TKRLRQACFKAN+K KK+R ++S+
Subjt:  GLEEHKRLMEELPDLDLTHTTKRLRQACFKANYK-KKRRIIRSN

AT4G35590.1 RWP-RK domain-containing protein3.6e-0927.54Show/hide
Query:  KGIREMIERPEFSSSWKEE-EEEKKLRLLRNGRKRKSSSV--LGLEEIQKHFHIPISEAAKQMN-----CLKEGADNLISRWPHRKLKSLNSLIQNVKE-
        + + E  E  EF +   E+ E + K  +L+  ++  S  V  L LEE+ K+F + I EA++ +        K+  +  I RWPHRK+KSL+ LI +++  
Subjt:  KGIREMIERPEFSSSWKEE-EEEKKLRLLRNGRKRKSSSV--LGLEEIQKHFHIPISEAAKQMN-----CLKEGADNLISRWPHRKLKSLNSLIQNVKE-

Query:  ------------MGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIAT
                    M +  + + LE  KR + + P +++   TK+ RQ  FK  ++  R      ++ T
Subjt:  ------------MGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIAT

AT5G53040.1 RWP-RK domain-containing protein2.3e-2452.25Show/hide
Query:  KRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQAC
        K+K    L + EI++ F  PI +AAK++N     LK+    L I RWPHRKLKSLNSLI+N+K +G+  EVK LEEH+ L+E+ PD +L+  TK+LRQAC
Subjt:  KRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQAC

Query:  FKANYKKKRRI
        FKANYK+++ +
Subjt:  FKANYKKKRRI

AT5G66990.1 RWP-RK domain-containing protein5.0e-1939.57Show/hide
Query:  SSWKEEEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM-----------GLTNE
        ++ KE+  +K++ + R  R+    + +  E ++++F++PI++AAK++N     LK+    L I RWPHRKL SLN+LI N+K++            L N 
Subjt:  SSWKEEEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMN----CLKEGADNL-ISRWPHRKLKSLNSLIQNVKEM-----------GLTNE

Query:  VKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKR
        ++ LE  K+++EE+PDL+    TKRLRQACFKA YK++R
Subjt:  VKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCCAACACCAAACTAAATCCTCAAAGCCTTCAGAATCCACTCGACTTTGACTGGCTGCAAGAACTCTTCCCATTCCACAAGGGAATTAGAGAGATGATTGAAAG
ACCAGAGTTTTCATCAAGTTGGAAAGAAGAGGAAGAGGAAAAGAAATTGAGATTATTGAGGAATGGGAGGAAGAGAAAAAGTAGTTCAGTTTTGGGATTGGAAGAGATTC
AAAAGCACTTTCATATACCAATATCAGAGGCAGCAAAACAGATGAATTGCTTAAAAGAAGGTGCAGACAACTTAATATCAAGATGGCCCCATAGAAAGCTCAAGAGCTTG
AATTCTCTCATTCAAAATGTTAAGGAGATGGGGTTAACAAATGAGGTAAAAGGGTTGGAGGAGCACAAGAGGCTGATGGAGGAATTGCCAGATTTGGACCTCACACACAC
AACCAAAAGGCTGAGGCAAGCTTGTTTCAAAGCCAATTACAAGAAGAAGAGAAGAATTATTAGGTCCAATAATATTGCTACTCTTCATCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCCAACACCAAACTAAATCCTCAAAGCCTTCAGAATCCACTCGACTTTGACTGGCTGCAAGAACTCTTCCCATTCCACAAGGGAATTAGAGAGATGATTGAAAG
ACCAGAGTTTTCATCAAGTTGGAAAGAAGAGGAAGAGGAAAAGAAATTGAGATTATTGAGGAATGGGAGGAAGAGAAAAAGTAGTTCAGTTTTGGGATTGGAAGAGATTC
AAAAGCACTTTCATATACCAATATCAGAGGCAGCAAAACAGATGAATTGCTTAAAAGAAGGTGCAGACAACTTAATATCAAGATGGCCCCATAGAAAGCTCAAGAGCTTG
AATTCTCTCATTCAAAATGTTAAGGAGATGGGGTTAACAAATGAGGTAAAAGGGTTGGAGGAGCACAAGAGGCTGATGGAGGAATTGCCAGATTTGGACCTCACACACAC
AACCAAAAGGCTGAGGCAAGCTTGTTTCAAAGCCAATTACAAGAAGAAGAGAAGAATTATTAGGTCCAATAATATTGCTACTCTTCATCACTAA
Protein sequenceShow/hide protein sequence
MESNTKLNPQSLQNPLDFDWLQELFPFHKGIREMIERPEFSSSWKEEEEEKKLRLLRNGRKRKSSSVLGLEEIQKHFHIPISEAAKQMNCLKEGADNLISRWPHRKLKSL
NSLIQNVKEMGLTNEVKGLEEHKRLMEELPDLDLTHTTKRLRQACFKANYKKKRRIIRSNNIATLHH