; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G21990 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G21990
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUnknown protein
Genome locationClcChr01:33316494..33317869
RNA-Seq ExpressionClc01G21990
SyntenyClc01G21990
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033980.1 hypothetical protein SDJN02_03706, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-7878.43Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRK+TVHPSPPIISDFLSFLP+AIF LTVALSADDKEVLAYLISCSNT+ASLSNLS++R++GRK   GKVGVDHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEII+AYEDGL KSK T + QRN KKE+RKKN ES  GESS+GKGK  E   S QQE+ R  N +       EEEGEERGSV RFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
         AWG
Subjt:  GAWG

TYK13010.1 uncharacterized protein E5676_scaffold255G006090 [Cucumis melo var. makuwa]6.7e-9390.69Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SR+NGRK+A  KVG+DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEE+ EE GEERGSVRRFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
        GAWG
Subjt:  GAWG

XP_004134788.1 uncharacterized protein LOC101204826 [Cucumis sativus]2.6e-9290.69Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALT+ALSADDKEVLAYLISCSN+TASLSNLS  R+NGRK+A  KVGVDHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESS GKGKTNEVL +  QETGRQRNEKEEEEEE E EGEERGSVRRFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
        GAWG
Subjt:  GAWG

XP_008440055.1 PREDICTED: uncharacterized protein LOC103484646 [Cucumis melo]4.0e-9391.18Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SR+NGRK+A  KVG+DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEEE EE GEERGSVRRFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
        GAWG
Subjt:  GAWG

XP_038882712.1 uncharacterized protein LOC120073876 [Benincasa hispida]5.5e-9591.71Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSNTTASLSNLS SR+N RK+A GKVGVDHAP+FDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSE--QQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKI
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNESA GESSLGKGKTNEVLS+  QQ+TGRQRNEKEEEEE  +EEG ERGSVRRFVSFVGEKI
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSE--QQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKI

Query:  WGAWG
        WGAWG
Subjt:  WGAWG

TrEMBL top hitse value%identityAlignment
A0A0A0KMY4 Uncharacterized protein1.2e-9290.69Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALT+ALSADDKEVLAYLISCSN+TASLSNLS  R+NGRK+A  KVGVDHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESS GKGKTNEVL +  QETGRQRNEKEEEEEE E EGEERGSVRRFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
        GAWG
Subjt:  GAWG

A0A1S3B0U5 uncharacterized protein LOC1034846461.9e-9391.18Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SR+NGRK+A  KVG+DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEEE EE GEERGSVRRFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
        GAWG
Subjt:  GAWG

A0A5D3CNJ0 Uncharacterized protein3.3e-9390.69Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SR+NGRK+A  KVG+DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEE+ EE GEERGSVRRFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
        GAWG
Subjt:  GAWG

A0A6J1EKS6 uncharacterized protein LOC1114335061.9e-6971.29Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKS+VHPS PIISDFLSFLP+ IFALTVALSADDKEVLAYLI+CSNT        ++R+  RK+ +GK GVDHAPLFDCDCFMCYRRYW RWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESS-----LGKGKTNEVLS-EQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFV
        PNRQLIHE+I+AYEDGL KSKA  ++QRNCKKERRKK NES   ES+     +GK K NE     QQE+ R  N KEEEEE       ERGSVRRFVSFV
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESS-----LGKGKTNEVLS-EQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFV

Query:  GEKIWGAWG
        GEKIWGAWG
Subjt:  GEKIWGAWG

A0A6J1IPN3 uncharacterized protein LOC1114788011.6e-7676.96Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRK+TVHPSPPIISDFLSFLP+ IF LTVALSADDKEVLAYLISCSNT+ASLSNLS +R++GRK   GKVGVDHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW
        PNRQLIHEII+AYEDGL K K T S QRN KKERRKKN ES   ESS+GKGK  E   S QQE+ R  N +       + EGEERGSV RFVSFVGEKIW
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGEKIW

Query:  GAWG
         AWG
Subjt:  GAWG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein1.8e-3544.1Show/hide
Query:  MKKLCRKSTVHPSPPIISD---FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARW
        MKKL RK TVHPSPP I      L+ LP AIF+L   LS +D+EVLAYLIS ++ +   +   TSR N  K     +  +H+PLF CDCF CY  YW RW
Subjt:  MKKLCRKSTVHPSPPIISD---FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWARW

Query:  DSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESA------------------IGESSLGKGKTNEVLSEQQETGRQRNEKEEEE-----
        DSSP+RQLIHEIIDA+ED L K+K    N    KK+RRK++ +S+                  +GES +            Q+ G      E  E     
Subjt:  DSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESA------------------IGESSLGKGKTNEVLSEQQETGRQRNEKEEEE-----

Query:  EEGEEEGEERGSVRRFVSFVGEKIWGAWG
        +  E+  EE+G+VRRFVSF+GEK++G WG
Subjt:  EEGEEEGEERGSVRRFVSFVGEKIWGAWG

AT1G24270.1 unknown protein7.8e-2344.74Show/hide
Query:  SSSSAMKKLCRKSTVHPSPPIIS-------DFLS---FLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCD
        SS SAM K+ +K  VHPSPP+ S       D LS    L SAI  L   LSA+D EVLAYLI+ S  T ++  +S  ++   K          APL DC 
Subjt:  SSSSAMKKLCRKSTVHPSPPIIS-------DFLS---FLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCD

Query:  CFMCYRRYWARWDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKK
        CF CY  YW++WDSS NR+LI++II+A+ED LT+ + + S+     K+R KK
Subjt:  CFMCYRRYWARWDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKK

AT1G62422.1 unknown protein2.8e-3646.38Show/hide
Query:  MKKLCRKSTVHPSPP--IISD--FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWAR
        MKKLCRK TVHPSPP  I +D  FLS LP AI +L  ALS +D+EVLAYLIS S  +  +S L  ++ +            H+PLF CDCF CY  YW R
Subjt:  MKKLCRKSTVHPSPP--IISD--FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDHAPLFDCDCFMCYRRYWAR

Query:  WDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGE
        WD+SP RQLIHEIIDAYED L   K         KK+RRK++ +++   +S+G  + +E+ S   E     +EK +    GEE  +E+GSV + +SF+G+
Subjt:  WDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEGEEEGEERGSVRRFVSFVGE

Query:  KIWGAWG
        +  G WG
Subjt:  KIWGAWG

AT5G13090.1 unknown protein2.2e-1735.78Show/hide
Query:  KNSKHRFPFPKPFFLSSSSAMKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDH-A
        K   +  P P P   SSSS+   L  +     S       L  LP+ I  L   LS++++EVLAYLI+   T +   N  +S +N  K  + K   +H  
Subjt:  KNSKHRFPFPKPFFLSSSSAMKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLATGKVGVDH-A

Query:  PLFDCDCFMCYRRYWARWDSSPNRQLIHEIIDAYED--GLTKSKATTSNQRNCKKE---RRKKNNES--AIGESSLGKGKTNEVLSEQQETGRQRNE---
        P+FDC+CF CY  YW RWDSSPNR+LIHEII+A+E+  G   S + + ++R  KKE   RR  +++S  A+  +  G   +  V+    ET    +    
Subjt:  PLFDCDCFMCYRRYWARWDSSPNRQLIHEIIDAYED--GLTKSKATTSNQRNCKKE---RRKKNNES--AIGESSLGKGKTNEVLSEQQETGRQRNE---

Query:  -----KEEEEEEGEEEGE
              E E  EGE E E
Subjt:  -----KEEEEEEGEEEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACACGTCATCAACTCCGATGCCTCCTTCTCTTCGGAGTAGAAAATAAAAACCTCAAAAACTCTAAACACCGCTTTCCATTTCCAAAACCCTTTTTTCTCTCTTC
TTCTTCCGCCATGAAGAAGCTTTGCCGGAAAAGCACCGTCCATCCATCGCCGCCGATAATTTCCGATTTCCTTTCCTTTTTACCCTCCGCGATATTCGCCCTCACCGTCG
CTCTCTCCGCCGATGACAAAGAAGTCCTCGCCTATCTCATCTCTTGTTCCAACACCACCGCTTCTCTCTCCAACTTATCCACCAGCCGCAGGAACGGTCGGAAACTCGCC
ACTGGTAAGGTCGGTGTCGATCACGCTCCGCTCTTTGACTGCGATTGTTTTATGTGCTATCGACGATACTGGGCGAGATGGGATTCTTCCCCCAATCGGCAACTTATTCA
TGAAATAATCGATGCTTATGAAGATGGATTAACGAAATCGAAAGCCACAACAAGCAATCAGAGGAATTGCAAGAAAGAAAGACGGAAGAAGAACAACGAATCGGCTATCG
GTGAGTCAAGCTTAGGGAAAGGCAAGACAAACGAGGTATTATCGGAGCAGCAGGAGACGGGTCGGCAGAGGAATGAAAAAGAGGAGGAGGAGGAAGAAGGAGAAGAAGAA
GGAGAAGAAAGAGGATCGGTGAGAAGATTCGTGAGTTTTGTAGGTGAGAAAATTTGGGGTGCTTGGGGTTAA
mRNA sequenceShow/hide mRNA sequence
TAAAGAAAGAGATATATGGACTGATGATCGTGGAGGGTCCCACTCTGTCTCCTTTAAATCAATCACAACCGTACGATGAGAAAGAGAAGTAATCCATCAGTGACATTGCA
GCATAAGAATCGTGTTGTGCGTCGCACGTGAATATTACCCAATACATCATGGACACACGTCATCAACTCCGATGCCTCCTTCTCTTCGGAGTAGAAAATAAAAACCTCAA
AAACTCTAAACACCGCTTTCCATTTCCAAAACCCTTTTTTCTCTCTTCTTCTTCCGCCATGAAGAAGCTTTGCCGGAAAAGCACCGTCCATCCATCGCCGCCGATAATTT
CCGATTTCCTTTCCTTTTTACCCTCCGCGATATTCGCCCTCACCGTCGCTCTCTCCGCCGATGACAAAGAAGTCCTCGCCTATCTCATCTCTTGTTCCAACACCACCGCT
TCTCTCTCCAACTTATCCACCAGCCGCAGGAACGGTCGGAAACTCGCCACTGGTAAGGTCGGTGTCGATCACGCTCCGCTCTTTGACTGCGATTGTTTTATGTGCTATCG
ACGATACTGGGCGAGATGGGATTCTTCCCCCAATCGGCAACTTATTCATGAAATAATCGATGCTTATGAAGATGGATTAACGAAATCGAAAGCCACAACAAGCAATCAGA
GGAATTGCAAGAAAGAAAGACGGAAGAAGAACAACGAATCGGCTATCGGTGAGTCAAGCTTAGGGAAAGGCAAGACAAACGAGGTATTATCGGAGCAGCAGGAGACGGGT
CGGCAGAGGAATGAAAAAGAGGAGGAGGAGGAAGAAGGAGAAGAAGAAGGAGAAGAAAGAGGATCGGTGAGAAGATTCGTGAGTTTTGTAGGTGAGAAAATTTGGGGTGC
TTGGGGTTAATGAATTTGAACAATCGCAAAGGTAGATTTCTTCTTCTTCTTCTTCTTCTTCTTTAAATCATCTAAGAAGATGAGCTCAAAATAGGATTCTGTTTTGGTTT
CTCTGTTTCGGATTTTTGTTTGCTCTGTTCTTGTAAATTACTTCCCTAATGTAAAAATGAATGATTATATGGTGTTCTATATAAATTCCACTCTGGAAATCTTCTGGATG
TTCTTGCTGAAATTTGGTAATTGAGGTTAAATTTTAGGCTTAGTTAAGTGTGTTTGATTATAATTGAAGAAGCAGAATTACATTCCATTTGTGAGATTCTAACGTTGAAT
TTCCTTGTAAACTGGAAAACGATGTCGTTAAAGGGAAGGCCTGACCCACCATTTTTCTGTATATATATGGAAAGAATATGGAATGCAAGAGAGTTGCAATTGCGAAACGT
GCATGCATGGCCTTTGCAAATGAAATTGACACCCAATTGGAAGTATTTTGGGGATT
Protein sequenceShow/hide protein sequence
MDTRHQLRCLLLFGVENKNLKNSKHRFPFPKPFFLSSSSAMKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRRNGRKLA
TGKVGVDHAPLFDCDCFMCYRRYWARWDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEGEEE
GEERGSVRRFVSFVGEKIWGAWG