; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC04G076080 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC04G076080
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionUnknown protein
Genome locationCicolChr04:31391361..31392098
RNA-Seq ExpressionCcUC04G076080
SyntenyCcUC04G076080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033980.1 hypothetical protein SDJN02_03706, partial [Cucurbita argyrosperma subsp. argyrosperma]6.2e-7877.67Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRK+TVHPSPPIISDFLSFLP+AIF LTVALSADDKEVLAYLISCSNT+ASLSNLS++RK+GRK   GKVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEII+AYEDGL KSK T + QRN KKE+RKKN ES  GESS+GKGK  E   S QQE+ R  N +EEE         GEERGSV RFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IW AWG
Subjt:  IWGAWG

TYK13010.1 uncharacterized protein E5676_scaffold255G006090 [Cucumis melo var. makuwa]1.4e-9390.78Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SRKNGRK+A  KVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEE+EE   GGEERGSVRRFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IWGAWG
Subjt:  IWGAWG

XP_004134788.1 uncharacterized protein LOC101204826 [Cucumis sativus]3.4e-9289.81Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALT+ALSADDKEVLAYLISCSN+TASLSNLS  RKNGRK+A  KVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESS GKGKTNEVL +  QETGRQRNEKEEEEEEE   G GEERGSVRRFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IWGAWG
Subjt:  IWGAWG

XP_008440055.1 PREDICTED: uncharacterized protein LOC103484646 [Cucumis melo]8.0e-9491.26Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SRKNGRK+A  KVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEEEEE   GGEERGSVRRFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IWGAWG
Subjt:  IWGAWG

XP_038882712.1 uncharacterized protein LOC120073876 [Benincasa hispida]2.1e-9490.82Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSNTTASLSNLS SRKN RK+A GKVG DHAP+FDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSE--QQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGE
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNESA GESSLGKGKTNEVLS+  QQ+TGRQRNEKEEEEE+EE    G ERGSVRRFVSFVGE
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSE--QQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGE

Query:  KIWGAWG
        KIWGAWG
Subjt:  KIWGAWG

TrEMBL top hitse value%identityAlignment
A0A0A0KMY4 Uncharacterized protein1.6e-9289.81Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALT+ALSADDKEVLAYLISCSN+TASLSNLS  RKNGRK+A  KVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESS GKGKTNEVL +  QETGRQRNEKEEEEEEE   G GEERGSVRRFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IWGAWG
Subjt:  IWGAWG

A0A1S3B0U5 uncharacterized protein LOC1034846463.9e-9491.26Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SRKNGRK+A  KVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEEEEE   GGEERGSVRRFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IWGAWG
Subjt:  IWGAWG

A0A5D3CNJ0 Uncharacterized protein6.6e-9490.78Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKSTVHPSPPIISDFLSFLP+AIFALTVALSADDKEVLAYLISCSN+TAS SNLS SRKNGRK+A  KVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEIIDAYEDGLTKSKATTS QRNCKKERRKKNNES  GESSLGKGKTNEVL +  QETGRQRNEKEEEEE+EE   GGEERGSVRRFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQ-QETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IWGAWG
Subjt:  IWGAWG

A0A6J1EKS6 uncharacterized protein LOC1114335067.4e-6970.62Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRKS+VHPS PIISDFLSFLP+ IFALTVALSADDKEVLAYLI+CSNT        ++RK  RK+ +GK G DHAPLFDCDCFMCYRRYW RWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESS-----LGKGKTNEVLS-EQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVS
        PNRQLIHE+I+AYEDGL KSKA  ++QRNCKKERRKK NES   ES+     +GK K NE     QQE+ R  N KEEEEE         ERGSVRRFVS
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESS-----LGKGKTNEVLS-EQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVS

Query:  FVGEKIWGAWG
        FVGEKIWGAWG
Subjt:  FVGEKIWGAWG

A0A6J1IPN3 uncharacterized protein LOC1114788018.1e-7676.21Show/hide
Query:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS
        MKKLCRK+TVHPSPPIISDFLSFLP+ IF LTVALSADDKEVLAYLISCSNT+ASLSNLS +RK+GRK   GKVG DHAPLFDCDCFMCYRRYWARWDSS
Subjt:  MKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARWDSS

Query:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK
        PNRQLIHEII+AYEDGL K K T S QRN KKERRKKN ES   ESS+GKGK  E   S QQE+ R  N ++ E         GEERGSV RFVSFVGEK
Subjt:  PNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVL-SEQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFVGEK

Query:  IWGAWG
        IW AWG
Subjt:  IWGAWG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein4.8e-3643.42Show/hide
Query:  MKKLCRKSTVHPSPPIISD---FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARW
        MKKL RK TVHPSPP I      L+ LP AIF+L   LS +D+EVLAYLIS ++ +   +   TSR N  K     +  +H+PLF CDCF CY  YW RW
Subjt:  MKKLCRKSTVHPSPPIISD---FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWARW

Query:  DSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEEEEGGG---------------
        DSSP+RQLIHEIIDA+ED L K+K    N    K  R++    S++  SS      +E+ S   E+           E  ++GGG               
Subjt:  DSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEEEEGGG---------------

Query:  -----GEERGSVRRFVSFVGEKIWGAWG
              EE+G+VRRFVSF+GEK++G WG
Subjt:  -----GEERGSVRRFVSFVGEKIWGAWG

AT1G24270.1 unknown protein4.6e-2345.39Show/hide
Query:  SSSSAMKKLCRKSTVHPSPPIIS-------DFLS---FLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCD
        SS SAM K+ +K  VHPSPP+ S       D LS    L SAI  L   LSA+D EVLAYLI+ S  T ++  +S  +K   K          APL DC 
Subjt:  SSSSAMKKLCRKSTVHPSPPIIS-------DFLS---FLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCD

Query:  CFMCYRRYWARWDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKK
        CF CY  YW++WDSS NR+LI++II+A+ED LT+ + + S+     K+R KK
Subjt:  CFMCYRRYWARWDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKK

AT1G62422.1 unknown protein3.1e-3545.45Show/hide
Query:  MKKLCRKSTVHPSPP--IISD--FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWAR
        MKKLCRK TVHPSPP  I +D  FLS LP AI +L  ALS +D+EVLAYLIS S  +  +S L  ++++            H+PLF CDCF CY  YW R
Subjt:  MKKLCRKSTVHPSPP--IISD--FLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDHAPLFDCDCFMCYRRYWAR

Query:  WDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFV
        WD+SP RQLIHEIIDAYED L   K         KK+RRK++ +++   +S+G  + +E+ S   E     +EK+     EE     +E+GSV + +SF+
Subjt:  WDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEEEEGGGGEERGSVRRFVSFV

Query:  GEKIWGAWG
        G++  G WG
Subjt:  GEKIWGAWG

AT5G13090.1 unknown protein1.4e-1634.91Show/hide
Query:  KNSKHRFPFPKPFFLSSSSAMKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDH-A
        K   +  P P P   SSSS+   L  +     S       L  LP+ I  L   LS++++EVLAYLI+   T +   N  +S KN  K  + K   +H  
Subjt:  KNSKHRFPFPKPFFLSSSSAMKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLATGKVGFDH-A

Query:  PLFDCDCFMCYRRYWARWDSSPNRQLIHEIIDAYED--GLTKSKATTSNQRNCKKE---RRKKNNES--AIGESSLGKGKTNEVLSEQQETGRQRN----
        P+FDC+CF CY  YW RWDSSPNR+LIHEII+A+E+  G   S + + ++R  KKE   RR  +++S  A+  +  G   +  V+    ET    +    
Subjt:  PLFDCDCFMCYRRYWARWDSSPNRQLIHEIIDAYED--GLTKSKATTSNQRNCKKE---RRKKNNES--AIGESSLGKGKTNEVLSEQQETGRQRN----

Query:  -----------EKEEEEEEEEEGGGGEERGSV
                   E E E+E   E  G EE  +V
Subjt:  -----------EKEEEEEEEEEGGGGEERGSV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACACGTCATCAACTCCAATGCCTCCTTCTCTTCGGAGTAGAAAATAAAAACCTCAAAAACTCTAAACACCGCTTTCCATTCCCAAAACCCTTTTTTCTCTCTTC
TTCTTCCGCCATGAAGAAGCTTTGCCGGAAAAGCACCGTCCATCCATCGCCGCCGATAATTTCCGACTTCCTTTCCTTTTTACCCTCCGCCATATTCGCCCTCACCGTCG
CTCTCTCCGCCGATGACAAAGAAGTCCTCGCCTATCTCATCTCTTGTTCCAACACCACCGCTTCTCTCTCCAACTTATCCACCAGCCGCAAGAACGGTCGGAAACTCGCC
ACTGGTAAGGTCGGTTTCGATCACGCTCCGCTCTTTGACTGTGATTGTTTTATGTGCTATCGACGATACTGGGCGAGATGGGATTCTTCCCCCAATCGGCAACTTATTCA
TGAAATAATCGATGCTTATGAAGATGGATTAACGAAATCGAAAGCCACAACAAGCAATCAGAGGAATTGCAAGAAAGAAAGACGGAAGAAGAACAACGAATCGGCTATCG
GTGAGTCAAGCTTAGGGAAAGGCAAGACGAACGAGGTATTATCGGAGCAGCAGGAGACGGGTCGGCAGAGGAATGAAAAAGAGGAGGAGGAGGAGGAGGAAGAAGAAGGA
GGAGGAGGAGAAGAAAGAGGATCGGTCAGAAGATTCGTGAGTTTTGTAGGTGAGAAAATTTGGGGTGCTTGGGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACACACGTCATCAACTCCAATGCCTCCTTCTCTTCGGAGTAGAAAATAAAAACCTCAAAAACTCTAAACACCGCTTTCCATTCCCAAAACCCTTTTTTCTCTCTTC
TTCTTCCGCCATGAAGAAGCTTTGCCGGAAAAGCACCGTCCATCCATCGCCGCCGATAATTTCCGACTTCCTTTCCTTTTTACCCTCCGCCATATTCGCCCTCACCGTCG
CTCTCTCCGCCGATGACAAAGAAGTCCTCGCCTATCTCATCTCTTGTTCCAACACCACCGCTTCTCTCTCCAACTTATCCACCAGCCGCAAGAACGGTCGGAAACTCGCC
ACTGGTAAGGTCGGTTTCGATCACGCTCCGCTCTTTGACTGTGATTGTTTTATGTGCTATCGACGATACTGGGCGAGATGGGATTCTTCCCCCAATCGGCAACTTATTCA
TGAAATAATCGATGCTTATGAAGATGGATTAACGAAATCGAAAGCCACAACAAGCAATCAGAGGAATTGCAAGAAAGAAAGACGGAAGAAGAACAACGAATCGGCTATCG
GTGAGTCAAGCTTAGGGAAAGGCAAGACGAACGAGGTATTATCGGAGCAGCAGGAGACGGGTCGGCAGAGGAATGAAAAAGAGGAGGAGGAGGAGGAGGAAGAAGAAGGA
GGAGGAGGAGAAGAAAGAGGATCGGTCAGAAGATTCGTGAGTTTTGTAGGTGAGAAAATTTGGGGTGCTTGGGGTTAA
Protein sequenceShow/hide protein sequence
MDTRHQLQCLLLFGVENKNLKNSKHRFPFPKPFFLSSSSAMKKLCRKSTVHPSPPIISDFLSFLPSAIFALTVALSADDKEVLAYLISCSNTTASLSNLSTSRKNGRKLA
TGKVGFDHAPLFDCDCFMCYRRYWARWDSSPNRQLIHEIIDAYEDGLTKSKATTSNQRNCKKERRKKNNESAIGESSLGKGKTNEVLSEQQETGRQRNEKEEEEEEEEEG
GGGEERGSVRRFVSFVGEKIWGAWG