; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G16230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G16230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr1:11746523..11747707
RNA-Seq ExpressionCSPI01G16230
SyntenyCSPI01G16230
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652887.1 hypothetical protein Csa_017671 [Cucumis sativus]8.2e-159100Show/hide
Query:  MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSP
        MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSP
Subjt:  MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSP

Query:  LIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIID
        LIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIID
Subjt:  LIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIID

Query:  AYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        AYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
Subjt:  AYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

KAG7019402.1 hypothetical protein SDJN02_18363, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-7778.5Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        MKKLYR+RGTVHPSP IISDHLSFLPT ILTLAAALS  DRE+LAYLISS SNDFT V N SSHRGKA HQK AAA  G DHPP FSC CF+CYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRN---NRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQ+IHEIIDAYEE LAESK GKNNKKERKKRN     G VS PG+GKGSE A K EE RVTE E A+GGE  AEKG VR IVS +GEKIWG WN
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRN---NRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

XP_004150423.1 uncharacterized protein LOC101221021 [Cucumis sativus]1.6e-106100Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

XP_008458978.1 PREDICTED: uncharacterized protein LOC103498228 [Cucumis melo]9.9e-8884.77Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        MKKLYRK GTVHPSP +ISDHLSFLPT ILTL++ALSL DREVLAYLISSCSNDFT V NSS+HRGKA H KHAA M G DHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYE+KLAE+KVGKNNKKERKKRN+ G VSGPGEGKG+EAA K EEW+VT     EGGEE AEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

XP_038894832.1 uncharacterized protein LOC120083238 [Benincasa hispida]8.7e-9274.23Show/hide
Query:  HPHAFPLPLRQYLQHLLQSPLALY---PTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSL
        H HA   PL++  Q   Q PLA +   P+   T  P  K  P F    +  SL I   FWSPMKKLYRKRGTVHPSP IISDHLSFLPT ILTLAAALSL
Subjt:  HPHAFPLPLRQYLQHLLQSPLALY---PTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSL

Query:  HDREVLAYLISSCSNDFTAVINSSSHRGKATHQK-HAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKR
         DREVLAYLISSCSNDFTAV N S+HRGKA HQK  AAA GG DHPPAFSC CF+CYTSYWVRWDSSPNRQLIHEIIDAYEEKLAESK GKNNKKERKKR
Subjt:  HDREVLAYLISSCSNDFTAVINSSSHRGKATHQK-HAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKR

Query:  NNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        N+ GPVSGPGEGK SE A +EEE RVTERE AEGGEE  EKG VRRIVS +GE+IWGSWN
Subjt:  NNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

TrEMBL top hitse value%identityAlignment
A0A0A0LVV3 Uncharacterized protein4.0e-159100Show/hide
Query:  MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSP
        MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSP
Subjt:  MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSP

Query:  LIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIID
        LIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIID
Subjt:  LIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIID

Query:  AYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        AYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
Subjt:  AYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

A0A1S3C9P0 uncharacterized protein LOC1034982284.8e-8884.77Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        MKKLYRK GTVHPSP +ISDHLSFLPT ILTL++ALSL DREVLAYLISSCSNDFT V NSS+HRGKA H KHAA M G DHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYE+KLAE+KVGKNNKKERKKRN+ G VSGPGEGKG+EAA K EEW+VT     EGGEE AEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

A0A5D3CKA3 Uncharacterized protein4.8e-8884.77Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        MKKLYRK GTVHPSP +ISDHLSFLPT ILTL++ALSL DREVLAYLISSCSNDFT V NSS+HRGKA H KHAA M G DHPPAFSCYCFQCYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQLIHEIIDAYE+KLAE+KVGKNNKKERKKRN+ G VSGPGEGKG+EAA K EEW+VT     EGGEE AEKGPVRRIVSLLGEKIWGSWN
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

A0A6J1ENT0 uncharacterized protein LOC1114342291.7e-7778Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        MKKLYR+RGTVHPSP IISDHLSFLPT ILTLAAALS  DRE+LAYLISS SNDFT V N S HRGKA HQK AAA  G DHPP FSC CF+CYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRN---NRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN
        WDSSPNRQ+IHEIIDAYEE LAESK GKNNKKERKKRN     G VS PG+GKGSE A K EE RVTE E A+GGE  AEKG VR IVS +GEKIWG WN
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRN---NRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN

A0A6J1KLN6 uncharacterized protein LOC1114956872.6e-7375.37Show/hide
Query:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR
        M KLYR+RGTVHPSP IISDHLSFLPT ILTLAAALS  DRE+LAYLISS SNDFT V N S HRGKA  QK AAA  G DHPPAFSC CF+CYTSYWVR
Subjt:  MKKLYRKRGTVHPSPLIISDHLSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVR

Query:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRN---NRGPVSGPGEGKGSEAATKEEEWRVTEREVAE---GGEEGAEKGPVRRIVSLLGEKIWG
        WDSSPNRQ+IHEIIDAYEE LAESK GKNNKKERKKRN     G VS  G+GKGSE A K EE RVTE E A+   GGE  AEKG VR IV  +GEKIWG
Subjt:  WDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRN---NRGPVSGPGEGKGSEAATKEEEWRVTEREVAE---GGEEGAEKGPVRRIVSLLGEKIWG

Query:  SWN
         WN
Subjt:  SWN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein6.3e-3244.59Show/hide
Query:  MKKLYRKRGTVHPSPLII--SDH-LSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSY
        MKKLYRK GTVHPSP  I  +DH L+ LP  I +LAA LS  DREVLAYLIS+ S  ++   N +S   K    K A      +H P F C CF CYTSY
Subjt:  MKKLYRKRGTVHPSPLII--SDH-LSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSY

Query:  WVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNN---KKERKKRNNRGP---VSGPGEGKGSEAATKEEEWRV--------TEREVAEGGEEGA-------
        WVRWDSSP+RQLIHEIIDA+E+ L ++K  K N   KK+R+KR+ +      S       SE  ++  E  V        +E     GG  G        
Subjt:  WVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNN---KKERKKRNNRGP---VSGPGEGKGSEAATKEEEWRV--------TEREVAEGGEEGA-------

Query:  -----------EKGPVRRIVSLLGEKIWGSW
                   EKG VRR VS +GEK++G W
Subjt:  -----------EKGPVRRIVSLLGEKIWGSW

AT1G24270.1 unknown protein2.6e-1740.69Show/hide
Query:  KRGTVHPSPLIIS-------DHLS---FLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTS
        K+G VHPSP + S       D LS    L + IL L + LS  D EVLAYLI+   N      N  S + K +H+            P   C CF CYTS
Subjt:  KRGTVHPSPLIIS-------DHLS---FLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTS

Query:  YWVRWDSSPNRQLIHEIIDAYEEKLAESKV-----GKNNKKERKK
        YW +WDSS NR+LI++II+A+E+ L   ++      K NKK  KK
Subjt:  YWVRWDSSPNRQLIHEIIDAYEEKLAESKV-----GKNNKKERKK

AT1G62422.1 unknown protein4.8e-3246.83Show/hide
Query:  MKKLYRKRGTVHPS--PLIISDH--LSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTS
        MKKL RK GTVHPS  P I +D   LS LP  IL+L AALS+ DREVLAYLIS+  +      N  S   K     H        H P F C CF CYTS
Subjt:  MKKLYRKRGTVHPS--PLIISDH--LSFLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTS

Query:  YWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNR--GPVSGPGEGKGSEAATKEEEWR--VTEREVAEGGEEG-AEKGPVRRIVSLLGEK
        YWVRWD+SP RQLIHEIIDAYE+ L      K  KK+R+KR+ +  G V+  G  + SE  +   E+    +E++   GGEE   EKG V +++S +G++
Subjt:  YWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNR--GPVSGPGEGKGSEAATKEEEWR--VTEREVAEGGEEG-AEKGPVRRIVSLLGEK

Query:  IWGSW
          G W
Subjt:  IWGSW

AT5G13090.1 unknown protein3.0e-1833.64Show/hide
Query:  RKRGTVHPSP---------LIISDHLS-----------FLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPA
        +K+G V+PSP            S+HL+            LP  IL L + LS  +REVLAYLI+      T + +  +   K   +K +        PP 
Subjt:  RKRGTVHPSP---------LIISDHLS-----------FLPTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPA

Query:  FSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNR------------------------GPVSGP-GEGKGSEAATKEEE
        F C CF CYT+YW RWDSSPNR+LIHEII+A+E    E      +K +R K+  +                         PV  P  E   SE++     
Subjt:  FSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERKKRNNR------------------------GPVSGP-GEGKGSEAATKEEE

Query:  WRVTEREVAEGGEE
         R++E EVAEG  E
Subjt:  WRVTEREVAEGGEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCAAGAATCTGGACTGTACGATTAGGGGTGTACTGTGTGTACTCAAATTCCCGCGTTTCACGTGAATATTTCCATCCCCATGCATTCCCACTGCCACTGCGGCA
GTACCTCCAGCATCTTTTACAATCCCCACTCGCACTTTATCCCACGCCCCTTCATACCCCGACACCCCCAATAAAATCTCTTCCTCTTTTCCCACTCCAACCCATTCCTC
CCTCACTCTCTATTCACTCTCTTTTTTGGTCTCCGATGAAGAAGCTCTACCGCAAAAGAGGAACCGTCCACCCGTCTCCCCTTATCATCTCCGATCACCTCTCCTTTCTC
CCCACCGTCATTCTCACCCTCGCCGCCGCCCTCTCTCTCCACGACCGTGAGGTTTTAGCCTACCTTATCTCCTCCTGCTCCAATGACTTCACCGCCGTCATCAATTCCTC
CAGCCACCGCGGCAAGGCAACTCACCAGAAACACGCCGCTGCCATGGGTGGTTTGGACCATCCCCCGGCGTTCTCCTGCTACTGTTTCCAATGCTACACAAGCTACTGGG
TCAGATGGGATTCCTCACCGAATCGGCAGCTGATTCACGAAATCATCGACGCTTATGAAGAGAAATTGGCTGAGAGCAAAGTTGGGAAGAACAATAAGAAAGAGAGGAAG
AAGAGAAATAATAGGGGACCGGTTTCCGGTCCGGGTGAGGGGAAAGGGTCTGAAGCGGCGACGAAGGAAGAAGAGTGGAGGGTGACGGAGAGGGAGGTGGCAGAAGGCGG
CGAGGAGGGAGCGGAGAAAGGGCCAGTGAGAAGGATTGTGAGTTTGCTAGGGGAAAAAATTTGGGGAAGTTGGAATTAA
mRNA sequenceShow/hide mRNA sequence
TTAGGACGATTTAAATTGAAGCAGAACCCGGTGAGGTGAGATAAGATAATAAGCAAGAAATATTTAATAAAGTGATTCAATGTAGGCCCCCACCTCCATGAAATCAAGAA
TCTGGACTGTACGATTAGGGGTGTACTGTGTGTACTCAAATTCCCGCGTTTCACGTGAATATTTCCATCCCCATGCATTCCCACTGCCACTGCGGCAGTACCTCCAGCAT
CTTTTACAATCCCCACTCGCACTTTATCCCACGCCCCTTCATACCCCGACACCCCCAATAAAATCTCTTCCTCTTTTCCCACTCCAACCCATTCCTCCCTCACTCTCTAT
TCACTCTCTTTTTTGGTCTCCGATGAAGAAGCTCTACCGCAAAAGAGGAACCGTCCACCCGTCTCCCCTTATCATCTCCGATCACCTCTCCTTTCTCCCCACCGTCATTC
TCACCCTCGCCGCCGCCCTCTCTCTCCACGACCGTGAGGTTTTAGCCTACCTTATCTCCTCCTGCTCCAATGACTTCACCGCCGTCATCAATTCCTCCAGCCACCGCGGC
AAGGCAACTCACCAGAAACACGCCGCTGCCATGGGTGGTTTGGACCATCCCCCGGCGTTCTCCTGCTACTGTTTCCAATGCTACACAAGCTACTGGGTCAGATGGGATTC
CTCACCGAATCGGCAGCTGATTCACGAAATCATCGACGCTTATGAAGAGAAATTGGCTGAGAGCAAAGTTGGGAAGAACAATAAGAAAGAGAGGAAGAAGAGAAATAATA
GGGGACCGGTTTCCGGTCCGGGTGAGGGGAAAGGGTCTGAAGCGGCGACGAAGGAAGAAGAGTGGAGGGTGACGGAGAGGGAGGTGGCAGAAGGCGGCGAGGAGGGAGCG
GAGAAAGGGCCAGTGAGAAGGATTGTGAGTTTGCTAGGGGAAAAAATTTGGGGAAGTTGGAATTAAGTGATTGGATTTTGCATTGATGAATGAATTTGGAGATTTTTCGT
TTTTATGTTTGTGAATTAATTAGTTTGCAGTAATTAGGAAGGAATTGAAGAAGAAGCAAAGGGGTGTTCTTATAAACATAATTAAGATATCATCTTCTTCTTTTTCTTTG
TTTTGATTCTGGTTAATGTTGTTGTTAGTAAGTTGTATATAGAATCTCATGTTTTTCATGTACAAATTTATCAATATATATATAA
Protein sequenceShow/hide protein sequence
MKSRIWTVRLGVYCVYSNSRVSREYFHPHAFPLPLRQYLQHLLQSPLALYPTPLHTPTPPIKSLPLFPLQPIPPSLSIHSLFWSPMKKLYRKRGTVHPSPLIISDHLSFL
PTVILTLAAALSLHDREVLAYLISSCSNDFTAVINSSSHRGKATHQKHAAAMGGLDHPPAFSCYCFQCYTSYWVRWDSSPNRQLIHEIIDAYEEKLAESKVGKNNKKERK
KRNNRGPVSGPGEGKGSEAATKEEEWRVTEREVAEGGEEGAEKGPVRRIVSLLGEKIWGSWN