; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G032100 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G032100
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCla97Chr02:5163807..5164685
RNA-Seq ExpressionCla97C02G032100
SyntenyCla97C02G032100
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]2.8e-9386.14Show/hide
Query:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN
        SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YI
         I
Subjt:  YI

KAG6572022.1 hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia]1.6e-7267.66Show/hide
Query:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA
        S     A     + L+S+R   NLG R++SLRR  S SP TT  IEPPYPWST+R AVVQTL  L SNQILTITG+V+C+QC+R Y +EYD VSKF EI 
Subjt:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA

Query:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR
         FVE     FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL+YLTYITLC Q+DPSGR
Subjt:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR

Query:  F
        F
Subjt:  F

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]1.0e-10387.1Show/hide
Query:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN
        SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]3.8e-9882.49Show/hide
Query:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN
        SL PP   LS +PSA P+    A  NA+T+MR+TR+LGTRRSS +RCNSRSPRTTE IEPPYPWSTNRRA+V+TLNDLKSNQIL ITGDV+CRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD  SKFEEIASFVEENKN FRDRAP+SWMNPNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG LNLNHLKYFCS T NHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]4.2e-7368.16Show/hide
Query:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA
        + V   A     + L+S+R   NLG R++SLR   S SP TT  IEPPYPWST+R AVV TL+ L SNQILTITG+V+C+QC+R Y IEYD VSKF EI 
Subjt:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA

Query:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR
        SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL+YLTYITLC Q+DPSGR
Subjt:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR

Query:  F
        F
Subjt:  F

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.8e-9882.49Show/hide
Query:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN
        SL PP   LS +PSA P+    A  NA+T+MR+TR+LGTRRSS +RCNSRSPRTTE IEPPYPWSTNRRA+V+TLNDLKSNQIL ITGDV+CRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD  SKFEEIASFVEENKN FRDRAP+SWMNPNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG LNLNHLKYFCS T NHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

A0A1S3BHR1 uncharacterized protein LOC1034897704.9e-10487.1Show/hide
Query:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN
        SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

A0A5A7T547 Uncharacterized protein1.3e-9386.14Show/hide
Query:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN
        SL PP  DL  RPS  P     A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTE IEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDVRCRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YI
         I
Subjt:  YI

A0A6J1GLD4 uncharacterized protein LOC1114553882.0e-7368.16Show/hide
Query:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA
        + V   A     + L+S+R   NLG R++SLR   S SP TT  IEPPYPWST+R AVV TL+ L SNQILTITG+V+C+QC+R Y IEYD VSKF EI 
Subjt:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA

Query:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR
        SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL+YLTYITLC Q+DPSGR
Subjt:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR

Query:  F
        F
Subjt:  F

A0A6J1I5V9 uncharacterized protein LOC1114709681.3e-7266.67Show/hide
Query:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA
        + V   A     + L+S+R    LG R++SLRR    SP TT  IEPPYPWST+R AVV TL+ L  NQILTITGDV+C+QC+R Y IEY+ VSKF EI 
Subjt:  SAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIA

Query:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR
        SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL+YLTYITLC Q+DPSGR
Subjt:  SFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGR

Query:  FRRV
        F R+
Subjt:  FRRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein1.7e-4336.62Show/hide
Query:  ENITNQSNEPHGDPD-LQLSL-----------RP-----PAGDPS--PQPFSLWSSVGD--------PSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPH-
        + +TNQ+++   D + L LSL           RP     P   P   P P + W +  D        P P P S    +   +S  F   P    L  H 
Subjt:  ENITNQSNEPHGDPD-LQLSL-----------RP-----PAGDPS--PQPFSLWSSVGD--------PSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPH-

Query:  --PFSLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQ
          P  L+PP  +L+P P   PVT          S+RI R+            S   + ++ I PP+PW+TNRR  +Q+L  L+SNQI TITG+V+CR C+
Subjt:  --PFSLSPPVRDLSPRPSAVPVTACQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQ

Query:  RQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL
        + Y + Y+   +F E+  F    K   RDRA + W  P    C  CG E   +PVI     +INWLFLLLG+ LG   L  LK FC ++ NHRTGAK+R+
Subjt:  RQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL

Query:  LYLTYITLCHQVDP
        LYLTY+ LC  + P
Subjt:  LYLTYITLCHQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)6.7e-3734.26Show/hide
Query:  QSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSM
        Q  E  G+  +QL    P  +  P P           PQP  + S            +   + + P   S+  P   L  +PS   +   Q N  A  ++
Subjt:  QSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSM

Query:  RITRNLGTRRSSLRRCNSRSPRTTER------IEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRD
           R         RR + R     ER      I PPYPW+T +   +Q+  DL SN I  I+G V C+ C R   +EY+   KF E+  +++ NK   R 
Subjt:  RITRNLGTRRSSLRRCNSRSPRTTER------IEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRD

Query:  RAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF
        RAP SW  P    CR C  E   +PV+     +INWLFLLLG+MLG   L+ L+YFC   + HRTG+K+R++Y+TY++LC Q+DP G F
Subjt:  RAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF

AT2G16190.2 FUNCTIONS IN: molecular_function unknown6.1e-2231.62Show/hide
Query:  QSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSM
        Q  E  G+  +QL    P  +  P P           PQP  + S            +   + + P   S+  P   L  +PS   +   Q N  A  ++
Subjt:  QSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTSM

Query:  RITRNLGTRRSSLRRCNSRSPRTTER------IEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRD
           R         RR + R     ER      I PPYPW+T +   +Q+  DL SN I  I+G V C+ C R   +EY+   KF E+  +++ NK   R 
Subjt:  RITRNLGTRRSSLRRCNSRSPRTTER------IEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRD

Query:  RAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHL
        RAP SW  P    CR C  E   +PV+     +INWLFLLLG+MLG   L+ L
Subjt:  RAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACATCACGAACCAAAGCAATGAACCCCACGGCGACCCTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTC
TGGTCATCGGTCGGAGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTCAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTC
TCACCCCATCCATTCTCACTCTCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGCCTGTCAGGCAAATGCAAATGCACTAACAAGC
ATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACAGAGAGGATCGAGCCACCATATCCATGGTCA
ACAAACCGACGAGCCGTGGTTCAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCGATGCCGACAATGCCAAAGACAATACAAT
ATCGAATACGACACCGTCTCAAAATTCGAGGAGATTGCAAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAAT
TACCCGACGTGTCGATTTTGCGGACATGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTT
GGAGCTTTGAATCTGAATCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGC
CACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACATCACGAACCAAAGCAATGAACCCCACGGCGACCCTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTC
TGGTCATCGGTCGGAGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTCAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTC
TCACCCCATCCATTCTCACTCTCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGCCTGTCAGGCAAATGCAAATGCACTAACAAGC
ATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACAGAGAGGATCGAGCCACCATATCCATGGTCA
ACAAACCGACGAGCCGTGGTTCAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCGATGCCGACAATGCCAAAGACAATACAAT
ATCGAATACGACACCGTCTCAAAATTCGAGGAGATTGCAAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAAT
TACCCGACGTGTCGATTTTGCGGACATGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTT
GGAGCTTTGAATCTGAATCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGC
CACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA
Protein sequenceShow/hide protein sequence
MENITNQSNEPHGDPDLQLSLRPPAGDPSPQPFSLWSSVGDPSPQPFSLRSSVRDLLSQPFSLRPPVRDLSPHPFSLSPPVRDLSPRPSAVPVTACQANANALTS
MRITRNLGTRRSSLRRCNSRSPRTTERIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVRCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPN
YPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV