; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G031410 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G031410
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCiama_Chr02:5801746..5802708
RNA-Seq ExpressionCaUC02G031410
SyntenyCaUC02G031410
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]8.0e-9486.14Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN
        SL PP  DL  RPS  P  +  A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDV+CRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YI
         I
Subjt:  YI

KAG6572022.1 hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia]3.0e-7266.51Show/hide
Query:  PSAVPVTVCQANANA-------LTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYNIEYDT
        PSA       A + A       L+S+R   NLG R++SLRR  S SP TT  IEPPYPWST+R AVVQTL  L SNQILTITG+V+C+QC+R Y +EYD 
Subjt:  PSAVPVTVCQANANA-------LTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYNIEYDT

Query:  VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLC
        VSKF EI  FVE     FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL+YLTYITLC
Subjt:  VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLC

Query:  HQVDPSGRF
         Q+DPSGRF
Subjt:  HQVDPSGRF

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]2.3e-10487.1Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN
        SL PP  DL  RPS  P  +  A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDV+CRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]2.9e-9983.41Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN
        SL PP   LS +PSA P+    A  NA+T+MR+TR+LGTRRSS +RCNSRSPRTTETIEPPYPWSTNRRA+V+TLNDLKSNQIL ITGDVQCRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD  SKFEEIASFVEENKN FRDRAP+SWMNPNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG LNLNHLKYFCS T NHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]6.0e-7365.6Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANA----NALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQ
        SL+ P    +   +A   TV  A A    + L+S+R   NLG R++SLR   S SP TT  IEPPYPWST+R AVV TL+ L SNQILTITG+V+C+QC+
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANA----NALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQ

Query:  RQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL
        R Y IEYD VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL
Subjt:  RQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL

Query:  LYLTYITLCHQVDPSGRF
        +YLTYITLC Q+DPSGRF
Subjt:  LYLTYITLCHQVDPSGRF

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.4e-9983.41Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN
        SL PP   LS +PSA P+    A  NA+T+MR+TR+LGTRRSS +RCNSRSPRTTETIEPPYPWSTNRRA+V+TLNDLKSNQIL ITGDVQCRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD  SKFEEIASFVEENKN FRDRAP+SWMNPNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG LNLNHLKYFCS T NHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

A0A1S3BHR1 uncharacterized protein LOC1034897701.1e-10487.1Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN
        SL PP  DL  RPS  P  +  A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDV+CRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YITLCHQVDPSGRFRRV
        YITLCHQVDPSGRF RV
Subjt:  YITLCHQVDPSGRFRRV

A0A5A7T547 Uncharacterized protein3.9e-9486.14Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN
        SL PP  DL  RPS  P  +  A ANALT+ RITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRA+V+TLNDL+S+QIL ITGDV+CRQCQ +Y 
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYN

Query:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT
        IEYD VSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG LNLNHLKYFCSYTNNHRTGAKNRLLYLT
Subjt:  IEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLT

Query:  YI
         I
Subjt:  YI

A0A6J1GLD4 uncharacterized protein LOC1114553882.9e-7365.6Show/hide
Query:  SLSPPVRDLSPRPSAVPVTVCQANA----NALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQ
        SL+ P    +   +A   TV  A A    + L+S+R   NLG R++SLR   S SP TT  IEPPYPWST+R AVV TL+ L SNQILTITG+V+C+QC+
Subjt:  SLSPPVRDLSPRPSAVPVTVCQANA----NALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQ

Query:  RQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL
        R Y IEYD VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL
Subjt:  RQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRL

Query:  LYLTYITLCHQVDPSGRF
        +YLTYITLC Q+DPSGRF
Subjt:  LYLTYITLCHQVDPSGRF

A0A6J1I5V9 uncharacterized protein LOC1114709683.2e-7264.65Show/hide
Query:  RDLSPRPSAVPVTVCQANA----NALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYNIE
        + L+   +    TV  A A    + L+S+R    LG R++SLRR    SP TT  IEPPYPWST+R AVV TL+ L  NQILTITGDV+C+QC+R Y IE
Subjt:  RDLSPRPSAVPVTVCQANA----NALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYNIE

Query:  YDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYI
        Y+ VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL LNHLKYFCSYT NHRTG+K+RL+YLTYI
Subjt:  YDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYI

Query:  TLCHQVDPSGRFRRV
        TLC Q+DPSGRF R+
Subjt:  TLCHQVDPSGRFRRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein5.3e-4336.48Show/hide
Query:  MENITNQSNEPHGDLDLQLSLRPPAGDPSPQPFSLRSSVGDPS--PQPFSLRSSVRDLLSQPFSLRSSVRDLLSQPFSLRPPVRDLLSQPFSLRPPVRDL
        M N T+  ++    L L L+L   +     +P      +  P   P P +   +  D L    + RS V D    P S + P+   +S  F   P    L
Subjt:  MENITNQSNEPHGDLDLQLSLRPPAGDPSPQPFSLRSSVGDPS--PQPFSLRSSVRDLLSQPFSLRSSVRDLLSQPFSLRPPVRDLLSQPFSLRPPVRDL

Query:  SPQ---PFSLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQC
              P  L+PP  +L+P P   PVT          S+RI R+            S   + ++TI PP+PW+TNRR  +Q+L  L+SNQI TITG+VQC
Subjt:  SPQ---PFSLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQC

Query:  RQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGA
        R C++ Y + Y+   +F E+  F    K   RDRA + W  P    C  CG E   +PVI     +INWLFLLLG+ LG   L  LK FC ++ NHRTGA
Subjt:  RQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGA

Query:  KNRLLYLTYITLCHQVDP
        K+R+LYLTY+ LC  + P
Subjt:  KNRLLYLTYITLCHQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)7.9e-3936.4Show/hide
Query:  PPVRDLLSQPFSLRPPVRDLSPQPFSLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTL
        PP    +  P   +P    L P   +    V   +PR    P    + N+    +  + RN+G R                 I PPYPW+T +   +Q+ 
Subjt:  PPVRDLLSQPFSLRPPVRDLSPQPFSLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTL

Query:  NDLKSNQILTITGDVQCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNL
         DL SN I  I+G V C+ C R   +EY+   KF E+  +++ NK   R RAP SW  P    CR C  E   +PV+     +INWLFLLLG+MLG   L
Subjt:  NDLKSNQILTITGDVQCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNL

Query:  NHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF
        + L+YFC   + HRTG+K+R++Y+TY++LC Q+DP G F
Subjt:  NHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF

AT2G16190.2 FUNCTIONS IN: molecular_function unknown7.2e-2433.5Show/hide
Query:  PPVRDLLSQPFSLRPPVRDLSPQPFSLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTL
        PP    +  P   +P    L P   +    V   +PR    P    + N+    +  + RN+G R                 I PPYPW+T +   +Q+ 
Subjt:  PPVRDLLSQPFSLRPPVRDLSPQPFSLSPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTL

Query:  NDLKSNQILTITGDVQCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNL
         DL SN I  I+G V C+ C R   +EY+   KF E+  +++ NK   R RAP SW  P    CR C  E   +PV+     +INWLFLLLG+MLG   L
Subjt:  NDLKSNQILTITGDVQCRQCQRQYNIEYDTVSKFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNL

Query:  NHL
        + L
Subjt:  NHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACATCACAAACCAAAGCAATGAACCCCACGGCGACCTTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTC
CGGTCATCGGTCGGGGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTTAGGGATCTTTTATCCCAACCATTCTCACTCCGGTCGTCGGTTAGGGATCTT
TTATCCCAACCATTCTCACTCCGGCCGCCGGTTAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTCTCACCCCAACCATTCTCACTC
TCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGTCTGTCAGGCAAATGCAAATGCACTAACAAGCATGAGAATCACTCGCAATTTA
GGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACGGAGACGATCGAGCCACCATATCCATGGTCAACAAACCGACGAGCCGTGGTT
CAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCAATGCCGACAATGCCAAAGACAATACAATATTGAATACGACACTGTCTCG
AAATTCGAGGAGATTGCGAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAATTACCCGACGTGTAGATTTTGC
GGACACGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTTGGAGCTTTGAATCTGAATCAT
CTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGC
CGTTTCCGTCGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACATCACAAACCAAAGCAATGAACCCCACGGCGACCTTGACCTCCAACTCTCACTTCGGCCGCCGGCCGGGGATCCCTCACCCCAACCATTCTCACTC
CGGTCATCGGTCGGGGATCCCTCACCGCAACCATTCTCACTCCGGTCGTCGGTTAGGGATCTTTTATCCCAACCATTCTCACTCCGGTCGTCGGTTAGGGATCTT
TTATCCCAACCATTCTCACTCCGGCCGCCGGTTAGGGATCTTTTATCCCAACCATTCTCACTCCGGCCGCCGGTCAGAGATCTCTCACCCCAACCATTCTCACTC
TCGCCGCCGGTCAGGGATCTCTCACCCCGACCGTCAGCTGTCCCCGTCACCGTCTGTCAGGCAAATGCAAATGCACTAACAAGCATGAGAATCACTCGCAATTTA
GGAACTCGTCGATCATCTCTCCGTCGCTGCAATTCCCGATCACCAAGGACAACGGAGACGATCGAGCCACCATATCCATGGTCAACAAACCGACGAGCCGTGGTT
CAAACCCTAAACGACCTGAAATCAAATCAAATCCTCACAATCACTGGAGACGTCCAATGCCGACAATGCCAAAGACAATACAATATTGAATACGACACTGTCTCG
AAATTCGAGGAGATTGCGAGCTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGGGCACCGAGGTCGTGGATGAACCCTAATTACCCGACGTGTAGATTTTGC
GGACACGAGAATGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTTTTGGGAGAAATGCTTGGAGCTTTGAATCTGAATCAT
CTGAAATACTTCTGCAGTTACACTAACAATCATCGAACTGGTGCAAAGAATCGTCTTCTTTATCTAACTTATATCACTTTGTGCCACCAAGTTGATCCTTCTGGC
CGTTTCCGTCGAGTTTGA
Protein sequenceShow/hide protein sequence
MENITNQSNEPHGDLDLQLSLRPPAGDPSPQPFSLRSSVGDPSPQPFSLRSSVRDLLSQPFSLRSSVRDLLSQPFSLRPPVRDLLSQPFSLRPPVRDLSPQPFSL
SPPVRDLSPRPSAVPVTVCQANANALTSMRITRNLGTRRSSLRRCNSRSPRTTETIEPPYPWSTNRRAVVQTLNDLKSNQILTITGDVQCRQCQRQYNIEYDTVS
KFEEIASFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSG
RFRRV