; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi11G010870 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi11G010870
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr11:18974802..18975503
RNA-Seq ExpressionLsi11G010870
SyntenyLsi11G010870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]8.4e-9379.55Show/hide
Query:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ
        M  +ITNQT      LDLQLSLRPP+G L  +PS  A+   +ANA+TN RI R LGTRRSSLRRCNSRSPR TETIEPPYPWSTNRRA+V+TLN L+S+Q
Subjt:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ

Query:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC
        IL ITGDVRCRQCQ +Y IEY+ VSKFEEIASFVE+NKNLFRDRAPRSWM+PNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG L+L+HLKYFC
Subjt:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC

Query:  SYTNNHRTGAKNRLLYLIYI
        SYTNNHRTGAKNRLLYL  I
Subjt:  SYTNNHRTGAKNRLLYLIYI

KAG6572022.1 hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia]3.3e-7362.84Show/hide
Query:  LDLQLSLRPPA---GVLSPQPSVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQCQ
        +DL+LSL  P+      S   S  A  + + ++++R    LG R++SLRR  S SP  T  IEPPYPWST+R AVVQTL YL SNQILTITG+V+C+QC+
Subjt:  LDLQLSLRPPA---GVLSPQPSVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQCQ

Query:  RQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRL
        R Y++EY+ VSKF EI  FVE     FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL L+HLKYFCSYT NHRTG+K+RL
Subjt:  RQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRL

Query:  LYLIYITLCHQVDPSGRF
        +YL YITLC Q+DPSGRF
Subjt:  LYLIYITLCHQVDPSGRF

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]2.4e-10380.85Show/hide
Query:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ
        M  +ITNQT      LDLQLSLRPP+G L  +PS  A+   +ANA+TN RI R LGTRRSSLRRCNSRSPR TETIEPPYPWSTNRRA+V+TLN L+S+Q
Subjt:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ

Query:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC
        IL ITGDVRCRQCQ +Y IEY+ VSKFEEIASFVE+NKNLFRDRAPRSWM+PNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG L+L+HLKYFC
Subjt:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC

Query:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV
        SYTNNHRTGAKNRLLYL YITLCHQVDPSGRF RV
Subjt:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]6.5e-10177.87Show/hide
Query:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAVCQA--NAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ
        M+  I NQ NE H  LDL+LSLRPP+G LS QPS   +  A  NA+TNMR+ R LGTRRSS +RCNSRSPR TETIEPPYPWSTNRRA+V+TLN L+SNQ
Subjt:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAVCQA--NAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ

Query:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC
        IL ITGDV+CRQCQ +Y IEY+  SKFEEIASFVE+NKN FRDRAP+SWM+PNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG L+L+HLKYFC
Subjt:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC

Query:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV
        S T NHRTGAKNRLLYL YITLCHQVDPSGRF RV
Subjt:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]1.3e-7261.88Show/hide
Query:  LDLQLSLRPPAGVLSPQPSVVAVCQANA--------ITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVR
        +DL+LSL  P+   +   +  A   A A        ++++R    LG R++SLR   S SP  T  IEPPYPWST+R AVV TL+YL SNQILTITG+V+
Subjt:  LDLQLSLRPPAGVLSPQPSVVAVCQANA--------ITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVR

Query:  CRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTG
        C+QC+R Y+IEY+ VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL L+HLKYFCSYT NHRTG
Subjt:  CRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTG

Query:  AKNRLLYLIYITLCHQVDPSGRF
        +K+RL+YL YITLC Q+DPSGRF
Subjt:  AKNRLLYLIYITLCHQVDPSGRF

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein3.1e-10177.87Show/hide
Query:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAVCQA--NAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ
        M+  I NQ NE H  LDL+LSLRPP+G LS QPS   +  A  NA+TNMR+ R LGTRRSS +RCNSRSPR TETIEPPYPWSTNRRA+V+TLN L+SNQ
Subjt:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAVCQA--NAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ

Query:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC
        IL ITGDV+CRQCQ +Y IEY+  SKFEEIASFVE+NKN FRDRAP+SWM+PNYPTCRFCGHENGARPVIP +WRKINWLFLLLGEMLG L+L+HLKYFC
Subjt:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC

Query:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV
        S T NHRTGAKNRLLYL YITLCHQVDPSGRF RV
Subjt:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV

A0A1S3BHR1 uncharacterized protein LOC1034897701.1e-10380.85Show/hide
Query:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ
        M  +ITNQT      LDLQLSLRPP+G L  +PS  A+   +ANA+TN RI R LGTRRSSLRRCNSRSPR TETIEPPYPWSTNRRA+V+TLN L+S+Q
Subjt:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ

Query:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC
        IL ITGDVRCRQCQ +Y IEY+ VSKFEEIASFVE+NKNLFRDRAPRSWM+PNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG L+L+HLKYFC
Subjt:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC

Query:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV
        SYTNNHRTGAKNRLLYL YITLCHQVDPSGRF RV
Subjt:  SYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV

A0A5A7T547 Uncharacterized protein4.1e-9379.55Show/hide
Query:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ
        M  +ITNQT      LDLQLSLRPP+G L  +PS  A+   +ANA+TN RI R LGTRRSSLRRCNSRSPR TETIEPPYPWSTNRRA+V+TLN L+S+Q
Subjt:  MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAV--CQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQ

Query:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC
        IL ITGDVRCRQCQ +Y IEY+ VSKFEEIASFVE+NKNLFRDRAPRSWM+PNYPTCRFCGHENGARPVIP EWRKINWLFLLLGEMLG L+L+HLKYFC
Subjt:  ILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFC

Query:  SYTNNHRTGAKNRLLYLIYI
        SYTNNHRTGAKNRLLYL  I
Subjt:  SYTNNHRTGAKNRLLYLIYI

A0A6J1GLD4 uncharacterized protein LOC1114553886.1e-7361.88Show/hide
Query:  LDLQLSLRPPAGVLSPQPSVVAVCQANA--------ITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVR
        +DL+LSL  P+   +   +  A   A A        ++++R    LG R++SLR   S SP  T  IEPPYPWST+R AVV TL+YL SNQILTITG+V+
Subjt:  LDLQLSLRPPAGVLSPQPSVVAVCQANA--------ITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVR

Query:  CRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTG
        C+QC+R Y+IEY+ VSKF EI SFVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL L+HLKYFCSYT NHRTG
Subjt:  CRQCQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTG

Query:  AKNRLLYLIYITLCHQVDPSGRF
        +K+RL+YL YITLC Q+DPSGRF
Subjt:  AKNRLLYLIYITLCHQVDPSGRF

A0A6J1I5V9 uncharacterized protein LOC1114709685.2e-7265.5Show/hide
Query:  SVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVE
        +  A  + + ++++R    LG R++SLRR    SP  T  IEPPYPWST+R AVV TL+YL  NQILTITGDV+C+QC+R Y+IEY  VSKF EI SFVE
Subjt:  SVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQCQRQYKIEYETVSKFEEIASFVE

Query:  KNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV
         N   FRDRAP+ WM PNYPTCRFCG E G +PVIP EW KINW+FLLLGEM+GAL L+HLKYFCSYT NHRTG+K+RL+YL YITLC Q+DPSGRF R+
Subjt:  KNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRLLYLIYITLCHQVDPSGRFRRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein3.5e-4441.46Show/hide
Query:  LRPPAGVLSPQPSVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQCQRQYKIEYET
        L PP+  L+P P                 ++  T    + R  S   + ++TI PP+PW+TNRR  +Q+L YL+SNQI TITG+V+CR C++ Y++ Y  
Subjt:  LRPPAGVLSPQPSVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQCQRQYKIEYET

Query:  VSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRLLYLIYITLC
          +F E+  F    K   RDRA + W  P    C  CG E   +PVI     +INWLFLLLG+ LG  +L  LK FC ++ NHRTGAK+R+LYL Y+ LC
Subjt:  VSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRLLYLIYITLC

Query:  HQVDP
          + P
Subjt:  HQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)6.3e-3837.9Show/hide
Query:  LSLRPPAGVLSPQPSVVAVCQANAITNMRIAR-KLGTRRSSLRRCNSRSPRM-------TETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQC
        +S+R P     P   V+   Q N +  + +A  + G       R NS+ P            I PPYPW+T +   +Q+   L SN I  I+G V C+ C
Subjt:  LSLRPPAGVLSPQPSVVAVCQANAITNMRIAR-KLGTRRSSLRRCNSRSPRM-------TETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQC

Query:  QRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNR
         R   +EY    KF E+  +++ NK   R RAP SW +P    CR C  E   +PV+     +INWLFLLLG+MLG  +L  L+YFC   + HRTG+K+R
Subjt:  QRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNR

Query:  LLYLIYITLCHQVDPSGRF
        ++Y+ Y++LC Q+DP G F
Subjt:  LLYLIYITLCHQVDPSGRF

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.5e-2335.52Show/hide
Query:  LSLRPPAGVLSPQPSVVAVCQANAITNMRIAR-KLGTRRSSLRRCNSRSPRM-------TETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQC
        +S+R P     P   V+   Q N +  + +A  + G       R NS+ P            I PPYPW+T +   +Q+   L SN I  I+G V C+ C
Subjt:  LSLRPPAGVLSPQPSVVAVCQANAITNMRIAR-KLGTRRSSLRRCNSRSPRM-------TETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQC

Query:  QRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHL
         R   +EY    KF E+  +++ NK   R RAP SW +P    CR C  E   +PV+     +INWLFLLLG+MLG  +L  L
Subjt:  QRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACAACATCACAAACCAAACCAATGAACCCCACGGCGATCTTGACCTTCAACTCTCACTCCGGCCACCGGCCGGGGTTCTCTCACCCCAACCGTCCGTCGTCGC
CGTCTGTCAAGCAAATGCAATAACCAACATGAGAATTGCTCGCAAATTAGGAACTCGTCGATCATCTCTCCGCCGGTGCAATTCCCGATCACCAAGGATGACGGAGACAA
TCGAGCCACCATATCCATGGTCAACTAACCGACGAGCCGTGGTTCAAACCCTAAACTACTTACAATCGAATCAAATCCTTACAATCACCGGGGACGTCCGGTGCCGGCAA
TGCCAAAGACAATACAAGATCGAATACGAGACCGTCTCGAAATTTGAGGAGATTGCAAGCTTTGTGGAGAAGAACAAGAACTTGTTTCGCGACCGAGCACCGAGGTCGTG
GATGAGCCCTAATTACCCGACGTGTCGATTTTGCGGACACGAGAACGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTATTGGGAG
AAATGCTTGGAGCTTTGAGTCTGAGTCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACGGGTGCAAAAAATCGTCTTCTTTATCTAATTTATATCACTTTG
TGCCACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACAACATCACAAACCAAACCAATGAACCCCACGGCGATCTTGACCTTCAACTCTCACTCCGGCCACCGGCCGGGGTTCTCTCACCCCAACCGTCCGTCGTCGC
CGTCTGTCAAGCAAATGCAATAACCAACATGAGAATTGCTCGCAAATTAGGAACTCGTCGATCATCTCTCCGCCGGTGCAATTCCCGATCACCAAGGATGACGGAGACAA
TCGAGCCACCATATCCATGGTCAACTAACCGACGAGCCGTGGTTCAAACCCTAAACTACTTACAATCGAATCAAATCCTTACAATCACCGGGGACGTCCGGTGCCGGCAA
TGCCAAAGACAATACAAGATCGAATACGAGACCGTCTCGAAATTTGAGGAGATTGCAAGCTTTGTGGAGAAGAACAAGAACTTGTTTCGCGACCGAGCACCGAGGTCGTG
GATGAGCCCTAATTACCCGACGTGTCGATTTTGCGGACACGAGAACGGAGCGAGGCCGGTGATCCCGGGGGAATGGAGAAAGATCAATTGGTTGTTCTTGCTATTGGGAG
AAATGCTTGGAGCTTTGAGTCTGAGTCATCTGAAATACTTCTGCAGTTACACTAACAATCATCGAACGGGTGCAAAAAATCGTCTTCTTTATCTAATTTATATCACTTTG
TGCCACCAAGTTGATCCTTCTGGCCGTTTCCGTCGAGTTTGA
Protein sequenceShow/hide protein sequence
MENNITNQTNEPHGDLDLQLSLRPPAGVLSPQPSVVAVCQANAITNMRIARKLGTRRSSLRRCNSRSPRMTETIEPPYPWSTNRRAVVQTLNYLQSNQILTITGDVRCRQ
CQRQYKIEYETVSKFEEIASFVEKNKNLFRDRAPRSWMSPNYPTCRFCGHENGARPVIPGEWRKINWLFLLLGEMLGALSLSHLKYFCSYTNNHRTGAKNRLLYLIYITL
CHQVDPSGRFRRV