; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi05G001076 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi05G001076
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationchr5:43247488..43248198
RNA-Seq ExpressionBhi05G001076
SyntenyBhi05G001076
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036575.1 uncharacterized protein E6C27_scaffold191G00850 [Cucumis melo var. makuwa]1.4e-9882.81Show/hide
Query:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN
        M  +ITNQT      LDLQLSLRPP+GDL    P P A+  ARANA+TN RITRNLGTRRSS RRCNSRSP+TTE IEPPYPWSTNRRA+V+TLN+L+S+
Subjt:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN

Query:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF
        QIL I+G+V CRQCQ +YTIEYDMVSKFEEIA+FVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP+EWRKINWLFLLLGEMLGVLNLNHLKYF
Subjt:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF

Query:  CSYTNNHRTGAKNRLLYLTYI
        CSYTNNHRTGAKNRLLYLT I
Subjt:  CSYTNNHRTGAKNRLLYLTYI

KAG6572022.1 hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia]1.5e-7363.3Show/hide
Query:  LDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQ
        +DL+LSL  P+   +  S    A      + ++++R   NLG R++S RR  S SP TT  IEPPYPWST+R AVVQTL  L SNQILTI+GEV C+QC+
Subjt:  LDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQ

Query:  RQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRL
        R Y +EYD+VSKF EI  FVE     FRDRAP+ WM PNYPTCRFCG E G +PVIP+EW KINW+FLLLGEM+G L LNHLKYFCSYT NHRTG+K+RL
Subjt:  RQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRL

Query:  LYLTYITLCHQVDPSGRF
        +YLTYITLC Q+DPSGRF
Subjt:  LYLTYITLCHQVDPSGRF

XP_008447299.1 PREDICTED: uncharacterized protein LOC103489770 [Cucumis melo]5.0e-10983.9Show/hide
Query:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN
        M  +ITNQT      LDLQLSLRPP+GDL    P P A+  ARANA+TN RITRNLGTRRSS RRCNSRSP+TTE IEPPYPWSTNRRA+V+TLN+L+S+
Subjt:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN

Query:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF
        QIL I+G+V CRQCQ +YTIEYDMVSKFEEIA+FVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP+EWRKINWLFLLLGEMLGVLNLNHLKYF
Subjt:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF

Query:  CSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV
        CSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Subjt:  CSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV

XP_011659748.1 uncharacterized protein LOC105436256 [Cucumis sativus]5.7e-10580.17Show/hide
Query:  MKNNITNQTSAPHGNLDLQLSLRPPAGDL-SPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKS
        MK  I NQ +  H  LDL+LSLRPP+G L S PS  P+    AR NA+TNMR+TR+LGTRRSS +RCNSRSP+TTE IEPPYPWSTNRRA+V+TLN+LKS
Subjt:  MKNNITNQTSAPHGNLDLQLSLRPPAGDL-SPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKS

Query:  NQILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKY
        NQIL I+G+V CRQCQ +YTIEYDM SKFEEIA+FVEENKN FRDRAP+SWMNPNYPTCRFCGHENGARPVIP++WRKINWLFLLLGEMLGVLNLNHLKY
Subjt:  NQILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKY

Query:  FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV
        FCS T NHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Subjt:  FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]5.8e-7363.23Show/hide
Query:  LDLQLSLRPPAGDLSPPSPLPVAVCRARANA-----ITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVS
        +DL+LSL  P+   +  S    A   A A A     ++++R   NLG R++S R   S SP TT  IEPPYPWST+R AVV TL+ L SNQILTI+GEV 
Subjt:  LDLQLSLRPPAGDLSPPSPLPVAVCRARANA-----ITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVS

Query:  CRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTG
        C+QC+R Y IEYD+VSKF EI +FVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP+EW KINW+FLLLGEM+G L LNHLKYFCSYT NHRTG
Subjt:  CRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTG

Query:  AKNRLLYLTYITLCHQVDPSGRF
        +K+RL+YLTYITLC Q+DPSGRF
Subjt:  AKNRLLYLTYITLCHQVDPSGRF

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein2.8e-10580.17Show/hide
Query:  MKNNITNQTSAPHGNLDLQLSLRPPAGDL-SPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKS
        MK  I NQ +  H  LDL+LSLRPP+G L S PS  P+    AR NA+TNMR+TR+LGTRRSS +RCNSRSP+TTE IEPPYPWSTNRRA+V+TLN+LKS
Subjt:  MKNNITNQTSAPHGNLDLQLSLRPPAGDL-SPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKS

Query:  NQILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKY
        NQIL I+G+V CRQCQ +YTIEYDM SKFEEIA+FVEENKN FRDRAP+SWMNPNYPTCRFCGHENGARPVIP++WRKINWLFLLLGEMLGVLNLNHLKY
Subjt:  NQILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKY

Query:  FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV
        FCS T NHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Subjt:  FCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV

A0A1S3BHR1 uncharacterized protein LOC1034897702.4e-10983.9Show/hide
Query:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN
        M  +ITNQT      LDLQLSLRPP+GDL    P P A+  ARANA+TN RITRNLGTRRSS RRCNSRSP+TTE IEPPYPWSTNRRA+V+TLN+L+S+
Subjt:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN

Query:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF
        QIL I+G+V CRQCQ +YTIEYDMVSKFEEIA+FVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP+EWRKINWLFLLLGEMLGVLNLNHLKYF
Subjt:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF

Query:  CSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV
        CSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRF RV
Subjt:  CSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV

A0A5A7T547 Uncharacterized protein6.6e-9982.81Show/hide
Query:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN
        M  +ITNQT      LDLQLSLRPP+GDL    P P A+  ARANA+TN RITRNLGTRRSS RRCNSRSP+TTE IEPPYPWSTNRRA+V+TLN+L+S+
Subjt:  MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSN

Query:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF
        QIL I+G+V CRQCQ +YTIEYDMVSKFEEIA+FVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIP+EWRKINWLFLLLGEMLGVLNLNHLKYF
Subjt:  QILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYF

Query:  CSYTNNHRTGAKNRLLYLTYI
        CSYTNNHRTGAKNRLLYLT I
Subjt:  CSYTNNHRTGAKNRLLYLTYI

A0A6J1GLD4 uncharacterized protein LOC1114553882.8e-7363.23Show/hide
Query:  LDLQLSLRPPAGDLSPPSPLPVAVCRARANA-----ITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVS
        +DL+LSL  P+   +  S    A   A A A     ++++R   NLG R++S R   S SP TT  IEPPYPWST+R AVV TL+ L SNQILTI+GEV 
Subjt:  LDLQLSLRPPAGDLSPPSPLPVAVCRARANA-----ITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVS

Query:  CRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTG
        C+QC+R Y IEYD+VSKF EI +FVE N   FRDRAP+ WM PNYPTCRFCG E G +PVIP+EW KINW+FLLLGEM+G L LNHLKYFCSYT NHRTG
Subjt:  CRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTG

Query:  AKNRLLYLTYITLCHQVDPSGRF
        +K+RL+YLTYITLC Q+DPSGRF
Subjt:  AKNRLLYLTYITLCHQVDPSGRF

A0A6J1I5V9 uncharacterized protein LOC1114709688.4e-7066.84Show/hide
Query:  ITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRA
        ++++R    LG R++S RR    SP TT  IEPPYPWST+R AVV TL+ L  NQILTI+G+V C+QC+R Y IEY++VSKF EI +FVE N   FRDRA
Subjt:  ITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRA

Query:  PRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV
        P+ WM PNYPTCRFCG E G +PVIP+EW KINW+FLLLGEM+G L LNHLKYFCSYT NHRTG+K+RL+YLTYITLC Q+DPSGRF R+
Subjt:  PRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTYITLCHQVDPSGRFRRV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein6.0e-4442.79Show/hide
Query:  LRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIE
        L PP+ +L+PP   PV     +     ++RI R+            S   K ++ I PP+PW+TNRR  +Q+L  L+SNQI TI+GEV CR C++ Y + 
Subjt:  LRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIE

Query:  YDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTYI
        Y++  +F E+  F    K   RDRA + W  P    C  CG E   +PVI E   +INWLFLLLG+ LG   L  LK FC ++ NHRTGAK+R+LYLTY+
Subjt:  YDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTYI

Query:  TLCHQVDP
         LC  + P
Subjt:  TLCHQVDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.6e-4140.76Show/hide
Query:  RPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIEY
        RPP G     S  PVA             + RN+G R                 I PPYPW+T +   +Q+  +L SN I  ISG+V C+ C R  T+EY
Subjt:  RPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIEY

Query:  DMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTYIT
        ++  KF E+  +++ NK   R RAP SW  P    CR C  E   +PV+ E   +INWLFLLLG+MLG   L+ L+YFC   + HRTG+K+R++Y+TY++
Subjt:  DMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTYIT

Query:  LCHQVDPSGRF
        LC Q+DP G F
Subjt:  LCHQVDPSGRF

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.5e-2638.29Show/hide
Query:  RPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIEY
        RPP G     S  PVA             + RN+G R                 I PPYPW+T +   +Q+  +L SN I  ISG+V C+ C R  T+EY
Subjt:  RPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVSCRQCQRQYTIEY

Query:  DMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHL
        ++  KF E+  +++ NK   R RAP SW  P    CR C  E   +PV+ E   +INWLFLLLG+MLG   L+ L
Subjt:  DMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAACAATATCACAAACCAAACCAGTGCGCCCCACGGCAACCTTGACCTTCAACTCTCACTCCGGCCACCAGCTGGGGATCTCTCGCCCCCATCGCCCCTGCCTGT
TGCCGTTTGTCGGGCTAGGGCAAATGCAATAACAAACATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCATCGAGACGTTGCAATTCTCGATCACCAAAGACGA
CCGAGGCAATCGAGCCACCATATCCATGGTCCACGAACCGACGAGCCGTGGTTCAAACCCTAAACGAGTTGAAGTCAAATCAAATCCTCACGATCTCTGGGGAAGTCAGC
TGCCGGCAATGCCAAAGACAATACACAATCGAATATGACATGGTTTCGAAATTCGAGGAGATTGCAAACTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGAGCACC
AAGGTCATGGATGAACCCTAATTACCCAACGTGTCGATTTTGTGGACACGAGAACGGAGCGAGGCCTGTGATTCCGGAGGAATGGAGAAAGATCAATTGGTTGTTCTTGC
TTCTGGGAGAAATGCTTGGAGTTTTGAATTTGAATCATCTGAAATACTTTTGCAGTTACACTAACAATCATCGAACAGGTGCAAAAAATCGTCTTCTTTATCTAACTTAT
ATCACTTTATGCCACCAAGTTGATCCTTCCGGCCGATTCCGTCGAGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAACAATATCACAAACCAAACCAGTGCGCCCCACGGCAACCTTGACCTTCAACTCTCACTCCGGCCACCAGCTGGGGATCTCTCGCCCCCATCGCCCCTGCCTGT
TGCCGTTTGTCGGGCTAGGGCAAATGCAATAACAAACATGAGAATCACTCGCAATTTAGGAACTCGTCGATCATCATCGAGACGTTGCAATTCTCGATCACCAAAGACGA
CCGAGGCAATCGAGCCACCATATCCATGGTCCACGAACCGACGAGCCGTGGTTCAAACCCTAAACGAGTTGAAGTCAAATCAAATCCTCACGATCTCTGGGGAAGTCAGC
TGCCGGCAATGCCAAAGACAATACACAATCGAATATGACATGGTTTCGAAATTCGAGGAGATTGCAAACTTTGTGGAGGAGAACAAGAACTTGTTTCGCGATCGAGCACC
AAGGTCATGGATGAACCCTAATTACCCAACGTGTCGATTTTGTGGACACGAGAACGGAGCGAGGCCTGTGATTCCGGAGGAATGGAGAAAGATCAATTGGTTGTTCTTGC
TTCTGGGAGAAATGCTTGGAGTTTTGAATTTGAATCATCTGAAATACTTTTGCAGTTACACTAACAATCATCGAACAGGTGCAAAAAATCGTCTTCTTTATCTAACTTAT
ATCACTTTATGCCACCAAGTTGATCCTTCCGGCCGATTCCGTCGAGTTTGA
Protein sequenceShow/hide protein sequence
MKNNITNQTSAPHGNLDLQLSLRPPAGDLSPPSPLPVAVCRARANAITNMRITRNLGTRRSSSRRCNSRSPKTTEAIEPPYPWSTNRRAVVQTLNELKSNQILTISGEVS
CRQCQRQYTIEYDMVSKFEEIANFVEENKNLFRDRAPRSWMNPNYPTCRFCGHENGARPVIPEEWRKINWLFLLLGEMLGVLNLNHLKYFCSYTNNHRTGAKNRLLYLTY
ITLCHQVDPSGRFRRV