; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003949 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003949
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionsegmentation polarity homeobox protein engrailed
Genome locationChr08:12018530..12019330
RNA-Seq ExpressionHG10003949
SyntenyHG10003949
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647203.1 hypothetical protein Csa_018951 [Cucumis sativus]8.7e-8680.89Show/hide
Query:  QDKLVVIPQPLSPLTTTTT---IPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSSPSPISSHHYFSSPYNQIPHILRINSLKAT
        QDKLVVIPQPLSPL TT T    PSLSL+NKISPYPP PSP SSSISSFTCLSS  T SSTNTSFSTASSSPSPISSHHYF SPYNQ PH+  INSLKA 
Subjt:  QDKLVVIPQPLSPLTTTTT---IPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSSPSPISSHHYFSSPYNQIPHILRINSLKAT

Query:  AFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTVAPINIAPLKNGGRPKSRSPARGSEMKK
        AF P P+KP+SP ++RH SPQRVSRS PQKR RPASPSP IRQKSFRKEVLQRPLSSPSPTRRF+ EKC+V +API      NG RPKSRSP R S MKK
Subjt:  AFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTVAPINIAPLKNGGRPKSRSPARGSEMKK

Query:  EITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
        EITCIHRISSKIDEVAV+E VG+LDSVVAMEDIDNPLISLDCFIFL
Subjt:  EITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

XP_008441600.1 PREDICTED: putative protein TPRXL [Cucumis melo]6.7e-10281.82Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPL------TTTTTIPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSS
        MGSCISKCKPKMM+ QPP+FDFNNLVQDKLVVIPQPLSPL       TT+  PSLSL+NKISPYPP PSP SSSISSFTCLSS  TSSSTNTSFSTASSS
Subjt:  MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPL------TTTTTIPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSS

Query:  PSPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRV
        PSPISSHHYF SPYNQ PH+ RINSLKA AF   P+KP+SP VVRH SPQRVSRSTPQKRLRPASPSP IRQKSFRKEVLQRPLSSPSPTRRF+ EKC+V
Subjt:  PSPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRV

Query:  TVAPINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
         +API      NG RPKSRSP RGS MKKEITCIHRISSKID+VAV+E VG+LDSVVAMED+DNPLISLDCFIFL
Subjt:  TVAPINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

XP_011657327.1 putative protein TPRXL [Cucumis sativus]9.0e-9981.68Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDFNNL-VQDKLVVIPQPLSPLTTTTT---IPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSSPS
        MGSCISKCKPKMMK QPP+FDFNNL VQDKLVVIPQPLSPL TT T    PSLSL+NKISPYPP PSP SSSISSFTCLSS  T SSTNTSFSTASSSPS
Subjt:  MGSCISKCKPKMMKQQPPVFDFNNL-VQDKLVVIPQPLSPLTTTTT---IPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSSPS

Query:  PISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTV
        PISSHHYF SPYNQ PH+  INSLKA AF P P+KP+SP ++RH SPQRVSRS PQKR RPASPSP IRQKSFRKEVLQRPLSSPSPTRRF+ EKC+V +
Subjt:  PISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTV

Query:  APINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
        API      NG RPKSRSP R S MKKEITCIHRISSKIDEVAV+E VG+LDSVVAMEDIDNPLISLDCFIFL
Subjt:  APINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

XP_023550659.1 proline-rich receptor-like protein kinase PERK2 [Cucurbita pepo subsp. pepo]7.7e-8271.89Show/hide
Query:  MGSCISKCKPKMMK----QQPPVFDFNNLVQDKLVVIPQ--PLSPLTTTTTIPSLSLNNKISPYPP-PSPSSSISSFTCLSSTTTSSSTNTSFSTASSSP
        MGSCISKCKPK +K      PP+FDFNN+VQDKLVVIPQ  PL+   T+   PSLSL+NKISPYPP PSPS   SS TCLSSTTT+++TN+SFSTASS  
Subjt:  MGSCISKCKPKMMK----QQPPVFDFNNLVQDKLVVIPQ--PLSPLTTTTTIPSLSLNNKISPYPP-PSPSSSISSFTCLSSTTTSSSTNTSFSTASSSP

Query:  SPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSPVV----RHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKC
        SPI SH YF SPYNQ PH++RINSLKA+AF P P  PVSPVV    RH SPQRVSRSTPQKR+R ASPSP +RQKSFRKEV QRPL SPSP+RR +GEKC
Subjt:  SPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSPVV----RHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKC

Query:  RVTVAPINIAPLKNGGRPKSRSPARGSEMKKE-ITCIHRISSKIDEVAVREVV---GELDSVVAMEDIDNPLISLDCFIFL
        RV    I  A +K  GR KSRSPARG EMKKE ITCIHRISSKIDE A RE V   G+LDS  AMEDIDNPLISLDCFIFL
Subjt:  RVTVAPINIAPLKNGGRPKSRSPARGSEMKKE-ITCIHRISSKIDEVAVREVV---GELDSVVAMEDIDNPLISLDCFIFL

XP_038886331.1 proline-rich receptor-like protein kinase PERK2 [Benincasa hispida]3.4e-9881.92Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDF-NNLVQDKLVVIPQPLSPL----TTTTTIPSLSLNNKISPYPPPSPSSSISSFTCLSSTTTSSSTNTSFSTASSSPSP
        MGSCISKCKPKMMK QPP+FDF NNLVQDKLVVIPQPLSPL    TTTTTIPSLSLNNKISPY PPSPSSSISSFTCL     SSSTNTSFSTASSSPSP
Subjt:  MGSCISKCKPKMMKQQPPVFDF-NNLVQDKLVVIPQPLSPL----TTTTTIPSLSLNNKISPYPPPSPSSSISSFTCLSSTTTSSSTNTSFSTASSSPSP

Query:  ISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSPVVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTVAP
        ISSHH F SPYNQ   +LRINSLKATAFPP PIKPVSP+VRH SPQRV RSTPQKR+RPASPSP IRQKSFRKEVL +PL SPSP+RRF+ EKCRV VA 
Subjt:  ISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSPVVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTVAP

Query:  INIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
                   PKSRSPAR S MKKEITCIHRISSKIDEVAV+E VG+LDSVVAMEDIDNPLISLDCFIFL
Subjt:  INIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KIF4 Uncharacterized protein4.4e-9981.68Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDFNNL-VQDKLVVIPQPLSPLTTTTT---IPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSSPS
        MGSCISKCKPKMMK QPP+FDFNNL VQDKLVVIPQPLSPL TT T    PSLSL+NKISPYPP PSP SSSISSFTCLSS  T SSTNTSFSTASSSPS
Subjt:  MGSCISKCKPKMMKQQPPVFDFNNL-VQDKLVVIPQPLSPLTTTTT---IPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSSPS

Query:  PISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTV
        PISSHHYF SPYNQ PH+  INSLKA AF P P+KP+SP ++RH SPQRVSRS PQKR RPASPSP IRQKSFRKEVLQRPLSSPSPTRRF+ EKC+V +
Subjt:  PISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTV

Query:  APINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
        API      NG RPKSRSP R S MKKEITCIHRISSKIDEVAV+E VG+LDSVVAMEDIDNPLISLDCFIFL
Subjt:  APINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

A0A1S3B4I5 Uncharacterized protein3.2e-10281.82Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPL------TTTTTIPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSS
        MGSCISKCKPKMM+ QPP+FDFNNLVQDKLVVIPQPLSPL       TT+  PSLSL+NKISPYPP PSP SSSISSFTCLSS  TSSSTNTSFSTASSS
Subjt:  MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPL------TTTTTIPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSS

Query:  PSPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRV
        PSPISSHHYF SPYNQ PH+ RINSLKA AF   P+KP+SP VVRH SPQRVSRSTPQKRLRPASPSP IRQKSFRKEVLQRPLSSPSPTRRF+ EKC+V
Subjt:  PSPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRV

Query:  TVAPINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
         +API      NG RPKSRSP RGS MKKEITCIHRISSKID+VAV+E VG+LDSVVAMED+DNPLISLDCFIFL
Subjt:  TVAPINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

A0A2C9WER9 Uncharacterized protein9.9e-3544.16Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDFNNL-VQDKLVVIPQPLSPLTTTTTIPSLSLNNKISPYPPPSP---SSSISSFTCLSSTTTSSSTNTSFSTASSS---P
        MG CISKCKPK +    P+ DF ++ VQDKLV+   P   + T    PS    NKISP  PPSP   SSS SSFTC S++ TS S  +S ST SSS   P
Subjt:  MGSCISKCKPKMMKQQPPVFDFNNL-VQDKLVVIPQPLSPLTTTTTIPSLSLNNKISPYPPPSP---SSSISSFTCLSSTTTSSSTNTSFSTASSS---P

Query:  SPIS-SHHYFSSPYNQIPHILRINSLKATAFPPVP--------IKPVSPVVRHSSPQRV-SRSTPQKRLRPASPSPIIRQKSFRKEVLQ---------RP
           S S+ +  S   + PH++RINS+K  +   VP          PVS   +    QRV   STPQKR+R  SP+P+ RQKSFR+E  +         R 
Subjt:  SPIS-SHHYFSSPYNQIPHILRINSLKATAFPPVP--------IKPVSPVVRHSSPQRV-SRSTPQKRLRPASPSPIIRQKSFRKEVLQ---------RP

Query:  LSSPSPTRRFTGEKCR--------VTVAPINIAPLKNGGRPKSRSPAR-----------------GSEMKKEITCIHRISSKIDEVAVREVVGELDSVVA
        L SPSP+RRF GE  R          ++   +A   N       S  R                 GS +K   TCIHRISSKIDEVAV E +   DS   
Subjt:  LSSPSPTRRFTGEKCR--------VTVAPINIAPLKNGGRPKSRSPAR-----------------GSEMKKEITCIHRISSKIDEVAVREVVGELDSVVA

Query:  MEDIDNPLISLDCFIFL
        MEDIDNPLISLDCFIFL
Subjt:  MEDIDNPLISLDCFIFL

A0A5D3D583 TPRXL protein3.2e-10281.82Show/hide
Query:  MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPL------TTTTTIPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSS
        MGSCISKCKPKMM+ QPP+FDFNNLVQDKLVVIPQPLSPL       TT+  PSLSL+NKISPYPP PSP SSSISSFTCLSS  TSSSTNTSFSTASSS
Subjt:  MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPL------TTTTTIPSLSLNNKISPYPP-PSP-SSSISSFTCLSSTTTSSSTNTSFSTASSS

Query:  PSPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRV
        PSPISSHHYF SPYNQ PH+ RINSLKA AF   P+KP+SP VVRH SPQRVSRSTPQKRLRPASPSP IRQKSFRKEVLQRPLSSPSPTRRF+ EKC+V
Subjt:  PSPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSP-VVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRV

Query:  TVAPINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL
         +API      NG RPKSRSP RGS MKKEITCIHRISSKID+VAV+E VG+LDSVVAMED+DNPLISLDCFIFL
Subjt:  TVAPINIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL

A0A6J1FG04 uncharacterized protein LOC1114451476.1e-7769.34Show/hide
Query:  MGSCISKCKPKMMK-QQPPVFDFNNLVQDKLVVIPQPLSPLTTTTT-----IPSLSLNNKISPYPP-PSPSSSISSFTCLSSTTTSSSTNTSFSTASSSP
        MGSCISKCKPK +K   PP+FDFNN+VQDKLVVIPQP  PL    T      PSLSL+NKISPYPP PSPS   SS TCLSS+TT+++TN+SFSTASS  
Subjt:  MGSCISKCKPKMMK-QQPPVFDFNNLVQDKLVVIPQPLSPLTTTTT-----IPSLSLNNKISPYPP-PSPSSSISSFTCLSSTTTSSSTNTSFSTASSSP

Query:  SPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSPVV----RHSSPQRVSR------STPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRR
        SPI  H YF SPYNQ PH++RINSLKA+ F P P   VSPVV    RH SPQRVSR      STPQKR+R ASPSP +RQKSFRKEV QRPL SPSP+RR
Subjt:  SPISSHHYFSSPYNQIPHILRINSLKATAFPPVPIKPVSPVV----RHSSPQRVSR------STPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRR

Query:  FTGEKCRVTVAPINIAPLKNGGRPKSRSPARGSEMKKE-ITCIHRISSKIDEVAVREVV---GELDSVVAMEDIDNPLISLDCFIFL
         +GEKCRV    I  A +K  GR KSRSPARG EMKKE ITCIHRISSKIDE A RE V   G+LDS  AMEDIDNPLISLDCFIFL
Subjt:  FTGEKCRVTVAPINIAPLKNGGRPKSRSPARGSEMKKE-ITCIHRISSKIDEVAVREVV---GELDSVVAMEDIDNPLISLDCFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21510.1 unknown protein5.6e-1433.33Show/hide
Query:  MGSCISKCKPKM--MKQQPPVFDFNNLVQDKLVV--IPQPLSPLTT-------TTTIPSLSLNNKISPYPPPSPSSSISSFTCLSSTTTS----SSTNTS
        MG CISKC PK    K+     +    V +K+ +   P  +SPL            +P+  +N      PPPSP   ++SF+ +  +TTS    SS+N+S
Subjt:  MGSCISKCKPKM--MKQQPPVFDFNNLVQDKLVV--IPQPLSPLTT-------TTTIPSLSLNNKISPYPPPSPSSSISSFTCLSSTTTS----SSTNTS

Query:  FSTASS---SPSPISSHHYFSSPYNQIPHILRINSLK------ATAFPPVPIK---PVSPVVRHSSPQRVSR-----STPQKRLRPASPS--PIIRQKSF
         STASS   S     S+ +  + Y +  H+ RINSL+       T  P  P +   PV P    ++P R +      S   KR R  SP+   + RQKSF
Subjt:  FSTASS---SPSPISSHHYFSSPYNQIPHILRINSLK------ATAFPPVPIK---PVSPVVRHSSPQRVSR-----STPQKRLRPASPS--PIIRQKSF

Query:  RKE-----------------VLQRP----------LSSPSPTRRFTGEKCRVTVAP-INIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVR
        R++                  L+ P          L SPSP+RRF      +TV+  +    L   GR K    +  SE +     IHRISSKID+  +R
Subjt:  RKE-----------------VLQRP----------LSSPSPTRRFTGEKCRVTVAP-INIAPLKNGGRPKSRSPARGSEMKKEITCIHRISSKIDEVAVR

Query:  EVV-GELDSVVAM-EDIDNPLISLDCFIFL
        EV+  + + VV + E++ NPLI LDCFIFL
Subjt:  EVV-GELDSVVAM-EDIDNPLISLDCFIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGATGATGAAACAACAACCACCTGTTTTTGATTTCAACAATCTTGTCCAAGACAAGCTTGTTGTAATTCCCCAACCACT
TTCTCCATTAACAACAACAACAACAATTCCTTCTCTCTCTCTTAACAACAAAATCTCTCCTTATCCTCCCCCTTCCCCTTCTTCTTCCATTTCTTCTTTCACTTGTCTCT
CTTCAACTACAACTTCATCATCAACCAACACCTCTTTCTCAACAGCATCTTCTTCACCTTCCCCAATTTCCTCACATCACTACTTTTCTTCTCCCTACAACCAAATCCCT
CACATCCTAAGAATCAATTCCCTTAAAGCTACCGCCTTTCCACCGGTCCCCATCAAGCCGGTTTCCCCCGTCGTTCGTCATTCGTCCCCACAAAGGGTGTCGAGATCCAC
ACCCCAAAAGAGACTCCGACCGGCTTCTCCATCGCCAATAATTCGGCAGAAGAGCTTCAGAAAGGAGGTTCTACAACGGCCTCTCTCGTCACCGTCACCGACTAGACGCT
TCACTGGAGAGAAATGTAGGGTGACCGTGGCTCCGATTAACATAGCGCCGTTGAAGAATGGCGGTCGTCCAAAAAGCCGATCGCCGGCGAGGGGTAGTGAGATGAAGAAG
GAAATAACTTGCATTCATAGGATCAGTTCAAAGATTGATGAAGTTGCTGTGAGAGAAGTGGTTGGAGAATTAGATTCAGTGGTGGCTATGGAAGATATTGATAATCCTTT
AATCTCGTTGGATTGCTTTATCTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCTTGCATTAGCAAATGCAAACCCAAGATGATGAAACAACAACCACCTGTTTTTGATTTCAACAATCTTGTCCAAGACAAGCTTGTTGTAATTCCCCAACCACT
TTCTCCATTAACAACAACAACAACAATTCCTTCTCTCTCTCTTAACAACAAAATCTCTCCTTATCCTCCCCCTTCCCCTTCTTCTTCCATTTCTTCTTTCACTTGTCTCT
CTTCAACTACAACTTCATCATCAACCAACACCTCTTTCTCAACAGCATCTTCTTCACCTTCCCCAATTTCCTCACATCACTACTTTTCTTCTCCCTACAACCAAATCCCT
CACATCCTAAGAATCAATTCCCTTAAAGCTACCGCCTTTCCACCGGTCCCCATCAAGCCGGTTTCCCCCGTCGTTCGTCATTCGTCCCCACAAAGGGTGTCGAGATCCAC
ACCCCAAAAGAGACTCCGACCGGCTTCTCCATCGCCAATAATTCGGCAGAAGAGCTTCAGAAAGGAGGTTCTACAACGGCCTCTCTCGTCACCGTCACCGACTAGACGCT
TCACTGGAGAGAAATGTAGGGTGACCGTGGCTCCGATTAACATAGCGCCGTTGAAGAATGGCGGTCGTCCAAAAAGCCGATCGCCGGCGAGGGGTAGTGAGATGAAGAAG
GAAATAACTTGCATTCATAGGATCAGTTCAAAGATTGATGAAGTTGCTGTGAGAGAAGTGGTTGGAGAATTAGATTCAGTGGTGGCTATGGAAGATATTGATAATCCTTT
AATCTCGTTGGATTGCTTTATCTTTCTGTAG
Protein sequenceShow/hide protein sequence
MGSCISKCKPKMMKQQPPVFDFNNLVQDKLVVIPQPLSPLTTTTTIPSLSLNNKISPYPPPSPSSSISSFTCLSSTTTSSSTNTSFSTASSSPSPISSHHYFSSPYNQIP
HILRINSLKATAFPPVPIKPVSPVVRHSSPQRVSRSTPQKRLRPASPSPIIRQKSFRKEVLQRPLSSPSPTRRFTGEKCRVTVAPINIAPLKNGGRPKSRSPARGSEMKK
EITCIHRISSKIDEVAVREVVGELDSVVAMEDIDNPLISLDCFIFL